Lakshmi kanakamu’s Post

View profile for Lakshmi kanakamu, graphic

Microsoft certified Azure Data engineer | Database Administrator

𝗣𝗼𝗹𝘆𝗕𝗮𝘀𝗲 is a feature in Azure Synapse Analytics that allows you to access and query external data stored in Azure Blob Storage or Azure Data Lake Store directly using T-SQL. It enables you to perform Extract, Load, and Transform (ELT) operations efficiently. It does require going through a handful of steps: 1.Create Master Key for database 2.Create Database Scoped Credential 3.Create External Data Source  4.Create External File Format 5.Create schema 6.Create External Table 7.Query data.   𝗞𝗲𝘆 𝗙𝗲𝗮𝘁𝘂𝗿𝗲𝘀 𝗼𝗳 𝗣𝗼𝗹𝘆𝗕𝗮𝘀𝗲: 1.𝗗𝗮𝘁𝗮 𝗩𝗶𝗿𝘁𝘂𝗮𝗹𝗶𝘇𝗮𝘁𝗶𝗼𝗻: PolyBase allows you to query external data without moving it into the data warehouse. This means you can access and join external data with relational tables in your SQL pool2. 2.𝗘𝘅𝘁𝗲𝗿𝗻𝗮𝗹 𝗧𝗮𝗯𝗹𝗲𝘀: You can create external tables that reference data stored in Azure Blob Storage or Azure Data Lake Store. These tables can be queried just like regular tables in your SQL pool. 3.𝗦𝘂𝗽𝗽𝗼𝗿𝘁𝗲𝗱 𝗙𝗼𝗿𝗺𝗮𝘁𝘀: PolyBase supports various file formats, including delimited text files (UTF-8 and UTF-16), Hadoop file formats (RC File, ORC, Parquet), and compressed files (Gzip, Snappy). 4.𝗦𝗰𝗮𝗹𝗮𝗯𝗶𝗹𝗶𝘁𝘆: It leverages the massively parallel processing (MPP) architecture of Azure Synapse Analytics, making it highly scalable and efficient for large data sets. 5.𝗥𝗲𝗱𝘂𝗰𝗲𝗱 𝗘𝗧𝗟: By using PolyBase, you can minimize the need for traditional Extract, Transform, and Load (ETL) processes, as data can be loaded directly into staging tables and transformed within the SQL pool

To view or add a comment, sign in

Explore topics