COURSE OBJECTIVE:
During this course you will learn to:
• Master Azure Synapse Analytics architecture and key concepts.
• Build data pipelines with Synapse Pipelines.
• Leverage dedicated SQL pools & serverless Spark pools for data warehousing & big data analysis.
• Develop data models and perform SQL queries for analysis.
• Analyze data with Spark and Delta Lake.
• Visualize & report data using Power BI.
• Monitor & optimize data pipelines for performance.
• Design & build data warehouse models (star/snowflake schemas).
• Load data efficiently into dedicated SQL pools.
• Perform complex queries on large data sets.
• Manage & secure Synapse Analytics data warehouses.
• Process large data sets with serverless Spark pools.
• Utilize Spark SQL & DataFrames for data exploration & transformation.
• Implement Delta Lake for reliable data storage & version control.
• Work with streaming data using Synapse SQL Streaming.
• Integrate machine learning models with Spark MLlib & other frameworks.
• Preprocess & prepare data for machine learning tasks.
• Train & evaluate machine learning models within Synapse Analytics.
• Deploy & manage machine learning models in production.
• Understand the business value of data analytics & big data projects.
• Learn best practices for building & deploying data solutions with Synapse Analytics.
• Prepare for data engineer, analyst, & architect roles using Azure Synapse Analytics.
TARGET AUDIENCE:
This course is destinated to administrators and data specialists looking to mplement a Data Analytics Solution with Azure Synapse Analytics.
COURSE PREREQUISITES:
The participants should have familiarity with notebooks that use different languages and a Spark engine, such as Databricks, Jupyter Notebooks, Zeppelin notebooks and more. They should also have some experience with SQL, Python, and Azure tools, such as Data Factory.
COURSE CONTENT:
MODULE 1: Introduction to Azure Synapse Analytics
• Identify the business problems that Azure Synapse Analytics addresses.
• Describe core capabilities of Azure Synapse Analytics.
• Determine when to use Azure Synapse Analytics.
MODULE 2: Use Azure Synapse serverless SQL pool to query files in a data lake
• Identify capabilities and use cases for serverless SQL pools in Azure Synapse Analytics
• Query CSV, JSON, and Parquet files using a serverless SQL pool
• Create external database objects in a serverless SQL pool
MODULE 3: Analyze data with Apache Spark in Azure Synapse Analytics
• Identify core features and capabilities of Apache Spark.
• Configure a Spark pool in Azure Synapse Analytics.
• Run code to load, analyze, and visualize data in a Spark notebook.
MODULE 4: Use Delta Lake in Azure Synapse Analytics
• Describe core features and capabilities of Delta Lake.
• Create and use Delta Lake tables in a Synapse Analytics Spark pool.
• Create Spark catalog tables for Delta Lake data.
• Use Delta Lake tables for streaming data.
• Query Delta Lake tables from a Synapse Analytics SQL pool.
MODULE 5: Analyze data in a relational data warehouse
• Design a schema for a relational data warehouse.
• Create fact, dimension, and staging tables.
• Use SQL to load data into data warehouse tables.
• Use SQL to query relational data warehouse tables.
MODULE 6: Build a data pipeline in Azure Synapse Analytics
• Describe core concepts for Azure Synapse Analytics pipelines.
• Create a pipeline in Azure Synapse Studio.
• Implement a data flow activity in a pipeline.
• Initiate and monitor pipeline runs.
FOLLOW ON COURSES:
Not available. Please contact.