TRDP-203
Data Engineering in a Microsoft Azure environment
Description
In this course, the student will be introduced to data engineering for batch and real-time analytics solutions using Azure data platform technologies. Students will first understand the basic compute and storage technologies used to build an analytics solution. Students will learn how to interactively explore data stored in files placed in a data warehouse. They will learn different data ingestion techniques to load data using the Apache Spark capability in Azure Synapse Analytics or Azure Databricks, or how to ingest data using Azure Data Factory or Azure Synapse pipelines. Students will also learn how to transform data in different ways using the same technologies they use for data ingestion. They understand the importance of implementing security to ensure that data is protected at rest or in transit. The student will then show how to create a real-time analytics system to create real-time analytics solutions.
Suggested For
The primary target audience for this course is data scientists, data architects and business intelligence professionals who want to learn about data design and building analytics solutions using existing data platform technologies on Microsoft Azure. The secondary audience for this course is data analysts and data scientists who work with analytics solutions built on Microsoft Azure.
Outline
- Explore compute and storage options in Azure for your data processing workloads.
- Run interactive queries using serverless SQL pools
- Explore and transform data in Azure Databricks
- Explore, transform and load data into the data warehouse using Apache Spark
- Ingest and load data into the data warehouse
- Transform data using Azure Data Factory or Azure Synapse Pipelines
- Integrate data from notebooks using Azure Data Factory or Azure Synapse Pipelines
- Support Hybrid Transactional Analytical Processing (HTAP) using Azure Synapse Link
- End-to-end security with Azure Synapse Analytics
- Real-time stream processing with Stream Analytics
- Create a stream processing solution using Event Hubs and Azure Databricks
Prerequisites
The course is in English, so a basic knowledge of the language is required.