ADF Course Content
ADF Course Content
o Importance of ETL
o What is OLTP
o Limitation of OLTP Systems
o What is Data warehouse
o Limitations of traditional data warehouse
o What is ETL(Extract Transformation and Loading)
o What is ELT(Extract Loading and Transformation)
o ETL Tools in the Market
o Importance of Transformation
o What is ADF
o Why ADF
o Advantages compared to traditional ETL tools
o Different Modern Data warehouse systems in Market
Introduction to the Azure Data Engineering Products
o Azure SQL
o Azure Synapse
o Azure Data Lake
o Azure Data Factory
o Azure Databricks
o Azure Keyvault
o Azure Cosmos DB
Roles and Responsibilities of Azure Data Factory and Data Engineer
o Different types of data
▪ Structured
▪ Unstructured
▪ Semi-Structured
▪ JSON
▪ Parquet
▪ AVRO
▪ ORC
▪ Scheduled Triggers
o Connectors overview
o Data Transformations using Mapping Data flows
Introduction to Azure Data Factory
o Difference between ADF version1 and version2
o Building Blocks of ADF
▪ Pipelines
▪ Activities
▪ Datasets
▪ Linked Services
▪ Integration Runtimes
▪ Hot
▪ Cool
▪ Achieve
● General
● Source
● Logging Settings
● User Properties
Scenario3
o How copy the files with some pattern to another container
Scenario4
o Conditional Split with if condition
▪ Used activities
● Get Metadata
● Filter
● Foreach
● If Condition
Scenario5
o Send an email on Pipeline Failure
▪ Web Activity
● Settings
Scenario6
o Migrate all tables of database into datalake with single pipeline
▪ Lookup Activity --> Foreach Activity --> Copy Activity --> with
Parameters
Scenario7
o How to apply more than one conditional split in ADF Activities
▪ Switch Activity
Scenario8
o How to run the pipeline until file arrives in Datalake Container
▪ Until Activity
Scenario9
o How to load the filtered data to the azure sql table/Synapse table
▪ Script Activity
Scenario11
o Use cases of Execute pipeline activity
o Use cases of Validation Activity
o Use cases of Wait Activity
o Use cases of Set and Append Variable Activity
Recap of all the activities
o Copy Data, Append Variable, Delete, Execute Pipeline, Get Metadata,
Lookup, Stored Procedure, Script, Set Variable, Validation, Web Activity, Wait,
Filter Activity, For Each Activity, If Condition, Switch, Until
Parameterization
o Parameterize datasets, Linked Services
o Bulk load of data from Azure SQL to Data Lake Storage
Data Flows Introduction
o ELT(Extract Load and Transform)
o Two DataFlows in ADF
o DataFlow Activity in Pipeline
o Mapping Data Flows
o Configuring Mapping Dataflows as Dataflow Activity in PipeLine
o DataFlow Debug Explanation
o Introduction to Transformations
▪ Source
▪ Sink
▪ Union
▪ Filter
▪ Select
▪ Derived Column
▪ Join, etc.
o Difference between source , sink of “Copy Data” activity And source , sink of
“Mapping Data Flows”.
Dataflow Scenario1
o Source as (datalake) Sink as (SQL table) --> by using dataflow source and
sink transformation
Extract data from RDBMS to Datalake --> Apply Filter
o Filter Transformation (Source --> Filter --> Sink) Execute with Dataflow
Activity
Data Flow Scenario on Select Transformation
o Select Transformation
▪ Flow
▪ Columns condition
▪ Column pattern
▪ Name
▪ Position
▪ Any row
▪ First row
▪ Last row
▪ Ungrouping column
▪ Descending”
Stringify Transformation
o Reading Json data
o Reading a single Json record
o Reading array of documents
o Converting complex types into String using Stringify and derived column
transformations
Parse Transformation
o Parse transformation in Mapping DataFlows
o How to handle and parse string collection( delimited string values )
o How to handle and parse xml data
o How to handle and parse json data
Flatten Transformation
o Converting Complex Json nested structures into CSV/Text file.
o Complex data processing (transformations).
o How parse nested Json records
o How to flatten array of values into multiple rows.
o If data has complex structures, what are supportive data store formats .
o How to write into json format.
o How to write into flatten file format( csv).
o Transformations used to process data.
▪ Parse
▪ Flatten
▪ Derived Column
▪ Select
▪ Sink