19/08/2025
Data Engineering ETL (extract transform load) With Azure Data Factory
by - joey gomez de jesus jr
📞whatsapp +639602727215
Donate on Paypal - https://paypal.me/awareit
Donate on Gcash - 0994009301
Business Insight Engineering Free Training
🛠️ Part 1 – Azure Deployment Environment (ADE)
https://youtu.be/HvA3nY1FkMI
🚪 Part 2 – Intro to Azure Databricks (14-day trial)
https://www.youtube.com/watch?v=JFxnBOzoxIk
🤖 Part 3 – AI-Assisted Dev with Databricks, SQL Server, PySpark
https://www.youtube.com/watch?v=nuUyia7Hoag
🔄 Part 4 – Load SQL Server Data into Databricks with PySpark + AI
https://www.youtube.com/watch?v=k-3QW_XqFMg
🔐 Part 5 – Storing Secrets in Azure Key Vault for Databricks
https://www.youtube.com/watch?v=IS0cHXlOWQI
🛠️ Part 6 – Scheduling Azure Databricks jobs
https://youtu.be/X2nUjBIi6GQ
🧩 Part 7 – Connecting Database Development Tools to Azure Databricks Datawarehouse
https://www.youtube.com/watch?v=VLBICcmr2NE
📊 Part 8 - Charts and Graph in Azure databricks
https://youtu.be/NGxMRzetyug
🏭 Part 9 - Data Engineering ETL (extract transform load) With Azure Data Factory
https://youtu.be/YmlA2Dg12GU
Azure Data Factory (ADF) is a cloud-based data integration service from Microsoft that lets you build, orchestrate, and automate data pipelines across diverse systems—whether on-premises or in the cloud.
🛠️ What It Does
- ETL/ELT Workflows: Extract, transform, and load data from multiple sources into a centralized destination
- Data Movement: Seamlessly copy data between databases, APIs, file systems, and cloud storage
- Data Transformation: Clean, enrich, and reshape data using built-in data flows or external compute services like Azure Databricks
- Scheduling & Automation: Trigger pipelines based on time, events, or conditions—fully automated and scalable
- Monitoring & Management: Track pipeline health, performance, and errors through a visual dashboard
🌐 Key Features
- No-code/low-code interface for building complex workflows
- Broad connectivity to over 90 data sources (SQL, REST, Blob, Salesforce, etc.)
- Integrated security via Entra ID and role-based access control
- Custom triggers for real-time or batch processing
- Compression, validation, and preview tools during data copy operations
🧩 Real-World Use Case
Imagine a health coalition pulling data from hospital systems, survey platforms, and cloud logs. Azure Data Factory can:
- Ingest all that data
- Clean and normalize it
- Load it into a dashboard for real-time advocacy insights
🧭 Step-by-Step ETL Guide: CSV → Blob → Azure SQL
🔧 Prerequisites
- Azure Subscription
- Azure Blob Storage with CSV uploaded
- Azure SQL Database with destination table created
- Azure Data Factory instance
🪄 1. Create Linked Services
These are your connection bridges.
- Blob Storage Linked Service
- Go to ADF Studio → Manage → Linked Services → New
- Choose Azure Blob Storage
- Authenticate via account key or managed identity
- SQL Database Linked Service
- Choose Azure SQL Database
- Provide server name, database name, credentials
📁 2. Define Datasets
These describe your source and destination formats.
- Source Dataset (CSV)
- Type: Delimited Text
- Point to the blob container and file path
- Define column delimiter, encoding, and schema (optional)
- Sink Dataset (SQL Table)
- Type: Azure SQL Table
- Point to the destination table
- Define schema mapping if needed
🔄 3. Build the Pipeline
This is your ritual flow.
- Go to Author → New Pipeline
- Add a Copy Data activity
- Source: Blob CSV dataset
- Sink: SQL dataset
- Optional: Add Mapping tab to align columns
- (Optional) Add Data Flow if you need transformations
⏰ 4. Trigger Ex*****on
Choose how and when your ritual runs.
- Manual Trigger: Click Debug or Trigger Now
- Scheduled Trigger: Add a Time-based trigger
- Event Trigger: Trigger when a new blob is uploaded
📊 5. Monitor Pipeline
Track your movement’s pulse.
- Go to Monitor tab in ADF Studio
- View run history, success/failure, duration
- Drill into activity logs for diagnostics
🧠 Bonus: SQL Table Creation Example
CREATE TABLE Customers (
CustomerID INT,
Name VARCHAR(100),
Email VARCHAR(100),
JoinDate DATE
);
📚 Reference Tutorials
- Microsoft’s official walkthrough
- SQLShack’s practical guide
- GitHub demo pipeline
- "From Raw to Ritual: ETL Data Engineering with Azure Data Factory"
- "Pipeline Power: Transforming Data with Azure Data Factory"
-- "ETL Unleashed: Scalable Data Engineering in the Cloud"
- "Azure Data Factory Masterclass: ETL for Real-Time Impact"
- "Blob to Dashboard: ETL Magic with Azure Data Factory"
- "Orchestrate the Flow: Data Engineering with Azure’s ETL Engine"
https://youtu.be/YmlA2Dg12GU
Data Engineering ETL (extract transform load) With Azure Data Factoryby - joey gomez de jesus jr📞whatsapp +639602727215Donate on Paypal - https://paypal.me...