Program Outline

DataIntermediate

Azure Data Engineering with Databricks

A hands-on data engineering program covering end-to-end pipeline development on Azure, from raw ingestion through transformation to serving. Participants work with Apache Spark, Delta Lake, and Azure Data Factory under real pipeline design constraints, not simplified examples.

Delivery

Virtual or On-site

Duration

5 days

Lab-Based DeliveryCustomizable for TeamsOfficial Source Linked

Schedule Consultation Request Enterprise Training Plan

Need a shareable version?Download Program Outline (PDF)

Databricks branded course visual for Azure Data Engineering with Databricks

DBX

Program Guide

Overview Outline Coverage Prerequisites Outcomes Certification Related

Best Fit

Tailored Team DeliveryImplementation-Focused

Audience Profile

Who This Program Is For

Data engineers and architects building or modernizing data platforms on Azure

Overview

Program Summary

Build scalable data pipelines using Azure Databricks, Delta Lake, and Apache Spark. Covers medallion architecture and ADF integration.

Course Outline

Complete Module Sequence

Review the full module sequence for this program, including the primary topic coverage in each module where available.

Module 1

Delta Lake and Medallion Architecture

Design and implement Bronze-Silver-Gold pipeline patterns using Delta Lake, schema management, and incremental processing for scalable data platforms.

implement Bronze-Silver-Gold pipeline patterns using Delta Lake
schema management
and incremental processing for scalable data platforms

Module 2

Apache Spark Optimization

Write and tune Spark jobs for cost-efficient processing at enterprise scale, covering partitioning, caching, and cluster configuration.

tune Spark jobs for cost-efficient processing at enterprise scale
covering partitioning
and cluster configuration

Module 3

Azure Data Factory Orchestration

Build ADF-triggered pipeline workflows that invoke Databricks compute, monitor job runs, and handle failures with retry and alerting logic.

Build ADF-triggered pipeline workflows that invoke Databricks compute
monitor job runs
and handle failures with retry
alerting logic

Module 4

Data Quality and Unity Catalog

Implement data quality checks, lineage tracking, and catalog-level governance using Databricks Unity Catalog for enterprise data reliability.

Implement data quality checks
lineage tracking
and catalog-level governance using Databricks Unity Catalog for enterprise data reliability

Coverage Areas

Topic Coverage

Coverage Item 1

Delta Lake and medallion architecture design

Coverage Item 2

Apache Spark optimization and performance tuning

Coverage Item 3

Azure Data Factory orchestration with Databricks

Coverage Item 4

Data quality and Unity Catalog governance

Customization

Adapt This Program for Your Team

We can adapt this program around your team structure, platform priorities, delivery goals, and the scenarios your people need to work through in practice.

•Align dataset and schema to team's existing sources
•Include Unity Catalog configuration and access control
•Add structured streaming module (Event Hubs or Kafka)

Program Details

Delivery

Virtual or On-site

Duration

5 days

Learning Brief

Prerequisites

•SQL proficiency
•Basic Python or Scala
•Familiarity with cloud storage concepts

Learning Outcomes

•Design and build Bronze-Silver-Gold medallion pipelines
•Use ADF orchestration with Databricks compute
•Implement data quality checks and lineage tracking
•Optimize Spark jobs for cost and performance at scale

Adapt for Your Team

Shape This Program Around Your Delivery Goals

We can adapt the program outline, labs, and implementation emphasis around your platform priorities, delivery timelines, and role mix.

Request Tailored Delivery Schedule Consultation

Request Delivery Book Call