Data Engineering Training | Digital Innovation Academy

This training presents, through detailed explanations and real-world examples, the roles, responsibilities, architectures, tools, and best practices essential for modern data platforms. Hands-on labs provide participants the opportunity to work with leading technologies such as Snowflake, Apache Spark, dbt, and Airflow to build, optimize, and orchestrate scalable data pipelines.

Day 1 : Data Engineering Foundations

Understand the core responsibilities of data engineers and key architectural patterns
Explore Snowflake as a modern cloud data warehouse and learn its deployment models
Learn best practices in data modeling, including relational vs dimensional models and schema design

Configure a Snowflake environment with proper security and access control
Design and create initial data models using dimensional modeling concepts
Set up dbt for version-controlled data transformation workflows

Day 2 : Data Processing and Workflow Management

Dive deeper into Apache Spark for efficient batch processing and performance tuning
Learn data ingestion and transformation patterns with Spark and dbt
Design, schedule, and monitor workflows using Apache Airflow for orchestration

Implement optimized Spark data transformations with DataFrames and RDDs
Build and orchestrate an end-to-end batch pipeline using Spark, dbt, and Airflow
Apply benchmarking and code review techniques to evaluate pipeline efficiency

Day 3 : Advanced Topics and Capstone Project

Explore real-time data processing using Spark Streaming and event-driven architectures
Extract and visualize data using tools like Tableau or Looker with Snowflake as the backend
Complete a capstone project integrating multiple tools, followed by peer feedback and presentation

Explore real-time data processing using Spark Streaming and event-driven architectures
Extract and visualize data using tools like Tableau or Looker with Snowflake as the backend
Complete a capstone project integrating multiple tools, followed by peer feedback and presentation

This course is available online and onsite and fully customizable to your needs.
*The course is also available in French.

Data Engineering

Theory

Practical Labs