Data Engineering

Data Engineering Learning Journey

Develop Your Data Engineering Skills

From Concepts to Cutting-Edge Data Engineering

Learn how to take data and AI concepts from concept to prototype and to production-ready application. Acquire the skills to develop and run Data and AI solutions at an enterprise-scale with ease! Take part in a specific training or advance through the entire journey. Learn how to build secure data platforms and reliable AI applications that are engineered for scale.


The Learning Journey for Data Engineers

How do you become a data engineering expert? Start here! We’ve put together a carefully crafted learning journey for data engineers. Knowing engineers love to figure things out on their own, we packed the program with opportunities to learn, hands-on, by solving real-life situations. Plus, there’s plenty of practical philosophy, too.

We’ll teach you how to leverage Docker to ease your deployments and navigate code written by data scientists ( Advanced Python and Data Science in Production). You will learn to use Apache Airflow, Apache Spark, and Kafka like a forklift to move data around. And we won’t shy away from proven technologies either, like ElasticSearch. We also remain on the cutting edge with others, like Apache Flink.

Download Data Engineering Training Guide

Download the Xebia Guide for a complete overview of available training sessions and Data Engineering learning journeys

Download Training Guide

Learning Journey

Junior Data Engineer

Learning Goals for a Junior Data Engineers

  • Writes correct and clean code with guidance
  • Participates in the technical design of features with guidance
  • Knows how to integrate CI/CD concepts into their daily coding
  • Able to create simple pipelines without guidance
  • Knows how containerization works, and what it simplifies
  • Can write and push containers

Training Courses

  • Public dbt Learn Training / 2-days – Public
    Build models to shape your data from raw data to transformed data
  • Python for Data Engineers / 2-daysPublic & In-Company
    This 2-days Xebia training will provide you with the necessary tools to help you turn your code simple, beautiful and truly pythonic.
  • Data Processing at Scale / 2-daysIn-Company
     This training goes deep down into one of the most popular and scalable tools in the market for large-data transformation: Apache Spark!
  • Docker & Kubernetes / 3-daysIn-Company
     This training takes you through everything you need to know to package applications into containers and run them on Kubernetes


+ Professional Scala Development / 2-daysPublic & In-Company


Learning Journey

Medior Data Engineer

Learning Goals for a Medior Data Engineers

  • Understands and makes well-reasoned design decisions and trade-offs in their area
  • Able to quickly get familiar with larger codebases
  • Able to create complex pipelines without guidance

Training Courses

  • Apache Airflow/ 1-daysPublic & In-Company
    This 1-day Xebia training teaches you the internals, terminology, and best practices of writing DAGs. Plus hands-on experience in writing and maintaining data pipelines.
  • Optimizing Apache Spark & Tuning Best Practices / 2-daysIn-Company
    Building up from the experience we built at the largest Apache Spark users in the world, we give you an in-depth overview of the do’s and don’ts of one of the most popular analytics engines out there.
  • Create Data Data Science Products / 2-daysIn-Company
    In this course you’ll be introduced to how to efficiently productionize data science models.


+ Concurrency in Scala / 2 daysPublic & In-Company

Learning Journey

Senior Data Engineer

Learning Goals for a Senior Data Engineers

  • Go-to expert in one area; understands the broad architecture of the entire system
  • Provides technical advice and weighs in on technical decisions that impact other teams or the company at large

Training Courses



Learning Journey

Cloud Data Engineer

Learning Goals for a Cloud Data Engineers

  • Go-to expert for data engineering in the cloud; understands the services that simplifies the architecture of the entire landscape
  • Provides technical advice and weighs in on technical decisions that impact the cloud infrastructure at the company level

Training Courses



Develop the skills of your organization

Find the right courses to grow your team’s Data & AI skills, or design learning journeys at scale to empower your entire organization.

In-Company Training Programs

Data Pipelines with Apache Airflow

Yes, we’re book authors too.

Our experienced data engineers Bas Harenslak and Julian de Ruiter explain how to use Apache Airflow to create efficient and automated pipelines. They use their consulting experience from companies like Heineken, Unilever and to present relevant use cases and applications.

You will find the following content in the book:

  • Framework foundation and best practices
  • Airflow’s execution and dependency system
  • Testing Airflow DAGs
  • Running Airflow in production


We’re hiring!

Come join our growing team of AI Experts. Check our careers page.

What Are We Looking For
Get in touch with the experts

Have any questions?

Contact Giovanni Lanzani, our Managing Director of Learning and Development, if you want to know more. He’ll be happy to help you!