Training schedule

Join waiting list

IN-COMPANY TRAINING PROGRAMS

Contact Gert-Jan Steltenpool, if you want to know more about custom data & AI training for your teams. He’ll be happy to help you!
Check out more

Streaming Architecture at Scale

While some apps enjoy the luxury of processing in a batch oriented fashion, others, as in the IoT ecosystem, expect events to be ingested and processed as they occur. This training focuses on two key players on the streaming-side of data processing: Apache Kafka and Apache Spark!

 

Clients we've helped

  • DSM is a GoDataDriven customer
  • Dupont - GoDataDriven customer
  • Logo-Booking.com
  • lego-logo
  • Airbus-logo
  • Merck-logo
  • Ahold Delhaize
  • Credit-Suisse-Logo
  • Shell-Logo
  • ING Bank
  • Danone logo
  • Nike-logo
  • tomtom_logo
  • Verizon-logo

What you'll learn

  • Fundamentals of queue messaging systems
  • Fundamentals of the Kafka architecture
  • Fundamentals of Spark Streaming, with concept as checkpointing, watermarking, streaming windows and more
  • How to consume and process events from Kafka with Spark

The schedule

Kafka
  • How streaming topics work
  • The basics of messaging systems
  • Watermarks
  • The concept of topics
  • Design considerations for a messaging system
  • Run a Kafka cluster as docker-compose
Spark
  • Set Spark as a consumer for Kafka
  • Process incoming events real-time

learning journey

Data Engineering Learning Journey

This online course is perfect for

IT engineers/architects, who deal with data stream processing architectures. Basic experience with Python and Apache Spark is required. If you’re not quite there yet, we recommend the Python for Data Engineers and Data Processing at Scale courses respectively as preparation for this training.

What will you learn during Streaming Architecture at Scale?

After this training, you will have understanding on how queue messaging systems work, how to route real-time incoming events with Apache Kafka and finally how to process them in real-time with Apache Spark.

meet your trainer

Andrew Snare

Big data hacker

Andrew is a Big Data Hacker at GoDataDriven. He is an experienced software engineer with a deep understanding of numerous technologies and languages.

Andrew is a certified Cloudera, Databricks, and Cassandra instructor, and also enjoys sharing his experiences on stage, for example at Goto Conference.

Flexible delivery

The Right Format For Your Preferred Learning Style

In-Classroom & In-Company Training
Online, Instructor-Led Training
Hybrid and Blended Learning
Self-Paced Training
Get in touch with the experts

Have any questions?

Contact Gert-Jan Steltenpool, the sales director of GoDataDriven Academy if you want to know more. He’ll be happy to help you!

You can reach him by phone as well at +31 6 4214 0783

Course: Streaming Architecture at Scale Training

Book now