Big Data Training

Increase your skills and experience with Hadoop and Big Data solutions and become truly independent from consultancy services. Our skilled trainers have world-class knowledge and years of experience in the field of Big Data.

Big Data Training

Data is one of the main ingredients for business intelligence and innovation. For modern organizations, it is essential to develop competences to manage and use large volumes of data. Data has become equally important as the factors of production: land, labor and capital. Classroom training provided by experienced practitioners is one of the most efficient ways to build up knowledge and skills.

Our Big Data training enables professionals to design and implement scalable data infrastructures by using state-of-the-art technology.

Training curriculum

Download Brochure

Brochure Big Data Training 2017


Training is organized through our parent company's training department, Xebia Education.

Training for Cloudera and Apache Hadoop

GoDataDriven is the exclusive Cloudera training partner in the Netherlands. We offer public classes and in-house training that fully prepares candidates for the official certification program.

Cloudera Developer for Apache Hadoop

Learn how to import data into your Apache Hadoop cluster and process it with Spark, Hive, Flume, Sqoop, Impala, and other Hadoop ecosystem tools This four-day hands-on training course delivers the key concepts and expertise participants need to ingest and process data on a Hadoop cluster using the most up-to-date tools and techniques. Employing Hadoop ecosystem projects such as Spark, Hive, Flume, Sqoop, and Impala, this training course is the best preparation for the real-world challenges faced by Hadoop developers. Participants learn to identify which tool is the right one to use in each situation, and will gain hands-on experience in developing using those tools.

In 2017, this training is scheduled for: 10 – 13 April 3 – 6 July * 10 – 13 October

Four days, € 2695. Find out more about this training, including full programme, dates and registration.

Cloudera Data Analyst

This training shows analysts and database administrators how to apply traditional data analytics and business intelligence skills to Big Data. Learn the tools data professionals need to access, manipulate, and analyze complex data sets using SQL and familiar scripting languages.

In 2017, this training is scheduled for: 13 – 16 February 1 – 4 May * 11 – 14 September

Four days, € 2695. Find out more about this training, including full programme, dates and registration.

Databricks Spark Training

Apache Spark is a powerful open source processing engine built around speed, ease of use, and sophisticated analytics. Since its release, Spark has seen rapid adoption by enterprises across a wide range of industries. Internet powerhouses such as Yahoo, Baidu, and Tencent, have eagerly deployed Spark at massive scale, collectively processing multiple petabytes of data on clusters of over 8,000 nodes. It has quickly become the largest open source community in big data, with over 1000 contributors from 200+ organizations.

Databricks was founded by the team that created and continues to drive Apache Spark, a powerful open source data processing engine built for sophisticated analytics, ease of use, and speed. The company has trained over 20,000 users on Apache Spark, and has the largest number of customers deploying Spark to date.

Databricks Spark Programming

Training for data engineers, analysts, architects, software engineers, IT operations, and technical managers interested in a thorough, hands-on overview of the Apache Spark platform. Each topic includes slide and lecture content along with hands-on use of Spark through the elegant Databricks web-based notebook environment. Inspired by tools like IPython/Jupyter and Matlab, Databricks notebooks allow attendees to code jobs, data analysis queries, and generate visualizations using their own Spark cluster, accessed through a web browser.

In 2017, this training is scheduled for: * 8 – 10 May

Three days, € 2100. Find out more about the Databricks Spark training

Neo4j Training

Neo4j is the most popular graph database, implemented in Java. Neo4j stores data structured in graphs rather than in tables.

Neo4J Masterclass

Despite the large number of players in the NoSQL space, graph technology has still been far and away the fastest growing category of database over the last three years according to industry monitor DB-Engines. So, while Neo4J is currently the indisputable leader in the graph technology space it is now your time to get your hands dirty with this great database. During this two-day interactive training, we will take you on a tour through Neo4j to make you ready to use Neo4J in your projects.

In 2017, this training is scheduled for: 31 January – 1 February 15 – 16 May * 14 – 15 September

Two days, € 1195. Find out more about this training, including full programme, dates and registration.

Custom Big Data and NoSQL training

If you have a particular training interest, get in touch to discuss a custom-made training. Our data specialists look forward to sharing their expertise!


The material of the training as well as the instructions were really good. The hands-on sessions were very useful.

Amarjeet Jha
ACN