Big Data and Data Science training

Develop your skills and become truly independent from consultancy companies.

The most experienced trainers to develop your Big Data and Data Science skills

Increase your skills and experience with Hadoop and Big Data solutions and become truly independent from consultancy services. Our skilled trainers have world-class knowledge and years of experience in the field of Big Data and Data Science.

Training curriculum

Training is organized through our parent company's training department, Xebia Education.

Training for Apache Hadoop

GoDataDriven is the exclusive Cloudera training partner in the Netherlands. We offer public classes and in-house training that fully prepares candidates for the official certification program.

Cloudera Developer for Apache Hadoop

This four-day training delivers the key concepts and expertise necessary for developers to create robust data processing applications using Apache Hadoop. You will develop your own MapReduce jobs and learn how to debug and maintain MapReduce programs.

Four days, € 2695. Find out more about this training, including full programme, dates and registration.

Cloudera Administrator for Apache Hadoop

This four-day training provides you with a comprehensive understanding of all the steps necessary to operate and maintain a Hadoop cluster. From installation and configuration through management, scaling and advanced tuning, this training is the best preparation for the real-world challenges faced by Hadoop administrators.

Four days, € 2695. Find out more about this training, including full programme, dates and registration.

Cloudera Apache HBase

Apache HBase is a distributed, scalable, NoSQL database build on Apache Hadoop. In this four day course you will learn how HBase enables you to store and access massive quantities of multi-structured data, serve data to many users and applications in real-time, and provide fast, random read/write access to users and applications.

Four days, € 2695. Find out more about this training, including full programme, dates and registration.

Cloudera Developer for Apache Spark I

This four-day course enables you to build complete, unified Big Data applications combining batch, streaming, and interactive analytics on all your data. With Spark, developers can write sophisticated parallel applications to execute faster and better decisions and real time actions, applied to a wide variety of use cases, architectures, and industries.

Four days, € 2695. Find out more about this training, including full programme, dates and registration.

Cloudera Data Analyst

This four-day course shows analysts and database administrators how to apply traditional data analytics and business intelligence skills to Big Data. Learn the tools data professionals need to access, manipulate, and analyze complex data sets using SQL and familiar scripting languages.

Four days, € 2695. Find out more about this training, including full programme, dates and registration.

Cloudera Building and Designing Big Data Applications

This four-day course prepares developers, engineers and architects to analyze and solve real-world problems using Apache Hadoop and associated tools in the enterprise data hub. Go beyond MapReduce to use additional elements of the enterprise data hub and develop applications that are highly relevant to your business.

Four days, € 2695. Find out more about this training, including full programme, dates and registration.

Databricks Spark training

This three-day training is for data engineers, analysts, architects; software engineers; IT operations; and technical managers interested in a thorough, hands-on overview of the Apache Spark platform. Each topic includes slide and lecture content along with hands-on use of Spark through the elegant Databricks web-based notebook environment. Inspired by tools like IPython/Jupyter and Matlab, Databricks notebooks allow attendees to code jobs, data analysis queries, and generate visualizations using their own Spark cluster, accessed through a web browser.

Three days, € 2100. Find out more about the Databricks Spark training

Training for Apache Cassandra

Apache Cassandra is an open-source distributed database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.

Cassandra Core Concepts, Skills, and Tools

This three-day training teaches the fundamentals of Cassandra 2.0 architecture, installation, configuration, administration, and tooling. You will become familiar with the Cassandra Data Model and Cassandra Query Language and learn to understand Compaction.

Three days, € 1795. Find out more about this training, including full programme, dates and registration.

Cassandra Operations and Performance

This three-day course teaches the specific operational, and performance tuning skills needed to administer an Apache Cassandra cluster. You will have hands-on experience in implementing multiple data center clusters and understand how to tune for performance.

Three days, € 1795. Find out more about this training, including full programme, dates and registration.

Neo4j Training

Neo4j is the most popular graph database, implemented in Java. Neo4j stores data structured in graphs rather than in tables.

Introduction to Neo4j

This one-day course helps developers and executives to understand graph databases and the core functionality of the Neo4j graph database. With a mixture of theory and hands-on practice sessions, you will quickly learn how easy it is to work with a powerful graph database using the Cypher query language.

One day, € 199. Find out more about this training, including full programme, dates and registration.

Graph Data Modeling with Neo4j

In this one-day course you will learn how to design and implement a graph data model and associated queries, and how to apply the property graph model to solve common modeling problems. You will also learn how to evolve an existing graph in a controlled manner to accommodate new or changing requirements.

One day, € 299. Find out more about this training, including full programme, dates and registration.

Custom Big Data and NoSQL training

We offer a number of training programs that were developed as in-house training for our clients. If you are interested in one of the following courses for your organization, please contact our training staff at Xebia Education. If you have a particular training interest, get in touch to discuss a custom-made training. Our data specialists look forward to sharing their expertise!

Big Data network analysis using Hadoop and Neo4j

Networks are everywhere! Whether social, financial or organizational, data modelling using networks and graph theory is becoming a more common use case. This training will get you started with large scale network analysis using Hadoop for Big Data processing and the popular Neo4j graph database for interactive network analysis and pattern exploration.

During this class you will work hands on with a Hadoop cluster and Neo4j infrastructure on a substantial, real life dataset.

Topics:
  • Introduction to Hadoop and Neo4j
  • HiveQL query language (SQL on a Hadoop cluster)
  • Cypher graph query language (query language of Neo4j)
  • Hands-on: analyzing a large dataset (hundreds of thousands of emails) using Hive and Neo4j

Architecting NoSQL-based solutions

The hype around NoSQL and related technology is starting to settle. The endless product landscape is starting to show who came out on top and which technologies are here to stay. If you haven't done so yet, now is the time to take a serious look at NoSQL-based solutions.

In this one-day, highly interactive workshop, you will be introduced to everything you need to know about NoSQL as a software architect. This includes the strengths and weaknesses of different NoSQL product offerings, how consistency works in non-relational databases, and which design considerations to keep in mind when going NoSQL.

Topics:
  • Opening: Why NoSQL? And introduction to consistency models
  • Document databases (including MongoDB)
  • Apache Hadoop, HBase and Big Data processing
  • Graph databases (including Neo4j)
  • Case studies: during the day we will review two, real life case studies of successful NoSQL technology implementations
  • NoSQL systems: hardware, maintenance, operations, development

The material of the training as well as the instructions were really good. The hands-on sessions were very useful.

Amarjeet Jha
ACN