The role of Data Engineer can be considered as one of the most sought after positions. What does it mean to be a data engineer and how does it differ from being a software engineer or being a data scientist? Fokko Driesprong, data engineer at GoDataDriven, took the time to give us a glimpse of his daily life.
As soon as I have arrived at the client’s office and have brewed myself a cup of coffee, the first thing I do is open Slack to see if there are any notifications from Apache Airflow. Usually, these are low-priority notifications, as all urgent notifications are always taken care of upon the first notification. If it’s an easy fix, I’ll get on it right away. If not, I’ll schedule it for later or delegate the issue to a team member.
At 9.30 am, the team gathers for the daily stand-up. This is where we discuss yesterday’s progression, today’s schedule, and any issues we’re having.
On Thursdays every other week we discuss the sprint schedule to determine what we’ll be working on.
Picking The Best Technology for Any Project
My job requires me to work on very diverse projects. The thing that I enjoy most is employing the latest technology for every client, preferably open-source.
For instance, right now I’m working on containerising event stream collectors, to run them on a Kubernetes cluster. It’s always a joy to work with our in-house developed open-source clickstream collector, Divolte, but I don’t mind using other collectors too.
For another client, I recently implemented a Druid platform. There, I had the opportunity to work with Deck.gl, a Superset integration to visualise data in 3D on a map. The cool thing is that Superset was developed by AirBnB as a data exploration tool and that this specific integration was developed by Uber.
Business-Critical Systems with Terabytes of Data
More and more of my work takes place in the cloud, for example setting up date infrastructure on AWS or GCP. When configuring buckets, I make sure that sensitive information – such as personal details – is properly protected using service accounts.
Because I often work on business-critical systems with terabytes of data flowing through them, it’s important for me to be well versed in the underlying technology. If the systems crash, end users no longer get recommendations or the managers’ figures run out of date. When that happens, people get nervous. Naturally, that’s something I always try to prevent.
New tools are made available every day, so needless to say, I’m constantly working to keep my knowledge up-to-date. I read a lot of blogs and I keep up with interesting projects on Github. At GoDataDriven, continuous knowledge sharing is ensured by for example our biweekly knowledge exchanges on Tuesdays and the GoDataDriven Fridays. During this day, everyone gets time to work on their own projects. Me and my colleagues receive a generous training budget, which I use to attend conferences such as Berlin Buzzwords, Spark Summit or Strata Hadoop. As colleagues, we also regularly consult with each other to discuss solutions to challenging issues.
I think it’s important for any organization to develop the ability to work with the system or product we developed independently. That’s why I don’t mind spending time with the client behind the MacBook to give them instructions.
Interested in working alongside Fokko?
Your passion is data and analytics. You help our clients with uncovering, organizing, and making sense of their data. You feel comfortable operating in the sweet spot between analytics translation, data science, and data engineering. After unearthing the needs of the business, you make sure the right data is available for creating a valuable solution with analytics. Your beautiful data visualizations allow for a quick understanding, providing the business with direct, actionable insights.
As a machine learning engineer, you work together with data scientists and data engineers on the team to build production-ready, scalable applications driven by data and AI in the Amsterdam area.
You’re a scientist or engineer who is not afraid of maths and who understands how to take models into production. Bringing something from laptop to cluster takes skill and courage and you’re not afraid to do just that.
You are passionate about helping organizations drive their success with Data & AI. You feel comfortable operating at the sweet spot between UX, Tech and Business. And, after uncovering the needs of the business, you’re not afraid to take ownership and make sure the right solution goes into production.
You’re an engineer at heart with a pragmatic attitude and the responsibility of someone maintaining production systems. You easily switch between scripting and structured programming in typed languages. You understand failure modes in distributed systems and you’re passionate about provisioning and automation. And you know how to build robust systems.
A role as data engineer sounds great but there are a lot of new skills and technologies to master and you do not have the time. However, you already have more than two years of experience in a job as a software engineer or dealing with data in another way that is not Excel. We offer you the chance to start as a data engineer at GoDataDriven, kicked off by a month-long intensive boot camp with a lot of 1:1 training with our very own data engineers.
We are looking for senior data scientists who feel at home at the intersection between science, mathematics, machine learning, business, and coaching. You have experience in building value from data through ML and software.
You’re a true leader in marketing with a proven track record in writing, designing and producing compelling content and events. You see, hear and feel stories that need to be shared and make it happen. You find it hard not to share your excitement about new technology with anyone you meet.
Are you ready to take responsibility for the digital campaign management of this AI consultancy boutique? Does your mind fizz with result-driven ideas and do you walk your talk? Then this might be the role for you.