Godatadriven blogs

Data Engineering

Data Science and AI (78)Data Engineering (68)Data Platforms (55)Open Source (52)Technology (37)Data Democratization (35)Python (34)Data and AI Strategy (27)Analytics Translation (24)Analytics Engineering (23)dbt (21)Apache Airflow (18)Apache Spark (15)Data Governance (10)MLops (9)Keras (7)Azure (7)Hadoop (6)Google Cloud Platform (6)Docker (4)AWS (4)Healthcare (2)Kubernetes (1)Kedro (1)Industries (0)Topics (0)
Data Engineering Data Science and AI Kedro MLops Open Source
The surprising impact of Kedro’s data catalog
Jordi Smit on 24 February 2023
Azure Data Engineering Python
adfPy: an intuitive way to build data pipelines with Azure Data Factory
Daniel van der Ende on 25 July 2022
Apache Spark Data Engineering Data Science and AI Python
Devil’s in the details: Data Leakage
Erdem Başeğmez on 12 July 2022
Apache Spark Data Engineering dbt
DBT’s missing software engineering piece: unit tests
Cor Zuurmond on 27 May 2022
Azure Data Engineering Python
Deploying a Python Azure function as .zip
Jelle Jan Bankert on 11 May 2022
Apache Spark Data Engineering
Real distributed image processing with Apache Spark
Kris Geusebroek on 25 April 2022
Data Engineering MLops
How to deploy your python project on Databricks
Rogier van der Geer on 20 April 2022
Azure Data Engineering Python
Deploying an Azure Function with Terraform
Niels Zeilemaker on 11 March 2022
Analytics Engineering Data Engineering Open Source
Airbyte, the open-source data ingester
[email protected] on 09 March 2022
Analytics Engineering Data Engineering Data Governance Data Platforms dbt Python
dbt + SODA: how to manage your data at scale
Guillermo Sánchez Dionis on 08 March 2022
Azure Data Engineering Data Platforms Python
Putting the Factory in Azure Data Factory: Dynamically generated Pipelines
Daniel van der Ende on 21 December 2021
Data Engineering Data Platforms
Data Mesh – a review
Niels Zeilemaker on 20 December 2021
Data Engineering Data Science and AI Python
Python 3.10 Introduces better error messaging
Herbert van Leeuwen on 09 September 2021
Data Engineering Data Science and AI Python
Python 3.10 introduces Pattern Matching
Giovanni Lanzani on 10 August 2021
Data Engineering
An Agile Approach to Building Data Pipelines
Steven Nooijen on 24 June 2021
Analytics Engineering data Data Engineering Data Platforms dbt
Build data pipelines using dbt on Databricks
Data Engineering
Using Draw.io diagrams as Grafana Dashboard
godatadriven on 19 February 2021
Apache Spark Data Engineering
Why Dask if I may ask?
Roel Bertens on 18 February 2021
Apache Spark Data Engineering Data Platforms Open Source
Making joins faster in DataFusion based on table statistics
Daniël Heres on 22 December 2020
Data Engineering Data Science and AI
BaaS: Backtest, optimize and discover
Diederik Greveling on 06 October 2020
Apache Spark Data Engineering Data Platforms Open Source
Spark on Kubernetes with Argo and Helm
godatadriven on 02 August 2020
Apache Airflow Data Engineering Open Source Technology
Highlights of the Apache Airflow 1.10.10 release
godatadriven on 12 April 2020
Data Engineering Data Science and AI
To the moon with BaaS
Diederik Greveling on 10 April 2020
AWS Data Engineering
Distributed training a DIY AWS SageMaker model
godatadriven on 28 March 2020
Page 1 of 2