Godatadriven blogs

Tools & Tech

Events (69)Train (65)Build (64)Data Science (50)Data Engineering (47)Organize (44)Open source (43)Modern Data Stack (35)Learning Journey (32)News (30)Data Platforms (29)Python (27)Data democratization (24)Analytics Translator (23)Technology (23)Whitepaper (21)Analytics Engineering (20)Tools & Tech (19)Airflow (17)dbt (16)Data Science Learning Journey (16)Strategy (15)AI Maturity (14)Spark (13)Data Engineering Learning Journey (13)Data Governance (11)
Azure Build Cloud Data Engineer Data Engineering Python
Deploying a Python Azure function as .zip
Jelle Jan Bankert on 11 May 2022
Open source
Encouraging open source contributions lowers your security risks
Giovanni Lanzani on 18 April 2022
Azure Build Cloud Data Engineer Data Engineering Python
Deploying an Azure Function with Terraform
Niels Zeilemaker on 11 March 2022
Analytics Engineering Data Engineering Open source
Airbyte, the open-source data ingester
lassebenninga@godatadriven.com on 09 March 2022
dbt Open source Tools & Tech
10 data tools to watch in 2022
Niels Zeilemaker on 28 December 2021
Azure Cloud Data Engineer Data Engineering Data Platforms Python
Putting the Factory in Azure Data Factory: Dynamically generated Pipelines
Daniel van der Ende on 21 December 2021
Tools & Tech
Minimal pyproject.toml example
Niels Zeilemaker on 10 December 2021
Dask Data Engineering Spark
Why Dask if I may ask?
Roel Bertens on 18 February 2021
Data Engineering Data Platforms Open source Spark
Making joins faster in DataFusion based on table statistics
Daniël Heres on 22 December 2020
Analytics Translator Azure Tools & Tech
How to build your own Covid-19 search engine
godatadriven on 29 October 2020
Data Engineering Data Platforms Open source Spark
Spark on Kubernetes with Argo and Helm
godatadriven on 02 August 2020
Data Science Open source Tools & Tech
The strength of the data community and the beauty of open source
Giovanni Lanzani on 12 June 2020
Airflow Data Engineering Open source Tools & Tech
Highlights of the Apache Airflow 1.10.10 release
godatadriven on 12 April 2020
AWS Data Engineering
Distributed training a DIY AWS SageMaker model
godatadriven on 28 March 2020
Data Engineering Open source Senior Data Engineer Spark
B.EFFICIENT – Large scale Spark optimisation
godatadriven on 06 March 2020
General Open source
GoDataDriven Open Source Contribution for Q4 2019
godatadriven on 07 February 2020
Airflow General
Early Access of Apache Airflow book
godatadriven on 30 October 2019
General Open source
GoDataDriven Open Source Contribution for Q3 2019
Barend Garvelink on 21 October 2019
Airflow Data Engineering Tools & Tech
Deploying Apache Airflow on Azure Kubernetes Service
godatadriven on 28 June 2019
Airflow Tools & Tech
Highlights from the new Apache Avro 1.9.0 release
godatadriven on 14 May 2019
Airflow General
Introducing Pylint-Airflow
godatadriven on 12 May 2019
Airflow Tools & Tech
Testing and debugging Apache Airflow
godatadriven on 21 February 2019
Airflow Tools & Tech
The Zen of Python and Apache Airflow
godatadriven on 17 February 2019
Airflow Tools & Tech
GoDataDriven Open Source Contribution for January 2019, the Apache Edition
Giovanni Lanzani on 13 February 2019
Page 1 of 2