Data Engineering
Data Science and AI
Kedro
MLops
Open Source
The surprising impact of Kedro’s data catalog
Jordi Smit
on 24 February 2023
Apache Spark
data
Open Source
Python
Streamlining Data Science Workflows with a Feature Catalog
Roel Bertens
on 09 February 2023
Docker
MLops
Open Source
Python
How to create a Devcontainer for your Python project 🐳
Jeroen Overschie
on 21 November 2022
Data Science and AI
MLops
Open Source
Python
How Streamlit will help you get your machine learning products used
Daniel Willemsen
on 01 August 2022
Azure
Data Engineering
Python
adfPy: an intuitive way to build data pipelines with Azure Data Factory
Daniel van der Ende
on 25 July 2022
Python
Protocols in Python: Why You Need Them
Rogier van der Geer
on 25 July 2022
Apache Spark
Data Engineering
Data Science and AI
Python
Devil’s in the details: Data Leakage
Erdem Başeğmez
on 12 July 2022
Apache Spark
Data Engineering
dbt
DBT’s missing software engineering piece: unit tests
Cor Zuurmond
on 27 May 2022
Google Cloud Platform
Python
Overengineering our assessment sending process
Jordi Smit
on 16 May 2022
Azure
Data Engineering
Python
Deploying a Python Azure function as .zip
Jelle Jan Bankert
on 11 May 2022
Apache Spark
Data Engineering
Real distributed image processing with Apache Spark
Kris Geusebroek
on 25 April 2022
Open Source
Encouraging open source contributions lowers your security risks
Giovanni Lanzani
on 18 April 2022
Python
A Practical Guide to Setuptools and Pyproject.toml
Rogier van der Geer
on 18 March 2022
Azure
Data Engineering
Python
Deploying an Azure Function with Terraform
Niels Zeilemaker
on 11 March 2022
Analytics Engineering
Data Engineering
Open Source
Airbyte, the open-source data ingester
lassebenninga@godatadriven.com
on 09 March 2022
Analytics Engineering
Data Engineering
Data Governance
Data Platforms
dbt
Python
dbt + SODA: how to manage your data at scale
Guillermo Sánchez Dionis
on 08 March 2022
dbt
Open Source
Technology
10 data tools to watch in 2022
Niels Zeilemaker
on 28 December 2021
Azure
Data Engineering
Data Platforms
Python
Putting the Factory in Azure Data Factory: Dynamically generated Pipelines
Daniel van der Ende
on 21 December 2021
Technology
Minimal pyproject.toml example
Niels Zeilemaker
on 10 December 2021
Data Science and AI
Python
Doing business, the Bayesian way (Part 2)
Vadim Nelidov
on 07 December 2021
Data Science and AI
Python
Doing business, the Bayesian way (Part 1)
Vadim Nelidov
on 30 November 2021
Data Engineering
Data Science and AI
Python
Python 3.10 Introduces better error messaging
Herbert van Leeuwen
on 09 September 2021
Data Engineering
Data Science and AI
Python
Python 3.10 introduces Pattern Matching
Giovanni Lanzani
on 10 August 2021
Data Science and AI
Python
Shadow and Solar Panels: a Short Analysis
Rogier van der Geer
on 21 May 2021
Page
1 of 3