GoDataDriven Blog

Latest from Godatadriven

Data Science and AI (78)Data Engineering (68)Data Platforms (55)Open Source (52)Technology (37)Data Democratization (35)Python (34)Data and AI Strategy (27)Analytics Translation (24)Analytics Engineering (23)dbt (21)Apache Airflow (18)Apache Spark (15)Data Governance (10)MLops (9)Keras (7)Azure (7)Hadoop (6)Google Cloud Platform (6)Docker (4)AWS (4)Healthcare (2)Kubernetes (1)Kedro (1)Industries (0)Topics (0)
Deleting Your Commit History?
Jochem Loedeman on 09 March 2023
  • Data Engineering
  • Data Science and AI
  • Kedro
  • MLops
  • Open Source
The surprising impact of Kedro’s data catalog
Jordi Smit on 24 February 2023
  • Apache Spark
  • data
  • Open Source
  • Python
Streamlining Data Science Workflows with a Feature Catalog
Roel Bertens on 09 February 2023
Empowering Medical Students to solve a real-world medical problem with AI
James Hayward on 08 February 2023
  • Docker
  • MLops
  • Open Source
  • Python
How to create a Devcontainer for your Python project 🐳
Jeroen Overschie on 21 November 2022
  • MLops
MLOps: why and how to build end-to-end product teams
Daniel Willemsen on 07 November 2022
Optimizing TopK queries in DataFusion
Daniël Heres on 28 September 2022
  • Data Science and AI
  • MLops
  • Open Source
  • Python
How Streamlit will help you get your machine learning products used
Daniel Willemsen on 01 August 2022
DropBlox: Coding Challenge at PyCon DE & PyData Berlin 2022
Yke Rusticus on 27 July 2022
  • Azure
  • Data Engineering
  • Python
adfPy: an intuitive way to build data pipelines with Azure Data Factory
Daniel van der Ende on 25 July 2022
  • Python
Protocols in Python: Why You Need Them
Rogier van der Geer on 25 July 2022
  • Apache Spark
  • Data Engineering
  • Data Science and AI
  • Python
Devil’s in the details: Data Leakage
Erdem Başeğmez on 12 July 2022
  • Apache Spark
  • Data Engineering
  • dbt
DBT’s missing software engineering piece: unit tests
Cor Zuurmond on 27 May 2022
  • Google Cloud Platform
  • Python
Overengineering our assessment sending process
Jordi Smit on 16 May 2022
  • Azure
  • Data Engineering
  • Python
Deploying a Python Azure function as .zip
Jelle Jan Bankert on 11 May 2022
  • Apache Spark
  • Data Engineering
Real distributed image processing with Apache Spark
Kris Geusebroek on 25 April 2022
  • Data Engineering
  • MLops
How to deploy your python project on Databricks
Rogier van der Geer on 20 April 2022
  • Open Source
Encouraging open source contributions lowers your security risks
Giovanni Lanzani on 18 April 2022
  • Analytics Translation
Analytics Translator in Practice: Google Translate Analogy
Mahmoud Khodier on 06 April 2022
  • Data Democratization
Data Literacy: why training is important — and not enough to stay relevant
Renald Buter on 24 March 2022
  • Python
A Practical Guide to Setuptools and Pyproject.toml
Rogier van der Geer on 18 March 2022
  • Analytics Engineering
  • Data Democratization
  • Data Platforms
Data analysts, unleash your inner engineer
Fanny Kassapian on 16 March 2022
  • Azure
  • Data Engineering
  • Python
Deploying an Azure Function with Terraform
Niels Zeilemaker on 11 March 2022
  • Data Science and AI
Why you should care about Data Centric AI
Rens Dimmendaal on 10 March 2022
Page 1 of 13