Godatadriven blogs

data

General posts

Data Science and AI (78)Data Engineering (68)Data Platforms (55)Open Source (52)Technology (37)Data Democratization (35)Python (34)Data and AI Strategy (27)Analytics Translation (24)Analytics Engineering (23)dbt (21)Apache Airflow (18)Apache Spark (15)Data Governance (10)MLops (9)Keras (7)Azure (7)Google Cloud Platform (6)Hadoop (6)Docker (4)AWS (4)Healthcare (2)Kubernetes (1)Kedro (1)Industries (0)Topics (0)
data
GoDataDriven Summer Specials 2014!
Giovanni Lanzani on 13 June 2014
data Hadoop
Configuring Samba4 and Cloudera Manager
godatadriven on 30 May 2014
data
Local and Pseudo-distributed CDH5 Hadoop on your laptop
godatadriven on 22 April 2014
data Hadoop
Refactor Hadoop job: old to new API
godatadriven on 28 March 2014
data
Setting up cross realm trust between Active Directory and Kerberos KDC
godatadriven on 13 March 2014
data
The performance impact of vectorized operations
Giovanni Lanzani on 03 March 2014
data
Merge Mahout item based recommendations results from different algorithms
Giovanni Lanzani on 28 February 2014
data
Kerberos basics and installing a KDC
godatadriven on 28 February 2014
data
Some recommendations in Neo4j
godatadriven on 14 February 2014
data
Convert chararray user ID’s to integers with pig
Giovanni Lanzani on 06 January 2014
data
Bare metal Hadoop provisioning with Ansible and Cobbler
godatadriven on 30 July 2013
data
I Mapreduced a Neo store
godatadriven on 17 June 2013
data
Monotonically increasing row IDs with MapReduce
godatadriven on 30 May 2013
data
Graph partitioning in MapReduce with Cascading (part 2)
godatadriven on 30 May 2013
data
Graph partitioning in MapReduce with Cascading (part 1)
godatadriven on 30 May 2013
Page 10 of 10