Godatadriven blogs

blogposts

General
Can external people be productive data scientists?
godatadriven 11 March 2015
General
Distance calculation with Impala (or Hive)
godatadriven 05 February 2015
General
About Big Data
godatadriven 27 January 2015
General
Variable Selection in Machine Learning
godatadriven 05 December 2014
Data Science General
Data Science – It’s all about people!
godatadriven 20 November 2014
General
Data SaaS
Giovanni Lanzani 13 November 2014
General
Casino Gambling Simulations in R
godatadriven 20 October 2014
General
Replace Hive CLI with Beeline on a cluster with Sentry
godatadriven 11 August 2014
General
Hive, Impala, JDBC and Kerberos
godatadriven 08 August 2014
General
Upgrade secure cluster CDH4.5.0 to CDH5.0.3
godatadriven 01 August 2014
General
Automated install of CDH5 Hadoop on your laptop with Ansible
godatadriven 07 July 2014
General
GoDataDriven Summer Specials 2014!
Giovanni Lanzani 13 June 2014
General
Configuring Samba4 and Cloudera Manager
godatadriven 30 May 2014
General
Local and Pseudo-distributed CDH5 Hadoop on your laptop
godatadriven 22 April 2014
General
Refactor Hadoop job: old to new API
godatadriven 28 March 2014
Build Data Engineering
Setting up Kerberos authentication for Hadoop with Cloudera Manager
godatadriven 18 March 2014
General
Setting up cross realm trust between Active Directory and Kerberos KDC
godatadriven 13 March 2014
General
The performance impact of vectorized operations
Giovanni Lanzani 03 March 2014
General
Merge Mahout item based recommendations results from different algorithms
Giovanni Lanzani 28 February 2014
General
Kerberos basics and installing a KDC
godatadriven 28 February 2014
General
Some recommendations in Neo4j
godatadriven 14 February 2014
General
Convert chararray user ID’s to integers with pig
Giovanni Lanzani 06 January 2014
General
Bare metal Hadoop provisioning with Ansible and Cobbler
godatadriven 30 July 2013
General
I Mapreduced a Neo store
godatadriven 17 June 2013
General
Monotonically increasing row IDs with MapReduce
godatadriven 30 May 2013
General
Graph partitioning in MapReduce with Cascading (part 2)
godatadriven 30 May 2013
General
Graph partitioning in MapReduce with Cascading (part 1)
godatadriven 30 May 2013
Page 7 of 7