Win a copy of The Little Book of Impediments (e-book only) this week in the Agile and Other Processes forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Hadoop and Statistics

 
Santosh U Prabhu
Greenhorn
Posts: 5
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
We are currently doing a POC for a Hadoop implementation. Most of the users submitting their resumes seem to have statistical backgrounds. Is statistics a necessity for learning Hadoop? I am just beginning to learn Hadoop and I can already see that there is a learning curve and a differing way of approach to implementing Hadoop.
 
Carlos Morillo
Ranch Hand
Posts: 221
Java Python Scala
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I'd say it depends on the use case. Likely that's for the Analytics and BI aspect or consumers of the output of MapReduce Jobs.

You need UNIX/Linux skills to install and manage a Hadoop cluster.

You need Java skills to understand the framework and to write MapReduce jobs but you can also use some other programming languages as well.

You need some SQL skills to play with Hive.

You need to understand RDBMS to understand their limitations and how NoSQL Databases such as HBase (Hadoop Database) solve certain kind of problems.

At the end there has to be some consumer to get insights and make decisions and these are Analytics and BI software such as Datameer, Tableau, etc.


HTH,

Carlos.
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic