wood burning stoves 2.0*
The moose likes Hadoop and the fly likes Prerequisites for learning hadoop ? Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Databases » Hadoop
Reply locked New topic
Author

Prerequisites for learning hadoop ?

David Payne
Ranch Hand

Joined: May 27, 2012
Posts: 35
Can someone tell me what are the prerequisites for learning hadoop ? I know java and c++ well. Does knowledge of these or any other languages help in learning hadoop ?

Thanks.
chris webster
Bartender

Joined: Mar 01, 2009
Posts: 1848
    
  16

From What Is Apache Hadoop:

Working directly with Java APIs can be tedious and error prone. It also restricts usage of Hadoop to Java programmers. Hadoop offers two solutions for making Hadoop programming easier.

Pig is a programming language that simplifies the common tasks of working with Hadoop: loading data, expressing transformations on the data, and storing the final results. Pig's built-in operations can make sense of semi-structured data, such as log files, and the language is extensible using Java to add support for custom data types and transformations.

Hive enables Hadoop to operate as a data warehouse. It superimposes structure on data in HDFS and then permits queries over the data using a familiar SQL-like syntax. As with Pig, Hive's core capabilities are extensible.
Choosing between Hive and Pig can be confusing. Hive is more suitable for data warehousing tasks, with predominantly static structure and the need for frequent analysis. Hive's closeness to SQL makes it an ideal point of integration between Hadoop and other business intelligence tools.

There's a load of video tutorials etc at the Cloudera site as well.


No more Blub for me, thank you, Vicar.
Sachin rakesh
Greenhorn

Joined: Apr 04, 2012
Posts: 14
Hadoop deals with analysis of bigdata ...Since the tool is built in java knowing any object oriented programming is an added advantage.But besides that one should be confident with the concepts of web analytics,data analysis and datawarehousing,distributed computing.
David Payne
Ranch Hand

Joined: May 27, 2012
Posts: 35
Sachin rakesh wrote:Hadoop deals with analysis of bigdata ...Since the tool is built in java knowing any object oriented programming is an added advantage.But besides that one should be confident with the concepts of web analytics,data analysis and datawarehousing,distributed computing.


Can you recommend some books for all these subjects ? Is there any hadoop book that covers all these topics ?
Sachin rakesh
Greenhorn

Joined: Apr 04, 2012
Posts: 14
Hadoop: The Definitive Guide by Tom White...This book is targetted for freshers in hadoop.Try this one..Also contact the experts in hadoop by posting it in hadoop forum etc...
Happy leaning.
David Payne
Ranch Hand

Joined: May 27, 2012
Posts: 35
Sachin rakesh wrote:Hadoop: The Definitive Guide by Tom White...This book is targetted for freshers in hadoop.Try this one..Also contact the experts in hadoop by posting it in hadoop forum etc...
Happy leaning.


Right now, all I know is the Java part of hadoop. So, how much time would it take (approximately) to learn and become proficient enough in hadoop to do entry-level "company projects" ?
David Payne
Ranch Hand

Joined: May 27, 2012
Posts: 35
bounce
Junilu Lacar
Bartender

Joined: Feb 26, 2001
Posts: 5018
    
    8

David,

Please don't post your questions in multiple forums. This question is essentially a repeat of this: http://www.coderanch.com/t/584427/java/java/much-time-approx-it-learn


Junilu - [How to Ask Questions] [How to Answer Questions]
momin shakeeb
Greenhorn

Joined: Jul 03, 2013
Posts: 2
i want to learn hadoop. But i dont know any object oriented language such as java or .net. So is this neccesary to learn one of these first. I'm planning to learn .net first. Is that right?. For learning datawarehousing, what should i do.
Hussein Baghdadi
clojure forum advocate
Bartender

Joined: Nov 08, 2003
Posts: 3479

Personally I would say that you don't need to know an OOP language to start coding in Hadoop. In fact, this is great!

Why?

Because big data crunching is about processing massive amount of data streams, filtering, pipe-ing and aggregating. Functional programming languages are the perfect fit for this. In functional programming languages you deal with data structures, lazy evaluation and functions.

When using an OOP language while doing big data, you will get the same feeling you got when trying to fill the gap between a database and objects.
Hussein Baghdadi
clojure forum advocate
Bartender

Joined: Nov 08, 2003
Posts: 3479

But well, you can use Hadoop with many programming languages, not only with Java.
momin shakeeb
Greenhorn

Joined: Jul 03, 2013
Posts: 2
Thank you Hussein bhaghdadi for your kind reply. I also want to know that for learning 'web analytics, data analysis n datawarehousing n distributed computing' that are necessary for hadoop. So what should i do to learn these things. Is sql server 2008 include any of these thing. Or i've to do oracle dba. Or something else. I dont know.
 
wood burning stoves
 
subject: Prerequisites for learning hadoop ?