Greetings, Ladies and Gentlemen!
I need some guidelines about how to start with Hadoop. I need a bunch of two things: a book and an installation. Their versions should match
Can you give me some advice on this?
I have 32bit Windows 7 with Debian 7 inside my VirtualBox.
Questions:
Is there a big difference between Hadoop releases? I mean 1x and 2x.
There is a Hadoop download on Apache site. But I have seen many opinions that are saying that direct installation is a big pain. Is this correct? I just want to setup a single-node cluster to play with it.
There are Cloudera packs. But unfortunately they are for 64bit machines, as far as I understand.
There is a Horton sandbox I have been downloading for the last 30 minutes.
Something else?
What you can recommend me?
And also I need a book that describes the version I am going to install more or less precisely. Hadoop Definitive Guide is from 2012 - is it still up to date? I cannot figure which version of Hadoop it describes.