• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other Pie Elite all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Jeanne Boyarsky
  • Ron McLeod
  • Paul Clapham
  • Liutauras Vilda
Sheriffs:
  • paul wheaton
  • Rob Spoor
  • Devaka Cooray
Saloon Keepers:
  • Stephan van Hulst
  • Tim Holloway
  • Carey Brown
  • Frits Walraven
  • Tim Moores
Bartenders:
  • Mikalai Zaikin

Hadoop - mean time to productivity

 
Ranch Hand
Posts: 100
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
Chuck,

In your experience(s), how quickly does one go from Hadoop newbie to using it with success?

Sometimes, I feel like I've avoided trying/getting into Hadoop because there seems to be a lot to it - clusters, hdfs, etc. Could I spend a weekend fooling around and end up pretty comfortable?
 
Author
Posts: 12617
IntelliJ IDE Ruby
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
Depends on what you mean by "comfortable". I'd say no.
 
andrew ennamorato
Ranch Hand
Posts: 100
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
Good question. Comfortable enough to setup a small cluster, put some files on HDFS, run a few m/r tasks, etc. Not be an expert or anything, but enough to try out a few ideas and see how it works.
 
author
Posts: 12
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
I've seen a number of courses in universities where students are expected to get up to speed on Hadoop in about 2-4 weeks. My memory is a bit vague on this one, but I do remember somewhere that a mid-term homework assignment was to implement PageRank over Wikipedia articles using Hadoop. I would certainly consider that a "comfortable" level.

Of course, your learning curve will vary depending on your background and available resources. The courses I referred to above almost always require "distributed systems" as a prerequisite. The classes also usually have a test cluster already set up. If you're setting one up yourself, factor in some time on learning systems administration.
 
andrew ennamorato
Ranch Hand
Posts: 100
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
Chuck,

Good to know. I've been wanting to pick up a copy of your book just for the M/R chapters anyway (maybe to use as a guide for CouchDB's internal M/R functionality), so if I don't win one I'll spring for it regardless.

Thanks,

andy
 
David Newton
Author
Posts: 12617
IntelliJ IDE Ruby
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
Any of the MapReduce papers out there would be more than adequate for that, I think.
 
reply
    Bookmark Topic Watch Topic
  • New Topic