• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other Pie Elite all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Ron McLeod
  • Paul Clapham
  • Devaka Cooray
  • Liutauras Vilda
Sheriffs:
  • Jeanne Boyarsky
  • paul wheaton
  • Henry Wong
Saloon Keepers:
  • Stephan van Hulst
  • Tim Holloway
  • Tim Moores
  • Carey Brown
  • Mikalai Zaikin
Bartenders:
  • Lou Hamers
  • Piet Souris
  • Frits Walraven

Mahout in Action - evolution of the library and the book

 
Greenhorn
Posts: 3
Eclipse IDE Chrome Java
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
First of all greetings to the authors and thanks for being available for questions.

In the past few years we've seen quite some progress in all Hadoop related projects, including Mahout.
How can the books keep up with the pace of change in the Hadoop ecosystem?
Do you think as Mahout is still heavily under construction and many more features and algorithms yet to come, can we expect a second release of the book later on? Is it in plan to follow the roadmap and release an update of the book too after Mahout reaches certain milestones in the future?

Thank you for your answers.

Robert
 
author
Posts: 21
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
I really don't know, but I can take wild guesses.

Things are changing fast. Hadoop is an excellent tool for its purpose and is actually getting somewhat mature. It is not an ideal tool for machine learning algorithms. I would not be surprised if some of the other distributed computing frameworks that are emerging, which are designed for a bit more general purpose application, become more popular within a few years for stuff like this. But that's still a few years off at least.

And if that changes, I would not be surprised if Mahout (or another project) changes to reimplement on another framework.

For now I think Mahout has figured out its identity: clustering, classification, collaborative filtering on top of Hadoop. It implements a lot of stuff, and in my opinion has a fair bit of work to do to polish and document what's there. I do not anticipate big changes in what it does, but I do anticipate refinement.

There are no plans for a second edition of the book at this point, as it would be years away at least. The final version of the book is written for Mahout 0.5, which is recent as of a few months ago, and that should remain a useful guide for versions of Mahout for the next 1-2 years.
 
Greenhorn
Posts: 11
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
Please note that the examples from the book are available on-line and we will keep these examples up-to-date with the latest Mahout. Some of the examples are integrated into Mahout so we won't be able to change Mahout too much without updating those.
 
Why fit in when you were born to stand out? - Seuss. Tiny ad:
We need your help - Coderanch server fundraiser
https://coderanch.com/wiki/782867/Coderanch-server-fundraiser
reply
    Bookmark Topic Watch Topic
  • New Topic