This week's book giveaway is in the Reactive Progamming forum.
We're giving away four copies of Reactive Streams in Java: Concurrency with RxJava, Reactor, and Akka Streams and have Adam Davis on-line!
See this thread for details.
Win a copy of Reactive Streams in Java: Concurrency with RxJava, Reactor, and Akka Streams this week in the Reactive Progamming forum!
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Liutauras Vilda
  • Junilu Lacar
  • Jeanne Boyarsky
  • Bear Bibeault
Sheriffs:
  • Knute Snortum
  • Tim Cooke
  • Devaka Cooray
Saloon Keepers:
  • Ron McLeod
  • Stephan van Hulst
  • Tim Moores
  • Tim Holloway
  • Carey Brown
Bartenders:
  • Piet Souris
  • Frits Walraven
  • Ganesh Patekar

How closely coupled is Mahout to Hadoop and MapReduce?

 
Ranch Hand
Posts: 74
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi,

Thanks for your answers to my "Questions about Mahout's maturity" post.

Are the algorithms implemented by Mahout all implemented in Hadoop, i.e., does using Mahout imply that the problem can be implemented in Hadoop and is therefore amenable to implementation in MapReduce? As Mahout is about scalability, I suppose this question is getting at whether all of its scalability is, in the end, based on MapReduce.

Also, is there something in the book about running on Google AppEngine (GAE) and Amazon EC2?

Thanks,
Glenn
 
Ranch Hand
Posts: 172
Redhat Ruby C++
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I was about to ask the same thing. About one of the wuestions that you made, I read is that the GAE would do MapReduce jogs soon.
 
author
Posts: 21
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Most of it is based on Hadoop / MapReduce, yes. Not all of it is though, in particular a lot of the recommender code, which also has a significant non-distributed presence.

I don't think you can run Hadoop on GAE? Or at least I have not heard that you can, nor tried. I have personally run it on EC2. The book has a few pages on running Hadoop jobs on EC2; it's generally quite straightforward if you understand what's going on when you run it locally.
 
Ranch Hand
Posts: 63
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Most of mahout algorithms can be run on a in memory mode...
You need not have Map Reduce, however Map Reduce is required to run large datasets
 
Uh oh, we're definitely being carded. Here, show him this tiny ad:
Java file APIs (DOC, XLS, PDF, and many more)
https://products.aspose.com/total/java
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!