This week's book giveaway is in the OO, Patterns, UML and Refactoring forum. We're giving away four copies of Refactoring for Software Design Smells: Managing Technical Debt and have Girish Suryanarayana, Ganesh Samarthyam & Tushar Sharma on-line! See this thread for details.
It seems that I was able to create custom indexes based on InputSplit. Performances have been greatly improved on my test environment, but is there any hadoop guru here that could review my implementation to make sure I did it on the right way ? E.g I will not get undesired side effects when using on production ?
Indexing on mapreduce