• Post Reply Bookmark Topic Watch Topic
  • New Topic

lucene estimate index size, search time  RSS feed

 
mark smith
Ranch Hand
Posts: 258
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
hi

I search a way to estimate indexing time, index size, search time with lucene library.

I have some number for 500 files and i would like to estimate value for 5000, 500 000, 5 000 000 , 5 000 000 000 documents.

I search on the web and i don't found any good way to estimate theses number.

thanks
 
Ulf Dittmer
Rancher
Posts: 42972
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Having worked on a few Lucene projects, I'd put the indexing time and index size at O(n), and search time at O(log n), where n is the number of files. That's assuming that the base set of 500 files is representative of the entire set.

I've also found that having an in-depth understanding of Lucene has made a big difference for fine-tuning indexing time and search time - reading Lucene in Action is a must for serious users.
 
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!