• Post Reply Bookmark Topic Watch Topic
  • New Topic

desktop search engine like lucene  RSS feed

 
archit thakur
Greenhorn
Posts: 24
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
hi all,
i am trying to build the desktop search engine in java like lucene, where should i start from?
 
Ulf Dittmer
Rancher
Posts: 42972
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Determine the requirements. If those are "just like Lucene", then you'd be better off just using Lucene, or you'd have many man-years of work ahead of you.
 
archit thakur
Greenhorn
Posts: 24
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
yes my requirements are just like lucene .. but i want to build it(may be the little version of it) .. and as far as there is a question using lucene only, i am building it to get the knowledge of the subject .
 
Ulf Dittmer
Rancher
Posts: 42972
73
  • Likes 1
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Then I suggest to start by reading up on text analysis; not sure what would be a good book on that. But "Lucene in Action" should be very helpful.
 
William Brogden
Author and all-around good cowpoke
Rancher
Posts: 13078
6
  • Likes 1
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
The concepts you need to explore in order to build a desktop search engine.

1. Parsing text to locate words, and (if you are ambitious) larger text structures such as paragraphs, pages, chapters, etc.

2. Creating a dictionary of found words plus the index structure(s) which point to where they originate.

3. Lookup of words in response to user queries. (advanced)handling alternate forms, plurals, etc

4. (advanced) logical operations on index results - AND OR NOT

5. getting from logical hits to retrieved text and presenting results

Have fun

Bill
 
archit thakur
Greenhorn
Posts: 24
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
thanks ulf and bill .
 
It is sorta covered in the JavaRanch Style Guide.
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!