• Post Reply Bookmark Topic Watch Topic
  • New Topic

Ideas/inputs for basic design for searching topics from set of pdf books  RSS feed

 
Samar Chauhan
Greenhorn
Posts: 8
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Let me define design problem as that to create a search facility in the intranet for particular topics in the ever increasing pool of pdf books ,what should I consider for combination from java desktop/browser searching the database/file-system through luecene/solr.Adding the pdf should be user freindly( i.e indexing will be done also along with). Can I have your inputs for the basic design ?.
 
David Newton
Author
Rancher
Posts: 12617
IntelliJ IDE Ruby
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Upload a file and it gets indexed? That seems simple enough, no? Searching is as easy as entering in terms, or a full Lucene query, and executing it.

I guess I'm not sure what the question is.
 
Otis Gospodnetic
Author
Greenhorn
Posts: 23
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Samar Chauhan wrote:Let me define design problem as that to create a search facility in the intranet for particular topics in the ever increasing pool of pdf books ,what should I consider for combination from java desktop/browser searching the database/file-system through luecene/solr.Adding the pdf should be user freindly( i.e indexing will be done also along with). Can I have your inputs for the basic design ?.


Pointer: Solr Cell - http://wiki.apache.org/solr/ExtractingRequestHandler

Otis
 
Samar Chauhan
Greenhorn
Posts: 8
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Thanks,Otis and David..Sorry if the question was not clear ....I had just put a scenario so that 1) both the power and applicability of both lucene/solr can be compared....2) get the feedback for comparing the use of file system or blob data type in database for file storage....because here the blob content itself is to be searched...unlike other database searches where you search some other simple datatype and pull out the blob content..so wanted authors/others experience to share on this 3) and similarly pros and cons for using the java desktop or browser ...with reference to lucene/solr(ease ,speed of development,security )...Thanks a lot this week I gathered lot of information through out the week on lucene..which I always I wanted to..and looking forward to the book very eagerly.
 
It is sorta covered in the JavaRanch Style Guide.
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!