• Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Lucene Installation

 
Pat Magic
Greenhorn
Posts: 4
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi everyone,
I am trying to install lucene on my website. I use Tomcat + IIS and can't find proper doc on how to make Lucene work with them. I've build the files successfully but now don't know what to do next. Please Help me!
Pat
Xyz
 
Pat Magic
Greenhorn
Posts: 4
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Ok,
Well, since I've posted my question, I figured out a way to get some of the lucent classes to work. I am still having trouble with the HTMLParser. I unzipped all the jar files in my Tomcat classes folder, and can't seem to find the HTMLParser class. I get this error on the command line when compiling
ParserThread.java.
Pat
 
Carl Trusiak
Sheriff
Posts: 3341
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
If you downloaded RC2, HTMLParser is a portion of the demo program distibuted with Lucene and not a portion of the Lucene package. When you extract all files from rc2, HTMLParser.class is under lucene-1.2-rc2/demo/classes/org/apache/lucene/HtmlParser directory. If you wish to reuse this or extend it's functionality in your system, add the classes under org/apache/HtmlParser to the lucene jar.
 
Pat Magic
Greenhorn
Posts: 4
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Thanks Carl,
I got the IndexHTML to index the files that I want. on the command prompt: java IndexHTML -create -index index <directory>
Now, since I don't use jhtml but jsp on my site, I don't know how to query my indexes. I tried the SearchFiles.java but it will not query. Could you please Help Me!
Pat
Xyz
 
Pat Magic
Greenhorn
Posts: 4
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
It's me again,
finally got the SearchFiles.java to search my HTML pages index. The only thing is that when I query the index, the hits.doc("path") returns a null value on prompt and not the path to my directory. So I changed temporarily the path to hits.doc("title"), bringing back the title of the document. If you know how come i can't get the path, please tell me. Maybe it's the way I index the html pages?
Anyways. Thanks for the trouble.
PAt
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic