Forums Register Login

lucene and office 2007

+Pie Number of slices to send: Send
Has anyone indexed Office 2007 documents into lucene?

Im trying to index a docx document and allow full text searching of it. The indexing is working for *.doc documents but FT doesn't seem to work for *.docx documents.

If someone could point in the in right direction I would appreciate it.
+Pie Number of slices to send: Send
Which library are you using for reading DOC files - Apache POI? If so, that doesn't support the XML-based Office files yet. You could try the beta version of POI, which does support DOCX to a certain degree.
+Pie Number of slices to send: Send
Ill take a look at that :-) Thanks
I didn't say it. I'm just telling you what this tiny ad said.
a bit of art, as a gift, that will fit in a stocking
https://gardener-gift.com


reply
reply
This thread has been viewed 1926 times.
Similar Threads
Print MS Office Documents using Java API
how to check if a docx, xlsx, or pptx file is password protected using apache POI?
cannot open office2007 documents from the java application
Magic number for Microsoft 2007 files
Java API for word files
More...

All times above are in ranch (not your local) time.
The current ranch time is
Mar 29, 2024 03:38:46.