• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other Pie Elite all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Tim Cooke
  • paul wheaton
  • Ron McLeod
  • Jeanne Boyarsky
Sheriffs:
  • Paul Clapham
Saloon Keepers:
  • Tim Holloway
  • Roland Mueller
Bartenders:

search inside documents

 
Ranch Hand
Posts: 38
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
Hi all,
I need to search programmatically (from java code)
inside xls, doc, pdf stored in blob fields.

Does exist a product that could be useful?

Does UltraSearch works fine?

Any Ideas?

Thanks
 
Rancher
Posts: 43081
77
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
I'm not aware of a product or project that can search through all of these. Just for kicks, you could try opening any of these file types as byte stream, and look for your search string that way.

The AccessingFileFormats wiki page has links to a few libraries that can access these file types individually: jExcelAPI and POI for Excel, JPedal for PDF, and POI for Word.

Not sure what this has to do with Oracle, though (which is what this forum is all about).
 
Bartender
Posts: 10336
Hibernate Eclipse IDE Java
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
UltraSearch does this. However it is not just a tool to search inside proprietary file types, its designed to build indices of searchable content (i.e. it is an "information retrieval" product) so it is possibly a far more complex product than you perhaps require. However, if it is an IR product you are looking for you probably need to look at Lucene as well as UltraSearch.
[ April 12, 2006: Message edited by: Paul Sturrock ]
 
Forget Steve. Look at this tiny ad:
Clean our rivers and oceans from home
https://www.kickstarter.com/projects/paulwheaton/willow-feeders
reply
    Bookmark Topic Watch Topic
  • New Topic