• Post Reply Bookmark Topic Watch Topic
  • New Topic

Files to Html (or xml ) converter  RSS feed

 
avihai marchiano
Ranch Hand
Posts: 342
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hey,

Our application need to convert a lot of files type ( doc,ppt,pdf...) to xml or html in order to extract data from those files.

Does any one know a tool (that integrated with java) for convert files to html or xml?

Ps. we currently pay a lot for stellent solution, but we look for a chipper and easier solution.

Thank you.
 
Joe Ess
Bartender
Posts: 9443
12
Linux Mac OS X Windows
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I use OpenOffice to convert DOC's to HTML. It would probably work on PPT's as well. It will not convert PDF's.
 
avihai marchiano
Ranch Hand
Posts: 342
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Thank you for your answer, but i am looking for tool (lib) that support in varioes file types.
 
Joe Ess
Bartender
Posts: 9443
12
Linux Mac OS X Windows
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
OpenOffice provides APIs for several languages, including Java.
 
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!