Win a copy of Kotlin in Action this week in the Kotlin forum!
  • Post Reply Bookmark Topic Watch Topic
  • New Topic

PDF to Text converter  RSS feed

 
anju murthy
Greenhorn
Posts: 1
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hello All,
We want to convert pdf file to a text/html/xml file. This pdf converter has to support Japanese. Is there any such converter which is not an installable. We need a java based jar file which runs on unix. This should not be dependent on any graphics like X11. If anyone has any idea regarding this, please do let us know...
We have tried one tool called "Multivalent". This makes use of X11 graphics. So when we run it on unix, currently it is giving us "OutOfMemoryError". We think that this could be because of the fonts which are being used by the Multivalent.jar. The same works fine when run on Windows.
Can anyone throw some light on this?
Thanks and regards,
Anju
 
Sean Sullivan
Ranch Hand
Posts: 427
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Have you tried www.pdfbox.org ?
 
It is sorta covered in the JavaRanch Style Guide.
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!