Win a copy of Functional Reactive Programming this week in the Other Languages forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

guidance

 
rishi reddy
Ranch Hand
Posts: 30
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
hi,

i have information in the pdf files as well as word document. now i need to convert this pdf file/word document information into full text file.

could any one let me know how to do this?

rishi
 
Scott Selikoff
author
Saloon Keeper
Posts: 4028
18
Eclipse IDE Flex Google Web Toolkit
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
You need to use a library jar to convert it. There are some free ones available like iText and others that are commercial. Google would be the best, just search "java pdf library"
 
Ulf Dittmer
Rancher
Posts: 42968
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
The AccessingFileFormats wiki page has a bunch of information on the subject. Look into Jakarta POI for converting DOC to text, and JPedal or PDFTextStream for extracting text from PDFs.

By the way, you should make the topic of your posts more descriptive - "guidance" conveys nothing.
[ June 16, 2006: Message edited by: Ulf Dittmer ]
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic