• Post Reply Bookmark Topic Watch Topic
  • New Topic

how to extract text from pdf using jsp  RSS feed

 
Santosh Kumar
Ranch Hand
Posts: 30
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
hi guys

i want to extract text from pdf files how can that be done using jsp/servlets and is it possible to search a pdf file for some keywords.
please help m e solve the problem


regards
santosh
 
Ben Souther
Sheriff
Posts: 13411
Firefox Browser Redhat VI Editor
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
There is nothing built into JSP for this.

I know there are quite a few libraries out there for working with PDFs from a Java app (Most of the ones I've seen are for creating, not reading, PDFs).

Try Googling with "PDF JAVA PARSE" or some variation.

Note: Often the text in a PDF is just part of an image. In this case you won't be able to extract it.
 
Senthil B Kumar
Ranch Hand
Posts: 160
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
http://www.pdflib.com/products/tet/index.html

http://www.developer.com/java/other/article.php/626501
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!