• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Paul Clapham
  • Liutauras Vilda
  • Knute Snortum
  • Bear Bibeault
Sheriffs:
  • Devaka Cooray
  • Jeanne Boyarsky
  • Junilu Lacar
Saloon Keepers:
  • Ron McLeod
  • Stephan van Hulst
  • Tim Moores
  • Carey Brown
  • salvin francis
Bartenders:
  • Tim Holloway
  • Piet Souris
  • Frits Walraven

Pdf to html

 
Ranch Hand
Posts: 46
Hibernate Tomcat Server Java
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
hi all,
can any one give a a code which can extract a pdf file to an html one..
i have done in converting pdf to text file and also done images extraction in it..
but thats not it.
i need to get the css from the pdf and also the output must be a like the original pdf
thanks
Devan
 
Rancher
Posts: 43011
76
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
The short answer is: it can't be done.

The long answer is: Check out the PDFRenderer library. Since it can render PDFs on screen, it must somehow be able to extract formatting information. Since it's open source, you can study it to find out how it does it. Once you have the formatting information, you can generate CSS from it. I'd estimate it to be a week's worth of time before this works for even simple documents, much less for random documents.
 
Devan Brahma
Ranch Hand
Posts: 46
Hibernate Tomcat Server Java
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi,,
Thanks for the reply Ulf Dittmer
i wil Check out the PDFRenderer library.


thanks,
Devan
 
Skool. Stay in. Smartness. Tiny ad:
Java file APIs (DOC, XLS, PDF, and many more)
https://products.aspose.com/total/java
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!