• Post Reply Bookmark Topic Watch Topic
  • New Topic

iText: pdf conversion into other formats  RSS feed

 
Gian Franco
blacksmith
Ranch Hand
Posts: 979
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hello Bruno,

Does iText support converting pdf into other formats?

Or, if not, do you have any experience/preferences in conversion tools?

Cheers,

Gian
 
Bruno Lowagie
author
Ranch Hand
Posts: 88
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Gian Franco wrote:Does iText support converting pdf into other formats?


Given the nature of PDF (PDF Creation is supposed to be a "One-Way Process"),
I don't believe in converting PDF into other formats (unless you're talking about
rendering PDF to a raster format).

iText can do a best effort to extract a PDF to text, and if the PDF is "tagged",
it can convert the PDF to XML, but I don't trust any software that claims it
can convert PDF to Word, Excel, RTF, HTML,...

It's sufficient to look inside the PDF and to inspect the PDF syntax to understand
why they are promising something that is (in many cases) impossible.

Gian Franco wrote:Or, if not, do you have any experience/preferences in conversion tools?


No, that would be against my religion ;-)
 
Gian Franco
blacksmith
Ranch Hand
Posts: 979
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Bruno Lowagie wrote:No, that would be against my religion ;-)




I was thinking of PDF -> TIFF, we have two systems where the latter
format prevails so we're thinking of adding conversion of the former...

Cheers,

Gian
 
Lester Burnham
Rancher
Posts: 1337
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
See http://www.coderanch.com/t/497492/java/java/Convert-PDF-files-Tiff-files for one approach to do PDF -> TIFF conversion using PDFBox.
 
Bruno Lowagie
author
Ranch Hand
Posts: 88
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Gian Franco wrote:I was thinking of PDF -> TIFF, we have two systems where the latter
format prevails so we're thinking of adding conversion of the former...


OK, you are referring to conversion in the sense of "rendering".
I saw that Paulo (co-developer of iText) has been adding TIFF creation functionality,
but it's untested and undocumented, and certainly not capable of rendering PDFs to TIFFs yet.

Right now, I would say that PDF-to-TIFF isn't a priority for us, because:
- PDF rendering is not our core business
- there are other tools that do this, and why would we want to compete?

Then again: we've always said the same about parsing PDF, and
the PDF parser is getting better and better with every new release...
And if we can parse a PDF, we could render it, although I don't believe
any third party company can compete with Adobe with respect to viewers:
stuff like transparency is just too difficult. Some people dislike Adobe Reader
in favor of other tools, saying that Adobe Reader is bloated, but as soon as
the PDF has special features, you'll see that third party viewers fail when
compared to Adobe Reader.
 
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!