posted 15 years ago
Hello,
I am using a commercial library which extract text from PDF files. The library is adapted to many programming languages, so the API is the least common denominator among them. The only way for the library to read a PDF document is from the file system. The sole open_document() call takes a name of the file as its parameter.
In my program the PDF file to be parsed exists as a byte array in memory. I would like to be able to parse it without saving it on disk. Is there an easy way to define an in-memory file system and present the in-memory byte array as a file to the text extraction library?
Thank you,
Michael