File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
http://aspose.com/file-tools
The moose likes I/O and Streams and the fly likes How to unzip such files Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login


Win a copy of Spring in Action this week in the Spring forum!
JavaRanch » Java Forums » Java » I/O and Streams
Bookmark "How to unzip such files" Watch "How to unzip such files" New topic
Author

How to unzip such files

Eugene Aris
Greenhorn

Joined: Sep 20, 2013
Posts: 4
Hello all, I'm trying to figure out what this compression could be:

http://www.mediafire.com/download/0d4aa1c81r87adc/03338.htm.zip
http://www.mediafire.com/download/f3z9l9v051jflkx/00777.htm.zip
http://www.mediafire.com/download/bb88p6wc1apypam/00204.htm.zip

Inflater does not recognize the format, still 7-zip opens the files, though incorrectly showing the properties and is unable to extract any of them.
Each such file contains packed text and the compression method is the same.
Any ideas or tools to unpack the files ?
Jeanne Boyarsky
author & internet detective
Marshal

Joined: May 26, 2003
Posts: 30753
    
156

Maybe they are corrupt? I tried opening one on the command line and got this message:

Jeanne-Boyarskys-MacBook-Proownloads nyjeanne$ unzip *zip
Archive: 00204.htm.zip
End-of-central-directory signature not found. Either this file is not
a zipfile, or it constitutes one disk of a multi-part archive. In the
latter case the central directory and zipfile comment will be found on
the last disk(s) of this archive.
unzip: cannot find zipfile directory in one of 00204.htm.zip or
00204.htm.zip.zip, and cannot find 00204.htm.zip.ZIP, period.


[Blog] [JavaRanch FAQ] [How To Ask Questions The Smart Way] [Book Promos]
Blogging on Certs: SCEA Part 1, Part 2 & 3, Core Spring 3, OCAJP, OCPJP beta, TOGAF part 1 and part 2
Eugene Aris
Greenhorn

Joined: Sep 20, 2013
Posts: 4
Jeanne, the thing is that - they are correct, though packed in an unusual way.. maybe some header added or something, that only can be detected by someone with hands-on archiving and zip techniques

each such file (unzipped) contains readable text, in html or xml, and there's no doubt it is fine.. though how to unpack them is unclear.
Steve Luke
Bartender

Joined: Jan 28, 2003
Posts: 4181
    
  21

How do you know they are correct and have no doubts they are fine if you can't unpack them? If you know there is a special sauce needed to unpack them, maybe you should ask whomever created the file what the special sauce is?


Steve
Eugene Aris
Greenhorn

Joined: Sep 20, 2013
Posts: 4
They wouldn't tell and leave it to others to guess the clue.
Steve Luke
Bartender

Joined: Jan 28, 2003
Posts: 4181
    
  21

Who is 'they'? How do you know for sure it isn't corrupt? Why do you think you should pursue trying to open these files when the people who posted them don't want you to have them (otherwise they would tell you how to open them)?
Eugene Aris
Greenhorn

Joined: Sep 20, 2013
Posts: 4
Oh well, I just asked the experts, why all these questions if you cannot help?
There's code legacy and no source code, there are tons of situations when you cannot ask the developer about the specifics, so I would propose to post the solutions rather than discuss abstract matters on 'why' and 'how on Earth'.
Steve Luke
Bartender

Joined: Jan 28, 2003
Posts: 4181
    
  21

Eugene Aris wrote:Oh well, I just asked the experts, why all these questions if you cannot help?

I am not sure if I can help. If I could help, but it could be a dead end if, like Jeanne had suggested and all your previous attempts demonstrate, that the files are corrupt. So the questions are to get more information to see if this is something worth putting effort in.


Eugene Aris wrote:There's code legacy and no source code, there are tons of situations when you cannot ask the developer about the specifics, so I would propose to post the solutions rather than discuss abstract matters on 'why' and 'how on Earth'.

But the how requires time, and since you aren't willing to provide answers. I personally am not willing to put that effort in if the result is likely to be what you have already been told (the files are corrupt).
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: How to unzip such files