I am having this really weird issue on Unix where some of the characters coming from Windows CMS (where else) are showing up as a ?
These characters are not normal in the sense that i can see that the bytecode assigned to these is really messed up.
Has anyone faced this issue earlier. If so, how do you go about solving this issue.
Here are the details.
I am using
h=(HttpURLConnection)webserver.openConnection();
h.connect();
//Get the input stream
in=h.getInputStream();
To get the input stream from a web file and then downloading it as an html file on the stream translates the weird character into a ?
Byte:83-->Char:S-->HexCode:3
Byte:121-->Char:y-->HexCode:9
Byte:110-->Char:n-->HexCode:E
Byte:100-->Char
-->HexCode:4
Byte:114-->Char:r-->HexCode:2
Byte:111-->Char
-->HexCode:F
Byte:109-->Char:m-->HexCode
Byte:101-->Char:e-->HexCode:5
Byte:32-->Char: -->HexCode:0
Byte:-106-->Char:?-->HexCode:6
Byte:32-->Char: -->HexCode:0
Byte:70-->Char:F-->HexCode:6
Byte:114-->Char:r-->HexCode:2
Byte:111-->Char
-->HexCode:F
Byte:109-->Char:m-->HexCode
Byte:32-->Char: -->HexCode:0
Notice the byte value of -106 on that character. I have tried using readers setting the charset encoding to iso-8859-1 as well as utf8 but both gave the same results.
I am currently running out of options so please help.
Thanks