Win a copy of The Java Performance Companion this week in the Performance forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

UTF-8, Superscripts and XML

 
Nagendra Prasad
Ranch Hand
Posts: 219
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hello Again,
I have figured out that a small set of characters (normal english ones), have their superscript notation available in unicode and therefore it is possible to convert them into their HEX equivalents.
Now, I have an XML Document where there are <sup> tags. Are these to be
treated as unicode markup? or how do we derive the superscript notation
for a character that is enclosed by this tag?
And how can we determine if this <sup> tag is not a part of normal markup
I was planning on reading the XML as an input stream encoded in UTF-8.
Would I be able to differntiate the <sup> as a superscript tag or a
general markup one...
I am not sure if I have explained myself particularly well.. but if
there are question, please ask.. i shall try and be more comprehensible.
Thanks!
 
jason adam
Chicken Farmer ()
Ranch Hand
Posts: 1932
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Moving this post the XML forum, misunderstood the other question, and am going to leave it in I/O as a separate issue. Sorry!!
 
William Brogden
Author and all-around good cowpoke
Rancher
Posts: 13064
6
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
If I understand your question, <sup> in your document is an ordinary XML tag - nothing magic about it. Perhaps the document is intended to be used with a matching XSLT that translates the contents into the special characters, or maybe just uses the HTML <sup>.
Bill
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic