• Post Reply Bookmark Topic Watch Topic
  • New Topic

How can i get the range(ASCII or Unicode) of charcters belonging to Different Languag  RSS feed

 
Vishal Angrish
Greenhorn
Posts: 19
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi,
Let me clear my requirement first.
We are creating the data to be faxed & to pre-format the data, we get the width of each character(to calculate the starting pixel of the String) & to do this first we want to store the width of each character, Now the Language can be changed & in that case i want to know in advance the range of ascci values attained by the charset of the Language Choosed.
Thanx,
Angrish
 
Cindy Glass
"The Hood"
Sheriff
Posts: 8521
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Wouldn't you need to know the font size and style also?
I gotta think that there is a better way . . .
 
Vishal Angrish
Greenhorn
Posts: 19
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
OfCourse Font Name,Style & Size are required, But thats not the problem, I think my expalanation for the problem was not quite clear. Basically we want to know the ascii value of all the characters supported by any Language, Which is charset of that language, in advance, So as we can store tham at a place for later use.
You see font has nothing to do with this. Though we are doing it in a way like, we are storing any range of characters in advance, but rather on fly when we get any of the new charater(including digits and symbols) which is not in our collection, we store it, So there may be a chance like after few hunderds or thousands words our collection will get the full range of charsets supported by the Language.(But this is just a fix for the show stopper Problem).
 
David Patterson
Ranch Hand
Posts: 65
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I don't think the problem is solvable. If by "language" you mean a human language that supports literature, and you are doing text processing, the characters of that language are not likely to be collected together. I've been doing some text processing in Latin, and thought that the alphabet needed for Latin woudl be the same as needed for English. Well, now it turns out some texts use an "a-e-ligature" -- a single character that is an a welded to an e. It has a unicode value, but it is far from the normal a-z letters.
The best reference for unicode is
http://www.unicode.org/charts/
Some languages have their own code charts, but I'm not sure if you can guarantee that they don't also use other symbols from related languages.
I know the International Phonetic Alphabet symbols are scattered across many pages, even for the fairly standard symbols, because many of the IPA symbols were taken from other languages.
Hope this helps.
David Patterson
PatterD Consulting, Inc.
patterd1@comcast.net
 
Consider Paul's rocket mass heater.
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!