[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Questions about character sets and their encodings



> IA5String: I've heard this is just 7-bit ASCII, but is that true?

Letters and number characters are the same as ASCII.  Instead of a dollar sign,
IA5String may have a "currency symbol".  

>        TeletexString: I understand that one can select different
> character sets with escape codes; does anybody have the set of escape
> codes and character sets they select?

The character sets are Japanese Kanji (JIS C 6226-1983, set No. 87), Chinese 
(GB 2312-80, set No. 58), and Greek.   There are character set shifting codes 
in ITU-T Rec. T.61(88).  IMHO it would probably be much easier to use 
UniversalString and BMPString than T.61 for non-Western-European character 
sets.

> encodings.  Also, what is the tag for UniversalString?  

I think UniversalString is UNIVERSAL 28 and BMPString is UNIVERSAL 30.  
BMPString is a two-byte encoding of the UniversalString characters in the 0/0
plane.


Mark Wahl, Enterprise Directory Integration
Critical Angle Inc.