Re: UTF-8 syntax

New Message Reply About this list Date view Thread view Subject view Author view

From: Jean-Marc Desperrier (jean-marc.desperrier@certplus.com)
Date: Mon Dec 09 2002 - 10:28:30 CST


Jean-Marc Desperrier a dit :

> No character will ever be mapped to the 5 and 6 byte form,

> (they do not exist anymore in the UNICODE standard).

And according to the following text, they do not exist anymore in
ISO-10646 either :

http://www.unicode.org/unicode/reports/tr28/#relation

With the publication of Amendment 1 to ISO/IEC 10646-1:2000 and the
Unicode Standard, Version 3.2, the two standards are fully synchronized.
[...]

Notable among the architectural changes to ISO/IEC 10646 approved in
Amendment 1 are:
- The range of characters available for private use has been restricted
to those characters accessible via UTF-16, and the intent not to encode
characters past Plane 16 has been clarified. This guarantees the
interoperability of UTF-8 and UTF-16, and the equivalence of UTF-32 and
UCS-4.


New Message Reply About this list Date view Thread view Subject view Author view


This archive was generated by hypermail 2b29.