Re: UTF-8 syntax

New Message Reply About this list Date view Thread view Subject view Author view

From: Jean-Marc Desperrier (jean-marc.desperrier@certplus.com)
Date: Mon Dec 09 2002 - 04:28:04 CST


Charles Lindsey a dit :

>The latest UTF-8 draft draft-yergeau-rfc2279bis-02.txt includes a syntax
>for UTF-8. I have changed our syntax in 2.4.2 to use the same rules.
>This makes no technical difference to our draft.
>
> UTF8-5 = %xF8 %x88-BF 3(UTF8-tail) / %xF9-FB 4(UTF8-tail)
> UTF8-6 = %xFC %x84-BF 4(UTF8-tail) / %xFD 5(UTF8-tail)
> UTF8-xtra-char = UTF8-2 / UTF8-3 / UTF8-4 / UTF8-5 / UTF8-6
>
>
No character will ever be mapped to the 5 and 6 character form, they can
be removed from the draft, the yergeau draft is playing overly secure by
keeping them, I don't think the usefor draft needs to be concerned by
the motivations that make that some people still don't want to remove
them now and would better go in the direction of simplifying things by
restricting the forms that must be supported (they do not exist anymore
in the UNICODE standard).


New Message Reply About this list Date view Thread view Subject view Author view


This archive was generated by hypermail 2b29.