From: Jean-Marc Desperrier (jean-marc.desperrier@certplus.com)
Date: Mon Dec 09 2002 - 04:28:04 CST
Charles Lindsey a dit :
>The latest UTF-8 draft draft-yergeau-rfc2279bis-02.txt includes a syntax
>for UTF-8. I have changed our syntax in 2.4.2 to use the same rules.
>This makes no technical difference to our draft.
>
> UTF8-5 = %xF8 %x88-BF 3(UTF8-tail) / %xF9-FB 4(UTF8-tail)
> UTF8-6 = %xFC %x84-BF 4(UTF8-tail) / %xFD 5(UTF8-tail)
> UTF8-xtra-char = UTF8-2 / UTF8-3 / UTF8-4 / UTF8-5 / UTF8-6
>
>
No character will ever be mapped to the 5 and 6 character form, they can
be removed from the draft, the yergeau draft is playing overly secure by
keeping them, I don't think the usefor draft needs to be concerned by
the motivations that make that some people still don't want to remove
them now and would better go in the direction of simplifying things by
restricting the forms that must be supported (they do not exist anymore
in the UNICODE standard).