From: Marc Mutz (mutz@kde.org)
Date: Mon Jun 03 2002 - 18:47:04 CDT
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On Monday 03 June 2002 18:30, Charles Lindsey wrote:
<snip>
> >Um, ISO or Unicode in the brackets ?
<snip>
> I chose to mention ISO 10646 because AIUI that still pays lip service
> to the possibility of 32 bit codes, whereas the Unicode people seem
> to have gone quite away from them.
>
> Practically speaking, of course, they are dead. But UTF-8 still
> nominally covers them (and that includes the latest UTF-8 draft).
Umm? I don't think that surrogates were invented without reason for
UTF-16 and the BMP is becoming full already...
AFAIK, UTF-8 will not encode the surrogate pairs as two 16bit
characters, but the _value_ of the surrogates, thus already leaving the
16bit target space.
Correct me if I'm wrong, though.
Marc
- --
Marc Mutz <mutz@kde.org>
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.7 (GNU/Linux)
iD8DBQE8+//43oWD+L2/6DgRArcmAJ43Oxtm9NOzfAWnJpUscO3pUew9uwCfVgTs
6TX8hfl3vMwNYfkdch846V8=
=Mo9J
-----END PGP SIGNATURE-----