Re: UTF-8 over RFC 2047 (Re: Call for Usefor to recharter)

New Message Reply About this list Date view Thread view Subject view Author view

From: D. J. Bernstein (djb@cr.yp.to)
Date: Tue Jan 14 2003 - 17:13:19 CST


Keith Moore writes:
> there will still be a need for canonicalization of certain fields

No. All text is already, at a minimum, C-normalized. Keyboard interfaces
get it right, and other programs don't randomly switch accent positions.

Anyway, your hypothetical Unicode normalization problems would also
arise in normalizing RFC 2047, so you can't use normalization as an
argument against moving to UTF-8. (The big problem with normalizing RFC
2047, of course, is the horrible mess of character encodings.)

Your ``sufficient testing through ordinary usage'' argument is equally
silly. Testing UTF-8 is much simpler than testing RFC 2047.

---D. J. Bernstein, Associate Professor, Department of Mathematics,
Statistics, and Computer Science, University of Illinois at Chicago


New Message Reply About this list Date view Thread view Subject view Author view


This archive was generated by hypermail 2b29.