From: Claus Färber (list-ietf-wg-apps-usefor@faerber.muc.de)
Date: Sun Aug 04 2002 - 09:44:00 CDT
Erland Sommarskog <sommar-usefor@algonet.se> schrieb/wrote:
> Nah. There is one obvious problem, the Turkish i and I. There might
> be a few more. But it should be possible to define sensible rules
> for a general case-insensiivty for the Latin/Cyrillic/Greek scripts.
"Turkish" is a language, not a script, and that's the whole problem:
Most of the special casing rules apply to languages, not scripts.
So you need language information to handle casing correctly. This can't
be done with domain, mailbox and newsgroup names as the language tags
would result in different names for the same character sequence (and
there's no way to look up metadata that's not part of the name).
The best solution is to ignore the problem and to accept that a small
set of character pairs that some people view as upper and lower case
counterparts won't be seen by the computer as such.
Claus
-- ------------------------ http://www.faerber.muc.de/ ------------------------ OpenPGP: DSS 1024/639680F0 E7A8 AADB 6C8A 2450 67EA AF68 48A5 0E63 6396 80F0