Re: Newsgroup names and Unicode, attempt 3

New Message Reply About this list Date view Thread view Subject view Author view

From: Charles Lindsey (chl@clw.cs.man.ac.uk)
Date: Tue Jul 03 2001 - 04:40:55 CDT


In <20010702150013.G76346@demon.net> "Clive D.W. Feather" <clive@demon.net> writes:

> In English, o-dieresis is two characters normally rendered as one glyph
> In French, c-cedilla is one character and one glyph
> In Arabic, "ibn" might be three characters that are rendered as three
> glyphs in some contexts, but as one in others

That was looking good until you got to the arabic bit :-( . But would it
always be three glyphs after going through NFKC?

But does anyone know of a usage where "ch" would be regarded as one glyph?

>Also, "glyph" includes font changes, so that several different glyphs
>represent the same character or grapheme.

But fortunately, that does not affect us.

>I suspect we may want to steer well clear of this confusion.

It depends whether it is less confusing than other alternatives on offer.
It is certainly less confusing than the two defined usages of "grapheme".

-- 
Charles H. Lindsey ---------At Home, doing my own thing------------------------
Tel: +44 161 436 6131 Fax: +44 161 436 6133   Web: http://www.cs.man.ac.uk/~chl
Email: chl@clw.cs.man.ac.uk      Snail: 5 Clerewood Ave, CHEADLE, SK8 3JU, U.K.
PGP: 2C15F1A9      Fingerprint: 73 6D C2 51 93 A0 01 E7 65 E8 64 7E 14 A4 AB A5


New Message Reply About this list Date view Thread view Subject view Author view


This archive was generated by hypermail 2b29.