From: Charles Lindsey (chl@clw.cs.man.ac.uk)
Date: Tue Jul 03 2001 - 04:40:55 CDT
In <20010702150013.G76346@demon.net> "Clive D.W. Feather" <clive@demon.net> writes:
> In English, o-dieresis is two characters normally rendered as one glyph
> In French, c-cedilla is one character and one glyph
> In Arabic, "ibn" might be three characters that are rendered as three
> glyphs in some contexts, but as one in others
That was looking good until you got to the arabic bit :-( . But would it
always be three glyphs after going through NFKC?
But does anyone know of a usage where "ch" would be regarded as one glyph?
>Also, "glyph" includes font changes, so that several different glyphs
>represent the same character or grapheme.
But fortunately, that does not affect us.
>I suspect we may want to steer well clear of this confusion.
It depends whether it is less confusing than other alternatives on offer.
It is certainly less confusing than the two defined usages of "grapheme".
-- Charles H. Lindsey ---------At Home, doing my own thing------------------------ Tel: +44 161 436 6131 Fax: +44 161 436 6133 Web: http://www.cs.man.ac.uk/~chl Email: chl@clw.cs.man.ac.uk Snail: 5 Clerewood Ave, CHEADLE, SK8 3JU, U.K. PGP: 2C15F1A9 Fingerprint: 73 6D C2 51 93 A0 01 E7 65 E8 64 7E 14 A4 AB A5