[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Lack of clarity in dash-escaped?



On Wed, Nov 30, 2005 at 02:55:55PM +0000, Ian G wrote:

> I think what you mean is that the whitespace
> should not be included in the calculation,
> but it doesn't matter whether they are stripped
> from the document itself.

Yes, that is precisely what I mean.
 
> It is an issue, yes.  We discussed this a while
> back and came to the conclusion that *only* ascii
> whitespace was to be stripped/ignored, as the
> alternate was too hard to define.  That's why
> the specific characters to be stripped are in
> the spec - to stop people looking for cyrillic
> spaces or different sized spaces.

Unfortunately, unlike canonical ascii, there is no one-to-one correspondence
between unicode characters and glyph visuals. It would be nice to have some
canonical form for unicode text which is human-readable, yet has a unique
binary representation, but it's not easy and not the job of this WG, IMHO.

-- 
Daniel