From: Charles Lindsey (chl@clw.cs.man.ac.uk)
Date: Wed Jun 26 2002 - 04:48:48 CDT
In <3D182FFC.4010506@certplus.com> Jean-Marc Desperrier <jean-marc.desperrier@certplus.com> writes:
>>Andrew took the time to do this, and out of 91610 only 49 appeared to be
>>UTF8, of which only 4 were false positives. So, even 1% is off by several
>>order of magnitudes. Given a group with steady traffic (say 300 articles a
>>day) that's only one error every 10 months. Well within what I'd consider
>>an acceptable failure rate for something the user can probably manually
>>override with a single command.
>>
>Out of 91610, how many had characters over 0x80 inside ?
ITYM 0x80-0x9f. Yes, it would be an interesting question, though I would
expect there to be quite a lot.
>I don't know what his feed is, and if non-US hierarchy are correctly
>represented.
Don't worry there. He has access to the biggest feed on the planet.
-- Charles H. Lindsey ---------At Home, doing my own thing------------------------ Tel: +44 161 436 6131 Fax: +44 161 436 6133 Web: http://www.cs.man.ac.uk/~chl Email: chl@clw.cs.man.ac.uk Snail: 5 Clerewood Ave, CHEADLE, SK8 3JU, U.K. PGP: 2C15F1A9 Fingerprint: 73 6D C2 51 93 A0 01 E7 65 E8 64 7E 14 A4 AB A5