[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: well-formedness error
At 10:27 AM -0700 6/18/04, Tim Bray wrote:
You know, we could specify that Atom MUST always be encoded in UTF-8
and/or that the root element must be <Atøm>. Then, we'd have
belt-and-suspenders safety in the face of the most deranged encoding
breakage. No, that's probably not a serious suggestion. -Tim
Disclaimer: I'm an interop person, not a developer.
Given the number of edge cases this thread has brought out (many of
which have bitten developers over the years), I think that mandating
UTF-8 is a reasonable serious suggestion. It would eliminate all the
edge conditions by saying "if you create something using other than
UTF-8, I assure you I will not be able to figure it out". That should
cause folks on the creation side to fall into place quickly.
--Paul Hoffman, Director
--Internet Mail Consortium