[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: well-formedness error




At 10:27 AM -0700 6/18/04, Tim Bray wrote:
You know, we could specify that Atom MUST always be encoded in UTF-8 and/or that the root element must be <Atøm>. Then, we'd have belt-and-suspenders safety in the face of the most deranged encoding breakage. No, that's probably not a serious suggestion. -Tim

Disclaimer: I'm an interop person, not a developer.


Given the number of edge cases this thread has brought out (many of which have bitten developers over the years), I think that mandating UTF-8 is a reasonable serious suggestion. It would eliminate all the edge conditions by saying "if you create something using other than UTF-8, I assure you I will not be able to figure it out". That should cause folks on the creation side to fall into place quickly.

--Paul Hoffman, Director
--Internet Mail Consortium