MD> Well, yes, but: Assume a protocol is defined as accepting only UTF-8
MD> and UTF-16 (I understand that that's what you and Chris would prefer).
MD> There may be some XML parsers that understand exactly these two and
MD> nothing else, but your average XML parser understands more character
MD> encodings, starting with iso-8859-1. And as you say above, there is
MD> no standard way to enforce the restriction to UTF-8 and UTF-16,
Actually there is - if anything else is used, the xml encoding
declaration must be used. (well, unless unwisely overridden by snme
other protocol).
MD> and
MD> you may be able to tell a parser, but then you can't distinguish
MD> between a forbidden encoding and broken syntax.
Its as simple as looking for the encoding pseudo-attribute on the xml
declaration.
MD> So whether a protocol says 'UTF-8 only' or 'only UTF-8 and UTF-16',
MD> it's all just the same.
Not at all.