[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Well-formedness statistics



On Wed, 23 Jun 2004 06:59:08 -0700, Dare Obasanjo <dareo@xxxxxxxxxxxxx> wrote:
> So you count a document served as text/plain as well-formed XML?

I can find nothing in the XML specification stating that XML documents
MUST NOT be served as text/plain.  I can find nothing in RFC 3023
stating that XML documents MUST NOT be served as text/plain.  RFC 2046
states that "us-ascii" is the default character encoding for all
"text/*" content types.  RFC 2646 updates 2046 but does nothing to
change the default character encoding.

Therefore, if an XML document
1. is served as "text/*",
2. does not specify a charset parameter in the HTTP Content-type, and
2. is parseable by a conformant XML parser using a "us-ascii" encoding
then I count it as well-formed.

This is the situation as best I understand it.  I would be happy if
someone could provide further clarification on how I should handle XML
served as "text/plain", "text/css", or "text/javascript".

Note that this does not affect the non-well-formedness of your feed at
http://www.25hoursaday.com/rss10.xml, which is served as "text/xml"
and therefore falls squarely under the jurisdiction of RFC 3023.

-- 
Cheers,
-Mark