[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: well-formedness error



> -----Original Message-----
> From: Paul Hoffman / IMC [mailto:phoffman@xxxxxxx] 
> Sent: Wednesday, June 16, 2004 3:29 PM
> To: Dare Obasanjo; danny666@xxxxxxxxxxx
> Cc: atom-syntax@xxxxxxx
> Subject: RE: well-formedness error
> 
> At 1:53 PM -0700 6/16/04, Dare Obasanjo wrote:
> >  3 would be rejected flat out since their MIME type claimed they 
> >weren't XML
> 
> OK, that's bad. Out of curiosity, what did those three claim to be?

application/octet-stream, text/plain & text/html actually. 

> >  4 would be rejected because their MIME contained incorrect 
> or missing 
> >charset information
> 
> That seems irrelevant as long as the XML received had the 
> encoding in the ?xml entry correct for the object itself.

Nope. Read RFC 3023. Specifically 

"if the media type given in the Content-Type HTTP header is text/xml,
text/xml-external-parsed-entity, or a subtype like
text/AnythingAtAll+xml, then the encoding attribute of the XML
declaration within the document is ignored completely, and the encoding
is

1. the encoding given in the charset parameter of the Content-Type HTTP
header, or 
2. us-ascii."

All 4 feeds claimed to be text/xml with no charset parameter meaning
that I was supposed to treat them as us-ascii even though their actual
encodings were UTF-8, Windows-1252 and ISO 8859-1. 

Many realize this is a bug in RFC 3023 but no one seems to have
initiated the process to get the RFC fixed. 

--
PITHY WORDS OF WISDOM 
If you think practice makes perfect-you haven't had a child taking
violin lessons.       

This posting is provided "AS IS" with no warranties, and confers no
rights.