[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: well-formedness error




On Sun, 20 Jun 2004 09:06:12 +0200, Julian Reschke <julian.reschke@xxxxxx> wrote:


The important thing here is "implementation". From an HTTP protocol point of view, the extension is irrelevant.

Correct.


So for an IETF protocol that works on top of HTTP, I'd expect it to be
silent about that topic

Yep.


de facto standard for MIME type serving today. Apache does it. IIS does it. I'm sure Sun One, WebSphere and others do the same as well.

That's fine. So just use it.

It doesn't work, because the MIME type being served is without the 'charset' parameter.


You can't have that content-type on statically served web pages. You just can't.

That's simply incorrect. "Even" IIS let's you do that.

If you're an administrator with Remote Desktop logon; yes. If you're an ordinary user with no Remote Desktop logon, you're basically screwed or left with how kind your hosting service is. I think it's an enormous flaw in IIS that this even can't be configured at the folder level as Apache with .htaccess, but even in Apache, 'AllowOverride' can be set to 'None', which renders .htaccess files useless.


As long as it isn't text/*, that's fine.
And then you're stuck with US-ASCII, no matter what the characters in the XML is encoded with, and no matter what you state in the processing instruction. Which means that statically served web pages can _only_ be in english, and never include terms or words from other languages.

Again, that's incorrect. It would really be good if you re-read RFC3023 before making these claims.

I'm claiming that 'text/*' binds you to US-ASCII. Is that incorrect?


"application/..." doesn't have a default encoding.

Ok, so serving '.atom' files as 'application/octet-stream' is OK? This is the default content-type on many web-servers for unknown extensions. For others it is 'text/plain' which is even worse than 'text/xml'. All of the content-types are without 'charset' information.


So XML content served as "application/..." simply doesn't have
transport-level encoding information, and thus you can use any
encoding you like, as long as it's properly declared in the body.

True. I'd still say it's pretty nasty to serve Atom feeds as 'application/octet-stream'.


Yes, that's the whole problem. 'text/*' demands US-ASCII.

...unless you declare something else.

..Which you can't (in this scenario) because you don't have the rights to do so.


Anyway, this is only a problem when you use "text/*", so don't
do that.

If you give your feed the name 'atom.xml' it will get served as 'text/xml'. If you give it the name 'atom.atom' it will either be served as 'application/octet-stream' or more likely 'text/plain'. Neither of these do anything good imho. They are both worse than charset-less 'text/plain'.


Again and again, that's incorrect. The server can be configured to send a custom MIME type (I *did* it, so I know).

I have probably not been clear enough about which users I'm talking about, but there are users -- a lot of users -- that have absolutely no way of configuring this themselves. If their ISP is kind enough to do this for them, they're extremely lucky. Most of them will just say «sorry, we can't do that».


It's a really simple solution for Atom: Just exclude these users from the target audience. If Atom does this explicitly, we don't have to care about them. But if we don't exclude them, we need to do all sorts of research to find out if they fit inside the 80/20 rule (which I think will be an impossible task) or we will just have to take them in and try to do the best possible work for them.

I have to say that I'm not only inclined, but aching to drop these users off the train in 90mph. And I hope they hurt themselves in the fall as well.

--
Asbjørn Ulsberg         -=|=-        asbjornu@xxxxxxxxxxx
«He's a loathsome offensive brute, yet I can't look away»