[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Slug header encoding



* John Panzer wrote:
>I don't think you're saying this, but it sounds like you're saying that 
>you can simply write the UTF-8 byte sequences in the header.  For the 
>record: The problem here is that HTTP defines header fields to be 
>Latin-1.  Coincidentally, I am currently engaged in debugging a problem 
>in which someone is sending UTF-8 encoded bytes via an HTTP header, 
>which then get corrupted, somewhere inside either Apache or mod_jk.  

I would argue RFC 2616 is less than clear in this regard and as far as I
can tell there is little consensus among deployed servers and agents how
to interpret this. I would certainly hope a future version of RFC 2616
requires servers to use UTF-8 for the protocol-defined text parts of the
messages and clients to assume a different encoding only if it is not
UTF-8 encoded to accomodate legacy applications as necessary.
-- 
Björn Höhrmann · mailto:bjoern@xxxxxxxxxxxx · http://bjoern.hoehrmann.de
Weinh. Str. 22 · Telefon: +49(0)621/4309674 · http://www.bjoernsworld.de
68309 Mannheim · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/