[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: rough sketch of a potential solution
On Monday, November 17, 2003, at 03:08 AM, Martin Duerst wrote:
Log file format is in any event a locale-specific issue, so it's not
that it would be appropriate for an MTA to log an unencoded UTF-8
address to a
log file on a system that used some other charset in its locale.
First, it's not a system that uses a locale, it's just a certain
Well, that depends on whether the application maintains its own log
file or whether it uses a common logging facility like syslog.
In addition to that, assuming addresses in many different (languages
and) scripts, you better use UTF-8 or another encoding that is able
to encode all of Unicode/ISO 10646.
I still view logging as a local matter, however I would have no problem
with recommending that MTAs that log addresses do so in a way that
permits display of addresses without decoding if local system tools and
the log facility permit this.
> Essentially this proposal appear to add a i18n-layer for e-mail, a
> "presentation" layer if you wish, on top of RFC 2822 and SMTP.
> this may be preferable for ASCII people, having i18n as an "add-on"
> appear rather fragile to me. But that's only my initial reaction.
Even if we were designing a new mail system from scratch to handle
requirements, I think we would need a lookup across the net at the
that the message is composed (or submitted) in order to handle the
at least one recipient can't transcribe the sender's "native" address.
As I tried to explain in another mail, I'm not really clear on this.
The sender should not send an address that the receiver doesn't know
how to read.
The problem is that there may be multiple recipients and the sender may
not know which recipients can read which languages, or even if this is
known, the sender may not know alternative addresses for some of the
recipients. Consider a reply to a message that was originally sent to
a list of recipients, some of whom the sender knows and some of whom
the sender doesn't know. Either the original message or the reply
could be in multiple languages (say with multipart/alternative)