[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: 10646, and all that
> > Or, are you saying that the following specification:
> >
> > Content-type: text/plain; charset=ASCII
> > Content-language: French
> >
> > is better than:
> >
> > Content-type: text/plain; charset=ISO646-FR
> >
>
> Yes! Actually I prefer a language= parameter, but I definitely want
> to separate the language specification from the character set.
That's contradicts with the current policy to use the name US-ASCII
to the only true ASCII.
> If the language is combined with a charset, then a decent mail reader has to
> support LOTS of charsets. Especially when you start listing combinations of
> languages -- say, to indicate a mixed English/Japanese text.
If the language is encoded in a charset, and if the receiver want
to display the charset correctly, the receiver should provide support
for the charset.
If the language is encoded in a charset, and if the receiver does not
want to display the charset correctly, the receiver does not have to
provide support for the charset.
If the language is not encoded in a charset, if no language information
is provided otherwise, and if the receiver want to display the charset
correctly, the receiver can't.
> If language is a parameter to content-type, then a mail reader can display
> the message (perhaps sub-optimally) if it knows the character set. If it's
> smart enough to take the language parameter into account, the mail reader
> can pick a font that is tuned to that language.
That is no different from the above examples.
> Furthermore, separating the language parameter from the charset makes it
> easier to do multilingual text using a construct similar to
> multipart/alternative -- the mail reader can let the user choose to read the
> French rather than the Spanish version of the text.
Then, you must provide very precise definitions of all the languages
in the world.
> The
> reader doesn't really care about the character set, as long as her mail
> reader supports it.
So, let's encode language information into charset.
Masataka Ohta