From: Jean-Marc Desperrier (jean-marc.desperrier@certplus.com)
Date: Thu Jul 04 2002 - 17:17:12 CDT
I've just been experimenting sending a message with raw UTF-8 bit in the
newsgroups fr.test.
The interesting point is that I've put some japanese in the title, so
this title has three or four characters inside the forbidden range of
0x80 and 0x9F.
It displays fine inside Google Groups.
http://groups.google.fr/groups?oe=UTF-8&selm=ag1l0q%24pi9%241%40s1.read.news.oleane.net
There's another article with the same title, but a ISO-8859-1 body, and
inside the thread panel Google wrongly displays the title as ISO-8859-1,
but gets it right in the plain view of the message, where you both the
title in UTF-8 and the body correctly.
The most interesting is not that.
I received in my mail box three response from Autoresponder about this
message.
This is the same number as when I sent a message with only a 7 bit
subject.//
So the forbidden characters did not cause any of the message to be lost.
But even better, for two of the messages, the title was not barbled and
arrived correctly in my mail box.
Garbled :
From: "Mailgate.ORG Autoresponder System" <nobody@mailgate.org>
To: "Jean-Marc Desperrier" <jean-marc.desperrier@certplus.com>
Subject: Answer from Mailgate Autoresponder (was: sujet en utf-8 :
accentué fin)
Correct :
From: devnull@news.mediaWays.net (feedme.news.mediaways.net autoreflector)
To: Jean-Marc Desperrier <jean-marc.desperrier@certplus.com>
Subject: Answer from reflector (Re: sujet en utf-8 : accentué
にほんご fin)
From: do-not-reply@franconews.org (franconews.org autorepondeur)
To: Jean-Marc Desperrier <jean-marc.desperrier@certplus.com>
Subject: Reponse du robot (Re: sujet en utf-8 : accentué にほんご fin)
I think that for moderators of newsgroups where the use of 8 bit is
standard, just making sure their own mail server does not garble the
0x80 and 0x9F range, and that the server that redirects the message to
them does not do either, might be simpler than encapsulation.
If you sent the message to a moderated newsgroups in the Big 5, in raw
utf-8, the headers willbe garbled, and you'll simply learn not to do
that outside of your local hierarchy when everything is set up so that
it works.