[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Reviewing philosophies and assumptions
> The Japanese use of 2022 for mail is well documented, but in Japanese.
> ...
> Well, I am a bit unhappy about talking on issues
> related to Japan, being no Japanese; maybe the people coming
> from Japan could enlighten us?
Akira Kato has asked me to forward this message:
-------------------------------------------------------------------------------
One of the reference which is available to US people is
J. Murai and A. Kato: Researches in Network Developement of JUNET
Proc. of SIGCOMM '87 Workshop, ACM, 1987
More precisely, our Kanji mail exchange code is defined as follows:
1) Use ISO 2022 based encoding in 7bit environment and only G0 is used --
The announsor of ``ESC 2/0 4/1'' is assumed.
2) Designate appropriate code set into G0 (At the top of text,
ASCII or JIS-Roman is assumed)--
ASCII is designated to G0 by ``ESC 2/8 4/2'';
JIS-Roman is designated to G0 by ``ESC 2/8 4/10'';
JIS-Kanji is designated to G0 by ``ESC 2/4 4/2'';
3) C0 characters can be inserted when ASCII or JIS-Roman is designated
to G0.
Not OK: Kanji C0 Kanji
OK: Kanji ``ESC 2/8 4/2'' C0 ``ESC 2/4 4/2'' Kanji
Notes:
1) is introduced because it requires only 7bit transport channel. Prior
to the 8bit through tty implementation, MSB's might be truncated
during the transmission. Especially RFC822 only permit to transfer
ASCII (0-127) characters.
Processing of Locking Shifts were avoided because it introduce
additional states.
2) JIS Kanji can be designated by ``ESC 2/4 2/8 4/2'' in ISO 2022.
However, we do not use ``ESC 2/4 2/8 4/2'' but ``ESC 2/4 4/2'',
which is also permitted by ISO 2022.
3) This is special rule to JUNET (and current Japanese academic
networks). This rule makes the amount of backtrack being limited
to the length of a line when the text is accessed randomly.
4) The differences between ASCII and JIS-Roman are:
ASCII JIS-Roman
5/12 backslash YEN-mark
7/12 vertical bar vertical bar separated in center
7/14 tilde upper score
and the differences can be negligible for most cases in JUNET.
5) Old JIS-Kanji (1978) is still used in the net with designate
sequence of ``ESC 2/4 4/0'', which is not recommended.
6) As our encoding requires only 7bit channel, we can exchange emails
across foreign gateways provided if the very ends can handle the
Kanji according to the rule above. No patch is required to sendmail.
MH has been slightly changed to handle Kanji; the patch is available
via anonymous ftp from, for example, sh.wide.ad.jp [133.4.11.11]:
~ftp/JAPAN. Other Kanji handling software is also available in
~ftp/JAPAN or ~ftp/japanese on the same host.
-- Akira Kato