[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
draft on mnemonic encoding
Here is a rough draft on the mnemonig encoding.
Keld
Network Working Group
Request For Comments: DRAFT
Philippe-Andre' Prindeville
Te'le'com Paris
Keld Simonsen
Danish Unix Users' Group
March 1991
A Portable, Extensible Message Encoding Format
for Alphabetic Scripts
Status of the Memo
This memo proposes an elective encoding format that
permits the exchange of textual messages based on
alphabet scripts, e.g. Latin, Greek, Cyrillian, Hebrew,
Arabic, Katakana, Hiragana, Bopomofo and certain special
symbols etc.
This supplements the set defined in [1], and is the
preferred format for exchanging alphabetic messages.
Distribution of this memo is unlimited.
Acknowledgements
This memo was inspired by [1],[2], and [3], as well as by
conversations with Justin Bur of l'E'cole Polytechnique
Federale de Lausanne (EPFL) and people active within
l'Association Franc,ais d'Utilisateurs Unix (AFUU), and
les Re'seaux Associe's pour la Recherche Europeen (RARE).
Introduction
As the Internet grows in size, the number and scope of
users increases. TCP/IP has become a player in the pan-
European networking arena [4], and the number of other
networks that the Internet connects to is also mounting.
In short, the Internet is becoming Internationalized.
With this expanding circle of users, the breadth of their
needs similarly increases.
One of the most popular services of the Internet,
electronic mail (email), is also perhaps one of the least
adequate to meet this new demand. Issues of addressing
and gatewaying have been conceived and implemented, but
email still bears the constraint that messages be
composed of the 7-bit ASCII graphical character set. For
non-Anglophones, this is simply not adequate. A new
technique must be forthcoming.
For the remainder of this document, we shall take the
subset of ASCII characters that have ISO, EBCDIC, and
Teletext equivalents, and refer to it as NVT (though such
a set exists, and is called invariant ISO 646 [5]). We
regard this as the minimal universal character character
set. Its contents are given in the Appendix.
RFC xxx Encoding of International Characters March 1991
Considerations
When approaching the problem, we identified a few major
considerations. The solution:
o+ must render reasonable results on an
NVT terminal;" Not all users will have access to
resources that can display the complete set (indeed,
few are expected to have the full set available); still
others will continue to use the ubiquitous NVT
terminal. In such instances, this encoding must yield
acceptable results.
o+ must be extensible, to incorporate future insights;
Work continues on the definition and cataloging of
national character sets. One fairly extensive list,
ISO 10646 [6], is being compiled at the time of this
writing. Symbols will probably be added in the future:
at such time, the must by accomodated. Therefore,
expandability is needed.
o+ must work with existing MTAs;
System software is costly and difficult to install;
further, current mail addressing techniques (e.g. MX
[7]) offer little or impractical control of the routing
of messages. As a result, mail may be carried by
obsolete Message Transfer Agents (MTAs). Further,
message encoding is a presentation-level service, and
is best dealt with by the User Agent (UA).
o+ must be transparent to cryptographic or authenticating
services;
Standards are emerging for the end-to-end security and
authenticity of electronic messages [8]. This is
typically done by the UA. In order not to perturb such
facilities, any encoding mechanism must be layered atop
encryption and be impervious to manipulations as may
occur at a transit MTA (EBCDIC-ASCII translation, line
wrapping, etc).
o+ must align with internet methodology;
As mentioned in the first point above, the user,
implementor, or system administrator may not have
access to adequate encoding/decoding or rendering
facilities. He may be obliged to view or
enter/manipulate encoded text by hand. In order to
support this, an encoding format should be simple and
intuitive.
o+ should interoperate with a broad range of systems;
The current networking environment contains many
diverse types of systems with varied interchange
formats (e.g. BITNET, X.500, UUCP). To interoperate
with the greatest number of them, exchange must be
based on the most common assumptions: a limited
character set, limited line lengths, etc.
o+ should be simple an unambiguous;
Any solution that is to have widespread acceptance must
be simple and unambiguous; indeed, the latter
Prindeville & Simonsen [Page 2]
RFC xxx Encoding of International Characters March 1991
frequenctly precludes the former.
o+ must be coherent with existing standards;
It is beyond our charter to create yet another registry
of character sets. A few standards for character set
registration and encoding exist; (insert footnotes
here). Of these, [X] seems to be the most complete.
We rejected the adoptation of the ISO Latin character
sets [9] because they are not extensible, do not
interoperate with existing 7-bit mailers, and do not
yield a reasonable rendering on an NVT terminal.
We also rejected UNICODE [10], a significantly more
comprehensive encoding format, because it did not use
(given an NVT interpretation of the bit patterns) a
human-readable encoding, nor did it interoperate with
existing mail readers (User Agents, or UAs) and MTAs.
This 16-bit encoding format would also double the size of
most messages exchanged in the Internet, an important
factor.
Message Format
As in [1], the message exists as a series of parts, each
part being a group of non-empty lines containing NVT
characters. Each part may or may not be encoded using
this format; we concern ourselves in this document solely
with those that are. Within the relevant parts of the
message, ordinary NVT text may have occurrences of the
following sequence: a shift character (see below),
followed by a string of NVT characters that represent a
non-NVT symbol, as given in the Appendix.
Encoding Specifiers
A message in this format bears the Encoding: field in
the message header, with the following parameters:
Count
As per [1], this indicates a number of lines of
text, called a part, that use this encoding. A
part explicitly may not include empty lines.
Absence of the count indicates that all remaining
parts in the document use the named format.
Keyword
The keyword is given as Mnemonic.
Options
The options may specify:
version n
where n is a two decimal numbers separated
by a period. The current version is 1.0.
shift c[d]
where c specifies the non-locking shift
character. This may be any non-alphanumeric
Prindeville & Simonsen [Page 3]
RFC xxx Encoding of International Characters March 1991
NVT character, except space. The default
values is `&' (ampersand).
Unknown options are ignored.
Cleartext (i.e. unencoded) may be interspersed with
encoded text.
The non-locking shift may be doubled in cleartext to
represent a single occurence of that symbol. There is
currently no locking-shift capability.
Implementation Notes
The Appendix represents a subset of symbols in widespread
use as taken from a readily available catalogue [6]. It
is possible that certain needed symbols may have been
overlooked and omitted. Implementors are encouraged to
register new names as they are needed with the Internet
Naming Authority.
It is also possible that the recipient's rendering
software may be required to display a symbol that is not
in its set. Two possible solutions exist here: Firstly,
the symbol may be displayed as a composite of a base and
some modifiers, such as a Latin letter accompanied by a
diacritical mark or accent, or a Chinese major stroke
modified by its what's-its. This method requires fairly
sophisticated rendering software and high quality display
devices. Or, secondly, the mnemonic name may be
displayed, either with the (non-locking) shift character,
or preferrably with some display attribute (such as
colour, slant, or embolding) that makes it apparent that
the rendering is not possible.
The conversion to and from the user's character encodings
to this format should be made in the mailer reader, or UA
software. As stated above, this is done so as not to
require modification of the MTAs and so that encryption
and/or authentication may be performed transparently
without concern for crypto-checksums being invalidated by
translating gateways (e.g. certain BITNET MTAs
gratuitously perform character mappings on transit mail).
The message should be stored in the recipients mailbox in
the original format if possible. This avoids translation
that may irrecoverably lose information. The consequence
of this is that the contents of the mailbox might not
easily be manipulated by system text tools, such as
string searching programs (such as the UNIX utility
grep).
On systems that have rich character sets and file-system
facilities for attaching content type labels to files, or
where the universal text format is not ASCII, files may
be stored in decoded format, with characters not present
in the system's character set encoded according to the
specifications of this RFC.
Prindeville & Simonsen [Page 4]
RFC xxx Encoding of International Characters March 1991
Appendix: Mnemonic Names
This is the set of associations of strings to symbols.
The mnemonics are taken from [11].
All 26 letters1 (in upper or lower case) may be suffixed
with one of the following characters to yield an accented
letter. Similarly, quote (') followed by any of these
characters denotes the accent by itself, and double quote
(") character may be used in place of a letter to signify
a non-spacing (or dead-key) accent, but its use is not
recommended and requires private agreement.
! Grave
> Circumflex
? Tilde
- Macron
( Breve
: Dieresis
, Cedilla
_ Underline2
" Double acute
/ Stroke
; Ogonek
< Caron
Notes:
2 Not to be used as a general underlining character, but
only to denote symbols that include an underline.
The following list contains the character mnemonic and
the encoding and long descriptive name of ISO 10646.
SP /032/032/032/032 SPACE
! /032/032/032/033 EXCLAMATION MARK
" /032/032/032/034 QUOTATION MARK
Nb /032/032/032/035 NUMBER SIGN
DO /032/032/032/036 DOLLAR SIGN
% /032/032/032/037 PERCENT SIGN
& /032/032/032/038 AMPERSAND
' /032/032/032/039 APOSTROPHE
( /032/032/032/040 LEFT PARENTHESIS
) /032/032/032/041 RIGHT PARENTHESIS
* /032/032/032/042 ASTERISK
+ /032/032/032/043 PLUS SIGN
, /032/032/032/044 COMMA
- /032/032/032/045 HYPHEN-MINUS
. /032/032/032/046 FULL STOP
/ /032/032/032/047 SOLIDUS
0 /032/032/032/048 DIGIT ZERO
1 /032/032/032/049 DIGIT ONE
2 /032/032/032/050 DIGIT TWO
3 /032/032/032/051 DIGIT THREE
4 /032/032/032/052 DIGIT FOUR
5 /032/032/032/053 DIGIT FIVE
6 /032/032/032/054 DIGIT SIX
7 /032/032/032/055 DIGIT SEVEN
8 /032/032/032/056 DIGIT EIGHT
Prindeville & Simonsen [Page 5]
RFC xxx Encoding of International Characters March 1991
9 /032/032/032/057 DIGIT NINE
: /032/032/032/058 COLON
; /032/032/032/059 SEMICOLON
< /032/032/032/060 LESS-THAN SIGN
= /032/032/032/061 EQUALS SIGN
> /032/032/032/062 GREATER-THAN SIGN
? /032/032/032/063 QUESTION MARK
At /032/032/032/064 COMMERCIAL AT
A /032/032/032/065 LATIN CAPITAL LETTER A
B /032/032/032/066 LATIN CAPITAL LETTER B
C /032/032/032/067 LATIN CAPITAL LETTER C
D /032/032/032/068 LATIN CAPITAL LETTER D
E /032/032/032/069 LATIN CAPITAL LETTER E
F /032/032/032/070 LATIN CAPITAL LETTER F
G /032/032/032/071 LATIN CAPITAL LETTER G
H /032/032/032/072 LATIN CAPITAL LETTER H
I /032/032/032/073 LATIN CAPITAL LETTER I
J /032/032/032/074 LATIN CAPITAL LETTER J
K /032/032/032/075 LATIN CAPITAL LETTER K
L /032/032/032/076 LATIN CAPITAL LETTER L
M /032/032/032/077 LATIN CAPITAL LETTER M
N /032/032/032/078 LATIN CAPITAL LETTER N
O /032/032/032/079 LATIN CAPITAL LETTER O
P /032/032/032/080 LATIN CAPITAL LETTER P
Q /032/032/032/081 LATIN CAPITAL LETTER Q
R /032/032/032/082 LATIN CAPITAL LETTER R
S /032/032/032/083 LATIN CAPITAL LETTER S
T /032/032/032/084 LATIN CAPITAL LETTER T
U /032/032/032/085 LATIN CAPITAL LETTER U
V /032/032/032/086 LATIN CAPITAL LETTER V
W /032/032/032/087 LATIN CAPITAL LETTER W
X /032/032/032/088 LATIN CAPITAL LETTER X
Y /032/032/032/089 LATIN CAPITAL LETTER Y
Z /032/032/032/090 LATIN CAPITAL LETTER Z
<( /032/032/032/091 LEFT SQUARE BRACKET
// /032/032/032/092 REVERSE SOLIDUS
)> /032/032/032/093 RIGHT SQUARE BRACKET
'> /032/032/032/094 CIRCUMFLEX ACCENT
_ /032/032/032/095 LOW LINE
'! /032/032/032/096 GRAVE ACCENT
a /032/032/032/097 LATIN SMALL LETTER A
b /032/032/032/098 LATIN SMALL LETTER B
c /032/032/032/099 LATIN SMALL LETTER C
d /032/032/032/100 LATIN SMALL LETTER D
e /032/032/032/101 LATIN SMALL LETTER E
f /032/032/032/102 LATIN SMALL LETTER F
g /032/032/032/103 LATIN SMALL LETTER G
h /032/032/032/104 LATIN SMALL LETTER H
i /032/032/032/105 LATIN SMALL LETTER I
j /032/032/032/106 LATIN SMALL LETTER J
k /032/032/032/107 LATIN SMALL LETTER K
l /032/032/032/108 LATIN SMALL LETTER L
m /032/032/032/109 LATIN SMALL LETTER M
n /032/032/032/110 LATIN SMALL LETTER N
o /032/032/032/111 LATIN SMALL LETTER O
p /032/032/032/112 LATIN SMALL LETTER P
Prindeville & Simonsen [Page 6]
RFC xxx Encoding of International Characters March 1991
q /032/032/032/113 LATIN SMALL LETTER Q
r /032/032/032/114 LATIN SMALL LETTER R
s /032/032/032/115 LATIN SMALL LETTER S
t /032/032/032/116 LATIN SMALL LETTER T
u /032/032/032/117 LATIN SMALL LETTER U
v /032/032/032/118 LATIN SMALL LETTER V
w /032/032/032/119 LATIN SMALL LETTER W
x /032/032/032/120 LATIN SMALL LETTER X
y /032/032/032/121 LATIN SMALL LETTER Y
z /032/032/032/122 LATIN SMALL LETTER Z
(! /032/032/032/123 LEFT CURLY BRACKET
!! /032/032/032/124 VERTICAL LINE
!) /032/032/032/125 RIGHT CURLY BRACKET
'? /032/032/032/126 TILDE
NS /032/032/032/160 NO-BREAK SPACE
!I /032/032/032/161 INVERTED EXCLAMATION MARK
Ct /032/032/032/162 CENT SIGN
Pd /032/032/032/163 POUND SIGN
Cu /032/032/032/164 CURRENCY SIGN
Ye /032/032/032/165 YEN SIGN
BB /032/032/032/166 BROKEN BAR
SE /032/032/032/167 SECTION SIGN
': /032/032/032/168 DIAERESIS
Co /032/032/032/169 COPYRIGHT SIGN
-a /032/032/032/170 FEMININE ORDINAL INDICATOR
<< /032/032/032/171 LEFT POINTING DOUBLE ANGLE QUOTATION MARK
NO /032/032/032/172 NOT SIGN
-- /032/032/032/173 SOFT HYPHEN
Rg /032/032/032/174 REGISTERED SIGN
'- /032/032/032/175 MACRON
DG /032/032/032/176 DEGREE SIGN
+- /032/032/032/177 PLUS-MINUS SIGN
2S /032/032/032/178 SUPERSCRIPT TWO
3S /032/032/032/179 SUPERSCRIPT THREE
'' /032/032/032/180 ACUTE ACCENT
My /032/032/032/181 MICRO SIGN
PI /032/032/032/182 PILCROW SIGN
.M /032/032/032/183 MIDDLE DOT
', /032/032/032/184 CEDILLA
1S /032/032/032/185 SUPERSCRIPT ONE
-o /032/032/032/186 MASCULINE ORDINAL INDICATOR
>> /032/032/032/187 RIGHT POINTING DOUBLE ANGLE QUOTATION MARK
14 /032/032/032/188 VULGAR FRACTION ONE QUARTER
12 /032/032/032/189 VULGAR FRACTION ONE HALF
34 /032/032/032/190 VULGAR FRACTION THREE QUARTERS
?I /032/032/032/191 INVERTED QUESTION MARK
A! /032/032/032/192 LATIN CAPITAL LETTER A WITH GRAVE
A' /032/032/032/193 LATIN CAPITAL LETTER A WITH ACUTE
A> /032/032/032/194 LATIN CAPITAL LETTER A WITH CIRCUMFLEX
A? /032/032/032/195 LATIN CAPITAL LETTER A WITH TILDE
A: /032/032/032/196 LATIN CAPITAL LETTER A WITH DIAERESIS
AA /032/032/032/197 LATIN CAPITAL LETTER A WITH RING ABOVE
AE /032/032/032/198 LATIN CAPITAL LETTER AE
C, /032/032/032/199 LATIN CAPITAL LETTER C WITH CEDILLA
E! /032/032/032/200 LATIN CAPITAL LETTER E WITH GRAVE
E' /032/032/032/201 LATIN CAPITAL LETTER E WITH ACUTE
Prindeville & Simonsen [Page 7]
RFC xxx Encoding of International Characters March 1991
E> /032/032/032/202 LATIN CAPITAL LETTER E WITH CIRCUMFLEX
E: /032/032/032/203 LATIN CAPITAL LETTER E WITH DIAERESIS
I! /032/032/032/204 LATIN CAPITAL LETTER I WITH GRAVE
I' /032/032/032/205 LATIN CAPITAL LETTER I WITH ACUTE
I> /032/032/032/206 LATIN CAPITAL LETTER I WITH CIRCUMFLEX
I: /032/032/032/207 LATIN CAPITAL LETTER I WITH DIAERESIS
D- /032/032/032/208 LATIN CAPITAL LETTER ETH (Icelandic)
N? /032/032/032/209 LATIN CAPITAL LETTER N WITH TILDE
O! /032/032/032/210 LATIN CAPITAL LETTER O WITH GRAVE
O' /032/032/032/211 LATIN CAPITAL LETTER O WITH ACUTE
O> /032/032/032/212 LATIN CAPITAL LETTER O WITH CIRCUMFLEX
O? /032/032/032/213 LATIN CAPITAL LETTER O WITH TILDE
O: /032/032/032/214 LATIN CAPITAL LETTER O WITH DIAERESIS
*X /032/032/032/215 MULTIPLICATION SIGN
O/ /032/032/032/216 LATIN CAPITAL LETTER O WITH STROKE
U! /032/032/032/217 LATIN CAPITAL LETTER U WITH GRAVE
U' /032/032/032/218 LATIN CAPITAL LETTER U WITH ACUTE
U> /032/032/032/219 LATIN CAPITAL LETTER U WITH CIRCUMFLEX
U: /032/032/032/220 LATIN CAPITAL LETTER U WITH DIAERESIS
Y' /032/032/032/221 LATIN CAPITAL LETTER Y WITH ACUTE
TH /032/032/032/222 LATIN CAPITAL LETTER THORN (Icelandic)
ss /032/032/032/223 LATIN SMALL LETTER SHARP S (German)
a! /032/032/032/224 LATIN SMALL LETTER A WITH GRAVE
a' /032/032/032/225 LATIN SMALL LETTER A WITH ACUTE
a> /032/032/032/226 LATIN SMALL LETTER A WITH CIRCUMFLEX
a? /032/032/032/227 LATIN SMALL LETTER A WITH TILDE
a: /032/032/032/228 LATIN SMALL LETTER A WITH DIAERESIS
aa /032/032/032/229 LATIN SMALL LETTER A WITH RING ABOVE
ae /032/032/032/230 LATIN SMALL LETTER AE
c, /032/032/032/231 LATIN SMALL LETTER C WITH CEDILLA
e! /032/032/032/232 LATIN SMALL LETTER E WITH GRAVE
e' /032/032/032/233 LATIN SMALL LETTER E WITH ACUTE
e> /032/032/032/234 LATIN SMALL LETTER E WITH CIRCUMFLEX
e: /032/032/032/235 LATIN SMALL LETTER E WITH DIAERESIS
i! /032/032/032/236 LATIN SMALL LETTER I WITH GRAVE
i' /032/032/032/237 LATIN SMALL LETTER I WITH ACUTE
i> /032/032/032/238 LATIN SMALL LETTER I WITH CIRCUMFLEX
i: /032/032/032/239 LATIN SMALL LETTER I WITH DIAERESIS
d- /032/032/032/240 LATIN SMALL LETTER ETH (Icelandic)
n? /032/032/032/241 LATIN SMALL LETTER N WITH TILDE
o! /032/032/032/242 LATIN SMALL LETTER O WITH GRAVE
o' /032/032/032/243 LATIN SMALL LETTER O WITH ACUTE
o> /032/032/032/244 LATIN SMALL LETTER O WITH CIRCUMFLEX
o? /032/032/032/245 LATIN SMALL LETTER O WITH TILDE
o: /032/032/032/246 LATIN SMALL LETTER O WITH DIAERESIS
-: /032/032/032/247 DIVISION SIGN
o/ /032/032/032/248 LATIN SMALL LETTER O WITH STROKE
u! /032/032/032/249 LATIN SMALL LETTER U WITH GRAVE
u' /032/032/032/250 LATIN SMALL LETTER U WITH ACUTE
u> /032/032/032/251 LATIN SMALL LETTER U WITH CIRCUMFLEX
u: /032/032/032/252 LATIN SMALL LETTER U WITH DIAERESIS
y' /032/032/032/253 LATIN SMALL LETTER Y WITH ACUTE
th /032/032/032/254 LATIN SMALL LETTER THORN (Icelandic)
y: /032/032/032/255 LATIN SMALL LETTER Y WITH DIAERESIS
A- /032/032/033/033 LATIN CAPITAL LETTER A WITH MACRON
C> /032/032/033/034 LATIN CAPITAL LETTER C WITH CIRCUMFLEX
Prindeville & Simonsen [Page 8]
RFC xxx Encoding of International Characters March 1991
C. /032/032/033/035 LATIN CAPITAL LETTER C WITH DOT ABOVE
E- /032/032/033/036 LATIN CAPITAL LETTER E WITH MACRON
E. /032/032/033/037 LATIN CAPITAL LETTER E WITH DOT ABOVE
G> /032/032/033/039 LATIN CAPITAL LETTER G WITH CIRCUMFLEX
'6 /032/032/033/041 LEFT SINGLE QUOTATION MARK
"6 /032/032/033/042 LEFT DOUBLE QUOTATION MARK
G( /032/032/033/043 LATIN CAPITAL LETTER G WITH BREVE
<- /032/032/033/044 LEFTWARD ARROW
-! /032/032/033/045 UPWARD ARROW
-> /032/032/033/046 RIGHTWARD ARROW
-v /032/032/033/047 DOWNWARD ARROW
a- /032/032/033/049 LATIN SMALL LETTER A WITH MACRON
c> /032/032/033/050 LATIN SMALL LETTER C WITH CIRCUMFLEX
c. /032/032/033/051 LATIN SMALL LETTER C WITH DOT ABOVE
e- /032/032/033/052 LATIN SMALL LETTER E WITH MACRON
e. /032/032/033/053 LATIN SMALL LETTER E WITH DOT ABOVE
g> /032/032/033/055 LATIN SMALL LETTER G WITH CIRCUMFLEX
'9 /032/032/033/057 RIGHT SINGLE QUOTATION MARK
"9 /032/032/033/058 RIGHT DOUBLE QUOTATION MARK
g( /032/032/033/059 LATIN SMALL LETTER G WITH BREVE
G. /032/032/033/065 LATIN CAPITAL LETTER G WITH DOT ABOVE
G, /032/032/033/066 LATIN CAPITAL LETTER G WITH CEDILLA
H> /032/032/033/067 LATIN CAPITAL LETTER H WITH CIRCUMFLEX
I? /032/032/033/070 LATIN CAPITAL LETTER I WITH TILDE
I- /032/032/033/071 LATIN CAPITAL LETTER I WITH MACRON
I. /032/032/033/072 LATIN CAPITAL LETTER I WITH DOT ABOVE
'0 /032/032/033/074 RING ABOVE
HB /032/032/033/080 HORIZONTAL BAR
g. /032/032/033/081 LATIN SMALL LETTER G WITH DOT ABOVE
g, /032/032/033/082 LATIN SMALL LETTER G WITH CEDILLA
h> /032/032/033/083 LATIN SMALL LETTER H WITH CIRCUMFLEX
TM /032/032/033/084 TRADE MARK SIGN
Md /032/032/033/085 MUSIC NOTE
i? /032/032/033/086 LATIN SMALL LETTER I WITH TILDE
i- /032/032/033/087 LATIN SMALL LETTER I WITH MACRON
18 /032/032/033/092 VULGAR FRACTION ONE EIGHTH
38 /032/032/033/093 VULGAR FRACTION THREE EIGHTHS
58 /032/032/033/094 VULGAR FRACTION FIVE EIGHTHS
78 /032/032/033/095 VULGAR FRACTION SEVEN EIGHTHS
Om /032/032/033/096 OHM SIGN
I; /032/032/033/097 LATIN CAPITAL LETTER I WITH OGONEK
J> /032/032/033/098 LATIN CAPITAL LETTER J WITH CIRCUMFLEX
K, /032/032/033/099 LATIN CAPITAL LETTER K WITH CEDILLA
H/ /032/032/033/100 LATIN CAPITAL LETTER H WITH STROKE
IJ /032/032/033/102 LATIN CAPITAL LIGATURE IJ
L. /032/032/033/103 LATIN CAPITAL LETTER L WITH MIDDLE DOT
L, /032/032/033/104 LATIN CAPITAL LETTER L WITH CEDILLA
N, /032/032/033/105 LATIN CAPITAL LETTER N WITH CEDILLA
OE /032/032/033/106 LATIN CAPITAL LIGATURE OE
O- /032/032/033/107 LATIN CAPITAL LETTER O WITH MACRON
T/ /032/032/033/109 LATIN CAPITAL LETTER T WITH STROKE
NG /032/032/033/110 LATIN CAPITAL LETTER ENG (Lappish)
'n /032/032/033/111 LATIN SMALL LETTER N PRECEDED BY APOSTROPHE
kk /032/032/033/112 LATIN SMALL LETTER KRA (Greenlandic)
i; /032/032/033/113 LATIN SMALL LETTER I WITH OGONEK
j> /032/032/033/114 LATIN SMALL LETTER J WITH CIRCUMFLEX
Prindeville & Simonsen [Page 9]
RFC xxx Encoding of International Characters March 1991
k, /032/032/033/115 LATIN SMALL LETTER K WITH CEDILLA
h/ /032/032/033/116 LATIN SMALL LETTER H WITH STROKE
i. /032/032/033/117 LATIN SMALL LETTER I WITH NO DOT
ij /032/032/033/118 LATIN SMALL LIGATURE IJ
l. /032/032/033/119 LATIN SMALL LETTER L WITH MIDDLE DOT
l, /032/032/033/120 LATIN SMALL LETTER L WITH CEDILLA
n, /032/032/033/121 LATIN SMALL LETTER N WITH CEDILLA
oe /032/032/033/122 LATIN SMALL LIGATURE OE
o- /032/032/033/123 LATIN SMALL LETTER O WITH MACRON
t/ /032/032/033/125 LATIN SMALL LETTER T WITH STROKE
ng /032/032/033/126 LATIN SMALL LETTER ENG
A; /032/032/033/161 LATIN CAPITAL LETTER A WITH OGONEK
'( /032/032/033/162 BREVE
L/ /032/032/033/163 LATIN CAPITAL LETTER L WITH STROKE
L< /032/032/033/165 LATIN CAPITAL LETTER L WITH CARON
S' /032/032/033/166 LATIN CAPITAL LETTER S WITH ACUTE
S> /032/032/033/168 LATIN CAPITAL LETTER S WITH CIRCUMFLEX
S< /032/032/033/169 LATIN CAPITAL LETTER S WITH CARON
S, /032/032/033/170 LATIN CAPITAL LETTER S WITH CEDILLA
T< /032/032/033/171 LATIN CAPITAL LETTER T WITH CARON
Z' /032/032/033/172 LATIN CAPITAL LETTER Z WITH ACUTE
Z< /032/032/033/174 LATIN CAPITAL LETTER Z WITH CARON
Z. /032/032/033/175 LATIN CAPITAL LETTER Z WITH DOT ABOVE
a; /032/032/033/177 LATIN SMALL LETTER A WITH OGONEK
'; /032/032/033/178 OGONEK
l/ /032/032/033/179 LATIN SMALL LETTER L WITH STROKE
l< /032/032/033/181 LATIN SMALL LETTER L WITH CARON
s' /032/032/033/182 LATIN SMALL LETTER S WITH ACUTE
'< /032/032/033/183 CARON
s> /032/032/033/184 LATIN SMALL LETTER S WITH CIRCUMFLEX
s< /032/032/033/185 LATIN SMALL LETTER S WITH CARON
s, /032/032/033/186 LATIN SMALL LETTER S WITH CEDILLA
t< /032/032/033/187 LATIN SMALL LETTER T WITH CARON
z' /032/032/033/188 LATIN SMALL LETTER Z WITH ACUTE
'" /032/032/033/189 DOUBLE ACUTE ACCENT
z< /032/032/033/190 LATIN SMALL LETTER Z WITH CARON
z. /032/032/033/191 LATIN SMALL LETTER Z WITH DOT ABOVE
R' /032/032/033/192 LATIN CAPITAL LETTER R WITH ACUTE
R, /032/032/033/193 LATIN CAPITAL LETTER R WITH CEDILLA
A( /032/032/033/195 LATIN CAPITAL LETTER A WITH BREVE
L' /032/032/033/197 LATIN CAPITAL LETTER L WITH ACUTE
C' /032/032/033/198 LATIN CAPITAL LETTER C WITH ACUTE
C< /032/032/033/200 LATIN CAPITAL LETTER C WITH CARON
E; /032/032/033/202 LATIN CAPITAL LETTER E WITH OGONEK
E< /032/032/033/204 LATIN CAPITAL LETTER E WITH CARON
D< /032/032/033/207 LATIN CAPITAL LETTER D WITH CARON
D/ /032/032/033/208 LATIN CAPITAL LETTER D WITH STROKE
N' /032/032/033/209 LATIN CAPITAL LETTER N WITH ACUTE
N< /032/032/033/210 LATIN CAPITAL LETTER N WITH CARON
U? /032/032/033/212 LATIN CAPITAL LETTER U WITH TILDE
O" /032/032/033/213 LATIN CAPITAL LETTER O WITH DOUBLE ACUTE
U- /032/032/033/214 LATIN CAPITAL LETTER U WITH MACRON
U( /032/032/033/215 LATIN CAPITAL LETTER U WITH BREVE
R< /032/032/033/216 LATIN CAPITAL LETTER R WITH CARON
U0 /032/032/033/217 LATIN CAPITAL LETTER U WITH RING ABOVE
U; /032/032/033/218 LATIN CAPITAL LETTER U WITH OGONEK
Prindeville & Simonsen [Page 10]
RFC xxx Encoding of International Characters March 1991
U" /032/032/033/219 LATIN CAPITAL LETTER U WITH DOUBLE ACUTE
W> /032/032/033/220 LATIN CAPITAL LETTER W WITH CIRCUMFLEX
Y> /032/032/033/221 LATIN CAPITAL LETTER Y WITH CIRCUMFLEX
T, /032/032/033/222 LATIN CAPITAL LETTER T WITH CEDILLA
Y: /032/032/033/223 LATIN CAPITAL LETTER Y WITH DIAERESIS
r' /032/032/033/224 LATIN SMALL LETTER R WITH ACUTE
r, /032/032/033/225 LATIN SMALL LETTER R WITH CEDILLA
a( /032/032/033/227 LATIN SMALL LETTER A WITH BREVE
l' /032/032/033/229 LATIN SMALL LETTER L WITH ACUTE
c' /032/032/033/230 LATIN SMALL LETTER C WITH ACUTE
c< /032/032/033/232 LATIN SMALL LETTER C WITH CARON
e; /032/032/033/234 LATIN SMALL LETTER E WITH OGONEK
e< /032/032/033/236 LATIN SMALL LETTER E WITH CARON
d< /032/032/033/239 LATIN SMALL LETTER D WITH CARON
d/ /032/032/033/240 LATIN SMALL LETTER D WITH STROKE
n' /032/032/033/241 LATIN SMALL LETTER N WITH ACUTE
n< /032/032/033/242 LATIN SMALL LETTER N WITH CARON
u? /032/032/033/244 LATIN SMALL LETTER U WITH TILDE
o" /032/032/033/245 LATIN SMALL LETTER O WITH DOUBLE ACUTE
u- /032/032/033/246 LATIN SMALL LETTER U WITH MACRON
u( /032/032/033/247 LATIN SMALL LETTER U WITH BREVE
r< /032/032/033/248 LATIN SMALL LETTER R WITH CARON
u0 /032/032/033/249 LATIN SMALL LETTER U WITH RING ABOVE
u; /032/032/033/250 LATIN SMALL LETTER U WITH OGONEK
u" /032/032/033/251 LATIN SMALL LETTER U WITH DOUBLE ACUTE
w> /032/032/033/252 LATIN SMALL LETTER W WITH CIRCUMFLEX
y> /032/032/033/253 LATIN SMALL LETTER Y WITH CIRCUMFLEX
t, /032/032/033/254 LATIN SMALL LETTER T WITH CEDILLA
'. /032/032/033/255 DOT ABOVE
a< /032/032/034/032 LATIN SMALL LETTER A WITH CARON
A< /032/032/034/033 LATIN CAPITAL LETTER A WITH CARON
a_ /032/032/034/034 LATIN SMALL LETTER A WITH LINE BELOW
A_ /032/032/034/035 LATIN CAPITAL LETTER A WITH LINE BELOW
'a /032/032/034/048 LATIN SMALL LETTER A PRECEDED BY APOSTROPHE
'A /032/032/034/049 LATIN CAPITAL LETTER A PRECEDED BY APOSTROPHE
a1 /032/032/034/052 LATIN SMALL LETTER A WITH MACRON AND DIAERESIS
A1 /032/032/034/053 LATIN CAPITAL LETTER A WITH MACRON AND DIAERESIS
a2 /032/032/034/054 LATIN SMALL LETTER A WITH MACRON AND DOT ABOVE
A2 /032/032/034/055 LATIN CAPITAL LETTER A WITH MACRON AND DOT ABOVE
a3 /032/032/034/056 LATIN SMALL LETTER AE WITH MACRON
A3 /032/032/034/057 LATIN CAPITAL LETTER AE WITH MACRON
b. /032/032/034/086 LATIN SMALL LETTER B WITH DOT ABOVE
B. /032/032/034/087 LATIN CAPITAL LETTER B WITH DOT ABOVE
b_ /032/032/034/088 LATIN SMALL LETTER B WITH LINE BELOW
B_ /032/032/034/089 LATIN CAPITAL LETTER B WITH LINE BELOW
d_ /032/032/034/096 LATIN SMALL LETTER D WITH LINE BELOW
D_ /032/032/034/097 LATIN CAPITAL LETTER D WITH LINE BELOW
d. /032/032/034/098 LATIN SMALL LETTER D WITH DOT BELOW
D. /032/032/034/099 LATIN CAPITAL LETTER D WITH DOT BELOW
d; /032/032/034/100 LATIN SMALL LETTER D WITH OGONEK
D; /032/032/034/101 LATIN CAPITAL LETTER D WITH OGONEK
e( /032/032/034/106 LATIN SMALL LETTER E WITH BREVE
E( /032/032/034/107 LATIN CAPITAL LETTER E WITH BREVE
e_ /032/032/034/108 LATIN SMALL LETTER E WITH LINE BELOW
E_ /032/032/034/109 LATIN CAPITAL LETTER E WITH LINE BELOW
;S /032/032/034/126 HIGH OGONEK
Prindeville & Simonsen [Page 11]
RFC xxx Encoding of International Characters March 1991
e? /032/032/034/168 LATIN SMALL LETTER E WITH TILDE
E? /032/032/034/169 LATIN CAPITAL LETTER E WITH TILDE
f. /032/032/034/180 LATIN SMALL LETTER F WITH DOT ABOVE
F. /032/032/034/181 LATIN CAPITAL LETTER F WITH DOT ABOVE
g< /032/032/034/182 LATIN SMALL LETTER G WITH CARON
G< /032/032/034/183 LATIN CAPITAL LETTER G WITH CARON
g- /032/032/034/184 LATIN SMALL LETTER G WITH MACRON
G- /032/032/034/185 LATIN CAPITAL LETTER G WITH MACRON
g/ /032/032/034/188 LATIN SMALL LETTER G WITH STROKE
G/ /032/032/034/189 LATIN CAPITAL LETTER G WITH STROKE
h: /032/032/034/192 LATIN SMALL LETTER H WITH DIAERESIS
H: /032/032/034/193 LATIN CAPITAL LETTER H WITH DIAERESIS
h. /032/032/034/194 LATIN SMALL LETTER H WITH DOT ABOVE
H. /032/032/034/195 LATIN CAPITAL LETTER H WITH DOT ABOVE
h, /032/032/034/196 LATIN SMALL LETTER H WITH CEDILLA
H, /032/032/034/197 LATIN CAPITAL LETTER H WITH CEDILLA
h; /032/032/034/198 LATIN SMALL LETTER H WITH OGONEK
H; /032/032/034/199 LATIN CAPITAL LETTER H WITH OGONEK
i< /032/032/034/204 LATIN SMALL LETTER I WITH CARON
I< /032/032/034/205 LATIN CAPITAL LETTER I WITH CARON
i( /032/032/034/206 LATIN SMALL LETTER I WITH BREVE
I( /032/032/034/207 LATIN CAPITAL LETTER I WITH BREVE
j( /032/032/034/224 LATIN SMALL LETTER J WITH BREVE
J( /032/032/034/225 LATIN CAPITAL LETTER J WITH BREVE
k' /032/032/034/226 LATIN SMALL LETTER K WITH ACUTE
K' /032/032/034/227 LATIN CAPITAL LETTER K WITH ACUTE
k< /032/032/034/228 LATIN SMALL LETTER K WITH CARON
K< /032/032/034/229 LATIN CAPITAL LETTER K WITH CARON
k_ /032/032/034/230 LATIN SMALL LETTER K WITH LINE BELOW
K_ /032/032/034/231 LATIN CAPITAL LETTER K WITH LINE BELOW
k. /032/032/034/232 LATIN SMALL LETTER K WITH DOT BELOW
K. /032/032/034/233 LATIN CAPITAL LETTER K WITH DOT BELOW
k; /032/032/034/234 LATIN SMALL LETTER K WITH OGONEK
K; /032/032/034/235 LATIN CAPITAL LETTER K WITH OGONEK
l_ /032/032/034/240 LATIN SMALL LETTER L WITH LINE BELOW
L_ /032/032/034/241 LATIN CAPITAL LETTER L WITH LINE BELOW
m' /032/032/034/248 LATIN SMALL LETTER M WITH ACUTE
M' /032/032/034/249 LATIN CAPITAL LETTER M WITH ACUTE
m. /032/032/034/250 LATIN SMALL LETTER M WITH DOT ABOVE
M. /032/032/034/251 LATIN CAPITAL LETTER M WITH DOT ABOVE
n. /032/032/035/034 LATIN SMALL LETTER N WITH DOT ABOVE
N. /032/032/035/035 LATIN CAPITAL LETTER N WITH DOT ABOVE
n_ /032/032/035/038 LATIN SMALL LETTER N WITH LINE BELOW
N_ /032/032/035/039 LATIN CAPITAL LETTER N WITH LINE BELOW
o< /032/032/035/046 LATIN SMALL LETTER O WITH CARON
O< /032/032/035/047 LATIN CAPITAL LETTER O WITH CARON
o( /032/032/035/048 LATIN SMALL LETTER O WITH BREVE
O( /032/032/035/049 LATIN CAPITAL LETTER O WITH BREVE
o_ /032/032/035/050 LATIN SMALL LETTER O WITH LINE BELOW
O_ /032/032/035/051 LATIN CAPITAL LETTER O WITH LINE BELOW
o; /032/032/035/064 LATIN SMALL LETTER O WITH OGONEK
O; /032/032/035/065 LATIN CAPITAL LETTER O WITH OGONEK
o1 /032/032/035/068 LATIN SMALL LETTER O WITH MACRON AND OGONEK
O1 /032/032/035/069 LATIN CAPITAL LETTER O WITH MACRON AND OGONEK
p' /032/032/035/098 LATIN SMALL LETTER P WITH ACUTE
P' /032/032/035/099 LATIN CAPITAL LETTER P WITH ACUTE
Prindeville & Simonsen [Page 12]
RFC xxx Encoding of International Characters March 1991
r. /032/032/035/100 LATIN SMALL LETTER R WITH DOT ABOVE
R. /032/032/035/101 LATIN CAPITAL LETTER R WITH DOT ABOVE
r_ /032/032/035/102 LATIN SMALL LETTER R WITH LINE BELOW
R_ /032/032/035/103 LATIN CAPITAL LETTER R WITH LINE BELOW
s. /032/032/035/110 LATIN SMALL LETTER S WITH DOT ABOVE
S. /032/032/035/111 LATIN CAPITAL LETTER S WITH DOT ABOVE
s; /032/032/035/114 LATIN SMALL LETTER S WITH OGONEK
S; /032/032/035/115 LATIN CAPITAL LETTER S WITH OGONEK
t_ /032/032/035/160 LATIN SMALL LETTER T WITH LINE BELOW
T_ /032/032/035/161 LATIN CAPITAL LETTER T WITH LINE BELOW
t. /032/032/035/162 LATIN SMALL LETTER T WITH DOT BELOW
T. /032/032/035/163 LATIN CAPITAL LETTER T WITH DOT BELOW
u< /032/032/035/170 LATIN SMALL LETTER U WITH CARON
U< /032/032/035/171 LATIN CAPITAL LETTER U WITH CARON
v? /032/032/035/214 LATIN SMALL LETTER V WITH TILDE
V? /032/032/035/215 LATIN CAPITAL LETTER V WITH TILDE
w' /032/032/035/220 LATIN SMALL LETTER W WITH ACUTE
W' /032/032/035/221 LATIN CAPITAL LETTER W WITH ACUTE
w. /032/032/035/222 LATIN SMALL LETTER W WITH DOT ABOVE
W. /032/032/035/223 LATIN CAPITAL LETTER W WITH DOT ABOVE
w: /032/032/035/224 LATIN SMALL LETTER W WITH DIAERESIS
W: /032/032/035/225 LATIN CAPITAL LETTER W WITH DIAERESIS
x. /032/032/035/230 LATIN SMALL LETTER X WITH DOT ABOVE
X. /032/032/035/231 LATIN CAPITAL LETTER X WITH DOT ABOVE
x: /032/032/035/232 LATIN SMALL LETTER X WITH DIAERESIS
X: /032/032/035/233 LATIN CAPITAL LETTER X WITH DIAERESIS
y! /032/032/035/236 LATIN SMALL LETTER Y WITH GRAVE
Y! /032/032/035/237 LATIN CAPITAL LETTER Y WITH GRAVE
y. /032/032/035/238 LATIN SMALL LETTER Y WITH DOT ABOVE
Y. /032/032/035/239 LATIN CAPITAL LETTER Y WITH DOT ABOVE
z> /032/032/035/244 LATIN SMALL LETTER Z WITH CIRCUMFLEX
Z> /032/032/035/245 LATIN CAPITAL LETTER Z WITH CIRCUMFLEX
z( /032/032/035/246 LATIN SMALL LETTER Z WITH BREVE
Z( /032/032/035/247 LATIN CAPITAL LETTER Z WITH BREVE
z_ /032/032/035/248 LATIN SMALL LETTER Z WITH LINE BELOW
Z_ /032/032/035/249 LATIN CAPITAL LETTER Z WITH LINE BELOW
z/ /032/032/035/252 LATIN SMALL LETTER Z WITH STROKE
Z/ /032/032/035/253 LATIN CAPITAL LETTER Z WITH STROKE
ez /032/032/035/254 LATIN SMALL LETTER EZH WITH CARON
EZ /032/032/035/255 LATIN CAPITAL LETTER EZH WITH CARON
g' /032/032/036/033 LATIN SMALL LETTER G WITH ACUTE
G' /032/032/036/034 LATIN CAPITAL LETTER G WITH ACUTE
'b /032/032/036/084 LATIN SMALL LETTER B PRECEDED BY APOSTROPHE
'B /032/032/036/085 LATIN CAPITAL LETTER B PRECEDED BY APOSTROPHE
'd /032/032/036/096 LATIN SMALL LETTER D PRECEDED BY APOSTROPHE
'D /032/032/036/097 LATIN CAPITAL LETTER D PRECEDED BY APOSTROPHE
'g /032/032/036/162 LATIN SMALL LETTER G PRECEDED BY APOSTROPHE
'G /032/032/036/163 LATIN CAPITAL LETTER G PRECEDED BY APOSTROPHE
'j /032/032/036/174 LATIN SMALL LETTER J PRECEDED BY APOSTROPHE
'J /032/032/036/175 LATIN CAPITAL LETTER J PRECEDED BY APOSTROPHE
'y /032/032/036/235 LATIN SMALL LETTER Y PRECEDED BY APOSTROPHE
'Y /032/032/036/236 LATIN CAPITAL LETTER Y PRECEDED BY APOSTROPHE
ed /032/032/036/239 LATIN SMALL LETTER EDZ
ED /032/032/036/240 LATIN CAPITAL LETTER EDZ
Vs /032/032/037/032 SPACE SYMBOL
1M /032/032/037/033 EM-SPACE
Prindeville & Simonsen [Page 13]
RFC xxx Encoding of International Characters March 1991
1N /032/032/037/034 EN-SPACE
3M /032/032/037/035 THREE-PER-EM SPACE
4M /032/032/037/036 FOUR-PER-EM SPACE
6M /032/032/037/037 SIX-PER-EM SPACE
1H /032/032/037/038 HAIR SPACE
1T /032/032/037/039 THIN SPACE
-1 /032/032/037/040 HYPHEN
-N /032/032/037/041 EN-DASH
-2 /032/032/037/042 MINUS SIGN
-M /032/032/037/043 EM-DASH
-3 /032/032/037/044 QUOTATION DASH
'1 /032/032/037/045 SINGLE PRIME
'2 /032/032/037/046 DOUBLE PRIME
'3 /032/032/037/047 TRIPLE PRIME
9' /032/032/037/048 SINGLE HIGH-REVERSED-9 QUOTATION MARK
9" /032/032/037/049 DOUBLE HIGH-REVERSED-9 QUOTATION MARK
.9 /032/032/037/050 SINGLE LOW-9 QUOTATION MARK
:9 /032/032/037/051 DOUBLE LOW-9 QUOTATION MARK
<1 /032/032/037/052 SINGLE LEFT-POINTING ANGLE QUOTATION MARK
>1 /032/032/037/053 SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
</ /032/032/037/054 LEFT-POINTING ANGLE BRACKET
/> /032/032/037/055 RIGHT-POINTING ANGLE BRACKET
15 /032/032/037/056 VULGAR FRACTION ONE FIFTH
25 /032/032/037/057 VULGAR FRACTION TWO FIFTHS
35 /032/032/037/058 VULGAR FRACTION THREE FIFTHS
45 /032/032/037/059 VULGAR FRACTION FOUR FIFTHS
16 /032/032/037/060 VULGAR FRACTION ONE SIXTH
13 /032/032/037/061 VULGAR FRACTION ONE THIRD
23 /032/032/037/062 VULGAR FRACTION TWO THIRDS
56 /032/032/037/063 VULGAR FRACTION FIVE SIXTHS
*- /032/032/037/064 MIDDLE ASTERISK
/- /032/032/037/065 DAGGER
/= /032/032/037/066 DOUBLE-DAGGER
-X /032/032/037/067 MALTESE CROSS
%0 /032/032/037/068 PER-MILLE SIGN
co /032/032/037/069 CARE-OF SIGN
PO /032/032/037/070 SOUND RECORDING COPYRIGHT SIGN
Rx /032/032/037/071 PRESCRIPTION SIGN
AO /032/032/037/072 ANGSTROEM SIGN
oC /032/032/037/073 CENTIGRADE DEGREE SIGN
Ml /032/032/037/074 MALE SIGN
Fm /032/032/037/075 FEMALE SIGN
Tl /032/032/037/076 TELEPHONE SIGN
TR /032/032/037/077 TELEPHONE RECORDER SIGN
MX /032/032/037/078 MUSICAL SHARP SIGN
Mb /032/032/037/079 MUSICAL FLAT SIGN
Mx /032/032/037/080 MUSICAL NATURAL SIGN
XX /032/032/037/081 BALLOT CROSS SIGN
OK /032/032/037/082 CHECK MARK
M2 /032/032/037/083 DOUBLE MUSICAL NOTES
!2 /032/032/037/084 DOUBLE EXCLAMATION MARKS
=2 /032/032/037/085 DOUBLE LOW LINE
Ca /032/032/037/086 CARET
.. /032/032/037/087 TWO-DOT LEADER
.3 /032/032/037/088 HORIZONTAL ELLIPSIS
:3 /032/032/037/089 VERTICAL ELLIPSIS
Prindeville & Simonsen [Page 14]
RFC xxx Encoding of International Characters March 1991
.: /032/032/037/090 THEREFORE SIGN
:. /032/032/037/091 BECAUSE SIGN
-+ /032/032/037/092 MINUS-PLUS SIGN
!= /032/032/037/093 NOT EQUAL-TO SIGN
=3 /032/032/037/094 IDENTICAL-TO SIGN
?1 /032/032/037/095 DIFFERENCE-BETWEEN SIGN
?2 /032/032/037/096 ALMOST-EQUALS SIGN
?- /032/032/037/097 ASYMTOTICALLY-EQUALS SIGN
?= /032/032/037/098 SIMILAR-TO SIGN
=< /032/032/037/099 LESS-THAN OR EQUAL-TO SIGN
>= /032/032/037/100 GREATER-THAN OR EQUAL-TO SIGN
0( /032/032/037/101 PROPORTIONAL-TO SIGN
00 /032/032/037/102 INFINITY SIGN
PP /032/032/037/103 PARALLEL-TO SIGN
-T /032/032/037/104 ORTHOGONAL-TO SIGN
-L /032/032/037/105 RIGHT ANGLE SIGN
-V /032/032/037/106 ANGLE SIGN
AN /032/032/037/107 LOGICAL-AND SIGN
OR /032/032/037/108 LOGICAL-OR SIGN
.P /032/032/037/109 PRODUCT DOT SIGN
nS /032/032/037/110 SUPERSCRIPT LATIN SMALL LETTER N
dP /032/032/037/111 PARTIAL DIFFERENTIAL SIGN
f( /032/032/037/112 FUNCTION SIGN
In /032/032/037/113 INTEGRAL SIGN
Io /032/032/037/114 CONTOUR INTEGRAL SIGN
RT /032/032/037/117 RADICAL SIGN
*P /032/032/037/118 REPEATED PRODUCT SIGN
+Z /032/032/037/119 SUMMATION SIGN
FA /032/032/037/120 FOR-ALL SIGN
TE /032/032/037/121 THERE-EXISTS SIGN
GF /032/032/037/122 GAMMA FUNCTION SIGN
DE /032/032/037/123 INCREMENT SIGN
NB /032/032/037/124 NABLA
(U /032/032/037/125 INTERSECTION SIGN
)U /032/032/037/126 UNION SIGN
(C /032/032/037/160 PROPER SUBSET SIGN
)C /032/032/037/161 PROPER SUPERSET SIGN
(_ /032/032/037/162 SUBSET SIGN
)_ /032/032/037/163 SUPERSET SIGN
(- /032/032/037/164 ELEMENT-OF SIGN
-) /032/032/037/165 HAS AN ELEMENT SIGN
<> /032/032/037/166 LEFT AND RIGHT-POINTING ARROW
UD /032/032/037/167 UP AND DOWN-POINTING ARROW
Ub /032/032/037/168 UP AND DOWN-POINTING ARROW WITH LINE BELOW
<= /032/032/037/169 IMPLIED-BY SIGN
=> /032/032/037/170 IMPLIES SIGN
== /032/032/037/171 IF-AND-ONLY-IF SIGN
/0 /032/032/037/172 EMPTY SIGN
OL /032/032/037/173 SOLID LOZENGE
0u /032/032/037/176 SMILING FACE WHITE
0U /032/032/037/177 SMILING FACE BLACK
SU /032/032/037/178 RADIANT SUN
0: /032/032/037/179 DOTTED CIRCLE
OS /032/032/037/180 SQUARE EMPTY
fS /032/032/037/181 SQUARE SOLID
Or /032/032/037/182 RECTANGLE EMPTY
Prindeville & Simonsen [Page 15]
RFC xxx Encoding of International Characters March 1991
SR /032/032/037/183 RECTANGLE SOLID
uT /032/032/037/184 UPWARDS-POINTING TRIANGLE EMPTY
UT /032/032/037/185 UPWARDS-POINTING TRIANGLE SOLID
dT /032/032/037/186 DOWNWARDS-POINTING TRIANGLE EMPTY
Dt /032/032/037/187 DOWNWARDS-POINTING TRIANGLE SOLID
PL /032/032/037/188 LEFTWARDS POINTER SOLID
PR /032/032/037/189 RIGHTWARDS POINTER SOLID
*1 /032/032/037/190 STAR EMPTY
*2 /032/032/037/191 STAR SOLID
VV /032/032/037/192 BOX DRAWINGS HEAVY VERTICAL
HH /032/032/037/193 BOX DRAWINGS HEAVY HORIZONTAL
DR /032/032/037/194 BOX DRAWINGS HEAVY DOWN AND RIGHT
LD /032/032/037/195 BOX DRAWINGS HEAVY DOWN AND LEFT
UR /032/032/037/196 BOX DRAWINGS HEAVY UP AND RIGHT
UL /032/032/037/197 BOX DRAWINGS HEAVY UP AND LEFT
VR /032/032/037/198 BOX DRAWINGS HEAVY VERTICAL AND RIGHT
VL /032/032/037/199 BOX DRAWINGS HEAVY VERTICAL AND LEFT
DH /032/032/037/200 BOX DRAWINGS HEAVY HORIZONTAL AND DOWN
UH /032/032/037/201 BOX DRAWINGS HEAVY HORIZONTAL AND UP
VH /032/032/037/202 BOX DRAWINGS HEAVY VERTICAL AND HORIZONTAL
TB /032/032/037/203 BOX DRAWING SOLID UPPER HALF BLOCK
LB /032/032/037/204 BOX DRAWING SOLID LOWER HALF BLOCK
FB /032/032/037/205 BOX DRAWING SOLID FULL BLOCK
sB /032/032/037/206 BOX DRAWING SOLID SMALL SQUARE
EH /032/032/037/207 EMPTY HOUSE SIGN
vv /032/032/037/208 BOX DRAWINGS LIGHT VERTICAL
hh /032/032/037/209 BOX DRAWINGS LIGHT HORIZONTAL
dr /032/032/037/210 BOX DRAWINGS LIGHT DOWN AND RIGHT
dl /032/032/037/211 BOX DRAWINGS LIGHT DOWN AND LEFT
ur /032/032/037/212 BOX DRAWINGS LIGHT UP AND RIGHT
ul /032/032/037/213 BOX DRAWINGS LIGHT UP AND LEFT
vr /032/032/037/214 BOX DRAWINGS LIGHT VERTICAL AND RIGHT
vl /032/032/037/215 BOX DRAWINGS LIGHT VERTICAL AND LEFT
dh /032/032/037/216 BOX DRAWINGS LIGHT HORIZONTAL AND DOWN
uh /032/032/037/217 BOX DRAWINGS LIGHT HORIZONTAL AND UP
vh /032/032/037/218 BOX DRAWINGS LIGHT VERTICAL AND HORIZONTAL
.S /032/032/037/219 BOX DRAWING LIGHT SHADE (25%)
:S /032/032/037/220 BOX DRAWING MEDIUM SHADE (50%)
?S /032/032/037/221 BOX DRAWING DARK SHADE (75%)
lB /032/032/037/222 BOX DRAWING SOLID LEFT HALF BLOCK
RB /032/032/037/223 BOX DRAWING SOLID RIGHT HALF BLOCK
cC /032/032/037/224 CLUB SYMBOL
cD /032/032/037/225 DIAMOND SYMBOL
Dr /032/032/037/226 BOX DRAWINGS DOWN HEAVY AND RIGHT LIGHT
Dl /032/032/037/227 BOX DRAWINGS DOWN HEAVY AND LEFT LIGHT
Ur /032/032/037/228 BOX DRAWINGS UP HEAVY AND RIGHT LIGHT
Ul /032/032/037/229 BOX DRAWINGS UP HEAVY AND LEFT LIGHT
Vr /032/032/037/230 BOX DRAWINGS VERTICAL HEAVY AND RIGHT LIGHT
Vl /032/032/037/231 BOX DRAWINGS VERTICAL HEAVY AND LEFT LIGHT
dH /032/032/037/232 BOX DRAWINGS HORIZONTAL HEAVY AND DOWN LIGHT
uH /032/032/037/233 BOX DRAWINGS HORIZONTAL HEAVY AND UP LIGHT
vH /032/032/037/234 BOX DRAWINGS VERTICAL LIGHT AND HORIZONTAL HEAVY
Ob /032/032/037/235 CIRCLE BULLET EMPTY
Sb /032/032/037/236 CIRCLE BULLET SOLID
Sn /032/032/037/237 CIRCLE BULLET NEGATIVE
Pt /032/032/037/238 PESETA SYMBOL
Prindeville & Simonsen [Page 16]
RFC xxx Encoding of International Characters March 1991
NI /032/032/037/239 REVERSED NOT SIGN
cH /032/032/037/240 HEART SYMBOL
cS /032/032/037/241 SPADE SYMBOL
dR /032/032/037/242 BOX DRAWINGS DOWN LIGHT AND RIGHT HEAVY
dL /032/032/037/243 BOX DRAWINGS DOWN LIGHT AND LEFT HEAVY
uR /032/032/037/244 BOX DRAWINGS UP LIGHT AND RIGHT HEAVY
uL /032/032/037/245 BOX DRAWINGS UP LIGHT AND LEFT HEAVY
vR /032/032/037/246 BOX DRAWINGS VERTICAL LIGHT AND RIGHT HEAVY
vL /032/032/037/247 BOX DRAWINGS VERTICAL LIGHT AND LEFT HEAVY
Dh /032/032/037/248 BOX DRAWINGS HORIZONTAL LIGHT AND DOWN HEAVY
Uh /032/032/037/249 BOX DRAWINGS HORIZONTAL LIGHT AND UP HEAVY
Vh /032/032/037/250 BOX DRAWINGS VERTICAL HEAVY AND HORIZONTAL LIGHT
0m /032/032/037/251 MEDIUM CIRCLE EMPTY
0M /032/032/037/252 MEDIUM CIRCLE SOLID
Ic /032/032/037/253 MEDIUM CIRCLE NEGATIVE
SM /032/032/037/254 SERVICE MARK SIGN
CG /032/032/037/255 CONGRUENCE SIGN
Ci /032/032/038/037 CIRCLE
(A /032/032/038/041 ARC SIGN
>V /032/032/038/046 RIGHTWARDS VECTOR ABOVE
!< /032/032/038/049 NOT LESS-THAN SIGN
<* /032/032/038/056 MUCH-LESS-THAN SIGN
!> /032/032/038/065 NOT GREATER-THAN SIGN
*> /032/032/038/072 MUCH-GREATER-THAN SIGN
<7 /032/032/038/094 CEILING SIGN LEFT
7< /032/032/038/095 FLOOR SIGN LEFT
>7 /032/032/038/110 CEILING SIGN RIGHT
7> /032/032/038/111 FLOOR SIGN RIGHT
I2 /032/032/038/121 DOUBLE INTEGRAL SIGN
0. /032/032/038/164 DOT IN RING
HI /032/032/038/177 HAS-AN-IMAGE SIGN
:: /032/032/038/193 PROPORTION SIGN
FD /032/032/038/209 FORWARD DIAGONAL
LZ /032/032/038/223 LOZENGE
BD /032/032/038/225 BACKWARD DIAGONAL
1R /032/032/039/032 ROMAN NUMERAL ONE
2R /032/032/039/033 ROMAN NUMERAL TWO
3R /032/032/039/034 ROMAN NUMERAL THREE
4R /032/032/039/035 ROMAN NUMERAL FOUR
5R /032/032/039/036 ROMAN NUMERAL FIVE
6R /032/032/039/037 ROMAN NUMERAL SIX
7R /032/032/039/038 ROMAN NUMERAL SEVEN
8R /032/032/039/039 ROMAN NUMERAL EIGHT
9R /032/032/039/040 ROMAN NUMERAL NINE
aR /032/032/039/041 ROMAN NUMERAL TEN
bR /032/032/039/042 ROMAN NUMERAL ELEVEN
cR /032/032/039/043 ROMAN NUMERAL TWELVE
IO /032/032/040/161 CYRILLIC CAPITAL LETTER IO
D% /032/032/040/162 CYRILLIC CAPITAL LETTER DJE (Serbocroatian)
G% /032/032/040/163 CYRILLIC CAPITAL LETTER GJE (Macedonian)
IE /032/032/040/164 CYRILLIC CAPITAL LETTER UKRAINIAN IE
DS /032/032/040/165 CYRILLIC CAPITAL LETTER DZE (Macedonian)
II /032/032/040/166 CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I
YI /032/032/040/167 CYRILLIC CAPITAL LETTER YI (Ukrainian)
J% /032/032/040/168 CYRILLIC CAPITAL LETTER JE
LJ /032/032/040/169 CYRILLIC CAPITAL LETTER LJE
Prindeville & Simonsen [Page 17]
RFC xxx Encoding of International Characters March 1991
NJ /032/032/040/170 CYRILLIC CAPITAL LETTER NJE
Ts /032/032/040/171 CYRILLIC CAPITAL LETTER TSHE (Serbocroatian)
KJ /032/032/040/172 CYRILLIC CAPITAL LETTER KJE (Macedonian)
V% /032/032/040/174 CYRILLIC CAPITAL LETTER SHORT U (Byelorussian)
DZ /032/032/040/175 CYRILLIC CAPITAL LETTER DZHE
A= /032/032/040/176 CYRILLIC CAPITAL LETTER A
B= /032/032/040/177 CYRILLIC CAPITAL LETTER BE
V= /032/032/040/178 CYRILLIC CAPITAL LETTER VE
G= /032/032/040/179 CYRILLIC CAPITAL LETTER GHE
D= /032/032/040/180 CYRILLIC CAPITAL LETTER DE
E= /032/032/040/181 CYRILLIC CAPITAL LETTER IE
Z% /032/032/040/182 CYRILLIC CAPITAL LETTER ZHE
Z= /032/032/040/183 CYRILLIC CAPITAL LETTER ZE
I= /032/032/040/184 CYRILLIC CAPITAL LETTER I
J= /032/032/040/185 CYRILLIC CAPITAL LETTER SHORT I
K= /032/032/040/186 CYRILLIC CAPITAL LETTER KA
L= /032/032/040/187 CYRILLIC CAPITAL LETTER EL
M= /032/032/040/188 CYRILLIC CAPITAL LETTER EM
N= /032/032/040/189 CYRILLIC CAPITAL LETTER EN
O= /032/032/040/190 CYRILLIC CAPITAL LETTER O
P= /032/032/040/191 CYRILLIC CAPITAL LETTER PE
R= /032/032/040/192 CYRILLIC CAPITAL LETTER ER
S= /032/032/040/193 CYRILLIC CAPITAL LETTER ES
T= /032/032/040/194 CYRILLIC CAPITAL LETTER TE
U= /032/032/040/195 CYRILLIC CAPITAL LETTER U
F= /032/032/040/196 CYRILLIC CAPITAL LETTER EF
H= /032/032/040/197 CYRILLIC CAPITAL LETTER HA
C= /032/032/040/198 CYRILLIC CAPITAL LETTER TSE
C% /032/032/040/199 CYRILLIC CAPITAL LETTER CHE
S% /032/032/040/200 CYRILLIC CAPITAL LETTER SHA
Sc /032/032/040/201 CYRILLIC CAPITAL LETTER SHCHA
=" /032/032/040/202 CYRILLIC CAPITAL HARD SIGN
Y= /032/032/040/203 CYRILLIC CAPITAL LETTER YERU
%" /032/032/040/204 CYRILLIC CAPITAL SOFT SIGN
JE /032/032/040/205 CYRILLIC CAPITAL LETTER E
JU /032/032/040/206 CYRILLIC CAPITAL LETTER YU
JA /032/032/040/207 CYRILLIC CAPITAL LETTER YA
a= /032/032/040/208 CYRILLIC SMALL LETTER A
b= /032/032/040/209 CYRILLIC SMALL LETTER BE
v= /032/032/040/210 CYRILLIC SMALL LETTER VE
g= /032/032/040/211 CYRILLIC SMALL LETTER GHE
d= /032/032/040/212 CYRILLIC SMALL LETTER DE
e= /032/032/040/213 CYRILLIC SMALL LETTER IE
z% /032/032/040/214 CYRILLIC SMALL LETTER ZHE
z= /032/032/040/215 CYRILLIC SMALL LETTER ZE
i= /032/032/040/216 CYRILLIC SMALL LETTER I
j= /032/032/040/217 CYRILLIC SMALL LETTER SHORT I
k= /032/032/040/218 CYRILLIC SMALL LETTER KA
l= /032/032/040/219 CYRILLIC SMALL LETTER EL
m= /032/032/040/220 CYRILLIC SMALL LETTER EM
n= /032/032/040/221 CYRILLIC SMALL LETTER EN
o= /032/032/040/222 CYRILLIC SMALL LETTER O
p= /032/032/040/223 CYRILLIC SMALL LETTER PE
r= /032/032/040/224 CYRILLIC SMALL LETTER ER
s= /032/032/040/225 CYRILLIC SMALL LETTER ES
t= /032/032/040/226 CYRILLIC SMALL LETTER TE
Prindeville & Simonsen [Page 18]
RFC xxx Encoding of International Characters March 1991
u= /032/032/040/227 CYRILLIC SMALL LETTER U
f= /032/032/040/228 CYRILLIC SMALL LETTER EF
h= /032/032/040/229 CYRILLIC SMALL LETTER HA
c= /032/032/040/230 CYRILLIC SMALL LETTER TSE
c% /032/032/040/231 CYRILLIC SMALL LETTER CHE
s% /032/032/040/232 CYRILLIC SMALL LETTER SHA
sc /032/032/040/233 CYRILLIC SMALL LETTER SHCHA
=' /032/032/040/234 CYRILLIC SMALL HARD SIGN
y= /032/032/040/235 CYRILLIC SMALL LETTER YERU
%' /032/032/040/236 CYRILLIC SMALL SOFT SIGN
je /032/032/040/237 CYRILLIC SMALL LETTER E
ju /032/032/040/238 CYRILLIC SMALL LETTER YU
ja /032/032/040/239 CYRILLIC SMALL LETTER YA
N0 /032/032/040/240 NUMERO SIGN
io /032/032/040/241 CYRILLIC SMALL LETTER IO
d% /032/032/040/242 CYRILLIC SMALL LETTER DJE (Serbocroatian)
g% /032/032/040/243 CYRILLIC SMALL LETTER GJE (Macedonian)
ie /032/032/040/244 CYRILLIC SMALL LETTER UKRAINIAN IE
ds /032/032/040/245 CYRILLIC SMALL LETTER DZE (Macedonian)
ii /032/032/040/246 CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
yi /032/032/040/247 CYRILLIC SMALL LETTER YI (Ukrainian)
j% /032/032/040/248 CYRILLIC SMALL LETTER JE
lj /032/032/040/249 CYRILLIC SMALL LETTER LJE
nj /032/032/040/250 CYRILLIC SMALL LETTER NJE
ts /032/032/040/251 CYRILLIC SMALL LETTER TSHE (Serbocroatian)
kj /032/032/040/252 CYRILLIC SMALL LETTER KJE (Macedonian)
v% /032/032/040/254 CYRILLIC SMALL LETTER SHORT U (Byelorussian)
dz /032/032/040/255 CYRILLIC SMALL LETTER DZHE
i3 /032/032/042/160 GREEK IOTA BELOW
;; /032/032/042/161 GREEK DAISA PNEUMATA (rough)
,, /032/032/042/162 GREEK PSILI PNEUMATA (smooth)
!* /032/032/042/164 GREEK VARIA
?* /032/032/042/165 GREEK PERISPOMENI
;' /032/032/042/166 GREEK DAISA AND ACUTE ACCENT
,' /032/032/042/167 GREEK PSILI AND ACUTE ACCENT
;! /032/032/042/168 GREEK DAISA AND VARIA
,! /032/032/042/169 GREEK PSILI AND VARIA
?; /032/032/042/170 GREEK PERISPOMENI AND DAISA
?, /032/032/042/171 GREEK PERISPOMENI AND PSILI
!: /032/032/042/174 GREEK VARIA AND DIAERESIS
?: /032/032/042/175 GREEK PERISPOMENI AND DIAERESIS
I3 /032/032/042/176 GREEK CAPITAL LETTER IOTA WITH PERISPOMENI AND PSILI
'% /032/032/042/181 ACUTE ACCENT AND DIAERESIS (Tonos and Dialytica)
A% /032/032/042/182 GREEK CAPITAL LETTER ALPHA WITH ACUTE
E% /032/032/042/184 GREEK CAPITAL LETTER EPSILON WITH ACUTE
Y% /032/032/042/185 GREEK CAPITAL LETTER ETA WITH ACUTE
I% /032/032/042/186 GREEK CAPITAL LETTER IOTA WITH ACUTE
O% /032/032/042/188 GREEK CAPITAL LETTER OMICRON WITH ACUTE
U% /032/032/042/190 GREEK CAPITAL LETTER UPSILON WITH ACUTE
W% /032/032/042/191 GREEK CAPITAL LETTER OMEGA WITH ACUTE
A* /032/032/042/193 GREEK CAPITAL LETTER ALPHA
B* /032/032/042/194 GREEK CAPITAL LETTER BETA
G* /032/032/042/195 GREEK CAPITAL LETTER GAMMA
D* /032/032/042/196 GREEK CAPITAL LETTER DELTA
E* /032/032/042/197 GREEK CAPITAL LETTER EPSILON
Z* /032/032/042/198 GREEK CAPITAL LETTER ZETA
Prindeville & Simonsen [Page 19]
RFC xxx Encoding of International Characters March 1991
Y* /032/032/042/199 GREEK CAPITAL LETTER ETA
H* /032/032/042/200 GREEK CAPITAL LETTER THETA
I* /032/032/042/201 GREEK CAPITAL LETTER IOTA
K* /032/032/042/202 GREEK CAPITAL LETTER KAPPA
L* /032/032/042/203 GREEK CAPITAL LETTER LAMDA
M* /032/032/042/204 GREEK CAPITAL LETTER MU
N* /032/032/042/205 GREEK CAPITAL LETTER NU
C* /032/032/042/206 GREEK CAPITAL LETTER XI
O* /032/032/042/207 GREEK CAPITAL LETTER OMICRON
P* /032/032/042/208 GREEK CAPITAL LETTER PI
R* /032/032/042/209 GREEK CAPITAL LETTER RHO
S* /032/032/042/211 GREEK CAPITAL LETTER SIGMA
T* /032/032/042/212 GREEK CAPITAL LETTER TAU
U* /032/032/042/213 GREEK CAPITAL LETTER UPSILON
F* /032/032/042/214 GREEK CAPITAL LETTER PHI
X* /032/032/042/215 GREEK CAPITAL LETTER CHI
Q* /032/032/042/216 GREEK CAPITAL LETTER PSI
W* /032/032/042/217 GREEK CAPITAL LETTER OMEGA
J* /032/032/042/218 GREEK CAPITAL LETTER IOTA WITH DIAERESIS
V* /032/032/042/219 GREEK CAPITAL LETTER UPSILON WITH DIAERESIS
a% /032/032/042/220 GREEK SMALL LETTER ALPHA WITH ACUTE
e% /032/032/042/221 GREEK SMALL LETTER EPSILON WITH ACUTE
y% /032/032/042/222 GREEK SMALL LETTER ETA WITH ACUTE
i% /032/032/042/223 GREEK SMALL LETTER IOTA WITH ACUTE
a* /032/032/042/225 GREEK SMALL LETTER ALPHA
b* /032/032/042/226 GREEK SMALL LETTER BETA
g* /032/032/042/227 GREEK SMALL LETTER GAMMA
d* /032/032/042/228 GREEK SMALL LETTER DELTA
e* /032/032/042/229 GREEK SMALL LETTER EPSILON
z* /032/032/042/230 GREEK SMALL LETTER ZETA
y* /032/032/042/231 GREEK SMALL LETTER ETA
h* /032/032/042/232 GREEK SMALL LETTER THETA
i* /032/032/042/233 GREEK SMALL LETTER IOTA
k* /032/032/042/234 GREEK SMALL LETTER KAPPA
l* /032/032/042/235 GREEK SMALL LETTER LAMDA
m* /032/032/042/236 GREEK SMALL LETTER MU
n* /032/032/042/237 GREEK SMALL LETTER NU
c* /032/032/042/238 GREEK SMALL LETTER XI
o* /032/032/042/239 GREEK SMALL LETTER OMICRON
p* /032/032/042/240 GREEK SMALL LETTER PI
r* /032/032/042/241 GREEK SMALL LETTER RHO
*s /032/032/042/242 GREEK SMALL LETTER FINAL SIGMA
s* /032/032/042/243 GREEK SMALL LETTER SIGMA
t* /032/032/042/244 GREEK SMALL LETTER TAU
u* /032/032/042/245 GREEK SMALL LETTER UPSILON
f* /032/032/042/246 GREEK SMALL LETTER PHI
x* /032/032/042/247 GREEK SMALL LETTER CHI
q* /032/032/042/248 GREEK SMALL LETTER PSI
w* /032/032/042/249 GREEK SMALL LETTER OMEGA
j* /032/032/042/250 GREEK SMALL LETTER IOTA WITH DIAERESIS
v* /032/032/042/251 GREEK SMALL LETTER UPSILON WITH DIAERESIS
o% /032/032/042/252 GREEK SMALL LETTER OMICRON WITH ACUTE
u% /032/032/042/253 GREEK SMALL LETTER UPSILON WITH ACUTE
w% /032/032/042/254 GREEK SMALL LETTER OMEGA WITH ACUTE
p+ /032/032/044/035 ARABIC LETTER PEH
v+ /032/032/044/040 ARABIC LETTER VEH
Prindeville & Simonsen [Page 20]
RFC xxx Encoding of International Characters March 1991
gf /032/032/044/052 ARABIC LETTER GAF
,+ /032/032/044/172 ARABIC COMMA
;+ /032/032/044/187 ARABIC SEMICOLON
?+ /032/032/044/191 ARABIC QUESTION MARK
H' /032/032/044/193 ARABIC LETTER HAMZA
aM /032/032/044/194 ARABIC LETTER ALEF WITH MADDA ABOVE
aH /032/032/044/195 ARABIC LETTER ALEF WITH HAMZA ABOVE
wH /032/032/044/196 ARABIC LETTER WAW WITH HAMZA ABOVE
ah /032/032/044/197 ARABIC LETTER ALEF WITH HAMZA BELOW
yH /032/032/044/198 ARABIC LETTER YEH WITH HAMZA ABOVE
a+ /032/032/044/199 ARABIC LETTER ALEF
b+ /032/032/044/200 ARABIC LETTER BEH
tm /032/032/044/201 ARABIC LETTER TEH MARBUTA
t+ /032/032/044/202 ARABIC LETTER TEH
tk /032/032/044/203 ARABIC LETTER THEH
g+ /032/032/044/204 ARABIC LETTER JEEM
hk /032/032/044/205 ARABIC LETTER HAH
x+ /032/032/044/206 ARABIC LETTER KHAH
d+ /032/032/044/207 ARABIC LETTER DAL
dk /032/032/044/208 ARABIC LETTER THAL
r+ /032/032/044/209 ARABIC LETTER RA
z+ /032/032/044/210 ARABIC LETTER ZAIN
s+ /032/032/044/211 ARABIC LETTER SEEN
sn /032/032/044/212 ARABIC LETTER SHEEN
c+ /032/032/044/213 ARABIC LETTER SAD
dd /032/032/044/214 ARABIC LETTER DAD
tj /032/032/044/215 ARABIC LETTER TAH
zH /032/032/044/216 ARABIC LETTER ZAH
e+ /032/032/044/217 ARABIC LETTER AIN
i+ /032/032/044/218 ARABIC LETTER GHAIN
++ /032/032/044/224 ARABIC TATWEEL
f+ /032/032/044/225 ARABIC LETTER FEH
q+ /032/032/044/226 ARABIC LETTER QAF
k+ /032/032/044/227 ARABIC LETTER KAF
l+ /032/032/044/228 ARABIC LETTER LAM
m+ /032/032/044/229 ARABIC LETTER MEEM
n+ /032/032/044/230 ARABIC LETTER NOON
h+ /032/032/044/231 ARABIC LETTER HEH
w+ /032/032/044/232 ARABIC LETTER WAW
j+ /032/032/044/233 ARABIC LETTER ALEF MAKSURA
y+ /032/032/044/234 ARABIC LETTER YEH
:+ /032/032/044/235 ARABIC FATHATAN
"+ /032/032/044/236 ARABIC DAMMATAN
=+ /032/032/044/237 ARABIC KASRATAN
/+ /032/032/044/238 ARABIC FATHA
'+ /032/032/044/239 ARABIC DAMMA
1+ /032/032/044/240 ARABIC KASRA
3+ /032/032/044/241 ARABIC SHADDA
0+ /032/032/044/242 ARABIC SUKUN
A+ /032/032/045/224 HEBREW LETTER ALEF
B+ /032/032/045/225 HEBREW LETTER BET
G+ /032/032/045/226 HEBREW LETTER GIMEL
D+ /032/032/045/227 HEBREW LETTER DALET
H+ /032/032/045/228 HEBREW LETTER HE
W+ /032/032/045/229 HEBREW LETTER VAV
Z+ /032/032/045/230 HEBREW LETTER ZAYIN
Prindeville & Simonsen [Page 21]
RFC xxx Encoding of International Characters March 1991
X+ /032/032/045/231 HEBREW LETTER HET
Tj /032/032/045/232 HEBREW LETTER TET
J+ /032/032/045/233 HEBREW LETTER YOD
K% /032/032/045/234 HEBREW LETTER FINAL KAF
K+ /032/032/045/235 HEBREW LETTER KAF
L+ /032/032/045/236 HEBREW LETTER LAMED
M% /032/032/045/237 HEBREW LETTER FINAL MEM
M+ /032/032/045/238 HEBREW LETTER MEM
N% /032/032/045/239 HEBREW LETTER FINAL NUN
N+ /032/032/045/240 HEBREW LETTER NUN
S+ /032/032/045/241 HEBREW LETTER SAMEKH
E+ /032/032/045/242 HEBREW LETTER AYIN
P% /032/032/045/243 HEBREW LETTER FINAL PE
P+ /032/032/045/244 HEBREW LETTER PE
Zj /032/032/045/245 HEBREW LETTER FINAL TSADI
ZJ /032/032/045/246 HEBREW LETTER TSADI
Q+ /032/032/045/247 HEBREW LETTER QOF
R+ /032/032/045/248 HEBREW LETTER RESH
Sh /032/032/045/249 HEBREW LETTER SIN
T+ /032/032/045/250 HEBREW LETTER TAV
IS /032/032/046/032 IDEOGRAPHIC SPACE
,_ /032/032/046/033 IDEOGRAPHIC COMMA
._ /032/032/046/034 IDEOGRAPHIC FULL STOP
+" /032/032/046/035 DITTO MARK
+_ /032/032/046/036 IDEOGRAPHIC DITTO MARK
*_ /032/032/046/037 IDEOGRAPHIC REPETITION MARK
;_ /032/032/046/038 IDEOGRAPHIC CLOSING MARK
0_ /032/032/046/039 IDEOGRAPHIC NUMBER ZERO
<+ /032/032/046/042 LEFT-POINTING DOUBLE ANGLE BRACKET
>+ /032/032/046/043 RIGHT-POINTING DOUBLE ANGLE BRACKET
<' /032/032/046/044 IDEOGRAPHIC LEFT BRACKET
>' /032/032/046/045 IDEOGRAPHIC RIGHT BRACKET
<" /032/032/046/046 IDEOGRAPHIC LEFT DOUBLE BRACKET
>" /032/032/046/047 IDEOGRAPHIC RIGHT DOUBLE BRACKET
(" /032/032/046/048 LEFT BOLDFACE SQUARE BRACKET
)" /032/032/046/049 RIGHT BOLDFACE SQUARE BRACKET
=/ /032/032/046/050 POSTAL MARK
=_ /032/032/046/051 GETA MARK
(' /032/032/046/052 LEFT TORTOISE-SHELL BRACKET
)' /032/032/046/053 RIGHT TORTOISE-SHELL BRACKET
KM /032/032/046/054 KOME MARK
b4 /032/032/046/069 BOPOMOFO LETTER B
p4 /032/032/046/070 BOPOMOFO LETTER P
m4 /032/032/046/071 BOPOMOFO LETTER M
f4 /032/032/046/072 BOPOMOFO LETTER F
d4 /032/032/046/073 BOPOMOFO LETTER D
t4 /032/032/046/074 BOPOMOFO LETTER T
n4 /032/032/046/075 BOPOMOFO LETTER N
l4 /032/032/046/076 BOPOMOFO LETTER L
g4 /032/032/046/077 BOPOMOFO LETTER G
k4 /032/032/046/078 BOPOMOFO LETTER K
h4 /032/032/046/079 BOPOMOFO LETTER H
j4 /032/032/046/080 BOPOMOFO LETTER J
q4 /032/032/046/081 BOPOMOFO LETTER Q
x4 /032/032/046/082 BOPOMOFO LETTER X
zh /032/032/046/083 BOPOMOFO LETTER ZH
Prindeville & Simonsen [Page 22]
RFC xxx Encoding of International Characters March 1991
ch /032/032/046/084 BOPOMOFO LETTER CH
sh /032/032/046/085 BOPOMOFO LETTER SH
r4 /032/032/046/086 BOPOMOFO LETTER R
z4 /032/032/046/087 BOPOMOFO LETTER Z
c4 /032/032/046/088 BOPOMOFO LETTER C
s4 /032/032/046/089 BOPOMOFO LETTER S
a4 /032/032/046/090 BOPOMOFO LETTER A
o4 /032/032/046/091 BOPOMOFO LETTER O
e4 /032/032/046/092 BOPOMOFO LETTER E
eh /032/032/046/093 BOPOMOFO LETTER EH
ai /032/032/046/094 BOPOMOFO LETTER AI
ei /032/032/046/095 BOPOMOFO LETTER EI
au /032/032/046/096 BOPOMOFO LETTER AU
ou /032/032/046/097 BOPOMOFO LETTER OU
an /032/032/046/098 BOPOMOFO LETTER AN
en /032/032/046/099 BOPOMOFO LETTER EN
aN /032/032/046/100 BOPOMOFO LETTER ANG
eN /032/032/046/101 BOPOMOFO LETTER ENG
er /032/032/046/102 BOPOMOFO LETTER ER
i4 /032/032/046/103 BOPOMOFO LETTER I
u4 /032/032/046/104 BOPOMOFO LETTER U
iu /032/032/046/105 BOPOMOFO LETTER IU
A5 /032/032/047/033 HIRAGANA LETTER SMALL A
a5 /032/032/047/034 HIRAGANA LETTER A
I5 /032/032/047/035 HIRAGANA LETTER SMALL I
i5 /032/032/047/036 HIRAGANA LETTER I
U5 /032/032/047/037 HIRAGANA LETTER SMALL U
u5 /032/032/047/038 HIRAGANA LETTER U
E5 /032/032/047/039 HIRAGANA LETTER SMALL E
e5 /032/032/047/040 HIRAGANA LETTER E
O5 /032/032/047/041 HIRAGANA LETTER SMALL O
o5 /032/032/047/042 HIRAGANA LETTER O
ka /032/032/047/043 HIRAGANA LETTER KA
ga /032/032/047/044 HIRAGANA LETTER GA
ki /032/032/047/045 HIRAGANA LETTER KI
gi /032/032/047/046 HIRAGANA LETTER GI
ku /032/032/047/047 HIRAGANA LETTER KU
gu /032/032/047/048 HIRAGANA LETTER GU
ke /032/032/047/049 HIRAGANA LETTER KE
ge /032/032/047/050 HIRAGANA LETTER GE
ko /032/032/047/051 HIRAGANA LETTER KO
go /032/032/047/052 HIRAGANA LETTER GO
sa /032/032/047/053 HIRAGANA LETTER SA
za /032/032/047/054 HIRAGANA LETTER ZA
si /032/032/047/055 HIRAGANA LETTER SI
zi /032/032/047/056 HIRAGANA LETTER ZI
su /032/032/047/057 HIRAGANA LETTER SU
zu /032/032/047/058 HIRAGANA LETTER ZU
se /032/032/047/059 HIRAGANA LETTER SE
ze /032/032/047/060 HIRAGANA LETTER ZE
so /032/032/047/061 HIRAGANA LETTER SO
zo /032/032/047/062 HIRAGANA LETTER ZO
ta /032/032/047/063 HIRAGANA LETTER TA
da /032/032/047/064 HIRAGANA LETTER DA
ti /032/032/047/065 HIRAGANA LETTER TI
di /032/032/047/066 HIRAGANA LETTER DI
Prindeville & Simonsen [Page 23]
RFC xxx Encoding of International Characters March 1991
tU /032/032/047/067 HIRAGANA LETTER SMALL TU
tu /032/032/047/068 HIRAGANA LETTER TU
du /032/032/047/069 HIRAGANA LETTER DU
te /032/032/047/070 HIRAGANA LETTER TE
de /032/032/047/071 HIRAGANA LETTER DE
to /032/032/047/072 HIRAGANA LETTER TO
do /032/032/047/073 HIRAGANA LETTER DO
na /032/032/047/074 HIRAGANA LETTER NA
ni /032/032/047/075 HIRAGANA LETTER NI
nu /032/032/047/076 HIRAGANA LETTER NU
ne /032/032/047/077 HIRAGANA LETTER NE
no /032/032/047/078 HIRAGANA LETTER NO
ha /032/032/047/079 HIRAGANA LETTER HA
ba /032/032/047/080 HIRAGANA LETTER BA
pa /032/032/047/081 HIRAGANA LETTER PA
hi /032/032/047/082 HIRAGANA LETTER HI
bi /032/032/047/083 HIRAGANA LETTER BI
pi /032/032/047/084 HIRAGANA LETTER PI
hu /032/032/047/085 HIRAGANA LETTER HU
bu /032/032/047/086 HIRAGANA LETTER BU
pu /032/032/047/087 HIRAGANA LETTER PU
he /032/032/047/088 HIRAGANA LETTER HE
be /032/032/047/089 HIRAGANA LETTER BE
pe /032/032/047/090 HIRAGANA LETTER PE
ho /032/032/047/091 HIRAGANA LETTER HO
bo /032/032/047/092 HIRAGANA LETTER BO
po /032/032/047/093 HIRAGANA LETTER PO
ma /032/032/047/094 HIRAGANA LETTER MA
mi /032/032/047/095 HIRAGANA LETTER MI
mu /032/032/047/096 HIRAGANA LETTER MU
me /032/032/047/097 HIRAGANA LETTER ME
mo /032/032/047/098 HIRAGANA LETTER MO
yA /032/032/047/099 HIRAGANA LETTER SMALL YA
ya /032/032/047/100 HIRAGANA LETTER YA
yU /032/032/047/101 HIRAGANA LETTER SMALL YU
yu /032/032/047/102 HIRAGANA LETTER YU
yO /032/032/047/103 HIRAGANA LETTER SMALL YO
yo /032/032/047/104 HIRAGANA LETTER YO
ra /032/032/047/105 HIRAGANA LETTER RA
ri /032/032/047/106 HIRAGANA LETTER RI
ru /032/032/047/107 HIRAGANA LETTER RU
re /032/032/047/108 HIRAGANA LETTER RE
ro /032/032/047/109 HIRAGANA LETTER RO
wA /032/032/047/110 HIRAGANA LETTER SMALL WA
wa /032/032/047/111 HIRAGANA LETTER WA
wi /032/032/047/112 HIRAGANA LETTER WI
we /032/032/047/113 HIRAGANA LETTER WE
wo /032/032/047/114 HIRAGANA LETTER WO
n5 /032/032/047/115 HIRAGANA LETTER N
"5 /032/032/047/122 HIRAGANA-KATAKANA VOICED SOUND MARK
05 /032/032/047/123 HIRAGANA-KATAKANA SEMI-VOICED SOUND MARK
*5 /032/032/047/124 HIRAGANA ITERATION MARK
+5 /032/032/047/125 HIRAGANA VOICED ITERATION MARK
a6 /032/032/047/161 KATAKANA LETTER SMALL A
A6 /032/032/047/162 KATAKANA LETTER A
i6 /032/032/047/163 KATAKANA LETTER SMALL I
Prindeville & Simonsen [Page 24]
RFC xxx Encoding of International Characters March 1991
I6 /032/032/047/164 KATAKANA LETTER I
u6 /032/032/047/165 KATAKANA LETTER SMALL U
U6 /032/032/047/166 KATAKANA LETTER U
e6 /032/032/047/167 KATAKANA LETTER SMALL E
E6 /032/032/047/168 KATAKANA LETTER E
o6 /032/032/047/169 KATAKANA LETTER SMALL O
O6 /032/032/047/170 KATAKANA LETTER O
Ka /032/032/047/171 KATAKANA LETTER KA
Ga /032/032/047/172 KATAKANA LETTER GA
Ki /032/032/047/173 KATAKANA LETTER KI
Gi /032/032/047/174 KATAKANA LETTER GI
Ku /032/032/047/175 KATAKANA LETTER KU
Gu /032/032/047/176 KATAKANA LETTER GU
Ke /032/032/047/177 KATAKANA LETTER KE
Ge /032/032/047/178 KATAKANA LETTER GE
Ko /032/032/047/179 KATAKANA LETTER KO
Go /032/032/047/180 KATAKANA LETTER GO
Sa /032/032/047/181 KATAKANA LETTER SA
Za /032/032/047/182 KATAKANA LETTER ZA
Si /032/032/047/183 KATAKANA LETTER SI
Zi /032/032/047/184 KATAKANA LETTER ZI
Su /032/032/047/185 KATAKANA LETTER SU
Zu /032/032/047/186 KATAKANA LETTER ZU
Se /032/032/047/187 KATAKANA LETTER SE
Ze /032/032/047/188 KATAKANA LETTER ZE
So /032/032/047/189 KATAKANA LETTER SO
Zo /032/032/047/190 KATAKANA LETTER ZO
Ta /032/032/047/191 KATAKANA LETTER TA
Da /032/032/047/192 KATAKANA LETTER DA
Ti /032/032/047/193 KATAKANA LETTER TI
Di /032/032/047/194 KATAKANA LETTER DI
TU /032/032/047/195 KATAKANA LETTER SMALL TU
Tu /032/032/047/196 KATAKANA LETTER TU
Du /032/032/047/197 KATAKANA LETTER DU
Te /032/032/047/198 KATAKANA LETTER TE
De /032/032/047/199 KATAKANA LETTER DE
To /032/032/047/200 KATAKANA LETTER TO
Do /032/032/047/201 KATAKANA LETTER DO
Na /032/032/047/202 KATAKANA LETTER NA
Ni /032/032/047/203 KATAKANA LETTER NI
Nu /032/032/047/204 KATAKANA LETTER NU
Ne /032/032/047/205 KATAKANA LETTER NE
No /032/032/047/206 KATAKANA LETTER NO
Ha /032/032/047/207 KATAKANA LETTER HA
Ba /032/032/047/208 KATAKANA LETTER BA
Pa /032/032/047/209 KATAKANA LETTER PA
Hi /032/032/047/210 KATAKANA LETTER HI
Bi /032/032/047/211 KATAKANA LETTER BI
Pi /032/032/047/212 KATAKANA LETTER PI
Hu /032/032/047/213 KATAKANA LETTER HU
Bu /032/032/047/214 KATAKANA LETTER BU
Pu /032/032/047/215 KATAKANA LETTER PU
He /032/032/047/216 KATAKANA LETTER HE
Be /032/032/047/217 KATAKANA LETTER BE
Pe /032/032/047/218 KATAKANA LETTER PE
Ho /032/032/047/219 KATAKANA LETTER HO
Prindeville & Simonsen [Page 25]
RFC xxx Encoding of International Characters March 1991
Bo /032/032/047/220 KATAKANA LETTER BO
Po /032/032/047/221 KATAKANA LETTER PO
Ma /032/032/047/222 KATAKANA LETTER MA
Mi /032/032/047/223 KATAKANA LETTER MI
Mu /032/032/047/224 KATAKANA LETTER MU
Me /032/032/047/225 KATAKANA LETTER ME
Mo /032/032/047/226 KATAKANA LETTER MO
YA /032/032/047/227 KATAKANA LETTER SMALL YA
Ya /032/032/047/228 KATAKANA LETTER YA
YU /032/032/047/229 KATAKANA LETTER SMALL YU
Yu /032/032/047/230 KATAKANA LETTER YU
YO /032/032/047/231 KATAKANA LETTER SMALL YO
Yo /032/032/047/232 KATAKANA LETTER YO
Ra /032/032/047/233 KATAKANA LETTER RA
Ri /032/032/047/234 KATAKANA LETTER RI
Ru /032/032/047/235 KATAKANA LETTER RU
Re /032/032/047/236 KATAKANA LETTER RE
Ro /032/032/047/237 KATAKANA LETTER RO
WA /032/032/047/238 KATAKANA LETTER SMALL WA
Wa /032/032/047/239 KATAKANA LETTER WA
Wi /032/032/047/240 KATAKANA LETTER WI
We /032/032/047/241 KATAKANA LETTER WE
Wo /032/032/047/242 KATAKANA LETTER WO
N6 /032/032/047/243 KATAKANA LETTER N
Vu /032/032/047/244 KATAKANA LETTER VU
KA /032/032/047/245 KATAKANA LETTER SMALL KA
KE /032/032/047/246 KATAKANA LETTER SMALL KE
-6 /032/032/047/252 HIRAGANA-KATAKANA PROLONGED SOUND MARK
*6 /032/032/047/253 KATAKANA ITERATION MARK
+6 /032/032/047/254 KATAKANA VOICED ITERATION MARK
ff /032/032/060/040 LATIN SMALL LIGATURE FF
fi /032/032/060/041 LATIN SMALL LIGATURE FI
fl /032/032/060/042 LATIN SMALL LIGATURE FL
ft /032/032/060/045 LATIN SMALL LIGATURE FT
st /032/032/060/046 LATIN SMALL LIGATURE ST
Iu /032/032/060/048 INTEGRAL SIGN UPPER PART
Il /032/032/060/049 INTEGRAL SIGN LOWER PART
NU /032/032/032/000 NULL (NUL)
SH /032/032/032/001 START OF HEADING (SOH)
SX /032/032/032/002 START OF TEXT (STX)
EX /032/032/032/003 END OF TEXT (ETX)
ET /032/032/032/004 END OF TRANSMISSION (EOT)
EQ /032/032/032/005 ENQUIRY (ENQ)
AK /032/032/032/006 ACKNOWLEDGE (ACK)
BL /032/032/032/007 BELL (BEL)
BS /032/032/032/008 BACKSPACE (BS)
HT /032/032/032/009 CHARACTER TABULATION (HT)
LF /032/032/032/010 LINE FEED (LF)
VT /032/032/032/011 LINE TABULATION (VT)
FF /032/032/032/012 FORM FEED (FF)
CR /032/032/032/013 CARRIAGE RETURN (CR)
SO /032/032/032/014 SHIFT OUT (SO)
SI /032/032/032/015 SHIFT IN (SI)
DL /032/032/032/016 DATALINK ESCAPE (DLE)
D1 /032/032/032/017 DEVICE CONTROL ONE (DC1)
D2 /032/032/032/018 DEVICE CONTROL TWO (DC2)
Prindeville & Simonsen [Page 26]
RFC xxx Encoding of International Characters March 1991
D3 /032/032/032/019 DEVICE CONTROL THREE (DC3)
D4 /032/032/032/020 DEVICE CONTROL FOUR (DC4)
NK /032/032/032/021 NEGATIVE ACKNOWLEDGE (NAK)
SY /032/032/032/022 SYNCRONOUS IDLE (SYN)
EB /032/032/032/023 END OF TRANSMISSION BLOCK (ETB)
CN /032/032/032/024 CANCEL (CAN)
EM /032/032/032/025 END OF MEDIUM (EM)
SB /032/032/032/026 SUBSTITUTE (SUB)
EC /032/032/032/027 ESCAPE (ESC)
FS /032/032/032/028 FILE SEPARATOR (IS4)
GS /032/032/032/029 GROUP SEPARATOR (IS3)
RS /032/032/032/030 RECORD SEPARATOR (IS2)
US /032/032/032/031 UNIT SEPARATOR (IS1)
DT /032/032/032/127 DELETE (DEL)
PA /032/032/032/128 PADDING CHARACTER (PAD)
HO /032/032/032/129 HIGH OCTET PRESET (HOP)
BH /032/032/032/130 BREAK PERMITTED HERE (BPH)
NH /032/032/032/131 NO BREAK HERE (NBH)
IN /032/032/032/132 INDEX (IND)
NL /032/032/032/133 NEXT LINE (NEL)
SA /032/032/032/134 START OF SELECTED AREA (SSA)
ES /032/032/032/135 END OF SELECTED AREA (ESA)
HS /032/032/032/136 CHARACTER TABULATION SET (HTS)
HJ /032/032/032/137 CHARACTER TABULATION WITH JUSTIFICATION (HTJ)
VS /032/032/032/138 LINE TABULATION SET (VTS)
PD /032/032/032/139 PARTIAL LINE FORWARD (PLD)
PU /032/032/032/140 PARTIAL LINE BACKWARD (PLU)
RI /032/032/032/141 REVERSE LINE FEED (RI)
S2 /032/032/032/142 SINGLE-SHIFT TWO (SS2)
S3 /032/032/032/143 SINGLE-SHIFT THREE (SS3)
DC /032/032/032/144 DEVICE CONTROL STRING (DCS)
P1 /032/032/032/145 PRIVATE USE ONE (PU1)
P2 /032/032/032/146 PRIVATE USE TWO (PU2)
TS /032/032/032/147 SET TRANSMIT STATE (STS)
CC /032/032/032/148 CANCEL CHARACTER (CCH)
MW /032/032/032/149 MESSAGE WAITING (MW)
SG /032/032/032/150 START OF GUARDED AREA (SPA)
EG /032/032/032/151 END OF GUARDED AREA (EPA)
SS /032/032/032/152 START OF STRING (SOS)
GC /032/032/032/153 SINGLE GRAPHIC CHARACTER INTRODUCER (SGCI)
SC /032/032/032/154 SINGLE CHARACTER INTRODUCER (SCI)
CI /032/032/032/155 CONTROL SEQUENCE INTRODUCER (CSI)
ST /032/032/032/156 STRING TERMINATOR (ST)
OC /032/032/032/157 OPERATING SYSTEM COMMAND (OSC)
PM /032/032/032/158 PRIVACY MESSAGE (PM)
AC /032/032/032/159 APPLICATION PROGRAM COMMAND (APC)
__ /032/032/052/032 indicates unfinished
"! /032/032/052/033 NON-SPACING GRAVE ACCENT (ISO IR 70 193)
"' /032/032/052/034 NON-SPACING ACUTE ACCENT (ISO IR 70 194)
"> /032/032/052/035 NON-SPACING CIRCUMFLEX ACCENT (ISO IR 70 195)
"? /032/032/052/036 NON-SPACING TILDE (ISO IR 70 196)
"- /032/032/052/037 NON-SPACING MACRON (ISO IR 70 197)
"( /032/032/052/038 NON-SPACING BREVE (ISO IR 70 198)
". /032/032/052/039 NON-SPACING DOT ABOVE (ISO IR 70 199)
": /032/032/052/040 NON-SPACING DIAERESIS (ISO IR 70 200)
"/ /032/032/052/041 NON-SPACING SOLIDUS (ISO IR 99 201)
Prindeville & Simonsen [Page 27]
RFC xxx Encoding of International Characters March 1991
"0 /032/032/052/042 NON-SPACING RING ABOVE (ISO IR 70 202)
", /032/032/052/043 NON-SPACING CEDILLA (ISO IR 70 203)
"_ /032/032/052/044 NON-SPACING UNDERLINE (ISO IR 99 216)
"" /032/032/052/045 NON-SPACING DOUBLE ACCUTE ACCENT (ISO IR 70 205)
"< /032/032/052/046 NON-SPACING CARON (ISO IR 70 207)
"; /032/032/052/047 NON-SPACING OGONEK (ISO IR 53 208)
"= /032/032/052/048 NON-SPACING DOUBLE UNDERLINE (ISO IR 53 217)
"1 /032/032/052/049 NON-SPACING DIAERESIS WITH ACCENT (ISO IR 70 192)
"2 /032/032/052/050 NON-SPACING UMLAUT (ISO 5426 201)
Fd /032/032/052/051 FILLED FORWARD DIAGONAL (ANSI X3.110-1983 218)
Bd /032/032/052/052 FILLED BACKWARD DIAGONAL (ANSI X3.110-1983 219)
Fl /032/032/052/053 Dutch guilder sign (IBM CP 437 159)
Li /032/032/052/054 Italian Lira sign (HP ROMAN 8 175)
/f /032/032/052/055 VULGAR FRACTION BAR (MacIntosh 218)
0s /032/032/052/056 SUBSCRIPT ZERO (ISO IR 50 096)
1s /032/032/052/057 SUBSCRIPT ONE (ISO IR 50 097)
2s /032/032/052/058 SUBSCRIPT TWO (ISO IR 50 098)
3s /032/032/052/059 SUBSCRIPT THREE (ISO IR 50 099)
4s /032/032/052/060 SUBSCRIPT FOUR (ISO IR 50 100)
5s /032/032/052/061 SUBSCRIPT FIVE (ISO IR 50 101)
6s /032/032/052/062 SUBSCRIPT SIX (ISO IR 50 102)
7s /032/032/052/063 SUBSCRIPT SEVEN (ISO IR 50 103)
8s /032/032/052/064 SUBSCRIPT EIGHT (ISO IR 50 104)
9s /032/032/052/065 SUBSCRIPT NINE (ISO IR 50 105)
0S /032/032/052/066 SUPERSCRIPT ZERO (ISO IR 50 112)
4S /032/032/052/067 SUPERSCRIPT FOUR (ISO IR 50 116)
5S /032/032/052/068 SUPERSCRIPT FIVE (ISO IR 50 117)
6S /032/032/052/069 SUPERSCRIPT SIX (ISO IR 50 118)
7S /032/032/052/070 SUPERSCRIPT SEVEN (ISO IR 50 119)
8S /032/032/052/071 SUPERSCRIPT EIGHT (ISO IR 50 120)
9S /032/032/052/072 SUPERSCRIPT NINE (ISO IR 50 121)
+S /032/032/052/073 SUPERSCRIPT PLUS (ISO IR 50 106)
-S /032/032/052/074 SUPERSCRIPT MINUS (ISO IR 50 107)
1h /032/032/052/075 ABSTRACT SYMBOL H ONE (HOOK) (JIS C 6229-1984 060)
2h /032/032/052/076 ABSTRACT SYMBOL H TWO (FORK) (JIS C 6229-1984 093)
3h /032/032/052/077 ABSTRACT SYMBOL H THREE (CHAIR) (JIS C 6229-1984 062)
4h /032/032/052/078 ABSTRACT SYMBOL H FOUR (LONG VERTICAL MARK) (JIS C 6229-1984 125)
1j /032/032/052/079 SYMBOL ONE (ISO 2033-1983 058)
2j /032/032/052/080 SYMBOL TWO (ISO 2033-1983 059)
3j /032/032/052/081 SYMBOL THREE (ISO 2033-1983 060)
4j /032/032/052/082 SYMBOL FOUR (ISO 2033-1983 061)
UA /032/032/052/083 Unit space A (ISO IR 8-1 064)
UB /032/032/052/084 Unit space B (ISO IR 8-1 096)
yf /032/032/052/085 ARABIC LETTER YEH FINAL (CODAR U 090)
yr /032/032/052/086 OLD NORSE YR (DIN 31624 251)
.6 /032/032/052/087 KATAKANA FULL STOP (JIS C 6220 033)
<6 /032/032/052/088 KATAKANA OPENING BRACKET (JIS C 6220 034)
>6 /032/032/052/089 KATAKANA CLOSING BRACKET (JIS C 6220 035)
,6 /032/032/052/090 KATAKANA COMMA (JIS C 6220 036)
&6 /032/032/052/091 KATAKANA CONJUNCTION SYMBOL (JIS C 6220 037)
(S /032/032/052/092 LEFT PARENTHESIS SUPERSCRIPT (CSA Z243.4-1985-gr 168)
)S /032/032/052/093 RIGHT PARENTHESIS SUPERSCRIPT (CSA Z243.4-1985-gr 169)
References
Prindeville & Simonsen [Page 28]
RFC xxx Encoding of International Characters March 1991
[1]
D. Robinson, R. Ullman, ``Encoding Header Field for
Internet Messages,'' RFC 1154, April 1990.
[2]
M. Sirbu, "Content-Type Header Field for Internet
Messages,'' RFC 1049, March 1988.
[3]
J.W. van Wingen, ``Networks and Coded Character Sets''
in Computer Networks and ISDN Systems, Nos. 3-5,
November 1990.
[4]
R. Blokzijl, ``RIPE: IP coordination in Europe'' in
Computer Networks and ISDN Systems, Nos. 3-5, November
1990.
[5]
Mumble
[6]
Mumble.
[7]
RFC mumble.
[8]
J. Linn, ``Privacy Enhancement for the Internet
Electronic Mail: Part I - Message Encipherment and
Authentication Procedures,'' RFC 1113, August 1989.
[9]
International Organization for Standardization,
Information Processing - 8-bit single-byte coded
graphic character sets - Part 1: Latin alphabet No. 1,
ISO 8859-1 , 1987 (and successive standards).
[10]
International Organization for Standardization, Insert
title here, ISO 10646 . Date.
[11]
IEEE 1003.2 Draft 11 POSIX Shell and Utilities
informative annex F: example national locales and
charmaps, March 1991.
Prindeville & Simonsen [Page 29]