[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Non-ASCII hdrs
> Keld says:
> I have several ways of doing things like that up my sleeve.
> I have romanisation tables for CJK.
> I have some pattern description for CJK.
> I have dictionary numbers for CJK.
>
> Would any of these be better than the <charset><row><column>
> names?
>
> Not in the general case since there are multiple languages using the
> same ideograms. I'm not a Japanese-speaker and cannot read Kana at
> all. I can read most Kanji because I do know Chinese.
>
> Erik and Mark have already commented that there is no 1-1 mapping
> between romanisation and kanji for Japanese. This is also true for
> Chinese because of homonyms within any given dialect and the many
> mutually unintelligible dialects of Chinese.
>
The chinese GB standard has been tabled with chinese romanisation
and the japanese with japanese romanisation.
Also JIS X0212 has been tabled with chinese romanization.
> The several dictionaries of Chinese that I use have different dictionary
> numbers for the same character.
We have chosen two of those dictiotionaries for the numbering
and have each of the charactes coded with those two numbers.
> What is meant by "pattern description" ?? Do you mean radicals ?
> If so that is unlikely to work also. Research done in Asia and at
> places like PARC have shown that the existing radical-encoding schemes
> all have many-to-one mappings in parts of their encoding tables. That
> isn't likely to be useful.
Pattern description is not radical/stroke. We do have radical stroke too.
Patern description is based on the left downward corner of
the character, as far as I know.
keld