Overview of CJK Lexical Resources
The tables below list CJKI's lexical resources, which currently contains over 24 million entries. Not included is most of our extensive data for CJK single character input methods, nor our data for European languages, such as our Spanish resources. Our Simplified <> Traditional Chinese mapping tables have also been excluded.
Language | Description | Entries | Remarks |
---|---|---|---|
J<>E | Companies and organizations | 600,000 |
|
J<>E | Personal names | 1,584,000 |
|
J<>E | Personal name variants | 3,500,000 |
|
J<>E | Place names | 86,400 |
|
J<>C | Personal Names | 1,412,100 |
|
J<>C | Place names | 84,100 |
|
J<>K | Personal Names | 1,563,000 |
|
J<>K | Place names | 87,700 |
|
J | General vocabulary monolingual | 300,000 |
excludes proper nouns |
J>E | General vocabulary kanji | 40,000 |
based on NJECD to be expanded |
E>J | General vocabulary bilingual | 82,000 |
|
J | Phonological/phonemic, general/proper | 130,000 |
|
J>E | General vocabulary bilingual | 110,000 |
|
J | General vocabulary katakana | 50,000 |
some English |
J | Pornographic terms | 720 |
some English |
J<>E | Technical terms | 1,000,000 |
max. 1.5 million |
J<>E | Other | 50,000 |
|
Total: | 10,680,020 |
Language | Description | Entries | Remarks |
---|---|---|---|
SC<>E | Personal names | 1,560,300 |
|
SC<>E | Personal name variants | 243,000 |
|
TC<>E | Personal names | 1,560,300 |
|
SC<>E | Place names | 84,700 |
|
TC<>E | Place names | 84,700 |
|
SC<>J | Personal names | 1,412,100 |
|
TC<>J | Personal names | 1,412,100 |
|
SC<>J | Place names | 84,100 |
|
TC<>J | Place names | 84,100 |
|
SC<>K | Personal names | 1,774,800 |
|
TC<>K | Personal names | 1,774,800 |
|
SC<>K | Place names | 108,800 |
|
TC<>K | Place names | 108,800 |
|
SC<>E | Companies and organizations | 55,000 |
|
TC<>E | Companies and organizations | 55,000 |
|
SC<>E | Computer terms | 100,000 |
|
TC<>E | Computer terms | 100,000 |
|
SC<>E | Technical terms | 4,750,000 |
|
SC<>J | Technical terms | 820,000 |
|
SC | General vocabulary monolingual | 250,000 |
excludes proper nouns |
TC | General vocabulary monolingual | 250,000 |
excludes proper nouns |
E>SC | General vocabulary bilingual | 80,000 |
|
SC>E | General vocabulary bilingual | 700,000 |
|
E>TC | General vocabulary bilingual | 85,000 |
|
CAN | Cantonese input method | 25,000 |
|
Total | 17,562,600
|
Language | Description | Entries | Remarks |
---|---|---|---|
CJKEA | Place/personal names multilingual | 150,000 |
|
CJE | Technical terms multilingual | 150,000 |
under development, eventually 500,000 |
Total | 300,000 |
Language | Description | Entries | Remarks |
---|---|---|---|
AE | Romanized name variants | 6,500,000 |
|
A | Arabic name variants | 220,000 |
|
CJKEA | Place/personal names multilingual | 150,000 |
Arabic partially available |
EA | Romanized place names | 6,000 |
|
Total | 6,876,000 |
Language | Description | Entries | Remarks |
---|---|---|---|
K<>E | Personal names | 1,138,500 |
|
K<>E | Place names | 81,700 |
|
K<>J | Personal names | 1,563,000 |
|
K<>J | Place names | 87,700 |
|
K<>SC | Personal names | 1,774,800 |
|
K<>SC | Place names | 108,800 |
|
K<>TC | Personal names | 1,774,800 |
|
K<>TC | Place names | 108,800 |
|
K | Companies and organizations | 30,000 |
some English in progress |
K | Pornographic terms | 610 |
some English |
K | General vocabulary monolingual | 100,000 |
in progress |
E>K | General vocabulary bilingual | 80,000 |
in progress |
K | Korean input method | 11,172 |
|
Total | 6,859,882 |