
The CJK Dictionary Institute (CJKI), which specializes in CJK computational lexicography, is engaged in the continuous expansion of a comprehensive CJK lexical database called DESK. Currently, DESK has over two million Japanese, one million Simplified Chinese and one million Traditional Chinese entries, and includes a rich set of grammatical and semantic attributes required for developing information retrieval applications, input method editors, and electronic dictionaries.
| Description | Simplified Chinese | Traditional Chinese |
|---|---|---|
| General vocabulary | 250,000 | 250,000 |
| Companies and organizations | 50,000 | 50,000 |
| Personal names | 650,000 | 650,000 |
| Place names | 170,000 | 170,000 |
| Famous people's names | 60,000 | - |
| Computer terminology | 45,000 | 45,000 |
| Single character | 18,000 | 14,000 |
| Others | 120,000 | 120,000 |
| Total | 1,363,000 | 1,299,000 |
| Description | |
|---|---|
| General vocabulary | 390,000 |
| Katakana loanwords | 50,000 |
| Companies and organizations | 600,000 |
| Personal names | 570,000 |
| Place names | 90,000 |
| Famous people's names | 20,000 |
| Computer terminology | 50,000 |
| Technical terminology | 250,000 |
| Single characters | 17,000 |
| Orthographical variants | 80,000 |
| Total | 2,117,000 |