Content
CEDICT is merely a text file; other programs are needed to search and display it. This project is considered a standard Chinese-English reference on the Internet and is used by several other Chinese-English projects. The Unihan Database uses CEDICT data for most of its information about character compounds, but this is auxiliary and is explicitly not a part of the main Unicode database. CEDICT is not used for Unihan's definitions and pronunciations of individual characters.
The basic format of a CEDICT entry is:
Traditional Simplified /English equivalent 1/equivalent 2/
中國 中国 /China/Middle Kingdom/
CEDICT is now primarily encoded in UTF-8, but compatibility versions are available in GB2312 and Big5. The compatibility versions omit either the Traditional or the Simplified characters respectively.
Features:
* Traditional and Simplified Chinese
* Pinyin
* English
As of June 2008, it has 65.312 Chinese entries.
History
Sub-projects
CEDICT has shown the way to some other projects, such , the Chinese-German free dictionary. A CFDICT Chinese-French dictionary and a Hungrarian-Chinese dictionary project are under discussion. Some older CEDICT data is also found in the Adsotrans dictionary.
No comments:
Post a Comment