With over 185,000 entries, Adso is the largest open source Chinese-English dictionary compilation on the Internet. It differs from other projects in providing part of speech and ontological data on word entries, and in reviewing user contributions. Project data is generated collaboratively by users and drawn from related projects including CEDICT and the Linguistic Data Consortium.
The Adso software engine provides text segmentation, hanzi-to-pinyin, gist translation, annotation, gist extraction and semantic analysis services. It is heavily used as a translation aid for Chinese-English translation. Adso also supports a specially-defined XML language which customizes software output. This has made it useful as preprocessor for statistical machine translation software such as GIZA++ or for reverse-index search engines such as Lucene.