Resources for English-Chinese CLIR

Doug Oard has a technical report on Resources for Chinese-English Cross-Language IR, which lists resources and tools on Encoding, character set, and Language identification, Character set conversion, segmentation and compound splitting, proper name, and mapping terms between languages. For each resource or tool, its availabilty, cost, format, domain coverage, size, and other information are provided. Here is the HTML version of his report.

In addition to the resources listed in Doug's report, I found the following resources that might be useful in English-Chinese CLIR. My intention is to find as many publicly available and free resources on the web as possible and make good use of them in CLIR research.

English-Chinese Bilingual Dictionary

see also Marjorie Chan's page on Chinese dictionaries

Paul Denisowski's CEDICT

Yick Yam Lam's E-C bilingual dictionary

Eng-Chi

Online English-Chinese bilingual dictionaries

Technical terminology

Proper name resourses

English-Chinese Machine Translation Software

Free Online E-C Translation Service

Commercial E-C MT software

Popular E-C MT softwares in Mainland China

TranStar

E-C bidirectional MT system. It claims to have 250,000 entries in their general dictionary. 15 specialty dictionaries with in total 500,000 entries. Trial version downloadable.

KingSoft

Bidirectional. 17,000,000 words in standard version. Has also Business and Economics version and Science and Technology version. Trial version downloadable.

Other MT systerms and their trial versions can be found here.

English-Chinese bilingual corpora

English-Chinese parallel texts collected by Wang Lixun available at http://web.bham.ac.uk/lxw715/EnglishChineseCorpus.html

Most are novels.