Author name: paco.tam@chilin.hk

HK ITC supports Chilin (HK) Ltd to develop an AI-Mediated API for MT Post-Editing

Chilin (HK) Ltd. announced today that it has signed a sizable matching fund agreement under the Enterprise Support Scheme (ESS) with the Innovation and Technology Commission (ITC) of the Hong Kong Special Administrative Region Government. Launched in 2015, the ESS aims to provide funding support for local companies to conduct in-house research and development (R&D)

HK ITC supports Chilin (HK) Ltd to develop an AI-Mediated API for MT Post-Editing Read More »

New Chinese Buzz words for 2021 Released

The Top 5 hottest new Chinese terms are “Lie Flat Approach”, “Metaverse”, “Cardboard Bed”, “Xinjiang as China Stopper” and “Nuclear Waste Disposal” in the Pan-Chinese media of Beijing, Hong Kong and Taipei, according to the 2021 LIVAC Pan-Chinese New Word Rosters released by City University of Hong Kong (CityU). The rosters are based on the LIVAC Synchronous Corpus of Chinese developed by Professor Benjamin T’sou Ka-yin,

New Chinese Buzz words for 2021 Released Read More »

Simplified Chinese vs Traditional Chinese – NOT THAT CONFUSING!

The apparently confusing DIFFERENCES can be resolved at two different levels. We must distinguish between (1) the WRITING system or (2) the WRITTEN language.   For (1), an average person educated in an alphabetic language can easily recognize differences between the common alphabets found among languages such as English, Russian, Greek, Arabic and Hindi. For example, the word Livac will be transliterated into these

Simplified Chinese vs Traditional Chinese – NOT THAT CONFUSING! Read More »

RWS Publishes Report on State of Translation Tools Market

“Translation Studies in East Asia – Tradition, Transition and Transcendence (2021 East)” was a major conference on Chinese related translation topics recently held in Hong Kong.  The conference was organized by The Hong Kong Polytechnic University’s Department of Chinese and Bilingual Studies and The Hong Kong Translation Society.  Chilin (HK) founder Dr. Benjamin Tsou was a

RWS Publishes Report on State of Translation Tools Market Read More »

How is the LiVaC New Buzzword Roster generated?

In our recent blog “New words and language processing by computer”, we presented the newest Chinese buzzwords selected by Chilin. In this blog, we explore how the Chilin’s LiVaC system collects and filters the new Chinese buzzwords. The LiVaC system has collected over 2 million words used in Chinese newspapers over the last 25 years. It collects news regularly

How is the LiVaC New Buzzword Roster generated? Read More »

Improvement of Chinese-to-English MT of Pharmaceutical Texts

In our last blog, we discussed how Chilin offered English-Chinese parallel sentence pairs on the TAUS Data Marketplace. Our first offerings are related to the Pharmaceutical-Biotechnology domain.  In the English to Chinese case, we found that adding Chilin parallel sentence data to the base Google model increased the BLEU score by almost 5 points – within

Improvement of Chinese-to-English MT of Pharmaceutical Texts Read More »

Improvement of English-to-Chinese MT of Pharmaceutical Texts

In our last blog, we discussed how Chilin offered English-Chinese parallel sentence pairs on the TAUS Data Marketplace. Our first offerings are related to the Pharmaceutical-Biotechnology domain.  Does this data actually improve machine translation? The answer is “yes” as we show below.  Chilin conducted a test using the parallel sentence data and the AutoML Translation tool

Improvement of English-to-Chinese MT of Pharmaceutical Texts Read More »

Top Quality English-Chinese Pharmaceutical and Biotechnology Training Data available for Machine Translation

In our previous blog entries, we discussed translation hurdles (blog) and how parallel word databases can be helpful (blog). We now introduce what resources in lexical databases may be available. Chilin now provides English-Chinese parallel sentences on the TAUS Data Marketplace. Two large datasets related to Pharmaceuticals and Biotechnology are now available. Chilin’s parallel sentence data is

Top Quality English-Chinese Pharmaceutical and Biotechnology Training Data available for Machine Translation Read More »

How many Chinese words are needed for Natural Language Processing?

In the first Chilin blog entry we noted that OOV (Out-Of-Vocabulary) words were a major problem for natural language processing in general. Actually, there are also other important challenges. One is the size of the lexicon and the relevance of the subjects covered by the words. The actual requirements will depend on the target users and usage.

How many Chinese words are needed for Natural Language Processing? Read More »