Skip to main content

5.3.1. Overview

Activities on LR may be reported all over the world, for the various languages spoken in many different countries.

There is of course a high degree of activities in the US, with the support of the NSF and of the DARPA programs, and the existence of LDC for producing and distributing the LR and of NIST for conducting evaluation (see also the Transnational initiatives chapter). Australia and Canada also devote coordinated efforts in this area for the English language and its varieties.

Japan and Korea devote a large, coordinated effort for their respective languages. The situation in China seems to be also very strong, although maybe less coordinated. Hong-Kong and Taiwan has several very active laboratories in that field and produced large amounts of data. Among Chinese languages, the mandarin language is the most covered one.

A strong political interest in national languages should be mentioned in India, with the existence of the national TDIL (Technology Development for Indian Languages) program, addressing the development of several written and spoken processing systems, and the corresponding resources, for the 19 official Indian languages, and in South Africa, with the NHN (National HLT Network), which conducted a survey of the existing LRT for the 11 national languages and is considering the way to provide such LRT for all of them. The interest in regional languages is also strong in Thailand, in the cultural heritage framework and with the coordination of NECTEC.

For the Arabic language, an initiative for a better coordination has been launched by Egypt through the Arabic Language Technology Center (ALTEC). Comparable initiatives have been taken for the African Languages in Nigeria (African Languages Technology Initiative (Alt-i)) and Djibouti (African HLT Association).

But a large amount of activities may also be reported in many other countries for many different languages, as it appears in the received feedbacks.