LRs Collection

Language Resources Collection

Since 1995, ELRA has served the language engineering community with a mission: providing commercial and academic research and development units with high-quality language resource databases.

ELRA is the recognised leader in Europe for collecting, producing, validating, and distributing speech, text, and terminology language resources that serve the purpose of developing, training, and testing language engineering and Human Language Technology (HLT) systems. ELRA also participates in evaluation campaigns at national, European and international levels. ELRA’s operational body, ELDA (Evaluations and Language resources Distribution Agency), is responsible for the achievement of those projects.

The existing language resources are first located and negotiated. They are then been made available in the catalogue of language resources which you can consult on this web site.

Indeed, ELRA committed to create a structured catalogue of language resources. A set of description forms, which include all the features one need to know about a specific resource, was prepared to help the providers describe in a more uniform and consistent way their language resources. LRs description forms can be viewed here.

Our team has experience in language data collection for speech- and text-based systems, including the management of, and participation in, several European Commission-funded language resource projects (LE1-1019 - ELRA; LE4-8335 - LRs P&P; LRE 62-050 MulText, etc.).

We have also been indirectly involved in many European projects where language resources were to be created (SpeechDAT; SpeechDAT-East; SpeechDAT-Car, etc.). In addition, some of our team members have participated in language resource collection projects in the United States.

Our experience ranges from telephony-based recording, to parallel corpus development efforts, to lexicon development for speech translation systems, etc. Our team members have benefited from working in industrial, corporate, and academic settings.

Since we understand your language resource needs and have the background that is necessary to assist in your language resource design, collection and implementation efforts, ELRA offers a language resource collection service.

We are able to work with your institution on a case-by-case basis in order to develop a proposal that will respond to your specific needs.

For more information regarding this service, please contact:

Khalid Choukri
- Email: choukri elda.org
- Tel: (+33) 1 43 13 33 33
- Fax: (+33) 1 43 13 33 30