35 Language Resources (Page 1 of 2)

« Previous | Next »Order by:

 2007 CoNLL Shared Task - Arabic & English    
  • Arabic
  • English

ID: ELRA-W0123

ISLRN: 505-782-255-628-8

2007 CoNLL Shared Task - Arabic & English consists of dependency treebanks in two languages used as part of the CoNLL 2007 shared task on multi-lingual dependency parsing and domain adaptation. The languages covered in this release are: Arabic and English. The Conference on Computational Natur...

MEMBERacademiccommercial
Licence: Non Commercial Use - Non Standard Licence Terms
NON MEMBERacademiccommercial
Licence: Non Commercial Use - Non Standard Licence Terms
 Amharic-English bilingual corpus    
  • Amharic
  • English

ID: ELRA-W0074

ISLRN: 590-255-335-719-0

The Amharic-English bilingual corpus contains parallel text from legal and news domains in Amharic script, in transliterated form and in English. The size of the corpus is of 232,653 words in Amharic and 291,701 in English. This parallel corpus contains documents from two domains, namely legal a...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
 Bilingual (Spanish-English) Speech synthesis HTS models    
  • English
  • Spanish; Castilian

ID: ELRA-S0335

ISLRN: 277-380-359-561-3

This database contains Bilingual (English and Spanish) Festival HTS models. Models were trained with 9h of speech from 2 female bilingual speakers and 2 male bilingual speakers. Each speaker recorded 2h 15 min per language. The speech data can be found in the TC-STAR Bilingual Voice-Conversion S...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
 CESTA Evaluation Package    
  • Arabic
  • English
  • French

ID: ELRA-E0020

ISLRN: 809-316-046-724-8

The CESTA Evaluation Package was produced within the French national project CESTA (Evaluation of MT systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The CESTA project enabled to carry out a campaign for the evaluation of machi...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
150.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
300.00 € submit
1000.00 € submit
 ECPC Corpus (European Comparable and Parallel Corpora of Parliamentary Speeches Archive) – set 1    
  • English
  • Spanish; Castilian

ID: ELRA-W0128

ISLRN: 036-939-425-010-1

The European Comparable and Parallel Corpora of Parliamentary Speeches Archive (ECPC), compiled at the Universitat Jaume I (Spain), is a collection of XML metatextually tagged corpora containing speeches from three European chambers (the European Parliament, the British House of Commons, and the ...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
 English-Nepali Parallel Corpus    
  • English
  • Nepali (macrolanguage)

ID: ELRA-W0077

ISLRN: 853-487-663-161-6

The Nepali Monolingual written corpus is one of the 3 resources that constitute the Nepali National Corpus. The Nepali National Corpus was produced in 2006 in the framework of the project Bhasha Sanchar (“language communication”), also known as Nelralec, for Nepali Language Resources and Localiza...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
 English-Persian parallel corpus    
  • English
  • Persian

ID: ELRA-W0118

ISLRN: 074-825-114-781-7

The English-Persian parallel corpus contains more than 200,000 aligned sentences across a variety of text types from the domains of art, law, culture, science, religion, literature, medicine, idioms, politics and others. It is an extension of the English-Persian parallel corpus already distribute...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 English-Persian parallel Corpus    
  • English
  • Persian

ID: ELRA-W0051

ISLRN: 671-618-321-687-7

Please refer to ELRA-W0118 for the latest version of this corpus. This version consists of about 3,500,000 English and Persian (Farsi) words aligned at sentence level (about 100,000 sentences, distributed over 50,021 entries). The format of the files is Unicode. It has been originally created wi...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
 English-Vietnamese Parallel Corpus    
  • English
  • Vietnamese

ID: ELRA-W0124

ISLRN: 838-483-738-912-8

This is a corpus of 500,000 English-Vietnamese sentence pairs, built to develop SMT (Statistical Machine Translation) systems. The parallel corpus contains English documents translated by professional translators into Vietnamese. The source texts include books, dictionaries, newspapers, online ne...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
1200.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
8000.00 € submit
8000.00 € submit
 EUROPARL Corpus Parallel Corpora: Portuguese-English    
  • English
  • Portuguese

ID: ELRA-W0090

ISLRN: 435-502-922-727-2

The EUROPARL Corpus (Portuguese-English subpart of the parallel corpora), was extracted from the proceedings of the European Parliament. It contains transcriptions of sessions dating back from 1996 to 2011, with a total of approximately 58,324,562 tokens of European Portuguese (L1) and 49,216,896...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
 TAXI - Multilingual telephone dialog database    
  • English
  • German

ID: ELRA-S0137

ISLRN: 734-992-877-270-4

TAXI was produced by BAS, in collaboration with the German research centre for artificial intelligence, DFKI. This speech database contains recordings which consist of dialogues, 94 on the whole (spontaneous speech), between a German speaking cab dispatcher and his clients, who always answered in...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
127.82 € submit
383.03 € submit
Licence: Commercial Use - ELRA VAR
383.03 € submit
383.03 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
255.65 € submit
511.29 € submit
Licence: Commercial Use - ELRA VAR
511.29 € submit
511.29 € submit
 TC-STAR 2005 Evaluation Package - SLT Chinese-to-English    
  • Chinese
  • English

ID: ELRA-E0007

ISLRN: 401-569-014-653-6

TC-STAR is a European integrated project focusing on Speech-to-Speech Translation (SST). To encourage significant breakthrough in all SST technologies, annual open competitive evaluations are organized. Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) and Text-To-Speech (TTS)...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
500.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
750.00 € submit
750.00 € submit

Special offers are also available. Check here for details.

 TC-STAR 2005 Evaluation Package - SLT English-to-Spanish    
  • English
  • Spanish; Castilian

ID: ELRA-E0005

ISLRN: 191-146-101-327-2

TC-STAR is a European integrated project focusing on Speech-to-Speech Translation (SST). To encourage significant breakthrough in all SST technologies, annual open competitive evaluations are organized. Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) and Text-To-Speech (TTS)...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
500.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
750.00 € submit
750.00 € submit

Special offers are also available. Check here for details.

 TC-STAR 2005 Evaluation Package - SLT Spanish-to-English    
  • English
  • Spanish; Castilian

ID: ELRA-E0006

ISLRN: 235-338-796-186-1

TC-STAR is a European integrated project focusing on Speech-to-Speech Translation (SST). To encourage significant breakthrough in all SST technologies, annual open competitive evaluations are organized. Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) and Text-To-Speech (TTS)...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
500.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
750.00 € submit
750.00 € submit

Special offers are also available. Check here for details.

 TC-STAR 2006 Evaluation Package – End-to-End    
  • English
  • Spanish; Castilian

ID: ELRA-E0031

ISLRN: 096-175-302-473-2

TC-STAR is a European integrated project focusing on Speech-to-Speech Translation (SST). To encourage significant breakthrough in all SST technologies, annual open competitive evaluations are organized. Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) Text-To-Speech (TTS) are...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
500.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
750.00 € submit
750.00 € submit

Special offers are also available. Check here for details.

 TC-STAR 2006 Evaluation Package - SLT Spanish-to-English - CORTES    
  • English
  • Spanish; Castilian

ID: ELRA-E0015-01

ISLRN: 538-224-560-510-1

TC-STAR is a European integrated project focusing on Speech-to-Speech Translation (SST). To encourage significant breakthrough in all SST technologies, annual open competitive evaluations are organized. Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) and Text-To-Speech (TTS)...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
500.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
750.00 € submit
750.00 € submit

Special offers are also available. Check here for details.

 TC-STAR 2006 Evaluation Package - SLT Spanish-to-English - EPPS    
  • English
  • Spanish; Castilian

ID: ELRA-E0015-02

ISLRN: 646-849-092-206-1

TC-STAR is a European integrated project focusing on Speech-to-Speech Translation (SST). To encourage significant breakthrough in all SST technologies, annual open competitive evaluations are organized. Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) and Text-To-Speech (TTS)...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
500.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
750.00 € submit
750.00 € submit

Special offers are also available. Check here for details.

 TC-STAR 2007 Evaluation Package – End-to-End    
  • English
  • Spanish; Castilian

ID: ELRA-E0032

ISLRN: 197-528-212-495-6

TC-STAR is a European integrated project focusing on Speech-to-Speech Translation (SST). To encourage significant breakthrough in all SST technologies, annual open competitive evaluations are organized. Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) Text-To-Speech (TTS) are...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
500.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
750.00 € submit
750.00 € submit

Special offers are also available. Check here for details.

 TC-STAR 2007 Evaluation Package - SLT Chinese-to-English    
  • Chinese
  • English

ID: ELRA-E0030

ISLRN: 113-138-629-916-7

TC-STAR is a European integrated project focusing on Speech-to-Speech Translation (SST). To encourage significant breakthrough in all SST technologies, annual open competitive evaluations are organized. Automatic Speech Recognition (ASR), Spoken Language Translation (SLT) and Text-To-Speech (TTS)...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
500.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
750.00 € submit
750.00 € submit

Special offers are also available. Check here for details.

« Previous | Next »