Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

55 Language Resources (Page 2 of 3)

« Previous | Next »Order by:

 GlobalPhone Multilingual Model Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0399

ISLRN: 204-945-263-927-6

The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Manda...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 LC-STAR Standard Arabic Phonetic lexicon      
  • Arabic

ID: ELRA-S0247

ISLRN: 716-224-338-250-4

The LC-STAR Standard Arabic Phonetic lexicon was created within the scope of the LC-STAR project (IST 2001-32216) which was sponsored by the European Commission. The lexicon comprises 110,271 entries, distributed over three categories: - a set of 52,981 common word entries. This set is extracte...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
21250.00 € submit
28000.00 € submit
Licence: Commercial Use - ELRA VAR
28000.00 € submit
28000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
27625.00 € submit
36400.00 € submit
Licence: Commercial Use - ELRA VAR
36400.00 € submit
36400.00 € submit
 "Le Monde Diplomatique" Arabic tagged corpus    
  • Arabic

ID: ELRA-W0049

ISLRN: 124-139-628-259-2

This corpus contains 102,960 vowelised, lemmatised and tagged words (58 texts from Le Monde Diplomatique Arabic, see also ELRA-W0036-04). To each text are associated 3 files : - raw text in Arabic, - vowelized text in Arabic, - one XML file containing the morphological annotation of the text. ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
185.00 € submit
975.00 € submit
Licence: Commercial Use - ELRA VAR
975.00 € submit
975.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 "Le Monde Diplomatique" Text corpus in Arabic    
  • Arabic

ID: ELRA-W0036-04

ISLRN: 231-368-326-920-2

Electronic archiving of "Le Monde Diplomatique" articles in Arabic from 2000. The corpus is available in HTML. Each HTML file contains one article. Number of articles available per year : • 2000: 61 articles (November and December available only) (75,305 words) • 2001: 346 articles (479,435 ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
46.00 € submit
46.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
69.00 € submit
69.00 € submit
 MAURDOR Evaluation Package  
  • Arabic
  • English
  • French

ID: ELRA-E0045

ISLRN: 364-018-517-901-2

The MAURDOR project consists in evaluating systems for automatic processing of written documents. Collected written documents are scanned documents (printed, typewritten or manuscripts). In order to get images for the evaluation of automatic analysis systems, 10,000 original documents were c...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
10000.00 € submit
Licence: Evaluation Use - ELRA EVALUATION
5000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
750.00 € submit
15000.00 € submit
Licence: Evaluation Use - ELRA EVALUATION
7500.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 MEDAR Evaluation Package    
  • Arabic
  • English

ID: ELRA-E0040

ISLRN: 631-407-723-040-2

The MEDAR Evaluation Package was produced within the project MEDAR (MEDiterranean ARabic language and speech technology), supported by the European Commission's ICT programme and which has been running from February 1st 2008 until July 31st 2010. The project addressed International Cooperation be...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
0.00 € submit
0.00 € submit
 Multilingual Dictionary of Sports – English-French-Arabic trilingual database    
  • Arabic
  • English
  • French

ID: ELRA-T0372-04

ISLRN: 351-230-082-450-3

This dictionary was produced within the French national project EuRADic (European and Arabic Dictionaries and Corpora), as part of the Technolangue programme funded by the French Ministry of Industry. A needs study in the field of sport terminology, which covered an overall category of users, ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
150.00 € submit
1500.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 Multilingual Dictionary of Sports – English-French-Greek-Arabic-German-Spanish-Portuguese multilingual database    
  • Arabic
  • English
  • French
  • German
  • Modern Greek (1453-)
  • Portuguese
  • Spanish; Castilian

ID: ELRA-T0372-01

ISLRN: 753-372-742-011-3

This dictionary was produced within the French national project EuRADic (European and Arabic Dictionaries and Corpora), as part of the Technolangue programme funded by the French Ministry of Industry. A needs study in the field of sport terminology, which covered an overall category of users, ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
300.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
 NE3L named entities Arabic corpus    
  • Arabic

ID: ELRA-W0078

ISLRN: 398-979-151-557-0

The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 NEMLAR Broadcast News Speech Corpus    
  • Arabic

ID: ELRA-S0219

ISLRN: 479-507-036-103-9

This corpus was produced within the NEMLAR project (http://www.nemlar.org). Two other resources, produced within the same project, are also available: NEMLAR Written Corpus (ELRA-W0042) and the NEMLAR Speech Synthesis Corpus (ELRA-S0220). The Nemlar Broadcast News Speech Corpus consists of about...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
150.00 € submit
500.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
300.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit

Special offers are also available. Check here for details.

 NEMLAR Speech Synthesis Corpus    
  • Arabic

ID: ELRA-S0220

ISLRN: 361-216-121-305-9

This corpus was produced within the NEMLAR project (http://www.nemlar.org). Two other resources, produced within the same project, are also available: NEMLAR Written Corpus (ELRA-W0042) and the NEMLAR Broadcast News Speech Corpus (ELRA-S0219). The NEMLAR Speech Synthesis Corpus contains the reco...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
1250.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit

Special offers are also available. Check here for details.

 NEMLAR Written Corpus    
  • Arabic

ID: ELRA-W0042

ISLRN: 050-693-158-326-9

This corpus was produced within the NEMLAR project (http://www.nemlar.org). Two other resources, produced within the same project, are also available: NEMLAR Broadcast News Speech Corpus (ELRA-S0219) and the NEMLAR Speech Synthesis Corpus (ELRA-S0220). The NEMLAR Written Corpus consists of about...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
150.00 € submit
250.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
300.00 € submit
500.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit

Special offers are also available. Check here for details.

 NetDC Arabic BNSC (Broadcast News Speech Corpus)    
  • Arabic

ID: ELRA-S0157

ISLRN: 663-177-513-755-1

The NetDC Arabic BNSC (Broadcast News Speech Corpus) is a corpus developed by ELDA in the framework of the European-funded project Network of Data Centres (NetDC). The project was done in collaboration with the LDC (Linguistic Data Consortium), which has produced a similar corpus from the news br...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
1350.00 € submit
Licence: Commercial Use - ELRA VAR
1350.00 € submit
1350.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
2700.00 € submit
Licence: Commercial Use - ELRA VAR
2700.00 € submit
2700.00 € submit
 Normalized Arabic Fragments for Inestimable Stemming (NAFIS)    
  • Arabic

ID: ELRA-W0127

ISLRN: 305-450-745-774-1

Normalized Arabic Fragments for Inestimable Stemming (NAFIS) is an Arabic stemming gold standard corpus composed by a collection of sentences, selected to be representative of Arabic stemming tasks and manually annotated. Indeed, NAFIS is: Comprehensive: The content of NAFIS can be generalized...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
 OrienTel Arabic as spoken in Israel database    
  • Arabic

ID: ELRA-S0190

ISLRN: 627-343-367-534-7

The OrienTel Arabic as spoken in Israel database comprises 750 Arabic speakers (375 males, 375 females) recorded over the Israeli fixed and mobile telephone network. This database is partitioned into 2 DVDs. The speech databases made within the OrienTel project were validated by SPEX, the Netherl...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
37875.00 € submit
40000.00 € submit
Licence: Commercial Use - ELRA VAR
40000.00 € submit
40000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
39843.00 € submit
43125.00 € submit
Licence: Commercial Use - ELRA VAR
43125.00 € submit
43125.00 € submit
 OrienTel Egypt MCA (Modern Colloquial Arabic) database    
  • Arabic

ID: ELRA-S0221

ISLRN: 036-535-444-454-5

The OrienTel Egypt MCA (Modern Colloquial Arabic) database comprises 750 Egyptian speakers (398 males, 352 females) recorded over the Egyptian fixed and mobile telephone network. This database is partitioned into 1 CD and 1 DVD. The speech databases made within the OrienTel project were validated...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
24000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
24000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
22500.00 € submit
30000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
30000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 OrienTel Egypt MSA (Modern Standard Arabic) database    
  • Arabic

ID: ELRA-S0222

ISLRN: 830-378-677-910-7

The OrienTel Egypt MSA (Modern Standard Arabic) database comprises 500 Egyptian speakers (254 males, 246 females) recorded over the Egyptian fixed and mobile telephone network. This database is partitioned into 1 CD and 1 DVD. The speech databases made within the OrienTel project were validated b...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
12000.00 € submit
16000.00 € submit
Licence: Commercial Use - ELRA VAR
16000.00 € submit
16000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 OrienTel Jordan MCA (Modern Colloquial Arabic) database    
  • Arabic

ID: ELRA-S0289

ISLRN: 172-662-950-237-0

The OrienTel Jordan MCA (Modern Colloquial Arabic) database comprises 757 Jordanian speakers (393 males, 364 females) recorded over the Jordanian fixed and mobile telephone network. This database is stored on 1 DVD. The speech databases made within the OrienTel project were validated by SPEX, the...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
24000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
24000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
22500.00 € submit
30000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
30000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 OrienTel Jordan MSA (Modern Standard Arabic) database    
  • Arabic

ID: ELRA-S0290

ISLRN: 259-713-018-372-4

The OrienTel Jordan MSA (Modern Standard Arabic) database comprises 556 Jordanian speakers (288 males, 268 females) recorded over the Jordanian fixed and mobile telephone network. This database is stored on 1 DVD. The speech databases made within the OrienTel project were validated by SPEX, the N...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
12000.00 € submit
16000.00 € submit
Licence: Commercial Use - ELRA VAR
16000.00 € submit
16000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 OrienTel Morocco MCA (Modern Colloquial Arabic) database    
  • Arabic

ID: ELRA-S0183

ISLRN: 613-578-868-832-2

The OrienTel Morocco MCA (Modern Colloquial Arabic) database comprises 772 Moroccan speakers (383 males, 389 females) recorded over the Moroccan fixed and mobile telephone network. This database is partitioned into 1 CD and 1 DVD. The speech databases made within the OrienTel project were validat...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
24000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
24000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
22500.00 € submit
30000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
30000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

« Previous | Next »