Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

117 Language Resources (Page 2 of 6)

« Previous | Next »Order by:

 Collins Multilingual database (MLD) – WordBank with audio files    
  • Arabic
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Finnish
  • French
  • German
  • Italian
  • Japanese
  • Korean
  • Modern Greek (1453-)
  • Norwegian
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish
  • Vietnamese

ID: ELRA-S0382

ISLRN: 309-438-781-042-2

The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the corresponding audio files c...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3640.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5200.00 € submit
 Comprehensive Wordlist of Simplified Chinese    
  • Chinese

ID: ELRA-L0109

ISLRN: 159-767-888-341-3

Comprehensive monolingual wordlist for Simplified Chinese. Pinyin is provided, making this database ideal for speech-related applications such as speech synthesis.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10800.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
21600.00 € submit
36000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
13500.00 € submit
22500.00 € submit
Licence: Commercial Use - ELRA VAR
27000.00 € submit
45000.00 € submit
 Comprehensive Word List of Traditional Chinese    
  • Chinese

ID: ELRA-L0110

ISLRN: 378-715-589-213-1

Comprehensive monolingual wordlist for Traditional Chinese. Zhuyin is provided, making this database ideal for speech-related applications such as speech synthesis.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10800.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
21600.00 € submit
36000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
13500.00 € submit
22500.00 € submit
Licence: Commercial Use - ELRA VAR
27000.00 € submit
45000.00 € submit
 Comprehensive Word Lists for Chinese, Japanese, Korean and Arabic    
  • Arabic
  • Chinese
  • Japanese
  • Korean

ID: ELRA-M0071

ISLRN: 476-146-877-598-3

Comprehensive monolingual word lists for both Simplified and Traditional Chinese, Japanese, Korean and Arabic, including a full-form Arabic word list. For Simplified and Traditional Chinese, Japanese and Korean, we provide readings as well, making them ideal for speech-related applications such...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
30000.00 € submit
50000.00 € submit
Licence: Commercial Use - ELRA VAR
60000.00 € submit
100000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
37500.00 € submit
62500.00 € submit
Licence: Commercial Use - ELRA VAR
75000.00 € submit
125000.00 € submit
 Database of Chinese Full Names    
  • Chinese

ID: ELRA-L0106

ISLRN: 356-835-468-182-0

Covers Chinese full names of real people, including celebrities. Includes pinyin readings.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4500.00 € submit
7500.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5625.00 € submit
9375.00 € submit
Licence: Commercial Use - ELRA VAR
11250.00 € submit
18750.00 € submit
 Database of Chinese Names    
  • Chinese

ID: ELRA-L0129

ISLRN: 792-499-131-789-4

Chinese name components, accompanied by accurate pinyin readings, gender codes, and flags denoting whether name is a given name, surname, or both.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
20000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7500.00 € submit
12500.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
25000.00 € submit
 Database of Chinese Name Variants    
  • Chinese

ID: ELRA-L0105

ISLRN: 379-237-021-386-4

Provides comprehensive coverage for the major Chinese romanization systems and their variants, and if needed can be expanded considerably with dialectical variants (Cantonese, Hakka, Hokkien, etc.).

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
12000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
40000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
50000.00 € submit
 ECI/MCI (European Corpus Initiative/Multilingual Corpus I)    
  • Albanian
  • Bulgarian
  • Chinese
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Italian
  • Japanese
  • Latin
  • Lithuanian
  • Malay (macrolanguage)
  • Modern Greek (1453-)
  • Norwegian
  • Portuguese
  • Russian
  • Scottish Gaelic; Gaelic
  • Serbian
  • Spanish; Castilian
  • Swedish
  • Turkish
  • Uzbek

ID: ELRA-W0004

ISLRN: 511-168-567-582-5

The European Corpus Initiative (ECI) was founded to oversee the acquisition and preparation of a large multilingual corpus, and supports existing and projected national and international efforts to carefully design, collect and publish large-scale multilingual written and spoken corpora. ECI has ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50.00 € submit
50.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50.00 € submit
50.00 € submit
 English-Chinese-Vietnamese Trilingual Parallel Corpus    
  • Chinese
  • English
  • Vietnamese

ID: ELRA-W0314

ISLRN: 637-630-726-817-9

The English-Chinese-Vietnamese Trilingual Parallel Corpus consists of 20,046 trilingual sets of sentence pairs. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
150.00 € submit
500.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
225.00 € submit
750.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
 English-to-Simplified Chinese Dictionary    
  • Chinese
  • English

ID: ELRA-M0055

ISLRN: 407-348-028-638-3

80,000 headwords, expandable to 100,000, of general vocabulary and important proper names.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2250.00 € submit
3750.00 € submit
Licence: Commercial Use - ELRA VAR
4500.00 € submit
7500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2813.00 € submit
4688.00 € submit
Licence: Commercial Use - ELRA VAR
5625.00 € submit
9375.00 € submit
 GlobalPhone 2000 Speaker Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0400

ISLRN: 331-592-378-424-7

The GlobalPhone 2000 Speaker Package contains transcribed read speech spoken by 2000 native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Mandarin (ELRA-S0193), C...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 GlobalPhone Chinese-Mandarin    
  • Chinese

ID: ELRA-S0193

ISLRN: 976-318-571-969-1

The GlobalPhone corpus developed in collaboration with the Karlsruhe Institute of Technology (KIT) was designed to provide read speech data for the development and evaluation of large continuous speech recognition systems in the most widespread languages of the world, and to provide a uniform, mu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit

Special offers are also available. Check here for details.

 GlobalPhone Chinese-Mandarin Pronunciation Dictionary      
  • Chinese

ID: ELRA-S0363

ISLRN: 457-511-870-286-9

The GlobalPhone pronunciation dictionaries, created within the framework of the multilingual speech and language corpus GlobalPhone, were developed in collaboration with the Karlsruhe Institute of Technology (KIT). The GlobalPhone pronunciation dictionaries contain the pronunciations of all wo...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit

Special offers are also available. Check here for details.

 GlobalPhone Chinese-Shanghai    
  • Chinese

ID: ELRA-S0194

ISLRN: 879-999-559-792-7

The GlobalPhone corpus developed in collaboration with the Karlsruhe Institute of Technology (KIT) was designed to provide read speech data for the development and evaluation of large continuous speech recognition systems in the most widespread languages of the world, and to provide a uniform, mu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
300.00 € submit
1800.00 € submit
Licence: Commercial Use - ELRA VAR
1800.00 € submit
1800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
355.00 € submit
2125.00 € submit
Licence: Commercial Use - ELRA VAR
2125.00 € submit
2125.00 € submit

Special offers are also available. Check here for details.

 GlobalPhone Multilingual Model Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0399

ISLRN: 204-945-263-927-6

The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Manda...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 Hanzi Pinyin Database for Simplified Chinese    
  • Chinese

ID: ELRA-L0104

ISLRN: 292-895-602-975-4

Covers entries of general vocabulary, along with high-frequency technical terms and proper nouns. In addition to large coverage and high level of accuracy, the database has several special features including explicit codes to indicate headword type and part-of speech, coverage of all polyphones, ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
9000.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
30000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11250.00 € submit
18750.00 € submit
Licence: Commercial Use - ELRA VAR
22500.00 € submit
37500.00 € submit
 Hong Kong Cantonese Speech Recognition Corpus (Desktop)    
  • Chinese

ID: ELRA-S0228-75

ISLRN: 083-033-068-532-0

This corpus comprises 101,964 entries uttered by 51 speakers, recorded over 4 channels (desktop). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 24.18 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 Korean-Chinese Database of Proper Nouns    
  • Chinese
  • Korean

ID: ELRA-M0070

ISLRN: 207-127-841-003-9

A large comprehensive database of Korean-Chinese personal and place names, with coverage of not only native Korean proper nouns, but also Japanese, Chinese and Western proper nouns as well.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4500.00 € submit
7500.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5625.00 € submit
9375.00 € submit
Licence: Commercial Use - ELRA VAR
11250.00 € submit
18750.00 € submit
 LC-STAR Mandarin Chinese Phonetic lexicon      
  • Chinese

ID: ELRA-S0256

ISLRN: 103-062-804-789-9

The LC-STAR Mandarin Chinese Phonetic lexicon was created within the scope of the LC-STAR project (IST 2001-32216) which was sponsored by the European Commission. The lexicon comprises 104,368 entries, distributed over three categories: - a set of 38,098 common word entries. This set is extract...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
27000.00 € submit
40000.00 € submit
Licence: Commercial Use - ELRA VAR
40000.00 € submit
40000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
38000.00 € submit
50000.00 € submit
Licence: Commercial Use - ELRA VAR
50000.00 € submit
50000.00 € submit

« Previous | Next »