27 Language Resources (Page 1 of 2)

« Previous | Next »Order by:

 Collins Multilingual database (MLD) – PhraseBank with audio files    
  • Arabic
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Finnish
  • French
  • German
  • Hindi
  • Italian
  • Japanese
  • Korean
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish
  • Vietnamese

ID: ELRA-S0383

ISLRN: 398-655-047-044-5

The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the audio files corresponding t...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3360.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4480.00 € submit
 Collins Multilingual database (MLD) – WordBank with audio files    
  • Arabic
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Finnish
  • French
  • German
  • Italian
  • Japanese
  • Korean
  • Modern Greek (1453-)
  • Norwegian
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish
  • Vietnamese

ID: ELRA-S0382

ISLRN: 309-438-781-042-2

The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the corresponding audio files c...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3640.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5200.00 € submit
 GlobalPhone 2000 Speaker Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0400

ISLRN: 331-592-378-424-7

The GlobalPhone 2000 Speaker Package contains transcribed read speech spoken by 2000 native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Mandarin (ELRA-S0193), C...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 GlobalPhone Korean    
  • Korean

ID: ELRA-S0200

ISLRN: 520-329-707-787-0

The GlobalPhone corpus developed in collaboration with the Karlsruhe Institute of Technology (KIT) was designed to provide read speech data for the development and evaluation of large continuous speech recognition systems in the most widespread languages of the world, and to provide a uniform, mu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit

Special offers are also available. Check here for details.

 GlobalPhone Multilingual Model Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0399

ISLRN: 204-945-263-927-6

The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Manda...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 Korean Speech Data by Mobile Phone - 357 Hours    
  • Korean

ID: ELRA-S0460

ISLRN: 452-991-963-242-3

357 hours of Korean speech data collected by cellphone, recorded by 999 Korean in quiet environment and rich in content. All texts are transtribed by professional annotator. The accuracy rate of sentence is 95%. It can be used for speech recognition, machine translation and voiceprint recognition...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
57655.50 € submit
57655.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
57655.50 € submit
57655.50 € submit

Special offers are also available. Check here for details.

 Korean Speech Data by Mobile Phone - 516 Hours    
  • Korean

ID: ELRA-S0431

ISLRN: 988-295-249-804-1

Korean audio data with duration of 516 hours. Recorded texts include: daily language, various interactive sentences, home commands, on-board commands, etc. Among 1,077 speakers, male and female speakers are 49% and 51%. The duration of each speaker is around half an hour. Format:16kHz, 16bit, ...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
83334.00 € submit
83334.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
83334.00 € submit
83334.00 € submit

Special offers are also available. Check here for details.

 Korean Speech Data by Mobile Phone_Reading - 197 Hours    
  • Korean

ID: ELRA-S0475

ISLRN: 330-652-829-242-5

It collects 291 Korean locals and is recorded in quiet indoor environment. The recordings include economics, entertainment, news, oral, figure, letter. 400 sentences for each speaker. Recording devices are mainstream Android phones and iPhones. Format:16kHz, 16bit, uncompressed wav, mono chann...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
31815.50 € submit
31815.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
31815.50 € submit
31815.50 € submit

Special offers are also available. Check here for details.

 Korean Speech Recognition Corpus (desktop) – digit string (110 people)    
  • Korean

ID: ELRA-S0228-52

ISLRN: 331-652-936-814-0

This corpus comprises 13,200 Korean digit strings uttered by 110 speakers of different dialects, ages and various educational levels, recorded over 4 channels. Speech samples are stored as a sequence of 16-bit 48kHz WAV for 18.89 hours of speech per channel. The total capacity of the data is 24.2...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
 Korean Speech Recognition Corpus (Desktop+Mobile)    
  • Korean

ID: ELRA-S0228-103

ISLRN: 852-908-669-816-4

This corpus comprises 32,247 entries uttered by 52 speakers (26 males and 26 females), recorded over 3 channels (desktop and mobile in quiet office). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 15.76 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 Korean Speech Recognition corpus (Desktop) - name, digit string, place, sentences    
  • Korean

ID: ELRA-S0228-62

ISLRN: 832-286-766-358-8

This corpus comprises 83,756 entries uttered by 150 speakers (66 males and 84 females), recorded over 4 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 29.65 hours of speech per channel. This set combines ELRA-S0228-50, ELRA-S0228-51, ELR...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 Korean Speech Recognition Corpus (desktop) – person name (150 people)    
  • Korean

ID: ELRA-S0228-50

ISLRN: 146-478-758-877-6

This corpus comprises 1,500 Korean person names uttered by 150 speakers of different dialects, ages and various educational levels, recorded over 4 channels. Speech samples are stored as a sequence of 16-bit 48kHz WAV for 1.69 hours of speech per channel. The total capacity of the data is 2 Gb. ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
 Korean Speech Recognition Corpus (desktop) – place name (150 people)    
  • Korean

ID: ELRA-S0228-51

ISLRN: 227-476-631-391-1

This corpus comprises 1,500 Korean place names uttered by 150 speakers of different dialects, ages and various educational levels, recorded over 4 channels. Speech samples are stored as a sequence of 16-bit 48kHz WAV for 1.65 hours of speech per channel. The total capacity of the data is 2 Gb. E...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
 Korean Speech Recognition Corpus (desktop) – single Korean sentences (40 people)    
  • Korean

ID: ELRA-S0228-53

ISLRN: 892-259-683-285-7

This corpus comprises 4,800 Korean sentences uttered by 40 speakers of different dialects, ages and various educational levels, recorded over 4 channels. Speech samples are stored as a sequence of 16-bit 48kHz WAV for 7.43 hours of speech per channel. The total capacity of the data is 9.82 Gb. E...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
 Korean Speecon database    
  • Korean

ID: ELRA-S0177

ISLRN: 429-596-342-929-3

The Korean Speecon database is divided into 2 sets: 1) The first set comprises the recordings of 568 adult Korean speakers (259 males, 309 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place). 2) The second set comprises the record...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50000.00 € submit
67000.00 € submit
Licence: Commercial Use - ELRA VAR
67000.00 € submit
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
60000.00 € submit
75000.00 € submit
Licence: Commercial Use - ELRA VAR
75000.00 € submit
75000.00 € submit
 Korean-Vietnamese Parallel Corpus    
  • Korean
  • Vietnamese

ID: ELRA-W0313

ISLRN: 365-128-449-700-7

The Korean-Vietnamese Parallel Corpus consists of 200,000 sentence pairs, with an average length of 15 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
400.00 € submit
Licence: Commercial Use - ELRA VAR
1400.00 € submit
1400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
300.00 € submit
600.00 € submit
Licence: Commercial Use - ELRA VAR
2100.00 € submit
2100.00 € submit
 LILA Korean database    
  • Korean

ID: ELRA-S0295

ISLRN: 391-771-784-796-1

The LILA Korean database collected in South Korea was recorded within the scope of the LILA project. It contains the recordings of 1,000 Korean speakers (500 males and 500 females) recorded over the Korean mobile telephone network. The following acoustic conditions were selected as representativ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
43125.00 € submit
47500.00 € submit
Licence: Commercial Use - ELRA VAR
47500.00 € submit
47500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
46405.00 € submit
51875.00 € submit
Licence: Commercial Use - ELRA VAR
51875.00 € submit
51875.00 € submit
 Multilingual Corpus    
  • Chinese
  • English
  • Korean

ID: ELRA-W0035

ISLRN: 731-151-596-869-3

Multilingual parallel corpus produced by Kaist Korterm containing 60 000 expressions in Korean, Chinese and English.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
750.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 Parallel Corpora & Domains (bilingual and multilingual)    
  • Arabic
  • Chinese
  • Danish
  • Dutch; Flemish
  • English
  • Finnish
  • French
  • German
  • Hebrew
  • Italian
  • Japanese
  • Korean
  • Modern Greek (1453-)
  • Northern Sami
  • Norwegian
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swedish
  • Turkish

ID: ELRA-W0336

ISLRN: 471-919-856-164-1

Parallel corpora for nearly 400 language pairs and numerous multilingual combinations, including 10 million bilingual segments and 90 million tokens in 20 languages: Arabic, Chinese (Simplified), Danish, Dutch, English, Finnish, French, German, Greek, Hebrew, Italian, Japanese, Korean, North Sami...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
0.10 € submit
0.10 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
0.11 € submit
0.11 € submit

Special offers are also available. Check here for details.

 Phonetically Balanced Sentences    
  • Korean

ID: ELRA-S0129

ISLRN: 134-396-214-473-8

Large acoustic corpus in Korean produced by Kaist Korterm. 20 native Korean speakers (males and females) read 1 time 539 sentences and a set of 50 common sentence. Information such as the size and the level of studies of the speakers are provided. The recordings took place in a soundproof room. T...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit

« Previous | Next »