82 Language Resources (Page 1 of 5)

« Previous | Next »Order by:

 aGender    
  • German

ID: ELRA-S0365

ISLRN: 038-476-412-610-4

aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The purpose of the corpus is the automatic detection of gender and/or...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
327.00 € submit
8127.00 € submit
Licence: Commercial Use - ELRA VAR
8127.00 € submit
8127.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
455.00 € submit
8255.00 € submit
Licence: Commercial Use - ELRA VAR
8255.00 € submit
8255.00 € submit
 Alcohol Language Corpus (BAS ALC)    
  • German

ID: ELRA-S0299

ISLRN: 780-368-852-139-3

ALC contains recordings of German speakers that are either intoxicated or sober. The type of speech ranges from read single digits to full conversation style. Recordings were done during drinking test where speakers drank beer or wine to reach a self-chosen level of alcoholic intoxication. The ac...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
510.00 € submit
510.00 € submit
Licence: Commercial Use - ELRA VAR
510.00 € submit
510.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1020.00 € submit
1020.00 € submit
Licence: Commercial Use - ELRA VAR
1020.00 € submit
1020.00 € submit
 ANITA (Audio eNhancement In Telecom Applications)    
  • English
  • French
  • German
  • Spanish; Castilian

ID: ELRA-S0156

ISLRN: 537-894-870-719-4

ANITA (Audio eNhancement In secured Telecommunication Applications) is a European project launched on the initiative of EADS TELECOM with the objective of reducing audio acoustics noise in secured communications in adverse environments (sirens, alarms, engines, water pumps, stress situations, etc...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
 Austrian SpeechDat(AT) FDB-1000 database    
  • German

ID: ELRA-S0142

ISLRN: 989-950-794-642-6

The SpeechDat(AT) FDB-1000 database contains the recordings of 1,000 Austrian speakers (544 males, 456 females) recorded over the Austrian fixed telephone network. The database is partitioned into 5 CD-ROMs, in ISO 9660 format. Speech samples are stored as sequences of 8-bit 8 kHz A-law, uncompr...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
16000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
20000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
25000.00 € submit
25000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 Austrian SpeechDat(AT) MDB-1000 database    
  • German

ID: ELRA-S0143

ISLRN: 294-112-593-120-6

The Austrian SpeechDat(AT) MDB-1000 database contains the recordings of 1,000 Austrian speakers (543 males, 457 females) recorded over the Austrian mobile telephone network. The database is partitioned into 5 CD-ROMs, in ISO 9660 format. Speech samples are stored as sequences of 8-bit 8 kHz A-la...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
24000.00 € submit
30000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
30000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
30000.00 € submit
35000.00 € submit
Licence: Commercial Use - ELRA VAR
35000.00 € submit
35000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 BAS PHATT 1.0.X (sub-set)    
  • German

ID: ELRA-S0282-01

ISLRN: 704-844-083-488-7

The Ph@ttSessionz speech database, funded by the German Ministry of Science and Education (BMBF), contains recordings of 864 adolescent speakers of German (age range 12-20). The recordings were performed via the WWW in public schools (Gymnasium) in 41 locations in Germany. The speech material rec...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
512.00 € submit
2512.00 € submit
Licence: Commercial Use - ELRA VAR
2512.00 € submit
2512.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
512.00 € submit
3512.00 € submit
Licence: Commercial Use - ELRA VAR
3512.00 € submit
3512.00 € submit
 BAS PHATT 1.1.X (complete corpus)    
  • German

ID: ELRA-S0282-02

ISLRN: 847-046-185-654-8

The Ph@ttSessionz speech database, funded by the German Ministry of Science and Education (BMBF), contains recordings of 864 adolescent speakers of German (age range 12-20). The recordings were performed via the WWW in public schools (Gymnasium) in 41 locations in Germany. The speech material rec...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1917.30 € submit
3917.30 € submit
Licence: Commercial Use - ELRA VAR
3917.30 € submit
3917.30 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3834.75 € submit
6834.75 € submit
Licence: Commercial Use - ELRA VAR
6834.75 € submit
6834.75 € submit
 BITS Logatome Synthesis Corpus – BITS-LG    
  • German

ID: ELRA-S0217

ISLRN: 887-235-135-658-7

BITS stands for "BAS Infrastructures for Technical Speech Processing" and was funded by the German Ministry of Science and Education during 2003-2005. The BITS synthesis corpus consists of two parts: a set of logatome recordings for controlled diphone synthesis (ELRA-S0217) and a set of sentence ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
627.17 € submit
4627.17 € submit
Licence: Commercial Use - ELRA VAR
4627.17 € submit
4627.17 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
754.35 € submit
9000.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
9000.00 € submit
 BITS Unit Selection Synthesis Corpus    
  • German

ID: ELRA-S0224

ISLRN: 553-776-339-039-5

BITS stands for "BAS Infrastructures for Technical Speech Processing" and was funded by the German Ministry of Science and Education during 2003-2005. The BITS synthesis corpus consists of two parts: a set of logatome recordings for controlled diphone synthesis (ELRA-S0217) and a set of sentenc...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
627.17 € submit
4627.17 € submit
Licence: Commercial Use - ELRA VAR
4627.17 € submit
4627.17 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
754.35 € submit
9000.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
9000.00 € submit
 deL1L2IM corpus    
  • German

ID: ELRA-W0083

ISLRN: 339-799-085-669-8

The deL1L2IM corpus, created between May and August 2012 and last updated in August 2014, has been collected within the framework of a PhD project on the development of a learning method implying conversations with an artificial companion. This PhD work is presented as a qualitative investigation...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
 Erlanger Bahnansage - ERBA    
  • German

ID: ELRA-S0013

ISLRN: 839-887-396-051-4

Over 10.000 utterances read by over 100 German speakers (60 male and 40 female), in the domain of train inquiries. All recordings were made in a quiet office room (4 CDROMs).

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
511.29 € submit
6263.29 € submit
Licence: Commercial Use - ELRA VAR
6263.29 € submit
6263.29 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1022.58 € submit
6774.58 € submit
Licence: Commercial Use - ELRA VAR
6774.58 € submit
6774.58 € submit
 EUROM1g German    
  • German

ID: ELRA-S0014-03

ISLRN: 348-315-352-666-4

The first really multilingual speech database produced in Europe. Equivalent corpora for each of the European languages: same number of speakers selected in the same way, and recorded in the same conditions with common file formats. Initially eight European countries have made recordings: Italy, ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
800.00 € submit
Licence: Commercial Use - ELRA VAR
800.00 € submit
800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1600.00 € submit
1600.00 € submit
Licence: Commercial Use - ELRA VAR
1600.00 € submit
1600.00 € submit
 German SpeechDat-Car    
  • German

ID: ELRA-S0122

ISLRN: 090-326-240-961-9

The German SpeechDat-Car database comprises 338 German speakers recorded over the mobile telephone network. This database is partitioned into 17 DVDs and 1 CD. The speech databases made within the SpeechDat-Car project were validated by SPEX, the Netherlands, to assess their compliance with the S...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
90000.00 € submit
90000.00 € submit
Licence: Commercial Use - ELRA VAR
90000.00 € submit
90000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
120000.00 € submit
120000.00 € submit
Licence: Commercial Use - ELRA VAR
120000.00 € submit
120000.00 € submit
 German SpeechDat(II) MDB-1000    
  • German

ID: ELRA-S0096

ISLRN: 222-030-211-822-7

The German SpeechDat(II) MDB-1000 comprises 1295 German speakers (663 males, 610 females, 22 speakers with gender not specified) recorded over the German mobile telephone network. The MDB-1000 database is partitioned into 8 CDs in ISO 9660 format. The speech databases made within the SpeechDat(I...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
20000.00 € submit
28000.00 € submit
Licence: Commercial Use - ELRA VAR
28000.00 € submit
28000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
25000.00 € submit
35000.00 € submit
Licence: Commercial Use - ELRA VAR
35000.00 € submit
35000.00 € submit
 German Speecon database    
  • German

ID: ELRA-S0216

ISLRN: 578-285-282-393-9

The German Speecon database is divided into 2 sets: 1) The first set comprises the recordings of 562 adult German speakers (272 males, 290 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place). 2) The second set comprises the record...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50000.00 € submit
67000.00 € submit
Licence: Commercial Use - ELRA VAR
67000.00 € submit
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
60000.00 € submit
75000.00 € submit
Licence: Commercial Use - ELRA VAR
75000.00 € submit
75000.00 € submit
 GlobalPhone 2000 Speaker Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0400

ISLRN: 331-592-378-424-7

The GlobalPhone 2000 Speaker Package contains transcribed read speech spoken by 2000 native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Mandarin (ELRA-S0193), C...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 GlobalPhone German    
  • German

ID: ELRA-S0198

ISLRN: 937-733-002-847-8

The GlobalPhone corpus developed in collaboration with the Karlsruhe Institute of Technology (KIT) was designed to provide read speech data for the development and evaluation of large continuous speech recognition systems in the most widespread languages of the world, and to provide a uniform, mu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit

Special offers are also available. Check here for details.

 GlobalPhone Multilingual Model Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0399

ISLRN: 204-945-263-927-6

The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Manda...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 Hempel    
  • German

ID: ELRA-S0162

ISLRN: 683-410-635-177-8

This corpus contains 3,909 recordings via public phone lines (fixed network only) of 3,909 German speakers with a total of 184,240 spoken words. The contents are free monologues answering the question: "Was haben Sie in der letzten Stunde gemacht?" (What did you do within the last hour?). 25.5 ho...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
755.00 € submit
4755.00 € submit
Licence: Commercial Use - ELRA VAR
4755.00 € submit
4755.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1010.00 € submit
5010.00 € submit
Licence: Commercial Use - ELRA VAR
5010.00 € submit
5010.00 € submit
 Karl May Korpus (KMK)    
  • German

ID: ELRA-W0016

ISLRN: 628-817-117-400-1

The "Karl-May-Korpus" is a monolingual German corpus, available in an SGML-tagged ASCII text format. It contains the works of the German author Karl May (1842-1912) and consists of around 1.6 million words (divided into 9 subcorpora of about 180,000 words each). The corpus was created between 199...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
3500.00 € submit
Licence: Commercial Use - ELRA VAR
3500.00 € submit
3500.00 € submit

« Previous | Next »