Text (671)
Audio (151)
Video (11)
True (180)
TMX (6)
TEI (3)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

812 Language Resources (Page 1 of 41)

« Previous | Next »Order by:

 2007 CoNLL Shared Task - Arabic & English    
  • Arabic
  • English

ID: ELRA-W0123

ISLRN: 505-782-255-628-8

2007 CoNLL Shared Task - Arabic & English consists of dependency treebanks in two languages used as part of the CoNLL 2007 shared task on multi-lingual dependency parsing and domain adaptation. The languages covered in this release are: Arabic and English. The Conference on Computational Natur...

MEMBERacademiccommercial
Licence: Non Commercial Use - Non Standard Licence Terms
NON MEMBERacademiccommercial
Licence: Non Commercial Use - Non Standard Licence Terms
 A Bilingual English-Ukrainian Lexicon of Named Entities Extracted from Wikipedia    
  • English
  • Ukrainian

ID: ELRA-M0104

ISLRN: 110-617-195-245-4

The bilingual English-Ukrainian lexicon of named entities uses Wikipedia metadata as a source. The extracted named entity pairs are classified into five classes: PERSON, ORGANIZATION, LOCATION, PRODUCT, and MISC (miscellaneous). The lexicon consists of 624,168 pairs and comes in two formats: csv ...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
 Accented English GlobalPhone    
  • English

ID: ELRA-S0389

ISLRN: 574-579-221-841-3

The Accented English part of the GlobalPhone resources contains 63 recording sessions of Bulgarian, Chinese, German, and Indian native speakers reading 37 English sentences each, produced in GlobalPhone-style, i.e. 16kHz PCM encoded audio recordings of utterance-segmented read speech from the new...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit
 ACCOR - English    
  • English

ID: ELRA-S0001

ISLRN: 936-783-643-804-4

ACCOR is a unique acoustic and articulatory database recorded as part of the ESPRIT- ACCOR project investigating cross-language acoustic-articulatory correlations in coarticulatory processes. The European Languages covered are: Catalan, English, French, German, Irish Gaelic, Italian and Swedish. ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
25.00 € submit
Licence: Commercial Use - ELRA VAR
25.00 € submit
25.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
75.00 € submit
Licence: Commercial Use - ELRA VAR
75.00 € submit
75.00 € submit
 ACL RD-TEC: A Reference Dataset for Terminology Extraction and Classification Research in Computational Linguistics    
  • English

ID: ELRA-T0375

ISLRN: 699-305-362-089-6

Automatic Term Recognition (ATR) is a research task that deals with the identification of domain-specific terms. Terms, in simple words, are textual realization of significant concepts in an expertise domain. Additionally, domain-specific terms may be classified into a number of categories, in wh...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
 American/Canadian English Speech Recognition Corpus (headset+mobile)    
  • English

ID: ELRA-S0228-102

ISLRN: 992-319-311-431-0

This corpus comprises 12,974 entries uttered by 30 speakers (15 males and 15 females), recorded over 2 channels (headset and mobile in noisy restaurant/shopping mall/info center/hospital/station/car). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 12 hours of speech per ch...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3600.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3600.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit
 American Children Speech Data by Microphone - 50 Hours    
  • English

ID: ELRA-S0468

ISLRN: 178-575-028-743-8

It is recorded by 219 American children native speakers. The recording texts are mainly storybook, children's song, spoken expressions, etc. 350 sentences for each speaker. Each sentence contain 4.5 words in average. Each sentence is repeated 2.1 times in average. The recording device is hi-fi Bl...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
28785.00 € submit
28785.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
28785.00 € submit
28785.00 € submit

Special offers are also available. Check here for details.

 American English Conversational Speech Recognition Corpus (Multi-Channel)    
  • English

ID: ELRA-S0228-93

ISLRN: 576-996-121-023-5

This corpus was recorded by 20 speakers (10 males and 10 females), over 7 channels (multi-channel in quiet office/home). Speech samples are stored as a sequence of 16-bit 16 kHz for a total of 10 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5040.00 € submit
5040.00 € submit
Licence: Commercial Use - ELRA VAR
5040.00 € submit
5040.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5040.00 € submit
5040.00 € submit
Licence: Commercial Use - ELRA VAR
5040.00 € submit
5040.00 € submit
 American English Speech Data by Mobile Phone - 800 Hours    
  • English

ID: ELRA-S0437

ISLRN: 629-877-109-625-1

1842 American native speakers participated in the recording with authentic accent. The recorded script is designed by linguists, based on scenes, and cover a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with ...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
136800.00 € submit
136800.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
136800.00 € submit
136800.00 € submit

Special offers are also available. Check here for details.

 American English Speech Data by Mobile Phone_Reading - 215 Hours    
  • English

ID: ELRA-S0467

ISLRN: 921-365-371-849-5

The data set contains 349 American English speakers' speech data, all of whom are American locals. It is recorded in quiet environment. The recording contents cover various categories like economics, entertainment, news and spoken language. It is manually transcribed and annotated with the starti...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
34722.50 € submit
34722.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
34722.50 € submit
34722.50 € submit

Special offers are also available. Check here for details.

 American English Speech Recognition Corpus (Desktop)    
  • English

ID: ELRA-S0228-79

ISLRN: 254-019-000-249-3

This corpus comprises 49,990 entries uttered by 50 speakers (25 males and 25 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 16kHz for a total of 24.9 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 American English Speech Recognition Corpus (Desktop)    
  • English

ID: ELRA-S0228-113

ISLRN: 703-568-790-770-5

This corpus was recorded in both quiet and noisy environments over 2 channels and collected from a total of 50 speakers, including 24 males and 26 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as text m...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 American English Speech Recognition Corpus (Mobile) - 14.67 hours    
  • English

ID: ELRA-S0228-73

ISLRN: 817-988-141-738-4

This corpus comprises 14,988 entries uttered by 50 speakers (23 males and 27 females), recorded over the mobile telephone network. Speech samples are stored as a sequence of 16-bit 16 kHz for a total of 14.67 hours of speech.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 American English Wake-up Words Speech Recognition Corpus (Mobile)    
  • English

ID: ELRA-S0228-58

ISLRN: 968-856-860-742-9

The corpus contains the recordings of 38,718 utterances of American English mobile Keywords speech data which were from 149 speakers(72 males and 77 females). Each speaker was designed to record 1 session, totally 260 utterances in quiet or noisy environments. The total pure recording time is abo...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2430.00 € submit
2430.00 € submit
Licence: Commercial Use - ELRA VAR
2430.00 € submit
2430.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2430.00 € submit
2430.00 € submit
Licence: Commercial Use - ELRA VAR
2430.00 € submit
2430.00 € submit
 American Spanish Recognition Corpus (Desktop+Mobile)    
  • English

ID: ELRA-S0228-68

ISLRN: 100-009-143-020-4

This corpus comprises 33,527 entries uttered by 40 speakers (21 males and 19 females), recorded over 2 channels (desktop in quiet office and mobile in noisy restaurant). Speech samples are stored as a sequence of 16-bit 16kHz for a total of 14.7 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4320.00 € submit
4320.00 € submit
Licence: Commercial Use - ELRA VAR
4320.00 € submit
4320.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4320.00 € submit
4320.00 € submit
Licence: Commercial Use - ELRA VAR
4320.00 € submit
4320.00 € submit
 Amharic-English bilingual corpus    
  • Amharic
  • English

ID: ELRA-W0074

ISLRN: 590-255-335-719-0

The Amharic-English bilingual corpus contains parallel text from legal and news domains in Amharic script, in transliterated form and in English. The size of the corpus is of 232,653 words in Amharic and 291,701 in English. This parallel corpus contains documents from two domains, namely legal...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
 ANITA (Audio eNhancement In Telecom Applications)    
  • English
  • French
  • German
  • Spanish; Castilian

ID: ELRA-S0156

ISLRN: 537-894-870-719-4

ANITA (Audio eNhancement In secured Telecommunication Applications) is a European project launched on the initiative of EADS TELECOM with the objective of reducing audio acoustics noise in secured communications in adverse environments (sirens, alarms, engines, water pumps, stress situations, etc...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
 Annotated tweet corpus in Arabizi, French and English    
  • Arabic
  • English
  • French

ID: ELRA-W0323

ISLRN: 482-848-308-105-6

The annotated tweet corpus in Arabizi, French and English was built by ELDA on behalf of INSA Rouen Normandie (Normandie Université, LITIS team), in the framework of the SAPhIRS project (System for the Analysis of Information Propagation in Social Networks), funded by the DGE (Direction Générale ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
7000.00 € submit
Licence: Commercial Use - ELRA VAR
7000.00 € submit
7000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
 ArabLEX: Database of Arabic Place Names (DAP)    
  • Arabic
  • English

ID: ELRA-M0105

ISLRN: 161-842-321-771-2

This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7000.00 € submit
22000.00 € submit
Licence: Commercial Use - ELRA VAR
22000.00 € submit
22000.00 € submit

Special offers are also available. Check here for details.

 ArabLEX: Database of Arab Names (DAN)    
  • Arabic
  • English

ID: ELRA-M0107

ISLRN: 773-974-582-139-4

This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
45000.00 € submit
Licence: Commercial Use - ELRA VAR
45000.00 € submit
45000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
24000.00 € submit
71000.00 € submit
Licence: Commercial Use - ELRA VAR
71000.00 € submit
71000.00 € submit

Special offers are also available. Check here for details.

« Previous | Next »