Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
12 Language Resources
Order by:
- Bulgarian
- Danish
- Dutch; Flemish
- German
- Japanese
- Portuguese
- Slovenian
- Spanish; Castilian
- Swedish
- Turkish
ID: ELRA-W0086
ISLRN: 578-227-532-044-02006 CoNLL Shared Task - Ten Languages consists of dependency treebanks in ten languages used as part of the CoNLL 2006 shared task on multi-lingual dependency parsing. The languages covered in this release are: Bulgarian, Danish, Dutch, German, Japanese, Portuguese, Slovene, Spanish, Swedish and...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
- Bulgarian
- Czech
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hungarian
- Italian
- Persian
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
ID: ELRA-E0036
ISLRN: 378-279-085-589-0The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
Special offers are also available. Check here for details.
- Bulgarian
- Dutch; Flemish
- English
- Finnish
- French
- German
- Italian
- Portuguese
- Romanian; Moldavian; Moldovan
- Spanish; Castilian
ID: ELRA-E0038
ISLRN: 394-993-527-034-7The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
Special offers are also available. Check here for details.
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hindi
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Thai
- Turkish
- Vietnamese
ID: ELRA-S0383
ISLRN: 398-655-047-044-5The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the audio files corresponding t...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3360.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4480.00 €
|
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Norwegian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Thai
- Turkish
- Vietnamese
ID: ELRA-S0382
ISLRN: 309-438-781-042-2The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the corresponding audio files c...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3640.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5200.00 €
|
- Dutch; Flemish
ID: ELRA-W0019
ISLRN: 440-290-917-102-7The Dutch PAROLE Distributable Corpus is a 3 million words selection from the 20 million words Dutch PAROLE Reference corpus. The Dutch corpus annotation and checking was made accordingly to the common core PAROLE tagset. The Dutch data were also checked for type. The Dutch PAROLE Distributable...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
270.00 €
|
800.00 €
|
Licence: Commercial Use - ELRA VAR |
1600.00 €
|
1600.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
300.00 €
|
1300.00 €
|
Licence: Commercial Use - ELRA VAR |
2500.00 €
|
2500.00 €
|
Special offers are also available. Check here for details.
- Dutch; Flemish
ID: ELRA-S0010
ISLRN: 117-997-161-308-7The Dutch Polyphone corpus contains telephone speech from 5050 speakers. The corpus comprises 222,075 speech files (based on 44 or, in a few cases 43 items per speaker), which all have been orthographically transcribed. The data were collected in 8-bit A-law digital form, directly off an ISDN tel...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
12000.00 €
|
25000.00 €
|
Licence: Commercial Use - ELRA VAR |
25000.00 €
|
25000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
20000.00 €
|
35000.00 €
|
Licence: Commercial Use - ELRA VAR |
35000.00 €
|
35000.00 €
|
- Albanian
- Bulgarian
- Chinese
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- French
- German
- Italian
- Japanese
- Latin
- Lithuanian
- Malay (macrolanguage)
- Modern Greek (1453-)
- Norwegian
- Portuguese
- Russian
- Scottish Gaelic; Gaelic
- Serbian
- Spanish; Castilian
- Swedish
- Turkish
- Uzbek
ID: ELRA-W0004
ISLRN: 511-168-567-582-5The European Corpus Initiative (ECI) was founded to oversee the acquisition and preparation of a large multilingual corpus, and supports existing and projected national and international efforts to carefully design, collect and publish large-scale multilingual written and spoken corpora. ECI has ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
50.00 €
|
50.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
50.00 €
|
50.00 €
|
- Dutch; Flemish
ID: ELRA-S0020
ISLRN: 819-542-178-821-7The 4 CD-ROMs contain over 20 hours of speech. It is a corpus of read speech material in Dutch, recorded on PCM tape under fairly good conditions. These 4 CD-ROMs contain speech from 238 speakers who read: · 2 short texts (the famous North wind text, and a longer text, "de Koning" by Godfried Bo...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
|
1600.00 €
|
Licence: Commercial Use - ELRA VAR |
1600.00 €
|
1600.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
800.00 €
|
3200.00 €
|
Licence: Commercial Use - ELRA VAR |
3200.00 €
|
3200.00 €
|
- Dutch; Flemish
- English
- French
- German
ID: ELRA-S0238
ISLRN: 189-835-264-931-4In 1996, some 75 Dutch people participated in recording a multi-purpose continuous speech database. Most of them were recruited from the TNO Human Factors Research Institute, where the recordings were made. The main part of the database consisted of Dutch sentences. However, most speakers partici...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
400.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
500.00 €
|
- Danish
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Portuguese
- Spanish; Castilian
ID: ELRA-W0023
ISLRN: 963-635-729-341-8The MLCC text corpus has two main components - one set to allow comparable studies to be carried out in different languages and one set as the basis for translation studies. The first set is referred as the Polylingual Document Collection, a collection of newspaper articles from financial new...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
1600.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3600.00 €
|
- Dutch; Flemish
- English
- Finnish
- French
- German
- Italian
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
ID: ELRA-E0008
ISLRN: 317-005-302-361-6The CLEF Test Suite contains the data used for the main tracks of the CLEF campaigns carried out from 2000 to 2003: Multilingual text retrieval, Bilingual text retrieval, Monolingual text retrieval, and Domain-specific text retrieval. The CLEF Test Suite is composed of: • The multilingual docum...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
Special offers are also available. Check here for details.