Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
73 Language Resources (Page 1 of 4)
« Previous | Next »Order by:
- Bulgarian
- Danish
- Dutch; Flemish
- German
- Japanese
- Portuguese
- Slovenian
- Spanish; Castilian
- Swedish
- Turkish
ID: ELRA-W0086
ISLRN: 578-227-532-044-02006 CoNLL Shared Task - Ten Languages consists of dependency treebanks in ten languages used as part of the CoNLL 2006 shared task on multi-lingual dependency parsing. The languages covered in this release are: Bulgarian, Danish, Dutch, German, Japanese, Portuguese, Slovene, Spanish, Swedish and...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
- Portuguese
ID: ELRA-S0490-15
ISLRN: 839-027-842-220-5Human voice recordings of single-word lemmas and multiword expressions, besides IPA (International Phonetic Alphabet) and alternative scripts (Japanese – Romaji/Kanji/Hiragana; Chinese – Pinyin; Arabic and Hebrew – w/out diacritics), distributed as distinct sets (from ELRA-S0490-01 to ELRA-S0490-...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
925.00 €
|
925.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
971.25 €
|
971.25 €
|
Special offers are also available. Check here for details.
- Portuguese
ID: ELRA-S0490-16
ISLRN: 707-840-365-826-0Human voice recordings of single-word lemmas and multiword expressions, besides IPA (International Phonetic Alphabet) and alternative scripts (Japanese – Romaji/Kanji/Hiragana; Chinese – Pinyin; Arabic and Hebrew – w/out diacritics), distributed as distinct sets (from ELRA-S0490-01 to ELRA-S0490-...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
767.60 €
|
767.60 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
805.98 €
|
805.98 €
|
Special offers are also available. Check here for details.
- Portuguese
ID: ELRA-L0160
ISLRN: 318-935-224-974-8As a complement to the generic vocabulary provided in ELRA-L0148, language variants of Portuguese are provided with the following features: Tense, Mood, Person, Number, Gender, Pronominal Clitics. - Portuguese (Portugal): 51,000 lemmas / 3,780,000 forms - Portuguese (Brazil): 36,000 lemmas / ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
|
- Portuguese
ID: ELRA-L0148
ISLRN: 984-017-567-921-2The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Portuguese consists of 40,000 lemmas (3,500,000 f...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
- Portuguese
ID: ELRA-L0034
ISLRN: 654-505-941-943-8BrasiLEX is a multifunctional monolingual lexicon of the Brazilian variety of Portuguese, developed by the Natural Language Group of INESC. It has about 65,000 entries (lemmas) and 1,600 correspondent inflexion paradigms. The set of entries includes compound words and the inflexion paradigms incl...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3000.00 €
|
25000.00 €
|
Licence: Commercial Use - ELRA VAR |
25000.00 €
|
25000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
|
30000.00 €
|
Licence: Commercial Use - ELRA VAR |
30000.00 €
|
30000.00 €
|
This resource is also available in a bundle. Check here for bundled pricing.
- Portuguese
ID: ELRA-S0445
ISLRN: 767-329-448-534-2The data volumn is 1044 hours and is recorded by 2038 Brazilian native speakers. The recording text is designed by linguistic experts, which covers general interactive, in-car and home category. The texts are manually proofread with high accuracy. Recording devices are mainstream Android phones a...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
247950.00 €
|
247950.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
247950.00 €
|
247950.00 €
|
Special offers are also available. Check here for details.
- Portuguese
ID: ELRA-S0228-74
ISLRN: 403-396-918-176-7This corpus comprises 99,804 entries uttered by 50 speakers (25 males and 25 females), recorded over 4 channels (desktop in quiet office/home). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 37.3 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
|
5400.00 €
|
Licence: Commercial Use - ELRA VAR |
5400.00 €
|
5400.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
|
5400.00 €
|
Licence: Commercial Use - ELRA VAR |
5400.00 €
|
5400.00 €
|
- Portuguese
ID: ELRA-L0094
ISLRN: 408-817-203-152-3CEPLEXicon is a lexicon based on two different corpora of child speech – Santos corpus (Santos, 2006, Santos et al., 2014, see http://www.clul.ul.pt/resources/546?lang=en) and Freitas corpus (Freitas, 1997, Freitas et al. 2012). This lexicon results from the automatic tagging of the two corpora, ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
0.00 €
|
0.00 €
|
- Portuguese
ID: ELRA-W0062
ISLRN: 368-672-631-502-0The CINTIL-DeepBank (Branco et al., 2010) is a corpus of sentences annotated with their full-fledged deep grammatical representations, composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), and novels (399 sentences; 3,082...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
- Portuguese
ID: ELRA-W0061
ISLRN: 133-035-138-613-6The CINTIL-DependencyBank (Silva and Branco, 2012) is a corpus of sentences annotated with their syntactic dependency graphs and grammatical function tags composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), novels (399 ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
- Portuguese
ID: ELRA-W0056
ISLRN: 723-486-478-286-6The CINTIL-PropBank is a corpus of sentences annotated with their constituency structure and semantic role tags, composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), and novels (399 sentences; 3,082 tokens). In addition,...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
- Portuguese
ID: ELRA-W0055
ISLRN: 411-691-515-701-9The CINTIL-TreeBank is a corpus of syntactic constituency trees of Portuguese texts composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), novels (399 sentences; 3,082 tokens). In addition, there are 779 sentences (5,654 t...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
- Bulgarian
- Czech
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hungarian
- Italian
- Persian
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
ID: ELRA-E0036
ISLRN: 378-279-085-589-0The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
Special offers are also available. Check here for details.
- Bulgarian
- Dutch; Flemish
- English
- Finnish
- French
- German
- Italian
- Portuguese
- Romanian; Moldavian; Moldovan
- Spanish; Castilian
ID: ELRA-E0038
ISLRN: 394-993-527-034-7The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
Special offers are also available. Check here for details.
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hindi
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Thai
- Turkish
- Vietnamese
ID: ELRA-T0377
ISLRN: 452-383-219-228-0The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, distributed separately under reference ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank). The PhraseBank consists of 2,000 p...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1680.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2240.00 €
|
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hindi
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Thai
- Turkish
- Vietnamese
ID: ELRA-S0383
ISLRN: 398-655-047-044-5The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the audio files corresponding t...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3360.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4480.00 €
|
- Arabic
- Bengali
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hindi
- Italian
- Japanese
- Korean
- Malayalam
- Modern Greek (1453-)
- Norwegian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Swedish
- Tamil
- Thai
- Turkish
- Ukrainian
- Vietnamese
ID: ELRA-T0376
ISLRN: 990-814-402-335-7The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank) and a multilingual set of sentences in 28 languages (the PhraseBank, distributed separately under reference ELRA-T0377). The WordBank contains 10,000 words...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2400.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3600.00 €
|
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Norwegian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Thai
- Turkish
- Vietnamese
ID: ELRA-S0382
ISLRN: 309-438-781-042-2The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the corresponding audio files c...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3640.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5200.00 €
|
- Portuguese
ID: ELRA-S0367
ISLRN: 499-311-025-331-2The CORAL corpus was collected in the framework of a national project sponsored by the PRAXIS XXI program, by a consortium formed by INESC, CLUL, FLUL (Faculdade de Letras da Universidade de Lisboa), and FCSH-UNL (Faculdade de Ciências Sociais e Humanas da Universidade Nova de Lisboa). The purpos...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
200.00 €
|
750.00 €
|
Licence: Commercial Use - ELRA VAR |
750.00 €
|
750.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
|
1500.00 €
|
Licence: Commercial Use - ELRA VAR |
1500.00 €
|
1500.00 €
|
« Previous | Next »