Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
39 Language Resources (Page 2 of 2)
« Previous | Next » Order by:
- Macedonian
ID: ELRA-L0084
ISLRN: 580-487-347-384-8MACPLEX comprises two dictionaries: a dictionary of lemmas (89,026 entries) and a dictionary of word forms (1,480,201 entries). Morphological information (PoS, gender, case, definiteness, number for nouns, tense, person, etc. for verbs) is available for each entry. Out of the 1,480,201 word forms...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2000.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
8000.00 €
|
8000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2500.00 €
|
4000.00 €
|
Licence: Commercial Use - ELRA VAR |
10000.00 €
|
10000.00 €
|
This resource is also available in a bundle. Check here for bundled pricing.
- English
- French
- German
- Italian
- Spanish; Castilian
ID: ELRA-L0010
ISLRN: 346-384-408-181-3This CD-ROM contains a set of lexicons developed in the MULTEXT project financed by the European Commission (LRE 62-050). The set contains the following languages: English 66,214 Word forms French 306,795 Word forms German 233,861 Word forms Italian 145,530 Word forms Spanish 510,710 Word...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
2000.00 €
|
Licence: Commercial Use - ELRA VAR |
2000.00 €
|
2000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
5000.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
- Arabic
ID: ELRA-W0078
ISLRN: 398-979-151-557-0The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
|
5000.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
|
5000.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
- Chinese
ID: ELRA-W0079
ISLRN: 187-154-782-686-9The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
|
5000.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
|
5000.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
- Russian
ID: ELRA-W0080
ISLRN: 024-620-556-146-2The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
|
5000.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
|
5000.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
- Mongolian
ID: ELRA-W0120
ISLRN: 492-817-146-504-9This is a corpus of Mongolian text mostly from domains like online or printed daily newspapers, literature, and laws. The collected raw texts was reduced from 5 to 4.8 million words after cleaning. The cleaned corpus comprises: - 144 texts from laws until 2009, - 288 texts from literature t...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
5000.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
7000.00 €
|
Licence: Commercial Use - ELRA VAR |
7000.00 €
|
7000.00 €
|
- Czech
- Estonian
- Latvian
- Polish
- Slovak
- Slovenian
- Ukrainian
ID: ELRA-S0043
ISLRN: 246-224-540-110-4The ONOMASTICA project was a European-wide research initiative within the scope of the Linguistic Research and Engineering Programme, the aim of which was the construction of a multi-language pronunciation lexicon of proper names. That project covered eleven European languages: Danish, Dutch, Eng...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
800.00 €
|
6000.00 €
|
Licence: Commercial Use - ELRA VAR |
6000.00 €
|
6000.00 €
|
- Italian
ID: ELRA-L0072-01
ISLRN: 388-991-977-669-9This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1500.00 €
|
12000.00 €
|
Licence: Commercial Use - ELRA VAR |
12000.00 €
|
12000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2000.00 €
|
15600.00 €
|
Licence: Commercial Use - ELRA VAR |
15600.00 €
|
15600.00 €
|
- Italian
ID: ELRA-L0072-03
ISLRN: 969-348-626-482-1This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
375.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
500.00 €
|
3900.00 €
|
Licence: Commercial Use - ELRA VAR |
3900.00 €
|
3900.00 €
|
- Italian
ID: ELRA-L0072-02
ISLRN: 180-476-916-198-4This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
600.00 €
|
4800.00 €
|
Licence: Commercial Use - ELRA VAR |
4800.00 €
|
4800.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
800.00 €
|
6200.00 €
|
Licence: Commercial Use - ELRA VAR |
6200.00 €
|
6200.00 €
|
- Italian
ID: ELRA-L0072-05
ISLRN: 035-041-279-714-6This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
150.00 €
|
1200.00 €
|
Licence: Commercial Use - ELRA VAR |
1200.00 €
|
1200.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
200.00 €
|
1600.00 €
|
Licence: Commercial Use - ELRA VAR |
1600.00 €
|
1600.00 €
|
- Italian
ID: ELRA-L0072-04
ISLRN: 737-815-447-915-0This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
375.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
500.00 €
|
3900.00 €
|
Licence: Commercial Use - ELRA VAR |
3900.00 €
|
3900.00 €
|
- Portuguese
ID: ELRA-W0060
ISLRN: 294-303-577-819-2The PTPARL Corpus contains 1,076 texts consisting of adapted transcriptions of the Portuguese Parliament sessions. The corpus contains 1,000,441 tokens. The corpus is delivered in one file, in two different formats. The txt version has one sentence per line, an identification number for each ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
0.00 €
|
0.00 €
|
- Spanish; Castilian
ID: ELRA-L0012
ISLRN: 901-988-016-499-7General vocabulary Entries: 60000 Format: ASCII format with ISO 8859-1 character set. Available versions include atribute-value pairs and tag-style encoding. The Spanish gilcUB-M-Dictionary is a full form lexicon derived from 60,000 lemmas of general vocabulary (9,700 verbs, 35,500 nouns, 14,300...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
6500.00 €
|
8250.00 €
|
Licence: Commercial Use - ELRA VAR |
8250.00 €
|
8250.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
8225.00 €
|
10300.00 €
|
Licence: Commercial Use - ELRA VAR |
10300.00 €
|
10300.00 €
|
- Khasi
ID: ELRA-W0321
ISLRN: 926-738-235-188-8The Tham Khasi annotated corpus is a Khasi corpus, an Austro-Asiatic language, comprising of Khasi sentences extracted from textbooks prescribed for students in secondary, higher secondary, graduation, and post-graduation in the year 2015-2016. In the corpus, each word is separated by a space and...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
0.00 €
|
0.00 €
|
- Italian
ID: ELRA-L0013-01
ISLRN: 273-356-234-601-1A Generic monolingual Italian dictionary of 87,000 canonical forms. Multi-word terms contain morphological coding for the headword.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
19140.00 €
|
47850.00 €
|
Licence: Commercial Use - ELRA VAR |
47850.00 €
|
47850.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
20880.00 €
|
52200.00 €
|
Licence: Commercial Use - ELRA VAR |
52200.00 €
|
52200.00 €
|
- Portuguese
ID: ELRA-W0050
ISLRN: 176-775-844-396-0CINTIL-Corpus Internacional do Português is a linguistically interpreted written and spoken corpus of European Portuguese. It is composed of one million annotated tokens, each one of which verified by human expert annotators. The annotation comprises information on part-of-speech, open class lemm...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
10000.00 €
|
Licence: Commercial Use - ELRA VAR |
10000.00 €
|
10000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
15000.00 €
|
Licence: Commercial Use - ELRA VAR |
15000.00 €
|
15000.00 €
|
- English
- Hebrew
- Italian
- Latin
- Portuguese
- Romanian; Moldavian; Moldovan
- Spanish; Castilian
ID: ELRA-M0050
ISLRN: 431-556-012-743-8MWN.PT - MultiWordnet of Portuguese (version 1) spans over 17,200 manually validated concepts/synsets, linked under the semantic relations of hyponymy and hypernymy. These concepts are made of over 21,000 word senses/word forms and 16,000 lemmas from both European and American variants of Portugu...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
100.00 €
|
1000.00 €
|
Licence: Commercial Use - ELRA VAR |
1500.00 €
|
1500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
200.00 €
|
2000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
- Italian
ID: ELRA-W0040
ISLRN: 008-645-281-539-8The VIT, Venice Italian Treebank is the effort of the collaboration of people working at the Laboratory of Computational Linguistics of the University of Venice in the years 1995-2005. It is partly the result of annotation carried out internally with no specific project in mind and no financial s...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3000.00 €
|
7000.00 €
|
Licence: Commercial Use - ELRA VAR |
7000.00 €
|
7000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4000.00 €
|
10000.00 €
|
Licence: Commercial Use - ELRA VAR |
10000.00 €
|
10000.00 €
|
« Previous | Next »