39 Language Resources (Page 2 of 2)

« Previous | Next » Order by:

 Macedonian Morphological Lexicon (MACPLEX)    
  • Macedonian

ID: ELRA-L0084

ISLRN: 580-487-347-384-8

MACPLEX comprises two dictionaries: a dictionary of lemmas (89,026 entries) and a dictionary of word forms (1,480,201 entries). Morphological information (PoS, gender, case, definiteness, number for nouns, tense, person, etc. for verbs) is available for each entry. Out of the 1,480,201 word forms...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
8000.00 € submit
8000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2500.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 MULTEXT Lexicons    
  • English
  • French
  • German
  • Italian
  • Spanish; Castilian

ID: ELRA-L0010

ISLRN: 346-384-408-181-3

This CD-ROM contains a set of lexicons developed in the MULTEXT project financed by the European Commission (LRE 62-050). The set contains the following languages: English 66,214 Word forms French 306,795 Word forms German 233,861 Word forms Italian 145,530 Word forms Spanish 510,710 Word...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 NE3L named entities Arabic corpus    
  • Arabic

ID: ELRA-W0078

ISLRN: 398-979-151-557-0

The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 NE3L named entities Chinese corpus    
  • Chinese

ID: ELRA-W0079

ISLRN: 187-154-782-686-9

The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 NE3L named entities Russian corpus    
  • Russian

ID: ELRA-W0080

ISLRN: 024-620-556-146-2

The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 NUM 5M Mongolian written corpus    
  • Mongolian

ID: ELRA-W0120

ISLRN: 492-817-146-504-9

This is a corpus of Mongolian text mostly from domains like online or printed daily newspapers, literature, and laws. The collected raw texts was reduced from 5 to 4.8 million words after cleaning. The cleaned corpus comprises: - 144 texts from laws until 2009, - 288 texts from literature t...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
7000.00 € submit
Licence: Commercial Use - ELRA VAR
7000.00 € submit
7000.00 € submit
 ONOMASTICA-COPERNICUS DATABASE      
  • Czech
  • Estonian
  • Latvian
  • Polish
  • Slovak
  • Slovenian
  • Ukrainian

ID: ELRA-S0043

ISLRN: 246-224-540-110-4

The ONOMASTICA project was a European-wide research initiative within the scope of the Linguistic Research and Engineering Programme, the aim of which was the construction of a multi-language pronunciation lexicon of proper names. That project covered eleven European languages: Danish, Dutch, Eng...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 PAROLE-SIMPLE-CLIPS PISA Italian Lexicon – Full lexicon    
  • Italian

ID: ELRA-L0072-01

ISLRN: 388-991-977-669-9

This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
15600.00 € submit
Licence: Commercial Use - ELRA VAR
15600.00 € submit
15600.00 € submit
 PAROLE-SIMPLE-CLIPS PISA Italian Lexicon – Morphological layer    
  • Italian

ID: ELRA-L0072-03

ISLRN: 969-348-626-482-1

This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
375.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
3900.00 € submit
Licence: Commercial Use - ELRA VAR
3900.00 € submit
3900.00 € submit
 PAROLE-SIMPLE-CLIPS PISA Italian Lexicon – Phonetic layer    
  • Italian

ID: ELRA-L0072-02

ISLRN: 180-476-916-198-4

This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
4800.00 € submit
Licence: Commercial Use - ELRA VAR
4800.00 € submit
4800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
6200.00 € submit
Licence: Commercial Use - ELRA VAR
6200.00 € submit
6200.00 € submit
 PAROLE-SIMPLE-CLIPS PISA Italian Lexicon – Semantic layer    
  • Italian

ID: ELRA-L0072-05

ISLRN: 035-041-279-714-6

This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
150.00 € submit
1200.00 € submit
Licence: Commercial Use - ELRA VAR
1200.00 € submit
1200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
1600.00 € submit
Licence: Commercial Use - ELRA VAR
1600.00 € submit
1600.00 € submit
 PAROLE-SIMPLE-CLIPS PISA Italian Lexicon – Syntactic layer    
  • Italian

ID: ELRA-L0072-04

ISLRN: 737-815-447-915-0

This lexicon is subdivided into five different subsets: L0072-01 Full lexicon L0072-02 Phonetic layer L0072-03 Morphological layer L0072-04 Syntactic layer L0072-05 Semantic layer PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been elaborated over three different projects....

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
375.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
3900.00 € submit
Licence: Commercial Use - ELRA VAR
3900.00 € submit
3900.00 € submit
 PTPARL Corpus    
  • Portuguese

ID: ELRA-W0060

ISLRN: 294-303-577-819-2

The PTPARL Corpus contains 1,076 texts consisting of adapted transcriptions of the Portuguese Parliament sessions. The corpus contains 1,000,441 tokens. The corpus is delivered in one file, in two different formats. The txt version has one sentence per line, an identification number for each ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
 Spanish gilcUB-M Dictionary    
  • Spanish; Castilian

ID: ELRA-L0012

ISLRN: 901-988-016-499-7

General vocabulary Entries: 60000 Format: ASCII format with ISO 8859-1 character set. Available versions include atribute-value pairs and tag-style encoding. The Spanish gilcUB-M-Dictionary is a full form lexicon derived from 60,000 lemmas of general vocabulary (9,700 verbs, 35,500 nouns, 14,300...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6500.00 € submit
8250.00 € submit
Licence: Commercial Use - ELRA VAR
8250.00 € submit
8250.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8225.00 € submit
10300.00 € submit
Licence: Commercial Use - ELRA VAR
10300.00 € submit
10300.00 € submit
 Tham Khasi annotated corpus    
  • Khasi

ID: ELRA-W0321

ISLRN: 926-738-235-188-8

The Tham Khasi annotated corpus is a Khasi corpus, an Austro-Asiatic language, comprising of Khasi sentences extracted from textbooks prescribed for students in secondary, higher secondary, graduation, and post-graduation in the year 2015-2016. In the corpus, each word is separated by a space and...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
 THAMUS Generic Italian Dictionary - canonical forms    
  • Italian

ID: ELRA-L0013-01

ISLRN: 273-356-234-601-1

A Generic monolingual Italian dictionary of 87,000 canonical forms. Multi-word terms contain morphological coding for the headword.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
19140.00 € submit
47850.00 € submit
Licence: Commercial Use - ELRA VAR
47850.00 € submit
47850.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
20880.00 € submit
52200.00 € submit
Licence: Commercial Use - ELRA VAR
52200.00 € submit
52200.00 € submit
 The CINTIL Corpus – International Corpus of Portuguese    
  • Portuguese

ID: ELRA-W0050

ISLRN: 176-775-844-396-0

CINTIL-Corpus Internacional do Português is a linguistically interpreted written and spoken corpus of European Portuguese. It is composed of one million annotated tokens, each one of which verified by human expert annotators. The annotation comprises information on part-of-speech, open class lemm...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 The MWN.PT - MultiWordnet of Portuguese    
  • English
  • Hebrew
  • Italian
  • Latin
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Spanish; Castilian

ID: ELRA-M0050

ISLRN: 431-556-012-743-8

MWN.PT - MultiWordnet of Portuguese (version 1) spans over 17,200 manually validated concepts/synsets, linked under the semantic relations of hyponymy and hypernymy. These concepts are made of over 21,000 word senses/word forms and 16,000 lemmas from both European and American variants of Portugu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
 Venice Italian Treebank (VIT)    
  • Italian

ID: ELRA-W0040

ISLRN: 008-645-281-539-8

The VIT, Venice Italian Treebank is the effort of the collaboration of people working at the Laboratory of Computational Linguistics of the University of Venice in the years 1995-2005. It is partly the result of annotation carried out internally with no specific project in mind and no financial s...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
7000.00 € submit
Licence: Commercial Use - ELRA VAR
7000.00 € submit
7000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit

« Previous | Next »