Search and Browse – ELRA Catalogue

Macedonian lexicon of derived adjectives (MACPLEX_ADJDERV) text

Macedonian

ID: ELRA-L0091

MACPLEX_ADJDERV contains 12,073 lemmas and 281,488 word forms (10,233 with suffix –чки, 1,840 with suffix –билен). The lexicon is available in Unicode.

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	270.00 €	400.00 €
Licence: Commercial Use - ELRA VAR	1100.00 €	1100.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	340.00 €	550.00 €
Licence: Commercial Use - ELRA VAR	1400.00 €	1400.00 €

This resource is also available in a bundle. Check here for bundled pricing.

Macedonian lexicon of participles (MACPLEX_ADJPARTIC) text

Macedonian

ID: ELRA-L0092

ISLRN: 375-087-750-051-4

MACPLEX_ADJPARTIC contains 19,552 lemmas and 1,251,328 word forms. The lemmas are derived from verbs. The lexicon is available in Unicode.

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	440.00 €	660.00 €
Licence: Commercial Use - ELRA VAR	1800.00 €	1800.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	550.00 €	900.00 €
Licence: Commercial Use - ELRA VAR	2200.00 €	2200.00 €

This resource is also available in a bundle. Check here for bundled pricing.

Macedonian lexicon of proper nouns (MACPLEX_PROPERS) text

Macedonian

ID: ELRA-L0090

ISLRN: 039-112-943-391-5

MACPLEX_PROPERS contains 15,422 lemmas and 157,321 word forms (2,516 first names, 12,322 last names, 148 other human names, 426 companies and 22 brands). Adjectives related to proper nouns are derived. The lexicon is available in Unicode.

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	350.00 €	500.00 €
Licence: Commercial Use - ELRA VAR	1400.00 €	1400.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	450.00 €	700.00 €
Licence: Commercial Use - ELRA VAR	1800.00 €	1800.00 €

This resource is also available in a bundle. Check here for bundled pricing.

Macedonian lexicon of toponyms (MACPLEX_TOPO) text

Macedonian

ID: ELRA-L0089

ISLRN: 310-855-448-682-2

MACPLEX_TOPO lexicon contains 1,398 lemmas and 40,246 word forms (787 places, 428 regions, 68 waters, 47 peoples, 45 mountains, 27 lands). New words related to toponyms (their inhabitants and related adjectives) are derived. The lexicon is available in Unicode.

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	45.00 €	50.00 €
Licence: Commercial Use - ELRA VAR	125.00 €	125.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	45.00 €	60.00 €
Licence: Commercial Use - ELRA VAR	150.00 €	150.00 €

This resource is also available in a bundle. Check here for bundled pricing.

Macedonian Morphological Lexicon (MACPLEX) text

Macedonian

ID: ELRA-L0084

ISLRN: 580-487-347-384-8

MACPLEX comprises two dictionaries: a dictionary of lemmas (89,026 entries) and a dictionary of word forms (1,480,201 entries). Morphological information (PoS, gender, case, definiteness, number for nouns, tense, person, etc. for verbs) is available for each entry. Out of the 1,480,201 word forms...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	2000.00 €	3000.00 €
Licence: Commercial Use - ELRA VAR	8000.00 €	8000.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	2500.00 €	4000.00 €
Licence: Commercial Use - ELRA VAR	10000.00 €	10000.00 €

This resource is also available in a bundle. Check here for bundled pricing.

Modern French Corpus including Anaphors Tagging text

French

ID: ELRA-W0032

ISLRN: 488-420-763-510-8

The corpus that includes the tagging of the anaphors was created by the CRISTAL-GRESEC (Stendhal-Grenoble 3 University, France) team and XRCE (Xerox Research Centre Europe, France) in the framework of the call launched by the DGLF-LF (national institution for the French language and the languages...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	250.00 €	250.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	1000.00 €	1000.00 €

MULTEXT Lexicons text

English
French
German
Italian
Spanish; Castilian

ID: ELRA-L0010

ISLRN: 346-384-408-181-3

This CD-ROM contains a set of lexicons developed in the MULTEXT project financed by the European Commission (LRE 62-050). The set contains the following languages: English 66,214 Word forms French 306,795 Word forms German 233,861 Word forms Italian 145,530 Word forms Spanish 510,710 Word...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	2000.00 €
Licence: Commercial Use - ELRA VAR	2000.00 €	2000.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	5000.00 €
Licence: Commercial Use - ELRA VAR	5000.00 €	5000.00 €

NE3L named entities Arabic corpus text

Arabic

ID: ELRA-W0078

ISLRN: 398-979-151-557-0

The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	5000.00 €	5000.00 €
Licence: Commercial Use - ELRA VAR	5000.00 €	5000.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	5000.00 €	5000.00 €
Licence: Commercial Use - ELRA VAR	5000.00 €	5000.00 €

NE3L named entities Chinese corpus text

Chinese

ID: ELRA-W0079

ISLRN: 187-154-782-686-9

The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	5000.00 €	5000.00 €
Licence: Commercial Use - ELRA VAR	5000.00 €	5000.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	5000.00 €	5000.00 €
Licence: Commercial Use - ELRA VAR	5000.00 €	5000.00 €

NE3L named entities Russian corpus text

Russian

ID: ELRA-W0080

ISLRN: 024-620-556-146-2

The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	5000.00 €	5000.00 €
Licence: Commercial Use - ELRA VAR	5000.00 €	5000.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	5000.00 €	5000.00 €
Licence: Commercial Use - ELRA VAR	5000.00 €	5000.00 €

NUM 5M Mongolian written corpus text

Mongolian

ID: ELRA-W0120

ISLRN: 492-817-146-504-9

This is a corpus of Mongolian text mostly from domains like online or printed daily newspapers, literature, and laws. The collected raw texts was reduced from 5 to 4.8 million words after cleaning. The cleaned corpus comprises: - 144 texts from laws until 2009, - 288 texts from literature t...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	5000.00 €
Licence: Commercial Use - ELRA VAR	5000.00 €	5000.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	7000.00 €
Licence: Commercial Use - ELRA VAR	7000.00 €	7000.00 €

ONOMASTICA-COPERNICUS DATABASE text

Czech
Estonian
Latvian
Polish
Slovak
Slovenian
Ukrainian

ID: ELRA-S0043

ISLRN: 246-224-540-110-4

The ONOMASTICA project was a European-wide research initiative within the scope of the Linguistic Research and Engineering Programme, the aim of which was the construction of a multi-language pronunciation lexicon of proper names. That project covered eleven European languages: Danish, Dutch, Eng...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	400.00 €	3000.00 €
Licence: Commercial Use - ELRA VAR	3000.00 €	3000.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	800.00 €	6000.00 €
Licence: Commercial Use - ELRA VAR	6000.00 €	6000.00 €

PANACEA English-French and English-Greek parallel corpus acquired for Environment domain text

English
French
Modern Greek (1453-)

ID: ELRA-W0057

ISLRN: 870-946-931-293-7

The PANACEA English-French and English-Greek parallel corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

PANACEA English-French and English-Greek parallel corpus acquired for Labour Legislation domain text

English
French
Modern Greek (1453-)

ID: ELRA-W0058

ISLRN: 428-891-110-719-1

The PANACEA English-French and English-Greek parallel corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

PANACEA Environment English monolingual corpus text

English

ID: ELRA-W0063

ISLRN: 732-466-154-657-8

The PANACEA Environment English monolingual corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework Programme...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

PANACEA Environment French monolingual corpus text

French

ID: ELRA-W0065

ISLRN: 400-316-779-360-9

The PANACEA Environment French monolingual corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework Programme....

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

PANACEA Environment Greek monolingual corpus text

Modern Greek (1453-)

ID: ELRA-W0067

ISLRN: 305-175-858-715-1

The PANACEA Environment Greek monolingual corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework Programme. ...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

PANACEA Environment Italian monolingual corpus text

Italian

ID: ELRA-W0069

ISLRN: 843-358-936-298-5

The PANACEA Environment Italian monolingual corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework Programme...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

PANACEA Environment Spanish monolingual corpus text

Spanish; Castilian

ID: ELRA-W0071

ISLRN: 154-034-915-247-9

The PANACEA Environment Spanish monolingual corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework Programme...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

PANACEA Labour English monolingual corpus text

English

ID: ELRA-W0064

ISLRN: 655-029-501-158-4

The PANACEA Labour English monolingual corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework Programme. ...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

Resource Type:

Media Type:

59 Language Resources (Page 2 of 3)