12 Language Resources

Order by:

 A Bilingual English-Ukrainian Lexicon of Named Entities Extracted from Wikipedia    
  • English
  • Ukrainian

ID: ELRA-M0104

ISLRN: 110-617-195-245-4

The bilingual English-Ukrainian lexicon of named entities uses Wikipedia metadata as a source. The extracted named entity pairs are classified into five classes: PERSON, ORGANIZATION, LOCATION, PRODUCT, and MISC (miscellaneous). The lexicon consists of 624,168 pairs and comes in two formats: csv ...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
 CALEM (Comprehensive Arabic LEMmas)    
  • Arabic

ID: ELRA-L0133

ISLRN: 462-532-124-988-8

Comprehensive Arabic LEMmas is a lexicon covering a large list of Arabic lemmas and their corresponding inflected word forms (stems) with details (POS + Root). Each lexical entry represents a lemma followed by all its possible stems and each stem is enriched by its morphological features, especia...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
7500.00 € submit
7500.00 € submit
 CroaTPAS    
  • Croatian
  • English

ID: ELRA-M0108

ISLRN: 649-554-159-147-9

CroaTPAS (Croatian Typed Predicate Argument Structures) is a bi-lingual lexicon in Croatian and English. It was created by manual annotation from the Croatian Web as Corpus and pattern creation using the Skema editor on the Sketch Engine platform. CroaTPAS is tailor-made to represent verb polysem...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
 DA-EN Danish Ministry of Higher Education and Science 2 (Processed)    
  • Danish
  • English

ID: ELRA-W0157

ISLRN: 026-863-463-067-1

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts Danish-English from the Danish Ministry o...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
 DA-EN Danish Ministry of Higher Education and Science 3 (Processed)    
  • Danish
  • English

ID: ELRA-W0155

ISLRN: 625-397-811-990-4

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts Danish-English from the Danish Ministry o...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
 DA-EN Danish Ministry of Higher Education and Science 4 (Processed)    
  • Danish
  • English

ID: ELRA-W0172

ISLRN: 560-401-490-272-1

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts Danish-English from the Danish Ministry o...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
 DA-EN Danish Ministry of Higher Education and Science (Processed)    
  • Danish
  • English

ID: ELRA-W0166

ISLRN: 222-781-852-505-9

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts Danish-English from the Danish Ministry o...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
 ECPC Corpus (European Comparable and Parallel Corpora of Parliamentary Speeches Archive) – set 1    
  • English
  • Spanish; Castilian

ID: ELRA-W0128

ISLRN: 036-939-425-010-1

The European Comparable and Parallel Corpora of Parliamentary Speeches Archive (ECPC), compiled at the Universitat Jaume I (Spain), is a collection of XML metatextually tagged corpora containing speeches from three European chambers (the European Parliament, the British House of Commons, and the ...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
 Ema-lon Manipuri Corpus (including word embedding and language model)    
  • English
  • Manipuri

ID: ELRA-W0316

ISLRN: 588-170-827-016-7

The Ema-lon Manipuri Corpus consists of a set of resources for Manipuri language (locally known as Meiteilon) for the purpose of machine translation. The main source for these resources is the Sangai Express news website. The resources that constitute the present corpus are listed below: 1. EM C...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
 MADED (Moroccan Arabic Dialect Electronic Dictionary)    
  • Arabic

ID: ELRA-L0134

ISLRN: 977-057-254-691-5

Moroccan Arabic Dialect Electronic Dictionary (MADED) is an electronic lexicon containing almost 11,500 entries. They are written in Arabic script wherein each Modern Standard Arabic (MSA) lemma is provided with its corresponding Moroccan Arabic equivalent. In addition, MADED entries are annotate...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 MORV (Moroccan Morphological vocabulary)    
  • Arabic

ID: ELRA-L0135

ISLRN: 064-194-729-767-0

The Moroccan Morphological vocabulary is a lexicon containing more than 4.6 M entries describing a given Moroccan Arabic word with fourteen (14) morphological and semantic features: the word orthographic form, the segmentation (prefix and suffix), part-of-speech (POS), gender, number, tense and t...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
 T-PAS    
  • Croatian
  • English

ID: ELRA-M0109

ISLRN: 432-666-503-743-8

T-PAS (Typed Predicate Argument Structures) is a digital lexicon consisting of a corpus-derived collection of Italian verb argument structures, whose arguments have been manually annotated with a set of hierarchically organised semantic labels called Semantic Types. T-PAS is primarily tailored f...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit