Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

101 Language Resources (Page 2 of 6)

« Previous | Next »Order by:

 ECPC Corpus (European Comparable and Parallel Corpora of Parliamentary Speeches Archive) – set 1    
  • English
  • Spanish; Castilian

ID: ELRA-W0128

ISLRN: 036-939-425-010-1

The European Comparable and Parallel Corpora of Parliamentary Speeches Archive (ECPC), compiled at the Universitat Jaume I (Spain), is a collection of XML metatextually tagged corpora containing speeches from three European chambers (the European Parliament, the British House of Commons, and the ...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
 Ema-lon Manipuri Corpus (including word embedding and language model)    
  • English
  • Manipuri

ID: ELRA-W0316

ISLRN: 588-170-827-016-7

The Ema-lon Manipuri Corpus consists of a set of resources for Manipuri language (locally known as Meiteilon) for the purpose of machine translation. The main source for these resources is the Sangai Express news website. The resources that constitute the present corpus are listed below: 1. EM C...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
 Employment in Poland 2009 report in EN-PL (Processed)    
  • English
  • Polish

ID: ELRA-W0242

ISLRN: 062-316-276-801-8

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. The report "Employment in Poland 2009 – Entrepreneurship...

MEMBERacademiccommercial
Licence: Attribution, Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Other - Open Under-PSI
0.00 € submit
0.00 € submit
 English-Danish Parallel corpus from Tatoeba project (Processed)    
  • Danish
  • English

ID: ELRA-W0214

ISLRN: 893-698-207-679-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel corpus from English-Danish translations from ta...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-2.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-2.0
0.00 € submit
0.00 € submit
 English-Estonian corpus from Finnish Information Bank (Processed)    
  • English
  • Estonian

ID: ELRA-W0218

ISLRN: 492-203-674-156-9

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. http://www.infopankki.fi - Finland in your language - In...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
 English-Finnish corpus from Finnish Information Bank (Processed)    
  • English
  • Finnish

ID: ELRA-W0217

ISLRN: 894-719-306-863-7

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. http://www.infopankki.fi - Finland in your language - In...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
 English-Icelandic parallel corpus from Statistics Iceland (Processed)    
  • English
  • Icelandic

ID: ELRA-W0219

ISLRN: 968-796-585-795-9

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English-Icelandic parallel corpus compiled from parallel...

MEMBERacademiccommercial
Licence: Attribution, Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Other - Open Under-PSI
0.00 € submit
0.00 € submit
 English-Norwegian parallel corpus from Forbruker Europa, 2017 release (Processed)    
  • English
  • Norwegian Bokmål

ID: ELRA-W0195

ISLRN: 153-210-190-637-8

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Forbruker Europa is the Norwegian office of the European...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
 English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic (Processed)    
  • English
  • Slovak

ID: ELRA-W0188

ISLRN: 632-640-184-652-7

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Dataset of various English-Slovak legal texts within age...

MEMBERacademiccommercial
Licence: Attribution, Other - Public Domain
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Other - Public Domain
0.00 € submit
0.00 € submit
 English-Swedish corpus from Finnish Information Bank (Processed)    
  • English
  • Swedish

ID: ELRA-W0222

ISLRN: 800-702-006-351-7

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. http://www.infopankki.fi - Finland in your language - In...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
 English-Swedish parallel corpus from the translation of 'Sweden a Pocket Guide' book (Processed)    
  • English
  • Swedish

ID: ELRA-W0130

ISLRN: 790-580-207-032-9

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A guide for foreigners who move to Sweden. Source langua...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-2.5-SE
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-2.5-SE
0.00 € submit
0.00 € submit
 General Romanian-English bilingual corpus (Processed)    
  • English
  • Romanian; Moldavian; Moldovan

ID: ELRA-W0193

ISLRN: 206-680-247-212-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Romanian – English corpus built from a Wikipedia dump.

MEMBERacademiccommercial
Licence: Attribution, Share Alike - CC-BY-SA-3.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Share Alike - CC-BY-SA-3.0
0.00 € submit
0.00 € submit
 German Political Speeches Corpus    
  • German

ID: ELRA-W0330

ISLRN: 381-445-879-769-5

This corpus consists of a collection of political speeches in German crawled from the online archive of the German presidency (Bundespraësident) and the Chancellery (Bundesregierung). For the German Presidency the speeches are available from July 1, 1984 to February 17, 2012 and the corpus con...

MEMBERacademiccommercial
Licence: Attribution, Share Alike - CC-BY-SA
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Share Alike - CC-BY-SA
0.00 € submit
0.00 € submit
 Glissando-ca    
  • Catalan; Valencian

ID: ELRA-S0407

ISLRN: 780-617-066-913-1

Glissando-ca includes more than 12 hours of speech in Catalan, recorded under optimal acoustic conditions, orthographically transcribed, phonetically aligned and annotated with prosodic information (location of the stressed syllables and prosodic phrasing). The corpus was recorded by 8 profession...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
 Glissando-sp    
  • Spanish; Castilian

ID: ELRA-S0406

ISLRN: 024-286-962-247-6

Glissando-sp includes more than 12 hours of speech in Spanish, recorded under optimal acoustic conditions, orthographically transcribed, phonetically aligned and annotated with prosodic information (location of the stressed syllables and prosodic phrasing). The corpus was recorded by 8 profession...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
 Greek anti-corruption legislation and National Anti-Corruption Plan (greek-english) (Processed)    
  • English
  • Modern Greek (1453-)

ID: ELRA-W0164

ISLRN: 919-659-714-668-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Greek laws, ratification of International Conventions ag...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
 Greek-English parallel corpus from the website of the Prime Minister of the Hellenic Republic (Processed)    
  • English
  • Modern Greek (1453-)

ID: ELRA-W0272

ISLRN: 763-048-196-707-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Greek-English parallel corpus from the website of the Pr...

MEMBERacademiccommercial
Licence: Attribution, Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Other - Open Under-PSI
0.00 € submit
0.00 € submit
 Hallituskausi 2007-2011 -- Finnish-English Translation Memory (Processed)    
  • English
  • Finnish

ID: ELRA-W0220

ISLRN: 645-363-039-955-3

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. The "Hallituskausi 2007–2011" translation memory is inte...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
 Hallituskausi 2011-2015 -- Finnish-English Translation Memory (Processed)    
  • English
  • Finnish

ID: ELRA-W0221

ISLRN: 751-465-762-980-9

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Information on the "Hallituskausi 2011–" translation mem...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
 How2Sign Dataset      
  • American Sign Language
  • English

ID: ELRA-S0416

ISLRN: 583-408-694-292-6

The How2Sign dataset consists of a parallel corpus of speech and transcriptions of instructional videos and their corresponding American Sign Language (ASL) translation videos and annotations. It has been produced by recording 11 persons (6 males and 5 females) with various hearing status (5 self...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0

« Previous | Next »