Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

236 Language Resources (Page 1 of 12)

« Previous | Next »Order by:

 ACL RD-TEC: A Reference Dataset for Terminology Extraction and Classification Research in Computational Linguistics    
  • English

ID: ELRA-T0375

ISLRN: 699-305-362-089-6

Automatic Term Recognition (ATR) is a research task that deals with the identification of domain-specific terms. Terms, in simple words, are textual realization of significant concepts in an expertise domain. Additionally, domain-specific terms may be classified into a number of categories, in wh...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
 Al-Hayat Arabic Corpus    
  • Arabic

ID: ELRA-W0030

ISLRN: 365-777-769-398-7

The corpus was developed in the course of a research project at the University of Essex, in collaboration with the Open University. The corpus contains Al-Hayat newspaper articles with value added for Language Engineering and Information Retrieval applications development purposes. The data have ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
480.00 € submit
960.00 € submit
Licence: Commercial Use - ELRA VAR
960.00 € submit
960.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
720.00 € submit
1440.00 € submit
Licence: Commercial Use - ELRA VAR
1440.00 € submit
1440.00 € submit
 Amaryllis Corpus - Evaluation Package    
  • French

ID: ELRA-W0029

ISLRN: 786-395-313-491-8

Launched at the end of 1995, the AMARYLLIS project aimed at evaluating information retrieval software for French text corpora in order to provide a methodology for the evaluation of other similar tools. AMARYLLIS was organised by the Institut de l'Information Scientifique et Technique (INIST) wit...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
100.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
100.00 € submit
 An-Nahar Newspaper Text Corpus    
  • Arabic

ID: ELRA-W0027

ISLRN: 083-457-618-309-8

The An-Nahar Lebanon Newspaper Text Corpus comprises articles in standard Arabic from 1995 to 2000 (6 years) stored as HTML files on CDRom media. Each year contains 45 000 articles and 24 million words. Each article includes information such as title, newspaper's name, date, country, type, page, ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2016.00 € submit
3192.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3024.00 € submit
4788.00 € submit

Special offers are also available. Check here for details.

 Arabic dictionary of inflected words    
  • Arabic

ID: ELRA-L0098

ISLRN: 049-623-948-389-2

The Arabic dictionary of inflected words consists of a list of 6 million inflected forms, fully vowelized, generated in compliance with the grammatical rules of Arabic and tagged with grammatical information which includes POS and grammatical features, including number, gender, case, definiteness...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4500.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 Arabic Morphological Dictionary    
  • Arabic

ID: ELRA-L0088

ISLRN: 472-591-121-577-5

The Arabic Morphological Dictionary contains 4,912,749 entries, including: - 3,374,852 nouns, - 1,537,699 verbs, - 198 grammatical words. The dictionary is stored on 1 CD. All files are provided as plain text in UTF8 character encoding, which represents about 154 Mb of data. The dictionary form...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
250.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
450.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
 Arboretum treebank    
  • Danish

ID: ELRA-W0084

ISLRN: 025-729-182-451-2

The Arboretum treebank is a morphologically and syntactically annotated repository of Danish sentences, taken from Korpus 90 and Korpus 2000, both compiled by the Society for Danish Language and Literature (http://ordnet.dk/korpusdk/fakta), and containing samples of written Danish from the 90'ies...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
7000.00 € submit
Licence: Commercial Use - ELRA VAR
7000.00 € submit
7000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2200.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
 ARCADE/ROMANSEVAL corpus    
  • English
  • French
  • Italian

ID: ELRA-W0018

ISLRN: 681-769-134-114-2

The ARCADE/ROMANSEVAL corpus was used as a reference corpus in two international competitions: · ARCADE, an exercise on multilingual text alignment financed by AUPELF-UREF · ROMANSEVAL, part of the SENSEVAL exercise sponsored by ACL-SIGLEX and EURALEX, on word sense disambiguation. The corpus con...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 A "scientific" corpus of modern French ("La Recherche" magazine) - Complete version    
  • French

ID: ELRA-W0025-02

ISLRN: 798-363-116-656-4

This "scientific" corpus of modern French was produced by the University of Nantes (France) within the European Commission funded project LRsP&P (Language Resources Production & Packaging - LE4-8335). The corpus contains all articles published in La Recherche magazine in 1998, including issues 30...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 A "scientific" corpus of modern French ("La Recherche" magazine) - Raw data    
  • French

ID: ELRA-W0025-01

ISLRN: 508-941-013-339-7

This "scientific" corpus of modern French was produced by the University of Nantes (France) through a funding from ELRA in the framework of the European Commission project LRsP&P (Language Resources Production & Packaging - LE4-8335). The corpus contains all articles published in La Recherche mag...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
240.00 € submit
1200.00 € submit
Licence: Commercial Use - ELRA VAR
1200.00 € submit
1200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
310.00 € submit
1500.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
 Automobile Engineering    
  • English
  • French
  • German
  • Spanish; Castilian

ID: ELRA-T0097

ISLRN: 536-306-764-088-7

Cards available: 1420 Languages: German, English, French, Spanish Card Description: Each card in this terminological database contains a definition, relation between concepts, graphics, abbreviations, notes, sub-domains, sources, grammatical labels.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1746.60 € submit
1746.60 € submit
Licence: Commercial Use - ELRA VAR
1746.60 € submit
1746.60 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2911.00 € submit
2911.00 € submit
Licence: Commercial Use - ELRA VAR
2911.00 € submit
2911.00 € submit
 BAS GEO1      
  • German

ID: ELRA-S0164

ISLRN: 853-731-110-167-7

BAS GEO1 is a simple database about the most important location names of Germany, Austria and Switzerland together with their canonical pronunciation coded in SAMPA. BAS GEO1 may be used as a basis for automatic speech recognition of German postal addresses or to feed a speech synthesis algorithm...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
172.82 € submit
1400.00 € submit
Licence: Commercial Use - ELRA VAR
1400.00 € submit
1400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
255.65 € submit
2800.00 € submit
Licence: Commercial Use - ELRA VAR
2800.00 € submit
2800.00 € submit
 BDLEX      
  • French

ID: ELRA-S0004

ISLRN: 613-587-811-827-8

BDLEX consists of a lexical database developed within the French GDR-PRC CHM at IRIT (IMH-PT team), Paul Sabatier University, Toulouse. The data cover lexical, phonological, and morphological information. The database BDLEX consists of about 440,000 inflected forms (generated from about 50,000 c...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 BioLexicon    
  • English

ID: ELRA-T0373

ISLRN: 152-047-849-795-0

BioLexicon is a large-scale English terminological resource which has been developed to address the needs emerging in text mining efforts in the biomedical domain. It contains information on: - terminological nouns, including nominalised verbs and proper names (e.g., gene names) - terminological ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
4500.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
9000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
320.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
11000.00 € submit
11000.00 € submit
 Biology Database    
  • English
  • Korean

ID: ELRA-T0365

ISLRN: 987-153-588-577-9

This bilingual terminology database produced by Kaist Korterm consists of 31 884 entries in Korean and English in the field of biology.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1063.00 € submit
6377.00 € submit
Licence: Commercial Use - ELRA VAR
6377.00 € submit
6377.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2126.00 € submit
12754.00 € submit
Licence: Commercial Use - ELRA VAR
12754.00 € submit
12754.00 € submit
 BrasiLEX Brazilian Portuguese lexicon    
  • Portuguese

ID: ELRA-L0034

ISLRN: 654-505-941-943-8

BrasiLEX is a multifunctional monolingual lexicon of the Brazilian variety of Portuguese, developed by the Natural Language Group of INESC. It has about 65,000 entries (lemmas) and 1,600 correspondent inflexion paradigms. The set of entries includes compound words and the inflexion paradigms incl...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
25000.00 € submit
25000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
30000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
30000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 British English Source Lexicon (BESL) version 2.2    
  • English

ID: ELRA-L0058

ISLRN: 875-872-158-794-8

BESL is a complete database of the English lexicon. It consists of over 230,000 lemmas, over 350,000 word forms, 60,000 proper nouns, 3,000 abbreviations, and 58,000 multi-word compound nouns. Each headword is provided with a full listing of all inflected forms and other morphological variation. ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10000.00 € submit
 Bulgarian Linguistic Database    
  • Bulgarian

ID: ELRA-L0075

ISLRN: 450-247-052-039-5

This database contains 81,647 entries in Bulgarian with a linguistic environment tool (for WINDOWS XP). The data may be used for morphological analysis and synthesis, syntactic agreement checking, phonetic stress determining. Structure of entries: Local linguistic variant File format: MS ACCESS ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
16000.00 € submit
Licence: Commercial Use - ELRA VAR
16000.00 € submit
16000.00 € submit
 Bulgarian Morphological Dictionary    
  • Bulgarian

ID: ELRA-L0030

ISLRN: 611-552-122-892-7

This dictionary contains 67500 entries divided into 242 inflectional types (including proper nouns), morphosyntactic information for each entry, and a morphological engine (MS DOS and WINDOWS 95/NT) for morphological analysis and generation. The data may be used for morphological analysis and syn...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
 Catalan Corpus of News Articles    
  • Catalan; Valencian

ID: ELRA-W0047

ISLRN: 000-089-517-382-8

The Catalan Corpus of News Articles comprises articles in Catalan from 1 January 1999 to 31 March 2007. These articles are grouped per trimester without chronological order inside. The DVD contains one folder per year. Each folder has been divided into subfolders, containing the archives per tri...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2975.00 € submit
14855.00 € submit
Licence: Commercial Use - ELRA VAR
14855.00 € submit
14855.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3930.00 € submit
19315.00 € submit
Licence: Commercial Use - ELRA VAR
19315.00 € submit
19315.00 € submit

« Previous | Next »