Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

262 Language Resources (Page 2 of 14)

« Previous | Next »Order by:

 Bitext Lexical Dataset - Italian    
  • Italian

ID: ELRA-L0145

ISLRN: 799-050-908-518-3

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Italian consists of 65,000 lemmas (1,400,000 form...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
 Bitext Lexical Dataset - Language Variants - Arabic    
  • Arabic

ID: ELRA-L0151

ISLRN: 898-259-529-174-3

As a complement to the generic vocabulary provided in ELRA-L0136, language variants of Arabic are provided with the following features: Voice, Tense, Mood, Person, Number, Gender, Case, Definiteness, Pronominal Clitics, Category (except for Arabic MSA). Variants are distributed as follows: - A...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
185000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
185000.00 € submit
 Bitext Lexical Dataset - Language Variants - Chinese    
  • Chinese

ID: ELRA-L0152

ISLRN: 345-861-801-718-8

As a complement to the generic vocabulary provided in ELRA-L0137 and ELRA-L0138, the following language variants of Chinese are provided: - Chinese Simplified: 74,000 lemmas (forms) - Chinese Traditional: 74,000 lemmas (forms)

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
78000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
78000.00 € submit
 Bitext Lexical Dataset - Language Variants - Dutch    
  • Dutch; Flemish

ID: ELRA-L0153

ISLRN: 740-197-371-258-0

As a complement to the generic vocabulary provided in ELRA-L0139, language variants of Dutch are provided with the following features: Tense, Mood, Person, Number, Gender, Contraction. Variants are distributed as follows: - Dutch Netherlands: 106,000 lemmas / 586,000 forms - Dutch Belgium: 97...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
74000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
74000.00 € submit
 Bitext Lexical Dataset - Language Variants - English    
  • English

ID: ELRA-L0154

ISLRN: 200-337-685-053-0

As a complement to the generic vocabulary provided in ELRA-L0140, language variants of English are provided with the following features: Tense, Person, Number, Gender, Degree, Contraction. Variants are distributed as follows: - English US: 63,000 lemmas / 188,000 forms - English UK: 63,000 lem...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
96000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
96000.00 € submit
 Bitext Lexical Dataset - Language Variants - Finnish    
  • Finnish

ID: ELRA-L0155

ISLRN: 921-047-661-179-3

As a complement to the generic vocabulary provided in ELRA-L0141, language variants of Finnish are provided with the following features: Voice, Tense, Mood, Person, Number, Case, Degree, Pronominal Clitics, Formality. Variants are distributed as follows: - Finnish Standard: 74,000 lemmas / 74,0...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
94000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
94000.00 € submit
 Bitext Lexical Dataset - Language Variants - French    
  • French

ID: ELRA-L0156

ISLRN: 484-371-030-999-7

As a complement to the generic vocabulary provided in ELRA-L0142, language variants of French are provided with the following features: Tense, Mood, Person, Number, Gender, Contraction, Pronominal Clitics. Variants are distributed as follows: - French (France): 76,000 lemmas / 1,450,000 forms ...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
88000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
88000.00 € submit
 Bitext Lexical Dataset - Language Variants - German    
  • German

ID: ELRA-L0157

ISLRN: 423-414-945-503-9

As a complement to the generic vocabulary provided in ELRA-L0143, language variants of German are provided with the following features: Tense, Mood, Person, Number, Gender, Case, Degree, Contraction. Variants are distributed as follows: - German (Germany): 101,000 lemmas / 2,600,000 forms - G...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
74000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
74000.00 € submit
 Bitext Lexical Dataset - Language Variants - Italian    
  • Italian

ID: ELRA-L0158

ISLRN: 291-311-446-172-3

As a complement to the generic vocabulary provided in ELRA-L0145, language variants of Italian are provided with the following features: Tense, Mood, Person, Number, Gender, Contraction, Pronominal Clitics. Variants are distributed as follows: - Italian (Italy): 82,000 lemmas / 1,470,000 forms...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
74000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
74000.00 € submit
 Bitext Lexical Dataset - Language Variants - Norwegian    
  • Norwegian

ID: ELRA-L0159

ISLRN: 990-947-280-161-4

As a complement to the generic vocabulary provided in ELRA-L0147, language variants of Norwegian are provided with the following features: Tense, Person, Number, Gender, Case, Degree, Definiteness. Variants are distributed as follows: - Norwegian (Bokmal): 45,000 lemmas / 500,000 forms - Norwe...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
88000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
88000.00 € submit
 Bitext Lexical Dataset - Language Variants - Portuguese    
  • Portuguese

ID: ELRA-L0160

ISLRN: 318-935-224-974-8

As a complement to the generic vocabulary provided in ELRA-L0148, language variants of Portuguese are provided with the following features: Tense, Mood, Person, Number, Gender, Pronominal Clitics. - Portuguese (Portugal): 51,000 lemmas / 3,780,000 forms - Portuguese (Brazil): 36,000 lemmas / ...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
88000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
88000.00 € submit
 Bitext Lexical Dataset - Language Variants - Spanish    
  • Spanish; Castilian

ID: ELRA-L0161

ISLRN: 310-440-658-038-5

En complément du vocabulaire général fourni dans ELRA-L0149, les variantes linguistiques of espagnol sont fournies avec les informations suivantes: temps, mode, personne, nombre, genre, clitiques pronominaux. Les variantes sont réparties comme suit: - espagnol (Espagne): 85,000 lemmes / 1,340,...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
124000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
124000.00 € submit
 Bitext Lexical Dataset - Malay    
  • Malay (macrolanguage)

ID: ELRA-L0146

ISLRN: 841-153-293-824-7

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset -Malay consists of 45,000 lemmas (120,000 forms) as...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
 Bitext Lexical Dataset - Norwegian (Bokmal)    
  • Norwegian

ID: ELRA-L0147

ISLRN: 906-018-621-709-5

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Norwegian (Bokmal) consists of 45,000 lemmas (500...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
 Bitext Lexical Dataset - Portuguese    
  • Portuguese

ID: ELRA-L0148

ISLRN: 984-017-567-921-2

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Portuguese consists of 40,000 lemmas (3,500,000 f...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
 Bitext Lexical Dataset - Spanish    
  • Spanish; Castilian

ID: ELRA-L0149

ISLRN: 337-528-420-161-1

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Spanish consists of 60,000 lemmas (2,500,000 form...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
 Bitext Lexical Dataset - Ukrainian    
  • Ukrainian

ID: ELRA-L0150

ISLRN: 395-379-970-824-1

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Ukrainian consists of 40,000 lemmas (650,000 form...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
72000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
72000.00 € submit
 Bitext Synonym Data - General Language    
  • English

ID: ELRA-L0202

ISLRN: 470-885-612-363-1

The Bitext Synonym Data - General Language includes 31,723 entries and more than 100,000 synonyms for English language. This dataset is a set of synonyms developed to augment the English version of Wordnet, a powerful open-source lexical database, released in 2005. All synonyms can be linked to B...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
65000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
65000.00 € submit
 Bitext Synthetic Data - Automotive (English language)    
  • English

ID: ELRA-L0162

ISLRN: 969-007-860-723-7

The Bitext Synthetic Data consist of pre-built training data for intent detection and are provided for 20 verticals for English language (see ELRA-L0162 to ELRA-L0181). They cover the most common intents for each vertical and include a large number of example utterances for each intent, with opti...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
26000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
26000.00 € submit
 Bitext Synthetic Data - Automotive (Spanish language)    
  • Spanish; Castilian

ID: ELRA-L0182

ISLRN: 745-867-265-444-7

The Bitext Synthetic Data consist of pre-built training data for intent detection and are provided for 20 verticals for Spanish language (see ELRA-L0182 to ELRA-L0201). They cover the most common intents for each vertical and include a large number of example utterances for each intent, with opti...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
26000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
26000.00 € submit

« Previous | Next »