Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
262 Language Resources (Page 2 of 14)
« Previous | Next »Order by:
- Italian
ID: ELRA-L0145
ISLRN: 799-050-908-518-3The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Italian consists of 65,000 lemmas (1,400,000 form...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
- Arabic
ID: ELRA-L0151
ISLRN: 898-259-529-174-3As a complement to the generic vocabulary provided in ELRA-L0136, language variants of Arabic are provided with the following features: Voice, Tense, Mood, Person, Number, Gender, Case, Definiteness, Pronominal Clitics, Category (except for Arabic MSA). Variants are distributed as follows: - A...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
185000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
185000.00 €
|
- Chinese
ID: ELRA-L0152
ISLRN: 345-861-801-718-8As a complement to the generic vocabulary provided in ELRA-L0137 and ELRA-L0138, the following language variants of Chinese are provided: - Chinese Simplified: 74,000 lemmas (forms) - Chinese Traditional: 74,000 lemmas (forms)
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
78000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
78000.00 €
|
- Dutch; Flemish
ID: ELRA-L0153
ISLRN: 740-197-371-258-0As a complement to the generic vocabulary provided in ELRA-L0139, language variants of Dutch are provided with the following features: Tense, Mood, Person, Number, Gender, Contraction. Variants are distributed as follows: - Dutch Netherlands: 106,000 lemmas / 586,000 forms - Dutch Belgium: 97...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
74000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
74000.00 €
|
- English
ID: ELRA-L0154
ISLRN: 200-337-685-053-0As a complement to the generic vocabulary provided in ELRA-L0140, language variants of English are provided with the following features: Tense, Person, Number, Gender, Degree, Contraction. Variants are distributed as follows: - English US: 63,000 lemmas / 188,000 forms - English UK: 63,000 lem...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
96000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
96000.00 €
|
- Finnish
ID: ELRA-L0155
ISLRN: 921-047-661-179-3As a complement to the generic vocabulary provided in ELRA-L0141, language variants of Finnish are provided with the following features: Voice, Tense, Mood, Person, Number, Case, Degree, Pronominal Clitics, Formality. Variants are distributed as follows: - Finnish Standard: 74,000 lemmas / 74,0...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
94000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
94000.00 €
|
- French
ID: ELRA-L0156
ISLRN: 484-371-030-999-7As a complement to the generic vocabulary provided in ELRA-L0142, language variants of French are provided with the following features: Tense, Mood, Person, Number, Gender, Contraction, Pronominal Clitics. Variants are distributed as follows: - French (France): 76,000 lemmas / 1,450,000 forms ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
|
- German
ID: ELRA-L0157
ISLRN: 423-414-945-503-9As a complement to the generic vocabulary provided in ELRA-L0143, language variants of German are provided with the following features: Tense, Mood, Person, Number, Gender, Case, Degree, Contraction. Variants are distributed as follows: - German (Germany): 101,000 lemmas / 2,600,000 forms - G...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
74000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
74000.00 €
|
- Italian
ID: ELRA-L0158
ISLRN: 291-311-446-172-3As a complement to the generic vocabulary provided in ELRA-L0145, language variants of Italian are provided with the following features: Tense, Mood, Person, Number, Gender, Contraction, Pronominal Clitics. Variants are distributed as follows: - Italian (Italy): 82,000 lemmas / 1,470,000 forms...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
74000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
74000.00 €
|
- Norwegian
ID: ELRA-L0159
ISLRN: 990-947-280-161-4As a complement to the generic vocabulary provided in ELRA-L0147, language variants of Norwegian are provided with the following features: Tense, Person, Number, Gender, Case, Degree, Definiteness. Variants are distributed as follows: - Norwegian (Bokmal): 45,000 lemmas / 500,000 forms - Norwe...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
|
- Portuguese
ID: ELRA-L0160
ISLRN: 318-935-224-974-8As a complement to the generic vocabulary provided in ELRA-L0148, language variants of Portuguese are provided with the following features: Tense, Mood, Person, Number, Gender, Pronominal Clitics. - Portuguese (Portugal): 51,000 lemmas / 3,780,000 forms - Portuguese (Brazil): 36,000 lemmas / ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
|
- Spanish; Castilian
ID: ELRA-L0161
ISLRN: 310-440-658-038-5En complément du vocabulaire général fourni dans ELRA-L0149, les variantes linguistiques of espagnol sont fournies avec les informations suivantes: temps, mode, personne, nombre, genre, clitiques pronominaux. Les variantes sont réparties comme suit: - espagnol (Espagne): 85,000 lemmes / 1,340,...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
124000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
124000.00 €
|
- Malay (macrolanguage)
ID: ELRA-L0146
ISLRN: 841-153-293-824-7The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset -Malay consists of 45,000 lemmas (120,000 forms) as...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
- Norwegian
ID: ELRA-L0147
ISLRN: 906-018-621-709-5The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Norwegian (Bokmal) consists of 45,000 lemmas (500...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
- Portuguese
ID: ELRA-L0148
ISLRN: 984-017-567-921-2The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Portuguese consists of 40,000 lemmas (3,500,000 f...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
- Spanish; Castilian
ID: ELRA-L0149
ISLRN: 337-528-420-161-1The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Spanish consists of 60,000 lemmas (2,500,000 form...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
- Ukrainian
ID: ELRA-L0150
ISLRN: 395-379-970-824-1The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Ukrainian consists of 40,000 lemmas (650,000 form...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
72000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
72000.00 €
|
- English
ID: ELRA-L0202
ISLRN: 470-885-612-363-1The Bitext Synonym Data - General Language includes 31,723 entries and more than 100,000 synonyms for English language. This dataset is a set of synonyms developed to augment the English version of Wordnet, a powerful open-source lexical database, released in 2005. All synonyms can be linked to B...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
65000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
65000.00 €
|
- English
ID: ELRA-L0162
ISLRN: 969-007-860-723-7The Bitext Synthetic Data consist of pre-built training data for intent detection and are provided for 20 verticals for English language (see ELRA-L0162 to ELRA-L0181). They cover the most common intents for each vertical and include a large number of example utterances for each intent, with opti...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
26000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
26000.00 €
|
- Spanish; Castilian
ID: ELRA-L0182
ISLRN: 745-867-265-444-7The Bitext Synthetic Data consist of pre-built training data for intent detection and are provided for 20 verticals for Spanish language (see ELRA-L0182 to ELRA-L0201). They cover the most common intents for each vertical and include a large number of example utterances for each intent, with opti...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
26000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
26000.00 €
|