Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

12 Language Resources

Order by:

 GlobalPhone Chinese-Mandarin Pronunciation Dictionary      
  • Chinese

ID: ELRA-S0363

ISLRN: 457-511-870-286-9

The GlobalPhone pronunciation dictionaries, created within the framework of the multilingual speech and language corpus GlobalPhone, were developed in collaboration with the Karlsruhe Institute of Technology (KIT). The GlobalPhone pronunciation dictionaries contain the pronunciations of all wo...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit

Special offers are also available. Check here for details.

 LC-STAR Mandarin Chinese Phonetic lexicon      
  • Chinese

ID: ELRA-S0256

ISLRN: 103-062-804-789-9

The LC-STAR Mandarin Chinese Phonetic lexicon was created within the scope of the LC-STAR project (IST 2001-32216) which was sponsored by the European Commission. The lexicon comprises 104,368 entries, distributed over three categories: - a set of 38,098 common word entries. This set is extract...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
27000.00 € submit
40000.00 € submit
Licence: Commercial Use - ELRA VAR
40000.00 € submit
40000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
38000.00 € submit
50000.00 € submit
Licence: Commercial Use - ELRA VAR
50000.00 € submit
50000.00 € submit
 NE3L named entities Chinese corpus    
  • Chinese

ID: ELRA-W0079

ISLRN: 187-154-782-686-9

The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 Original Short-Message Data Collation II in Chinese    
  • Chinese

ID: ELRA-W0045-05

ISLRN: 004-512-635-005-4

This corpus comprises 2,604,901 characters, corresponding to 202,277 daily life short messages (SMS). This subset contains the original messages. All data have been proofread manually.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11942.00 € submit
11942.00 € submit
Licence: Commercial Use - ELRA VAR
11942.00 € submit
11942.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11942.00 € submit
11942.00 € submit
Licence: Commercial Use - ELRA VAR
11942.00 € submit
11942.00 € submit
 Original Short-Message Data Collation II in Chinese (named entities)    
  • Chinese

ID: ELRA-W0045-08

ISLRN: 753-094-616-225-9

This corpus comprises 2,604,901 characters, corresponding to 202,277 daily life short messages (SMS). This subset contains original messages together with named entities. All data have been proofread manually.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14928.00 € submit
14928.00 € submit
Licence: Commercial Use - ELRA VAR
14928.00 € submit
14928.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14928.00 € submit
14928.00 € submit
Licence: Commercial Use - ELRA VAR
14928.00 € submit
14928.00 € submit
 Original Short-Message Data Collation II in Chinese (participles)    
  • Chinese

ID: ELRA-W0045-07

ISLRN: 747-585-323-393-8

This corpus comprises 2,604,901 characters, corresponding to 202,277 daily life short messages (SMS). This subset contains original messages together with participles. All data have been proofread manually.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14928.00 € submit
14928.00 € submit
Licence: Commercial Use - ELRA VAR
14928.00 € submit
14928.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14928.00 € submit
14928.00 € submit
Licence: Commercial Use - ELRA VAR
14928.00 € submit
14928.00 € submit
 Original Short-Message Data Collation II in Chinese (PinYin)    
  • Chinese

ID: ELRA-W0045-06

ISLRN: 745-287-055-486-8

This corpus comprises 2,604,901 characters, corresponding to 202,277 daily life short messages (SMS). This subset contains original messages together with PinYin transcription. All data have been proofread manually with PinYin.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14928.00 € submit
14928.00 € submit
Licence: Commercial Use - ELRA VAR
14928.00 € submit
14928.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14928.00 € submit
14928.00 € submit
Licence: Commercial Use - ELRA VAR
14928.00 € submit
14928.00 € submit
 Original Short-Message Data Collation I in Chinese    
  • Chinese

ID: ELRA-W0045-01

ISLRN: 453-260-875-772-3

This corpus comprises 5,891,275 characters, corresponding to 51,568 short messages (SMS) from radio/TV stations and 213,694 daily life short messages. This subset contains the original messages. All data have been proofread manually.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14928.00 € submit
14928.00 € submit
Licence: Commercial Use - ELRA VAR
14928.00 € submit
14928.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14928.00 € submit
14928.00 € submit
Licence: Commercial Use - ELRA VAR
14928.00 € submit
14928.00 € submit
 Original Short-Message Data Collation I in Chinese (named entities)    
  • Chinese

ID: ELRA-W0045-04

ISLRN: 169-161-744-054-8

This corpus comprises 5,891,275 characters, corresponding to 51,568 short messages (SMS) from radio/TV stations and 213,694 daily life short messages. This subset contains original messages together with named entities. All data have been proofread and tagged manually.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18659.00 € submit
18659.00 € submit
Licence: Commercial Use - ELRA VAR
18659.00 € submit
18659.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18659.00 € submit
18659.00 € submit
Licence: Commercial Use - ELRA VAR
18659.00 € submit
18659.00 € submit
 Original Short-Message Data Collation I in Chinese (participles)    
  • Chinese

ID: ELRA-W0045-03

ISLRN: 327-586-643-099-5

This corpus comprises 5,891,275 characters, corresponding to 51,568 short messages (SMS) from radio/TV stations and 213,694 daily life short messages. This subset contains original messages together with participles. All data have been proofread manually.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18659.00 € submit
18659.00 € submit
Licence: Commercial Use - ELRA VAR
18659.00 € submit
18659.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18659.00 € submit
18659.00 € submit
Licence: Commercial Use - ELRA VAR
18659.00 € submit
18659.00 € submit
 Original Short-Message Data Collation I in Chinese (PinYin)    
  • Chinese

ID: ELRA-W0045-02

ISLRN: 910-780-238-099-2

This corpus comprises 5,891,275 characters, corresponding to 51,568 short messages (SMS) from radio/TV stations and 213,694 daily life short messages. This subset contains original messages together with PinYin transcription. All data have been proofread manually with PinYin.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18659.00 € submit
18659.00 € submit
Licence: Commercial Use - ELRA VAR
18659.00 € submit
18659.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18659.00 € submit
18659.00 € submit
Licence: Commercial Use - ELRA VAR
18659.00 € submit
18659.00 € submit
 The Lancaster Corpus of Mandarin Chinese (LCMC)    
  • Chinese

ID: ELRA-W0039

ISLRN: 990-638-120-277-2

The Lancaster Corpus of Mandarin Chinese (LCMC) is designed as a Chinese match for the FLOB and FROWN corpora for modern British and American English. The corpus is suitable for use in both monolingual research into modern Mandarin Chinese and cross-linguistic contrast of Chinese and British/Ame...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
7500.00 € submit
Licence: Commercial Use - ELRA VAR
7500.00 € submit
7500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit