Cantonese Conversational Speech Data by Mobile Phone and Voice Recorder - 607 Hours
View resource name in all available languages
Base de données orales de conversations en cantonais par téléphone portable et enregistreur vocal - 607 heures
ID:
ELRA-S0427
995 local Cantonese speakers participated in the recording, and conducted face-to-face communication in a natural way. They had free discussion on a number of given topics, with a wide range of fields; the voice was natural and fluent, in line with the actual dialogue scene. Text is transcribed manually, with high accuracy.
Format:Mobile phone: 16kHz, 16bit, mono channel, .wav; Voice recorder: 44.1kHz, 16bit, dual channel, .wav;
Environment:quiet indoor environment, without echo
Recording Content:dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed
Demographics:995 Cantonese; 45% speakers of all are in the age group of 26-45; 504 speakers of them spoke in groups of two speakers, 195 speakers of them spoke in groups of three speakers, 196 speakers of them spoke in groups of four speakers, and the other 100 speakers spoke in groups of five speakers
Annotation:annotating for the transcription text, speaker identification and gender
Device:mobile phone and voice recorder
Language:Cantonese
Application Scenario:Voice Recognition, Voice Print Recognition
Accuracy rate:95%
View resource description in
French
995 locuteurs cantonais locaux ont participé à l'enregistrement du corpus et ont mené une conversation en face à face de manière naturelle. Ils ont poursuivi des discussions libres sur un certain nombre de thèmes donnés, avec un large éventail de domaines. La voix était naturelle et fluide, en ligne avec la scène de dialogue réelle. Le texte est transcrit manuellement, avec une précision élevée.
Format : téléphone portable : 16 kHz, 16 bits, canal mono, .wav ; Enregistreur vocal : 44,1 kHz, 16 bits, double canal, .wav ;
Environnement : environnement intérieur calme, sans écho
Contenu des enregistrements : des dizaines de thèmes sont spécifiés, et les locuteurs dialoguent sur ces thèmes pendant l'enregistrement
Données démographiques : 995 cantonais ; 45% des locuteurs sont dans la tranche d'âge des 26-45 ans ; 504 locuteurs ont parlé en groupes de deux locuteurs, 195 locuteurs ont parlé en groupes de trois locuteurs, 196 locuteurs ont parlé en groupes de quatre locuteurs et les 100 autres locuteurs ont parlé en groupes de cinq locuteurs
Annotation : annotation pour le texte de transcription, l'identification du locuteur et le sexe
Supports d'enregistrement : téléphone portable et enregistreur vocal
Langue : cantonais
Applications : reconnaissance vocale, reconnaissance d'impression vocale
Taux de précision : 95 %
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
98030.50 €
|
98030.50 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
98030.50 €
|
98030.50 €
|
Special offer:
-
Subsets of this dataset, based on a reduced number of entries, may be requested upon demand. This offer also applies for the datasets listed below.
- Chinese Speaking English Speech Data by Mobile phone - 593 Hours
- Mandarin Heavy Accent Speech Data by Mobile Phone - 662 Hours
- Mandarin Heavy Accent Speech Data by Mobile Phone - 131 Hours
- Uyghur Speech Data by Mobile Phone - 738 Hours
- Sichuan Dialect Speech Data by Mobile Phone - 794 Hours
- Cantonese Dialect Speech Data by Mobile Phone - 1,652 Hours
- Shanghai Dialect Speech Data by Mobile Phone - 1,030 Hours
- Japanese Speech Data by Mobile Phone_R - 234 Hours
- Korean Speech Data by Mobile Phone_Reading - 197 Hours
- British Children Speech Data by Microphone - 55 Hours
- German Speech Data by Mobile Phone_Reading - 211 Hours
- Italian Speech Data by Mobile Phone_Reading - 215 Hours
- Thai Speech Data by Mobile Phone_Reading - 292 Hours
- Indonesian Speech Data by Mobile Phone_R - 359 Hours
- Malay Speech Data by Mobile Phone_Reading - 134 Hours
- American Children Speech Data by Microphone - 50 Hours
- American English Speech Data by Mobile Phone_Reading - 215 Hours
- British English Speech Data by Mobile Phone_Reading - 199 Hours
- French Speech Data by Mobile Phone_Reading - 231 Hours
- Spanish Speech Data by Mobile Phone_R - 227 Hours
- Hindi Speech Data by Mobile Phone_R - 240 Hours
- Spanish Speech Data by Mobile Phone - 338 Hours
- Italian Speech Data Collected by Mobile Phone - 347 Hours
- Korean Speech Data by Mobile Phone - 357 Hours
- Japanese Speech Data by Mobile Phone - 261 Hours
- Chinese Children Speech data by Mobile phone - 3,255 Hours
- Mixed Speech with Chinese and English Data by Mobile Phone - 1,535 Hours
- Indian English Speech Data by Mobile Phone - 1,012 Hours
- Wuhan Dialect Speech Data by Mobile Phone - 997 Hours
- Kunming Dialect Speech Data by Mobile Phone - 1,002 Hours
- Changsha Dialect Speech Data by Mobile Phone - 997 Hours
- Hindi Speech Data by Mobile Phone - 759 Hours
- Japanese Speech Data By Mobile Phone - 474 Hours
- Italian Speech Data by Mobile Phone - 1,441 Hours
- German Speech Data by Mobile Phone - 1,796 Hours
- British English Speech Data by Mobile Phone - 831 Hours
- Spanish Speech Data by Mobile Phone - 435 Hours
- French Speech Data by Mobile Phone - 769 Hours
- Brazilian Portuguese Speech Data by Mobile Phone - 1,044 Hours
- Non-Hispanic Spanish Speech Data by Mobile Phone - 762 Hours
- Russian Speech Data by Mobile Phone - 1,002 Hours
- German Speaking English Speech Data by Mobile Phone - 535 Hours
- French Speaking English Speech Data by Mobile Phone - 520 Hours
- Spanish Speaking English Speech Data by Mobile Phone - 388 Hours
- Indonesian Speech Data by Mobile Phone - 639 Hours
- Malay Speech Data by Mobile Phone - 370 Hours
- American English Speech Data by Mobile Phone - 800 Hours
- Mandarin Conversational Speech Data by Mobile Phone and Voice Recorder - 1,351 Hours
- Chinese Children Speaking English Speech Data by Mobile Phone - 464 Hours
- Chinese Speaking English Speech Data by Mobile Phone - 502 Hours
- Mandarin Strong Accent Speech Data by Mobile Phone - 1,025 Hours
- Vietnamese Speech Data by Mobile Phone - 760 Hours
- Korean Speech Data by Mobile Phone - 516 Hours
- Latin American Speaking English Speech Data by Mobile Phone - 117 Hours
- Italian Speaking English Speech Data by Mobile Phone - 227 Hours
- Portuguese Speaking English Speech Data by Mobile Phone - 209 Hours
- Russian Speaking English Speech Data by Mobile Phone - 230 Hours
- Singaporean Speaking English Speech Data by Mobile Phone - 201 Hours
- Australian English Speech Data by Mobile Phone - 199 Hours
- Canadian Speaking English Speech Data by Mobile Phone - 207 Hours
- Japanese Speaking English Speech Data by Mobile Phone - 207 Hours
- Mandarin Mobile Telephony Conversational Speech Collection Data - 2,657 Hours
- Sichuan Dialect Conversational Speech Data by Mobile Phone - 800 Hours
- Chinese Digital Speech Data by Mobile Phone - 11,010 People
- Wake-up Words Speech Data by Microphone - 1,027 People
- Mandarin Speech Data by Mobile Phone - 2,028 Hours