Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

7 Language Resources

Order by:

 Pashto phonetic lexicon      
  • Pushto; Pashto

ID: ELRA-S0392

ISLRN: 186-827-325-462-6

This is a phonetic lexicon of 21,560 tokens in Pashto with their phonetic transcription in IPA. It covers the major dialect of the TRAD Pashto Broadcast News Speech Corpus (see ELRA Catalogue reference ELRA-S0381) from which the most frequent words were extracted. The pronunciation dictionary of ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 TRAD Pashto Broadcast News Speech Corpus    
  • Pushto; Pashto

ID: ELRA-S0381

ISLRN: 918-508-885-913-7

This corpus contains transcribed broadcast news recordings in Pashto. Recordings are collected from 5 sources: Ashna TV, Azadi Radio, Deewa Radio, Mashaal Radio and Shamshad TV. The corpus contains 108 hours of recordings covering more than 1,000 speakers. Transcriptions are provided together ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3500.00 € submit
28000.00 € submit
Licence: Commercial Use - ELRA VAR
28000.00 € submit
28000.00 € submit
 TRAD Pashto-English News Articles Parallel corpus    
  • English
  • Pushto; Pashto

ID: ELRA-W0097

ISLRN: 612-936-517-010-2

This is a parallel corpus, which contains 10,000 Pashto words translated into English by two different translators. The source texts have been collected from the following news websites: Azadiradio, Mashaal and Voice of America Pashto. The content has also been translated into French (see ELRA-W...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
350.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 TRAD Pashto-English Parallel corpus of transcribed Broadcast News Speech - Test data    
  • English
  • Pushto; Pashto

ID: ELRA-W0095

ISLRN: 006-102-605-738-4

This is a parallel corpus, which contains 10,000 Pashto words translated into English. The source texts come from 3 broadcast news transcriptions of the TRAD Pashto Broadcast News Speech Corpus (ELRA-S0381). These texts are VOA Ashna TV programs recorded on 15/01/2011, 18/01/2011 and 19/01/2011. ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
350.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 TRAD Pashto-French News Articles Parallel corpus    
  • French
  • Pushto; Pashto

ID: ELRA-W0096

ISLRN: 649-628-149-051-7

This is a parallel corpus, which contains 10,000 Pashto words translated into French by two different translators. The source texts have been collected from the following news websites: Azadiradio, Mashaal and Voice of America Pashto. The content has also been translated into English (see ELRA-W...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
350.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 TRAD Pashto-French Parallel corpus of transcribed Broadcast News Speech - Test data    
  • French
  • Pushto; Pashto

ID: ELRA-W0094

ISLRN: 547-897-479-723-3

This is a parallel corpus, which contains 10,000 Pashto words translated into French by two different translators. The source texts come from 3 broadcast news transcriptions of the TRAD Pashto Broadcast News Speech Corpus (ELRA-S0381). These texts are VOA Ashna TV programs recorded on 15/01/2011,...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
350.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 TRAD Pashto Monolingual text Corpus    
  • Pushto; Pashto

ID: ELRA-W0092

ISLRN: 394-903-293-388-0

This is a monolingual text corpus in Pashto. The corpus contains about 112,000,000 tokens collected from 46 different blogs and websites. Identified and negotiated or freely available sources have been crawled in 2012, cleaned and XML-formatted. Pashto is an indo-iranian language spoken by th...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
3500.00 € submit
Licence: Commercial Use - ELRA VAR
3500.00 € submit
3500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit