English-Nepali Parallel Corpus – ELRA Catalogue

Last view: 2024-04-18

1178 Last view: 2024-04-18

Last update: 2020-03-06

4 Last update: 2020-03-06

Last download: 2024-04-11

116 Last download: 2024-04-11

English-Nepali Parallel Corpus

View resource name in all available languages

Corpus parallèle anglais-népalais

ISLRN: 853-487-663-161-6

ID:

ELRA-W0077

The Nepali Monolingual written corpus is one of the 3 resources that constitute the Nepali National Corpus. The Nepali National Corpus was produced in 2006 in the framework of the project Bhasha Sanchar (“language communication”), also known as Nelralec, for Nepali Language Resources and Localization for Education and Communication; funded by the EU Asia IT&C programme, reference number ASIE/2004/091-777.

This corpus consists of a collection of national development texts in English and Nepali and is divided as follows:
- a small set of data aligned at the sentence level (27,060 English words; 21,756 Nepali words), provided in the tmx format (xml file):
- a larger set of texts aligned at the document level (617,340 English words; 596,571 Nepali words), provided in raw text and in the original word processing format.
- an additional set of monolingual data in Nepali (386,879 words in Nepali), provided in raw text and in the original word processing format.

View resource description in French

Le corpus écrit monolingue népalais est l’une des 3 ressources qui composent le Corpus National Népalais. Le Corpus National Népalais a été produit en 2006 dans le cadre du projet Bhasha Sanchar (“communication langagière”), également connu sous le nom de Nelralec (Ressources linguistiques népalaises et localisation pour l’éducation et la communication); il a été financé par le programme de l’Union Européenne Asia IT&C, sous la référence ASIE/2004/091-777.

Ce corpus consiste en une collection de textes dans le domaine du développement national en anglais et népalais et est réparti comme suit:
- un petit ensemble de données alignées au niveau de la phrase (27 060 mots anglais; 21 756 mots népalais), fourni au format tmx (fichier xml):
- un plus grand ensemble de textes alignés au niveau du document (617 340 mots anglais; 596 571 mots népalais), fourni en texte brut dans le format de traitement word d’origine.
- un ensemble additionnel de données monolingues en népalais (386 879 mots en népalais), fournis en texte brut et dans le format de traitement word d’origine.

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	0.00 €

DistributionAvailability start date 15/01/2014 Distribution access medium Downloadable Contact Person

Valérie Mapelli

text

Bilingual text corpusLanguages

Nepali (macrolanguage) (1,005,206 Words) English (644,400 Words)

Linguality

Linguality type: Bilingual

Multi-linguality type: Parallel

Size

1,649,606 Words

Resource Creation

Creation ended: 01/01/2006

Funding Project

Nelralec (Bhasha Sanchar)

Funding Type: Other

Project ended: 01/01/2006

Metadata

Created: 05/12/2005

Metadata Language: French, English (fr, en)

Version

Version: 1.0

Last Updated: 01/15/2014

People who looked at this resource also viewed the following:

Resources from the same project