English to urdu hierarchical phrase-based statistical machine translation

Nadeem Khan, Waqas Anwar, Usama Ijaz Bajwa, Nadir Durrani

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper addresses the Hierarchical Phraseóbased (HPB) models which are used in development of different Statistió cal Machine Translation (SMT) Systems for many modern languages. Any SMT System needs large parallel coró pora for accurate performance. Therefore, availability of a large parallel corpus is a preórequisite for designing a reliable, roó bust SMT system between any two lanó guages. The HPB models have shown strong capability of generalization and reó ordering, which in turn gets improved reó sults for the sparse resourced languages. This paper considers English as Source and Urdu as target language for experió ments. For this study, Hierarchical phraseó based Baseline SMT system is used for English to Urdu translation. At the end auó tomatic evaluation of system is performed by using BLEU and NIST as evaluation metrics. Average BLEU evaluation score the developed system got is 13% which is a good competitive score for any sparse reó sourced language.
Original languageEnglish
Pages72-76
Number of pages5
Publication statusPublished - Oct 2013
Externally publishedYes
EventProceedings of the 4th Workshop on South and Southeast Asian Natural Language Processing - Nagoya, Japan
Duration: 1 Oct 2013 → …

Conference

ConferenceProceedings of the 4th Workshop on South and Southeast Asian Natural Language Processing
Country/TerritoryJapan
CityNagoya
Period1/10/13 → …

Fingerprint

Dive into the research topics of 'English to urdu hierarchical phrase-based statistical machine translation'. Together they form a unique fingerprint.

Cite this