Development of a Syntax-Based Model for English - Igbo  Statistical Machine Translation

Esan Adebimpe; John B.  Oladosu

Authors

Esan Adebimpe Federal University Oye-Ekiti, Ekiti state, Nigeria
John B. Oladosu LAUTECH, Ogbomoso, Oyo State, Nigeria

Keywords:

Syntax, Religious, Language model, Translation model, Domain and Corpora

Abstract

Semantic errors occurred due to syntactical difference between English and Igbo languages in existing statistical machine translators. Therefore, a syntax-based model was developed in this research for English-Igbo statistical machine translation.
Parallel corpus was obtained from the religious domain and word alignments were made on the English and Igbo corpora with
GIZA++. The Hidden Markov Model uses the word alignments produced by GIZA++ to estimate a maximum likelihood translation table. The Language model for the target language was built using IRSTLM toolkit and the model was tuned using Minimum error rate training (MERT). The developed SMT system was evaluated using BLEU and NIST and the results were compared to an existing related work. Results showed that the developed model outperformed the previous system by up to 0.3 BLEU score and 3.0 NIST scores respectively.

Development of a Syntax-Based Model for English - Igbo Statistical Machine Translation

Authors

Keywords:

Abstract

Author Biographies

Esan Adebimpe , Federal University Oye-Ekiti, Ekiti state, Nigeria

John B. Oladosu, LAUTECH, Ogbomoso, Oyo State, Nigeria

Downloads

Published

How to Cite

Issue

Section

Most read articles by the same author(s)

EDITORIAL BOARD

Current Issue

Information