Learning Better Classification-based Reordering Model for Phrase-based Translation

Reordering is of a challenging issue in phrase-based statistical machine translation systems. This paper proposed three techniques to optimize classification-based reordering models for phrase-based translation under the bracket transduction grammar framework. First, a forced decoding technique is adopted to learn reordering samples for maximum entropy model training. Secondly, additional features are learned from the context of two consecutive phrases to enhance the prediction ability of the reordering classifier. Thirdly, the reordering model score is integrated as two feature functions (STRAIGHT and INVERTED) into the log-linear model to improve its discriminative ability. Experimental result demonstrates significant improvements over the baseline in two translation tasks such as Chinese to English and Chinese to Japanese translation.

Langue:: Anglais

Périodicité:: 4 fois par an
Sujets de la revue:: Informatique, Informatique, autres

RSS Feed de la revue

Learning Better Classification-based Reordering Model for Phrase-based Translation

Li Fuxue

Xiao Tong

Zhu Jingbo

Publié en ligne: 12 avr. 2018

Pages: 145 - 152

DOI: https://doi.org/10.21307/ijanmc-2017-082

Mots clésstatistical machine translation, word reordering, log linear model, feature selection

© 2017 Li Fuxue et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Mots clés
statistical machine translation, word reordering, log linear model, feature selection