Learning Better Classification-based Reordering Model for Phrase-based Translation
Publié en ligne: 12 avr. 2018
Pages: 145 - 152
DOI: https://doi.org/10.21307/ijanmc-2017-082
Mots clés
© 2017 Li Fuxue et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Reordering is of a challenging issue in phrase-based statistical machine translation systems. This paper proposed three techniques to optimize classification-based reordering models for phrase-based translation under the bracket transduction grammar framework. First, a forced decoding technique is adopted to learn reordering samples for maximum entropy model training. Secondly, additional features are learned from the context of two consecutive phrases to enhance the prediction ability of the reordering classifier. Thirdly, the reordering model score is integrated as two feature functions (STRAIGHT and INVERTED) into the log-linear model to improve its discriminative ability. Experimental result demonstrates significant improvements over the baseline in two translation tasks such as Chinese to English and Chinese to Japanese translation.