TY - GEN
T1 - Understanding and Improving Morphological Learning in the Neural Machine Translation Decoder
AU - Dalvi, Fahim
AU - Durrani, Nadir
AU - Sajjad, Hassan
AU - Belinkov, Yonatan
AU - Vogel, Stephan
N1 - Publisher Copyright:
©2017 AFNLP.
PY - 2017
Y1 - 2017
N2 - End-to-end training makes the neural machine translation (NMT) architecture simpler, yet elegant compared to traditional statistical machine translation (SMT). However, little is known about linguistic patterns of morphology, syntax and semantics learned during the training of NMT systems, and more importantly, which parts of the architecture are responsible for learning each of these phenomena. In this paper we i) analyze how much morphology an NMT decoder learns, and ii) investigate whether injecting target morphology into the decoder helps it produce better translations. To this end we present three methods: i) joint generation, ii) joint-data learning, and iii) multi-task learning. Our results show that explicit morphological information helps the decoder learn target language morphology and improves the translation quality by 0.2–0.6 BLEU points.
AB - End-to-end training makes the neural machine translation (NMT) architecture simpler, yet elegant compared to traditional statistical machine translation (SMT). However, little is known about linguistic patterns of morphology, syntax and semantics learned during the training of NMT systems, and more importantly, which parts of the architecture are responsible for learning each of these phenomena. In this paper we i) analyze how much morphology an NMT decoder learns, and ii) investigate whether injecting target morphology into the decoder helps it produce better translations. To this end we present three methods: i) joint generation, ii) joint-data learning, and iii) multi-task learning. Our results show that explicit morphological information helps the decoder learn target language morphology and improves the translation quality by 0.2–0.6 BLEU points.
UR - https://www.scopus.com/pages/publications/105019298458
M3 - Conference contribution
AN - SCOPUS:105019298458
T3 - 8th International Joint Conference on Natural Language Processing - Proceedings of the IJCNLP 2017, System Demonstrations
SP - 142
EP - 151
BT - 8th International Joint Conference on Natural Language Processing - Proceedings of the IJCNLP 2017, System Demonstrations
PB - Association for Computational Linguistics (ACL)
T2 - 8th International Joint Conference on Natural Language Processing, IJCNLP 2017
Y2 - 27 November 2017 through 1 December 2017
ER -