Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau [email protected], Fethi Bougares, Holger Schwenk, Yoshua Bengio, Université de Montréal Jacobs University Germany, Université du Maine France, Université de Montréal CIFAR Senior Fellow (2014)
This paper proposes a novel neural network model called RNN Encoder-Decoder for statistical machine translation. The model consists of two recurrent neural networks (RNN) where one encodes a source sequence into a fixed-length vector and the other decodes it back into a target sequence. The proposed RNN Encoder-Decoder jointly optimizes the conditional probabilities of the target sequence based on the source sequence, significantly improving the performance of a statistical machine translation system. The model captures semantically and syntactically meaningful representations of phrases, and qualitative analyses demonstrate its ability to better handle linguistic regularities compared to traditional translation models. The evaluation focuses on English to French translation using the WMT'14 workshop datasets and reveals improved BLEU scores when integrating features from the RNN Encoder-Decoder into existing systems.
This paper employs the following methods:
The following datasets were used in this research:
The authors identified the following limitations: