Dual Learning for Machine Translation

This is an unofficial implementation based on Dual Learning for Machine Translation upon OpenNMT.

The whole workflow works like:

Read raw sentences (from language A)
Translate with model A-->B to get K-best translations (in language B)
Translate with model B-->A to get 1-best translation (in language A)
Build batches to feed into trainer (all together K batches)
Train model A-->B with averaged gradient calculated based on K batches
Train model B-->A with averaged gradient calculated based on K batches
Read raw sentences (from language B)
iterate as above mentioned

Quickstart

./run.dual.sh

to see the training on demo data.

Notes

Preprocess the data.

We provide several scripts (./tools/scripts) to create the preprocessed data. We need to filter raw sentences (e.g. remove sentences with more than one ), sort and randomize batch sentences, and add mono-lingual data into preprocessed file. In this demo, we do not add mono-lingual data here.

Train the model.

We use two GPUs to help decoding and training. Specially, GPU1 is used for decoder_ab & trainer_ab, GPU2 is used for decoder_ba & trainer_ba. We set the batch_size to 32, which can be hold in 8G memory. We use SGD as default optimizer and set a small learning rate (0.01). K-best translation is set to 2. You may refer to ./run.dual.sh for details.

Performance.

We use both log P and BLEU score as the reward function. In the case of BLEU, the performance (evaluated by ppl. based on validation set) increases a little and then begins to decrease. The training speed is slow since there needs (1+K)2decode_whole_training_set+(1+1)2training. There is no parallelization at this time.

Name		Name	Last commit message	Last commit date
Latest commit History 1,434 Commits
benchmark		benchmark
data		data
docs		docs
onmt		onmt
rocks		rocks
test		test
tools		tools
.Dockerfile		.Dockerfile
.dokx		.dokx
.gitignore		.gitignore
.luacheckrc		.luacheckrc
.luacov		.luacov
.travis.yml		.travis.yml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
STYLE.md		STYLE.md
codecov.yml		codecov.yml
dual.lua		dual.lua
lm.lua		lm.lua
mkdocs.yml		mkdocs.yml
preprocess.lua		preprocess.lua
run.dual.sh		run.dual.sh
tag.lua		tag.lua
train.lua		train.lua
translate.lua		translate.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dual Learning for Machine Translation

Quickstart

Notes

About

Releases

Packages

Languages

License

monsieurzhang/OpenNMT

Folders and files

Latest commit

History

Repository files navigation

Dual Learning for Machine Translation

Quickstart

Notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages