Mask-Predict
September 5, 2019 · View on GitHub
Download model
| Description | Dataset | Model |
|---|---|---|
| MASK-PREDICT | [WMT14 English-German] | download (.tar.bz2) |
| MASK-PREDICT | [WMT14 German-English] | download (.tar.bz2) |
| MASK-PREDICT | [WMT16 English-Romanian] | download (.tar.bz2) |
| MASK-PREDICT | [WMT16 Romanian-English] | download (.tar.bz2) |
| MASK-PREDICT | [WMT17 English-Chinese] | download (.tar.bz2) |
| MASK-PREDICT | [WMT17 Chinese-English] | download (.tar.bz2) |
Preprocess
text=PATH_YOUR_DATA
output_dir=PATH_YOUR_OUTPUT
src=source_language
tgt=target_language
model_path=PATH_TO_MASKPREDICT_MODEL_DIR
python preprocess.py --source-lang {tgt} --trainpref text/valid --testpref {output_dir}/data-bin --workers 60 --srcdict {model_path}/maskPredict_{src}{src}.txt --tgtdict {model_path}/maskPredict_{src}{tgt}.txt
Train
model_dir=PLACE_TO_SAVE_YOUR_MODEL
python train.py {model_dir}
Evaluation
python generate_cmlm.py {model_dir}/checkpoint_best_average.pt --task translation_self --remove-bpe --max-sentences 20 --decoding-iterations 10 --decoding-strategy mask_predict
License
MASK-PREDICT is CC-BY-NC 4.0. The license applies to the pre-trained models as well.
Citation
Please cite as:
@inproceedings{ghazvininejad2019MaskPredict,
title = {Mask-Predict: Parallel Decoding of Conditional Masked Language Models},
author = {Marjan Ghazvininejad, Omer Levy, Yinhan Liu, Luke Zettlemoyer},
booktitle = {Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing},
year = {2019},
}