curriculum-nmt

Setup

Create conda environment: conda create --name curriculum_nmt python=3.7
Install requirements in requirements.txt
Run bash run_iwslt.sh download to download the IWSLT dataset
Run bash run_iwslt.sh vocab to generate vocab files. This generates a iwslt_vocab.json and iwslt_word_freq.json

Train the model locally on IWSLT with bash run_iwslt.sh train_local (with "none" ordering)
Train the model with desired scoring and pacing functions locally on IWSLT e.g. bash run_iwslt.sh train_local rarity linear (with "rarity" ordering and "linear" pacing. see scoring.py and pacing.py for more options)

Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation arXiv
On The Power of Curriculum Learning in Training Deep Networks arXiv code
Competence-based Curriculum Learning for Neural Machine Translation arXiv
Improving Neural Machine Translation Models with Monolingual Data arXiv

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
figures		figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
iwslt_vocab.json		iwslt_vocab.json
iwslt_word_freq.json		iwslt_word_freq.json
model.py		model.py
model_embeddings.py		model_embeddings.py
old_main.py		old_main.py
pacing.py		pacing.py
requirements.txt		requirements.txt
run.py		run.py
run.sh		run.sh
run_iwslt.sh		run_iwslt.sh
scoring.py		scoring.py
temp.sh		temp.sh
utils.py		utils.py
vocab.py		vocab.py
wmt_vocab.json		wmt_vocab.json