Train Model from scratch with GPU

This is a simple script used to run this code from scratch. Using the default settings, you can get macro f1 score 0.70769.

The data preprocess steps are no the same as the one I used during the competition, so the final f1 score may be not the same:

Some major differences:

Modify the file paths in preprocess.sh:

Refer to 1 to get the data file path

Refer to 2 to get the embedding file path

You can try different vocab size

Then run

bash preprocess.sh

We will create all the files needed under ./data folder.

Change your workdir to parent folder, and run the training scripts:

bash bash/elmo_train.sh

After training, we can get the predicted results of test files:

bash bash/elmo_inference.sh

Provide feedback