Transformer for English-to-Chinese Translation

This repository implements a seminal transformer design inspired by the principles outlined in the paper Attention is All You Need. The coding implementation was adopted from a tutorial made by Umar Jamil. More details can be found at Project Report.pdf.

The model is trained on the English-Chinese section of the OPUS-100 dataset on Hugging Face.

Training Data: 1 million samples
Validation Data: 2,000 samples
Test Data: 2,000 samples

Installation

Requirements

Hardware: The model is trained and evaluated on an NVIDIA 3070 GPU. While hardware with lower specifications may be compatible, it could result in longer training time.
System: Ubuntu 22.04

Dependency Installation

Anaconda: download and install Anaconda.

Package Installation

Create a new Conda environment named transformer and install all necessary packages within this environment.

conda create -n transformer python=3.10
conda activate transformer

Clone the repository from GitHub.

git clone https://github.com/QZJGeorge/STATS507-Final-Project.git

Install the required python packages

pip install -r requirements.txt

Model Training

For detailed training parameters, refer to config.py. To start training, run the following command. This will train and save the transformer model.

python3 train.py

You can also download our pre-trained model and place it in the Helsinki-NLP/opus-100_weights folder.

Model Evaluation

The validation and test datasets were combined, resulting in a total of 4,000 samples. The model's performance is evaluated using BERTScore.

To run the evaluation, execute the following command:

python3 evaluate.py

Our pre-trained model achieved a Precision score of 0.795, Recall score of 0.766, and F1 score of 0.779.

Troubleshooting

If you encounter the error torch.cuda.OutOfMemoryError, open config.py and reduce the batch size.

Developer

Zhijie Qiao [email protected]

License

Distributed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Helsinki-NLP/opus-100_weights		Helsinki-NLP/opus-100_weights
runs/tmodel		runs/tmodel
.gitignore		.gitignore
LICENSE		LICENSE
Project Report.pdf		Project Report.pdf
README.md		README.md
config.py		config.py
dataset.py		dataset.py
evaluate.py		evaluate.py
filter.py		filter.py
loss.csv		loss.csv
model.py		model.py
plot.py		plot.py
requirements.txt		requirements.txt
tokenizer_en.json		tokenizer_en.json
tokenizer_zh.json		tokenizer_zh.json
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer for English-to-Chinese Translation

Installation

Requirements

Dependency Installation

Package Installation

Model Training

Model Evaluation

Troubleshooting

Developer

License

About

Releases

Packages

Languages

License

QZJGeorge/STATS507-Final-Project

Folders and files

Latest commit

History

Repository files navigation

Transformer for English-to-Chinese Translation

Installation

Requirements

Dependency Installation

Package Installation

Model Training

Model Evaluation

Troubleshooting

Developer

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages