GitHub - espoirMur/medical-image-captionning: medical image captioning with pytorch

Medical Image Captioning

This is the code for the submission for the Medical Image Captioning Challenge.

This project was done by a team of Big Data and Text Analytics Msc Student at the university of Essex as part of the group project course.

Our Solution

Nothing is said that has not been said before. Terence (ca 195 - 159 BC ) The Eunuch Prol.

Solve this challenge we used and Encoder Decoder Architecture.

Encoder

The encoder consist of a pretrained reset50 model to which we removed the final layers and add a batch normalization to return a feature vector of size 512 for each image.

Decoder

The decoder is a 8 layers LSTM that takes the images representation and the and predict the corresponding captions.

Training

We trained the model using 30 epoch, and using the cross entropy loss .

The result yield a bleu score of 0.2. on our validation set.

We did not train the encoder cnn, but we trained only the decoder LSTM.

Instructions to reproduce the results

Installing packages:

We have used Python3.8.5 and poetry to install the packages and the dependencies.

If you have poetry installed you can simply run the following command:

poetry shell To create the project virtual environment.

Run :

poetry install

Download the dataset.

The dataset is large and we used dagshub and dvc to version control it.

Add the remote with the following command:

dvc remote add origin https://dagshub.com/espoirMur/image-clef-2022-essex-submission.dvc

Pull the dataset from the remote repository using the following command :

dvc pull -r origin

Go grab a coffee and wait for the dataset to be downloaded.

Check the folder data/raw/training-images for the training images and data/raw/validation-images for the validation images.

The corresponding captions are in the data/raw/caption-prediction folders.

Setup MLFLOW to reproduce the experiments:

We used the MLflow to reproduce the experiments.

You need to have a daghubs account or mlflow account to run the experiments. Once you have let say a dagshub account you can run the following command to log your experiments:

MLFLOW_TRACKING_URI=your-daghubs-url\
MLFLOW_TRACKING_USERNAME=username-from-dagshub \
MLFLOW_TRACKING_PASSWORD=your_token \

Training the model

The code for training the model is saved under src/models/train_model.py check it out and edit the hyperparameters to your liking.

If everything is okay for you then run the following command:

python src/models/train_model.py and wait for the model to train.

Visualize the results

To visualize the results of the training of the model, please refer to the following notebook:

playground-file

Did I miss something?:

Please let us know , if you have any other questions or suggestions.

References:

Here are the majors references :

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.dvc		.dvc
docs		docs
evaluation/ImageCLEF-ConceptDetection-Evaluation		evaluation/ImageCLEF-ConceptDetection-Evaluation
minutes		minutes
mlruns/0		mlruns/0
models		models
notebooks		notebooks
references		references
reports		reports
src		src
.DS_Store		.DS_Store
.dvcignore		.dvcignore
.gitignore		.gitignore
.python-version		.python-version
ImageCLEF-ConceptDetection-Evaluation.zip		ImageCLEF-ConceptDetection-Evaluation.zip
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
checkpoints.dvc		checkpoints.dvc
data.dvc		data.dvc
image-2.png		image-2.png
poetry.lock		poetry.lock
predict-playground.ipynb		predict-playground.ipynb
predict.py		predict.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
test_environment.py		test_environment.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical Image Captioning

Our Solution

Encoder

Decoder

Training

Instructions to reproduce the results

Installing packages:

Download the dataset.

Setup MLFLOW to reproduce the experiments:

Training the model

Visualize the results

Did I miss something?:

References:

About

Releases

Packages

Languages

License

espoirMur/medical-image-captionning

Folders and files

Latest commit

History

Repository files navigation

Medical Image Captioning

Our Solution

Encoder

Decoder

Training

Instructions to reproduce the results

Installing packages:

Download the dataset.

Setup MLFLOW to reproduce the experiments:

Training the model

Visualize the results

Did I miss something?:

References:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages