Skip to content

LukaLmelias/DeepCCA_thesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

80 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

image

This project aims at predicting chemical structure of metabolites from LC-MS/MS spectra using Deep Canonical Correlation Analysis(DeepCCA). DeepCCA is a deep learning extension of CCA. This work is done in three phases as outlined below.

1. Data processing and Embeddings

In this notebook, we clean, intergrate and generate embeddings of structure and spectra dataset.

2. Model development

This notebook contains DeepCCA optimization codes.

3. Prediction and Evaluation for model development

Here we perfom a cross modal retrieval. It takes in the spectra embeddings and outputs the most likely structure of that spectra. Next we evaluate using Tanimoto scores whether the predicted structure is similar to the true structure.

Training the final model

After selecting best performing hyperparameters, we train the final model in this notebook

Final model predictions and Evaluation

The final model is used to predict the structures of query spectrum in this notebook

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published