Publication link :
Code doi (Zenodo) : https://doi.org/10.5281/zenodo.13318653
Automated Infrared spectrum analysis to identify 17 functional groups using deep learning. At present our model can identify below mentioned groups. Our model can be extended to incorporate identification of other functional groups as well.
- alkane
- methyl
- alkene
- alkyne
- alcohols
- amines
- nitriles
- aromatics
- alkyl halides
- esters
- ketones
- aldehydes
- carboxylic acids
- ether
- acyl halides
- amides
- nitro
Our data pipeline processes Jcamp files and structures the data into dataframes.
Our published models were trained on NIST SRD 35 and Chemotion datasets.
We cannot provide NIST SRD 35 dataset as it is a commericial dataset.
We provide our open access Chemotion dataset which can be downloaded via Radar4Chem : https://radar4chem.radar-service.eu/radar/en/dataset/OGoEQGlsZGElrgst
-
Clone the repository:
git clone https://github.com/devpunjabi/ir_analysis_kit.git
-
Change to the project directory:
cd ir_analysis_kit
-
Create and activate an anaconda virtual environment:
conda create --name ir python=3.8 conda activate ir
-
Install the required dependencies:
pip install -r requirements.txt
- Navigate to data folder.
cd ir_analysis/data
-
Follow the instructions in data_preproc.ipynb to create preprocessed datasets.
-
Export dataset file as environment variable
export DF=~/data/preprocessed_dataset.pk
- Navigate to utils folder
cd ir_analysis/utils
-
Update config.json with desired parameters. Default parameters in the file belong to our best model.
-
Run K-fold cross validation with
python cv.py
- Alternative : train and test for 1 fold of the dataset
python train_test.py --config config.json --results
- Trained models are saved under models folder with experiment names mentioned in config.json
Contributions are welcome! Please follow these steps to contribute:
-Fork the repository.
-Create a new branch: git checkout -b feature-branch-name.
-Make your changes and commit them: git commit -m 'New feature'.
-Push to the branch: git push origin feature-branch-name.
-Open a pull request.
This project is licensed under the MIT License
Dev Punjabi - [email protected],
Nicole Jung - [email protected]
Institution : https://www.ibcs.kit.edu/