This repository contains the data and the scripts used for the manuscript "Machine Learning Approach To Vertical Energy Gap in Redox Processes".
We suggest running the following command to create a conda environment called VEGPred
:
conda env create -f environment.yml
- The feature list calculated using Hartree-Fock (HF) and semi-empirical (EMP1) methods for each system are in their respective folders. For example,
- The HF features for QM cutoff 0.0 Å for benzene are in the
benzene/feature_list_0.0.dat
folder. - The EMP1 features for QM cutoff 7.5 Å for lumiflavin are in the
lumiflavin/EMP1_feature_list_7.5.dat
folder.
- The HF features for QM cutoff 0.0 Å for benzene are in the
- The workflow is organized as follows:
- Add ML models with parameter list in the
models_hyperparam_opt.py
file. - Run
models_hyperparam_opt.py > model_list.dat
to create list of best models for each system. - Select one ML model from
model_list.dat
and add it to the system dictionary inopt_model_test.py
. - Run
save_models_and_plot.py
to produce heatmaps, parity plots, and learning curves as presented in the manuscript.
- Add ML models with parameter list in the