################ PLEASE NOTE ################################################################
The current workflow may be unable to deal with multiple chained PDB bank protein structures
-
This Colab version notebook is currently experiencing some instability due to dependencies and incompatible issues. This will be solved later, for now, you are advised to do this Dock-MD-FEP separately with three independent notebooks and
-
The third stage is (This is only a test to assure yourself that the FEP is good to go, to run the actual job please use a GPU platform. Remember to change the platform to CPU during the test and back to CUDA in the real job).
-
The second stage is a free MD (https://github.com/pablo-arantes/making-it-rain/blob/main/Protein_ligand.ipynb) or at least an MD input generation process that you can finish from here ,
-
The very first stage you can finish from here ############################################################################################
Using the code locally, you only need to do the installation once so that you can skip the installation cell afterwards.
RTX30 series or a better GPU is recommended for the local computer.
This workflow requires a dedicated Python 3.8 environment before installation.
conda create --name Dock-MD-FEP python=3.8
conda activate Dock-MD-FEP
conda install jupyter --yes
(Dock-MD-FEP) jupyter lab Dock-MD-FEP_local_installation.ipynb
Several little things should be modified compared to using this online, like the path name etc. Smina instead of Gnia is used because of possible GPU incompatibilities. The rest of the code remains the same.
There might be an error from the first import cell, but it doesn't matter for the dock and MD simulation, it may only affect analysis later which you could do locally alternatively.
############################## Updated on 24/12/2022####################################
$Google_Drive_Path
is the path in your Google Drive where you want to store the simulation data, and you should provide it.
You can use this workflow purely for docking purposes, docking plus MD simulation, or docking, MD and absolute binding free energy calculation.
After you've imported Google Drive, just fill in the four lines that define your working path, providing your PDB ID and small molecule structure, and the rest will run automatically with just one click.
-- MD , all MD input use Amber Gaff2 force field to deal with small molecule.
Up until before FEP simulation, the workflow procedes with one MD (MD0), one docking(dock1), MD(1), then the second docking (docking2), and a final MD(MD2), it will cost a total 25 ns of MD simulation.
MD 0 , equilibration 1 ns, production 2 ns. OpenMM as simulation engine.
MD0 result in $Google_Drive_Path (PDB bank structures MD result)
1st docking result in $Google_Drive_Path and $Google_Drive_Path/MD1
MD 1, equilibration 2 ns, production 5.0 ns. OpenMM as simulation engine.
MD1 result in $Google_Drive_Path/MD1 (Docked small molecule MD result)
2nd docking result in $Google_Drive_Path/MD1 and $Google_Drive_Path/MD1/MD2
MD2, equilibration 5 ns, production 10 ns. OpenMM as a simulation engine.
MD2 result in $Google_Drive_Path/MD1/MD2 (Docked small molecule MD result)
Dock, gnina, --exaustiveness=200. The best pose with docking score and best CNN score was carried on to FEP
FEP, stop the simulation when reaching error < 0.1 KT, Simulation engine OpenMM with Yank python library.
Free energy of binding(benzene) : -11.229 +- 0.352 kT (-6.694 +- 0.210 kcal/mol)
Open the Dock-MD-FEP-restart.ipynb, after pasting the working directory path of last simulation. Import the drive manually, then run all the other cells at one time.
A CUDA version related error CUDA_ERROR_UNSUPPORTED_PTX_VERSION (222)
Please refer to #2 or the openMM issue link openmm/openmm#3585
Those with access to AI studio can try this workflow with automatic analysis. (Extend the simulation steps if you have enough GPU time)
The user is expected to upload the ligand.mol2 + starting_end.pdb to the default path, then submit the job to the queue.
https://aistudio.baidu.com/clusterprojectdetail/7677668