diff --git a/README.md b/README.md index 2c370d9..be6caaa 100644 --- a/README.md +++ b/README.md @@ -126,6 +126,12 @@ from DeepPurpose import DTI as models from DeepPurpose.utils import * from DeepPurpose.dataset import * +SAVE_PATH='./saved_path' +import os +if not os.path.exists(SAVE_PATH): + os.makedirs(SAVE_PATH) + + # Load Data, an array of SMILES for drug, an array of Amino Acid Sequence for Target and an array of binding values/0-1 label. # e.g. ['Cc1ccc(CNS(=O)(=O)c2ccc(s2)S(N)(=O)=O)cc1', ...], ['MSHHWGYGKHNGPEHWHKDFPIAKGERQSPVDIDTH...', ...], [0.46, 0.49, ...] # In this example, BindingDB with Kd binding score is used. @@ -182,6 +188,13 @@ from DeepPurpose import CompoundPred as models from DeepPurpose.utils import * from DeepPurpose.dataset import * + +SAVE_PATH='./saved_path' +import os +if not os.path.exists(SAVE_PATH): + os.makedirs(SAVE_PATH) + + # load AID1706 Assay Data X_drugs, _, y = load_AID1706_SARS_CoV_3CL() @@ -312,7 +325,7 @@ model.train(train, val, test) ### Case Study 2 (a): Antiviral Drugs Repurposing for SARS-CoV2 3CLPro, using One Line. - Given a new target sequence (e.g. SARS-CoV2 3CL Protease), retrieve a list of repurposing drugs from a curated drug library of 81 antiviral drugs. The Binding Score is the Kd values. Results aggregated from five pretrained model on BindingDB dataset! (Caution: this currently is for educational purposes. The pretrained DTI models only cover a small dataset and thus cannot generalize to every new unseen protein. For best use case, train your own model with customized data.) + Given a new target sequence (e.g., SARS-CoV2 3CL Protease), retrieve a list of repurposing drugs from a curated drug library of 81 antiviral drugs. The Binding Score is the Kd values. Results aggregated from five pretrained model on BindingDB dataset! (Caution: this currently is for educational purposes. The pretrained DTI models only cover a small dataset and thus cannot generalize to every new unseen protein. For the best use case, train your own model with customized data.)
Click here for the code! @@ -348,7 +361,7 @@ Drug Repurposing Result for SARS-CoV2 3CL Protease ### Case Study 2(b): Repurposing using Customized training data, with One Line. -Given a new target sequence (e.g. SARS-CoV 3CL Pro), training on new data (AID1706 Bioassay), and then retrieve a list of repurposing drugs from a proprietary library (e.g. antiviral drugs). The model can be trained from scratch or finetuned from the pretraining checkpoint! +Given a new target sequence (e.g., SARS-CoV 3CL Pro), training on new data (AID1706 Bioassay), and then retrieve a list of repurposing drugs from a proprietary library (e.g., antiviral drugs). The model can be trained from scratch or finetuned from the pretraining checkpoint!
Click here for the code! @@ -468,7 +481,7 @@ DeepPurpose supports the following dataset loaders for now and more will be adde |SARS-CoV2 3to5_exonuclease|```load_SARS_CoV2_3to5_exonuclease()```| |SARS-CoV2 endoRNAse|```load_SARS_CoV2_endoRNAse()```| -DeepPurpose also supports to read from users' txt file. It assumes the following data format. +DeepPurpose also supports reading from users' txt file. It assumes the following data format.
Click here for the format expected! @@ -529,7 +542,7 @@ from DeepPurpose import dataset X_drug, y = dataset.read_file_protein_function(PATH) ``` -For drug drug pairs: +For drug-drug pairs: ``` Drug1_SMILES Drug1_SMILES_ Score/Label Drug2_SMILES Drug2_SMILES_ Score/Label @@ -542,7 +555,7 @@ from DeepPurpose import dataset X_drug, X_target, y = dataset.read_file_training_dataset_drug_drug_pairs(PATH) ``` -For protein protein pairs: +For protein-protein pairs: ``` Target1_Seq Target1_Seq_ Score/Label Target2_Seq Target2_Seq_ Score/Label @@ -604,11 +617,11 @@ net = models.model_pretrained(model = 'MPNN_CNN_DAVIS') or net = models.model_pretrained(FILE_PATH) ``` -The list of avaiable pretrained models: +The list of available pretrained models: Model name consists of first the drug encoding, then the target encoding and then the trained dataset. -Note that for DTI models, the BindingDB and DAVIS are trained on log scale. But DeepPurpose allows you to specify conversion between log scale (e.g. pIC50) and original scale by the variable `convert_y`. +Note that for DTI models, the BindingDB and DAVIS are trained on the log scale. But DeepPurpose allows you to specify conversion between log scale (e.g., pIC50) and original scale by the variable `convert_y`.
Click here for the models supported!