The following figure illustrates the binding score distributions and quartiles in the BDB and KIBA datasets. A strong peak at
We create train/validation/test splits out of BDB and KIBA with warm and cold biomolecules (see the manuscript for more details). We report the average number of proteins, ligands, and interactions in the training and test sets in the following table, alongside standard deviations in the parentheses.
Dataset | Fold | #Proteins | #Ligands | #Interactions |
---|---|---|---|---|
BDB | Train | 403.4 |
740.8 |
17988.2 |
BDB | Validation | 355.0 |
170.0 |
1494.2 |
BDB | Warm | 354.4 |
179.6 |
1494.4 |
BDB | Cold Ligand | 376.0 |
84.8 |
2448.8 |
BDB | Cold Protein | 43.6 |
264.8 |
2360.0 |
BDB | Cold Both | 41.4 |
30.8 |
274.6 |
KIBA | Train | 200.6 |
1834.6 |
77264.4 |
KIBA | Validation | 193.0 |
1467.2 |
6650.2 |
KIBA | Warm | 192.0 |
1476.2 |
6650.6 |
KIBA | Cold Ligand | 193.0 |
140.0 |
6810.0 |
KIBA | Cold Protein | 14.6 |
1296.0 |
6259.6 |
KIBA | Cold Both | 14.0 |
100.2 |
468.6 |