The following two repos were also part of my project. All three are required to repeat my steps:
Prediction Of Binding Affinity: prediction of DTI
ChemVAE Fork: encoding of the small molecules
Additionally, PortT5 is needed:
[ProtT5] (https://github.com/agemagician/ProtTrans): encoding of the proteins
- Clone this repo
- Download the MayaChemTools collection, and place the folder next to the folder of this repo (or you can specify another location in the config file)
- Install CD-Hit
- Ensure the requirements are met, we found that the best way is to create an RDKit environment and install the other requirements to it
- Check in the src/config.ini file if all files are chosen correctly, including the raw data file
- Run by using "python main.py"
BindingDB: Database of DTI data
RDKit: Open-Source Cheminformatics Software, install using the information in the following link https://www.rdkit.org/docs/Install.html
MayaChemTools: collection of Perl and Python scripts, modules, and classes to support a variety of day-to-day computational discovery needs
CD-Hit: a widely used program for clustering and comparing protein or nucleotide sequences