BionoiNet is a deep learning-based framework for ligand-binding site classification. It transforms 3-d structures of ligand-binding sites into 2-d images, and then these images are processed by a deep neural network (DNN) to generate classification results. The pipeline of BionoiNet is shown below:
The 2-d images of ligand-binding sites are generated using the Bionoi software and the DNN is implemented with the APIs of Pytorch. The datasets are divided into 5 folds for cross-validation and the sequence identity between the training and validation sets is less than 20%.
Install the dependency package using this file: dependency/environment.yml
. This denpendency file is exported by Anaconda. To install the environment:
conda env create -f environment.yml
- bionoi: a software that transforms ligand-binding sites (.mol2 files) into Voronoi diagrams.
- bionoi_autoencoder: autoencoder that trained on Bionoi images in an un-supervised manner. The trained autoencoder can be used to produce latent space vectors of Bionoi images for machine learning applications.
- bionoi_cnn_homology_reduced: a convolutional neural network (CNN) trained on the Voronoi representations of ligand-binding sites for classification.
- dependency: dependency python packages.
- homology_reduced_folds: files containing folds such that the sequence identity between train and validation is less than 20%.
- lecacy: codes produced during research, not used in final version.
- Unzip the .mol2 files located at
/homology_reduced_folds/mols.zip
. - Download this repository and put it where you extract the .mol2 files. The layout of the project should be like following:
.
├── BionoiNet
└── homology_reduced_mols
├── control
├── fold_1
├── fold_2
├── ...
└── fold_5
├── heme
├── fold_1
├── fold_2
├── ...
└── fold_5
└── nucleotide
├── fold_1
├── fold_2
├── ...
└── fold_5
- Run Bionoi by executing
/bionoi/img_gen.sh
to transform .mol2 files to images. - Train the convolutional neural network (CNN) at
/bionoi_cnn_homology_reduced/
for cross-validation:
python homology_reduced_cnn_cv_resnet18.py -op control_vs_nucleotide -root_dir ../../bionoi_output/residue_type/ -batch_size 32 -result_file_suffix 1st
- Grab a cup of coffee and wait for results.
If you find this work useful, please cite our papers in your work :)
Shi W, Lemoine JM, Shawky MA, Singha M, Pu L, Yang S, Ramanujam J, Brylinski M. (2020) BionoiNet: Ligand-binding site classification with off-the-shelf deep neural network. Bioinformatics 36 (10): 3077-3083.
Feinstein J, Shi W, Ramanujam J, Brylinski M. Bionoi: A Voronoi Diagram-Based Representation of Ligand-Binding Sites in Proteins for Machine Learning Applications. Methods Mol Biol. 2021;2266:299-312. doi: 10.1007/978-1-0716-1209-5_17. PMID: 33759134.