Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the datasets/docking.csv #12

Open
Lyu6PosHao opened this issue Oct 15, 2024 · 2 comments
Open

About the datasets/docking.csv #12

Lyu6PosHao opened this issue Oct 15, 2024 · 2 comments

Comments

@Lyu6PosHao
Copy link

Hello, thanks for your great work.

I have a question: In both the paper and the tutorial, the number of smiles of the docking.csv is recorded as 152296. However, the number of smiles in docking.csv is actually 105338.

It is a little confused. Is it a typo?

By the way, is the score in docking.csv calculated by mode=qvina or mode=smina? Which mode should I choose?

Thanks

@akshat998
Copy link
Collaborator

Hi @Lyu6PosHao,

Yes, that was a typo! During the revisions, we added additional filters to enhance the synthesizability of the molecules, which led to a decrease in their numbers. It looks like we overlooked updating this number in the manuscript.

All generative models were run with QVina, and Smina was used to re-score the top molecule.

Just a quick side note: for the docking tasks, we recommend running calculations for 1SYH and 4LDE. We’re currently re-working 6Y2F since the calculations aren’t very stable :)

Thanks!
Akshat

@Lyu6PosHao
Copy link
Author

I see, thanks a lot!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants