Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

confusing contact_head in RnaFmPreTrainingHeads #1

Open
xzdlj opened this issue Dec 1, 2024 · 3 comments
Open

confusing contact_head in RnaFmPreTrainingHeads #1

xzdlj opened this issue Dec 1, 2024 · 3 comments

Comments

@xzdlj
Copy link

xzdlj commented Dec 1, 2024

Thank you very much for providing such an excellent library!

As stated in the README and the original paper, the only pretraining dataset for the RnaFM model consists of sequences from RNAcentral, which do not include labels for contact prediction. However, I noticed that the RnaFmPreTrainingHeads class includes a contact_head that appears to require labels to compute its loss. This is quite confusing to me.

If I have misunderstood or missed anything, please feel free to point it out.

@ZhiyuanChen
Copy link
Member

Thank you for raising the very first issue regarding MultiMolecule.

Unfortunately, I don’t have a definitive answer at the moment. We’ve made every effort to ensure that our converted checkpoint matches the official one provided by the authors. From what I know, the official checkpoint does include a concat_head.

If I had to take an educated guess, I would say the authors simply copied the structure from ESM and may have forgotten to remove the concat_head.

@xzdlj
Copy link
Author

xzdlj commented Dec 3, 2024

Many thanks for the response!

If I had to take an educated guess, I would say the authors simply copied the structure from ESM and may have forgotten to remove the concat_head.

I believe the guess makes a lot of sense.
I’ve also raised the issue in the official repository, so hopefully, we might receive a conclusive answer soon.

@ZhiyuanChen
Copy link
Member

No worries! I hope you’ve found MultiMolecule useful and helpful.

And don’t forget to let me know if you think anything can be added—new models, datasets, tools, or features for training and fine-tuning neural networks.
Anything you find interesting is welcome! 🤔

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants