Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does mpg support no cleavage? #609

Open
lydiayliu opened this issue Nov 13, 2022 · 7 comments
Open

Does mpg support no cleavage? #609

lydiayliu opened this issue Nov 13, 2022 · 7 comments
Assignees
Labels
enhancement New feature or request

Comments

@lydiayliu
Copy link
Collaborator

But for immunopeptidome, we known those peptides have affinity to HLA, so moPepGen would be a perfect fit for generating custom DB to search for it.

I've been wondering about what you said here. I think some tweaks are required for mpg to generate neoantigens. I just realized that I used trypsin as the enzyme for calling neoantigens. Even though MHCFlurry can chop the peptides generated by mpg into arbitrary shorter peptides (could be bad if the fragments dont actually contain the variant), it would probably be more ideal to have a "no cleavage" or "native peptides" mode for mpg (and turn off the chopping function in MHCFlurry if possible :P)

@zhuchcn
Copy link
Member

zhuchcn commented Nov 14, 2022

Not impossible. Instead of creating the peptide cleavage graph, we can create a peptide kmer graph. But this would need a lot of work, because we'll have to create a new model for this kind of graph.

@lydiayliu
Copy link
Collaborator Author

Why don't we just forget about the cleavage graph and go straight from the peptide variant graph? Traverse the graph for each length of peptide, or each "frame" of peptide

@zhuchcn
Copy link
Member

zhuchcn commented Nov 16, 2022

I think that should work! We then don't need to worry about the miscleavages. Just need to write a new traverser.

@lydiayliu lydiayliu added the enhancement New feature or request label Feb 8, 2023
@hsiaoyi0504
Copy link

Is there any update for usage of generating customized database for immunopeptidome?

@zhuchcn
Copy link
Member

zhuchcn commented Oct 16, 2024

Thanks for being interested in moPepGen @hsiaoyi0504. My plan now is generating non-canonical peptides with at least 1 variant event, and with up to X number of consecutive reference amino acids. X will be the max length of peptides you would usually expect from your MS data. This should capture all the possible non-canonical peptides, but will require some extensive work.

@zhuchcn zhuchcn self-assigned this Oct 16, 2024
@hsiaoyi0504
Copy link

hsiaoyi0504 commented Nov 5, 2024

Or instead of getting peptides from callVariant, can we also have a function to get the protein fasta directly? I think proteomics search software could generate non redundant database based on that as well.

@zhuchcn
Copy link
Member

zhuchcn commented Nov 6, 2024

Along with the FASTA file, callVariant generates a table containing the information per peptide entry, such as the amino acids carrying variant(s). So technically you can take the table and generate all peptides of 8-11 length harboring any variant. But note that depending on how this is done, some variant combinations may be lost.

I have a (still active) branch czhu-feat-no-cleavage for generating non-canonical peptides with no enzyme. Feel free to try it but I haven't run any fuzz test on it so it is not guaranteed to work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants