Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Insertion counts for small insertions that change the reading frame #836

Open
CBeelen opened this issue May 10, 2022 · 0 comments
Open

Insertion counts for small insertions that change the reading frame #836

CBeelen opened this issue May 10, 2022 · 0 comments
Milestone

Comments

@CBeelen
Copy link
Contributor

CBeelen commented May 10, 2022

Sample JLAT1-HIV_S217 from run 190501_M05995 has an insertion of length 2 in GP120. Because this insertion is in the middle of a codon and changes the reading frame, there is a non-covered amino acid in amino.csv at this position (at reference amino position 60, query nuc positions 5753 and 5754 are insertions).
However, no insertion counts are recorded in amino.csv and nuc.csv. This is because the code that records insertions treats them as chunks of 3 bases, and therefore misses this small insertion of length 2, see here:
The best solution would probably be to switch the insertion counting to 1-based positions to better reflect insertions whose length is not a multiple of three.

@CBeelen CBeelen added this to the 7.16 milestone May 10, 2022
@CBeelen CBeelen modified the milestones: 7.16, 7.17 May 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant