Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUG: gtdb taxonomy parser #191

Merged
merged 2 commits into from
Jun 10, 2024
Merged

Conversation

nbokulich
Copy link
Collaborator

Evidently #169 introduced a bug into the way taxonomy is parsed from GTDB files. Taxonomy labels were being split on whitespace, so only the genus name was listed under the species rank.

This fixes this issue by also including the species label, and introduces a simple test for the parser, just in case the taxonomy format changes in the future.

I manually tested and confirmed that this works with versions 202, 214, 220. The taxonomy format is consistent — see the test for the structure.

FYI @mikerobeson

@nbokulich nbokulich requested a review from misialq June 7, 2024 13:39
Copy link
Contributor

@misialq misialq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, thanks @nbokulich!

@nbokulich
Copy link
Collaborator Author

is it worth patching the latest release? @lizgehret ?

@nbokulich nbokulich added the bug Something isn't working label Jun 7, 2024
@nbokulich nbokulich merged commit 4bf4576 into bokulich-lab:master Jun 10, 2024
4 checks passed
lizgehret pushed a commit that referenced this pull request Jun 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants