Completeness and contamination missing from fetch_ncbi.py #80

amardeepranu · 2023-12-19T22:33:13Z

Are we suppose to compile this data ourselves? fetch_ena.py seems to generate this data, but fetch_ncbi.py does not.

amardeepranu · 2023-12-19T22:51:13Z

Ah looks like its done here: https://github.com/EBI-Metagenomics/genomes-pipeline/blob/853487f6dda1420fd8b6b41dd4aff5c8540c7e37/subworkflows/prepare_data.nf#L27-L29

is there a reason the CHECKM step isn't done for the ENA data as well? Specifically for data that isn't pulled using the fetch_ena.py script?

amardeepranu · 2023-12-19T23:02:49Z

https://github.com/EBI-Metagenomics/genomes-pipeline/blob/853487f6dda1420fd8b6b41dd4aff5c8540c7e37/workflows/genomes_annotation.nf#L90-L99

I also see here that all ncbi data is ignored, can I edit this and include ncbi data without issue? Thanks

tgurbich · 2024-01-16T10:33:44Z

Hi @amardeepranu,

All genomes, regardless of where they were downloaded from, should be passed to the pipeline using the --ena_genomes flag. If any of your genomes were fetched using the fetch_ncbi.py script, you need to run CheckM on them and combine all completeness and contamination results into one file for all ENA and NCBI genomes. Pass the path to this combined file to the pipeline using the --ena_genomes_checkm flag.

We will adjust this in the future releases to avoid confusion.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Completeness and contamination missing from fetch_ncbi.py #80

Completeness and contamination missing from fetch_ncbi.py #80

amardeepranu commented Dec 19, 2023

amardeepranu commented Dec 19, 2023 •

edited

Loading

amardeepranu commented Dec 19, 2023

tgurbich commented Jan 16, 2024

Completeness and contamination missing from fetch_ncbi.py #80

Completeness and contamination missing from fetch_ncbi.py #80

Comments

amardeepranu commented Dec 19, 2023

amardeepranu commented Dec 19, 2023 • edited Loading

amardeepranu commented Dec 19, 2023

tgurbich commented Jan 16, 2024

amardeepranu commented Dec 19, 2023 •

edited

Loading