Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error because of large chromosomes #132

Closed
MargaridaMCM opened this issue Mar 11, 2022 · 3 comments
Closed

Error because of large chromosomes #132

MargaridaMCM opened this issue Mar 11, 2022 · 3 comments
Assignees
Labels
bug Something isn't working

Comments

@MargaridaMCM
Copy link

As part of the pipeline, Bowtie alignment files are indexed using samtools' BAI index. The problem is that the BAI index can only index chromosomes smaller than 512 Mbp. For larger chromosomes, a CSI index (also from samtools) needs to be used instead (it doesn't have a hardcoded limit).

The marsupial opossum's chromosomes 1 and 2 are larger than 512 Mbp. The same is true for other marsupials and several plants.

Would it be possible to update the pipeline to use a CSI index instead of a BAI index? I realise the answer depends on the downstream tools that use the .bai file. If the .bai file is only required for samtools own use, the fix would be straightforward (using option -c when indexing the bam file with samtools).

@MargaridaMCM MargaridaMCM added the bug Something isn't working label Mar 11, 2022
@lpantano
Copy link
Contributor

Hi @MargaridaMCM , this is interesting. I have never had to use these big chromosomes. Would you mind to make the changes and see what happens? I am not sure if all tools are compatible with this...

@MargaridaMCM
Copy link
Author

Hi @lpantano - I ran dev forcing samtools to index using the CSI instead of the BAI index, and that solved the problem! The pipeline finished successfully, and all the expected output files were generated.

I forced indexing using the CSI index by going to ../../nf-core/modules/samtools/index/main.nf and substituting samtools index by samtools index -c. I know this is not the proper way to do this using DSL 2; it needs to be specified as a parameter somewhere. But I don't feel confident (yet) with the DSL 2 syntax.

@apeltzer apeltzer added this to the 2.4.0 milestone Aug 8, 2024
@apeltzer apeltzer added this to smrnaseq Aug 8, 2024
@atrigila atrigila self-assigned this Aug 27, 2024
@atrigila
Copy link
Contributor

Closed via #391

@github-project-automation github-project-automation bot moved this from In progress to Done in smrnaseq Aug 27, 2024
nschcolnicov pushed a commit that referenced this issue Oct 10, 2024
Important! Template update for nf-core/tools v2.9
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: Done
Development

No branches or pull requests

4 participants