Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Start indexing while still reading the reference #436

Open
marcelm opened this issue Aug 22, 2024 · 0 comments
Open

Start indexing while still reading the reference #436

marcelm opened this issue Aug 22, 2024 · 0 comments

Comments

@marcelm
Copy link
Collaborator

marcelm commented Aug 22, 2024

I have incorporated strobealign into a pipeline and was looking at this log output:

This is strobealign 0.13.0
Estimated read length: 151 bp
Time reading reference: 8.06 s
Reference size: 3099.92 Mbp (195 contigs; largest: 248.96 Mbp)
Indexing ...
  Time counting seeds: 6.70 s
  Time generating seeds: 15.56 s
  Time sorting seeds: 13.83 s
  Time generating hash table index: 7.88 s
Total time indexing: 43.98 s

Reading the reference was quite slow, probably because this was run on a freshly allocated cluster node where the reference was not in the filesystem cache. In cases like these, it would save a couple of seconds if we started indexing the reference while we are still reading it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant