Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Increasing the number of threads #7

Open
yliasolom opened this issue Nov 22, 2022 · 2 comments
Open

Increasing the number of threads #7

yliasolom opened this issue Nov 22, 2022 · 2 comments

Comments

@yliasolom
Copy link

Good afternoon! Please, could you help: I have a large Avena genome, which weight about 3 Gb. When I run ReasonateTE, this program takes just a huge amount of time. Could you gave me a clue how to increase the number of threads for running this tool? Thanks you in advance for your response.

@DerKevinRiehl
Copy link
Owner

DerKevinRiehl commented Nov 26, 2022

Dear Ylia Solomennikowa,
to help you better, I need to know at which of the steps you are using reasonaTE.
Are you still at Step 2) Annotate genome with annotation tools? My following answers are for step 2:

I see two aspects for you here to increase your speed, but it depends where you, work probably on Linux Cluster?:

  1. What you could do is to execute reasonaTE on single tool mode, and execute the single tools on single threads / ssh sessions in parallel.

This means, instead of writing

conda activate transposon_annotation_tools_env
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool all

You could just run these in parallel with the "&" operator, however you need to wait until all sessions are finished before you go to the next steps:

conda activate transposon_annotation_tools_env
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool helitronScanner &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool ltrHarvest &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool mitefind &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool mitetracker &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool must &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool repeatmodel &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool repMasker &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool sinefind &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool sinescan &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool tirvish &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool transposonPSI &
reasonaTE -mode annotate -projectFolder workspace -projectName testProject -tool NCBICDD1000 &
  1. Especially the tools repeatmasker and repeatmodeler take a lot of time. You could also install and run them separately with their way of parallelizing (but this depends on your environment). Then after they finished you take their output and copy them into the folder structure of reasonaTE as describe on the reasonaTE page.
    According to: https://blaxter-lab-documentation.readthedocs.io/en/latest/repeatmodeler.html
    you could run repeatmasker with "-p" flag.

Please let me know which step you are, and we can find solutions to accelerate.
Hope this could already help,
Best regards and looking forward to your answer, Kevin

@Ramkyeri
Copy link

Ramkyeri commented Apr 3, 2023

Good afternoon! Please, could you help: I have a large Avena genome, which weight about 3 Gb. When I run ReasonateTE, this program takes just a huge amount of time. Could you gave me a clue how to increase the number of threads for running this tool? Thanks you in advance for your response.

Dear Ylia Solomennikowa,

I have issue installing reasonaTE, how did you install RepeatMasker and RepeatModeler
.
Ducker use full memory.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants