-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
signal 11 error with Bowtie2 hg38 alignment #300
Comments
Based on the error message:
It leads me to believe that the steps preceding You could run the first part of the command manually and see if it offers further insight:
I do not believe you need to reinstall PEPATAC. However, given that the above command does rely on genome assets via refgenie, it may be worth clearing them and re-pulling them using the instructions here: https://pepatac.databio.org/en/latest/run-conda/ |
Hi @oliviacwhite, did you have any luck with this? |
Hi @donaldcampbelljr, I was able to get my bowtie2 command to work correctly. Thank you so much for your help. I ended up removing the refgenie assets and re-pulling / building them. I used a slurm job submission with bowtie2_index which allowed it to be properly pulled and not kill the command from lack of memory. This helped everything and after renaming some files to fix an error in my looper run, I was able to get it to run successfully all the way through on the example data. Unfortunately, I have now run into another problem I can't seem to fix. Now I am trying to get the summary directory after the looper run by running
I can see the error here clearly, as the command includes --config None when the config should be leading to the appropriate file /path/to/pepatac/examples/test_project/test_config_refgenie.yaml . I would appreciate help determining why the looper runp command does not find the configuration .yaml file, as I would expect it to. That being said, I tried a work around using the following command that specifies the --config pathway:
Now, I can't seem to figure out what is going wrong here (something in the pepatac_collator.py script or in the stats.yaml file that was generated after the normal looper run where the sample names are not recognized?) and do not know how to fix this specific error. Any help is greatly appreciated! |
Hi @oliviacwhite, I suspect the test folder might actually be out of date and contain config files that were useable only with older looper versions. Could you check out the tutorial folder, specifically the .looper_tutorial_refgenie.yaml file: And ensure you have looper 1.6.0. installed: Example commands: I can look at this sometime after the holidays (next week). Thanks. |
So, if I understand correctly, I should try the tutorial (after downloading the correct tutorial data files), as the test_project configuration files may be out of date to work with looper? Thank you for your help! If you have more time after the holidays, I'd appreciate it. |
Yes, I recommend trying to get the tutorial to work with the requirements listed in the |
Hi @donaldcampbelljr, I have worked on running
I don't understand what KeyError: tutorial2 means, or how I can fix it, so that the pipeline finds the proper samples so that it can make the summary folder from the looper run. Thanks! |
Just to confirm, did you run the sample-level items first using |
Yes, first I did
I am not sure if having DEFAULT_SAMPLE_NAME is causing issues, as that is also the name of the |
I'm wondering if you edit the yaml file and replace Also, which versions of pipestat and piper are you using? This looks like an older bug that was solved a few releases ago. |
Yes, replacing When I look at the summary and report files, it looks mostly good, however, when I look at the summary reports, it says STATUS:Missing, instead of STATUS:Completed, as it should. Additionally, a DEFAULT_SAMPLE_NAME record appears (see first screenshot), in addition to tutorial1 and tutorial2, which contains some files (generally those that are in the summary directory, such as Library complexity), which are likewise missing from the tutorial1 and tutorial2 reported statistics pages (see second screenshot). I have included some screenshots of what is happened on these pages. Re: pipestat and piper, these runs were done with pipestat version 0.11.0 and piper version 0.14.0. As the requirements.txt file of pepatac specifies pipestat==0.6.0, I installed that version instead and tried running the tutorial files again, but it made no difference and the DEFAULT_SAMPLE_NAME issues persisted. At this point, I am not positive that any essential information is really missing from my results, but I would prefer to have this optimized and without small errors before I run PEPATAC on my larger data set. |
I believe it cannot find the status because it was setting the sample_name to DEFAULT_SAMPLE_NAME and thus, the status flags have that name in the path instead of the correct name, i.e. The additional record_identifier is unfortunately a placeholder that is made during project summaries for older versions of Looper. Could you attach the PEPATAC_log.md file for one of those tutorial runs here? I might be able to see why its still using the default sample name. Another suggestion, though it will be a bit more work on your end: On the dev branch, we have a version of PEPATAC that is running with the latest versions of Looper, Pipestat, and Pypiper and have fixed some of these small bugs. If you would like, you could pull the development branch ( |
Here is one PEPATAC_log.md file. I will look into your other suggestion as well. |
Ok, looking at the log file, it shows that the pipeline was still actually using Piper (pypiper) v0.14.3. If you downgrade that to 0.14.0, the sample names should report correctly. |
Hey @oliviacwhite, Did you have any luck with the above suggestions? |
I am trying to run the PEPATAC using conda. Currently, I am at the looper run examples/test_project/test_config_refgenie.yaml step and it runs fine until it reaches the bowtie2 hg38 genome alignment, where it outputs a signal 11 error (attached below). I run this command using a slurm job so I don't think it is a memory availability issue. My only guess is that there is something wrong with the bowtie2_index as I pulled it from refgenie using refgenie pull hg38/bowtie2_index . The reason I think this is that it consistently downloaded the index (reaching 100%), but then said that it was killed. I think my bowtie2_index for hg38 is downloaded correctly, as it has the .bt2 files in the directory, however, I have to guess that something is going wrong that I don't understand. I can't think of what to try because of the signal 11 index besides completely removing refgenie and pepatac from my terminal and restarting, but I am worried about running into the same issues. Any help is greatly appreciated!
Command completed. Elapsed time: 0:00:00. Running peak memory: 0.103GB.
PID: 2971905; Command: samtools; Return code: 1; Memory used: 0.0GB
PID: 2971903; Command: bowtie2; Return code: 1; Memory used: 0.009GB
PID: 2971904; Command: samtools; Return code: 1; Memory used: 0.002GB
Child process 2971879 (perl) was already terminated.
Starting cleanup: 3 files; 3 conditional files for cleanup
Cleaning up flagged intermediate files. . .
Conditional flag found: []
These conditional files were left in place:
Pipeline failed at: (11-18 12:51:19) elapsed: 0.0 TIME
Total time: 0:00:06
Failure reason: Subprocess returned nonzero result. Check above output for details
Traceback (most recent call last):
File "/path/to/pepatac/pipelines/pepatac.py", line 2779, in
sys.exit(main())
File "/path/to/pepatac/pipelines/pepatac.py", line 1112, in main
pm.run([cmd, cmd2], rmdup_bam, follow=check_alignment_genome)
File "/path/to/miniconda3/envs/pepatac/lib/python3.9/site-packages/pypiper/manager.py", line 1036, in run
list_ret, maxmem = self.callprint(
File "/path/to/miniconda3/envs/pepatac/lib/python3.9/site-packages/pypiper/manager.py", line 1316, in callprint
self._triage_error(SubprocessError(msg), nofail)
File "/path/to/miniconda3/envs/pepatac/lib/python3.9/site-packages/pypiper/manager.py", line 2539, in _triage_error
self.fail_pipeline(e)
File "/path/to/miniconda3/envs/pepatac/lib/python3.9/site-packages/pypiper/manager.py", line 2009, in fail_pipeline
raise exc
pypiper.exceptions.SubprocessError: Subprocess returned nonzero result. Check above output for details
The text was updated successfully, but these errors were encountered: