Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Downsampling step that makes hard links for bams that are already lower coverage than the target triggers re-runs when overlap clipped bai's have a later modification date than the bams #12

Open
eriqande opened this issue May 31, 2024 · 0 comments

Comments

@eriqande
Copy link
Owner

CH brought this to my attention. When and if things need to be re-run, if the BAI has a later modification time than the BAM, then the hard-linked files at this step:

https://github.com/eriqande/mega-non-model-wgs-snakeflow/blob/main/workflow/rules/downsample.smk#L54-L57

trigger re-runs because all the output files have to be later than all the input files, and the BAI's for input are later than the BAMs for output.

I had done the hard linking just to save space, but it is clear that the easy way to fix this is to just cp then bams and the bais (i.e. cp instead of ln there).

In the meantime, touching all the offending bams and bais to have the same modification date (which is before any other downstream products) can get around triggering a re-run because of this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant