-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Workflow for large number of genomes (Population) #52
Comments
Hi @djakubosky, Thanks for reaching out, these are really good questions, especially for large cohorts. I've made some bullet points that might help guide you. If you find this helpful or come up with your own tricks I'd like to add them to a wiki.
|
Hi @zeeev One more thing- I will be able to assess quality of variants somewhat indirectly using their reproducibility in pairs of twins in my cohort- happy to share these results with you to give you something slightly more "formal" to illustrate these FN/FP rates- see manuscript here if interested. |
Yes mergeSV can be used to merge within, and then between. You mentioned that you have families in your cohort? I'd joint call closely related individuals (up to 3/4). --Zev P.S. nice paper! |
Thanks for the info! Is there a limit to how many VCFs can be merged with
regards to memory concerns?
…On Tue, Oct 1, 2019 at 3:15 PM Zev Kronenberg ***@***.***> wrote:
@djakubosky <https://github.com/djakubosky>,
Yes mergeSV can be used to merge within, and then between.
You mentioned that you have families in your cohort? I'd joint call
closely related individuals (up to 3/4).
--Zev
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#52?email_source=notifications&email_token=ADSS7NRISMLA7R753XVSOPTQMPDXRA5CNFSM4I27V4BKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAC534A#issuecomment-537255408>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADSS7NUELXJZZE6QJEEAPQDQMPDXRANCNFSM4I27V4BA>
.
|
Hi, I was curious about the suggested workflow for WHAMG on ~1K genomes. My assumption is that this is what you would want to do:
Does this sound reasonable? If this is the proposed approach might be helpful to add a little more detail in the wiki!
Thanks for a nice tool!
The text was updated successfully, but these errors were encountered: