Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minigraph does not find variants in simulated data #119

Open
agolicz opened this issue Nov 29, 2024 · 1 comment
Open

Minigraph does not find variants in simulated data #119

agolicz opened this issue Nov 29, 2024 · 1 comment

Comments

@agolicz
Copy link

agolicz commented Nov 29, 2024

Hello,
We are trying to understand minigraph behavior.

We built a minigraph graph with 6 simulated assemblies (only SVs > 100bp, no SNPs, simulated with VISOR) and saw that we ended up with very few extremely large nodes and many SVs we used to make simulations did not end up in the graph despite being longer than 100bp. We only ended up with less than 10 SVs per chromosome where we used on average more than 3700 SVs per chromosome to make the simulations. We verified that the simulated data was not the issue because the data works well with other pangenome graph building pipelines (Minigraph-Cactus and PGGB). Also minigraph works perfectly well with real-world data, so something is going on with the simulated data and we do not know why. Could you perhaps help explain this?

It looks like others may have faced a similar problem?
#62

@jp-jong
Copy link

jp-jong commented Nov 30, 2024

Similar with #118 . I also encountered a similar problem when I did my own simulation (although in a very small simulated size)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants