You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I simulated reads ~1million 2x150bp with rescaled error rates using the -E parameter. Using the golden bam file, I ran samtools calmd to generate the MD and NM (edit distance to the reference) tags because it doesn't contain them by default.
Here is a plot of the NM values for -E 0.1 (expecting a median of 150*0.1 = 15):
median is 6
And for -E 0.3 (the maximum)
median is 10 but would have expected 45
This is using the default error model. Because at the very least -E seems to rescale the error rate at higher values this isn't bug per say but it could be critically misleading. That is, if you are benchmarking a tool it could mislead you into thinking the tool can tolerate much higher error rates than it can in reality.
The text was updated successfully, but these errors were encountered:
This is very interesting. Thank you for performing this detailed analysis!
I will look into this, and see if there might be an oversight in the simulator that causes the error rates to appear lower than expected. I'll follow up on this soon.
I simulated reads ~1million 2x150bp with rescaled error rates using the
-E
parameter. Using the golden bam file, I ransamtools calmd
to generate the MD and NM (edit distance to the reference) tags because it doesn't contain them by default.full set of options:
Here is a plot of the NM values for -E 0.1 (expecting a median of 150*0.1 = 15):
median is 6
And for -E 0.3 (the maximum)
median is 10 but would have expected 45
This is using the default error model. Because at the very least
-E
seems to rescale the error rate at higher values this isn't bug per say but it could be critically misleading. That is, if you are benchmarking a tool it could mislead you into thinking the tool can tolerate much higher error rates than it can in reality.The text was updated successfully, but these errors were encountered: