Skip to content

notthathime/NUMTs-detection

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

NUMTs-detection

This repository contains scripts for data processing and analysis used in the manuscript:

Wei W, Schon K, Elgar G, Orioli A, Tanguy M, Giess A, Tischkowitz M, Caulfield M, Chinnery PF. Nuclear-embedded mitochondrial DNA sequences in 66,083 human genomes. Nature 2022:in press.

  1. NUMTs and breakpoints detection

NUMTs_detection.sh

searchBreakpoint_fromblatoutputs.py

searchNumtCluster_fromDiscordantReads.py

groupNumtCluster_fromMultipleSamples.py

  1. enrichment analysis

enrichment_creatingRefgenome.py

enrichment_simulation.py

  1. mtDNA variants calling

mtVariantCalling.sh

mtVariantCalling_MToolBox.conf

  1. NUMTs methylation detection

nanopolish_methylationDetection.sh

  1. NUMT variants calling

VarDetection_fromDiscSplitReads.sh

generateVariantTable.Human.py

generateVariantTable.HumanChimp.py

  1. Circos plots

circos_allNUMTs.conf

confs

Data Availability

Whole genome sequence data that support the findings of this study can be analysed on the Genomics England data warehouse through https://www.genomicsengland.co.uk/understanding-genomics/data/

About

Detecting NUMTs from WGS

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 72.6%
  • Shell 27.4%