Supplementary Data for
Pandurang Kolekar1*, Vidya Balagopal2*, Li Dong1*, Yanling Liu1, Scott Foy1, Quang Tran1, Heather Mulder1, Anna LW Huskey2, Emily Plyler1, Zhikai Liang1, Jingqun Ma2, Joy Nakitandwe3, Jiali Gu2, Maria Namwanje2, Jamie Maciaszek2, Debbie Payne-Turner2, Saradhi Mallampati2, Lu Wang2, John Easton1, Jeffery M. Klco2#, Xiaotu Ma1#
1: Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, TN, USA
2: Department of Pathology, St. Jude Children’s Research Hospital, Memphis, TN, USA
3: Department of Pathology and Laboratory Medicine, Diagnostics Institute,
Cleveland Clinic, Cleveland, OH, USA
*: Contributed equally
#: correspondence authors
The repository consists of the following files,
.
├── Allelic_imbalance_figures.zip
├── CNVkit_Tumor_diagrams.pdf
├── CNVkit_output_files.zip
├── README.md
└── example
└── SJAML030006_D1.ai.jpg
- This zipped folder consists of allelic imbalance plots (format: jpg) for all the samples used for analytical validation
- These plots were manually reviewed to determine Loss of Heterozygosity (LOH) events in tumor samples
- Files are named as follows,
<Case ID>_<Sample type>.ai.jpg
- Sample type
D1
indicates tumor sample whereasG1
indicates normal sample - Representative figure for the sample SJAML030006_D1:
- Figure Description
- The figure consists of allelic frequencies of SNPs in chromosome-wise manner
- The Y axes on the left denotes allele frequency of SNPs shown as points/dots and secondary Y-axes on right indicate depth of coverage (capped at 5000) shown by orange bars.
- The X axis indicate genomic position on the chromosome
- In the above figure, it can be clearly seen that minor allele frequencies (MAF) of SNPs on chromosome 13 are deviated from their expected MAF of 0.5, as observed in rest of the chromosomes.
- This PDF file consists of chromosome-wise CNV calls for sample one per page. The red region indicates copy number gain whereas blue region indicate copy number loss. The intensity of color is proportional to log2 ratio of the call.
- The description of the figure is available from CNVkit documentation here
- This zipped folder consists of output files (format: text) generated by the CNVkit v0.9.10 (Talevich E. et al., 2016) for tumor samples of 113 cases sequenced for analytical validation of the SJPedPanel
- Files are named as follows,
<Case ID>_<Sample type>_Tumor_.cn[rs]
*.cns
file consists of segment level Copy Number variation (CNV) calls with log2 ratios*.cnr
file consists of region/bin level CNV calls
- The details of file format are described in CNVkit documentation available here
- Preprint available on medRxiv: https://doi.org/10.1101/2023.11.27.23299068