Skip to content

Single Sample SVs

John Garza edited this page Jun 13, 2019 · 7 revisions

Introduction

Tutorials

Command Line

INSERT STEP-BY-STEP WALKTHROUGH

CWL Workflow

INSERT LINK TO DOCKER IMAGES/REPOS AND CWL

Steps

INSERT PROCESS DIAGRAM

Inputs

Name Description Example Required
bam Aligned sequencing results to be analyzed for SVs
cnvkit_diagram Create an ideogram of copy ratios on chromosomes as a pdf false
cnvkit_drop_low_coverage Helps avoid false positive deletions in low quality tumor samples false
cnvkit_male_reference Use/assume a male reference false
cnvkit_method Sequencing protocol used wgs
cnvkit_reference_cnn A copy number reference file against which potential copy number variants will be evaluated
cnvkit_scatter_plot Create a whole genome copy ratio profile as a pdf scatter plot false
cnvkit_vcf_name Custom name to use for the cnvkit output vcf cnvkit_output
manta_call_regions bgzip-compressed, tabix-indexed BED file specifiying regions to which variant analysis will be restricted
manta_non_wgs When true, activates settings appropriate for whole exome sequencing false
manta_output_contigs if true, outputs assembled contig sequences in final VCF files, in the INFO field CONTIG true
maximum_sv_pop_freq Population frequency above which variants will be filtered out
merge_estimate_sv_distance When evaluating variants to be merged, estimate distance based on the size of the sv
merge_max_distance Maximum distance of variants to consider for merging
merge_min_sv_size Minimum size of SVs to merge
merge_min_svs Minimum number of sv calls needed to be merged
merge_same_strand Require merged SVs to be on the same strand
merge_same_type Require merged SVs to be of the same type
merge_sv_pop_freq_db bed file containing allele frequencies for a population
reference Reference sequence
smoove_exclude_regions Regions to be ignored when calling SVs through smoove (a wrapper for lumpy)
sv_filter_interval_lists One or more interval lists defining regions to keep in the output vcf, labeled with the source of the intervals
variants_to_table_fields one or more of any standard VCF column (CHROM, ID, QUAL) or any binding in the INFO field (e.g., AC=10) to add to the tsv report
variants_to_table_genotype_fields one or more of any binding in the FORMAT field (e.g., GQ, PL) to add to the tsv report
vep_cache_dir Location of a local ensembl cache to be used by vep
vep_ensembl_assembly Which (species) assembly vep should use GRCh38
vep_ensembl_species Which species vep should use homo_sapiens
vep_ensembl_version Which ensembl release vep should use 95
vep_to_table_fields VEP CSQ annotation fields to add to the tsv report

Outputs

Name Source Description
annotated_tsvs GATK VariantsToTable tsv files containing specified SV fields and annotations
cn_diagram CNVkit ideogram of copy ratios on chromosomes
cn_scatter_plot CNVkit whole genome copy ratio profile
cnvkit_vcf CNVkit final cnvkit output, converted to vcf format
filtered_vcfs Various filters SV VCF, filtered by variant population frequency and the above interval lists
manta_all_candidates Manta Unscored SV and indel candidates
manta_diploid_variants Manta SVs and indels scored and genotyped under a diploid model
manta_small_candidates Manta simple insertion and deletion variants less than the minimum scored variant size (50 by default)
manta_somatic_variants Manta SVs and indels scored under a somatic variant model
manta_tumor_only_variants Manta Subset of the candidateSV.vcf.gz file after removing redundant candidates and small indels less than the minimum scored variant size (50 by default)
merged_annotated_svs Suvivor, VEP SV calls from Manta, CNVkit, and Smoove(lumpy), merged by Survivor and annotated by VEP
smoove_output_variants Smoove (Lumpy) SV calls from Smoove, a wrapper for Lumpy
sv_pop_filtered_vcf Various filters SV VCF, filtered by variant population frequency
tumor_antitarget_coverage CNVkit Coverage in the antitarget regions from bam read depths
tumor_bin_level_ratios CNVkit table of copy number ratios
tumor_segmented_ratios CNVkit discrete copy number segments from the above table
tumor_target_coverage CNVkit Coverage in the target regions from bam read depths
Clone this wiki locally