Aligning RNA-sequencing reads to a sex chromosome informed reference genome. The effects of expression estimates when aligning RNAseq reads to a reference genome that is sensitive to the sex chromosome complement of the reference genome compared to using a default reference genome that does not account for the sex of the sample.
The human X and Y chromosomes share an evolutionary origin and sequence homology, including regions of 100% identity, that can confound estimates of transcript abundance if sex chromosome complement is not taken into account when aligning reads. Here we present XY_RNAseq, a pipeline for aligning RNAseq reads to an apportiate sex chromosome complement reference to infer differential expression between male XY and female XX RNAseq samples.
Please see our preprint for more information:
Aligning RNA-Seq reads to a sex chromosome informed reference genome increases ability to detect sex differences in gene expression 2019. Olney KC; Brotman SM; Valverde-Vesling V; Andrews J; Wilson MA.
If you use XY_RNAseq or discuss/correct for bias in mapping on the sex chromosomes, please cite this preprint.
-
Sex chromosome complement reference genomes - Download and create sex chromosome complement reference genomes and gene annotation files
-
Differential expression pipeline - Suggested pipeline for processing raw RNAseq fastq files for differential expression with HISAT and STAR read aligners followed by limma/voom for differential expression analysis
Creating sex chromosome complement reference genomes for genetic male XY and genetic female XX samples. Run the following command to download the ensembl GRCh38 human reference genome and create two sex chromosome complement reference genomes.
Reference genome | usage |
---|---|
Y-masked | for aligning genetic female XX individuals |
YPARs-masked | for aligning genetic male XX individuals |
default | the default ensemble GRCh38 human reference genome will include sequences for all chromosomes including the sex chromosomes, X and Y |
To download and create the sex chromosome complement reference genomes run the following script:
sh downloadGRCh38HumanReferenceGenomeCreateSexChrComplement.sh
Note: The GRCh38 human reference genome is large and will take several hours to download depending on processing and memory available.
We have put together a workflow for inferring differential expression between males XY and females XX using two read aligners, HISAT and STAR, and limma/voom for computing differential expression. These tools are publicly available and we ask that if you use this workflow to cite the tools used listed in the table below.
To run our suggested workflow for differential expression please see DifferentialExpression folder.
Tool | usage | citation |
---|---|---|
Trimmomatic | Trim RNA-sequences for quality | Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30: 2114–2120. |
HISAT | RNAseq read aligner | Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12: 357–360. |
STAR | RNAseq read aligner | Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29: 15–21 |
bamtools | analyzing and processing BAM files | Barnett DW, Garrison EK, Quinlan AR, Strömberg MP, Marth GT. BamTools: a C++ API and toolkit for analyzing and managing BAM files. Bioinformatics. 2011;27: 1691–1692. |
FeatureCounts | obtain raw transcriptome counts | Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30: 923–930. |
Limma/voom | differenital expression analysis | Law CW, Chen Y, Shi W, Smyth GK. voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014;15: R29. |
Name | github ID | |
---|---|---|
Kimberly Olney | [email protected] | @olneykimberly |
Sarah Brotman | [email protected] | @sarahbrotman |
Valeria Valverde-Vesling | ||
Jocelyn Andrews | ||
Melissa A. Wilson | [email protected] | @sexchrlab |