Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Understand BioQC analysis and evaluate whether we could use it in hermes #9

Open
cicdguy opened this issue Aug 5, 2021 · 0 comments

Comments

@cicdguy
Copy link
Contributor

cicdguy commented Aug 5, 2021

BioQC is a method to detect tissue heterogeneity in gene expression data (Zhang et al., BMC Genomics, 2017). Enrichment for unexpected tissues can be a consequence of mislabeled samples, technical reasons (e.g. imperfect dissection) or biological reasons (e.g. immune infiltration).

In biokitr:

  • 378 tissue-specific gene signatures derived from public and in-house data are considered
  • Signatures are tested for enrichment in highly expressed genes for each sample using the Wilcoxon-Mann-Whitney test
  • Enrichment scores Q are defined based on the -log10(p-value)
  • Signatures with Q > 2 in at least one sample are shown in the heatmap (the top 40 if there are more)

Internal information:
See https://drive.google.com/file/d/18U_ShdB0ATSWSwdXohUvlA3aC4ZnRmWE/view minute 17
BEDA/biokitr/blob/ab4c130410539f51fbfe680b7e5bc93f5a5163e7/R/gse_bioqc.R#L24
BEDA/biokitr/blob/ab4c130410539f51fbfe680b7e5bc93f5a5163e7/R/gse_bioqc.R#L67

To do: just design.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant