Skip to content

Latest commit

 

History

History

HM450K_data_analysis

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

HM450K Analysis

The folder fun/ contains functions dedicated to HM450K data analysis.
The folder data/ contains data that could be useful for HM450K analysis.

Functions

  • probeseq_match.R - Counts HM450K probe sequence matches on hg19 genome assembly scaffolds.

Data

HM450K_probes_annotation.csv - A CSV file listing HM450K probes purposes.

  • The first column "ID" contains the ID of probes.
  • The second column "Target" contains probes' target types. The target can be either:
    • A type of methylation: "CpG", "CpH" (any other C-dinucleotide different from a CpG) or "SNP" (when captured by genotyping probes).
    • A preparation step involved in the Infinium Human Methylation 450K method.
  • The third column "Purpose" defines the purpose of probes: The intensity of a probe can either be used for methylation analysis (Analysis), or for quality control (QC).

HM450K_probes_filters.csv - A CSV file listing HM450K probes that are either cross-reactive, or polymorphic, or both cross-reactive and polymorphic.

HM450K_genotyping_probes_hg19.bed - A BED file containing the coordinates of the HM450K genotyping 'rs' probes on the hg19 genome assembly version.

  • The first column provides the ID of rs probes, which also match Human dbSNP ID.
  • The second column states on which chromosome the probe is located.
  • The third column matches the 1-based position of the SNP.

A more detailed introduction to the 'rs' genotyping probes used in HM450K can be found in this paper:
"There are 65 probes placed on the 450K chip querying high-frequency SNPs (with 59 of these on the EPIC chip; their probe identifiers start with “rs”). Just as for CpG sites, a β-value is calculated for each SNP locus, based on fluorescence intensities from two probes targeting either the wild type or the common mutant variant. These β-values usually fall into one of three disjunct clusters, corresponding to the heterozygous and the two homozygous genotypes (AB, AA, or BB). The specific combination of SNPs across these 65 probes serves as a genetic fingerprint: fingerprints of samples from the same donor match but differ between individuals – with the exception of monozygotic twins – thereby enabling one to check for discrepancies with the metadata."
(N.B.: The SNP probe ID rs13369115 has been merged with another SNP (rs60784560) under the new ID rs10155413. More details here.)