download and extract the zip folder
cd into the extracted directory and run,
for Linux shell,
python setup.py sdist bdist_wheel
pip install dist/*whl
for Windows command line,
python setup.py sdist bdist_wheel
for %i in (dist\*.whl) do pip install %i
It takes vcf file as input
from VariantAnnotator import *
ParseObj = Parser("yourVCFfile.txt")
variants = ParseObj.parseInput()
ANNOTATE(variants, output="myOutputFile.xlsx")
It pulls following annotations for each variant within the vcf file
- Gene name
- Variant biotype
- Regulatory significance: Checks if the variant is within the regu;atory regions of the gene
- Regulatory feature id: ENSEMBL regulatory feature id
- Amino acids: Reports associated amino acid mutation
- Codons: Reports associated codon mutation
- rsid: variant dbSNP database id
- Uniprot id: Variant Uniprot database id
- Cosmic id: Variant Cosmic database id
- Clinvar id: Variant Clinvar database id
- PharmGKB ID: Variant PharmGKB database id
- SIFT: SIFT scare for the amino acid mutation
- Polyphen: Polyphen score for amino acid mutation
- American: Frequency of the variant in American population
- South Asian: Frequency of the variant in South Asian population
- East Asian: Frequency of the variant in East Asian population
- African: Frequency of the variant in African population
- European: Frequency of the variant in African population
- Pubmed: Pubmed id for the corresponding variant
Below is the example output excel sheet image
- Error handelling
- Add protein structure-function annotations