Skip to content

AHY123/DSC291-Final-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

More Comprehsneive Motif Databse for Gene Regulatory Network

While larger and more comprehensive motif databases theoretically improve the identification of transcription factors (TFs) affecting gene expression, significant challenges remain. One challenge is the variety of databases available, with different databases tailored to specific organisms such as humans and mice. Popular examples include HOCOMOCO and JASPAR, which are widely used but may differ in the range and accuracy of their motif collections. Another challenge is database coverage. Newer tools like SCENIC+ claim to curate and cluster the most comprehensive motif collection, boasting over 30,000 motifs to enhance the TF identification (Bravo González-Blas et al., 2023). We hypothesize that the motif database provided by SCENIC+ will enable the identification of a greater number of TFs that significantly influence gene expression compared to databases like HOCOMOCO. By comparing the performance of these databases, we aim to evaluate whether SCENIC+ offers a measurable improvement in GRN prediction accuracy and transcription factor identification.

Environment

Please refer to the Dictys Github Page to install the environment first.

Motif Threhold Calculation

Here is the method help to calculate the threshold of a given motif matirx.

Here is the method help to convert the scenic+ nmotif databse into the format of .motif.

Here is the method help to uncapitalize the motif name in the database to be compatible withsome particualr nomenclature.

Run Gene Regulatory Network

Please refer to this notebook to know how to prepare for the data.

Please refer to this notebook to know how to set up Makefile.

Please refer to this notebook to know how run the whole GRN.

Please refer to this notebook to know to get the data.

Reference

Dictys: dynamic gene regulatory network dissects developmental continuum with single-cell multiomics Nature Methods (2023)

Github Page: https://github.com/pinellolab/dictys/tree/master

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published