HiCAT-human

We proposed a modified version of our previous HOR annotation tool HiCAT for automatically annotating centromere HOR patterns from both HiFi reads and assemblies of multiple human samples.

Dependencies

Python 3.9.13

Development environment: Linux

Development tool: Pycharm

Packages	Version
biopython	1.79
joblib	1.1.0
lastz	1.04.22
matplotlib	3.5.1
numpy	1.22.3
pandas	1.4.0
python-edlib	1.3.9
python-levenshtein	0.12.2
scikit-learn	1.0.2
seqtk	1.2
setuptools	61.2.0

StringDecomposer version 1.1.2. (included in HiCAT-human source code)

Quick start

HiCAT-human is a tool to automatically annotate centromere HOR patterns from both reads and assemblies of multiple human samples.

Installation

#install
git clone https://github.com/xjtu-omics/HiCAT-human.git
conda install -y --file requirements.txt
cd ./stringdecomposer && make

Overview

HiCAT-human consists of 4 modules:

reads used for reads HOR annotation.
reads_aggregate used for aggregating reads annotation results.
assembly used for assembly HOR annotation.
assembly_match used for matching assembly annotation results to reads annotation results.

Input data

Reads HOR annotation: whole genome HiFi reads.
Assembly HOR annotation: haplotype-resolved human genome.

For detail usage, read the docs on the HiCAT-human wiki.

Contact

If you have any questions, please feel free to contact: [email protected], [email protected], [email protected]

Reference

Please cite the following paper when you use HiCAT-human in your work

Gao S, Zhang Y, Bush SJ, Wang B, Yang X, Ye K. Centromere landscapes resolved from hundreds of human genomes. Genomics, Proteomics & Bioinformatics. 2024 Oct 18:qzae071. https://doi.org/10.1093/gpbjnl/qzae071

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
HiCAT_ref		HiCAT_ref
stringdecomposer		stringdecomposer
AlphaSat.fa		AlphaSat.fa
HiCATReads_annotation.py		HiCATReads_annotation.py
HiCATReads_classification.py		HiCATReads_classification.py
HiCATReads_sampleAggregate.py		HiCATReads_sampleAggregate.py
HiCAT_HOR.py		HiCAT_HOR.py
HiCAT_aggregate.py		HiCAT_aggregate.py
HiCAT_human.py		HiCAT_human.py
HiCAT_mini.py		HiCAT_mini.py
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
__init__.py		__init__.py
buildFeatures.py		buildFeatures.py
buildPredictor.py		buildPredictor.py
getCENRegion.py		getCENRegion.py
getPatternSummary.py		getPatternSummary.py
getPatternTable.py		getPatternTable.py
kmer_feature.txt		kmer_feature.txt
model.pkl		model.pkl
model_cen.pkl		model_cen.pkl
normalHORnumber.py		normalHORnumber.py
predictReads.py		predictReads.py
prepareAlphaSat.py		prepareAlphaSat.py
processLastz.py		processLastz.py
requirements.txt		requirements.txt
splitReadsFa.sh		splitReadsFa.sh
workflow.png		workflow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HiCAT-human

Dependencies

Quick start

Installation

Overview

Input data

Contact

Reference

About

Releases

Packages

Languages

License

Ian916/HiCAT-human

Folders and files

Latest commit

History

Repository files navigation

HiCAT-human

Dependencies

Quick start

Installation

Overview

Input data

Contact

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages