Skip to content

Latest commit

 

History

History
40 lines (30 loc) · 4.47 KB

3.basecall-qc-correct-fetch.md

File metadata and controls

40 lines (30 loc) · 4.47 KB

quality control of sequencing data

这个列表收集了短读长、长度长的质控工具。

数据模拟 | reads simulation

  • [wgsim] - [C, Na, Na] - [Reads simulator.]
  • [badread] - [Python, v0.4.1, 2024.2] - [a long read simulator that can imitate many types of read problems.]

basecall

  • [Dorado] - [C++, v0.8.3, 2024.11] - [v0.9.0, 2024.12, with new dorado polish command] - [Oxford Nanopore's Basecaller.]

qc

correct

consensus sequences from long-reads

  • [medaka] - [Python, v2.0.0, 2024.9] - [a tool to create consensus sequences and variant calls from nanopore sequencing data.]

fetch sequence from public databases

  • SRA Toolkit - [C/C++, v3.1.1, 2024.5] - [The SRA Toolkit and SDK from NCBI is a collection of tools and libraries for using data in the INSDC Sequence Read Archives.]
  • KingFisher - [Python, v0.4.1, 2024.1] - [Easier download/extract of FASTA/Q read data and metadata from the ENA, NCBI, AWS or GCP.]
  • iSeq - [Shell, v1.2.0, 2024.10] - [2024.10, Bioinformatics] - [iSeq: An integrated tool to fetch public sequencing data.]