-
Notifications
You must be signed in to change notification settings - Fork 7
Filter
Young edited this page Apr 25, 2023
·
4 revisions
Basecalled Nanopore sequencing reads can have varying levels of quality. As such, Nanopore fastq files are run through porechop to remove any remaining barcodes and adapter sequences and filtlong to reduce the sample set to the highest quality subset of reads. Illumina fastq files are run through fastp to remove poor quality reads. PHIX removal is not part of this workflow.
---
Donut Falls
---
flowchart LR
A[nanopore fastq] -. optional .-> B[porechop]
B --> C[filtlong]
A -. default .-> C
C --> H[gzip compression]
H --> D[filtered nanopore reads]
F[illumina fastq] --> G[fastp]
G --> E[filtered illumina reads]
Relevant parameters (params) and their default values
- It is recommended to adjust params.filtlong_options to fit the needs of the inputs
params.fastp_options = ''
params.filtlong_options = '--min_length 1000 --keep_percent 95'
params.enable_porechop = false
params.porechop_options = ''