Design principles

Built up bit by bit.
Utilise nextflow training best practices.
Initially start with a few files on local directory, move to AWS batch.
Document and tag every step

The skeleton

Let's start with a template that get stuff from, but it not exactly the same as, an nf-core template.
- In particular, I'd like to use a few of the base configuration files in nf-core, so we can direct different resources to different processes depending on the labels.

TAG: 00-Skeleton

This should run with nextflow run main.nf --input data/samplesheet.csv --outdir results
The result should be just the channel containing the example files

Fastqc and trimming.

Decided to see if a FASTQC_TRIMGALORE subworkflow (from cutandrun) would work.

This seems to work, and makes for another good tag point.

TAG: 01-TrimGalore

If you check out this tag you should be able to run with nextflow run main.nf --input data/samplesheet.csv --outdir results

This version should place the trimmed results and fastqc results into the outdir.
This version records software versions.

TAG: 02-Bowtie2_index

I still needed to build the genome index (both botwie2 and faidx, so I cobbled together a PREPARE_GENOME subworkflow based on one of the nf-core pipelines. Not sure why the bowtie2/build module from nf-core uses process-high, so I changed it to process-medium.

Should be runnable with

nextflow run main.nf --input data/samplesheet.csv --outdir results/ --fasta data/genome/ecoli_rel606.fasta

TAG: 03-bowtie

This runs bowtie and samtools. It took a bit of work to realise that the bowtie index needed to be a value Channel, not a queue channel, so I could re-use it.

Should be runnable with

nextflow run main.nf --input data/samplesheet.csv --outdir results/ --fasta data/genome/ecoli_rel606.fasta

Tag: 04-VCF

The final VCF files should go into results/03_vcf/vcf.

I used a slightly different method for the vinal varian calling, but the basic idea is the same.

The final pipeline can be run with bash s3-run.sh. Please change your group name before running.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
conf		conf
data		data
modules		modules
subworkflows		subworkflows
workflows		workflows
.nf-core.yml		.nf-core.yml
README.md		README.md
main.nf		main.nf
modules.json		modules.json
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json
s3-run.sh		s3-run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Design principles

The skeleton

TAG: 00-Skeleton

Fastqc and trimming.

TAG: 01-TrimGalore

TAG: 02-Bowtie2_index

TAG: 03-bowtie

Tag: 04-VCF

About

Releases

Packages

Languages

GTK-teaching/bl5632-nf-lenski

Folders and files

Latest commit

History

Repository files navigation

Design principles

The skeleton

TAG: 00-Skeleton

Fastqc and trimming.

TAG: 01-TrimGalore

TAG: 02-Bowtie2_index

TAG: 03-bowtie

Tag: 04-VCF

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages