Updating workflow files to work with Igenome #388

nschcolnicov · 2024-08-26T17:09:43Z

PR checklist

github-actions · 2024-08-26T17:11:26Z

`nf-core lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 7d534cd

+| ✅ 214 tests passed       |+
#| ❔   1 tests were ignored |#
!| ❗   3 tests had warnings |!

❗ Test warnings:

pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!

❔ Tests ignored:

nextflow_config - Config default ignored: params.fastp_known_mirna_adapters

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-smrnaseq_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-smrnaseq_logo_light.png
files_exist - File found: docs/images/nf-core-smrnaseq_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-smrnaseq_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/WorkflowSmrnaseq.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: params.validationShowHiddenParams
nextflow_config - Config variable found: params.validationSchemaIgnoreParams
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 2.3.2dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.protocol= custom
nextflow_config - Config default value correct: params.umitools_extract_method= string
nextflow_config - Config default value correct: params.umitools_method= dir
nextflow_config - Config default value correct: params.skip_umi_extract_before_dedup= true
nextflow_config - Config default value correct: params.mature= https://github.com/nf-core/test-datasets/raw/smrnaseq/miRBase/mature.fa
nextflow_config - Config default value correct: params.hairpin= https://github.com/nf-core/test-datasets/raw/smrnaseq/miRBase/hairpin.fa
nextflow_config - Config default value correct: params.save_aligned_mirna_quant= true
nextflow_config - Config default value correct: params.three_prime_adapter= AGATCGGAAGAGCACACGTCTGAACTCCAGTCA
nextflow_config - Config default value correct: params.trim_fastq= true
nextflow_config - Config default value correct: params.fastp_min_length= 17
nextflow_config - Config default value correct: params.fastp_max_length= 100
nextflow_config - Config default value correct: params.min_trimmed_reads= 10
nextflow_config - Config default value correct: params.save_merged= true
nextflow_config - Config default value correct: params.phred_offset= 33
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.max_cpus= 16
nextflow_config - Config default value correct: params.max_memory= 128.GB
nextflow_config - Config default value correct: params.max_time= 240.h
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.max_multiqc_email_size= 25.MB
nextflow_config - Config default value correct: params.validate_params= true
nextflow_config - Config default value correct: params.pipelines_testdata_base_path= https://raw.githubusercontent.com/nf-core/test-datasets/
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-smrnaseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-smrnaseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-smrnaseq_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_ci - '.github/workflows/ci.yml' is triggered on expected events
actions_ci - '.github/workflows/ci.yml' checks minimum NF version
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 23.04.0, Config: 23.04.0
readme - README Zenodo placeholder was replaced with DOI.
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (250 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
multiqc_config - assets/multiqc_config.yml found and not ignored.
multiqc_config - assets/multiqc_config.yml contains report_section_order
multiqc_config - assets/multiqc_config.yml contains export_plots
multiqc_config - assets/multiqc_config.yml contains report_comment
multiqc_config - assets/multiqc_config.yml follows the ordering scheme of the minimally required plugins.
multiqc_config - assets/multiqc_config.yml contains a matching 'report_comment'.
multiqc_config - assets/multiqc_config.yml contains 'export_plots: true'.
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
modules_config - conf/modules.config found and not ignored.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - CAT_FASTQ found in conf/modules.config and Nextflow scripts.
modules_config - FASTP_LENGTH_FILTER found in conf/modules.config and Nextflow scripts.
modules_config - INDEX_GENOME found in conf/modules.config and Nextflow scripts.
modules_config - UMITOOLS_EXTRACT found in conf/modules.config and Nextflow scripts.
modules_config - MIRTRACE_RUN found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - EDGER_QC found in conf/modules.config and Nextflow scripts.
modules_config - SEQCLUSTER_SEQUENCES found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - MIRTOP_QUANT found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - SAMTOOLS_INDEX found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_SMRNASEQ found in conf/modules.config and Nextflow scripts.
modules_config - MULTIQC found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 2.14.1

Run details

nf-core/tools version 2.14.1
Run at 2024-08-28 16:02:21

atrigila

I added some comments to the code that can be addressed in this same PR or in other, if you think that is better.

The test case for this PR could be this issue: #359 which I think your solution addressed. We can add a simple nf-test that checks the presence of mirdeep2 files when using genome as input.

atrigila · 2024-08-27T13:54:59Z

main.nf

+        params.fasta,
+        params.mirtrace_species,
+        params.bowtie_index,


Interesting. I thought these had to be defined in the PIPELINE_INITIALISATION and the outputs from PIPELINE_INITIALISATION have to be passed to the main workflow. See:
https://github.com/nf-core/taxprofiler/blob/5d3ee5513a84f92773c8376c55b5f4da39835307/main.nf#L74-L91
or
https://github.com/nf-core/phaseimpute/blob/f2823b024800155cd87d70f406794324c4123dfd/main.nf#L140-L151
but I see sarek and rnaseq have different approaches as well.

I'll leave it as it is, as we discussed, since there doesn't seem to be a standard for how to set up the new template, we can always change it later if we have a reason to do so

atrigila · 2024-08-27T14:43:21Z

workflows/smrnaseq.nf

    ch_reads_for_mirna = FASTQ_FASTQC_UMITOOLS_FASTP.out.reads

    // even if bowtie index is specified, there still needs to be a fasta.
    // without fasta, no genome analysis.
-    if(params.fasta) {
+    if(fasta) {


Perhaps since we are changing this for all fasta, we could adhere to adding the ch_ prefix:

"2.5 Input channel name structure Input channel names SHOULD signify the input object type. For example, a single value input channel will be prefixed with val_, whereas input channels with multiple elements (e.g. meta map + file) should be prefixed with ch_."

https://nf-co.re/docs/guidelines/components/subworkflows

If this breaks things or you think it is too complicated we can open a new issue and address it separately.

I originally tackled this issue by converting value variables to value channels, and converting file string variables to path channels, but it created a lot of issues downstream. I want to tackle the conversion of all of these variables into channels in #390. To make sure that these are the only changes implemented in the branch

this fasta variable is just equal to params.fasta, so I will convert it to val_fasta

atrigila · 2024-08-27T14:43:59Z

workflows/smrnaseq.nf

@@ -181,8 +184,8 @@ workflow NFCORE_SMRNASEQ {
    //
    // SUBWORKFLOW: MIRTRACE
    //
-    if (params.mirtrace_species) {
-            MIRTRACE(ch_mirtrace_inputs)
+    if (mirtrace_species) {


same comment as before, perhaps adding val_ before this channel name.

https://nf-co.re/docs/guidelines/components/subworkflows

apeltzer · 2024-08-27T14:51:51Z

I think we can tackle the upstream points in two stages - lets update here already to match what @atrigila mentioned in terms of recommendations (ch_ and val_) for the points we need for this PR and then have a subsequent PR where we make sure to update it everywhere else necessary to match nf-core guidelines/recommendations more closely 👍🏻 Might come in handy too when we switch the local modules to nf-core/modules

apeltzer

Ok with the changes, would also go for changing the ch_ and val_ prefixes where necessary.

nschcolnicov · 2024-08-28T17:12:07Z

I added some comments to the code that can be addressed in this same PR or in other, if you think that is better.

The test case for this PR could be this issue: #359 which I think your solution addressed. We can add a simple nf-test that checks the presence of mirdeep2 files when using genome as input.

Ill look at this separately

Initialized changes

4adf3d5

Updated mirtrace_species handling

10b88c6

nschcolnicov force-pushed the fix_igenome branch from 65d5b4c to 10b88c6 Compare August 27, 2024 12:05

nschcolnicov changed the title ~~Initialized changes~~ Updating workflow files to work with Igenome Aug 27, 2024

nschcolnicov force-pushed the fix_igenome branch from d83bf81 to 10b88c6 Compare August 27, 2024 12:13

nschcolnicov added 2 commits August 27, 2024 12:13

Pulled from dev

c5e9123

Fix variable name

cc697dc

nschcolnicov marked this pull request as ready for review August 27, 2024 13:42

nschcolnicov requested review from atrigila and apeltzer August 27, 2024 13:42

atrigila reviewed Aug 27, 2024

View reviewed changes

apeltzer approved these changes Aug 27, 2024

View reviewed changes

Pulled from dev

0573675

nschcolnicov force-pushed the fix_igenome branch 2 times, most recently from eb05af3 to aff93c1 Compare August 28, 2024 15:52

Addressing PR comments

7d534cd

nschcolnicov force-pushed the fix_igenome branch from aff93c1 to 7d534cd Compare August 28, 2024 16:00

nschcolnicov merged commit 228212c into nf-core:dev Aug 28, 2024
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updating workflow files to work with Igenome #388

Updating workflow files to work with Igenome #388

nschcolnicov commented Aug 26, 2024

github-actions bot commented Aug 26, 2024 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

atrigila left a comment

atrigila Aug 27, 2024

nschcolnicov Aug 28, 2024

atrigila Aug 27, 2024

nschcolnicov Aug 27, 2024

nschcolnicov Aug 27, 2024

atrigila Aug 27, 2024

apeltzer commented Aug 27, 2024

apeltzer left a comment

nschcolnicov commented Aug 28, 2024

Updating workflow files to work with Igenome #388

Updating workflow files to work with Igenome #388

Conversation

nschcolnicov commented Aug 26, 2024

PR checklist

github-actions bot commented Aug 26, 2024 • edited Loading

nf-core lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

atrigila left a comment

Choose a reason for hiding this comment

atrigila Aug 27, 2024

Choose a reason for hiding this comment

nschcolnicov Aug 28, 2024

Choose a reason for hiding this comment

atrigila Aug 27, 2024

Choose a reason for hiding this comment

nschcolnicov Aug 27, 2024

Choose a reason for hiding this comment

nschcolnicov Aug 27, 2024

Choose a reason for hiding this comment

atrigila Aug 27, 2024

Choose a reason for hiding this comment

apeltzer commented Aug 27, 2024

apeltzer left a comment

Choose a reason for hiding this comment

nschcolnicov commented Aug 28, 2024

github-actions bot commented Aug 26, 2024 •

edited

Loading

`nf-core lint` overall result: Passed ✅ ⚠️