-
Notifications
You must be signed in to change notification settings - Fork 51
Add 'final reads for profiling' saving mechanism #272
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
TestsFor expecting:
TODO: Documentation 1✅ preprocessing Expect: in analysis_ready_fastqs only run merged files (3 total) nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs PASSED 2✅ preprocessing Expect: in analysis_ready_fastqs only host removed files (5 total) nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_runmerging false PASSED 3✅ preprocessing Expect: in analysis_ready_fastqs only complexity or filtlong (_filtered) files (5 files) bbduknextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_shortread_hostremoval false --perform_longread_hostremoval false --perform_runmerging false PASSED prinseqnextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_shortread_hostremoval false --perform_longread_hostremoval false --perform_runmerging false --shortread_complexityfilter_tool prinseqplusplus PASSED 3✅ preprocessing Expect: in analysis_ready_fastqs only preprocessed reads (5 total - 8 if don't perform read collapsing) fastpExpect 5 nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_runmerging false --perform_shortread_hostremoval false --perform_longread_hostremoval false --perform_shortread_complexityfilter false --longread_qc_skipqualityfilter false PASSED Expect 8(?) nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_runmerging false --perform_shortread_hostremoval false --perform_longread_hostremoval false --perform_shortread_complexityfilter false --longread_qc_skipqualityfilter false --shortread_qc_mergepairs false PASSED and adapterremovalExpect 5 nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_runmerging false --perform_shortread_hostremoval false --perform_longread_hostremoval false --perform_shortread_complexityfilter false --longread_qc_skipqualityfilter false --shortread_qc_tool 'adapterremoval' PASSED Expect 8 nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_runmerging false --perform_shortread_hostremoval false --perform_longread_hostremoval false --perform_shortread_complexityfilter false --longread_qc_skipqualityfilter false --shortread_qc_tool 'adapterremoval' --shortread_qc_mergepairs false PASSED 4✅ preprocessing Expect: in analysis_ready_fastqs host removed reads (*unmapped) (5 files) nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_runmerging false --perform_shortread_complexityfilter false --longread_qc_skipqualityfilter false PASSED 5✅ preprocessing Expect: in analysis_ready_fastqs run merged reads (3 files) nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_shortread_complexityfilter false --longread_qc_skipqualityfilter false PASSED 6✅ preprocessing Expect: in analysis_ready_fastqs run merged reads (3) nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_shortread_complexityfilter false --longread_qc_skipqualityfilter false --perform_shortread_hostremoval false --perform_longread_hostremoval false PASSED 7❌ preprocessing Expect: in analysis_ready_fastqs run merged reads (5 - 2 samples, 2612 and 2612 unmerged) nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_shortread_qc false --perform_longread_qc false --perform_shortread_complexityfilter false --longread_qc_skipqualityfilter false PASSED 8❌ preprocessing Expect: in analysis_ready_fastqs run merged reads (4). No nanopore expected as part of preprocessing nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_shortread_qc false --perform_longread_qc false --perform_shortread_hostremoval false --perform_longread_hostremoval false PASSED Missing ERR* from filtlong? 9❌ preprocessing Expect: in analysis_ready_fastqs run merged reads (5 - run-cat but not merged) nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_shortread_qc false --perform_longread_qc false PASSED 10❌ preprocessing Expect: in analysis_ready_fastqs run merged reads (expect only run merged reads, all other are all as user provided) nextflow run ../main.nf -profile singularity,test_noprofiling --input data/samplesheet.csv --databases data/database.csv --hostremoval_reference data/genome.fasta --outdir ./results -ansi-log false --publish_dir_mode 'symlink' -resume --save_analysis_ready_fastqs --perform_shortread_qc false --perform_longread_qc false --perform_shortread_complexityfilter false --longread_qc_skipqualityfilter false --perform_shortread_hostremoval false --perform_longread_hostremoval false PASSED |
Note this is maybe not the most optimal implementation. E.g., one could calculate already at the beginning of a pipeline what is teh 'final' step, and assign a numeric ID in meta that is then evaluated by teh However this wasn't working initially when I tried, and would require having the logic in multiple places. So I opted for this more verbose way. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since CAT_FASTQ
is removed, shall we update the taxprofiler tube as well?
Co-authored-by: Sofia Stamouli <91951607+sofstam@users.noreply.github.com>
CAT_FASTQ is not removed, just moved (further up in the config file :) ) |
Co-authored-by: Sofia Stamouli <91951607+sofstam@users.noreply.github.com>
Coming in two stages:
modules.conf
logic to select the 'final' FASTQ that goes into profiling for publishing in--outdir
Closes #262
PR checklist
nf-core lint
).nextflow run . -profile test,docker --outdir <OUTDIR>
).docs/usage.md
is updated.docs/output.md
is updated.CHANGELOG.md
is updated.README.md
is updated (including new tool citations and authors/contributors).