-
Notifications
You must be signed in to change notification settings - Fork 24
Release 3.0.0 #277
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Release 3.0.0 #277
Conversation
…ation of mhc binding for peptides without functionality, runs through with stub
New structure of mhc binding subworkflow
Fix stub run in mhc binding subworkflow with new template
Initial setup of netmhcpan module
…n external tools import
…. Breaks the current main workflow structure, which needs to be adjust in future PRs
Add initial draft of netmhciipan module
… ci tests. mhcflurry fetches models from server, and if multiple tests spawn in parallel, the server doesn't respond anymore
I cannot fix the CI tests for nf-core download on Edit: Discussion here: https://nfcore.slack.com/archives/C04QR0T3G3H/p1745924266198209 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Big portion of the changes are coming from template changes and giving a new structuring to the local modules. I have small commends. It seems good to go to me.
assets/supported_alleles.json
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is this HUGE file?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These are all the supported mhc alleles of all mhc binding predictors (netmhc(ii)pan, mhcflurry, mhcnuggets(ii)). Unfortunately, non of the tools provide sufficient tooling such that I could check their supported alleles. This file is loaded in the prepare_prediction_input.nf
. To filter out non-supported mhc alleles that the user provided.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is this specific to a genome in particular?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes. Every human has 6 HLA class I alleles (3 maternal, 3 paternal). Currently afaik over 1k HLA class I allele variants are known. For HLA class II alleles it is even more complex. That results in having such a big amount of HLA alleles in that file
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
https://hla.alleles.org/pages/nomenclature/naming_alleles/ if you want to know more
Co-authored-by: Maxime U Garcia <max.u.garcia@gmail.com>
Co-authored-by: Maxime U Garcia <max.u.garcia@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Release PR feedback
Thank you already for the reviews @kubranarci @famosab @maxulysse! 🙏🏼 Not quite sure how to handle the new CodeQL failiing tests, since I included the ci.yml from the dev template. I started a discussion here: https://nfcore.slack.com/archives/CE5LG7WMB/p1746538936390669 Edit: TL;DR: We should ignore the failing tests. |
Would be great to get approves from @famosab and @maxulysse until Friday 🙏🏼 Thank you again |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks good to me, just one comment for the next developments :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Its a nit pick but I would introduce a Dependencies
section where you denote old and new software versions when modules are being updated :) maybe you can start this beginning with pipeline version 3.0.0
Major release: Comprehensive restructure of pipeline #205
Most important changes:
syfpeithi
--tools
are:mhcflurry(default),mhcnuggets,mhcnuggetsii,netmhcpan,netmhciipan
epaa.py
into a subworkflowMHC_BINDING_PREDICTION
, which allows simple addition of future predictorsmhcgnomes
and supported alleles for each predictor are stored inassets/supported_alleles.json
epaa.py
andEPYTOPE_PREDICTION_VAR
to only be used with variants. Output:tsv
of peptides from variants. Use--fasta_output
to generate additional variant-based protein fasta fileFASTA2PEPTIDES
)--wide_format_output
<outdir>/predictions/<meta.id>/<meta.id>.tsv
to<outdir>/predictions/<meta.id>.tsv
PR checklist
nf-core lint
).nextflow run . -profile test,docker --outdir <OUTDIR>
).nextflow run . -profile debug,test,docker --outdir <OUTDIR>
).docs/usage.md
is updated.docs/output.md
is updated.CHANGELOG.md
is updated.README.md
is updated (including new tool citations and authors/contributors).