-
Notifications
You must be signed in to change notification settings - Fork 5
Expose more options in workflow; clean up core Python scripts #10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
34a66ac
to
eb634df
Compare
…cleanup summarize_AAV_alignment: - Use ruff to fix if-else block. - Remove unused imports and dead code. - Add a leading underscore to the names of unused loop variables. - Standardize docstrings, exceptions, string formatting, etc. No functional change.
The default AAV2 f/f info still comes from a global variable, not the new FASTA file, but the contents are identical and it can be used for local testing.
- Expose argument --target-gap-threshold to set what was (and still is) a global constant. - Reduce run time by 25% - Use more precise variable names - Remove some dead code
0b43dc8
to
8362b9d
Compare
test/Makefile: Fix a typo.
- Rename global MAX_DIFF_W_REF to MAX_MISSING_FLANK; settable by the same CLI arg --max-allowed-missing-flanking - Add global MAX_OUTSIDE_VECTOR and CLI arg --max-allowed-outside-vector, similar but referring to the margin just outside the vector region
Added input options: target_gap_threshold, max_allowed_outside_vector, max_allowed_missing_flanking, flipflop_fa
Also optionally take repcap_name (str) to count repcap mappings as before.
Add a hidden parameter "container_version" to streamline testing development builds.
alphabdiallo
approved these changes
Apr 26, 2024
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Add workflow input options to enable some investigations.
max_allowed_missing_flanking
andflipflop_fa
target_gap_threshold
into a script option and workflow inputmax_allowed_outside_vector
, similar tomax_allowed_missing_flanking
but for the regions just outside the vector region (full reads overlapping the backbone a bit), while the latter now only refers to the portion of each ITR allowed to be missing.repcap_fa
andhelper_fa
inputs into a singlepackaging_fa
, and optionally takerepcap_name
(str) to count repcap mappings as such for that mapped reference inpackaging_fa
. For dual constructs, the "repcap" region on the vector backbone can be explicitly annotated in the input BED file.Include type/subtype BAMs in Nextflow output.
Profiled both Python scripts and sped up
summarize_AAV_alignment.py
by about 25%. (The other spends almost all its time inside Parasail.)Cleaned up code with
ruff check
.