-
Notifications
You must be signed in to change notification settings - Fork 1
Handling effects of setting containment_distance to 1 #71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitH 8000 ub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi, can you please send me the error message you received? |
Also just to note on the theory of the DCJ-Indel distance between two completely different plasmids, say plasmid A and plasmid B -- their distance would be 2, because you'd basically end up with the following integer sequence representation for the two plasmids:
You can go from A to B then in two operations: delete 1 from A, and then insert 2. Basically the distance is mathematically still defined, just biologically nonsensical, hence the motivation for the containment distance threshold. I suspect I know why the pipeline is failing in this case, but it'll be easier to pin down if you are able to send me whatever error message you received. |
Thank you for the insight on the theory! This is a segment of the error message in the Pling output; the message repeats itself for various batches/jobids.
|
Hi there! Great tool - I set the containment_distance to 1 as I was trying to ensure the least amount of plasmids were filtered off, even highly dissimilar plasmids, since I wanted to see the nuances of the DCJ-Indel distance between such plasmid pairs too. However, when I did so, it resulted in a failure of the Snakemake workflow somewhere during running.
I understand in theory, a containment distance of 1 would result in no possibility of calculating DCJ-Indel distance as the number of operations to transform one to another would not be calculable.
Hence, could it be useful to perhaps provide internal handling of such problematic cases or restrict the maximum allowed containment distance value so that the pipeline would not fail completely?
Thanks so much!
The text was updated successfully, but these errors were encountered: