You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If the vector plasmid is circular, but the reference sequence is linear, sequencing reads spanning the origin will appear in the SAM/BAM output with two alignments, one on each side of the origin, and a misleading two-part classification such as "backbone|vector+backbone".
Add a workflow input option for whether the vector reference should be treated as circular (boolean). Then, if circular:
Perform read mapping twice -- once with the linear reference as-is, and again with the full reference sequence concatenated end-to-end.
Then, resolve reads spanning the origin in the concatenated reference versus the corresponding reads in the original linear reference. Ensure that the same total number of reads are present in the output, i.e. no reads are duplicated or lost after resolution. Consider adding the "TP" tag to the output BAM header and allowing read coordinates to go beyond the reference end -- see SAM/BAM support for circular reference sequences.
Classify origin-spanning reads (primarily backbone and vector+backbone) using modular arithmetic so that the alignment coordinates used in the report stay within the original reference's boundaries.
The text was updated successfully, but these errors were encountered:
If the vector plasmid is circular, but the reference sequence is linear, sequencing reads spanning the origin will appear in the SAM/BAM output with two alignments, one on each side of the origin, and a misleading two-part classification such as "backbone|vector+backbone".
Add a workflow input option for whether the vector reference should be treated as circular (boolean). Then, if circular:
The text was updated successfully, but these errors were encountered: