8000 Consolidate BED, FASTA file input validation · Issue #20 · formbio/laava · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Consolidate BED, FASTA file input validation #20
Open
@etal

Description

@etal

Check FASTA sequences:

  • Ensure terminal EOL (so concatenation works) – dos2unix, or append "\n"
  • Ensure sequence uses nucleotide alphabet? – seqkit or biopython; see minimap2 input requirements

Check BED versus FASTA sequences (currently handled but roundabout):

  • "vector" contig name from annotation BED matches contig name in the vector FASTA
  • "repcap" contig name from annotation BED matches one contig name in either the vector FASTA or packaging FASTA
  • "vector" contig name is not used in packaging or host FASTA files
  • Contig names are not reused between packaging and host FASTA files

Metadata

Metadata

3C2D

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0