-
Notifications
You must be signed in to change notification settings - Fork 10
Does longphase support SNVs from illumina/short-reads #88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @eesiribloom The variant calling (e.g., VCF from Illumina) can differ from the phasing (e.g., BAM from ONT), as long as both are using the same reference genome. |
Hi @ythuang0522 , thank you for making this great tool! I wanted to follow-up on this issue since I'm having some difficulty getting LongPhase to work with short-read variant calling and long-read alignment. Both my short-read VCFs (Illumina sequencing/Mutect2 variant caller) and my long-read alignment files (ONT sequencing) are aligned to the same reference genome (hg38). However when I try to run the following:
I get the following stdout:
and it doesn't look like there's any change between the input and output VCFs. Are there additional requirements that LongPhase looks for in the VCF to determine if the VCF is compatible with LongPhase? |
Hi @cteng585 , Mutect2 only outputs somatic SNPs which are usually much less than germline variants. As read-based phasing requires a long read spanning at least two variants, phasing over these small amount of somatic variants is not the right way (you can check the number of somatic variants first). One easy way is calling germline (e.g., via GATK or DeepVariant or Clair3) and somatic variants (Mutect2 or ClairS) separately and then merge the two VCFs for phasing. The LongPhase will have abundant variants for spanning and phasing. |
Hi @ythuang0522 , thank you for the quick response! I was able to get LongPhase to work once I merged my VCF files from Mutect2 and Clair3 together. I had one other question I was hoping you could take the time to answer. In one of the previous tickets #9 , it was suggested that one could theoretically combine the alignment files from ONT and Illumina short read sequencing. I tried to do this by merging an Illumina short-read CRAM and an ONT long-read CRAM, but LongPhase appears to hang in the process. Is this something that further work was done on? Do you know where I might start troubleshooting? Thank you again! |
Hi @cteng585, I tested the version you are using, and it should be able to run properly with CRAM files. If the CRAM index is not provided, there should be a corresponding message. If the reference FASTA file is missing the Currently, the testing methods I can think of are: first, try converting the CRAM file to BAM and test it; or alternatively, run the CRAM or BAM files separately from Illumina and ONT. I hope this helps you.
Thanks |
Given the higher accuracy of illumina short-read SNV/ indel calls compared to ONT long-reads, does longphase support phasing with short-read SNVs and long-read alignments/BAMs?
The text was updated successfully, but these errors were encountered: