8000 How can I use Darts_DNN for data mapped to hg38? · Issue #8 · Xinglab/DARTS · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

How can I use Darts_DNN for data mapped to hg38? #8

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
evachengzi opened this issue Jul 12, 2019 · 7 comments
Open

How can I use Darts_DNN for data mapped to hg38? #8

evachengzi opened this issue Jul 12, 2019 · 7 comments

Comments

@evachengzi
Copy link

No description provided.

@zj-zhang
Copy link
Collaborator

Hi @evachengzi
Because the sequence features are built for hg19, the exon coordinates should map to the hg19 genome assembly. Please convert the hg38 coordinates in DARTS_BHT output file to hg19 using UCSC liftOver:
https://bioconda.github.io/recipes/ucsc-liftover/README.html

The detailed steps are below:

  • run Darts_BHT on the bam files to generate the "darts_bht.flat.out.txt" output.
  • use liftOver to convert coordinates in the above output file to hg19, e.g. the output is "darts_bht.converted_hg19.txt"
  • run Darts_DNN on the converted hg19 file "darts_bht.converted_hg19.txt".

@evachengzi
Copy link
Author
evachengzi commented Jul 22, 2019

Thank you for your detailed explanation!
image
I have converted the exon coordinates in the "ID" column of "darts_bht.flat.out.txt" and run Darts_DNN on the converted hg19 file. But I still get the index error like the picture show. It seems like that there are only a few coordinates not in the resource of Darts_DNN?
And I found some coordinates that not mapped to hg19 when I convert them from hg38 to hg19. Liftover divided these coordinates to another file named "unmapped.bed". How would I deal with these unmapped coordinates in splicing ID ?
Looking forward to your reply!

@YangpingLi
Copy link

Do we have a sequence features built for hg38 now? This should be very useful!

Thanks!

@zj-zhang
Copy link
Collaborator

Ping @zcpan on this issue

@zcpan
Copy link
zcpan commented Oct 23, 2020

Hi @YangpingLi ,
Thanks for your interest in DARTS. We currently do not have a plan to make hg38-version feature sets, but it is definitely on our to-do list. We will let you know once we have the feature sets. Thanks!

@yangjywhu
Copy link

I have converted the exon coordinates in the "ID" column of "darts_bht.flat.out.txt" and run Darts_DNN on the converted hg19 file. But I still get the index error like the picture show. It seems like that there are only a few coordinates not in the resource of Darts_DNN?

@evachengzi I don't know how to do it :( How did you convert the coordinates? Thank you! Look forward to your reply.

@astulaaa
Copy link
astulaaa commented Sep 19, 2022

Hello, since a couple of years passed, are there any updates on accommodating hg38-aligned genomes or we still need to use lift-over to convert flat file to hg19 coordinates? https://darts-dnn.readthedocs.io/en/latest/ still mentions conversion.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants
0