TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

DeepVariant: Highly Accurate Genomes with Deep Neural Networks

69 pointsby tsaprailisover 7 years ago

3 comments

inciampatiover 7 years ago
I implemented a similar model based around the amazing out of core linear learner Vowpal Wabbit. It did pretty well in the Precision FDA challanges despite being developed in two person months. I has the benefit of using fantastically less compute to train than something like deepvariant. (<a href="https:&#x2F;&#x2F;github.com&#x2F;ekg&#x2F;hhga" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;ekg&#x2F;hhga</a>)<p>The approach is the right one for small genetic variants. But it will be hard to handle more complex kinds of variation without adapting the alignments to training example synthesis.<p>I think the field should cool it on calling the results of something like deepvariant &quot;genomes&quot;. These are genotypes, not fully sequenced and reconstructed genomes. The evaluations are typically on easy regions and we have no reason to believe that those are the only ones that are important. One important tool to dig into this is syndip, which is a simulated synthetic diploid where the full haplotypes are known. It is a mixture of two haploid human genomes that were de novo sequenced with pacbio technology. (<a href="https:&#x2F;&#x2F;www.biorxiv.org&#x2F;content&#x2F;early&#x2F;2017&#x2F;11&#x2F;22&#x2F;223297" rel="nofollow">https:&#x2F;&#x2F;www.biorxiv.org&#x2F;content&#x2F;early&#x2F;2017&#x2F;11&#x2F;22&#x2F;223297</a>). For the curious these haploid human genomes only exist in molar pregnancies, so even this isn&#x27;t ideal but it is maybe the best resource we have at present.
dcdankoover 7 years ago
The figures in this paper use pretty deceptive scales. To be clear, DeepVariant is 0.5% better than a tool built in ~2010 (GATK), on DeepVariant&#x27;s best test.<p>GATK is still the standard, not because better variant callers don&#x27;t exist, but because it&#x27;s more important that everyone uses the same tool for comparisons between studies.
评论 #15889309 未加载
评论 #15889205 未加载
j7akeover 7 years ago
very nice but do you think neural networks will also be able to interpret the function of these genotype ?
评论 #15894075 未加载
评论 #15888893 未加载