TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Helix I/O – Ultrafast genomic search

27 点作者 nkrumm大约 11 年前

7 条评论

vsbuffalo大约 11 年前
This looks neat, but is the method open? Is there a command line version?<p>I&#x27;d argue that while open source is a desirable feature in most software, it&#x27;s a <i>necessary</i> feature in scientific software. When our results depend on running through software, that software can&#x27;t be a proprietary, closed-source black box.
评论 #7734616 未加载
nkrumm大约 11 年前
OP here. We&#x27;ve been working on faster bioinformatics algorithms for the past year – this is a demo of one application which is effectively a fast replacement for BLAST. Happy to answer any questions about tech or usage.
评论 #7734768 未加载
评论 #7734498 未加载
aroch大约 11 年前
We help use the NMPDR RAST servers and also run our own for sequence annotation. I&#x27;d have to check but if I&#x27;m remembering correctly our annotations fix errors (~%5) present in the NCBI databases and catches many more (something like ~20% of unknowns) for the organisms we work with. Problem is searching our indexes is super slow (comparatively) and not all in one place -- BLASTn&#x2F;p takes some fiddling.<p>Could we use our own corpus for this?<p>Ah, I see you have some problems with species identification...are you using old NCBI datasets?<p>(ps. If you&#x27;d like to talk, you can email me at the address in my profile)
评论 #7735123 未加载
Jemaclus大约 11 年前
This is very cool. I love data, all kinds of data. The more data we have available, the better.<p>But... what&#x27;s the use case for this? Are there people with genomic sequences just laying around with no idea of what they represent? Or is it a resource for students?<p>(Serious question.)
评论 #7734457 未加载
评论 #7734451 未加载
amuresan大约 11 年前
Hi and congratulations, this looks like an excellent tool for bioinformatics. Just out of curiosity, how can you mathematically guarantee constant time searches independent of the database size?
评论 #7735057 未加载
macarthy12大约 11 年前
What stack are you using? Are you using anything like elasticsearch or non-sql Dbs ?
elwell大约 11 年前
Would be a lot more fun if it had eukaryotes.
评论 #7735994 未加载