TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Powering CRISPR with AWS Lambda

110 pointsby sajithwover 9 years ago

12 comments

dankohn1over 9 years ago
I know it&#x27;s old hat on HN, but I just wanted to point out how close to science fiction this article is. This technique to edit genomes is only a decade old. This startup is able to run sub-second searches without requiring any of their own infrastructure.<p>It costs them less than a <i>$100</i> a month.<p>It was written by an <i>intern</i>.
评论 #10285178 未加载
评论 #10281929 未加载
phkahlerover 9 years ago
While reading, I just kept wondering why this search needs to be in the cloud at all. Finding 20 byte strings in 3GB can be done on a laptop very quickly.
评论 #10281887 未加载
评论 #10281805 未加载
评论 #10282682 未加载
aleemover 9 years ago
AWS Lambda is great for inconsistent atomic workloads. However, I had a fairly disappointing experience with Lamda when I tested it just last week.<p>For example, you cannot send dynamic response headers using the AWS API Gateway (the complementary service to expose HTTP endpoints). In my case I wanted to change the mime-type depending on JSON vs JSONP response.<p>It&#x27;s also not possible to connect Lambda directly to ElastiCache and mostly you are expected to work with S3 or DynamoDB (Amazon&#x27;s proprietary JSON store and what was mostly responsible for the data outage recently in US East). ElastiCache would allow easy persistence which is why it&#x27;s surprising it can&#x27;t be connected to given that it&#x27;s an AWS service (you can connect to it by creating an EC2 proxy but that would defeat the purpose of a serverless architecture).<p>Some other oddities were sniffing the response body to set HTTP headers as opposed to just allowing your Lambda function to set the HTTP header directly or parsing the JSON response as opposed to doing a regex match.
评论 #10282782 未加载
ac360over 9 years ago
Hi Vineet,<p>I&#x27;m a huge fan of CRISPR. I&#x27;ve been following it closely since I heard Radiolab&#x27;s podcast about it.<p>I&#x27;m also the founder of the JAWS framework, which is an open-source application framework built entirely on AWS Lambda and AWS API Gateway: <a href="https:&#x2F;&#x2F;github.com&#x2F;jaws-framework&#x2F;JAWS" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;jaws-framework&#x2F;JAWS</a><p>I would LOVE to grab a coffee with you or anyone on your team some time, and chat about lambda or CRISPR, or anything really :) I live in Oakland and my email address is austen[at]servant.co<p>Also, will you be at Re:invent? I&#x27;m doing a breakout session on JAWS and I&#x27;ll be there all week.<p>Good luck to you!<p>Austen
taternutsover 9 years ago
I kind of forgot that while lambda only supports node, you can use it as a glorified wrapper to call your c++ code
评论 #10281004 未加载
评论 #10280914 未加载
评论 #10282995 未加载
评论 #10280625 未加载
motoboiover 9 years ago
&quot;Our old server infrastructure cost thousands of dollars each month just for server costs.<p>Using the new Lambda infrastructure, we pay for the number of Lambda invocations, the total duration of the requests, and the number of S3 requests. This comes out to $60&#x2F;monthfor hundreds of thousands of CRISPR searches!&quot;<p>Well, how much of that money you spent on EBS storage for your copies of genome data?<p>EC2 instances could read from S3 directly as lambda does, maybe that could alleviate the cost a lot.<p>Using AMI S3 backed instances could save a lot too.<p>But great work, nonetheless!
评论 #10281067 未加载
Gatskyover 9 years ago
Of note, the latest thing in reference genomes is representing them as a graph data structure, which importantly allows variation to be incorporated. Some of the newest methods for mapping short DNA fragments (that come out of the most common type of sequencers) take this approach. They use a genome index though, which takes a lot of computational effort to build before hand.<p>Anyway, benchling wants to avoid genome indexes from the sounds of it, in case users upload their own genomes. Having said that, if someone is doing multiple searches, it would quickly become more efficient to just index the genome. I would have thought most people seriously concerned about off target CRISPR hits would be using high quality reference genomes though.
评论 #10283281 未加载
netcraftover 9 years ago
I recently have started looking harder at lambda after realizing that you can use 1M requests &#x2F; month for free indefinitely. I just worry about vendor lock-in with services like this - if for whatever reason you want to move away its a rewrite at best. If amazon was to open source the lambda implementation allowing me to run my services somewhere else with a config change id probably buy into it completely and never move away...
评论 #10280900 未加载
评论 #10281470 未加载
评论 #10283001 未加载
评论 #10281010 未加载
评论 #10281464 未加载
评论 #10280882 未加载
评论 #10280585 未加载
JulianMorrisonover 9 years ago
I wonder if it would be possible to go the other way. How close is CRISPR to a primitive of Turing complete computation?<p>Take it from s&#x2F;xxx&#x2F;yyy&#x2F; into being &#x2F;bin&#x2F;sed. And then run the search in wetware.
评论 #10281679 未加载
评论 #10281642 未加载
deeglesover 9 years ago
They might be able to save a bit on costs by caching locally. Lambda instances can be reused if TPS is high enough. I think the limit is 500MB in the &#x2F;tmp directory.
jewbear48over 9 years ago
How are you getting such quick responses from S3? In our own testing using Java. It was taking over 500ms just to initiate the connection with S3 from Lambda.
评论 #10284610 未加载
Sujanover 9 years ago
Did I miss it or doesn&#x27;t the article mention what server does the slitting and combining? Is this also done in Lambda?
评论 #10284622 未加载