TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Is it a good idea to use AWS Lambda for ETL Pipelines?

4 pointsby 91dylwynover 3 years ago
We are working on an OSS for ETL (Typhoon data orchestrator) and we are thinking about using AWS Lambda as the backend. The benefit is that it has a built in scheduler and we don&#x27;t need to build our own. We can also spawn multiple Lambdas to scale concurrency.<p>Do you think this would be a sensible option? Would the 15 minutes running time be a deal breaker?

2 comments

ignorantguyover 3 years ago
We use lambda in our production environment for ETL. It works really well. In our case when we were using it, we modeled the code to account for 15 minute window. Hope that helps.
评论 #29970001 未加载
vlahmotover 3 years ago
You can always use step functions or some janky recursive calls to work around the 15 min timeout.<p>I would recommend using docker containers and writing it in a way that you can swap out lambda&#x2F;ecs&#x2F;k8s&#x2F;etc as the backend infrastructure. That way you you can punt on doing any fancy work to try and break the jobs up across lambdas under that 15 min constraint.