TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Recommendations for ETL frameworks that are NOT Python-native

1 点作者 dirtbag__dad超过 2 年前

2 条评论

techdragon超过 2 年前
Not that I can vouch for this as a “good” choice, but you can always use the old school original ETL of any operating system + small tools (awk&#x2F;sed&#x2F;jq&#x2F;grep&#x2F;… etc) + shell scripts (sh&#x2F;bash&#x2F;zsh&#x2F;etc…) and optionally make files to help if the way they work is a good fit for how you process the data.<p>Not being snarky this is genuinely a potential good solution in some situations <a href="https:&#x2F;&#x2F;adamdrake.com&#x2F;command-line-tools-can-be-235x-faster-than-your-hadoop-cluster.html" rel="nofollow">https:&#x2F;&#x2F;adamdrake.com&#x2F;command-line-tools-can-be-235x-faster-...</a>
iknownothow超过 2 年前
Airflow but use KubernetesPodOperator so you can run your ETLs in docker containers or pods within k8s (any language you want). You may need to write a few lines of dumb python code to build a DAG of KubernetesPodOperators but the actual work is done within containers.
评论 #33146713 未加载