TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Recommendations for ETL frameworks that are NOT Python-native

1 pointsby dirtbag__dadover 2 years ago

2 comments

techdragonover 2 years ago
Not that I can vouch for this as a “good” choice, but you can always use the old school original ETL of any operating system + small tools (awk&#x2F;sed&#x2F;jq&#x2F;grep&#x2F;… etc) + shell scripts (sh&#x2F;bash&#x2F;zsh&#x2F;etc…) and optionally make files to help if the way they work is a good fit for how you process the data.<p>Not being snarky this is genuinely a potential good solution in some situations <a href="https:&#x2F;&#x2F;adamdrake.com&#x2F;command-line-tools-can-be-235x-faster-than-your-hadoop-cluster.html" rel="nofollow">https:&#x2F;&#x2F;adamdrake.com&#x2F;command-line-tools-can-be-235x-faster-...</a>
iknownothowover 2 years ago
Airflow but use KubernetesPodOperator so you can run your ETLs in docker containers or pods within k8s (any language you want). You may need to write a few lines of dumb python code to build a DAG of KubernetesPodOperators but the actual work is done within containers.
评论 #33146713 未加载