TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: I made an Open Source Personal Data Pipeline

5 pointsby joshcanhelp10 months ago
Hi there!<p>I&#x27;ve been working with data in one form or another, professionally, for about 5 years. I&#x27;ve been thinking about my own personal data and how it&#x27;s used for at least twice that long. I&#x27;ve been sort of building something in my head for a while that solves my own problem and, in the beginning of this year, I found the opportunity to spend some time building it out.<p>I&#x27;ll leave the detailed explanation to the blog post but, in short, I built what amounts to an API crawler combined with a data processor to help you download your personal data from 3rd party services and work with it using relatively simple YAML recipes. It&#x27;s using DuckDB under the hood, which is quite impressive at turning unstructured JSON into queryable form. The goal is making your personal cloud data available locally for backup, exploration, and combination.<p>The tool works end-to-end for limited APIs and use cases right now. I&#x27;m working on solving a few of my own problems with it (events added to Obsidian daily notes, workout summaries across multiple trackers, mini-CRM linking events and emails to notes) and would be happy to help folks get this up and running. I have links to Discord and Substack (bottom of the post) if you want to lurk and see what happens and I welcome any and all contributions you are motivated to make!<p>Thanks for checking it out!

2 comments

joshcanhelp9 months ago
More discussion here: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=41183795">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=41183795</a>
justGtheOG10 months ago
Super cool!