TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Oxen.ai: Fast Unstructured Data Version Control

177 pointsby sbt567over 2 years ago

15 comments

Zee2over 2 years ago
Oh man, if this could plug into git and be a LFS replacement, that would be awesome. I work in a field where folks run into situations where they think they need LFS, and rarely does it work out well. If someone can figure out an ergonomic and durable LFS-like blob versioning system that can align with git histories, that would be incredible.
评论 #34876209 未加载
评论 #34884979 未加载
评论 #34875020 未加载
bangaover 2 years ago
How does this compare with other systems, like DVC (<a href="https:&#x2F;&#x2F;dvc.org&#x2F;" rel="nofollow">https:&#x2F;&#x2F;dvc.org&#x2F;</a>) for example?
评论 #34876184 未加载
评论 #34875537 未加载
评论 #34876259 未加载
JamesHinnekover 2 years ago
The comparison with DVC is biased <a href="https:&#x2F;&#x2F;github.com&#x2F;Oxen-AI&#x2F;oxen-release&#x2F;blob&#x2F;main&#x2F;Performance.md">https:&#x2F;&#x2F;github.com&#x2F;Oxen-AI&#x2F;oxen-release&#x2F;blob&#x2F;main&#x2F;Performanc...</a><p>I&#x27;d nowhere near the same performance with oxen. The analysis is very biased to help Oxen. I wish people had more integrity before trying so hard to push a half-baked product into the market.
评论 #34878955 未加载
评论 #34878811 未加载
keyleover 2 years ago
Link to the actual project source <a href="https:&#x2F;&#x2F;github.com&#x2F;Oxen-AI&#x2F;Oxen">https:&#x2F;&#x2F;github.com&#x2F;Oxen-AI&#x2F;Oxen</a>
rajataryaover 2 years ago
Great to see more people in this space! We are the authors of XetHub (posted in Dec ‘22, ShowHN: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=33969908" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=33969908</a>) and also think a git-like workflow is perfect for ML dataset management, except that we actually integrate with git (like LFS). &lt;A quick benchmark suggests we are 2x your published performance!&gt;
mmqover 2 years ago
On your github org, twitter link is pointing to the wrong handle:<p>@oxen_ao -&gt; @oxen_ai
评论 #34880373 未加载
评论 #34880995 未加载
aldanorover 2 years ago
Being realistic here, 3rd party provider for data handling will be a no-go for many firms, for infosec reasons. Whereas a hub with no ui might also be a no-go for convenience reasons. I understand that oxenhub is a way to monetise the project but is there a self-hosted &#x27;enterprise&#x27; version of that anywhere in the plans?
TnS-hunover 2 years ago
Any plans for adding exclusive locking and option to delete old versions of a file? These are really important if working with unmergeable, large files.
sbt567over 2 years ago
Web hub (similar to GitHub): <a href="https:&#x2F;&#x2F;www.oxen.ai&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.oxen.ai&#x2F;</a>
zachthewfover 2 years ago
What are the differences between this and DVC?
评论 #34876167 未加载
snthpyover 2 years ago
I see there are already a bunch of questions about how this compares to other tools like DVC, dolt, pachyderm.io and LFS? I would just like to add one to that list:<p>How does this compare to lakeFS?
jiangplusover 2 years ago
How does it compare to dolt?<p><a href="https:&#x2F;&#x2F;github.com&#x2F;dolthub&#x2F;dolt">https:&#x2F;&#x2F;github.com&#x2F;dolthub&#x2F;dolt</a>
评论 #34877085 未加载
JamesHinnekover 2 years ago
Please implement account deletion, you are violating people&#x27;s privacy, GDPR and this is a dark pattern.
评论 #34880352 未加载
verdvermover 2 years ago
How does this compare to Pachyderm.io?
stvnbnover 2 years ago
why not just use git?
评论 #34880082 未加载
评论 #34880065 未加载