TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Tile.run – Extract structured data from any document via API

11 点作者 ntkris7 个月前
Hey HN,<p>Today, we’re launching tile.run, an API that extracts structured data from unstructured documents (PDF, images, text) with support for custom schemas.<p>The Problem: Extracting data out of unstructured documents is surprisingly hard. We built tile.run while solving this for our product Kili (automation for invoicing&#x2F;reconciliation). We found that getting to accuracy that is reliable enough for automation is challenging. Dense documents (e.g., lots of tables or line items) are even harder, and these are the most valuable to automate. After talking to other teams and developers, we found many other teams were after similar solutions.<p>Key Features:<p>- Multiple formats: PDF, JPEG, PNG, TIFF, plain text<p>- Custom schema support with nested objects&#x2F;arrays<p>- Specialized in dense documents with tables<p>- Self-serve API - start extracting in minutes<p>Technical Details:<p>- REST API with simple JSON responses<p>- Robust error handling and validation<p>Coming Soon:<p>- Improved accuracy<p>- More file formats<p>- Self-hosting options<p>- Zero data retention mode<p>Links:<p>- Landing page: <a href="https:&#x2F;&#x2F;tile.run" rel="nofollow">https:&#x2F;&#x2F;tile.run</a><p>- Documentation: <a href="https:&#x2F;&#x2F;tile.run&#x2F;docs" rel="nofollow">https:&#x2F;&#x2F;tile.run&#x2F;docs</a><p>I appreciate there have been a bunch of launches in this area recently, so wanted to address that head on as well:<p>- Clearly this problem is very valuable to solve but requires significant effort<p>- There are many ways to approach the same problem. For example, tile.run targets technical teams whereas other teams are solving this for business teams or specific functions (e.g. ETL).<p>We&#x27;re excited to hear your feedback on the product.

2 条评论

rco87867 个月前
&gt; We found that getting to accuracy that is reliable enough for automation is challenging.<p>This is in the problem description of your pitch, and leads me to believe that tile.run has been solving this problem. Is that right?<p>&gt; Coming Soon:<p>&gt; - Improved accuracy<p>Can you expand more?<p>I have a large need for this sort of tooling, but accuracy is my primary concern.
评论 #42076343 未加载
namanyayg7 个月前
Offtopic but I&#x27;m so confused, how and why are there so many players in this space? Who even are the customers?
评论 #42075857 未加载
评论 #42095996 未加载
评论 #42076105 未加载