TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Tile.run – Extract structured data from any document via API

11 pointsby ntkris7 months ago
Hey HN,<p>Today, we’re launching tile.run, an API that extracts structured data from unstructured documents (PDF, images, text) with support for custom schemas.<p>The Problem: Extracting data out of unstructured documents is surprisingly hard. We built tile.run while solving this for our product Kili (automation for invoicing&#x2F;reconciliation). We found that getting to accuracy that is reliable enough for automation is challenging. Dense documents (e.g., lots of tables or line items) are even harder, and these are the most valuable to automate. After talking to other teams and developers, we found many other teams were after similar solutions.<p>Key Features:<p>- Multiple formats: PDF, JPEG, PNG, TIFF, plain text<p>- Custom schema support with nested objects&#x2F;arrays<p>- Specialized in dense documents with tables<p>- Self-serve API - start extracting in minutes<p>Technical Details:<p>- REST API with simple JSON responses<p>- Robust error handling and validation<p>Coming Soon:<p>- Improved accuracy<p>- More file formats<p>- Self-hosting options<p>- Zero data retention mode<p>Links:<p>- Landing page: <a href="https:&#x2F;&#x2F;tile.run" rel="nofollow">https:&#x2F;&#x2F;tile.run</a><p>- Documentation: <a href="https:&#x2F;&#x2F;tile.run&#x2F;docs" rel="nofollow">https:&#x2F;&#x2F;tile.run&#x2F;docs</a><p>I appreciate there have been a bunch of launches in this area recently, so wanted to address that head on as well:<p>- Clearly this problem is very valuable to solve but requires significant effort<p>- There are many ways to approach the same problem. For example, tile.run targets technical teams whereas other teams are solving this for business teams or specific functions (e.g. ETL).<p>We&#x27;re excited to hear your feedback on the product.

2 comments

rco87867 months ago
&gt; We found that getting to accuracy that is reliable enough for automation is challenging.<p>This is in the problem description of your pitch, and leads me to believe that tile.run has been solving this problem. Is that right?<p>&gt; Coming Soon:<p>&gt; - Improved accuracy<p>Can you expand more?<p>I have a large need for this sort of tooling, but accuracy is my primary concern.
评论 #42076343 未加载
namanyayg7 months ago
Offtopic but I&#x27;m so confused, how and why are there so many players in this space? Who even are the customers?
评论 #42075857 未加载
评论 #42095996 未加载
评论 #42076105 未加载