Hi! I created a universal data API that uses headless browsers and GPT to extract any data from the web in JSON format. I started this project because I needed some API to do data enrichment to get company data (headcount, investment rounds, etc.). Once I did the first version, I quickly realized that there can be many use cases for such a tool: data enrichment, web scraping, data validation, etc.
This is pretty cool, it is able to parse data out of a random pricing table somewhere in the page.
It does seem to just make up data it if is not found in the page (probably expected with LLM's), I wonder if you can reduce that with some prompting, or maybe verify the data is actually present?
Your schema page docs is broken <a href="https://singleapi.co/docs/schema" rel="nofollow noreferrer">https://singleapi.co/docs/schema</a>
I also published a simplified version on GitHub, so you can try to self-host it. I'm really excited to see all the possible use cases for such a tool besides web scraping or data enrichment.<p><a href="https://github.com/semanser/JsonGenius">https://github.com/semanser/JsonGenius</a>