Hey HN Community!<p>Just built a service I thought y'all might find interesting.<p>Rather than using HTML structure to scrape, I'm using GPT-3 to actually understand the text and pull information out.<p>You just ask for things you want from a site, for example a news article about fundraising like this:
"The name of the company that raised funding"
"Founders or CEO of the company"
"URL of the company that raised funding"
"A short description of the company"
"A list of investors"<p>And get back the needed data in a JSON object! It's been pretty fun to play with and I thought some of you might find interesting use cases for it.<p>I think it would also be very effective at parsing PDFs (same as text).