TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Anton the Search Relevance Evaluator

13 点作者 pablomendes9 个月前
Hey there HN,<p>I&#x27;m Pablo from Objective. We&#x27;re working on an AI-native Search platform. Today we&#x27;re releasing Anton (<a href="https:&#x2F;&#x2F;www.objective.inc&#x2F;anton" rel="nofollow">https:&#x2F;&#x2F;www.objective.inc&#x2F;anton</a>) an API that you can use to evaluate relevance for any search system. We would love your feedback. Here&#x27;s a quick demo by my co-founder Lance: <a href="https:&#x2F;&#x2F;youtu.be&#x2F;geX8mwBGqIo" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;geX8mwBGqIo</a><p>Why we created Anton: In every company we&#x27;ve worked, it was very slow and costly to spot check a new model, or to thoroughly evaluate a release candidate, or to compare a few competing approaches for how to improve search going forward. In each of those cases, we would absolutely have loved to have had Anton!<p>Today, in order to ship search that doesn&#x27;t disappoint users, software engineers need to become experts in AI&#x2F;ML, information retrieval, data engineering, and more. Folks in our team have worked for some of the largest companies out there (Apple, Google, Youtube, Amazon, Twitch, etc.), building products that you use every day. We&#x27;ve all had to build and stitch together similar components everywhere, and we&#x27;re now building Objective to lift some of that burden for you. We want to make it super easy for every developer to build great search experiences into their app&#x2F;site.<p>To build Anton, we spent a lot of time evaluating different LLMs, prompt engineering, fine-tuning and really throwing the kitchen sink at the LLM-as-a-judge problem. We&#x27;ve discovered many ways in which LLMs make it hard to make systematic progress, and only a few that didn&#x27;t make us question our sanity. That work culminated in the creation of Anton. We hope this will provide you with a cost-effective way to get near-human quality relevance judgements, so that you can delight your users every day with search that actually understands them.<p>Can&#x27;t wait to see what you&#x27;ll build with it. And as always, would love feedback!

1 comment

peekypeeky0089 个月前
Looks great