TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: An open-source ELO benchmark for voice agents

8 点作者 joshi4大约 1 年前

1 comment

cryogenicplanet大约 1 年前
want to thank the op for sharing this; i threw this together in the last couple of days ramping out to the &quot;steamroll&quot; - we think one of the key problems in LLMs in general but esp voice is evals and wanted to have a good place to evaluate voice-to-voice systems. these systems can be end-to-end like openai or (asr+llm)-&gt;tts or asr-&gt;(llm+tts) or asr-&gt;llm-&gt;tts<p>we built an ELO benchmark very much in the style of LMSYS and will be releasing results every two weeks<p>source code here: <a href="https:&#x2F;&#x2F;github.com&#x2F;thevoicecompany&#x2F;bench.audio">https:&#x2F;&#x2F;github.com&#x2F;thevoicecompany&#x2F;bench.audio</a><p>will be adding proper contributing guide soon