TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: An open-source ELO benchmark for voice agents

8 pointsby joshi4about 1 year ago

1 comment

cryogenicplanetabout 1 year ago
want to thank the op for sharing this; i threw this together in the last couple of days ramping out to the &quot;steamroll&quot; - we think one of the key problems in LLMs in general but esp voice is evals and wanted to have a good place to evaluate voice-to-voice systems. these systems can be end-to-end like openai or (asr+llm)-&gt;tts or asr-&gt;(llm+tts) or asr-&gt;llm-&gt;tts<p>we built an ELO benchmark very much in the style of LMSYS and will be releasing results every two weeks<p>source code here: <a href="https:&#x2F;&#x2F;github.com&#x2F;thevoicecompany&#x2F;bench.audio">https:&#x2F;&#x2F;github.com&#x2F;thevoicecompany&#x2F;bench.audio</a><p>will be adding proper contributing guide soon