TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Scaling Judge-Time Compute with Leonard Tang – Weaviate Podcast

1 点作者 CShorten6 天前
Scaling Judge-Time Compute!<p>I am SUPER EXCITED to publish the 121st episode of the Weaviate Podcast featuring Leonard Tang, Co-Founder of Haize Labs!<p>Evals are one of the hottest topics out there for people building AI systems. Leonard is absolutely at the cutting edge of this, and I learned so much from our chat!<p>The podcast covers tons of interesting nuggets around how LLM-as-Judge &#x2F; Reward Model systems are evolving. Ideas such as UX for Evals, Contrastive Evaluations, Judge Ensembles, Debate Judges, Curating Eval Sets and Adversarial Testing, and of course... Scaling Judge-Time Compute!! --<p>I highly recommend checking out their new library, `Verdict`, a declarative framework for specifying and executing compound LLM-as-Judge systems.<p>I hope you find the podcast useful! As always, more than happy to discuss these ideas further with you!<p>YouTube: https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=KFrKLkJzNDQ<p>Spotify: https:&#x2F;&#x2F;creators.spotify.com&#x2F;pod&#x2F;show&#x2F;weaviate&#x2F;episodes&#x2F;Haize-Labs-with-Leonard-Tang---Weaviate-Podcast-121-e32mts3

暂无评论

暂无评论