TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Introducing the Open Chain of Thought Leaderboard

3 点作者 srirangr大约 1 年前

1 comment

unraveller大约 1 年前
Self-discovery &gt; Self-consistency &gt; Medprompt &gt; Chain of thought<p>I think the leaderboard the way it is devised is a bit silly, it rewards failure of the base model and success of the prompt atop it, but that is not how we want to be using the style of prompting. We need to see it how the gorrila code nudging metric does it, both base model score and the increase from the prompt style matter.