首页 24小时热榜最新最佳问答展示工作

科技回声

基于 Next.js 构建的科技新闻平台，提供全球科技新闻和讨论内容。

首页

首页最新最佳问答展示工作

资源链接

HackerNews API 原版 HackerNews Next.js

© 2025 科技回声. 版权所有。

Unpacking the HF in RLHF

4 点作者 jbcranshaw大约 2 年前

1 comment

jbcranshaw大约 2 年前

Some observations on a few ways different people actually gather feedback from humans in practice to improve LLMs. Sure I've missed some here, so let me know.

评论 #35087174 未加载