TE
科技回声
首页
24小时热榜
最新
最佳
问答
展示
工作
中文
GitHub
Twitter
首页
LLM Hallucination Benchmark: R1, o1, o3-mini, Gemini 2.0 Flash Think Exp 01-21
17 点
作者
zone411
3 个月前
1 comment
jszymborski
3 个月前
Collapse
Some very odd choices in that first plot. Lower is better, but also the x-axis is inverted such that higher scores go towards the left.
评论 #43004660 未加载