TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
LLM Hallucination Benchmark: R1, o1, o3-mini, Gemini 2.0 Flash Think Exp 01-21
17 points
by
zone411
3 months ago
1 comment
jszymborski
3 months ago
Collapse
Some very odd choices in that first plot. Lower is better, but also the x-axis is inverted such that higher scores go towards the left.
评论 #43004660 未加载