TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Back to Profile
Submissions by zone411
1
Show HN: LLM Deceptiveness and Gullibility Benchmark
7 points
by
zone411
7 months ago
1 comment
2
LLM Confabulation (Hallucination) Leaderboard
6 points
by
zone411
7 months ago
no comments
3
O1-preview and o1-mini results on NYT Connections
2 points
by
zone411
8 months ago
1 comment
← Previous
Next →