TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Good up-to-date resource for finding best current LLM for a given task?

1 点作者 monroewalker8 个月前
Is there a good reference for which models should be preferred for which tasks? As I understand it, there&#x27;s a decent amount of overlap in the capability of the top models right now (Claude 3, Claude 3.5, GPT4o, OpenAI o1, Gemini 1.5 Pro, Llama 3.1, etc.) but each have their own strengths and weaknesses as well as differences in the API pricing, context windows, usage privacy policies, etc. It&#x27;d be nice to know how each stack up against each other for specific use cases. Eg. writing -&gt; use Claude 3, coding -&gt; use Claude 3.5, STEM &#x2F; logic questions -&gt; OpenAI o1, huge context -&gt; Gemini 1.5 Pro, self-hosting &#x2F; running locally -&gt; Llama 3.1<p>I do see a lot of articles on Google from people just personally testing and comparing models based on their own criteria. I haven&#x27;t found anything yet though that seems to be both thorough and well maintained (given how often new models are released or updated).

1 comment

deichrenner8 个月前
I have been wondering the same and found <a href="https:&#x2F;&#x2F;www.leewayhertz.com&#x2F;comparison-of-llms&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.leewayhertz.com&#x2F;comparison-of-llms&#x2F;</a> and <a href="https:&#x2F;&#x2F;artificialanalysis.ai&#x2F;" rel="nofollow">https:&#x2F;&#x2F;artificialanalysis.ai&#x2F;</a>