TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: OpenAI models vs. Gemini 2.5 Pro for coding and swe

4 点作者 endorphine18 天前
In your experience, which of the two models (all of OpenAI vs Gemini 2.5 Pro) are better for having as assistants to ask SWE&#x2F;software systems related questions and doing long and complex reasoning?<p>I&#x27;m debating whether there&#x27;s any point in paying for ChatGPT vs. paying (or even using the free version) of Gemini 2.5 Pro.<p>I have the feeling that most HNers prefer the latter, however in livebench I think OpenAI surpasses Gemini for coding.

4 条评论

JeduDev17 天前
I&#x27;ve been using Gemini 2.5 Pro, Claude 3.7 Sonnet, and GPT-4.1 recently and here are my thoughts.<p>Regarding context windows, Gemini currently offers 1M tokens (reportedly increasing to 2M soon), GPT-4.1 also handles a large window of 1m tokens, and Claude provides 200k. In my experience testing them with large code files (around 3-4k lines), I found Gemini 2.5 Pro and Claude 3.7 Sonnet performed quite similarly, both handling the large context well and providing good solutions.<p>However, my impression was that GPT-4.1 didn&#x27;t perform quite as well, While GPT-4.1 is certainly capable, I feel Gemini has a slight edge in this area right now. Based on this, I&#x27;d lean towards using Gemini 2.5 Pro for extremely large contexts needing high-quality results, GPT-4.1 for backend logic, and found Claude 3.7 particularly effective for UI interface tasks.
TheKelsbee18 天前
I&#x27;m not sure its easy to say one is better than the other. I&#x27;ve used ChatGPT pro, it&#x27;s good. I&#x27;ve also use Gemini, and it&#x27;s also good. Claude is surprisingly good as well. And I&#x27;ve recently been using Q-cli, which was extremely easy to get integrated into my Neovim&#x2F;Tmux workflow.<p>Purely from a code quality perspective, they&#x27;re all about the same, and they all generate code that rarely works for the first time. At least from my experience, and highly depending on language. For instance, Q-cli with Rust seems to generate better output for me than Gemini with Rust. And ChatGPT with JS gives me way better code than Claude with JS.<p>I honestly think that currently in the market, it&#x27;s not really a choice of which is better, but which is the right tool for workflow and language.
bn-l18 天前
It’s tricky. o3 is better (usually) but much much lazier IME. You probably have to pay for pro.
codingwagie16 天前
O3 is far ahead of the competition.
评论 #43796123 未加载