科技回声

9 条评论

sdrinf6 个月前

O1 for collabing on design docs, o1 for overall structure, break it into tasks per preference / sort; sonnet/o1 for executing each small tasks.O1 is higher quality, more nuanced, and has deeper understanding; the biggest downside rn is the significantly higher latency (both due to thinking, and also, continue.dev doesn't support o1 streaming currently, so you're waiting until it's all done), and higher cost.In terms of tools: either vscode with continue.dev / cline, or cursorLanguages: node.js / javascript, and lately c# / .net / unity

评论 #42247277 未加载

dauertewigkeit6 个月前

I prefer o1. I mostly use it as a knowledge system. Don't really care for the automatic code generation nonsense. Unless I'm really tired and the task is very simple, in which case I might decide to write a paragraph of text instead of 30 lines of Python. My experience is that when ChatGPT fails, Claude fails too. On some advanced coding tasks, I find ChatGPT's depth of reasoning ability to be better.

评论 #42246505 未加载

评论 #42247287 未加载

pizza6 个月前

o1:- better for when the response has to address many subgoals coherently- usually will not undo previous bugfix progress that was made earlier in the conversation, whereas with Claude if you start having extremely long conversations I have noticed it allowing certain bugs it had already fixed to be reintroduced at much later timesClaude:- image inputs are actually very complementary for debugging issues, esp if visual at all (eg debugging why a GUI framework rendered your UI in an unexpected way, just include a screenshot)- surprisingly very good at taking descriptions of algorithmic or mathematical procedures and making captioned svg illustrations, then taking screenshots of those svgs + user feedback to enhance the next version of svg illustrations- more recent knowledge cutoff, so generally speaking somewhat less likely to deny newer APIs/things exist (eg o1 told me tokenizer.apply_chat_template and meta-llama/Llama-3.2-1B-Instruct both did not exist and removed them both from the code I was feeding it)

评论 #42247262 未加载

notnotrishi6 个月前

My notes:- Sonnet 3.5 seems good with code generation and o1-preview seems good with debugging- Sonnet 3.5 struggles with long contexts whereas o1-preview seems good at identifying interdependencies between files in code repo in answering complex questions- Breaking the problem into small steps seems to yield better results with Sonnet- I’m using primarily in Cursor/GH Copilot and with Python

评论 #42226863 未加载

Imanari6 个月前

I like aider with the claude-3-5-sonnet-20241022, haven’t tried it with O1, though.Also, <a href="https://aider.chat/docs/scripting.html" rel="nofollow">https://aider.chat/docs/scripting.html</a> offers some nice possibilities.

评论 #42247282 未加载

KingOfCoders6 个月前

Started a small project to compare AI IDEs<a href="https://github.com/StephanSchmidt/ai-coding-comparison/">https://github.com/StephanSchmidt/ai-coding-comparison/</a>(no comparison there yet, just some code to play around with)

评论 #42247303 未加载

MeetingsBrowser6 个月前

Are there any concrete benchmarks for comparing models for different types of programming tasks?

评论 #42247325 未加载

muzani6 个月前

o1 if you're going to write full specs and not provide any context.Sonnet 3.5 if you can provide context (e.g. with cursor)gpt-4o for UI design. Also for solving screenshots of interviews

评论 #42247315 未加载

ldjkfkdsjnv6 个月前

o1 is much better at finding complex needle in the haystack bugs/fixes. sonnet 3.5 better at shallow generic coding

评论 #42247317 未加载

9 条评论

sdrinf6 个月前

评论 #42247277 未加载

dauertewigkeit6 个月前

评论 #42246505 未加载

评论 #42247287 未加载

pizza6 个月前

评论 #42247262 未加载

notnotrishi6 个月前

评论 #42226863 未加载

Imanari6 个月前

评论 #42247282 未加载

KingOfCoders6 个月前

评论 #42247303 未加载

MeetingsBrowser6 个月前

Are there any concrete benchmarks for comparing models for different types of programming tasks?

评论 #42247325 未加载

muzani6 个月前

o1 if you're going to write full specs and not provide any context.Sonnet 3.5 if you can provide context (e.g. with cursor)gpt-4o for UI design. Also for solving screenshots of interviews

评论 #42247315 未加载

ldjkfkdsjnv6 个月前

o1 is much better at finding complex needle in the haystack bugs/fixes. sonnet 3.5 better at shallow generic coding

评论 #42247317 未加载

Ask HN: Claude 3.5 Sonnet vs. o1 vs. <other> for coding. Let's talk!

9 条评论

Ask HN: Claude 3.5 Sonnet vs. o1 vs. <other> for coding. Let's talk!

9 条评论