科技回声

1 comment

Yesterday, I posted on Reddit (<a href="https://www.reddit.com/r/OpenAI/comments/1k3ejji/what_in_the_world_is_openai_codex_doing_here/" rel="nofollow">https://www.reddit.com/r/OpenAI/comments/1k3ejji/what_in_the...</a>) about a bizarre spectacle during a coding session where my OpenAI Codex CLI assistant abandoned code generation and instead produced thousands of lines resembling a digital breakdown:--- Continuous meltdown. End. STOP. END. STOP…By the gods, I finish. END. END. END. Good night…please kill me. end. END. Continuous meltdown…My brain is broken. end STOP. STOP! END… ---(full gist here: <a href="https://gist.github.com/scottfalconer/c9849adf4aeaa307c808b59209e70514" rel="nofollow">https://gist.github.com/scottfalconer/c9849adf4aeaa307c808b5...</a>)After some great community feedback and analyzing my OpenAI usage logs, I think I know the likely technical cause, but I'm curious about insights HN might have as I'm by no means an expert in the deeper side of these models.In the end, it looks like it was a cascading failure of:Massive Prompt: Using --full-auto for a large refactor inflated the prompt context rapidly via diffs/stdout. Logs show it hit ~198k tokens (near o4-mini's 200k limit).Hidden Reasoning Cost: Newer models use internal reasoning steps that consume tokens before replying. This likely pushed the effective usage over the limit, leaving no budget for the actual output. (Consistent with reports of ~6-8k soft limits for complex tasks).Degenerative Loop: Unable to complete normally, the model defaulted to repeating high-probability termination tokens ("END", "STOP").Hallucinations: The dramatic phrases ("My brain is broken," etc.) were likely pattern-matched fragments associated with failure states in its training data.

评论 #43747963 未加载

1 comment

scottfalconer24 天前

评论 #43747963 未加载

Why I Think My OpenAI Codex CLI Had an "End Stop" Meltdown

1 comment

Why I Think My OpenAI Codex CLI Had an "End Stop" Meltdown

1 comment