科技回声

9 条评论

> four key cognitive behaviors -- verification, backtracking, subgoal setting, and backward chaining -- that both expert human problem solvers and successful language models employ.As we make AI better, perhaps we'll inadvertently find ways to make HI (human intelligence) better too.I had a personal experience with this when I was studying for an exam recently. As I read over practice questions, I spoke aloud, replicating the reasoning methods/personality of Deepseek R1. By spending a lot of time reading long verbose R1 outputs, I've essentially fine-tuned my brain for reasoning tasks. I believe this method contributed to my excellent score on that exam.

评论 #43276249 未加载

评论 #43279118 未加载

评论 #43277027 未加载

评论 #43284660 未加载

评论 #43284851 未加载

评论 #43287331 未加载

评论 #43281974 未加载

评论 #43277504 未加载

评论 #43300685 未加载

评论 #43278823 未加载

评论 #43281732 未加载

评论 #43287173 未加载

meindnoch2 个月前

At this point I can't tell from the title whether it's a self-help psychology fad or an LLM paper.

评论 #43278554 未加载

robocat2 个月前

How much has our knowledge of AI training techniques helped to discover how to train people to think better?

评论 #43277360 未加载

评论 #43278053 未加载

评论 #43276705 未加载

nickpsecurity2 个月前

"models primed with incorrect solutions containing proper reasoning patterns achieve comparable performance to those trained on correct solutions"One of the parts most worth a replication study.

idiotsecant2 个月前

I sometimes see these reddit threads of people talking about the experience of having an internal monologue. I have no such monologue, at least not one that is accessible to the part of my mind that calls itself 'me', but I have often wondered if that monologue is something like a 'chain of thought'. I feel like maybe without access to that 'idea feed' maybe my planning and executive functioning is less effective than some other people. I do find myself quite more effective with those sort of tasks when I do a little 'chain of thought' notepad.I also suspect I spend less time ruminating and second-guessing myself and other anxious behaviours that I imagine would come with having someone talking in your ear all day, but that's probably off topic.

评论 #43277741 未加载

评论 #43277155 未加载

评论 #43281285 未加载

评论 #43287959 未加载

评论 #43277391 未加载

spwa42 个月前

True, but a problem is that self-improving AI leads to a somewhat troubling mode of thinking. AIs switch to an internal babbling type language that makes no sense but clearly still conveys meaning to the AIs, then think in that language (if it's a language, though not sure what else it could be) and then produce correct results.Worse, when you use multiple agents to get AI LLMs talking to one another, all AI agents switch to this internal language and they make progress despite no human understanding what hell is happening. This seems very bad.Illustration:> How many r in strawberry?I'm asked how many r in strawberry. I can just spell the word and a;dklsjaw; a;ewjraqwpeouypaads;lq qepwiouryaqeopw qewrpoiuyoiauysdqw145124rfa.nkjlwh ;45a8345a894ya4a q4p58q45jaq;lkjas;dlfkja;j<answer>There are 3 (three) r's in strawberry</answer>

评论 #43281612 未加载

评论 #43281255 未加载

miksik2 个月前

评论 #43284236 未加载

glass_door2 个月前

Does this also mean giving better system prompts that encourage this behaviour also substantially help?

评论 #43284748 未加载

kittikitti2 个月前

``think''In the abstract they use different characters for double quotes here.

评论 #43287948 未加载

9 条评论

owenpalmer2 个月前

评论 #43276249 未加载

评论 #43279118 未加载

评论 #43277027 未加载

评论 #43284660 未加载

评论 #43284851 未加载

评论 #43287331 未加载

评论 #43281974 未加载

评论 #43277504 未加载

评论 #43300685 未加载

评论 #43278823 未加载

评论 #43281732 未加载

评论 #43287173 未加载

meindnoch2 个月前

At this point I can't tell from the title whether it's a self-help psychology fad or an LLM paper.

评论 #43278554 未加载

robocat2 个月前

How much has our knowledge of AI training techniques helped to discover how to train people to think better?

评论 #43277360 未加载

评论 #43278053 未加载

评论 #43276705 未加载

nickpsecurity2 个月前

"models primed with incorrect solutions containing proper reasoning patterns achieve comparable performance to those trained on correct solutions"One of the parts most worth a replication study.

idiotsecant2 个月前

评论 #43277741 未加载

评论 #43277155 未加载

评论 #43281285 未加载

评论 #43287959 未加载

评论 #43277391 未加载

spwa42 个月前

评论 #43281612 未加载

评论 #43281255 未加载

miksik2 个月前

评论 #43284236 未加载

glass_door2 个月前

Does this also mean giving better system prompts that encourage this behaviour also substantially help?

评论 #43284748 未加载

kittikitti2 个月前

``think''In the abstract they use different characters for double quotes here.

评论 #43287948 未加载

Cognitive Behaviors That Enable Self-Improving Reasoners

9 条评论

Cognitive Behaviors That Enable Self-Improving Reasoners

9 条评论