科技回声

7 条评论

Lerc12 个月前

I was very impressed with Anthropic's paper on Concept mapping.Post <a href="https://www.anthropic.com/news/mapping-mind-language-model" rel="nofollow">https://www.anthropic.com/news/mapping-mind-language-model</a>Paper <a href="https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html" rel="nofollow">https://transformer-circuits.pub/2024/scaling-monosemanticit...</a>This seems like a very good starting point for alignment. One could almost see a pathway to making something like the laws of robotics from here. It's a long way to go, but a good first step.

mvkel12 个月前

These superaligners."I am breaking out on my own! Together we will do bigger and better things!!!""Ok I'll join the other guys."I think it's pretty clear that the capital markets have next door to no interest in alignment pursuits, and only the most-funded apply a token amount of investment towards it.

whimsicalism12 个月前

@dang - I find topics like these quite interesting. Are they downweighted due to AI relatedness (or is twitter?) or just being flagged a lot?

Imnimo12 个月前

"Automated alignment research" suggests he's still interested in following the superalignment blueprint from OpenAI. So what do you do while you're waiting for the AI that's capable of doing alignment research for you to arrive? If you believe this is a viable path, what's the point of putzing around doing your own research when you'll allegedly have an army of AI researchers at your command in the near future?

评论 #40503231 未加载

评论 #40503174 未加载

评论 #40503484 未加载

评论 #40503128 未加载

smountjoy12 个月前

"Superalignment" is (was?) OpenAI's term, so it might be more accurate to say he is joining Anthropic to work on alignment.

评论 #40503069 未加载

评论 #40503029 未加载

htrp12 个月前

it's also completely theoretical, until it isn't (ref paperclip maximizers)

andrewfromx12 个月前

I keep getting Anthropic and Extropic (Guillaume Verdon / Beff Jezos) names mixed up. Anthropic is Claude and Extropic is Thermodynamic hardware many orders of magnitude faster and more energy efficient than CPUs/GPUs.** parameterized stochastic analog circuits that implement energy-based models (EBMs). Stochastic computing is a computing paradigm that represents numbers using the probability of ones in a bitstream.

评论 #40503151 未加载

评论 #40503097 未加载

7 条评论

Lerc12 个月前

mvkel12 个月前

whimsicalism12 个月前

@dang - I find topics like these quite interesting. Are they downweighted due to AI relatedness (or is twitter?) or just being flagged a lot?

Imnimo12 个月前

评论 #40503231 未加载

评论 #40503174 未加载

评论 #40503484 未加载

评论 #40503128 未加载

smountjoy12 个月前

"Superalignment" is (was?) OpenAI's term, so it might be more accurate to say he is joining Anthropic to work on alignment.

Jan Leike joins Anthropic on their superalignment team

7 条评论

Jan Leike joins Anthropic on their superalignment team

7 条评论