TechEcho

7 comments

Lerc12 months ago

I was very impressed with Anthropic's paper on Concept mapping.Post <a href="https://www.anthropic.com/news/mapping-mind-language-model" rel="nofollow">https://www.anthropic.com/news/mapping-mind-language-model</a>Paper <a href="https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html" rel="nofollow">https://transformer-circuits.pub/2024/scaling-monosemanticit...</a>This seems like a very good starting point for alignment. One could almost see a pathway to making something like the laws of robotics from here. It's a long way to go, but a good first step.

mvkel12 months ago

These superaligners."I am breaking out on my own! Together we will do bigger and better things!!!""Ok I'll join the other guys."I think it's pretty clear that the capital markets have next door to no interest in alignment pursuits, and only the most-funded apply a token amount of investment towards it.

whimsicalism12 months ago

@dang - I find topics like these quite interesting. Are they downweighted due to AI relatedness (or is twitter?) or just being flagged a lot?

Imnimo12 months ago

"Automated alignment research" suggests he's still interested in following the superalignment blueprint from OpenAI. So what do you do while you're waiting for the AI that's capable of doing alignment research for you to arrive? If you believe this is a viable path, what's the point of putzing around doing your own research when you'll allegedly have an army of AI researchers at your command in the near future?

评论 #40503231 未加载

评论 #40503174 未加载

评论 #40503484 未加载

评论 #40503128 未加载

smountjoy12 months ago

"Superalignment" is (was?) OpenAI's term, so it might be more accurate to say he is joining Anthropic to work on alignment.

评论 #40503069 未加载

评论 #40503029 未加载

htrp12 months ago

it's also completely theoretical, until it isn't (ref paperclip maximizers)

andrewfromx12 months ago

I keep getting Anthropic and Extropic (Guillaume Verdon / Beff Jezos) names mixed up. Anthropic is Claude and Extropic is Thermodynamic hardware many orders of magnitude faster and more energy efficient than CPUs/GPUs.** parameterized stochastic analog circuits that implement energy-based models (EBMs). Stochastic computing is a computing paradigm that represents numbers using the probability of ones in a bitstream.

评论 #40503151 未加载

评论 #40503097 未加载

7 comments

Lerc12 months ago

mvkel12 months ago

whimsicalism12 months ago

@dang - I find topics like these quite interesting. Are they downweighted due to AI relatedness (or is twitter?) or just being flagged a lot?

Imnimo12 months ago

评论 #40503231 未加载

评论 #40503174 未加载

评论 #40503484 未加载

评论 #40503128 未加载

smountjoy12 months ago

"Superalignment" is (was?) OpenAI's term, so it might be more accurate to say he is joining Anthropic to work on alignment.

Jan Leike joins Anthropic on their superalignment team

7 comments

Jan Leike joins Anthropic on their superalignment team

7 comments