TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Jan Leike joins Anthropic on their superalignment team

99 pointsby icpmacdo12 months ago

7 comments

Lerc12 months ago
I was very impressed with Anthropic&#x27;s paper on Concept mapping.<p>Post <a href="https:&#x2F;&#x2F;www.anthropic.com&#x2F;news&#x2F;mapping-mind-language-model" rel="nofollow">https:&#x2F;&#x2F;www.anthropic.com&#x2F;news&#x2F;mapping-mind-language-model</a><p>Paper <a href="https:&#x2F;&#x2F;transformer-circuits.pub&#x2F;2024&#x2F;scaling-monosemanticity&#x2F;index.html" rel="nofollow">https:&#x2F;&#x2F;transformer-circuits.pub&#x2F;2024&#x2F;scaling-monosemanticit...</a><p>This seems like a very good starting point for alignment. One could almost see a pathway to making something like the laws of robotics from here. It&#x27;s a long way to go, but a good first step.
mvkel12 months ago
These superaligners.<p>&quot;I am breaking out on my own! Together we will do bigger and better things!!!&quot;<p>&quot;Ok I&#x27;ll join the other guys.&quot;<p>I think it&#x27;s pretty clear that the capital markets have next door to no interest in alignment pursuits, and only the most-funded apply a token amount of investment towards it.
whimsicalism12 months ago
@dang - I find topics like these quite interesting. Are they downweighted due to AI relatedness (or is twitter?) or just being flagged a lot?
Imnimo12 months ago
&quot;Automated alignment research&quot; suggests he&#x27;s still interested in following the superalignment blueprint from OpenAI. So what do you do while you&#x27;re waiting for the AI that&#x27;s capable of doing alignment research for you to arrive? If you believe this is a viable path, what&#x27;s the point of putzing around doing your own research when you&#x27;ll allegedly have an army of AI researchers at your command in the near future?
评论 #40503231 未加载
评论 #40503174 未加载
评论 #40503484 未加载
评论 #40503128 未加载
smountjoy12 months ago
&quot;Superalignment&quot; is (was?) OpenAI&#x27;s term, so it might be more accurate to say he is joining Anthropic to work on alignment.
评论 #40503069 未加载
评论 #40503029 未加载
htrp12 months ago
it&#x27;s also completely theoretical, until it isn&#x27;t (ref paperclip maximizers)
andrewfromx12 months ago
I keep getting Anthropic and Extropic (Guillaume Verdon &#x2F; Beff Jezos) names mixed up. Anthropic is Claude and Extropic is Thermodynamic hardware many orders of magnitude faster and more energy efficient than CPUs&#x2F;GPUs.*<p>* parameterized stochastic analog circuits that implement energy-based models (EBMs). Stochastic computing is a computing paradigm that represents numbers using the probability of ones in a bitstream.
评论 #40503151 未加载
评论 #40503097 未加载