TE
テックエコー
ホーム24時間トップ最新ベスト質問ショー求人
GitHubTwitter
ホーム

テックエコー

Next.jsで構築されたテクノロジーニュースプラットフォームで、グローバルなテクノロジーニュースとディスカッションを提供します。

GitHubTwitter

ホーム

ホーム最新ベスト質問ショー求人

リソース

HackerNews APIオリジナルHackerNewsNext.js

© 2025 テックエコー. すべての権利を保有。

Computer use agent with RL training for day trading?

1 ポイント投稿者: iiTsEddy約5時間前
We now know that RL can make models more capable on measurable tasks and is the new dimension of scaling law, but is anyone putting these capabilities to more meaningful use beyond Olympic math problems or 2D game playing?<p>So far, pretty much all of the computer use agent demos I&#x27;ve seen revolve around some kind of instruction following (book this flight, clean my desktop, etc.) I wonder, is anyone working on putting them into active trading in financial markets and use P&amp;L as a reward &#x2F; loss function? Or maybe title agnostic video game playing which is optimized for ELO or rank or win rate?<p>It feels like context length would go eventually from millions of tokens to days or months of agent&#x27;s &quot;life span&quot;; inference cost would eventually go down to time cost of GPU server since hybrid models (mamba + attention) with linear time complexity can perform like regular transformers (who&#x27;s inference is quadratic). What are the other major technical challenges here?<p>I think a meaningful metric is crucial, and I took a lot of inspiration from this startup, Chai.ai, a competitor to Character.ai. I went to one of their events and got the sense that they are essentially optimizing chat LLMs for user monthly retention and subscription. Their small team hit 30M ARR with 1.8m DAU, and it happened over the last year or so. Combined with my own experience working at a startup, it seems like the right metric is the money shot.<p>Am I missing anything fundamental? Is anybody working on this? (or have interest?)

1 comment

falcor84約5時間前
I&#x27;m pretty certain that the likes of Jane Street and SIG aren&#x27;t leaving any low-hanging-fruit signals for us mortals to benefit from.