22 点作者 ankeshanand超过 3 年前

3 条评论

visarga超过 3 年前

The fruits of massive language modeling are coming to RL. I envision such foundation models becoming cheap and standardized, like an AI operating system. If we could have a cheap, compact, multi-modal GPT-3 chip we could make all sorts of agents run on top. These RL agents would be like the libraries of skills in Matrix, you can load any skill you want on the player.

YetAnotherNick超过 3 年前

In India without VPN:<p>"The website has been blocked as per order of Ministry of Electronics and Information Technology under IT Act, 2000."

评论 #29911728 未加载

armanboyaci超过 3 年前

> Other learning paradigms are about minimization; reinforcement learning is about maximization.<p>I don't see why this is important.

评论 #29911884 未加载

评论 #29904483 未加载

评论 #29911742 未加载

Reinforcement Learning as a fine-tuning paradigm

3 条评论

Reinforcement Learning as a fine-tuning paradigm

3 条评论