69 点作者 aaronjg超过 8 年前

1 comment

skj超过 8 年前

It's not clear to me how this is interestingly different from model-based RL, where you learn the state function and reward function, and then use various types of simulation to learn a value function. I guess I'll have to read more than the abstract...

评论 #13588580 未加载

Deep Successor Reinforcement Learning (2016)

1 comment

Deep Successor Reinforcement Learning (2016)

1 comment