Research分享投票

欢迎参与本次投票，现在我们就马上开始吧！

请给你希望听到的Research的分享投票（最多投5个）

Learning from human feedback

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

DayDreamer: World Models for Physical Robot Learning

Video Pretraining (VPT) learning to act by watching unlabeled online videos

Towards Applicable Reinforcement Learning: Improving the Generalization and Sample Efficiency with Policy Ensemble

MINEDOJO: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

The Primacy Bias in Deep Reinforcement Learning

Decision Transformer and its variants

1题 | 被引用0次

模板修改

使用此模板创建