Model-Based Reinforcement Learning via Latent-Space Collocation

Our planner, LatCo, solves multi-stage long-horizon tasks much harder than those considered previously. By optimizing a sequence of future latent states instead of optimizing actions directly, it quickly discovers the high-reward region to create effective plans.


© 2021 Chuning Zhu. All rights reserved.

Powered by Jekyll and Hydejack. Hosted on Eniac.