
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning

TAAC: Temporally Abstract Actor-Critic for Continuous Control

An Empowerment-based Solution to Robotic Manipulation Tasks with Sparse Rewards
Generative Particle Variational Inference via Estimation of Functional Gradients
Siamese Natural Language Tracker: Tracking by Natural Language Descriptions with Siamese Trackers
Mutual Information State Intrinsic Control

Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning
