Publications

(2020). Inverse Policy Evaluation for Value-based Sequential Decision-making. Preprint.

PDF

(2020). Training Recurrent Neural Networks Online by Learning Explicit State Variables. In ICLR 2020.

PDF

(2020). Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning. In AAAI 2020.

PDF