Recent news...- Three papers got accepted at ICML-2010. "Bayesian Multi-Task Reinforcement Learning" with Alessandro Lazaric, "Analysis of a Classification-based Policy Iteration Algorithm" and "Finite-Sample Analysis of LSTD" with Alessandro Lazaric & Rémi Munos.
- Our
paper (with Shalabh Bhatnagar and Rich Sutton) on "Natural Actor-Critic
Algorithms" was published at Automatica. Its longer version is available as a UAlberta Tech-Report [pdf].
In this paper, we present four new actor-critic algorithms (three using
natural gradient and one using regular gradient) with convergence proofs.
|