Mohammad Ghavamzadeh                  Google Scholar



  1. Bullet    Four papers on “Active Learning for Accurate Estimation of Linear Models”, “Bottleneck Conditional Density Estimation”, “Diffusion Independent Semi-Bandit Influence Maximization”, and “Online Learning to Rank in Stochastic Click Models” got accepted at ICML-2017.

  1. Bullet    I will serve as a senior program committee member for AAAI-2018.

  1. Bullet    I will serve as an area chair for NIPS-2017. 

  1. Bullet    Together with Marek Petrik, we gave a tutorial on “Risk-averse Decision-making and Control” at AAAI-2017. (tutorial website)

  1. Bullet    Our paper on “Sequential Decision-making with Coherent Risk” will be published at IEEE Transaction on Automatic Control (TAC).

  1. Bullet    Three conference papers got accepted: a paper on “Sequential Multiple Hypothesis Testing with Type I Error Control” at AISTATS-2017 and two papers on “Predictive Off-Policy Evaluation for Nonstationary Decision Problems” and on “Automated Data Cleansing through Meta-Learning” at IAAI-2017.


  1. Bullet    Four journal papers published: “Analysis of Classification-based Policy Iteration Algorithms”, “Bayesian Policy Gradient and Actor-Critic Algorithms”, and “Regularized Policy Iteration for Non-Parametric Function Spaces” at Journal of Machine Learning Research (JMLR), and “Variance-constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs” at Machine Learning Journal (MLJ).

  1. Bullet    Four conference papers published: “Safe Policy Improvement by Minimizing Robust Baseline Regret” at NIPS-2016, “Improved Learning Complexity in Combinatorial Pure Exploration Bandits” at AISTATS-2016, “Proximal Gradient Temporal Difference Learning Algorithms” at the sister conference best paper track at IJCAI-2016, and “Graphical Model Sketch” at ECML-2016. 

  1. Bullet    I gave an invited talk at the 13th European Workshop on Reinforcement Learning (EWRL) in Barcelona on December 3-4, 2016.


  1. Bullet    Three journal papers published: “Approximate Modified Policy Iteration and its Application to the Game of Tetris” at Journal of Machine Learning Research (JMLR), “Classification-based Approximate Policy Iteration” at IEEE Transactions on Automatic Control (TAC), and “Bayesian Reinforcement Learning: A Survey” at Foundation and Trends in Machine Learning. 

  1. Bullet    Five conference papers published: “High Confidence Off-Policy Evaluation” at AAAI-2015, “Maximum Entropy Semi-Supervised Inverse Reinforcement Learning” at IJCAI-2015, “Building Personalized Ad Recommendation Systems for Life-Time Value Optimization with Guarantees” at IJCAI-2015, “High Confidence Policy Improvement” at ICML-2015, and “Policy Gradient for Coherent Risk Measures” at NIPS-2015.

  1. Bullet    Our paper entitled “Finite-Sample Analysis of Proximal Gradient TD Algorithms” won the Facebook best student paper award at UAI-2015

  1. Bullet    I co-chaired two workshops: 12th European Workshop on Reinforcement Learning (EWRL-12) as a workshop at ICML-2015 and “Machine Learning in eCommerce” at NIPS-2015.

  1. Bullet    My student, Victor Gabillon, won the AFIA (French Association for Artificial Intelligence) prize for the 2nd best Ph.D. thesis (completed in 2014) on artificial intelligence in France.


  1. Bullet    A paper published: “Algorithms for CVaR Optimization in MDPs” at NIPS-2014.

  1. Bullet    I co-chaired three workshops: “Sequential Decision-Making with Big Data” at AAAI-2014, Customers Value Optimization in Digital Marketing” at ICML-2014, and Large-scale Reinforcement Learning and Markov Decision Problems” at NIPS-2014.

  1. Bullet    I successfully defended my “Habilitation à Diriger des Recherches” (HDR) thesis and graduated my Ph.D. student Victor Gabillon in June 2014. Victor will be a postdoc with Prof. Peter Bartlett at UC Berkeley starting October 2014.

  1. Bullet    I served as an area chair for NIPS-2014.

Professional Experience

Senior Analytics Researcher (on leave from INRIA)

Adobe Research, San Jose, CA  (Oct. 2013 - present)

Habilitation à Diriger des Recherches (HDR)

Université Lille 1, France  (June 2014)

Chargé de Recherche  CR1 (on leave at Adobe)

INRIA Lille - Team SequeL, France  (2010 - Oct. 2013)

Chargé de recherche  CR2

INRIA Lille - Team SequeL, France  (2008 - 2010)

postdoctoral Fellow

University of Alberta, Canada  (2005 - 2008)

Ph.D. in Computer Science

University of Massachusetts Amherst, USA  (2001 - 2005)

Research Interests

Machine Learning

Artificial Intelligence


Reinforcement Learning

Contact Information


INRIA Lille - Nord Europe, Team SequeL

Parc Scientifique de la Haute-Borne

40 Avenue Halley

59650 Villeneuve d’Ascq, France


#7 INRIA Lille Building - Ground Floor


(+33 3) 59 57 78 21


(+33 3) 59 57 78 50


mohammad dot ghavamzadeh at inria dot fr


Recent News ...