Finite time analysis of potential-based reward shaping
by and
Reference:
Zhongtian Dai and Matthew R. Walter. Finite time analysis of potential-based reward shaping, In Proceedings of the Multi-Disciplinary Conference on Reinforcement Learning and Decision Making (RLDM), 2019.
BibTeX:
@string{rldm="Proceedings of the Multi-Disciplinary Conference on Reinforcement Learning and Decision Making (RLDM)"}
@inproceedings{dai19,
  author = {Zhongtian Dai and Matthew R. Walter},
  title = {Finite time analysis of potential-based reward shaping},
  booktitle = {Proceedings of the Multi-Disciplinary Conference on Reinforcement Learning and Decision Making (RLDM)},
  year = {2019},
  month = {7},
  address = {Montréal, Canada},
}