![]() | Up a level |
Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai, Le, Yuan, Zhu, Hongtu and Song, Rui
(2022)
Statistically efficient advantage learning for offline reinforcement learning in infinite horizons.
Journal of the American Statistical Association.
ISSN 0162-1459