Up a level |
Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai, Le, Yuan, Zhu, Hongtu and Song, Rui (2022) Statistically efficient advantage learning for offline reinforcement learning in infinite horizons. Journal of the American Statistical Association. ISSN 0162-1459