Up a level |
Shi, Chengchun ORCID: 0000-0001-7773-2099, Uehara, Masatoshi, Uehara, Masatoshi, Huang, Jiawei and Jiang, Nan (2022) A minimax learning approach to off-policy evaluation in confounded Partially Observable Markov Decision Processes. Proceedings of Machine Learning Research. ISSN 2640-3498