Up a level |
Shi, Chengchun, Uehara, Masatoshi, Uehara, Masatoshi, Huang, Jiawei and Jiang, Nan (2022) A minimax learning approach to off-policy evaluation in confounded Partially Observable Markov Decision Processes. In: Proceedings of the 39th International Conference on Machine Learning. International Conference on Machine Learning.
Uehara, Masatoshi, Kiyohara, Haruka, Bennett, Andrew, Chernozhukov, Victor, Jiang, Nan, Kallus, Nathan, Shi, Chengchun and Sun, Wenguang (2024) Future-dependent value-based off-policy evaluation in POMDPs. In: 37th Conference on Neural Information Processing Systems, 2023-12-10 - 2023-12-16, rnest N. Morial Convention Center , New Orleans, United States. (In Press)