Up a level |
Uehara, Masatoshi, Kiyohara, Haruka, Bennett, Andrew, Chernozhukov, Victor, Jiang, Nan, Kallus, Nathan, Shi, Chengchun and Sun, Wenguang (2024) Future-dependent value-based off-policy evaluation in POMDPs. In: 37th Conference on Neural Information Processing Systems, 2023-12-10 - 2023-12-16, rnest N. Morial Convention Center , New Orleans, United States. (In Press)
Shi, Chengchun, Uehara, Masatoshi, Uehara, Masatoshi, Huang, Jiawei and Jiang, Nan (2022) A minimax learning approach to off-policy evaluation in confounded Partially Observable Markov Decision Processes. In: Proceedings of the 39th International Conference on Machine Learning. International Conference on Machine Learning. (In Press)