![]() | Up a level |
Uehara, Masatoshi, Kiyohara, Haruka, Bennett, Andrew, Chernozhukov, Victor, Jiang, Nan, Kallus, Nathan, Shi, Chengchun ORCID: 0000-0001-7773-2099 and Sun, Wenguang
(2023)
Future-dependent value-based off-policy evaluation in POMDPs.
In: Oh, A., Naumann, T., Globerson, A., Saenko, K., Hardt, M. and Levine, S., (eds.)
Advances in Neural Information Processing Systems 36 (NeurIPS 2023).
Neural Information Processing Systems Foundation.
Shi, Chengchun ORCID: 0000-0001-7773-2099, Wan, Runzhe, Chernozhukov, Victor and Song, Rui
(2021)
Deeply-debiased off-policy interval estimation.
In: International Conference on Machine Learning, 2021-07-18 - 2021-07-24, Online.
(In Press)