Up a level |
Shi, Chengchun, Uehara, Masatoshi, Uehara, Masatoshi, Huang, Jiawei and Jiang, Nan (2022) A minimax learning approach to off-policy evaluation in confounded Partially Observable Markov Decision Processes. In: Proceedings of the 39th International Conference on Machine Learning. International Conference on Machine Learning. (In Press)