![]() | Up a level |
Bian, Zeyu, Shi, Chengchun ORCID: 0000-0001-7773-2099, Qi, Zhengling and Wang, Lan
(2024)
Off-policy evaluation in doubly inhomogeneous environments.
Journal of the American Statistical Association.
ISSN 0162-1459
Yu, Shuguang, Fang, Shuxing, Peng, Ruixin, Qi, Zhengling, Zhou, Fan and Shi, Chengchun ORCID: 0000-0001-7773-2099
(2024)
Two-way deconfounder for off-policy evaluation in causal reinforcement learning.
In: 38th Annual Conference on Neural Information Processing Systems, 2024-12-10 - 2024-12-15, Vancouver Convention Center, Vancouver, Canada, CAN.
(In Press)
Zhu, Jin ORCID: 0000-0001-8550-5822, Wan, Runzhe, Qi, Zhengling, Luo, Shikai and Shi, Chengchun
ORCID: 0000-0001-7773-2099
(2024)
Robust offline reinforcement learning with heavy-tailed rewards.
Proceedings of Machine Learning Research, 238.
541 - 549.
ISSN 2640-3498
Zhou, Yunzhe, Qi, Zhengling, Shi, Chengchun ORCID: 0000-0001-7773-2099 and Li, Lexin
(2023)
Optimizing pessimism in dynamic treatment regimes: a Bayesian learning approach.
Proceedings of Machine Learning Research, 206.
ISSN 1938-7228
Shi, Chengchun ORCID: 0000-0001-7773-2099, Qi, Zhengling, Wang, Jianing and Zhou, Fan
(2023)
Value enhancement of reinforcement learning via efficient and robust trust region optimization.
Journal of the American Statistical Association.
pp. 1-15.
ISSN 0162-1459