Cookies?
Library Header Image
LSE Research Online LSE Library Services

Items where Author is "Qi, Zhengling"

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Item Type | No Grouping
Number of items: 5.

Bian, Zeyu, Shi, Chengchun ORCID: 0000-0001-7773-2099, Qi, Zhengling and Wang, Lan (2024) Off-policy evaluation in doubly inhomogeneous environments. Journal of the American Statistical Association. ISSN 0162-1459

Yu, Shuguang, Fang, Shuxing, Peng, Ruixin, Qi, Zhengling, Zhou, Fan and Shi, Chengchun ORCID: 0000-0001-7773-2099 (2024) Two-way deconfounder for off-policy evaluation in causal reinforcement learning. In: 38th Annual Conference on Neural Information Processing Systems, 2024-12-10 - 2024-12-15, Vancouver Convention Center, Vancouver, Canada, CAN. (In Press)

Zhu, Jin ORCID: 0000-0001-8550-5822, Wan, Runzhe, Qi, Zhengling, Luo, Shikai and Shi, Chengchun ORCID: 0000-0001-7773-2099 (2024) Robust offline reinforcement learning with heavy-tailed rewards. In: Dasgupta, Sanjoy, Mandt, Stephan and Li, Yingzhen, (eds.) Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024. International Conference on Machine Learning, Valencia, Spain, 541 - 549.

Shi, Chengchun ORCID: 0000-0001-7773-2099, Qi, Zhengling, Wang, Jianing and Zhou, Fan (2023) Value enhancement of reinforcement learning via efficient and robust trust region optimization. Journal of the American Statistical Association. pp. 1-15. ISSN 0162-1459

Zhou, Yunzhe, Qi, Zhengling, Shi, Chengchun ORCID: 0000-0001-7773-2099 and Li, Lexin (2023) Optimizing pessimism in dynamic treatment regimes: a Bayesian learning approach. Proceedings of Machine Learning Research, 206. ISSN 1938-7228 (In Press)

This list was generated on Sat Dec 21 16:03:20 2024 GMT.