Up a level |
Wang, Xueqin, Zhu, Jin ORCID: 0000-0001-8550-5822, Pan, Wenliang, Zhu, Junhao and Zhang, Heping (2023) Nonparametric statistical inference via metric distribution function in metric spaces. Journal of the American Statistical Association. ISSN 0162-1459
Xu, Yang, Zhu, Jin, Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai and Song, Rui (2023) An instrumental variable approach to confounded off-policy evaluation. Proceedings of Machine Learning Research, 202. 38848 - 38880. ISSN 1938-7228
Shi, Chengchun ORCID: 0000-0001-7773-2099, Zhu, Jin, Shen, Ye, Luo, Shikai, Zhu, Hongtu and Song, Rui (2022) Off-policy confidence interval estimation with confounded Markov decision process. Journal of the American Statistical Association. ISSN 0162-1459
Zhu, Jin ORCID: 0000-0001-8550-5822, Wan, Runzhe, Qi, Zhengling, Luo, Shikai and Shi, Chengchun ORCID: 0000-0001-7773-2099 (2024) Robust offline reinforcement learning with heavy-tailed rewards. In: Dasgupta, Sanjoy, Mandt, Stephan and Li, Yingzhen, (eds.) Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024. International Conference on Machine Learning, Valencia, Spain, 541 - 549.