Up a level |
Luo, Shikai, Yang, Ying, Shi, Chengchun ORCID: 0000-0001-7773-2099, Yao, Fang, Ye, Jieping and Zhu, Hongtu (2024) Policy evaluation for temporal and/or spatial dependent experiments. Journal of the Royal Statistical Society. Series B: Statistical Methodology, 86 (3). 623 - 649. ISSN 1369-7412
Shi, Chengchun ORCID: 0000-0001-7773-2099, Wan, Runzhe, Song, Ge, Luo, Shikai, Zhu, Hongtu and Song, Rui (2023) A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets. Annals of Applied Statistics, 17 (4). 2701 - 2722. ISSN 1932-6157
Wu, Guojun, Song, Ge, Lv, Xiaoxiang, Luo, Shikai, Shi, Chengchun ORCID: 0000-0001-7773-2099 and Zhu, Hongtu (2023) DNet: distributional network for distributional individualized treatment effects. Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 5215 - 5224. ISSN 2154-817X
Xu, Yang, Zhu, Jin, Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai and Song, Rui (2023) An instrumental variable approach to confounded off-policy evaluation. Proceedings of Machine Learning Research, 202. 38848 - 38880. ISSN 1938-7228
Zhang, Yingying, Shi, Chengchun ORCID: 0000-0001-7773-2099 and Luo, Shikai (2023) Conformal off-policy prediction. Proceedings of Machine Learning Research, 206. pp. 2751-2768. ISSN 2640-3498
Shi, Chengchun ORCID: 0000-0001-7773-2099, Zhu, Jin, Shen, Ye, Luo, Shikai, Zhu, Hongtu and Song, Rui (2022) Off-policy confidence interval estimation with confounded Markov decision process. Journal of the American Statistical Association. ISSN 0162-1459
Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai, Le, Yuan, Zhu, Hongtu and Song, Rui (2022) Statistically efficient advantage learning for offline reinforcement learning in infinite horizons. Journal of the American Statistical Association. ISSN 0162-1459
Shi, Chengchun ORCID: 0000-0001-7773-2099, Wang, Xiaoyu, Luo, Shikai, Zhu, Hongtu, Ye, Jieping and Song, Rui (2022) Dynamic causal effects evaluation in A/B testing with a reinforcement learning framework. Journal of the American Statistical Association. 1 - 13. ISSN 0162-1459
Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai, Zhu, Hongtu and Song, Rui (2021) An online sequential test for qualitative treatment effects. Journal of Machine Learning Research, 22. ISSN 1532-4435
Zhu, Jin ORCID: 0000-0001-8550-5822, Wan, Runzhe, Qi, Zhengling, Luo, Shikai and Shi, Chengchun ORCID: 0000-0001-7773-2099 (2024) Robust offline reinforcement learning with heavy-tailed rewards. In: Dasgupta, Sanjoy, Mandt, Stephan and Li, Yingzhen, (eds.) Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024. International Conference on Machine Learning, Valencia, Spain, 541 - 549.
Wan, Runzhe, Zhang, Sheng, Shi, Chengchun, Luo, Shikai and Song, Rui (2021) Pattern transfer learning for reinforcement learning in order dispatching. In: International Joint Conference on Artificial Intelligence, 2021-08-19 - 2021-08-26. (In Press)