![]() | Up a level |
Luo, Shikai, Yang, Ying, Shi, Chengchun ORCID: 0000-0001-7773-2099, Yao, Fang, Ye, Jieping and Zhu, Hongtu
(2024)
Policy evaluation for temporal and/or spatial dependent experiments.
Journal of the Royal Statistical Society. Series B: Statistical Methodology, 86 (3).
623 - 649.
ISSN 1369-7412
Zhu, Jin ORCID: 0000-0001-8550-5822, Wan, Runzhe, Qi, Zhengling, Luo, Shikai and Shi, Chengchun
ORCID: 0000-0001-7773-2099
(2024)
Robust offline reinforcement learning with heavy-tailed rewards.
Proceedings of Machine Learning Research, 238.
541 - 549.
ISSN 2640-3498
Shi, Chengchun ORCID: 0000-0001-7773-2099, Wan, Runzhe, Song, Ge, Luo, Shikai, Zhu, Hongtu and Song, Rui
(2023)
A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets.
Annals of Applied Statistics, 17 (4).
2701 - 2722.
ISSN 1932-6157
Wu, Guojun, Song, Ge, Lv, Xiaoxiang, Luo, Shikai, Shi, Chengchun ORCID: 0000-0001-7773-2099 and Zhu, Hongtu
(2023)
DNet: distributional network for distributional individualized treatment effects.
Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023.
5215 - 5224.
ISSN 2154-817X
Xu, Yang, Zhu, Jin, Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai and Song, Rui
(2023)
An instrumental variable approach to confounded off-policy evaluation.
Proceedings of Machine Learning Research, 202.
38848 - 38880.
ISSN 1938-7228
Zhang, Yingying, Shi, Chengchun ORCID: 0000-0001-7773-2099 and Luo, Shikai
(2023)
Conformal off-policy prediction.
Proceedings of Machine Learning Research, 206.
pp. 2751-2768.
ISSN 2640-3498
Shi, Chengchun ORCID: 0000-0001-7773-2099, Zhu, Jin, Shen, Ye, Luo, Shikai, Zhu, Hongtu and Song, Rui
(2022)
Off-policy confidence interval estimation with confounded Markov decision process.
Journal of the American Statistical Association.
ISSN 0162-1459
Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai, Le, Yuan, Zhu, Hongtu and Song, Rui
(2022)
Statistically efficient advantage learning for offline reinforcement learning in infinite horizons.
Journal of the American Statistical Association.
ISSN 0162-1459
Shi, Chengchun ORCID: 0000-0001-7773-2099, Wang, Xiaoyu, Luo, Shikai, Zhu, Hongtu, Ye, Jieping and Song, Rui
(2022)
Dynamic causal effects evaluation in A/B testing with a reinforcement learning framework.
Journal of the American Statistical Association.
1 - 13.
ISSN 0162-1459
Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai, Zhu, Hongtu and Song, Rui
(2021)
An online sequential test for qualitative treatment effects.
Journal of Machine Learning Research, 22.
ISSN 1532-4435
Wan, Runzhe, Zhang, Sheng, Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai and Song, Rui
(2021)
Pattern transfer learning for reinforcement learning in order dispatching.
In: International Joint Conference on Artificial Intelligence, 2021-08-19 - 2021-08-26.
(In Press)