![]() | Up a level |
Behnamnia, Armin, Aminian, Gholamali, Aghaei, Alireza, Shi, Chengchun
ORCID: 0000-0001-7773-2099, Tan, Vincent Y. F. and R. Rabiee, Hamid
(2025)
Log-sum-exponential estimator for off-policy evaluation and learning.
In:
Proceedings of the 42nd International Conference on Machine Learning.
ACM Press.
(In Press)
Wu, Xiangkun, Li, Ting, Aminian, Gholamali, Behnamnia, Armin, R. Rabiee, Hamid and Shi, Chengchun
ORCID: 0000-0001-7773-2099
(2025)
Pessimistic data integration for policy evaluation.
In: 39th Conference on Neural Information Processing Systems, 2025-11-30 - 2025-12-07.
(In Press)