![]() | Up a level |
Behnamnia, Armin, Aminian, Gholamali, Aghaei, Alireza, Shi, Chengchun ORCID: 0000-0001-7773-2099, Tan, Vincent Y. F. and R. Rabiee, Hamid
(2025)
Log-sum-exponential estimator for off-policy evaluation and learning.
In:
Proceedings of the 42nd International Conference on Machine Learning.
ACM Press.
(In Press)