![]() | Up a level |
Behnamnia, Armin, Aminian, Gholamali, Aghaei, Alireza, Shi, Chengchun
ORCID: 0000-0001-7773-2099, Tan, Vincent Y. F. and R. Rabiee, Hamid
(2025)
Log-sum-exponential estimator for off-policy evaluation and learning.
Proceedings of Machine Learning Research, 267.
ISSN 2640-3498
(In Press)