Library Header Image

Login

Items where Author is "Ma, Tao"

Group by: Item Type | No Grouping

Jump to: Monograph

Number of items: 1.

Monograph

Ma, Tao ORCID: 0000-0002-8062-9217, Yang, Xuzhi and Szabo, Zoltan ORCID: 0000-0001-6183-7603 (2024) To switch or not to switch? Balanced policy switching in offline reinforcement learning. . arXiv. (Submitted)

This list was generated on Thu May 29 09:33:29 2025 BST.

Mission Statement & FAQs | Contact us | Takedown Policy | Content Policy | LSE Research Online supports OAI 2.0 with a base URL of /cgi/oai2