Cookies?
Library Header Image
LSE Research Online LSE Library Services

Items where Author is "Yang, Xuzhi"

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Item Type | No Grouping
Number of items: 1.

Ma, Tao ORCID: 0000-0002-8062-9217, Yang, Xuzhi and Szabo, Zoltan (2024) To switch or not to switch? Balanced policy switching in offline reinforcement learning. . arXiv. (Submitted)

This list was generated on Wed Jul 17 20:28:44 2024 BST.