Cookies?
Library Header Image
LSE Research Online LSE Library Services

Pattern transfer learning for reinforcement learning in order dispatching

Wan, Runzhe, Zhang, Sheng, Shi, Chengchun ORCID: 0000-0001-7773-2099, Luo, Shikai and Song, Rui (2021) Pattern transfer learning for reinforcement learning in order dispatching. In: International Joint Conference on Artificial Intelligence, 2021-08-19 - 2021-08-26. (In Press)

[img] Text (Shi_pattern-transfer-learning-for-reinforcement-learning--accepted) - Accepted Version
Download (959kB)

Abstract

Order dispatch is one of the central problems to ridesharing platforms. Recently, value-based reinforcement learning algorithms have shown promising performance to solve this task. However, in real-world applications, the demand-supply system is typically nonstationary over time, posing challenges to reutilizing data generated in different time periods to learn the value function. In this work, motivated by the fact that the relative relationship between the values of some states is largely stable across various environments, we propose a pattern transfer learning framework for value-based reinforcement learning in the order dispatch problem. Our method efficiently captures the value patterns by incorporating a concordance penalty. The superior performance of the proposed method is supported by experiments.

Item Type: Conference or Workshop Item (Paper)
Official URL: https://ijcai-21.org
Additional Information: © 2021 The Authors
Divisions: Statistics
Subjects: H Social Sciences > HA Statistics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Date Deposited: 24 Jun 2021 07:54
Last Modified: 20 Dec 2024 01:00
URI: http://eprints.lse.ac.uk/id/eprint/110919

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics