Dispatch of autonomous vehicles for taxi services: A deep reinforcement learning approach

Mao, Chao; Liu, Yulin; Shen, Zuo Jun (Max)

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1016/j.trc.2020.102626
Scopus: eid_2-s2.0-85082970900
WOS: WOS:000535426900018
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Industrial & Manufacturing Systems Engineering: Journal/Magazine Articles
- President's Office: Journal/Magazine Articles

Article: Dispatch of autonomous vehicles for taxi services: A deep reinforcement learning approach

Title	Dispatch of autonomous vehicles for taxi services: A deep reinforcement learning approach
Authors	Mao, Chao Liu, Yulin Shen, Zuo Jun (Max)
Keywords	Autonomous vehicles Deep reinforcement learning Actor-critic algorithm Demand rebalancing Ride sharing Policy-gradient Taxi dispatching
Issue Date	2020
Citation	Transportation Research Part C: Emerging Technologies, 2020, v. 115, article no. 102626 How to Cite? DOI: http://dx.doi.org/10.1016/j.trc.2020.102626
Abstract	© 2020 Elsevier Ltd In this paper, we define and investigate a novel model-free deep reinforcement learning framework to solve the taxi dispatch problem. The framework can be used to redistribute vehicles when the travel demand and taxi supply is either spatially or temporally imbalanced in a transportation network. While previous works mostly focus on using model-based methods, the goal of this paper is to explore the policy-based deep reinforcement learning algorithm as a model-free method to optimize the rebalancing strategy. In particular, we propose an actor-critic algorithm with feed-forward neural networks as approximations of both policy and value functions, where the policy function provides the optimal dispatch strategy and the value function estimates the expected costs at each time stamp. Our numerical studies show that the algorithm converges to the theoretical upper bound with less than 4% optimality gap, whether the system dynamics are deterministic or stochastic. We also investigate the scenario where we consider user priority and fairness, and the results indicate that our learned policy is capable of producing a superior strategy that balances equity, cancellation, and level of service when user priority is considered.
Persistent Identifier	http://hdl.handle.net/10722/296212
ISSN	0968-090X 2023 Impact Factor: 7.6 2023 SCImago Journal Rankings: 2.860
ISI Accession Number ID	WOS:000535426900018

DC Field	Value	Language
dc.contributor.author	Mao, Chao	-
dc.contributor.author	Liu, Yulin	-
dc.contributor.author	Shen, Zuo Jun (Max)	-
dc.date.accessioned	2021-02-11T04:53:04Z	-
dc.date.available	2021-02-11T04:53:04Z	-
dc.date.issued	2020	-
dc.identifier.citation	Transportation Research Part C: Emerging Technologies, 2020, v. 115, article no. 102626	-
dc.identifier.issn	0968-090X	-
dc.identifier.uri	http://hdl.handle.net/10722/296212	-
dc.description.abstract	© 2020 Elsevier Ltd In this paper, we define and investigate a novel model-free deep reinforcement learning framework to solve the taxi dispatch problem. The framework can be used to redistribute vehicles when the travel demand and taxi supply is either spatially or temporally imbalanced in a transportation network. While previous works mostly focus on using model-based methods, the goal of this paper is to explore the policy-based deep reinforcement learning algorithm as a model-free method to optimize the rebalancing strategy. In particular, we propose an actor-critic algorithm with feed-forward neural networks as approximations of both policy and value functions, where the policy function provides the optimal dispatch strategy and the value function estimates the expected costs at each time stamp. Our numerical studies show that the algorithm converges to the theoretical upper bound with less than 4% optimality gap, whether the system dynamics are deterministic or stochastic. We also investigate the scenario where we consider user priority and fairness, and the results indicate that our learned policy is capable of producing a superior strategy that balances equity, cancellation, and level of service when user priority is considered.	-
dc.language	eng	-
dc.relation.ispartof	Transportation Research Part C: Emerging Technologies	-
dc.subject	Autonomous vehicles	-
dc.subject	Deep reinforcement learning	-
dc.subject	Actor-critic algorithm	-
dc.subject	Demand rebalancing	-
dc.subject	Ride sharing	-
dc.subject	Policy-gradient	-
dc.subject	Taxi dispatching	-
dc.title	Dispatch of autonomous vehicles for taxi services: A deep reinforcement learning approach	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1016/j.trc.2020.102626	-
dc.identifier.scopus	eid_2-s2.0-85082970900	-
dc.identifier.volume	115	-
dc.identifier.spage	article no. 102626	-
dc.identifier.epage	article no. 102626	-
dc.identifier.isi	WOS:000535426900018	-
dc.identifier.issnl	0968-090X	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Dispatch of autonomous vehicles for taxi services: A deep reinforcement learning approach

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats