A reinforcement learning framework for the adaptive routing problem in stochastic time-dependent network

Mao, Chao; Shen, Zuojun

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1016/j.trc.2018.06.001
Scopus: eid_2-s2.0-85048325293
WOS: WOS:000442173400011
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Industrial & Manufacturing Systems Engineering: Journal/Magazine Articles
- President's Office: Journal/Magazine Articles

See more details

Article: A reinforcement learning framework for the adaptive routing problem in stochastic time-dependent network

Title	A reinforcement learning framework for the adaptive routing problem in stochastic time-dependent network
Authors	Mao, Chao Shen, Zuojun
Keywords	Tree-based function approximation Adaptive routing Q learning Reinforcement learning Fitted Q iteration
Issue Date	2018
Citation	Transportation Research Part C: Emerging Technologies, 2018, v. 93, p. 179-197 How to Cite? DOI: http://dx.doi.org/10.1016/j.trc.2018.06.001
Abstract	© 2018 Elsevier Ltd Most previous work in addressing the adaptive routing problem in stochastic and time-dependent (STD) network has been focusing on developing parametric models to reflect the network dynamics and designing efficient algorithms to solve these models. However, strong assumptions need to be made in the models and some algorithms also suffer from the curse of dimensionality. In this paper, we examine the application of Reinforcement Learning as a non-parametric model-free method to solve the problem. Both the online Q learning method for discrete state space and the offline fitted Q iteration algorithm for continuous state space are discussed. With a small case study on a mid-sized network, we demonstrate the significant advantages of using Reinforcement Learning to solve for the optimal routing policy over traditional stochastic dynamic programming method. And the fitted Q iteration algorithm combined with tree-based function approximation is shown to outperform other methods especially during peak demand periods.
Persistent Identifier	http://hdl.handle.net/10722/296176
ISSN	0968-090X 2023 Impact Factor: 7.6 2023 SCImago Journal Rankings: 2.860
ISI Accession Number ID	WOS:000442173400011

DC Field	Value	Language
dc.contributor.author	Mao, Chao	-
dc.contributor.author	Shen, Zuojun	-
dc.date.accessioned	2021-02-11T04:53:00Z	-
dc.date.available	2021-02-11T04:53:00Z	-
dc.date.issued	2018	-
dc.identifier.citation	Transportation Research Part C: Emerging Technologies, 2018, v. 93, p. 179-197	-
dc.identifier.issn	0968-090X	-
dc.identifier.uri	http://hdl.handle.net/10722/296176	-
dc.description.abstract	© 2018 Elsevier Ltd Most previous work in addressing the adaptive routing problem in stochastic and time-dependent (STD) network has been focusing on developing parametric models to reflect the network dynamics and designing efficient algorithms to solve these models. However, strong assumptions need to be made in the models and some algorithms also suffer from the curse of dimensionality. In this paper, we examine the application of Reinforcement Learning as a non-parametric model-free method to solve the problem. Both the online Q learning method for discrete state space and the offline fitted Q iteration algorithm for continuous state space are discussed. With a small case study on a mid-sized network, we demonstrate the significant advantages of using Reinforcement Learning to solve for the optimal routing policy over traditional stochastic dynamic programming method. And the fitted Q iteration algorithm combined with tree-based function approximation is shown to outperform other methods especially during peak demand periods.	-
dc.language	eng	-
dc.relation.ispartof	Transportation Research Part C: Emerging Technologies	-
dc.subject	Tree-based function approximation	-
dc.subject	Adaptive routing	-
dc.subject	Q learning	-
dc.subject	Reinforcement learning	-
dc.subject	Fitted Q iteration	-
dc.title	A reinforcement learning framework for the adaptive routing problem in stochastic time-dependent network	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1016/j.trc.2018.06.001	-
dc.identifier.scopus	eid_2-s2.0-85048325293	-
dc.identifier.volume	93	-
dc.identifier.spage	179	-
dc.identifier.epage	197	-
dc.identifier.isi	WOS:000442173400011	-
dc.identifier.issnl	0968-090X	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: A reinforcement learning framework for the adaptive routing problem in stochastic time-dependent network

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats