A policy gradient approach to solving dynamic assignment problem for on-site service delivery

Yan, Yimo; Deng, Yang; Cui, Songyi; Kuo, Yong Hong; Chow, Andy HF; Ying, Chengshuo

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1016/j.tre.2023.103260
Scopus: eid_2-s2.0-85172321129
WOS: WOS:001078391100001
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Industrial & Manufacturing Systems Engineering: Journal/Magazine Articles

Article: A policy gradient approach to solving dynamic assignment problem for on-site service delivery

Title	A policy gradient approach to solving dynamic assignment problem for on-site service delivery
Authors	Yan, Yimo Deng, Yang Cui, Songyi Kuo, Yong Hong Chow, Andy HF Ying, Chengshuo
Keywords	Dynamic assignment problem On-site service delivery Policy gradient Resource allocation Semi-Markov decision process
Issue Date	1-Oct-2023
Publisher	Elsevier
Citation	Transportation Research Part E: Logistics and Transportation Review, 2023, v. 178, p. 103260 How to Cite? DOI: http://dx.doi.org/10.1016/j.tre.2023.103260
Abstract	The paper studies the resource allocation problem for delivering on-site services in urban areas. Requests for services are received spontaneously, with deliveries to be assigned dynamically. Real-life examples of such applications include the dispatch of traffic officers to scenes of accidents and the deployment of mechanics to sites of maintenance works. The dynamic assignment problem is to be solved via a policy gradient approach that dynamically assigns workers to different locations so that each customer involved would experience a minimum delay. Our solution framework adopts the transformer architecture with layers of inter-task and inter-agent communications as the approximator. This approximator is trained with the vanilla policy gradient algorithm. To improve computational effectiveness, we introduce an option of withholding an assignment, where workers may not be assigned at a decision point even if a service request is received, to enhance the flexibility of actions. Extensive computational experiments with a varying number of orders, order frequencies, and spatial sparsity are conducted. Our proposed method is shown to outperform other benchmarking methods, including the genetic algorithm and other online heuristics, in terms of stability of effectiveness, computational efficiency, and solution quality. Our experimental results suggest that the proposed method would have a reduced advantage over other benchmarking algorithms if the on-site service time is long.
Persistent Identifier	http://hdl.handle.net/10722/346446
ISSN	1366-5545 2023 Impact Factor: 8.3 2023 SCImago Journal Rankings: 2.884
ISI Accession Number ID	WOS:001078391100001

DC Field	Value	Language
dc.contributor.author	Yan, Yimo	-
dc.contributor.author	Deng, Yang	-
dc.contributor.author	Cui, Songyi	-
dc.contributor.author	Kuo, Yong Hong	-
dc.contributor.author	Chow, Andy HF	-
dc.contributor.author	Ying, Chengshuo	-
dc.date.accessioned	2024-09-17T00:30:37Z	-
dc.date.available	2024-09-17T00:30:37Z	-
dc.date.issued	2023-10-01	-
dc.identifier.citation	Transportation Research Part E: Logistics and Transportation Review, 2023, v. 178, p. 103260	-
dc.identifier.issn	1366-5545	-
dc.identifier.uri	http://hdl.handle.net/10722/346446	-
dc.description.abstract	The paper studies the resource allocation problem for delivering on-site services in urban areas. Requests for services are received spontaneously, with deliveries to be assigned dynamically. Real-life examples of such applications include the dispatch of traffic officers to scenes of accidents and the deployment of mechanics to sites of maintenance works. The dynamic assignment problem is to be solved via a policy gradient approach that dynamically assigns workers to different locations so that each customer involved would experience a minimum delay. Our solution framework adopts the transformer architecture with layers of inter-task and inter-agent communications as the approximator. This approximator is trained with the vanilla policy gradient algorithm. To improve computational effectiveness, we introduce an option of withholding an assignment, where workers may not be assigned at a decision point even if a service request is received, to enhance the flexibility of actions. Extensive computational experiments with a varying number of orders, order frequencies, and spatial sparsity are conducted. Our proposed method is shown to outperform other benchmarking methods, including the genetic algorithm and other online heuristics, in terms of stability of effectiveness, computational efficiency, and solution quality. Our experimental results suggest that the proposed method would have a reduced advantage over other benchmarking algorithms if the on-site service time is long.	-
dc.language	eng	-
dc.publisher	Elsevier	-
dc.relation.ispartof	Transportation Research Part E: Logistics and Transportation Review	-
dc.subject	Dynamic assignment problem	-
dc.subject	On-site service delivery	-
dc.subject	Policy gradient	-
dc.subject	Resource allocation	-
dc.subject	Semi-Markov decision process	-
dc.title	A policy gradient approach to solving dynamic assignment problem for on-site service delivery	-
dc.type	Article	-
dc.identifier.doi	10.1016/j.tre.2023.103260	-
dc.identifier.scopus	eid_2-s2.0-85172321129	-
dc.identifier.volume	178	-
dc.identifier.spage	103260	-
dc.identifier.eissn	1878-5794	-
dc.identifier.isi	WOS:001078391100001	-
dc.publisher.place	OXFORD	-
dc.identifier.issnl	1366-5545	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: A policy gradient approach to solving dynamic assignment problem for on-site service delivery

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats