File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: A policy gradient approach to solving dynamic assignment problem for on-site service delivery

TitleA policy gradient approach to solving dynamic assignment problem for on-site service delivery
Authors
KeywordsDynamic assignment problem
On-site service delivery
Policy gradient
Resource allocation
Semi-Markov decision process
Issue Date1-Oct-2023
PublisherElsevier
Citation
Transportation Research Part E: Logistics and Transportation Review, 2023, v. 178, p. 103260 How to Cite?
AbstractThe paper studies the resource allocation problem for delivering on-site services in urban areas. Requests for services are received spontaneously, with deliveries to be assigned dynamically. Real-life examples of such applications include the dispatch of traffic officers to scenes of accidents and the deployment of mechanics to sites of maintenance works. The dynamic assignment problem is to be solved via a policy gradient approach that dynamically assigns workers to different locations so that each customer involved would experience a minimum delay. Our solution framework adopts the transformer architecture with layers of inter-task and inter-agent communications as the approximator. This approximator is trained with the vanilla policy gradient algorithm. To improve computational effectiveness, we introduce an option of withholding an assignment, where workers may not be assigned at a decision point even if a service request is received, to enhance the flexibility of actions. Extensive computational experiments with a varying number of orders, order frequencies, and spatial sparsity are conducted. Our proposed method is shown to outperform other benchmarking methods, including the genetic algorithm and other online heuristics, in terms of stability of effectiveness, computational efficiency, and solution quality. Our experimental results suggest that the proposed method would have a reduced advantage over other benchmarking algorithms if the on-site service time is long.
Persistent Identifierhttp://hdl.handle.net/10722/346446
ISSN
2023 Impact Factor: 8.3
2023 SCImago Journal Rankings: 2.884
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorYan, Yimo-
dc.contributor.authorDeng, Yang-
dc.contributor.authorCui, Songyi-
dc.contributor.authorKuo, Yong Hong-
dc.contributor.authorChow, Andy HF-
dc.contributor.authorYing, Chengshuo-
dc.date.accessioned2024-09-17T00:30:37Z-
dc.date.available2024-09-17T00:30:37Z-
dc.date.issued2023-10-01-
dc.identifier.citationTransportation Research Part E: Logistics and Transportation Review, 2023, v. 178, p. 103260-
dc.identifier.issn1366-5545-
dc.identifier.urihttp://hdl.handle.net/10722/346446-
dc.description.abstractThe paper studies the resource allocation problem for delivering on-site services in urban areas. Requests for services are received spontaneously, with deliveries to be assigned dynamically. Real-life examples of such applications include the dispatch of traffic officers to scenes of accidents and the deployment of mechanics to sites of maintenance works. The dynamic assignment problem is to be solved via a policy gradient approach that dynamically assigns workers to different locations so that each customer involved would experience a minimum delay. Our solution framework adopts the transformer architecture with layers of inter-task and inter-agent communications as the approximator. This approximator is trained with the vanilla policy gradient algorithm. To improve computational effectiveness, we introduce an option of withholding an assignment, where workers may not be assigned at a decision point even if a service request is received, to enhance the flexibility of actions. Extensive computational experiments with a varying number of orders, order frequencies, and spatial sparsity are conducted. Our proposed method is shown to outperform other benchmarking methods, including the genetic algorithm and other online heuristics, in terms of stability of effectiveness, computational efficiency, and solution quality. Our experimental results suggest that the proposed method would have a reduced advantage over other benchmarking algorithms if the on-site service time is long.-
dc.languageeng-
dc.publisherElsevier-
dc.relation.ispartofTransportation Research Part E: Logistics and Transportation Review-
dc.subjectDynamic assignment problem-
dc.subjectOn-site service delivery-
dc.subjectPolicy gradient-
dc.subjectResource allocation-
dc.subjectSemi-Markov decision process-
dc.titleA policy gradient approach to solving dynamic assignment problem for on-site service delivery-
dc.typeArticle-
dc.identifier.doi10.1016/j.tre.2023.103260-
dc.identifier.scopuseid_2-s2.0-85172321129-
dc.identifier.volume178-
dc.identifier.spage103260-
dc.identifier.eissn1878-5794-
dc.identifier.isiWOS:001078391100001-
dc.publisher.placeOXFORD-
dc.identifier.issnl1366-5545-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats