Towards Capturing the Temporal Dynamics for Trajectory Prediction: a Coarse-to-Fine Approach

Jia, Xiaosong; Chen, Li; Wu, Penghao; Zeng, Jia; Yan, Junchi; Li, Hongyang; Qiao, Yu

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Scopus: eid_2-s2.0-85153783223

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- HKU Musketeers Foundation Institute of Data Science: Conference papers

Conference Paper: Towards Capturing the Temporal Dynamics for Trajectory Prediction: a Coarse-to-Fine Approach

Title	Towards Capturing the Temporal Dynamics for Trajectory Prediction: a Coarse-to-Fine Approach
Authors	Jia, Xiaosong Chen, Li Wu, Penghao Zeng, Jia Yan, Junchi Li, Hongyang Qiao, Yu
Keywords	Autonomous Driving Temporal Correlation Trajectory Prediction
Issue Date	2023
Citation	Proceedings of Machine Learning Research, 2023, v. 205, p. 910-920 How to Cite?
Abstract	Trajectory prediction is one of the basic tasks in the autonomous driving field, which aims to predict the future position of other agents around the ego vehicle so that a safe yet efficient driving plan could be generated in the downstream module. Recently, deep learning based methods dominate the field. State-of-the-art (SOTA) methods usually follow an encoder-decoder paradigm. Specifically, the encoder is responsible for extracting information from agents' history states and HD-Map and providing a representation vector for each agent. Taking these vectors as input, the decoder predicts multi-step future positions for each agent, which is usually accomplished by a single multi-layer perceptron (MLP) to directly output a Tx2 tensor. Though models with adoptation of MLP decoder have dominated the leaderboard of multiple datasets, 'the elephant in the room is that the temporal correlation among future time-steps is ignored since there is no direct relation among output neurons of a MLP. In this work, we examine this design choice and investigate several ways to apply the temporal inductive bias into the generation of future trajectories on top of a SOTA encoder. We find that simply using autoregressive RNN to generate future positions would lead to significant performance drop even with techniques such as history highway and teacher forcing. Instead, taking scratch trajectories generated by MLP as input, an additional refinement module based on structures with temporal prior such as RNN or 1D-CNN could remarkably boost the accuracy. Furthermore, we examine several objective functions to emphasize the temporal priors. By the combination of aforementioned techniques to introduce the temporal prior, we improve the top-ranked method's performance by a large margin and achieve SOTA result on the Waymo Open Motion Challenge.
Persistent Identifier	http://hdl.handle.net/10722/351465

DC Field	Value	Language
dc.contributor.author	Jia, Xiaosong	-
dc.contributor.author	Chen, Li	-
dc.contributor.author	Wu, Penghao	-
dc.contributor.author	Zeng, Jia	-
dc.contributor.author	Yan, Junchi	-
dc.contributor.author	Li, Hongyang	-
dc.contributor.author	Qiao, Yu	-
dc.date.accessioned	2024-11-20T03:56:26Z	-
dc.date.available	2024-11-20T03:56:26Z	-
dc.date.issued	2023	-
dc.identifier.citation	Proceedings of Machine Learning Research, 2023, v. 205, p. 910-920	-
dc.identifier.uri	http://hdl.handle.net/10722/351465	-
dc.description.abstract	Trajectory prediction is one of the basic tasks in the autonomous driving field, which aims to predict the future position of other agents around the ego vehicle so that a safe yet efficient driving plan could be generated in the downstream module. Recently, deep learning based methods dominate the field. State-of-the-art (SOTA) methods usually follow an encoder-decoder paradigm. Specifically, the encoder is responsible for extracting information from agents' history states and HD-Map and providing a representation vector for each agent. Taking these vectors as input, the decoder predicts multi-step future positions for each agent, which is usually accomplished by a single multi-layer perceptron (MLP) to directly output a Tx2 tensor. Though models with adoptation of MLP decoder have dominated the leaderboard of multiple datasets, 'the elephant in the room is that the temporal correlation among future time-steps is ignored since there is no direct relation among output neurons of a MLP. In this work, we examine this design choice and investigate several ways to apply the temporal inductive bias into the generation of future trajectories on top of a SOTA encoder. We find that simply using autoregressive RNN to generate future positions would lead to significant performance drop even with techniques such as history highway and teacher forcing. Instead, taking scratch trajectories generated by MLP as input, an additional refinement module based on structures with temporal prior such as RNN or 1D-CNN could remarkably boost the accuracy. Furthermore, we examine several objective functions to emphasize the temporal priors. By the combination of aforementioned techniques to introduce the temporal prior, we improve the top-ranked method's performance by a large margin and achieve SOTA result on the Waymo Open Motion Challenge.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings of Machine Learning Research	-
dc.subject	Autonomous Driving	-
dc.subject	Temporal Correlation	-
dc.subject	Trajectory Prediction	-
dc.title	Towards Capturing the Temporal Dynamics for Trajectory Prediction: a Coarse-to-Fine Approach	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.scopus	eid_2-s2.0-85153783223	-
dc.identifier.volume	205	-
dc.identifier.spage	910	-
dc.identifier.epage	920	-
dc.identifier.eissn	2640-3498	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Towards Capturing the Temporal Dynamics for Trajectory Prediction: a Coarse-to-Fine Approach

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats