UrbanGPT: Spatio-Temporal Large Language Models

Li, Zhonghang; Xia, Lianghao; Tang, Jiabin; Xu, Yong; Shi, Lei; Xia, Long; Yin, Dawei; Huang, Chao

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1145/3637528.3671578
Scopus: eid_2-s2.0-85203709974
WOS: WOS:001324524205047
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: UrbanGPT: Spatio-Temporal Large Language Models

Title	UrbanGPT: Spatio-Temporal Large Language Models
Authors	Li, Zhonghang Xia, Lianghao Tang, Jiabin Xu, Yong Shi, Lei Xia, Long Yin, Dawei Huang, Chao
Keywords	generative ai large language models smart cities spatial-temporal data mining urban computing
Issue Date	2024
Citation	Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2024, p. 5351-5362 How to Cite? DOI: http://dx.doi.org/10.1145/3637528.3671578
Abstract	Spatio-temporal prediction aims to forecast and gain insights into the ever-changing dynamics of urban environments across both time and space. Its purpose is to anticipate future patterns, trends, and events in diverse facets of urban life, including transportation, population movement, and crime rates. Although numerous efforts have been dedicated to developing neural network techniques for accurate predictions on spatio-temporal data, it is important to note that many of these methods heavily depend on having sufficient labeled data to generate precise spatio-temporal representations. Unfortunately, the issue of data scarcity is pervasive in practical urban sensing scenarios. In certain cases, it becomes challenging to collect any labeled data from downstream scenarios, intensifying the problem further. Consequently, it becomes necessary to build a spatio-temporal model that can exhibit strong generalization capabilities across diverse spatio-temporal learning scenarios. Taking inspiration from the remarkable achievements of large language models (LLMs), our objective is to create a spatio-temporal LLM that can exhibit exceptional generalization capabilities across a wide range of downstream urban tasks. To achieve this objective, we present the UrbanGPT, which seamlessly integrates a spatio-temporal dependency encoder with the instruction-tuning paradigm. This integration enables LLMs to comprehend the complex inter-dependencies across time and space, facilitating more comprehensive and accurate predictions under data scarcity. To validate the effectiveness of our approach, we conduct extensive experiments on various public datasets, covering different spatio-temporal prediction tasks. The results consistently demonstrate that our UrbanGPT, with its carefully designed architecture, consistently outperforms state-of-the-art baselines. These findings highlight the potential of building large language models for spatio-temporal learning, particularly in zero-shot scenarios where labeled data is scarce. The code and data are available at: https://github.com/HKUDS/UrbanGPT.
Persistent Identifier	http://hdl.handle.net/10722/355978
ISSN	2154-817X
ISI Accession Number ID	WOS:001324524205047

DC Field	Value	Language
dc.contributor.author	Li, Zhonghang	-
dc.contributor.author	Xia, Lianghao	-
dc.contributor.author	Tang, Jiabin	-
dc.contributor.author	Xu, Yong	-
dc.contributor.author	Shi, Lei	-
dc.contributor.author	Xia, Long	-
dc.contributor.author	Yin, Dawei	-
dc.contributor.author	Huang, Chao	-
dc.date.accessioned	2025-05-19T05:47:02Z	-
dc.date.available	2025-05-19T05:47:02Z	-
dc.date.issued	2024	-
dc.identifier.citation	Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2024, p. 5351-5362	-
dc.identifier.issn	2154-817X	-
dc.identifier.uri	http://hdl.handle.net/10722/355978	-
dc.description.abstract	Spatio-temporal prediction aims to forecast and gain insights into the ever-changing dynamics of urban environments across both time and space. Its purpose is to anticipate future patterns, trends, and events in diverse facets of urban life, including transportation, population movement, and crime rates. Although numerous efforts have been dedicated to developing neural network techniques for accurate predictions on spatio-temporal data, it is important to note that many of these methods heavily depend on having sufficient labeled data to generate precise spatio-temporal representations. Unfortunately, the issue of data scarcity is pervasive in practical urban sensing scenarios. In certain cases, it becomes challenging to collect any labeled data from downstream scenarios, intensifying the problem further. Consequently, it becomes necessary to build a spatio-temporal model that can exhibit strong generalization capabilities across diverse spatio-temporal learning scenarios. Taking inspiration from the remarkable achievements of large language models (LLMs), our objective is to create a spatio-temporal LLM that can exhibit exceptional generalization capabilities across a wide range of downstream urban tasks. To achieve this objective, we present the UrbanGPT, which seamlessly integrates a spatio-temporal dependency encoder with the instruction-tuning paradigm. This integration enables LLMs to comprehend the complex inter-dependencies across time and space, facilitating more comprehensive and accurate predictions under data scarcity. To validate the effectiveness of our approach, we conduct extensive experiments on various public datasets, covering different spatio-temporal prediction tasks. The results consistently demonstrate that our UrbanGPT, with its carefully designed architecture, consistently outperforms state-of-the-art baselines. These findings highlight the potential of building large language models for spatio-temporal learning, particularly in zero-shot scenarios where labeled data is scarce. The code and data are available at: https://github.com/HKUDS/UrbanGPT.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining	-
dc.subject	generative ai	-
dc.subject	large language models	-
dc.subject	smart cities	-
dc.subject	spatial-temporal data mining	-
dc.subject	urban computing	-
dc.title	UrbanGPT: Spatio-Temporal Large Language Models	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1145/3637528.3671578	-
dc.identifier.scopus	eid_2-s2.0-85203709974	-
dc.identifier.spage	5351	-
dc.identifier.epage	5362	-
dc.identifier.isi	WOS:001324524205047	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: UrbanGPT: Spatio-Temporal Large Language Models

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats