An unsupervised approach to inferring the localness of people using incomplete geotemporal online check-in data

Huang, Chao; Wang, Dong; Tao, Jun

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1145/3022471
Scopus: eid_2-s2.0-85028594056
WOS: WOS:000414319000009
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: An unsupervised approach to inferring the localness of people using incomplete geotemporal online check-in data

Title	An unsupervised approach to inferring the localness of people using incomplete geotemporal online check-in data
Authors	Huang, Chao Wang, Dong Tao, Jun
Keywords	Crowdsourcing GPU implementation Localness of people Maximum likelihood estimation Online social networks Unsupervised learning
Issue Date	2017
Citation	ACM Transactions on Intelligent Systems and Technology, 2017, v. 8, n. 6, article no. 80 How to Cite? DOI: http://dx.doi.org/10.1145/3022471
Abstract	Inferring the localness of people is to classify people who are local residents in a city from people who visit the city by analyzing online check-in points that are contributed by online users. This information is critical for the urban planning, user profiling, and localized recommendation systems. Supervised learning approaches have been developed to infer the location of people in a city by assuming the availability of high-quality training datasets with complete geotemporal information. In this article, we develop an unsupervised model to accurately identify local people in a city by using the incomplete online check-in data that are publicly available. In particular, we develop an incomplete geotemporal expectation maximization (IGT-EM) scheme, which incorporates a set of hidden variables to represent the localness of people and a set of estimation parameters to represent the likelihood of venues to attract local and nonlocal people, respectively. Our solution can accurately classify local people from nonlocal nones without requiring any training data. We also implement a parallel IGT-EM algorithm by leveraging the computing power of a graphic processing unit (GPU) that consists of 2,496 cores. In the evaluation, we compare our new approach with the existing solutions through four real-world case studies using data from the New York City, Chicago, Boston, and Washington, DC. The results show that our approach can identify the local people and significantly outperform the compared baselines in estimation accuracy and execution time.
Persistent Identifier	http://hdl.handle.net/10722/308729
ISSN	2157-6904 2023 Impact Factor: 7.2 2023 SCImago Journal Rankings: 1.882
ISI Accession Number ID	WOS:000414319000009

DC Field	Value	Language
dc.contributor.author	Huang, Chao	-
dc.contributor.author	Wang, Dong	-
dc.contributor.author	Tao, Jun	-
dc.date.accessioned	2021-12-08T07:50:00Z	-
dc.date.available	2021-12-08T07:50:00Z	-
dc.date.issued	2017	-
dc.identifier.citation	ACM Transactions on Intelligent Systems and Technology, 2017, v. 8, n. 6, article no. 80	-
dc.identifier.issn	2157-6904	-
dc.identifier.uri	http://hdl.handle.net/10722/308729	-
dc.description.abstract	Inferring the localness of people is to classify people who are local residents in a city from people who visit the city by analyzing online check-in points that are contributed by online users. This information is critical for the urban planning, user profiling, and localized recommendation systems. Supervised learning approaches have been developed to infer the location of people in a city by assuming the availability of high-quality training datasets with complete geotemporal information. In this article, we develop an unsupervised model to accurately identify local people in a city by using the incomplete online check-in data that are publicly available. In particular, we develop an incomplete geotemporal expectation maximization (IGT-EM) scheme, which incorporates a set of hidden variables to represent the localness of people and a set of estimation parameters to represent the likelihood of venues to attract local and nonlocal people, respectively. Our solution can accurately classify local people from nonlocal nones without requiring any training data. We also implement a parallel IGT-EM algorithm by leveraging the computing power of a graphic processing unit (GPU) that consists of 2,496 cores. In the evaluation, we compare our new approach with the existing solutions through four real-world case studies using data from the New York City, Chicago, Boston, and Washington, DC. The results show that our approach can identify the local people and significantly outperform the compared baselines in estimation accuracy and execution time.	-
dc.language	eng	-
dc.relation.ispartof	ACM Transactions on Intelligent Systems and Technology	-
dc.subject	Crowdsourcing	-
dc.subject	GPU implementation	-
dc.subject	Localness of people	-
dc.subject	Maximum likelihood estimation	-
dc.subject	Online social networks	-
dc.subject	Unsupervised learning	-
dc.title	An unsupervised approach to inferring the localness of people using incomplete geotemporal online check-in data	-
dc.type	Article	-
dc.description.nature	link_to_OA_fulltext	-
dc.identifier.doi	10.1145/3022471	-
dc.identifier.scopus	eid_2-s2.0-85028594056	-
dc.identifier.volume	8	-
dc.identifier.issue	6	-
dc.identifier.spage	article no. 80	-
dc.identifier.epage	article no. 80	-
dc.identifier.eissn	2157-6912	-
dc.identifier.isi	WOS:000414319000009	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: An unsupervised approach to inferring the localness of people using incomplete geotemporal online check-in data

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats