File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1145/3022471
- Scopus: eid_2-s2.0-85028594056
- WOS: WOS:000414319000009
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: An unsupervised approach to inferring the localness of people using incomplete geotemporal online check-in data
Title | An unsupervised approach to inferring the localness of people using incomplete geotemporal online check-in data |
---|---|
Authors | |
Keywords | Crowdsourcing GPU implementation Localness of people Maximum likelihood estimation Online social networks Unsupervised learning |
Issue Date | 2017 |
Citation | ACM Transactions on Intelligent Systems and Technology, 2017, v. 8, n. 6, article no. 80 How to Cite? |
Abstract | Inferring the localness of people is to classify people who are local residents in a city from people who visit the city by analyzing online check-in points that are contributed by online users. This information is critical for the urban planning, user profiling, and localized recommendation systems. Supervised learning approaches have been developed to infer the location of people in a city by assuming the availability of high-quality training datasets with complete geotemporal information. In this article, we develop an unsupervised model to accurately identify local people in a city by using the incomplete online check-in data that are publicly available. In particular, we develop an incomplete geotemporal expectation maximization (IGT-EM) scheme, which incorporates a set of hidden variables to represent the localness of people and a set of estimation parameters to represent the likelihood of venues to attract local and nonlocal people, respectively. Our solution can accurately classify local people from nonlocal nones without requiring any training data. We also implement a parallel IGT-EM algorithm by leveraging the computing power of a graphic processing unit (GPU) that consists of 2,496 cores. In the evaluation, we compare our new approach with the existing solutions through four real-world case studies using data from the New York City, Chicago, Boston, and Washington, DC. The results show that our approach can identify the local people and significantly outperform the compared baselines in estimation accuracy and execution time. |
Persistent Identifier | http://hdl.handle.net/10722/308729 |
ISSN | 2023 Impact Factor: 7.2 2023 SCImago Journal Rankings: 1.882 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Huang, Chao | - |
dc.contributor.author | Wang, Dong | - |
dc.contributor.author | Tao, Jun | - |
dc.date.accessioned | 2021-12-08T07:50:00Z | - |
dc.date.available | 2021-12-08T07:50:00Z | - |
dc.date.issued | 2017 | - |
dc.identifier.citation | ACM Transactions on Intelligent Systems and Technology, 2017, v. 8, n. 6, article no. 80 | - |
dc.identifier.issn | 2157-6904 | - |
dc.identifier.uri | http://hdl.handle.net/10722/308729 | - |
dc.description.abstract | Inferring the localness of people is to classify people who are local residents in a city from people who visit the city by analyzing online check-in points that are contributed by online users. This information is critical for the urban planning, user profiling, and localized recommendation systems. Supervised learning approaches have been developed to infer the location of people in a city by assuming the availability of high-quality training datasets with complete geotemporal information. In this article, we develop an unsupervised model to accurately identify local people in a city by using the incomplete online check-in data that are publicly available. In particular, we develop an incomplete geotemporal expectation maximization (IGT-EM) scheme, which incorporates a set of hidden variables to represent the localness of people and a set of estimation parameters to represent the likelihood of venues to attract local and nonlocal people, respectively. Our solution can accurately classify local people from nonlocal nones without requiring any training data. We also implement a parallel IGT-EM algorithm by leveraging the computing power of a graphic processing unit (GPU) that consists of 2,496 cores. In the evaluation, we compare our new approach with the existing solutions through four real-world case studies using data from the New York City, Chicago, Boston, and Washington, DC. The results show that our approach can identify the local people and significantly outperform the compared baselines in estimation accuracy and execution time. | - |
dc.language | eng | - |
dc.relation.ispartof | ACM Transactions on Intelligent Systems and Technology | - |
dc.subject | Crowdsourcing | - |
dc.subject | GPU implementation | - |
dc.subject | Localness of people | - |
dc.subject | Maximum likelihood estimation | - |
dc.subject | Online social networks | - |
dc.subject | Unsupervised learning | - |
dc.title | An unsupervised approach to inferring the localness of people using incomplete geotemporal online check-in data | - |
dc.type | Article | - |
dc.description.nature | link_to_OA_fulltext | - |
dc.identifier.doi | 10.1145/3022471 | - |
dc.identifier.scopus | eid_2-s2.0-85028594056 | - |
dc.identifier.volume | 8 | - |
dc.identifier.issue | 6 | - |
dc.identifier.spage | article no. 80 | - |
dc.identifier.epage | article no. 80 | - |
dc.identifier.eissn | 2157-6912 | - |
dc.identifier.isi | WOS:000414319000009 | - |