File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.eswa.2018.07.005
- Scopus: eid_2-s2.0-85050764666
- WOS: WOS:000446949300016
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews
Title | An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews |
---|---|
Authors | |
Keywords | Deceptive review detection Topic-sentiment joint probabilistic model Latent dirichlet allocation Gibbs sampling |
Issue Date | 2018 |
Publisher | Pergamon. The Journal's web site is located at http://www.elsevier.com/locate/eswa |
Citation | Expert Systems with Applications, 2018, v. 114, p. 210-223 How to Cite? |
Abstract | In electronic commerce, online reviews play very important roles in customers’ purchasing decisions. Unfortunately, malicious sellers often hire buyers to fabricate fake reviews to improve their reputation. In order to detect deceptive reviews and mine the topics and sentiments from the reviews, in this paper, we propose an unsupervised topic-sentiment joint probabilistic model (UTSJ) based on Latent Dirichlet Allocation (LDA) model. This model first employs Gibbs sampling algorithm to approximate parameters of maximum likelihood function offline and obtain topic-sentiment joint probabilistic distribution vector for each review. Secondly, a Random Forest classifier and a SVM (Support Vector Machine) classifier are trained offline, respectively. Experimental results on real-life datasets show that our proposed model is better than baseline models such as n-grams, character n-grams in token, POS (part-of-speech), LDA, and JST (Joint Sentiment/Topic). Moreover, our UTSJ model outperforms or performs similarly to benchmark models in detecting deceptive reviews over balanced dataset and unbalanced dataset in different domains. Particularly, our UTSJ model is good at dealing with real-life unbalanced big data, which makes it very suitable for being applied in e-commerce environment. |
Persistent Identifier | http://hdl.handle.net/10722/261432 |
ISSN | 2023 Impact Factor: 7.5 2023 SCImago Journal Rankings: 1.875 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Dong, LY | - |
dc.contributor.author | Ji, SJ | - |
dc.contributor.author | Zhang, CJ | - |
dc.contributor.author | Zhang, Q | - |
dc.contributor.author | Chiu, DKW | - |
dc.contributor.author | Qiu, LQ | - |
dc.contributor.author | Li, D | - |
dc.date.accessioned | 2018-09-14T08:58:04Z | - |
dc.date.available | 2018-09-14T08:58:04Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | Expert Systems with Applications, 2018, v. 114, p. 210-223 | - |
dc.identifier.issn | 0957-4174 | - |
dc.identifier.uri | http://hdl.handle.net/10722/261432 | - |
dc.description.abstract | In electronic commerce, online reviews play very important roles in customers’ purchasing decisions. Unfortunately, malicious sellers often hire buyers to fabricate fake reviews to improve their reputation. In order to detect deceptive reviews and mine the topics and sentiments from the reviews, in this paper, we propose an unsupervised topic-sentiment joint probabilistic model (UTSJ) based on Latent Dirichlet Allocation (LDA) model. This model first employs Gibbs sampling algorithm to approximate parameters of maximum likelihood function offline and obtain topic-sentiment joint probabilistic distribution vector for each review. Secondly, a Random Forest classifier and a SVM (Support Vector Machine) classifier are trained offline, respectively. Experimental results on real-life datasets show that our proposed model is better than baseline models such as n-grams, character n-grams in token, POS (part-of-speech), LDA, and JST (Joint Sentiment/Topic). Moreover, our UTSJ model outperforms or performs similarly to benchmark models in detecting deceptive reviews over balanced dataset and unbalanced dataset in different domains. Particularly, our UTSJ model is good at dealing with real-life unbalanced big data, which makes it very suitable for being applied in e-commerce environment. | - |
dc.language | eng | - |
dc.publisher | Pergamon. The Journal's web site is located at http://www.elsevier.com/locate/eswa | - |
dc.relation.ispartof | Expert Systems with Applications | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject | Deceptive review detection | - |
dc.subject | Topic-sentiment joint probabilistic model | - |
dc.subject | Latent dirichlet allocation | - |
dc.subject | Gibbs sampling | - |
dc.title | An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews | - |
dc.type | Article | - |
dc.identifier.email | Chiu, DKW: dchiu88@hku.hk | - |
dc.description.nature | postprint | - |
dc.identifier.doi | 10.1016/j.eswa.2018.07.005 | - |
dc.identifier.scopus | eid_2-s2.0-85050764666 | - |
dc.identifier.hkuros | 291725 | - |
dc.identifier.volume | 114 | - |
dc.identifier.spage | 210 | - |
dc.identifier.epage | 223 | - |
dc.identifier.isi | WOS:000446949300016 | - |
dc.publisher.place | United Kingdom | - |
dc.identifier.issnl | 0957-4174 | - |