File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews

TitleAn unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews
Authors
Issue Date2018
PublisherElsevier. The Journal's web site is located at http://www.elsevier.com/locate/eswa
Citation
Expert Systems with Applications, 2018, v. 114, p. 210-223 How to Cite?
AbstractIn electronic commerce, online reviews play very important roles in customers’ purchasing decisions. Unfortunately, malicious sellers often hire buyers to fabricate fake reviews to improve their reputation. In order to detect deceptive reviews and mine the topics and sentiments from the reviews, in this paper, we propose an unsupervised topic-sentiment joint probabilistic model (UTSJ) based on Latent Dirichlet Allocation (LDA) model. This model first employs Gibbs sampling algorithm to approximate parameters of maximum likelihood function offline and obtain topic-sentiment joint probabilistic distribution vector for each review. Secondly, a Random Forest classifier and a SVM (Support Vector Machine) classifier are trained offline, respectively. Experimental results on real-life datasets show that our proposed model is better than baseline models such as n-grams, character n-grams in token, POS (part-of-speech), LDA, and JST (Joint Sentiment/Topic). Moreover, our UTSJ model outperforms or performs similarly to benchmark models in detecting deceptive reviews over balanced dataset and unbalanced dataset in different domains. Particularly, our UTSJ model is good at dealing with real-life unbalanced big data, which makes it very suitable for being applied in e-commerce environment.
Persistent Identifierhttp://hdl.handle.net/10722/261432

 

DC FieldValueLanguage
dc.contributor.authorDong, L-
dc.contributor.authorJi, S-
dc.contributor.authorZhang, C-
dc.contributor.authorZhang, Q-
dc.contributor.authorChiu, KWD-
dc.contributor.authorQiu, L-
dc.contributor.authorLi, D-
dc.date.accessioned2018-09-14T08:58:04Z-
dc.date.available2018-09-14T08:58:04Z-
dc.date.issued2018-
dc.identifier.citationExpert Systems with Applications, 2018, v. 114, p. 210-223-
dc.identifier.urihttp://hdl.handle.net/10722/261432-
dc.description.abstractIn electronic commerce, online reviews play very important roles in customers’ purchasing decisions. Unfortunately, malicious sellers often hire buyers to fabricate fake reviews to improve their reputation. In order to detect deceptive reviews and mine the topics and sentiments from the reviews, in this paper, we propose an unsupervised topic-sentiment joint probabilistic model (UTSJ) based on Latent Dirichlet Allocation (LDA) model. This model first employs Gibbs sampling algorithm to approximate parameters of maximum likelihood function offline and obtain topic-sentiment joint probabilistic distribution vector for each review. Secondly, a Random Forest classifier and a SVM (Support Vector Machine) classifier are trained offline, respectively. Experimental results on real-life datasets show that our proposed model is better than baseline models such as n-grams, character n-grams in token, POS (part-of-speech), LDA, and JST (Joint Sentiment/Topic). Moreover, our UTSJ model outperforms or performs similarly to benchmark models in detecting deceptive reviews over balanced dataset and unbalanced dataset in different domains. Particularly, our UTSJ model is good at dealing with real-life unbalanced big data, which makes it very suitable for being applied in e-commerce environment.-
dc.languageeng-
dc.publisherElsevier. The Journal's web site is located at http://www.elsevier.com/locate/eswa-
dc.relation.ispartofExpert Systems with Applications-
dc.titleAn unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews-
dc.typeArticle-
dc.identifier.emailChiu, KWD: dchiu88@hku.hk-
dc.identifier.doi10.1016/j.eswa.2018.07.005-
dc.identifier.hkuros291725-
dc.identifier.volume114-
dc.identifier.spage210-
dc.identifier.epage223-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats