Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study

Fu, Ziru; Hsu, Yu Cheng; Chan, Christian S; Lau, Chaak Ming; Liu, Joyce; Yip, Paul Siu Fai

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.2196/51069
Scopus: eid_2-s2.0-85184344413
PMID: 38289662
WOS: WOS:001171070000003
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:

Article: Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study

Title	Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study
Authors	Fu, Ziru Hsu, Yu Cheng Chan, Christian S Lau, Chaak Ming Liu, Joyce Yip, Paul Siu Fai
Keywords	Cantonese ChatGPT counseling natural language processing NLP sentiment analysis
Issue Date	30-Jan-2024
Publisher	JMIR Publications Inc.
Citation	Journal of Medical Internet Research, 2024, v. 26, n. 1 How to Cite? DOI: http://dx.doi.org/10.2196/51069
Abstract	Background: Sentiment analysis is a significant yet difficult task in natural language processing. The linguistic peculiarities of Cantonese, including its high similarity with Standard Chinese, its grammatical and lexical uniqueness, and its colloquialism and multilingualism, make it different from other languages and pose additional challenges to sentiment analysis. Recent advances in models such as ChatGPT offer potential viable solutions. Objective: This study investigated the efficacy of GPT-3.5 and GPT-4 in Cantonese sentiment analysis in the context of web-based counseling and compared their performance with other mainstream methods, including lexicon-based methods and machine learning approaches. Methods: We analyzed transcripts from a web-based, text-based counseling service in Hong Kong, including a total of 131 individual counseling sessions and 6169 messages between counselors and help-seekers. First, a codebook was developed for human annotation. A simple prompt (“Is the sentiment of this Cantonese text positive, neutral, or negative? Respond with the sentiment label only.”) was then given to GPT-3.5 and GPT-4 to label each message’s sentiment. GPT-3.5 and GPT-4’s performance was compared with a lexicon-based method and 3 state-of-the-art models, including linear regression, support vector machines, and long short-term memory neural networks. Results: Our findings revealed ChatGPT’s remarkable accuracy in sentiment classification, with GPT-3.5 and GPT-4, respectively, achieving 92.1% (5682/6169) and 95.3% (5880/6169) accuracy in identifying positive, neutral, and negative sentiment, thereby outperforming the traditional lexicon-based method, which had an accuracy of 37.2% (2295/6169), and the 3 machine learning models, which had accuracies ranging from 66% (4072/6169) to 70.9% (4374/6169). Conclusions: Among many text analysis techniques, ChatGPT demonstrates superior accuracy and emerges as a promising tool for Cantonese sentiment analysis. This study also highlights ChatGPT’s applicability in real-world scenarios, such as monitoring the quality of text-based counseling services and detecting message-level sentiments in vivo. The insights derived from this study pave the way for further exploration into the capabilities of ChatGPT in the context of underresourced languages and specialized domains like psychotherapy and natural language processing.
Persistent Identifier	http://hdl.handle.net/10722/348406
ISSN	1439-4456 2023 SCImago Journal Rankings: 2.020
ISI Accession Number ID	WOS:001171070000003

DC Field	Value	Language
dc.contributor.author	Fu, Ziru	-
dc.contributor.author	Hsu, Yu Cheng	-
dc.contributor.author	Chan, Christian S	-
dc.contributor.author	Lau, Chaak Ming	-
dc.contributor.author	Liu, Joyce	-
dc.contributor.author	Yip, Paul Siu Fai	-
dc.date.accessioned	2024-10-09T00:31:18Z	-
dc.date.available	2024-10-09T00:31:18Z	-
dc.date.issued	2024-01-30	-
dc.identifier.citation	Journal of Medical Internet Research, 2024, v. 26, n. 1	-
dc.identifier.issn	1439-4456	-
dc.identifier.uri	http://hdl.handle.net/10722/348406	-
dc.description.abstract	<p>Background: Sentiment analysis is a significant yet difficult task in natural language processing. The linguistic peculiarities of Cantonese, including its high similarity with Standard Chinese, its grammatical and lexical uniqueness, and its colloquialism and multilingualism, make it different from other languages and pose additional challenges to sentiment analysis. Recent advances in models such as ChatGPT offer potential viable solutions. Objective: This study investigated the efficacy of GPT-3.5 and GPT-4 in Cantonese sentiment analysis in the context of web-based counseling and compared their performance with other mainstream methods, including lexicon-based methods and machine learning approaches. Methods: We analyzed transcripts from a web-based, text-based counseling service in Hong Kong, including a total of 131 individual counseling sessions and 6169 messages between counselors and help-seekers. First, a codebook was developed for human annotation. A simple prompt (“Is the sentiment of this Cantonese text positive, neutral, or negative? Respond with the sentiment label only.”) was then given to GPT-3.5 and GPT-4 to label each message’s sentiment. GPT-3.5 and GPT-4’s performance was compared with a lexicon-based method and 3 state-of-the-art models, including linear regression, support vector machines, and long short-term memory neural networks. Results: Our findings revealed ChatGPT’s remarkable accuracy in sentiment classification, with GPT-3.5 and GPT-4, respectively, achieving 92.1% (5682/6169) and 95.3% (5880/6169) accuracy in identifying positive, neutral, and negative sentiment, thereby outperforming the traditional lexicon-based method, which had an accuracy of 37.2% (2295/6169), and the 3 machine learning models, which had accuracies ranging from 66% (4072/6169) to 70.9% (4374/6169). Conclusions: Among many text analysis techniques, ChatGPT demonstrates superior accuracy and emerges as a promising tool for Cantonese sentiment analysis. This study also highlights ChatGPT’s applicability in real-world scenarios, such as monitoring the quality of text-based counseling services and detecting message-level sentiments in vivo. The insights derived from this study pave the way for further exploration into the capabilities of ChatGPT in the context of underresourced languages and specialized domains like psychotherapy and natural language processing.</p>	-
dc.language	eng	-
dc.publisher	JMIR Publications Inc.	-
dc.relation.ispartof	Journal of Medical Internet Research	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject	Cantonese	-
dc.subject	ChatGPT	-
dc.subject	counseling	-
dc.subject	natural language processing	-
dc.subject	NLP	-
dc.subject	sentiment analysis	-
dc.title	Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study	-
dc.type	Article	-
dc.identifier.doi	10.2196/51069	-
dc.identifier.pmid	38289662	-
dc.identifier.scopus	eid_2-s2.0-85184344413	-
dc.identifier.volume	26	-
dc.identifier.issue	1	-
dc.identifier.eissn	1438-8871	-
dc.identifier.isi	WOS:001171070000003	-
dc.identifier.issnl	1438-8871	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats