Modeling Lexical Tones for Speaker Discrimination

Chan, Ricky K.W.; Wang, Bruce Xiao

File Download

content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1177/00238309241261702
Scopus: eid_2-s2.0-85199986235
WOS: WOS:001278062100001
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- English: Journal/Magazine Articles

Article: Modeling Lexical Tones for Speaker Discrimination

Title	Modeling Lexical Tones for Speaker Discrimination
Authors	Chan, Ricky K.W.Wang, Bruce Xiao
Keywords	Cantonese fundamental frequency lexical tone Mandarin Speaker discrimination
Issue Date	27-Jul-2024
Publisher	SAGE Publications
Citation	Language and Speech, 2024 How to Cite? DOI: http://dx.doi.org/10.1177/00238309241261702
Abstract	Fundamental frequency (F0) has been widely studied and used in the context of speaker discrimination and forensic voice comparison casework, but most previous studies focused on long-term F0 statistics. Lexical tone, the linguistically structured and dynamic aspects of F0, has received much less research attention. A main methodological issue lies on how tonal F0 should be parameterized for the best speaker discrimination performance. This paper compares the speaker discriminatory performance of three approaches with lexical tone modeling: discrete cosine transform (DCT), polynomial curve fitting, and quantitative target approximation (qTA). Results show that using parameters based on DCT and polynomials led to similarly promising performance, whereas those based on qTA generally yielded relatively poor performance. Implications modeling surface tonal F0 and the underlying articulatory processes for speaker discrimination are discussed.
Persistent Identifier	http://hdl.handle.net/10722/345722
ISSN	0023-8309 2023 Impact Factor: 1.1 2023 SCImago Journal Rankings: 0.625
ISI Accession Number ID	WOS:001278062100001

DC Field	Value	Language
dc.contributor.author	Chan, Ricky K.W.	-
dc.contributor.author	Wang, Bruce Xiao	-
dc.date.accessioned	2024-08-27T09:10:44Z	-
dc.date.available	2024-08-27T09:10:44Z	-
dc.date.issued	2024-07-27	-
dc.identifier.citation	Language and Speech, 2024	-
dc.identifier.issn	0023-8309	-
dc.identifier.uri	http://hdl.handle.net/10722/345722	-
dc.description.abstract	<p>Fundamental frequency (F0) has been widely studied and used in the context of speaker discrimination and forensic voice comparison casework, but most previous studies focused on long-term F0 statistics. Lexical tone, the linguistically structured and dynamic aspects of F0, has received much less research attention. A main methodological issue lies on how tonal F0 should be parameterized for the best speaker discrimination performance. This paper compares the speaker discriminatory performance of three approaches with lexical tone modeling: discrete cosine transform (DCT), polynomial curve fitting, and quantitative target approximation (qTA). Results show that using parameters based on DCT and polynomials led to similarly promising performance, whereas those based on qTA generally yielded relatively poor performance. Implications modeling surface tonal F0 and the underlying articulatory processes for speaker discrimination are discussed.</p>	-
dc.language	eng	-
dc.publisher	SAGE Publications	-
dc.relation.ispartof	Language and Speech	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject	Cantonese	-
dc.subject	fundamental frequency	-
dc.subject	lexical tone	-
dc.subject	Mandarin	-
dc.subject	Speaker discrimination	-
dc.title	Modeling Lexical Tones for Speaker Discrimination	-
dc.type	Article	-
dc.identifier.doi	10.1177/00238309241261702	-
dc.identifier.scopus	eid_2-s2.0-85199986235	-
dc.identifier.eissn	1756-6053	-
dc.identifier.isi	WOS:001278062100001	-
dc.identifier.issnl	0023-8309	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Modeling Lexical Tones for Speaker Discrimination

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats