Speaker discrimination: citation tones vs. coarticulated tones

Chan, RKW

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1016/j.specom.2019.06.006
Scopus: eid_2-s2.0-85079862158
WOS: WOS:000525305800005
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- English: Journal/Magazine Articles

Article: Speaker discrimination: citation tones vs. coarticulated tones

Title	Speaker discrimination: citation tones vs. coarticulated tones
Authors	Chan, RKW
Keywords	Speaker Discrimination Coarticulation Tone Cantonese Mandarin
Issue Date	2020
Publisher	Elsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/specom
Citation	Speech Communication, 2020, v. 117, p. 38-50 How to Cite? DOI: http://dx.doi.org/10.1016/j.specom.2019.06.006
Abstract	The task of forensic voice comparison (FVC) often involves the comparison of a voice in an offender recording with that in a suspect recording, with the aim to assist the investigating authority or the court in determining the identity of the speaker. One of the main goals in FVC research is to identify speech variables that are useful for differentiating speakers. While French and Stevens (2013) stated that connected speech processes (CSPs) vary across speakers and thus CSPs may be included in the 'toolbox' for forensic voice comparison casework, little empirical research has been done to test how effective various CSPs are in speaker discrimination. This paper reports an exploratory study comparing the speaker-discriminatory power of lexical tones in their citation forms and coarticulated tones. 20 Cantonese and 20 Mandarin speakers were instructed to produce tones under different speech rates and tonal contexts. Results based on discriminant analysis show that the combination of normal speech rate and compatible tonal context appears to have yielded the best speaker discrimination. On the other hand, the combination of fast speech and a conflicting tonal context, which in principle led to the greatest tonal coarticulatory effects, yielded the worst speaker discrimination. The addition of duration on top of tonal f0 significantly improved the classification rates in both languages. Furthermore, for the same tone categories, the Mandarin ones generally discriminate speakers better than the Cantonese counterparts, suggesting that tone inventory density affects the speaker-discriminatory power of tones. Implications of the findings for forensic speaker comparison are discussed.
Description	Link to Free access
Persistent Identifier	http://hdl.handle.net/10722/272670
ISSN	0167-6393 2021 Impact Factor: 2.723 2020 SCImago Journal Rankings: 0.459
ISI Accession Number ID	WOS:000525305800005

DC Field	Value	Language
dc.contributor.author	Chan, RKW	-
dc.date.accessioned	2019-08-06T09:14:19Z	-
dc.date.available	2019-08-06T09:14:19Z	-
dc.date.issued	2020	-
dc.identifier.citation	Speech Communication, 2020, v. 117, p. 38-50	-
dc.identifier.issn	0167-6393	-
dc.identifier.uri	http://hdl.handle.net/10722/272670	-
dc.description	Link to Free access	-
dc.description.abstract	The task of forensic voice comparison (FVC) often involves the comparison of a voice in an offender recording with that in a suspect recording, with the aim to assist the investigating authority or the court in determining the identity of the speaker. One of the main goals in FVC research is to identify speech variables that are useful for differentiating speakers. While French and Stevens (2013) stated that connected speech processes (CSPs) vary across speakers and thus CSPs may be included in the 'toolbox' for forensic voice comparison casework, little empirical research has been done to test how effective various CSPs are in speaker discrimination. This paper reports an exploratory study comparing the speaker-discriminatory power of lexical tones in their citation forms and coarticulated tones. 20 Cantonese and 20 Mandarin speakers were instructed to produce tones under different speech rates and tonal contexts. Results based on discriminant analysis show that the combination of normal speech rate and compatible tonal context appears to have yielded the best speaker discrimination. On the other hand, the combination of fast speech and a conflicting tonal context, which in principle led to the greatest tonal coarticulatory effects, yielded the worst speaker discrimination. The addition of duration on top of tonal f0 significantly improved the classification rates in both languages. Furthermore, for the same tone categories, the Mandarin ones generally discriminate speakers better than the Cantonese counterparts, suggesting that tone inventory density affects the speaker-discriminatory power of tones. Implications of the findings for forensic speaker comparison are discussed.	-
dc.language	eng	-
dc.publisher	Elsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/specom	-
dc.relation.ispartof	Speech Communication	-
dc.subject	Speaker Discrimination	-
dc.subject	Coarticulation	-
dc.subject	Tone	-
dc.subject	Cantonese	-
dc.subject	Mandarin	-
dc.title	Speaker discrimination: citation tones vs. coarticulated tones	-
dc.type	Article	-
dc.identifier.email	Chan, RKW: rickykwc@hku.hk	-
dc.identifier.authority	Chan, RKW=rp02417	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1016/j.specom.2019.06.006	-
dc.identifier.scopus	eid_2-s2.0-85079862158	-
dc.identifier.hkuros	299892	-
dc.identifier.volume	117	-
dc.identifier.spage	38	-
dc.identifier.epage	50	-
dc.identifier.isi	WOS:000525305800005	-
dc.publisher.place	Netherlands	-
dc.identifier.issnl	0167-6393	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Speaker discrimination: citation tones vs. coarticulated tones

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats