ShefCE: A Cantonese-English bilingual speech corpus for pronunciation assessment

Ng, RWM; Kwan, ACM; Lee, T; Hain, T

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/ICASSP.2017.7953273
Scopus: eid_2-s2.0-85023738602
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Faculty of Education: Conference papers

Conference Paper: ShefCE: A Cantonese-English bilingual speech corpus for pronunciation assessment

Title	ShefCE: A Cantonese-English bilingual speech corpus for pronunciation assessment
Authors	Ng, RWM Kwan, ACM Lee, T Hain, T
Keywords	Bilingual parallel speech corpus Cantonese English pronunciation assessment
Issue Date	2017
Publisher	IEEE. The Proceedings' web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000002
Citation	2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, 5-9 March 2017 How to Cite? DOI: http://dx.doi.org/10.1109/ICASSP.2017.7953273
Abstract	This paper introduces the development of ShefCE: a Cantonese-English bilingual speech corpus from L2 English speakers in Hong Kong. Bilingual parallel recording materials were chosen from TED online lectures. Script selection were carried out according to bilingual consistency (evaluated using a machine translation system) and the distribution balance of phonemes. 31 undergraduate to postgraduate students in Hong Kong aged 20-30 were recruited and recorded a 25-hour speech corpus (12 hours in Cantonese and 13 hours in English). Baseline phoneme/syllable recognition systems were trained on background data with and without the ShefCE training data. The final syllable error rate (SER) for Cantonese is 17.3% and final phoneme error rate (PER) for English is 34.5%. The automatic speech recognition performance on English showed a significant mismatch when applying L1 models on L2 data, suggesting the need for explicit accent adaptation. ShefCE and the corresponding baseline models will be made openly available for academic research.
Persistent Identifier	http://hdl.handle.net/10722/248686
ISSN	2379-190X

DC Field	Value	Language
dc.contributor.author	Ng, RWM	-
dc.contributor.author	Kwan, ACM	-
dc.contributor.author	Lee, T	-
dc.contributor.author	Hain, T	-
dc.date.accessioned	2017-10-18T08:47:01Z	-
dc.date.available	2017-10-18T08:47:01Z	-
dc.date.issued	2017	-
dc.identifier.citation	2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, 5-9 March 2017	-
dc.identifier.issn	2379-190X	-
dc.identifier.uri	http://hdl.handle.net/10722/248686	-
dc.description.abstract	This paper introduces the development of ShefCE: a Cantonese-English bilingual speech corpus from L2 English speakers in Hong Kong. Bilingual parallel recording materials were chosen from TED online lectures. Script selection were carried out according to bilingual consistency (evaluated using a machine translation system) and the distribution balance of phonemes. 31 undergraduate to postgraduate students in Hong Kong aged 20-30 were recruited and recorded a 25-hour speech corpus (12 hours in Cantonese and 13 hours in English). Baseline phoneme/syllable recognition systems were trained on background data with and without the ShefCE training data. The final syllable error rate (SER) for Cantonese is 17.3% and final phoneme error rate (PER) for English is 34.5%. The automatic speech recognition performance on English showed a significant mismatch when applying L1 models on L2 data, suggesting the need for explicit accent adaptation. ShefCE and the corresponding baseline models will be made openly available for academic research.	-
dc.language	eng	-
dc.publisher	IEEE. The Proceedings' web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000002	-
dc.relation.ispartof	IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)	-
dc.rights	IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Copyright © IEEE.	-
dc.rights	©2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	-
dc.subject	Bilingual parallel speech corpus	-
dc.subject	Cantonese	-
dc.subject	English pronunciation assessment	-
dc.title	ShefCE: A Cantonese-English bilingual speech corpus for pronunciation assessment	-
dc.type	Conference_Paper	-
dc.identifier.email	Kwan, ACM: cmkwan@hku.hk	-
dc.identifier.doi	10.1109/ICASSP.2017.7953273	-
dc.identifier.scopus	eid_2-s2.0-85023738602	-
dc.identifier.hkuros	280108	-
dc.publisher.place	New Orleans, LA	-
dc.identifier.issnl	1520-6149	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: ShefCE: A Cantonese-English bilingual speech corpus for pronunciation assessment

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats