File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Automatic Speech Assessment for Aphasic Patients Based on Syllable-Level Embedding and Supra-Segmental Duration Features

TitleAutomatic Speech Assessment for Aphasic Patients Based on Syllable-Level Embedding and Supra-Segmental Duration Features
Authors
KeywordsASR
Word embedding
Cantonese
Speech assessment
Aphasia
Issue Date2018
Citation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2018, v. 2018-April, p. 5994-5998 How to Cite?
AbstractAphasia is a type of acquired language impairment resulting from brain injury. Speech assessment is an important part of the comprehensive assessment process for aphasic patients. It is based on the acoustical and linguistic analysis of patients' speech elicited through pre-defined story-telling tasks. This type of narrative spontaneous speech embodies multi-fold atypical characteristics related to the underlying language impairment. This paper presents an investigation on automatic speech assessment for Cantonese-speaking aphasic patients using an automatic speech recognition (ASR) system. A novel approach to extracting robust text features from erroneous ASR output is developed based on word embedding methods. The text features can effectively distinguish the stories told by an impaired speaker from those by unimpaired ones. On the other hand, a set of supra-segmental duration features are derived from syllable-level time alignments produced by the ASR system, to characterize the atypical prosody of impaired speech. The proposed text features, duration features and their combination are evaluated in a binary classification experiment as well as in automatic prediction of subjective assessment score. The results clearly show that the text features are very effective in the intended task of aphasia assessment, while using duration features could provide additional benefit.
Persistent Identifierhttp://hdl.handle.net/10722/307247
ISSN
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorQin, Ying-
dc.contributor.authorLee, Tan-
dc.contributor.authorKong, Anthony Pak Hin-
dc.date.accessioned2021-11-03T06:22:13Z-
dc.date.available2021-11-03T06:22:13Z-
dc.date.issued2018-
dc.identifier.citationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2018, v. 2018-April, p. 5994-5998-
dc.identifier.issn1520-6149-
dc.identifier.urihttp://hdl.handle.net/10722/307247-
dc.description.abstractAphasia is a type of acquired language impairment resulting from brain injury. Speech assessment is an important part of the comprehensive assessment process for aphasic patients. It is based on the acoustical and linguistic analysis of patients' speech elicited through pre-defined story-telling tasks. This type of narrative spontaneous speech embodies multi-fold atypical characteristics related to the underlying language impairment. This paper presents an investigation on automatic speech assessment for Cantonese-speaking aphasic patients using an automatic speech recognition (ASR) system. A novel approach to extracting robust text features from erroneous ASR output is developed based on word embedding methods. The text features can effectively distinguish the stories told by an impaired speaker from those by unimpaired ones. On the other hand, a set of supra-segmental duration features are derived from syllable-level time alignments produced by the ASR system, to characterize the atypical prosody of impaired speech. The proposed text features, duration features and their combination are evaluated in a binary classification experiment as well as in automatic prediction of subjective assessment score. The results clearly show that the text features are very effective in the intended task of aphasia assessment, while using duration features could provide additional benefit.-
dc.languageeng-
dc.relation.ispartofICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings-
dc.subjectASR-
dc.subjectWord embedding-
dc.subjectCantonese-
dc.subjectSpeech assessment-
dc.subjectAphasia-
dc.titleAutomatic Speech Assessment for Aphasic Patients Based on Syllable-Level Embedding and Supra-Segmental Duration Features-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/ICASSP.2018.8461289-
dc.identifier.scopuseid_2-s2.0-85054218163-
dc.identifier.volume2018-April-
dc.identifier.spage5994-
dc.identifier.epage5998-
dc.identifier.isiWOS:000446384606031-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats