File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: An end-to-end approach to automatic speech assessment for people with aphasia

TitleAn end-to-end approach to automatic speech assessment for people with aphasia
Authors
KeywordsCantonese
Pathological speech assessment
End-to-end
Issue Date2018
Citation
2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, 2018, p. 66-70 How to Cite?
AbstractConventionally, automatic assessment of pathological speech involves two main steps: (1) extraction of pathology-specific features; (2) classification or regression of extracted features. Given the great variety of speech and language disorders, feature design is never a straightforward task, and yet it is most critical to the performance of assessment. This paper presents an end-to-end approach to automatic speech assessment for Cantonese-speaking people with aphasia (PWA). The assessment is formulated as a binary classification problem to differentiate PWA with high scores of subjective assessment from those with low scores. The sequence-to-one GRU-RNN and CNN models are applied to realize the end-to-end mapping from speech signals to the classification result. The speech features used for assessment are learned implicitly by the neural network model. Preliminary experimental results show that the end-to-end approach could reach a performance level comparable to conventional two-step approach. The experimental results also suggest that CNN performs better than sequence-to-one GRU-RNN in this specific task.
Persistent Identifierhttp://hdl.handle.net/10722/307430
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorQin, Ying-
dc.contributor.authorLee, Tan-
dc.contributor.authorWu, Yuzhong-
dc.contributor.authorKong, Anthony Pak Hin-
dc.date.accessioned2021-11-03T06:22:35Z-
dc.date.available2021-11-03T06:22:35Z-
dc.date.issued2018-
dc.identifier.citation2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, 2018, p. 66-70-
dc.identifier.urihttp://hdl.handle.net/10722/307430-
dc.description.abstractConventionally, automatic assessment of pathological speech involves two main steps: (1) extraction of pathology-specific features; (2) classification or regression of extracted features. Given the great variety of speech and language disorders, feature design is never a straightforward task, and yet it is most critical to the performance of assessment. This paper presents an end-to-end approach to automatic speech assessment for Cantonese-speaking people with aphasia (PWA). The assessment is formulated as a binary classification problem to differentiate PWA with high scores of subjective assessment from those with low scores. The sequence-to-one GRU-RNN and CNN models are applied to realize the end-to-end mapping from speech signals to the classification result. The speech features used for assessment are learned implicitly by the neural network model. Preliminary experimental results show that the end-to-end approach could reach a performance level comparable to conventional two-step approach. The experimental results also suggest that CNN performs better than sequence-to-one GRU-RNN in this specific task.-
dc.languageeng-
dc.relation.ispartof2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings-
dc.subjectCantonese-
dc.subjectPathological speech assessment-
dc.subjectEnd-to-end-
dc.titleAn end-to-end approach to automatic speech assessment for people with aphasia-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/ISCSLP.2018.8706690-
dc.identifier.scopuseid_2-s2.0-85065880011-
dc.identifier.spage66-
dc.identifier.epage70-
dc.identifier.isiWOS:000469313700014-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats