File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Combining Phone Posteriorgrams from Strong and Weak Recognizers for Automatic Speech Assessment of People with Aphasia

TitleCombining Phone Posteriorgrams from Strong and Weak Recognizers for Automatic Speech Assessment of People with Aphasia
Authors
KeywordsCNN
ASR
phone posteriorgrams
speech assessment
Aphasia
Issue Date2019
Citation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2019, v. 2019-May, p. 6420-6424 How to Cite?
AbstractThis paper presents an investigation on applying automatic speech recognition (ASR) to speech assessment of people with aphasia (PWA). A distinctive characteristic of PWA speech is paraphasia, which refers to frequent occurrence of phonemic errors, unintended words and non-verbal sounds. In view of the wide variety of paraphasias, we propose to view the ASR errors so caused as out-of-vocabulary (OOV) words. Inspired by previous research on OOV detection, paraphasias in PWA speech are captured by comparing the phone posteriorgrams of a strongly constrained speech recognizer and a weakly constrained one. The posteriorgrams also reveal other characteristics of impaired speech, e.g., change of speaking rate, voice abnormality. Siamese and 2-channel convolutional neural network (CNN) models are used for classifying the posteriorgram pairs and predicting the severity of aphasia. Experimental results on a Cantonese database of PWA speech confirm the effectiveness of the proposed methods. The best F1 score attained on binary classification (severe versus mild aphasia) is 0.891.
Persistent Identifierhttp://hdl.handle.net/10722/307270
ISSN

 

DC FieldValueLanguage
dc.contributor.authorQin, Ying-
dc.contributor.authorLee, Tan-
dc.contributor.authorHin Kong, Anthony Pak-
dc.date.accessioned2021-11-03T06:22:16Z-
dc.date.available2021-11-03T06:22:16Z-
dc.date.issued2019-
dc.identifier.citationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2019, v. 2019-May, p. 6420-6424-
dc.identifier.issn1520-6149-
dc.identifier.urihttp://hdl.handle.net/10722/307270-
dc.description.abstractThis paper presents an investigation on applying automatic speech recognition (ASR) to speech assessment of people with aphasia (PWA). A distinctive characteristic of PWA speech is paraphasia, which refers to frequent occurrence of phonemic errors, unintended words and non-verbal sounds. In view of the wide variety of paraphasias, we propose to view the ASR errors so caused as out-of-vocabulary (OOV) words. Inspired by previous research on OOV detection, paraphasias in PWA speech are captured by comparing the phone posteriorgrams of a strongly constrained speech recognizer and a weakly constrained one. The posteriorgrams also reveal other characteristics of impaired speech, e.g., change of speaking rate, voice abnormality. Siamese and 2-channel convolutional neural network (CNN) models are used for classifying the posteriorgram pairs and predicting the severity of aphasia. Experimental results on a Cantonese database of PWA speech confirm the effectiveness of the proposed methods. The best F1 score attained on binary classification (severe versus mild aphasia) is 0.891.-
dc.languageeng-
dc.relation.ispartofICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings-
dc.subjectCNN-
dc.subjectASR-
dc.subjectphone posteriorgrams-
dc.subjectspeech assessment-
dc.subjectAphasia-
dc.titleCombining Phone Posteriorgrams from Strong and Weak Recognizers for Automatic Speech Assessment of People with Aphasia-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/ICASSP.2019.8683835-
dc.identifier.scopuseid_2-s2.0-85069439806-
dc.identifier.volume2019-May-
dc.identifier.spage6420-
dc.identifier.epage6424-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats