File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/JSTSP.2019.2956371
- Scopus: eid_2-s2.0-85076135679
- WOS: WOS:000528666500010
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Automatic Assessment of Speech Impairment in Cantonese-Speaking People with Aphasia
Title | Automatic Assessment of Speech Impairment in Cantonese-Speaking People with Aphasia |
---|---|
Authors | |
Keywords | Pathological speech assessment aphasia deep neural network (DNN) automatic speech recognition Cantonese |
Issue Date | 2020 |
Citation | IEEE Journal on Selected Topics in Signal Processing, 2020, v. 14, n. 2, p. 331-345 How to Cite? |
Abstract | Aphasia is a common type of acquired language impairment resulting from dysfunction in specific brain regions. Analysis of narrative spontaneous speech, e.g., story-telling, is an essential component of standardized clinical assessment on people with aphasia (PWA). Subjective assessment by trained speech-language pathologists (SLP) have many limitations in efficiency, effectiveness and practicality. This article describes a fully automated system for speech assessment of Cantonese-speaking PWA. A deep neural network (DNN) based automatic speech recognition (ASR) system is developed for aphasic speech by multi-task training with both in-domain and out-of-domain speech data. Story-level embedding and siamese network are applied to derive robust text features, which can be used to quantify the difference between aphasic speech and unimpaired one. The proposed text features are combined with conventional acoustic features to cover different aspects of speech and language impairment in PWA. Experimental results show a high correlation between predicted scores and subject assessment scores. The best correlation value achieved with ASR-generated transcription is.827, as compared with.844 achieved with manual transcription. The siamese network significantly outperforms story-level embedding in generating text features for automatic assessment. |
Persistent Identifier | http://hdl.handle.net/10722/307281 |
ISSN | 2023 Impact Factor: 8.7 2023 SCImago Journal Rankings: 3.818 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Qin, Ying | - |
dc.contributor.author | Lee, Tan | - |
dc.contributor.author | Kong, Anthony Pak Hin | - |
dc.date.accessioned | 2021-11-03T06:22:17Z | - |
dc.date.available | 2021-11-03T06:22:17Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | IEEE Journal on Selected Topics in Signal Processing, 2020, v. 14, n. 2, p. 331-345 | - |
dc.identifier.issn | 1932-4553 | - |
dc.identifier.uri | http://hdl.handle.net/10722/307281 | - |
dc.description.abstract | Aphasia is a common type of acquired language impairment resulting from dysfunction in specific brain regions. Analysis of narrative spontaneous speech, e.g., story-telling, is an essential component of standardized clinical assessment on people with aphasia (PWA). Subjective assessment by trained speech-language pathologists (SLP) have many limitations in efficiency, effectiveness and practicality. This article describes a fully automated system for speech assessment of Cantonese-speaking PWA. A deep neural network (DNN) based automatic speech recognition (ASR) system is developed for aphasic speech by multi-task training with both in-domain and out-of-domain speech data. Story-level embedding and siamese network are applied to derive robust text features, which can be used to quantify the difference between aphasic speech and unimpaired one. The proposed text features are combined with conventional acoustic features to cover different aspects of speech and language impairment in PWA. Experimental results show a high correlation between predicted scores and subject assessment scores. The best correlation value achieved with ASR-generated transcription is.827, as compared with.844 achieved with manual transcription. The siamese network significantly outperforms story-level embedding in generating text features for automatic assessment. | - |
dc.language | eng | - |
dc.relation.ispartof | IEEE Journal on Selected Topics in Signal Processing | - |
dc.subject | Pathological speech assessment | - |
dc.subject | aphasia | - |
dc.subject | deep neural network (DNN) | - |
dc.subject | automatic speech recognition | - |
dc.subject | Cantonese | - |
dc.title | Automatic Assessment of Speech Impairment in Cantonese-Speaking People with Aphasia | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/JSTSP.2019.2956371 | - |
dc.identifier.scopus | eid_2-s2.0-85076135679 | - |
dc.identifier.volume | 14 | - |
dc.identifier.issue | 2 | - |
dc.identifier.spage | 331 | - |
dc.identifier.epage | 345 | - |
dc.identifier.eissn | 1941-0484 | - |
dc.identifier.isi | WOS:000528666500010 | - |