File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.21437/Interspeech.2019-1688
- Scopus: eid_2-s2.0-85074683688
- WOS: WOS:000831796403044
- Find via
Supplementary
- Citations:
- Appears in Collections:
Conference Paper: Automatic assessment of language impairment based on raw ASR output
Title | Automatic assessment of language impairment based on raw ASR output |
---|---|
Authors | |
Keywords | Speech assessment Language impairment CNN Aphasia |
Issue Date | 2019 |
Citation | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2019, v. 2019-September, p. 3078-3082 How to Cite? |
Abstract | For automatic assessment of language impairment in natural speech, properly designed text-based features are needed. The feature design relies on experts' domain knowledge and the feature extraction process may undesirably involve manual effort on transcribing. This paper describes a novel approach to automatic assessment of language impairment in narrative speech of people with aphasia (PWA), without explicit knowledge-driven feature design. A convolutional neural network (CNN) is used to extract language impairment related text features from the output of an automatic speech recognition (ASR) system or, if available, the manual transcription of input speech. To mitigate the adverse effect of ASR errors, confusion network is adopted to improve the robustness of embedding representation of ASR output. The proposed approach is evaluated on the task of discriminating severe PWA from mild PWA based on Cantonese narrative speech. Experimental results confirm the effectiveness of automatically learned text features. It is also shown that CNN models trained with text input and acoustic features are complementary to each other. |
Persistent Identifier | http://hdl.handle.net/10722/307058 |
ISSN | 2020 SCImago Journal Rankings: 0.689 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Qin, Ying | - |
dc.contributor.author | Lee, Tan | - |
dc.contributor.author | Kong, Anthony Pak Hin | - |
dc.date.accessioned | 2021-11-03T06:21:51Z | - |
dc.date.available | 2021-11-03T06:21:51Z | - |
dc.date.issued | 2019 | - |
dc.identifier.citation | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2019, v. 2019-September, p. 3078-3082 | - |
dc.identifier.issn | 2308-457X | - |
dc.identifier.uri | http://hdl.handle.net/10722/307058 | - |
dc.description.abstract | For automatic assessment of language impairment in natural speech, properly designed text-based features are needed. The feature design relies on experts' domain knowledge and the feature extraction process may undesirably involve manual effort on transcribing. This paper describes a novel approach to automatic assessment of language impairment in narrative speech of people with aphasia (PWA), without explicit knowledge-driven feature design. A convolutional neural network (CNN) is used to extract language impairment related text features from the output of an automatic speech recognition (ASR) system or, if available, the manual transcription of input speech. To mitigate the adverse effect of ASR errors, confusion network is adopted to improve the robustness of embedding representation of ASR output. The proposed approach is evaluated on the task of discriminating severe PWA from mild PWA based on Cantonese narrative speech. Experimental results confirm the effectiveness of automatically learned text features. It is also shown that CNN models trained with text input and acoustic features are complementary to each other. | - |
dc.language | eng | - |
dc.relation.ispartof | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | - |
dc.subject | Speech assessment | - |
dc.subject | Language impairment | - |
dc.subject | CNN | - |
dc.subject | Aphasia | - |
dc.title | Automatic assessment of language impairment based on raw ASR output | - |
dc.type | Conference_Paper | - |
dc.description.nature | link_to_OA_fulltext | - |
dc.identifier.doi | 10.21437/Interspeech.2019-1688 | - |
dc.identifier.scopus | eid_2-s2.0-85074683688 | - |
dc.identifier.volume | 2019-September | - |
dc.identifier.spage | 3078 | - |
dc.identifier.epage | 3082 | - |
dc.identifier.eissn | 1990-9772 | - |
dc.identifier.isi | WOS:000831796403044 | - |