File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Automatic assessment of language impairment based on raw ASR output

TitleAutomatic assessment of language impairment based on raw ASR output
Authors
KeywordsSpeech assessment
Language impairment
CNN
Aphasia
Issue Date2019
Citation
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2019, v. 2019-September, p. 3078-3082 How to Cite?
AbstractFor automatic assessment of language impairment in natural speech, properly designed text-based features are needed. The feature design relies on experts' domain knowledge and the feature extraction process may undesirably involve manual effort on transcribing. This paper describes a novel approach to automatic assessment of language impairment in narrative speech of people with aphasia (PWA), without explicit knowledge-driven feature design. A convolutional neural network (CNN) is used to extract language impairment related text features from the output of an automatic speech recognition (ASR) system or, if available, the manual transcription of input speech. To mitigate the adverse effect of ASR errors, confusion network is adopted to improve the robustness of embedding representation of ASR output. The proposed approach is evaluated on the task of discriminating severe PWA from mild PWA based on Cantonese narrative speech. Experimental results confirm the effectiveness of automatically learned text features. It is also shown that CNN models trained with text input and acoustic features are complementary to each other.
Persistent Identifierhttp://hdl.handle.net/10722/307058
ISSN
2020 SCImago Journal Rankings: 0.689
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorQin, Ying-
dc.contributor.authorLee, Tan-
dc.contributor.authorKong, Anthony Pak Hin-
dc.date.accessioned2021-11-03T06:21:51Z-
dc.date.available2021-11-03T06:21:51Z-
dc.date.issued2019-
dc.identifier.citationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2019, v. 2019-September, p. 3078-3082-
dc.identifier.issn2308-457X-
dc.identifier.urihttp://hdl.handle.net/10722/307058-
dc.description.abstractFor automatic assessment of language impairment in natural speech, properly designed text-based features are needed. The feature design relies on experts' domain knowledge and the feature extraction process may undesirably involve manual effort on transcribing. This paper describes a novel approach to automatic assessment of language impairment in narrative speech of people with aphasia (PWA), without explicit knowledge-driven feature design. A convolutional neural network (CNN) is used to extract language impairment related text features from the output of an automatic speech recognition (ASR) system or, if available, the manual transcription of input speech. To mitigate the adverse effect of ASR errors, confusion network is adopted to improve the robustness of embedding representation of ASR output. The proposed approach is evaluated on the task of discriminating severe PWA from mild PWA based on Cantonese narrative speech. Experimental results confirm the effectiveness of automatically learned text features. It is also shown that CNN models trained with text input and acoustic features are complementary to each other.-
dc.languageeng-
dc.relation.ispartofProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.subjectSpeech assessment-
dc.subjectLanguage impairment-
dc.subjectCNN-
dc.subjectAphasia-
dc.titleAutomatic assessment of language impairment based on raw ASR output-
dc.typeConference_Paper-
dc.description.naturelink_to_OA_fulltext-
dc.identifier.doi10.21437/Interspeech.2019-1688-
dc.identifier.scopuseid_2-s2.0-85074683688-
dc.identifier.volume2019-September-
dc.identifier.spage3078-
dc.identifier.epage3082-
dc.identifier.eissn1990-9772-
dc.identifier.isiWOS:000831796403044-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats