File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Automatic speech assessment for people with aphasia using TDNN-BLSTM with multi-task learning

TitleAutomatic speech assessment for people with aphasia using TDNN-BLSTM with multi-task learning
Authors
KeywordsSpeech assessment
Aphasia
Multi-task learning
TDNN-BLSTM
Issue Date2018
Citation
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018, v. 2018-September, p. 3418-3422 How to Cite?
AbstractThis paper describes an investigation on automatic speech assessment for people with aphasia (PWA) using a DNN based automatic speech recognition (ASR) system. The main problems being addressed are the lack of training speech in the intended application domain and the relevant degradation of ASR performance for impaired speech of PWA. We adopt the TDNN-BLSTM structure for acoustic modeling and apply the technique of multi-task learning with large amount of domain-mismatched data. This leads to a significant improvement on the recognition accuracy, as compared with a conventional single-task learning DNN system. To facilitate the extraction of robust text features for quantifying language impairment in PWA speech, we propose to incorporate N-best hypotheses and confusion network representation of the ASR output. The severity of impairment is predicted from text features and supra-segmental duration features using different regression models. Experimental results show a high correlation of 0.842 between the predicted severity level and the subjective Aphasia Quotient score.
Persistent Identifierhttp://hdl.handle.net/10722/307250
ISSN
2020 SCImago Journal Rankings: 0.689
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorQin, Ying-
dc.contributor.authorLee, Tan-
dc.contributor.authorFeng, Siyuan-
dc.contributor.authorKong, Anthony Pak Hin-
dc.date.accessioned2021-11-03T06:22:14Z-
dc.date.available2021-11-03T06:22:14Z-
dc.date.issued2018-
dc.identifier.citationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018, v. 2018-September, p. 3418-3422-
dc.identifier.issn2308-457X-
dc.identifier.urihttp://hdl.handle.net/10722/307250-
dc.description.abstractThis paper describes an investigation on automatic speech assessment for people with aphasia (PWA) using a DNN based automatic speech recognition (ASR) system. The main problems being addressed are the lack of training speech in the intended application domain and the relevant degradation of ASR performance for impaired speech of PWA. We adopt the TDNN-BLSTM structure for acoustic modeling and apply the technique of multi-task learning with large amount of domain-mismatched data. This leads to a significant improvement on the recognition accuracy, as compared with a conventional single-task learning DNN system. To facilitate the extraction of robust text features for quantifying language impairment in PWA speech, we propose to incorporate N-best hypotheses and confusion network representation of the ASR output. The severity of impairment is predicted from text features and supra-segmental duration features using different regression models. Experimental results show a high correlation of 0.842 between the predicted severity level and the subjective Aphasia Quotient score.-
dc.languageeng-
dc.relation.ispartofProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.subjectSpeech assessment-
dc.subjectAphasia-
dc.subjectMulti-task learning-
dc.subjectTDNN-BLSTM-
dc.titleAutomatic speech assessment for people with aphasia using TDNN-BLSTM with multi-task learning-
dc.typeConference_Paper-
dc.description.naturelink_to_OA_fulltext-
dc.identifier.doi10.21437/Interspeech.2018-1630-
dc.identifier.scopuseid_2-s2.0-85054969032-
dc.identifier.volume2018-September-
dc.identifier.spage3418-
dc.identifier.epage3422-
dc.identifier.eissn1990-9772-
dc.identifier.isiWOS:000465363900713-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats