File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.21437/Interspeech.2018-1630
- Scopus: eid_2-s2.0-85054969032
- WOS: WOS:000465363900713
- Find via
Supplementary
- Citations:
- Appears in Collections:
Conference Paper: Automatic speech assessment for people with aphasia using TDNN-BLSTM with multi-task learning
Title | Automatic speech assessment for people with aphasia using TDNN-BLSTM with multi-task learning |
---|---|
Authors | |
Keywords | Speech assessment Aphasia Multi-task learning TDNN-BLSTM |
Issue Date | 2018 |
Citation | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018, v. 2018-September, p. 3418-3422 How to Cite? |
Abstract | This paper describes an investigation on automatic speech assessment for people with aphasia (PWA) using a DNN based automatic speech recognition (ASR) system. The main problems being addressed are the lack of training speech in the intended application domain and the relevant degradation of ASR performance for impaired speech of PWA. We adopt the TDNN-BLSTM structure for acoustic modeling and apply the technique of multi-task learning with large amount of domain-mismatched data. This leads to a significant improvement on the recognition accuracy, as compared with a conventional single-task learning DNN system. To facilitate the extraction of robust text features for quantifying language impairment in PWA speech, we propose to incorporate N-best hypotheses and confusion network representation of the ASR output. The severity of impairment is predicted from text features and supra-segmental duration features using different regression models. Experimental results show a high correlation of 0.842 between the predicted severity level and the subjective Aphasia Quotient score. |
Persistent Identifier | http://hdl.handle.net/10722/307250 |
ISSN | 2020 SCImago Journal Rankings: 0.689 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Qin, Ying | - |
dc.contributor.author | Lee, Tan | - |
dc.contributor.author | Feng, Siyuan | - |
dc.contributor.author | Kong, Anthony Pak Hin | - |
dc.date.accessioned | 2021-11-03T06:22:14Z | - |
dc.date.available | 2021-11-03T06:22:14Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018, v. 2018-September, p. 3418-3422 | - |
dc.identifier.issn | 2308-457X | - |
dc.identifier.uri | http://hdl.handle.net/10722/307250 | - |
dc.description.abstract | This paper describes an investigation on automatic speech assessment for people with aphasia (PWA) using a DNN based automatic speech recognition (ASR) system. The main problems being addressed are the lack of training speech in the intended application domain and the relevant degradation of ASR performance for impaired speech of PWA. We adopt the TDNN-BLSTM structure for acoustic modeling and apply the technique of multi-task learning with large amount of domain-mismatched data. This leads to a significant improvement on the recognition accuracy, as compared with a conventional single-task learning DNN system. To facilitate the extraction of robust text features for quantifying language impairment in PWA speech, we propose to incorporate N-best hypotheses and confusion network representation of the ASR output. The severity of impairment is predicted from text features and supra-segmental duration features using different regression models. Experimental results show a high correlation of 0.842 between the predicted severity level and the subjective Aphasia Quotient score. | - |
dc.language | eng | - |
dc.relation.ispartof | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | - |
dc.subject | Speech assessment | - |
dc.subject | Aphasia | - |
dc.subject | Multi-task learning | - |
dc.subject | TDNN-BLSTM | - |
dc.title | Automatic speech assessment for people with aphasia using TDNN-BLSTM with multi-task learning | - |
dc.type | Conference_Paper | - |
dc.description.nature | link_to_OA_fulltext | - |
dc.identifier.doi | 10.21437/Interspeech.2018-1630 | - |
dc.identifier.scopus | eid_2-s2.0-85054969032 | - |
dc.identifier.volume | 2018-September | - |
dc.identifier.spage | 3418 | - |
dc.identifier.epage | 3422 | - |
dc.identifier.eissn | 1990-9772 | - |
dc.identifier.isi | WOS:000465363900713 | - |