Entropy coding for training deep belief networks with imbalanced and unlabeled data

Berry, J; Fasel, I; Fadiga, L; Archangeli, D

File Download

Abstract.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1121/1.4708066
Find via

Supplementary

Citations:
Appears in Collections:
- Humanities: Conference papers
- Faculty of Arts: Conference papers

Conference Paper: Entropy coding for training deep belief networks with imbalanced and unlabeled data

Title	Entropy coding for training deep belief networks with imbalanced and unlabeled data
Authors	Berry, J Fasel, I Fadiga, L Archangeli, D
Keywords	Physics Sound
Issue Date	2012
Publisher	Acoustical Society of America. The Journal's web site is located at http://asa.aip.org/jasa.html
Citation	The ACOUSTICS 2012 Hong Kong Conference & Exihibition, Hong Kong, 13-18 May 2012. In Journal of the Acoustical Society of America, 2012, v. 131 n. 4, p. 3235, abstract no. 1aSCb1 How to Cite? DOI: http://dx.doi.org/10.1121/1.4708066
Abstract	Training deep belief networks (DBNs) is normally done with large data sets. In this work, the goal is to predict traces of the surface of the tongue in ultrasoundimages of the mouth during speech. Performance on this task can be dramatically enhanced by pre-training a DBN jointly on human-supplied traces and ultrasoundimages, then training a modified version of the network to predict traces from ultrasound only. However, hand-tracing the entire dataset of ultrasoundimages is extremely labor intensive. Moreover, the dataset is highly imbalanced since many images are extremely similar. This work presents a bootstrapping method which takes advantage of this imbalance, iteratively selecting a small subset of images to be hand-traced, then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection. With this approach, a three-fold reduction in human time required to trace an entire dataset with human-level accuracy was achieved.
Description	Session 1aSCb - Speech Communication: Speech Processing Potpourri (Poster Session): no. 1aSCb1
Persistent Identifier	http://hdl.handle.net/10722/211020
ISSN	0001-4966 2021 Impact Factor: 2.482 2020 SCImago Journal Rankings: 0.619

DC Field	Value	Language
dc.contributor.author	Berry, J	-
dc.contributor.author	Fasel, I	-
dc.contributor.author	Fadiga, L	-
dc.contributor.author	Archangeli, D	-
dc.date.accessioned	2015-06-30T07:55:39Z	-
dc.date.available	2015-06-30T07:55:39Z	-
dc.date.issued	2012	-
dc.identifier.citation	The ACOUSTICS 2012 Hong Kong Conference & Exihibition, Hong Kong, 13-18 May 2012. In Journal of the Acoustical Society of America, 2012, v. 131 n. 4, p. 3235, abstract no. 1aSCb1	-
dc.identifier.issn	0001-4966	-
dc.identifier.uri	http://hdl.handle.net/10722/211020	-
dc.description	Session 1aSCb - Speech Communication: Speech Processing Potpourri (Poster Session): no. 1aSCb1	-
dc.description.abstract	Training deep belief networks (DBNs) is normally done with large data sets. In this work, the goal is to predict traces of the surface of the tongue in ultrasoundimages of the mouth during speech. Performance on this task can be dramatically enhanced by pre-training a DBN jointly on human-supplied traces and ultrasoundimages, then training a modified version of the network to predict traces from ultrasound only. However, hand-tracing the entire dataset of ultrasoundimages is extremely labor intensive. Moreover, the dataset is highly imbalanced since many images are extremely similar. This work presents a bootstrapping method which takes advantage of this imbalance, iteratively selecting a small subset of images to be hand-traced, then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection. With this approach, a three-fold reduction in human time required to trace an entire dataset with human-level accuracy was achieved.	-
dc.language	eng	-
dc.publisher	Acoustical Society of America. The Journal's web site is located at http://asa.aip.org/jasa.html	-
dc.relation.ispartof	Journal of the Acoustical Society of America	-
dc.rights	Copyright 2012 Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. The following article appeared in Journal of the Acoustical Society of America, 2012, v. 131 n. 4, p. 3235, abstract no. 1aSCb1 and may be found at https://doi.org/10.1121/1.4708066	-
dc.subject	Physics	-
dc.subject	Sound	-
dc.title	Entropy coding for training deep belief networks with imbalanced and unlabeled data	-
dc.type	Conference_Paper	-
dc.identifier.email	Archangeli, D: darchang@hku.hk	-
dc.identifier.authority	Archangeli, D=rp01748	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1121/1.4708066	-
dc.identifier.volume	131	-
dc.identifier.issue	4	-
dc.identifier.spage	3235, abstract no. 1aSCb1	-
dc.identifier.epage	3235, abstract no. 1aSCb1	-
dc.publisher.place	United States	-
dc.identifier.issnl	0001-4966	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Entropy coding for training deep belief networks with imbalanced and unlabeled data

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats