File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Classification via minimum incremental coding length

TitleClassification via minimum incremental coding length
Authors
KeywordsClassification
Lossy data coding
Maximum a posteriori
Regularization
Regularized discriminant analysis
Issue Date2009
Citation
SIAM Journal on Imaging Sciences, 2009, v. 2, n. 2, p. 367-395 How to Cite?
AbstractWe present a simple new criterion for classification, based on principles from lossy data compression. The criterion assigns a test sample to the class that uses the minimum number of additional bits to code the test sample, subject to an allowable distortion. We demonstrate the asymptotic optimality of this criterion for Gaussian distributions and analyze its relationships to classical classifiers. The theoretical results clarify the connections between our approach and popular classifiers such as maximum a posteriori (MAP), regularized discriminant analysis (RDA), k-nearest neighbor (k-NN), and support vector machine (SVM), as well as unsupervised methods based on lossy coding. Our formulation induces several good effects on the resulting classifier. First, minimizing the lossy coding length induces a regularization effect which stabilizes the (implicit) density estimate in a small sample setting. Second, compression provides a uniform means of handling classes of varying dimension. The new criterion and its kernel and local versions perform competitively on synthetic examples, as well as on real imagery data such as handwritten digits and face images. On these problems, the performance of our simple classifier approaches the best reported results, without using domainspecific information. All MATLAB code and classification results are publicly available for peer evaluation at http://perception.csl.uiuc.edu/coding/home.htm.
Persistent Identifierhttp://hdl.handle.net/10722/327112
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorWright, John-
dc.contributor.authorMa, Yi-
dc.contributor.authorTao, Yangyu-
dc.contributor.authorLin, Zhouchen-
dc.contributor.authorShum, Heung Yeung-
dc.date.accessioned2023-03-31T05:28:53Z-
dc.date.available2023-03-31T05:28:53Z-
dc.date.issued2009-
dc.identifier.citationSIAM Journal on Imaging Sciences, 2009, v. 2, n. 2, p. 367-395-
dc.identifier.urihttp://hdl.handle.net/10722/327112-
dc.description.abstractWe present a simple new criterion for classification, based on principles from lossy data compression. The criterion assigns a test sample to the class that uses the minimum number of additional bits to code the test sample, subject to an allowable distortion. We demonstrate the asymptotic optimality of this criterion for Gaussian distributions and analyze its relationships to classical classifiers. The theoretical results clarify the connections between our approach and popular classifiers such as maximum a posteriori (MAP), regularized discriminant analysis (RDA), k-nearest neighbor (k-NN), and support vector machine (SVM), as well as unsupervised methods based on lossy coding. Our formulation induces several good effects on the resulting classifier. First, minimizing the lossy coding length induces a regularization effect which stabilizes the (implicit) density estimate in a small sample setting. Second, compression provides a uniform means of handling classes of varying dimension. The new criterion and its kernel and local versions perform competitively on synthetic examples, as well as on real imagery data such as handwritten digits and face images. On these problems, the performance of our simple classifier approaches the best reported results, without using domainspecific information. All MATLAB code and classification results are publicly available for peer evaluation at http://perception.csl.uiuc.edu/coding/home.htm.-
dc.languageeng-
dc.relation.ispartofSIAM Journal on Imaging Sciences-
dc.subjectClassification-
dc.subjectLossy data coding-
dc.subjectMaximum a posteriori-
dc.subjectRegularization-
dc.subjectRegularized discriminant analysis-
dc.titleClassification via minimum incremental coding length-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1137/070707312-
dc.identifier.scopuseid_2-s2.0-84981228964-
dc.identifier.volume2-
dc.identifier.issue2-
dc.identifier.spage367-
dc.identifier.epage395-
dc.identifier.eissn1936-4954-
dc.identifier.isiWOS:000278101100004-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats