File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Robust and parallel Bayesian model selection

TitleRobust and parallel Bayesian model selection
Authors
KeywordsScalable inference
Bayesian statistics
Model selection
Machine learning
Issue Date2018
Citation
Computational Statistics and Data Analysis, 2018, v. 127, p. 229-247 How to Cite?
Abstract© 2018 Elsevier B.V. Effective and accurate model selection is an important problem in modern data analysis. One of the major challenges is the computational burden required to handle large datasets that cannot be stored or processed on one machine. Another challenge one may encounter is the presence of outliers and contaminations that damage the inference quality. The parallel “divide and conquer” model selection strategy divides the observations of the full dataset into roughly equal subsets and perform inference and model selection independently on each subset. After local subset inference, this method aggregates the posterior model probabilities or other model/variable selection criteria to obtain a final model by using the notion of geometric median. This approach leads to improved concentration in finding the “correct” model and model parameters and also is provably robust to outliers and data contamination.
Persistent Identifierhttp://hdl.handle.net/10722/296174
ISSN
2023 Impact Factor: 1.5
2023 SCImago Journal Rankings: 1.008
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorZhang, Michael Minyi-
dc.contributor.authorLam, Henry-
dc.contributor.authorLin, Lizhen-
dc.date.accessioned2021-02-11T04:52:59Z-
dc.date.available2021-02-11T04:52:59Z-
dc.date.issued2018-
dc.identifier.citationComputational Statistics and Data Analysis, 2018, v. 127, p. 229-247-
dc.identifier.issn0167-9473-
dc.identifier.urihttp://hdl.handle.net/10722/296174-
dc.description.abstract© 2018 Elsevier B.V. Effective and accurate model selection is an important problem in modern data analysis. One of the major challenges is the computational burden required to handle large datasets that cannot be stored or processed on one machine. Another challenge one may encounter is the presence of outliers and contaminations that damage the inference quality. The parallel “divide and conquer” model selection strategy divides the observations of the full dataset into roughly equal subsets and perform inference and model selection independently on each subset. After local subset inference, this method aggregates the posterior model probabilities or other model/variable selection criteria to obtain a final model by using the notion of geometric median. This approach leads to improved concentration in finding the “correct” model and model parameters and also is provably robust to outliers and data contamination.-
dc.languageeng-
dc.relation.ispartofComputational Statistics and Data Analysis-
dc.subjectScalable inference-
dc.subjectBayesian statistics-
dc.subjectModel selection-
dc.subjectMachine learning-
dc.titleRobust and parallel Bayesian model selection-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1016/j.csda.2018.05.016-
dc.identifier.scopuseid_2-s2.0-85048259822-
dc.identifier.volume127-
dc.identifier.spage229-
dc.identifier.epage247-
dc.identifier.isiWOS:000439748700015-
dc.identifier.issnl0167-9473-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats