File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Corrected generalized cross-validation for finite ensembles of penalized estimators

TitleCorrected generalized cross-validation for finite ensembles of penalized estimators
Authors
Keywordsdegrees of freedom adjustment
ensemble methods
generalized cross-validation
penalized estimators
Issue Date2025
Citation
Journal of the Royal Statistical Society Series B Statistical Methodology, 2025, v. 87, n. 2, p. 289-318 How to Cite?
AbstractGeneralized cross-validation (GCV) is a widely used method for estimating the squared out-of-sample prediction risk that employs scalar degrees of freedom adjustment (in a multiplicative sense) to the squared training error. In this paper, we examine the consistency of GCV for estimating the prediction risk of arbitrary ensembles of penalized least-squares estimators. We show that GCV is inconsistent for any finite ensemble of size greater than one. Towards repairing this shortcoming, we identify a correction that involves an additional scalar correction (in an additive sense) based on degrees of freedom adjusted training errors from each ensemble component. The proposed estimator (termed CGCV) maintains the computational advantages of GCV and requires neither sample splitting, model refitting, or out-of-bag risk estimation. The estimator stems from a finer inspection of the ensemble risk decomposition and two intermediate risk estimators for the components in this decomposition. We provide a non-asymptotic analysis of the CGCV and the two intermediate risk estimators for ensembles of convex penalized estimators under Gaussian features and a linear response model. Furthermore, in the special case of ridge regression, we extend the analysis to general feature and response distributions using random matrix theory, which establishes model-free uniform consistency of CGCV.
Persistent Identifierhttp://hdl.handle.net/10722/365456
ISSN
2023 Impact Factor: 3.1
2023 SCImago Journal Rankings: 4.330

 

DC FieldValueLanguage
dc.contributor.authorBellec, Pierre C.-
dc.contributor.authorDu, Jin Hong-
dc.contributor.authorKoriyama, Takuya-
dc.contributor.authorPatil, Pratik-
dc.contributor.authorTan, Kai-
dc.date.accessioned2025-11-05T09:40:39Z-
dc.date.available2025-11-05T09:40:39Z-
dc.date.issued2025-
dc.identifier.citationJournal of the Royal Statistical Society Series B Statistical Methodology, 2025, v. 87, n. 2, p. 289-318-
dc.identifier.issn1369-7412-
dc.identifier.urihttp://hdl.handle.net/10722/365456-
dc.description.abstractGeneralized cross-validation (GCV) is a widely used method for estimating the squared out-of-sample prediction risk that employs scalar degrees of freedom adjustment (in a multiplicative sense) to the squared training error. In this paper, we examine the consistency of GCV for estimating the prediction risk of arbitrary ensembles of penalized least-squares estimators. We show that GCV is inconsistent for any finite ensemble of size greater than one. Towards repairing this shortcoming, we identify a correction that involves an additional scalar correction (in an additive sense) based on degrees of freedom adjusted training errors from each ensemble component. The proposed estimator (termed CGCV) maintains the computational advantages of GCV and requires neither sample splitting, model refitting, or out-of-bag risk estimation. The estimator stems from a finer inspection of the ensemble risk decomposition and two intermediate risk estimators for the components in this decomposition. We provide a non-asymptotic analysis of the CGCV and the two intermediate risk estimators for ensembles of convex penalized estimators under Gaussian features and a linear response model. Furthermore, in the special case of ridge regression, we extend the analysis to general feature and response distributions using random matrix theory, which establishes model-free uniform consistency of CGCV.-
dc.languageeng-
dc.relation.ispartofJournal of the Royal Statistical Society Series B Statistical Methodology-
dc.subjectdegrees of freedom adjustment-
dc.subjectensemble methods-
dc.subjectgeneralized cross-validation-
dc.subjectpenalized estimators-
dc.titleCorrected generalized cross-validation for finite ensembles of penalized estimators-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1093/jrsssb/qkae092-
dc.identifier.scopuseid_2-s2.0-105002662998-
dc.identifier.volume87-
dc.identifier.issue2-
dc.identifier.spage289-
dc.identifier.epage318-
dc.identifier.eissn1467-9868-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats