File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: AUGMENTED DOUBLY ROBUST POST-IMPUTATION INFERENCE FOR PROTEOMIC DATA

TitleAUGMENTED DOUBLY ROBUST POST-IMPUTATION INFERENCE FOR PROTEOMIC DATA
Authors
Keywordsdouble robustness
post-imputation inference
Proteomic data
variational autoen-coder
Issue Date2025
Citation
Annals of Applied Statistics, 2025, v. 19, n. 2, p. 1006-1027 How to Cite?
AbstractQuantitative measurements produced by mass spectrometry proteomics experiments offer a direct way to explore the role of proteins in molecular mechanisms. However, analysis of such data is challenging due to the large proportion of missing values. A common strategy to address this issue is to utilize an imputed dataset, which often introduces systematic bias into down-stream analyses if the imputation errors are ignored. In this paper we propose a statistical framework, inspired by doubly robust estimators, that offers valid and efficient inference for proteomic data. Our framework combines powerful machine learning tools, such as variational autoencoders, to augment the imputation quality with high-dimensional peptide data, and a parametric model to estimate the propensity score for debiasing imputed outcomes. Our estimator is compatible with the double machine learning framework and has provable properties. Simulation studies verify its empirical superiority over other existing procedures. In application to both single-cell proteomic data and bulk-cell Alzheimer’s disease data our method utilizes the imputed data to gain additional, meaningful discoveries and yet maintains good control of false positives.
Persistent Identifierhttp://hdl.handle.net/10722/365461
ISSN
2023 Impact Factor: 1.3
2023 SCImago Journal Rankings: 0.954

 

DC FieldValueLanguage
dc.contributor.authorMoon, Haeun-
dc.contributor.authorDu, Jin Hong-
dc.contributor.authorLei, Jing-
dc.contributor.authorRoeder, Kathryn-
dc.date.accessioned2025-11-05T09:40:41Z-
dc.date.available2025-11-05T09:40:41Z-
dc.date.issued2025-
dc.identifier.citationAnnals of Applied Statistics, 2025, v. 19, n. 2, p. 1006-1027-
dc.identifier.issn1932-6157-
dc.identifier.urihttp://hdl.handle.net/10722/365461-
dc.description.abstractQuantitative measurements produced by mass spectrometry proteomics experiments offer a direct way to explore the role of proteins in molecular mechanisms. However, analysis of such data is challenging due to the large proportion of missing values. A common strategy to address this issue is to utilize an imputed dataset, which often introduces systematic bias into down-stream analyses if the imputation errors are ignored. In this paper we propose a statistical framework, inspired by doubly robust estimators, that offers valid and efficient inference for proteomic data. Our framework combines powerful machine learning tools, such as variational autoencoders, to augment the imputation quality with high-dimensional peptide data, and a parametric model to estimate the propensity score for debiasing imputed outcomes. Our estimator is compatible with the double machine learning framework and has provable properties. Simulation studies verify its empirical superiority over other existing procedures. In application to both single-cell proteomic data and bulk-cell Alzheimer’s disease data our method utilizes the imputed data to gain additional, meaningful discoveries and yet maintains good control of false positives.-
dc.languageeng-
dc.relation.ispartofAnnals of Applied Statistics-
dc.subjectdouble robustness-
dc.subjectpost-imputation inference-
dc.subjectProteomic data-
dc.subjectvariational autoen-coder-
dc.titleAUGMENTED DOUBLY ROBUST POST-IMPUTATION INFERENCE FOR PROTEOMIC DATA-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1214/25-AOAS2012-
dc.identifier.scopuseid_2-s2.0-105007975244-
dc.identifier.volume19-
dc.identifier.issue2-
dc.identifier.spage1006-
dc.identifier.epage1027-
dc.identifier.eissn1941-7330-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats