File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Improved nonparametric penalized maximum likelihood estimation for arbitrarily censored survival data

TitleImproved nonparametric penalized maximum likelihood estimation for arbitrarily censored survival data
Authors
KeywordsBIC
censoring
nonparametric maximum likelihood
smoothing
survival
Issue Date24-Jun-2022
PublisherWiley
Citation
Statistics in Medicine, 2022, v. 41, n. 20, p. 4006-4021 How to Cite?
AbstractNonparametric maximum likelihood estimation encompasses a group of classic methods to estimate distribution-associated functions from potentially censored and truncated data, with extensive applications in survival analysis. These methods, including the Kaplan-Meier estimator and Turnbull's method, often result in overfitting, especially when the sample size is small. We propose an improvement to these methods by applying kernel smoothing to their raw estimates, based on a BIC-type loss function that balances the trade-off between optimizing model fit and controlling model complexity. In the context of a longitudinal study with repeated observations, we detail our proposed smoothing procedure and optimization algorithm. With extensive simulation studies over multiple realistic scenarios, we demonstrate that our smoothing-based procedure provides better overall accuracy in both survival function estimation and individual-level time-to-event prediction (imputation) by reducing overfitting. Our smoothing procedure decreases the bias (discrepancy between the estimated and true simulated survival function) using interval-censored data by up to 48% compared to the raw un-smoothed estimate, with similar improvements of up to 34% and 23% in within-sample and out-of-sample prediction, respectively. Our smoothing algorithm also demonstrates significant overall improvement across all three metrics when compared to a popular semiparametric B-splines estimation method. Finally, we apply our method to real data on censored breast cancer diagnosis, which similarly shows improvement when compared to empirical survival estimates from uncensored data. We provide an R package, SISE, for implementing our penalized likelihood method.
Persistent Identifierhttp://hdl.handle.net/10722/347319
ISSN
2023 Impact Factor: 1.8
2023 SCImago Journal Rankings: 1.348

 

DC FieldValueLanguage
dc.contributor.authorTubbs, Justin D-
dc.contributor.authorChen, Lane G-
dc.contributor.authorThach, Thuan Quoc-
dc.contributor.authorSham, Pak C-
dc.date.accessioned2024-09-21T00:30:55Z-
dc.date.available2024-09-21T00:30:55Z-
dc.date.issued2022-06-24-
dc.identifier.citationStatistics in Medicine, 2022, v. 41, n. 20, p. 4006-4021-
dc.identifier.issn0277-6715-
dc.identifier.urihttp://hdl.handle.net/10722/347319-
dc.description.abstractNonparametric maximum likelihood estimation encompasses a group of classic methods to estimate distribution-associated functions from potentially censored and truncated data, with extensive applications in survival analysis. These methods, including the Kaplan-Meier estimator and Turnbull's method, often result in overfitting, especially when the sample size is small. We propose an improvement to these methods by applying kernel smoothing to their raw estimates, based on a BIC-type loss function that balances the trade-off between optimizing model fit and controlling model complexity. In the context of a longitudinal study with repeated observations, we detail our proposed smoothing procedure and optimization algorithm. With extensive simulation studies over multiple realistic scenarios, we demonstrate that our smoothing-based procedure provides better overall accuracy in both survival function estimation and individual-level time-to-event prediction (imputation) by reducing overfitting. Our smoothing procedure decreases the bias (discrepancy between the estimated and true simulated survival function) using interval-censored data by up to 48% compared to the raw un-smoothed estimate, with similar improvements of up to 34% and 23% in within-sample and out-of-sample prediction, respectively. Our smoothing algorithm also demonstrates significant overall improvement across all three metrics when compared to a popular semiparametric B-splines estimation method. Finally, we apply our method to real data on censored breast cancer diagnosis, which similarly shows improvement when compared to empirical survival estimates from uncensored data. We provide an R package, SISE, for implementing our penalized likelihood method.-
dc.languageeng-
dc.publisherWiley-
dc.relation.ispartofStatistics in Medicine-
dc.subjectBIC-
dc.subjectcensoring-
dc.subjectnonparametric maximum likelihood-
dc.subjectsmoothing-
dc.subjectsurvival-
dc.titleImproved nonparametric penalized maximum likelihood estimation for arbitrarily censored survival data-
dc.typeArticle-
dc.identifier.doi10.1002/sim.9489-
dc.identifier.pmid35750329-
dc.identifier.scopuseid_2-s2.0-85132581289-
dc.identifier.volume41-
dc.identifier.issue20-
dc.identifier.spage4006-
dc.identifier.epage4021-
dc.identifier.eissn1097-0258-
dc.identifier.issnl0277-6715-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats