File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Evaluation of marker selection methods and statistical models for chronological age prediction based on DNA methylation

TitleEvaluation of marker selection methods and statistical models for chronological age prediction based on DNA methylation
Authors
KeywordsDNA methylation
Age prediction
Forward selection
LASSO
Multiple linear regression
Machine learning
Issue Date2020
PublisherElsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/legalmed
Citation
Legal Medicine, 2020, v. 47, article no. 101744 How to Cite?
AbstractIn forensic investigation, retrieving biological information from DNA evidence is a promising field of interest. One of the applications is on the estimation of the age of the donor based on DNA methylation. A large number of studies focused on age prediction using the 450 K Human Methylation Beadchip. Various marker selection methods and prediction models have been considered. However, there is a lack of research evaluating different high-dimensional variable selection methods of CpG sites with various models for age prediction. The aim of this study is to evaluate four variable selection methods (forward selection, LASSO, elastic net and SCAD) combined with a classical statistical model and sophisticated machine learning models based on the mean absolute deviation (MAD) and the root-mean-square error (RMSE). We used publicly available 450 K data set containing 991 whole blood samples (age 19–101 years). We found that the multiple linear regression model with 16 markers selected from the forward selection method performed very well in age prediction (MAD = 3.76 years and RMSE = 5.01 years). On the other hand, the highly advanced ultrahigh dimensional variable selection methods and sophisticated machine learning algorithms appeared unnecessary for age prediction based on DNA methylation.
Persistent Identifierhttp://hdl.handle.net/10722/304015
ISSN
2023 Impact Factor: 1.3
2023 SCImago Journal Rankings: 0.491
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLau, PY-
dc.contributor.authorFung, WK-
dc.date.accessioned2021-09-23T08:54:03Z-
dc.date.available2021-09-23T08:54:03Z-
dc.date.issued2020-
dc.identifier.citationLegal Medicine, 2020, v. 47, article no. 101744-
dc.identifier.issn1344-6223-
dc.identifier.urihttp://hdl.handle.net/10722/304015-
dc.description.abstractIn forensic investigation, retrieving biological information from DNA evidence is a promising field of interest. One of the applications is on the estimation of the age of the donor based on DNA methylation. A large number of studies focused on age prediction using the 450 K Human Methylation Beadchip. Various marker selection methods and prediction models have been considered. However, there is a lack of research evaluating different high-dimensional variable selection methods of CpG sites with various models for age prediction. The aim of this study is to evaluate four variable selection methods (forward selection, LASSO, elastic net and SCAD) combined with a classical statistical model and sophisticated machine learning models based on the mean absolute deviation (MAD) and the root-mean-square error (RMSE). We used publicly available 450 K data set containing 991 whole blood samples (age 19–101 years). We found that the multiple linear regression model with 16 markers selected from the forward selection method performed very well in age prediction (MAD = 3.76 years and RMSE = 5.01 years). On the other hand, the highly advanced ultrahigh dimensional variable selection methods and sophisticated machine learning algorithms appeared unnecessary for age prediction based on DNA methylation.-
dc.languageeng-
dc.publisherElsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/legalmed-
dc.relation.ispartofLegal Medicine-
dc.subjectDNA methylation-
dc.subjectAge prediction-
dc.subjectForward selection-
dc.subjectLASSO-
dc.subjectMultiple linear regression-
dc.subjectMachine learning-
dc.titleEvaluation of marker selection methods and statistical models for chronological age prediction based on DNA methylation-
dc.typeArticle-
dc.identifier.emailFung, WK: wingfung@hkucc.hku.hk-
dc.identifier.authorityFung, WK=rp00696-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1016/j.legalmed.2020.101744-
dc.identifier.pmid32659707-
dc.identifier.scopuseid_2-s2.0-85087687763-
dc.identifier.hkuros325605-
dc.identifier.volume47-
dc.identifier.spagearticle no. 101744-
dc.identifier.epagearticle no. 101744-
dc.identifier.isiWOS:000579855100009-
dc.publisher.placeNetherlands-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats