File Download
Supplementary

postgraduate thesis: Statistical diagnostics for longitudinal data analysis : forward search of the GEE method

TitleStatistical diagnostics for longitudinal data analysis : forward search of the GEE method
Authors
Issue Date2015
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Li, N. [李乃霖]. (2015). Statistical diagnostics for longitudinal data analysis : forward search of the GEE method. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5570813
AbstractIn longitudinal data analysis, masking and swamping (MS) are two common effects that can cause severe problems. Successful identification of MS effects is essential to both outlier detection and longitudinal data analysis because ignorance of the MS effects can make the conclusion of analysis totally meaningless and misleading. In this thesis, a statistical method for analyzing and diagnosing longitudinal data sets is proposed as the forward search of the generalized estimating equation (GEE) method (FSGEE). Starting from an outlier-free initial subset of the data selected using a robust method, FSGEE makes its progress to the next subset by expanding the subset according to the distance of the observations to the GEE model fitted from the current subset. Through monitoring statistical diagnostics during the forward search process, the forward plots are produced by plotting the diagnostics against the sizes of the forward search subsets. The MS effects can then be discovered by simply investigating the forward plots of residuals. When the inclusion of an observation affects the model and the diagnostics of other points significantly, the observation is suspected to be an outlier. When necessary, by examining the forward plots of various statistical diagnostics, a deeper understanding of the observation can be acknowledged, for example changes in the values of the coefficients after the observation is included, or changes in the diagnostics of other observations when the suspicious outlier is removed from the data set. The acknowledgement will help in deciding whether the observation is a true outlier, or just a non-outlying observation with relatively high leverage. Through simulation studies and the analysis of seizure data and hormone data, the forward search of the GEE method is shown to be able to provide a wealth of information for guiding both outlier detection and the identification of MS effects.
DegreeMaster of Philosophy
SubjectLongitudinal method - Statistical methods
Dept/ProgramStatistics and Actuarial Science
Persistent Identifierhttp://hdl.handle.net/10722/219998

 

DC FieldValueLanguage
dc.contributor.authorLi, Nailin-
dc.contributor.author李乃霖-
dc.date.accessioned2015-10-08T23:12:19Z-
dc.date.available2015-10-08T23:12:19Z-
dc.date.issued2015-
dc.identifier.citationLi, N. [李乃霖]. (2015). Statistical diagnostics for longitudinal data analysis : forward search of the GEE method. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5570813-
dc.identifier.urihttp://hdl.handle.net/10722/219998-
dc.description.abstractIn longitudinal data analysis, masking and swamping (MS) are two common effects that can cause severe problems. Successful identification of MS effects is essential to both outlier detection and longitudinal data analysis because ignorance of the MS effects can make the conclusion of analysis totally meaningless and misleading. In this thesis, a statistical method for analyzing and diagnosing longitudinal data sets is proposed as the forward search of the generalized estimating equation (GEE) method (FSGEE). Starting from an outlier-free initial subset of the data selected using a robust method, FSGEE makes its progress to the next subset by expanding the subset according to the distance of the observations to the GEE model fitted from the current subset. Through monitoring statistical diagnostics during the forward search process, the forward plots are produced by plotting the diagnostics against the sizes of the forward search subsets. The MS effects can then be discovered by simply investigating the forward plots of residuals. When the inclusion of an observation affects the model and the diagnostics of other points significantly, the observation is suspected to be an outlier. When necessary, by examining the forward plots of various statistical diagnostics, a deeper understanding of the observation can be acknowledged, for example changes in the values of the coefficients after the observation is included, or changes in the diagnostics of other observations when the suspicious outlier is removed from the data set. The acknowledgement will help in deciding whether the observation is a true outlier, or just a non-outlying observation with relatively high leverage. Through simulation studies and the analysis of seizure data and hormone data, the forward search of the GEE method is shown to be able to provide a wealth of information for guiding both outlier detection and the identification of MS effects.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsCreative Commons: Attribution 3.0 Hong Kong License-
dc.subject.lcshLongitudinal method - Statistical methods-
dc.titleStatistical diagnostics for longitudinal data analysis : forward search of the GEE method-
dc.typePG_Thesis-
dc.identifier.hkulb5570813-
dc.description.thesisnameMaster of Philosophy-
dc.description.thesislevelMaster-
dc.description.thesisdisciplineStatistics and Actuarial Science-
dc.description.naturepublished_or_final_version-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats