File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Bayesian censoring approach to rounded zeros in compositional data
Title | Bayesian censoring approach to rounded zeros in compositional data |
---|---|
Authors | |
Advisors | Advisor(s):Bacon-Shone, J |
Issue Date | 2017 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Leung, T. [梁德貞]. (2017). Bayesian censoring approach to rounded zeros in compositional data. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
Abstract | The logratio transformation is commonly used for analyzing compositional data. However, logratio analysis is not possible when the data has any zeros. However, zeros are common in data on proportions, concentrations etc. when the data is small enough to be below the detection limit, i.e. left-censored data.
Various methods to handle zeros such as combining data and replacing the zeros with predicted non-zero values have been proposed, including additive and multiplicative replacements. The replacement value can be fixed or variable, and can be obtained from various iterative means like modified EM algorithm.
In this thesis, Bayesian analysis assuming left, right and interval censoring on the original scale of compositional data is proposed and applied to the 3 different data sets and simulated data. This approach has the advantages of reflecting the data recording process, which necessarily involves censoring, performing well on the simulated data and yielding easy interpretation of results.
The new approach is demonstrated by means of models estimated using the standard Markov Chain Monte Carlo(MCMC) package WinBUGS. Pioneering models were also estimated using the new Hamiltonian Monte Carlo(HMC) package, RStan. Unlike the models in WinBUGS and all previous approaches in the literature, the Rstan models are based on censoring over the correct region in the simplex, rather than an approximate censoring region based on the log-ratio space. Models from both tools showed reasonable and consistent results. The advantages and limitations of them are compared and discussed.
Apart from zeros, which are left-censored, both methods can handle right-censored and interval censored data as well.
The Rstan models are particularly promising. They are built on a simple architecture which uses numerical approximation of the integral of probabilities over the censored region. It is comparatively fast and flexible, allowing for the possibilities of modification with different ways to handle the approximation.
|
Degree | Doctor of Philosophy |
Subject | Multivariate analysis Mathematical statistics |
Dept/Program | Social Sciences |
Persistent Identifier | http://hdl.handle.net/10722/244335 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Bacon-Shone, J | - |
dc.contributor.author | Leung, Tak-ching | - |
dc.contributor.author | 梁德貞 | - |
dc.date.accessioned | 2017-09-14T04:42:21Z | - |
dc.date.available | 2017-09-14T04:42:21Z | - |
dc.date.issued | 2017 | - |
dc.identifier.citation | Leung, T. [梁德貞]. (2017). Bayesian censoring approach to rounded zeros in compositional data. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
dc.identifier.uri | http://hdl.handle.net/10722/244335 | - |
dc.description.abstract | The logratio transformation is commonly used for analyzing compositional data. However, logratio analysis is not possible when the data has any zeros. However, zeros are common in data on proportions, concentrations etc. when the data is small enough to be below the detection limit, i.e. left-censored data. Various methods to handle zeros such as combining data and replacing the zeros with predicted non-zero values have been proposed, including additive and multiplicative replacements. The replacement value can be fixed or variable, and can be obtained from various iterative means like modified EM algorithm. In this thesis, Bayesian analysis assuming left, right and interval censoring on the original scale of compositional data is proposed and applied to the 3 different data sets and simulated data. This approach has the advantages of reflecting the data recording process, which necessarily involves censoring, performing well on the simulated data and yielding easy interpretation of results. The new approach is demonstrated by means of models estimated using the standard Markov Chain Monte Carlo(MCMC) package WinBUGS. Pioneering models were also estimated using the new Hamiltonian Monte Carlo(HMC) package, RStan. Unlike the models in WinBUGS and all previous approaches in the literature, the Rstan models are based on censoring over the correct region in the simplex, rather than an approximate censoring region based on the log-ratio space. Models from both tools showed reasonable and consistent results. The advantages and limitations of them are compared and discussed. Apart from zeros, which are left-censored, both methods can handle right-censored and interval censored data as well. The Rstan models are particularly promising. They are built on a simple architecture which uses numerical approximation of the integral of probabilities over the censored region. It is comparatively fast and flexible, allowing for the possibilities of modification with different ways to handle the approximation. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Multivariate analysis | - |
dc.subject.lcsh | Mathematical statistics | - |
dc.title | Bayesian censoring approach to rounded zeros in compositional data | - |
dc.type | PG_Thesis | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Social Sciences | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.5353/th_991043953697303414 | - |
dc.date.hkucongregation | 2017 | - |
dc.identifier.mmsid | 991043953697303414 | - |