File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

postgraduate thesis: On incomplete multinomial data modeling and interactive neural and statistical computing with GPU

TitleOn incomplete multinomial data modeling and interactive neural and statistical computing with GPU
Authors
Advisors
Advisor(s):Yin, GTian, G
Issue Date2018
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Dong, F. [東方虎]. (2018). On incomplete multinomial data modeling and interactive neural and statistical computing with GPU. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractThis thesis consists of five chapters describing three independent works. Chapter 1 introduces the thesis and fills some necessary background. Chapter 2 describes the first work on incomplete multinomial model for count data sampled on a random partition. It contains a solution to the estimation problem by an iterative algorithm. The incomplete multinomial likelihood is parameterized by the complete-cell probabilities from the most refined partition. Its sufficient statistics include the variable-cell formation observed as an indicator matrix and all cell counts. With externally imposed structures on the cell formation process, it reduces to special models. The weaver algorithm enjoys the ascent property and has a linear rate of convergence. Its steps are short and amenable to a parallel implementation. It is significantly faster than the state-of-the-art EM/MM algorithm when fitting the Plackett--Luce model to a benchmark data set. The chapter also develops an analytic theory to investigate the conditions surrounding the global maximization of the likelihood. Simulation experiments are designed to show the model and algorithm's performance on recovering very weak signals. Asymptotic properties of the estimator are derived and validated with simulations. The next two chapters both design and implement softwares that combine the spreadsheet software's highly interactive user interface with the Graphics Processing Unit's high computing performance. Chapter 3 presents a general design of an interactive neural network trainer. Its main features include the abilities to specify different transfer functions, loss functions, and learning algorithms, facilities for stepping the learning course and tracking user defined variables, and a mechanism to specify constraints for the weights. It also includes a forward selection algorithm for optimizing the network architecture. Chapter 4 implements a dynamic-link library of GPU-executed matrix functions that can be called on the spreadsheet. It then demonstrates the implementation of an interactive software for multivariate statistical analysis utilizing the GPU matrix library. Chapter 5 makes concluding remarks and lists some potential directions for future works.
DegreeDoctor of Philosophy
SubjectParameter estimation
Estimation theory
Sampling (Statistics)
Iterative methods (Mathematics)
Mathematical optimization
Graphics processing units
Dept/ProgramStatistics and Actuarial Science
Persistent Identifierhttp://hdl.handle.net/10722/261493

 

DC FieldValueLanguage
dc.contributor.advisorYin, G-
dc.contributor.advisorTian, G-
dc.contributor.authorDong, Fanghu-
dc.contributor.author東方虎-
dc.date.accessioned2018-09-20T06:43:56Z-
dc.date.available2018-09-20T06:43:56Z-
dc.date.issued2018-
dc.identifier.citationDong, F. [東方虎]. (2018). On incomplete multinomial data modeling and interactive neural and statistical computing with GPU. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/261493-
dc.description.abstractThis thesis consists of five chapters describing three independent works. Chapter 1 introduces the thesis and fills some necessary background. Chapter 2 describes the first work on incomplete multinomial model for count data sampled on a random partition. It contains a solution to the estimation problem by an iterative algorithm. The incomplete multinomial likelihood is parameterized by the complete-cell probabilities from the most refined partition. Its sufficient statistics include the variable-cell formation observed as an indicator matrix and all cell counts. With externally imposed structures on the cell formation process, it reduces to special models. The weaver algorithm enjoys the ascent property and has a linear rate of convergence. Its steps are short and amenable to a parallel implementation. It is significantly faster than the state-of-the-art EM/MM algorithm when fitting the Plackett--Luce model to a benchmark data set. The chapter also develops an analytic theory to investigate the conditions surrounding the global maximization of the likelihood. Simulation experiments are designed to show the model and algorithm's performance on recovering very weak signals. Asymptotic properties of the estimator are derived and validated with simulations. The next two chapters both design and implement softwares that combine the spreadsheet software's highly interactive user interface with the Graphics Processing Unit's high computing performance. Chapter 3 presents a general design of an interactive neural network trainer. Its main features include the abilities to specify different transfer functions, loss functions, and learning algorithms, facilities for stepping the learning course and tracking user defined variables, and a mechanism to specify constraints for the weights. It also includes a forward selection algorithm for optimizing the network architecture. Chapter 4 implements a dynamic-link library of GPU-executed matrix functions that can be called on the spreadsheet. It then demonstrates the implementation of an interactive software for multivariate statistical analysis utilizing the GPU matrix library. Chapter 5 makes concluding remarks and lists some potential directions for future works.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshParameter estimation-
dc.subject.lcshEstimation theory-
dc.subject.lcshSampling (Statistics)-
dc.subject.lcshIterative methods (Mathematics)-
dc.subject.lcshMathematical optimization-
dc.subject.lcshGraphics processing units-
dc.titleOn incomplete multinomial data modeling and interactive neural and statistical computing with GPU-
dc.typePG_Thesis-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplineStatistics and Actuarial Science-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.5353/th_991044040572603414-
dc.date.hkucongregation2018-
dc.identifier.mmsid991044040572603414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats