File Download
Supplementary

postgraduate thesis: Nonparametric Bayesian methods for visual data association

TitleNonparametric Bayesian methods for visual data association
Authors
Issue Date2016
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Sun, X. [孙星]. (2016). Nonparametric Bayesian methods for visual data association. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractVisual data association, widely used in object modeling among multi-dimensional visual data, proves both useful and challenging. Manual preprocessing of object tracking is difficult and costly, involving procedures such as labeling the object position and setting constraint for the tracking trajectory. Massive computational cost and inconsistency in the number of objects identified also cause troubles in actual use. In this dissertation, emphasis is placed on three aspects, namely object vari- ations among regions in two-dimensional (2D) images, object trajectories among frames in three-dimensional (3D) videos and object relationships among view- points in the four-dimensional (4D) light field, which are addressed with spatial, temporal, and light field nonparametric Bayesian methods respectively. First of all, the Doubly Stochastic Dirichlet Process (DSDP) is proposed in the global topic measurement space modeling, which poses weaker assumption com- pared to the discrete Markov assumption, thus resulting in a lower computational cost than other nonparametric Bayesian models. A mixture model of the spatial DSDP is also presented, thinned from Dirichlet Process Mixture (DPM) with- out considerable auxiliary covariates, where the marked function prior makes the number of land-cover classes consistent while the stochastic process prior modelsthe 2D hyperspectral image (HSI) land cover variation globally. The consistency of the number of land covers is maintained in large-scale geographical areas of the HSI. Experiments show that the model is robust and consistent for the HSI identification problem with weak or even no supervision. Second, research is conducted over unsupervised tracking algorithm for hu- man and car trajectories detection in 3D video clips through the mixture model of temporal Doubly Stochastic Dirichlet Process (TDSDP). The TDSDP cap- tures the varying flow of crowds and vehicles in the temporal domain without the Markov assumption, making it particularly suitable for long-term tracking. Besides, TDSDP prior can estimate the number of trajectories automatically. Experimental results using synthetic and real-world data show that the proposed TDSDP mixture is superior to the DPM and Dependent Dirichlet Process (DDP) concerning topic variation modeling. PETS2001 dataset experiments show that TDSDP has more robust object tracking capability over DDP based on General- ized Polya Urn. Third, a sparse hierarchical nonparametric Bayesian model is used to repre- sent the data captured by 4D light field cameras, in which the light field can be regarded as a set of sub-aperture views. To capture the visual variations of the object in different viewpoints, a concept called “depth flow features” is proposed. These views can then be modeled statistically with a sparse representation in a fully unsupervised manner. While local dictionaries are learned from the corre- sponding sub-aperture view, all the views with different perspectives share one global dictionary. To prove the validity of the proposed model, it is applied to denoise the light field data. It is demonstrated that this method outperforms several state-of-the-art approaches in this application. (460 words)
DegreeDoctor of Philosophy
SubjectImage processing
Dept/ProgramElectrical and Electronic Engineering
Persistent Identifierhttp://hdl.handle.net/10722/238349
HKU Library Item IDb5824360

 

DC FieldValueLanguage
dc.contributor.authorSun, Xing-
dc.contributor.author孙星-
dc.date.accessioned2017-02-10T07:29:34Z-
dc.date.available2017-02-10T07:29:34Z-
dc.date.issued2016-
dc.identifier.citationSun, X. [孙星]. (2016). Nonparametric Bayesian methods for visual data association. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/238349-
dc.description.abstractVisual data association, widely used in object modeling among multi-dimensional visual data, proves both useful and challenging. Manual preprocessing of object tracking is difficult and costly, involving procedures such as labeling the object position and setting constraint for the tracking trajectory. Massive computational cost and inconsistency in the number of objects identified also cause troubles in actual use. In this dissertation, emphasis is placed on three aspects, namely object vari- ations among regions in two-dimensional (2D) images, object trajectories among frames in three-dimensional (3D) videos and object relationships among view- points in the four-dimensional (4D) light field, which are addressed with spatial, temporal, and light field nonparametric Bayesian methods respectively. First of all, the Doubly Stochastic Dirichlet Process (DSDP) is proposed in the global topic measurement space modeling, which poses weaker assumption com- pared to the discrete Markov assumption, thus resulting in a lower computational cost than other nonparametric Bayesian models. A mixture model of the spatial DSDP is also presented, thinned from Dirichlet Process Mixture (DPM) with- out considerable auxiliary covariates, where the marked function prior makes the number of land-cover classes consistent while the stochastic process prior modelsthe 2D hyperspectral image (HSI) land cover variation globally. The consistency of the number of land covers is maintained in large-scale geographical areas of the HSI. Experiments show that the model is robust and consistent for the HSI identification problem with weak or even no supervision. Second, research is conducted over unsupervised tracking algorithm for hu- man and car trajectories detection in 3D video clips through the mixture model of temporal Doubly Stochastic Dirichlet Process (TDSDP). The TDSDP cap- tures the varying flow of crowds and vehicles in the temporal domain without the Markov assumption, making it particularly suitable for long-term tracking. Besides, TDSDP prior can estimate the number of trajectories automatically. Experimental results using synthetic and real-world data show that the proposed TDSDP mixture is superior to the DPM and Dependent Dirichlet Process (DDP) concerning topic variation modeling. PETS2001 dataset experiments show that TDSDP has more robust object tracking capability over DDP based on General- ized Polya Urn. Third, a sparse hierarchical nonparametric Bayesian model is used to repre- sent the data captured by 4D light field cameras, in which the light field can be regarded as a set of sub-aperture views. To capture the visual variations of the object in different viewpoints, a concept called “depth flow features” is proposed. These views can then be modeled statistically with a sparse representation in a fully unsupervised manner. While local dictionaries are learned from the corre- sponding sub-aperture view, all the views with different perspectives share one global dictionary. To prove the validity of the proposed model, it is applied to denoise the light field data. It is demonstrated that this method outperforms several state-of-the-art approaches in this application. (460 words) -
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.subject.lcshImage processing-
dc.titleNonparametric Bayesian methods for visual data association-
dc.typePG_Thesis-
dc.identifier.hkulb5824360-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplineElectrical and Electronic Engineering-
dc.description.naturepublished_or_final_version-
dc.identifier.mmsid991021210739703414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats