File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Hand detection and hand shape and posture analysis in images
Title | Hand detection and hand shape and posture analysis in images |
---|---|
Authors | |
Issue Date | 2015 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Zhu, X. [朱曉龍]. (2015). Hand detection and hand shape and posture analysis in images. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5699948 |
Abstract | Hand is an important part of human in communicating with other persons and interacting with objects in the world. In this thesis, we will explore hand analysis, which includes hand detection, hand posture and hand shape analysis.
First I will focus on hand detection. We start by exploring pixel-level hand detection. It is a challenging problem because the appearance of hands can vary greatly in images. We use a random forest framework in order to predict hand mask and hand part labels in an efficient and robust manner. Our approach exploits the inherent contextual information from structured hand labelling for pixel-level hand detection and hand part labelling. Having obtained a pixel-level hand probability map, we present an end-to-end system to detect hands in a bounding box fashion. Our model can not only detect and localize the hands but also provide detailed information within the bounding box by using a multi-task convolutional neural network. Experimental results show that hand detection can be improved by utilizing more information such as hand shape and hand landmark points than only using bounding boxes.
In the second part of my thesis, we will concentrate on two specific problems based on the good hand detection results. The first problem is to recognize the hand postures efficiently. We present a flexible method for hand posture recognition by fusing information from color and depth images using kernel descriptors. Existing methods usually focus on designing intuitive features for color and depth images. On the contrary, our method extracts common patch-level features, and combines them by means of kernel descriptors. Linear SVM is used to predict the class label efficiently.
The second problem is related to hand shape. We present an interesting application of shape analysis to train a hand shape recognizer automatically from simple sketches, such as a “stick-figure” of a hand shape. We introduce the Hand Boltzmann Machine, a generative model built upon unsupervised learning, to represent the hand shape space of a binary image, and formulate the user provided sketches as an initial guidance for sampling to generate realistic hand shape samples. A hand shape recognizer is trained from these samples and used to classify the hand shapes intuitively. |
Degree | Doctor of Philosophy |
Subject | Biometric identification |
Dept/Program | Computer Science |
Persistent Identifier | http://hdl.handle.net/10722/223018 |
HKU Library Item ID | b5699948 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhu, Xiaolong | - |
dc.contributor.author | 朱曉龍 | - |
dc.date.accessioned | 2016-02-17T23:14:32Z | - |
dc.date.available | 2016-02-17T23:14:32Z | - |
dc.date.issued | 2015 | - |
dc.identifier.citation | Zhu, X. [朱曉龍]. (2015). Hand detection and hand shape and posture analysis in images. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5699948 | - |
dc.identifier.uri | http://hdl.handle.net/10722/223018 | - |
dc.description.abstract | Hand is an important part of human in communicating with other persons and interacting with objects in the world. In this thesis, we will explore hand analysis, which includes hand detection, hand posture and hand shape analysis. First I will focus on hand detection. We start by exploring pixel-level hand detection. It is a challenging problem because the appearance of hands can vary greatly in images. We use a random forest framework in order to predict hand mask and hand part labels in an efficient and robust manner. Our approach exploits the inherent contextual information from structured hand labelling for pixel-level hand detection and hand part labelling. Having obtained a pixel-level hand probability map, we present an end-to-end system to detect hands in a bounding box fashion. Our model can not only detect and localize the hands but also provide detailed information within the bounding box by using a multi-task convolutional neural network. Experimental results show that hand detection can be improved by utilizing more information such as hand shape and hand landmark points than only using bounding boxes. In the second part of my thesis, we will concentrate on two specific problems based on the good hand detection results. The first problem is to recognize the hand postures efficiently. We present a flexible method for hand posture recognition by fusing information from color and depth images using kernel descriptors. Existing methods usually focus on designing intuitive features for color and depth images. On the contrary, our method extracts common patch-level features, and combines them by means of kernel descriptors. Linear SVM is used to predict the class label efficiently. The second problem is related to hand shape. We present an interesting application of shape analysis to train a hand shape recognizer automatically from simple sketches, such as a “stick-figure” of a hand shape. We introduce the Hand Boltzmann Machine, a generative model built upon unsupervised learning, to represent the hand shape space of a binary image, and formulate the user provided sketches as an initial guidance for sampling to generate realistic hand shape samples. A hand shape recognizer is trained from these samples and used to classify the hand shapes intuitively. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Biometric identification | - |
dc.title | Hand detection and hand shape and posture analysis in images | - |
dc.type | PG_Thesis | - |
dc.identifier.hkul | b5699948 | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Computer Science | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.5353/th_b5699948 | - |
dc.identifier.mmsid | 991018968699703414 | - |