File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: A model-based approach to the recovery of non-rigid shape from an image sequence
Title | A model-based approach to the recovery of non-rigid shape from an image sequence |
---|---|
Authors | |
Issue Date | 2016 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Zhang, B. [张铂翼]. (2016). A model-based approach to the recovery of non-rigid shape from an image sequence. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5760955 |
Abstract | The Non-Rigid Structure from Motion (NRSFM) problem is a challenging problem in computer vision aiming at recovering the deforming shape of a flexible 3D object from a sequence of 2D image measurements, such as an expressive face, a human body under motion, or a moving robot arm. It is however an ill-posed problem with more unknown variables than inputs whose solution requires further regulation by imposing additional constraints. Two major constraints that have been utilized by most recent works devoted to this problem are the low-rank condition and the articulated model. Similar to all existing works in this area, the camera model is assumed to be orthographic in this thesis.
This thesis introduces the Small Deformation from Average Shape (SDFAS) condition to remove the ill-posedness of the NRSFM problem. This condition is fundamental to the definition and estimation of the camera motion and the average shape. It is shown to be a sufficient but not necessary condition of the low-rank condition. Our analysis indicates that many existing methods that claim to be based on the low-rank condition in fact implicitly rely on the SDFAS condition, and the commonly assumed low-rank condition alone is not sufficient to guarantee these existing low-rank methods to work. We then developed two new approaches to the NRSFM problem, namely the blend shape method and the ellipsoid fitting method, for non-rigid shapes that may or may not satisfy the SDFAS condition.
The blend shape method is proposed to recover non-rigid structures satisfying the SDFAS condition by modeling them as a linear combination of blend shapes. In our blend shape method, a pseudo view is introduced to suppress distortion of the estimated blend shapes in the direction of the camera axis, such that the blend shapes are guaranteed to be 3D shapes with clear physical meaning. This gives the algorithm an advantage of being robust against overfitting compared with other existing low-rank methods.
For non-rigid structures not satisfying the SDFAS condition, the ellipsoid fitting method is proposed for datasets that can be described by an articulated model. We first revealed that points belonging to a rigid subset must satisfy the ellipsoid property, and design an efficient algorithm to apply this property to segment the feature points into different rigid subsets. The recovered rigid subsets are then linked as a kinematic chain to reconstruct the 3D articulated structure.
The blend shape method and the ellipsoid fitting method are combined into a hybrid method to achieve refined results for articulated structures whose individual links are not perfectly rigid but may undergo small deformations such as motion of the human body. The hybrid method is applicable to datasets satisfying the SDFAS condition and those that fit to the articulated model. The effectiveness of all the proposed methods is demonstrated by experiments on both synthetic data and real data, with comparisons with existing methods. |
Degree | Doctor of Philosophy |
Subject | Three-dimensional imaging Computer vision Image reconstruction |
Dept/Program | Electrical and Electronic Engineering |
Persistent Identifier | http://hdl.handle.net/10722/226745 |
HKU Library Item ID | b5760955 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhang, Boyi | - |
dc.contributor.author | 张铂翼 | - |
dc.date.accessioned | 2016-06-30T04:24:02Z | - |
dc.date.available | 2016-06-30T04:24:02Z | - |
dc.date.issued | 2016 | - |
dc.identifier.citation | Zhang, B. [张铂翼]. (2016). A model-based approach to the recovery of non-rigid shape from an image sequence. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5760955 | - |
dc.identifier.uri | http://hdl.handle.net/10722/226745 | - |
dc.description.abstract | The Non-Rigid Structure from Motion (NRSFM) problem is a challenging problem in computer vision aiming at recovering the deforming shape of a flexible 3D object from a sequence of 2D image measurements, such as an expressive face, a human body under motion, or a moving robot arm. It is however an ill-posed problem with more unknown variables than inputs whose solution requires further regulation by imposing additional constraints. Two major constraints that have been utilized by most recent works devoted to this problem are the low-rank condition and the articulated model. Similar to all existing works in this area, the camera model is assumed to be orthographic in this thesis. This thesis introduces the Small Deformation from Average Shape (SDFAS) condition to remove the ill-posedness of the NRSFM problem. This condition is fundamental to the definition and estimation of the camera motion and the average shape. It is shown to be a sufficient but not necessary condition of the low-rank condition. Our analysis indicates that many existing methods that claim to be based on the low-rank condition in fact implicitly rely on the SDFAS condition, and the commonly assumed low-rank condition alone is not sufficient to guarantee these existing low-rank methods to work. We then developed two new approaches to the NRSFM problem, namely the blend shape method and the ellipsoid fitting method, for non-rigid shapes that may or may not satisfy the SDFAS condition. The blend shape method is proposed to recover non-rigid structures satisfying the SDFAS condition by modeling them as a linear combination of blend shapes. In our blend shape method, a pseudo view is introduced to suppress distortion of the estimated blend shapes in the direction of the camera axis, such that the blend shapes are guaranteed to be 3D shapes with clear physical meaning. This gives the algorithm an advantage of being robust against overfitting compared with other existing low-rank methods. For non-rigid structures not satisfying the SDFAS condition, the ellipsoid fitting method is proposed for datasets that can be described by an articulated model. We first revealed that points belonging to a rigid subset must satisfy the ellipsoid property, and design an efficient algorithm to apply this property to segment the feature points into different rigid subsets. The recovered rigid subsets are then linked as a kinematic chain to reconstruct the 3D articulated structure. The blend shape method and the ellipsoid fitting method are combined into a hybrid method to achieve refined results for articulated structures whose individual links are not perfectly rigid but may undergo small deformations such as motion of the human body. The hybrid method is applicable to datasets satisfying the SDFAS condition and those that fit to the articulated model. The effectiveness of all the proposed methods is demonstrated by experiments on both synthetic data and real data, with comparisons with existing methods. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.subject.lcsh | Three-dimensional imaging | - |
dc.subject.lcsh | Computer vision | - |
dc.subject.lcsh | Image reconstruction | - |
dc.title | A model-based approach to the recovery of non-rigid shape from an image sequence | - |
dc.type | PG_Thesis | - |
dc.identifier.hkul | b5760955 | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Electrical and Electronic Engineering | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.5353/th_b5760955 | - |
dc.identifier.mmsid | 991019897629703414 | - |