A model-based approach to the recovery of non-rigid shape from an image sequence

Zhang, Boyi; 张铂翼

File Download

FullText.pdf

Links for fulltext

(May Require Subscription)

DOI: 10.5353/th_b5760955

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Electrical & Electronic Engineering: Theses

postgraduate thesis: A model-based approach to the recovery of non-rigid shape from an image sequence

Title	A model-based approach to the recovery of non-rigid shape from an image sequence
Authors	Zhang, Boyi 张铂翼
Issue Date	2016
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Zhang, B. [张铂翼]. (2016). A model-based approach to the recovery of non-rigid shape from an image sequence. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5760955
Abstract	The Non-Rigid Structure from Motion (NRSFM) problem is a challenging problem in computer vision aiming at recovering the deforming shape of a flexible 3D object from a sequence of 2D image measurements, such as an expressive face, a human body under motion, or a moving robot arm. It is however an ill-posed problem with more unknown variables than inputs whose solution requires further regulation by imposing additional constraints. Two major constraints that have been utilized by most recent works devoted to this problem are the low-rank condition and the articulated model. Similar to all existing works in this area, the camera model is assumed to be orthographic in this thesis. This thesis introduces the Small Deformation from Average Shape (SDFAS) condition to remove the ill-posedness of the NRSFM problem. This condition is fundamental to the definition and estimation of the camera motion and the average shape. It is shown to be a sufficient but not necessary condition of the low-rank condition. Our analysis indicates that many existing methods that claim to be based on the low-rank condition in fact implicitly rely on the SDFAS condition, and the commonly assumed low-rank condition alone is not sufficient to guarantee these existing low-rank methods to work. We then developed two new approaches to the NRSFM problem, namely the blend shape method and the ellipsoid fitting method, for non-rigid shapes that may or may not satisfy the SDFAS condition. The blend shape method is proposed to recover non-rigid structures satisfying the SDFAS condition by modeling them as a linear combination of blend shapes. In our blend shape method, a pseudo view is introduced to suppress distortion of the estimated blend shapes in the direction of the camera axis, such that the blend shapes are guaranteed to be 3D shapes with clear physical meaning. This gives the algorithm an advantage of being robust against overfitting compared with other existing low-rank methods. For non-rigid structures not satisfying the SDFAS condition, the ellipsoid fitting method is proposed for datasets that can be described by an articulated model. We first revealed that points belonging to a rigid subset must satisfy the ellipsoid property, and design an efficient algorithm to apply this property to segment the feature points into different rigid subsets. The recovered rigid subsets are then linked as a kinematic chain to reconstruct the 3D articulated structure. The blend shape method and the ellipsoid fitting method are combined into a hybrid method to achieve refined results for articulated structures whose individual links are not perfectly rigid but may undergo small deformations such as motion of the human body. The hybrid method is applicable to datasets satisfying the SDFAS condition and those that fit to the articulated model. The effectiveness of all the proposed methods is demonstrated by experiments on both synthetic data and real data, with comparisons with existing methods.
Degree	Doctor of Philosophy
Subject	Three-dimensional imaging Computer vision Image reconstruction
Dept/Program	Electrical and Electronic Engineering
Persistent Identifier	http://hdl.handle.net/10722/226745
HKU Library Item ID	b5760955

DC Field	Value	Language
dc.contributor.author	Zhang, Boyi	-
dc.contributor.author	张铂翼	-
dc.date.accessioned	2016-06-30T04:24:02Z	-
dc.date.available	2016-06-30T04:24:02Z	-
dc.date.issued	2016	-
dc.identifier.citation	Zhang, B. [张铂翼]. (2016). A model-based approach to the recovery of non-rigid shape from an image sequence. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5760955	-
dc.identifier.uri	http://hdl.handle.net/10722/226745	-
dc.description.abstract	The Non-Rigid Structure from Motion (NRSFM) problem is a challenging problem in computer vision aiming at recovering the deforming shape of a flexible 3D object from a sequence of 2D image measurements, such as an expressive face, a human body under motion, or a moving robot arm. It is however an ill-posed problem with more unknown variables than inputs whose solution requires further regulation by imposing additional constraints. Two major constraints that have been utilized by most recent works devoted to this problem are the low-rank condition and the articulated model. Similar to all existing works in this area, the camera model is assumed to be orthographic in this thesis. This thesis introduces the Small Deformation from Average Shape (SDFAS) condition to remove the ill-posedness of the NRSFM problem. This condition is fundamental to the definition and estimation of the camera motion and the average shape. It is shown to be a sufficient but not necessary condition of the low-rank condition. Our analysis indicates that many existing methods that claim to be based on the low-rank condition in fact implicitly rely on the SDFAS condition, and the commonly assumed low-rank condition alone is not sufficient to guarantee these existing low-rank methods to work. We then developed two new approaches to the NRSFM problem, namely the blend shape method and the ellipsoid fitting method, for non-rigid shapes that may or may not satisfy the SDFAS condition. The blend shape method is proposed to recover non-rigid structures satisfying the SDFAS condition by modeling them as a linear combination of blend shapes. In our blend shape method, a pseudo view is introduced to suppress distortion of the estimated blend shapes in the direction of the camera axis, such that the blend shapes are guaranteed to be 3D shapes with clear physical meaning. This gives the algorithm an advantage of being robust against overfitting compared with other existing low-rank methods. For non-rigid structures not satisfying the SDFAS condition, the ellipsoid fitting method is proposed for datasets that can be described by an articulated model. We first revealed that points belonging to a rigid subset must satisfy the ellipsoid property, and design an efficient algorithm to apply this property to segment the feature points into different rigid subsets. The recovered rigid subsets are then linked as a kinematic chain to reconstruct the 3D articulated structure. The blend shape method and the ellipsoid fitting method are combined into a hybrid method to achieve refined results for articulated structures whose individual links are not perfectly rigid but may undergo small deformations such as motion of the human body. The hybrid method is applicable to datasets satisfying the SDFAS condition and those that fit to the articulated model. The effectiveness of all the proposed methods is demonstrated by experiments on both synthetic data and real data, with comparisons with existing methods.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.subject.lcsh	Three-dimensional imaging	-
dc.subject.lcsh	Computer vision	-
dc.subject.lcsh	Image reconstruction	-
dc.title	A model-based approach to the recovery of non-rigid shape from an image sequence	-
dc.type	PG_Thesis	-
dc.identifier.hkul	b5760955	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Electrical and Electronic Engineering	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.5353/th_b5760955	-
dc.identifier.mmsid	991019897629703414	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

postgraduate thesis: A model-based approach to the recovery of non-rigid shape from an image sequence

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats