Generic object locating and segmentation based on deep convolutional neural networks and level-set methods

Wu, Kan; 吴侃

File Download

FullText.pdf

Links for fulltext

(May Require Subscription)

DOI: 10.5353/th_991044058184303414

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Computer Science: Theses

postgraduate thesis: Generic object locating and segmentation based on deep convolutional neural networks and level-set methods

Title	Generic object locating and segmentation based on deep convolutional neural networks and level-set methods
Authors	Wu, Kan 吴侃
Advisors	Advisor(s):Yu, Y
Issue Date	2018
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Wu, K. [吴侃]. (2018). Generic object locating and segmentation based on deep convolutional neural networks and level-set methods. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	The problem of object locating and segmentation within daily images is closely related to many computer vision and image editing applications, such as object detection and tracking, context-based image retrieval, and data-driven scene synthesis, etc. In this thesis, the topic is discussed regarding two important techniques: quality assessment of object proposals and automatic segmentation of foreground objects. The first part of the thesis presents a generic pipeline for high-quality object discovery over internet images. The pipeline is built around dense proposal generation and object quality assessment by deep convolutional neural networks. The proposals are generated using state-of-the-art methods, and are further re-ranked by a quality assessment network. The network takes a given image and its object proposals as the input, and outputs quality scores, with high values indicating good quality and low values for bad quality. In this work, the concepts of completeness and fullness are introduced as major criteria for quality assessment, and are used in training the network. It is shown, through extensive experiments, the performance of existing object proposal generators can be significantly improved by re-ranking their generated proposals. It is also evident that a good combination of both region and edge features extracted from pre-trained deep convolutional neural networks makes quality assessment more reliable, compared to traditional methods using features from a single network. In the second part of the thesis, automatic foreground object segmentation is discussed. Given good-quality object proposal windows, the segmentation of foreground objects within them can be automated with the help of a high-performance saliency detector and carefully designed segmentation procedures. The experiments conducted in this work suggest that saliency maps, as strong clues for the locations of foreground objects, can be used to initialize the segmentation, removing the need for tedious user interaction. A multi-pass level-set method based on multi-scale region and boundary features are proposed for overcoming possible initialization inaccuracy and obtaining acceptable segmentation results.
Degree	Doctor of Philosophy
Subject	Neural networks (Computer science) Level set methods
Dept/Program	Computer Science
Persistent Identifier	http://hdl.handle.net/10722/265339

DC Field	Value	Language
dc.contributor.advisor	Yu, Y	-
dc.contributor.author	Wu, Kan	-
dc.contributor.author	吴侃	-
dc.date.accessioned	2018-11-29T06:22:20Z	-
dc.date.available	2018-11-29T06:22:20Z	-
dc.date.issued	2018	-
dc.identifier.citation	Wu, K. [吴侃]. (2018). Generic object locating and segmentation based on deep convolutional neural networks and level-set methods. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/265339	-
dc.description.abstract	The problem of object locating and segmentation within daily images is closely related to many computer vision and image editing applications, such as object detection and tracking, context-based image retrieval, and data-driven scene synthesis, etc. In this thesis, the topic is discussed regarding two important techniques: quality assessment of object proposals and automatic segmentation of foreground objects. The first part of the thesis presents a generic pipeline for high-quality object discovery over internet images. The pipeline is built around dense proposal generation and object quality assessment by deep convolutional neural networks. The proposals are generated using state-of-the-art methods, and are further re-ranked by a quality assessment network. The network takes a given image and its object proposals as the input, and outputs quality scores, with high values indicating good quality and low values for bad quality. In this work, the concepts of completeness and fullness are introduced as major criteria for quality assessment, and are used in training the network. It is shown, through extensive experiments, the performance of existing object proposal generators can be significantly improved by re-ranking their generated proposals. It is also evident that a good combination of both region and edge features extracted from pre-trained deep convolutional neural networks makes quality assessment more reliable, compared to traditional methods using features from a single network. In the second part of the thesis, automatic foreground object segmentation is discussed. Given good-quality object proposal windows, the segmentation of foreground objects within them can be automated with the help of a high-performance saliency detector and carefully designed segmentation procedures. The experiments conducted in this work suggest that saliency maps, as strong clues for the locations of foreground objects, can be used to initialize the segmentation, removing the need for tedious user interaction. A multi-pass level-set method based on multi-scale region and boundary features are proposed for overcoming possible initialization inaccuracy and obtaining acceptable segmentation results.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Neural networks (Computer science)	-
dc.subject.lcsh	Level set methods	-
dc.title	Generic object locating and segmentation based on deep convolutional neural networks and level-set methods	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Computer Science	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.5353/th_991044058184303414	-
dc.date.hkucongregation	2018	-
dc.identifier.mmsid	991044058184303414	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

postgraduate thesis: Generic object locating and segmentation based on deep convolutional neural networks and level-set methods

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats