File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Generic object locating and segmentation based on deep convolutional neural networks and level-set methods
Title | Generic object locating and segmentation based on deep convolutional neural networks and level-set methods |
---|---|
Authors | |
Advisors | Advisor(s):Yu, Y |
Issue Date | 2018 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Wu, K. [吴侃]. (2018). Generic object locating and segmentation based on deep convolutional neural networks and level-set methods. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
Abstract | The problem of object locating and segmentation within daily images is closely related to many computer vision and image editing applications, such as object detection and tracking, context-based image retrieval, and data-driven scene synthesis, etc. In this thesis, the topic is discussed regarding two important techniques: quality assessment of object proposals and automatic segmentation of foreground objects.
The first part of the thesis presents a generic pipeline for high-quality object discovery over internet images. The pipeline is built around dense proposal generation and object quality assessment by deep convolutional neural networks. The proposals are generated using state-of-the-art methods, and are further re-ranked by a quality assessment network. The network takes a given image and its object proposals as the input, and outputs quality scores, with high values indicating good quality and low values for bad quality. In this work, the concepts of completeness and fullness are introduced as major criteria for quality assessment, and are used in training the network. It is shown, through extensive experiments, the performance of existing object proposal generators can be significantly improved by re-ranking their generated proposals. It is also evident that a good combination of both region and edge features extracted from pre-trained deep convolutional neural networks makes quality assessment more reliable, compared to traditional methods using features from a single network.
In the second part of the thesis, automatic foreground object segmentation is discussed. Given good-quality object proposal windows, the segmentation of foreground objects within them can be automated with the help of a high-performance saliency detector and carefully designed segmentation procedures. The experiments conducted in this work suggest that saliency maps, as strong clues for the locations of foreground objects, can be used to initialize the segmentation, removing the need for tedious user interaction. A multi-pass level-set method based on multi-scale region and boundary features are proposed for overcoming possible initialization inaccuracy and obtaining acceptable segmentation results. |
Degree | Doctor of Philosophy |
Subject | Neural networks (Computer science) Level set methods |
Dept/Program | Computer Science |
Persistent Identifier | http://hdl.handle.net/10722/265339 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Yu, Y | - |
dc.contributor.author | Wu, Kan | - |
dc.contributor.author | 吴侃 | - |
dc.date.accessioned | 2018-11-29T06:22:20Z | - |
dc.date.available | 2018-11-29T06:22:20Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | Wu, K. [吴侃]. (2018). Generic object locating and segmentation based on deep convolutional neural networks and level-set methods. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
dc.identifier.uri | http://hdl.handle.net/10722/265339 | - |
dc.description.abstract | The problem of object locating and segmentation within daily images is closely related to many computer vision and image editing applications, such as object detection and tracking, context-based image retrieval, and data-driven scene synthesis, etc. In this thesis, the topic is discussed regarding two important techniques: quality assessment of object proposals and automatic segmentation of foreground objects. The first part of the thesis presents a generic pipeline for high-quality object discovery over internet images. The pipeline is built around dense proposal generation and object quality assessment by deep convolutional neural networks. The proposals are generated using state-of-the-art methods, and are further re-ranked by a quality assessment network. The network takes a given image and its object proposals as the input, and outputs quality scores, with high values indicating good quality and low values for bad quality. In this work, the concepts of completeness and fullness are introduced as major criteria for quality assessment, and are used in training the network. It is shown, through extensive experiments, the performance of existing object proposal generators can be significantly improved by re-ranking their generated proposals. It is also evident that a good combination of both region and edge features extracted from pre-trained deep convolutional neural networks makes quality assessment more reliable, compared to traditional methods using features from a single network. In the second part of the thesis, automatic foreground object segmentation is discussed. Given good-quality object proposal windows, the segmentation of foreground objects within them can be automated with the help of a high-performance saliency detector and carefully designed segmentation procedures. The experiments conducted in this work suggest that saliency maps, as strong clues for the locations of foreground objects, can be used to initialize the segmentation, removing the need for tedious user interaction. A multi-pass level-set method based on multi-scale region and boundary features are proposed for overcoming possible initialization inaccuracy and obtaining acceptable segmentation results. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Neural networks (Computer science) | - |
dc.subject.lcsh | Level set methods | - |
dc.title | Generic object locating and segmentation based on deep convolutional neural networks and level-set methods | - |
dc.type | PG_Thesis | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Computer Science | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.5353/th_991044058184303414 | - |
dc.date.hkucongregation | 2018 | - |
dc.identifier.mmsid | 991044058184303414 | - |