File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

postgraduate thesis: Image interpolation and image-based content summary

TitleImage interpolation and image-based content summary
Authors
Issue Date2015
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Jing, G. [井光美]. (2015). Image interpolation and image-based content summary. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5699926
AbstractWith the prevalence of smartphones and modern digital cameras, people have access to digital photo collections and videos more often than before. Organizing and browsing a considerable amount of image data becomes a challenging problem. This thesis mainly focus on gradient guided image interpolation for single images and compact summarization for videos and photo collections using geometric-based method. This thesis first investigates the problem of image enlargement for a single image, which aims to improve image details when zooming. A novel edge prior is proposed for image interpolation which assumes that the variation of image intensity away from an edge is locally similar along the edge. This new approach performs significantly better than several representative edge-directed image interpolation methods in terms of PSNR and SSIM, as well as subjective visual quality in most of the test cases. The second part of this thesis studies the comic-style presentation for videos. A new approach that conveniently converts conversational videos into comics with manga-style layout in a content-driven manner is introduced. The main components, including panels and word balloons, that constitute a visually pleasing comic page are intelligently organized. The information contained in a comic page is qualitatively measured and an efficient Markov chain Monte Carlo sampling algorithm is designed for the proposed optimization. A user study demonstrates that users much prefer the produced manga-style comics to purely Western style comics. Extensive experiments and comparisons against previous work also verify the effectiveness of the proposed approach. The third part of this thesis presents a new framework for producing a visually appealing photo collage from a photo collection. The core of this framework is a novel divide-and-conquer photo collage approach. While conventional photo collage methods are often built upon complex non-linear optimization systems, the proposed approach divides the canvas from the geometric point of view and assigns each image an independent area to display in a purely saliency-driven manner. To seek an optimal partition of the canvas, a new region partition algorithm which accepts the irregular salient image regions as input has been developed. Each of the subregions resulting from the partition fits its corresponding salient region well. This leads to better utilization of the limited canvas space, thus yielding a more compact yet informative collage. Moreover, a seam carving based scheme is devised to complete the collage result. Extensive experiments and comparisons against state-of-the-art methods are conducted. A user study shows that the generated collage outperforms competitive methods.
DegreeDoctor of Philosophy
SubjectDigital images
Interpolation
Dept/ProgramComputer Science
Persistent Identifierhttp://hdl.handle.net/10722/223040
HKU Library Item IDb5699926

 

DC FieldValueLanguage
dc.contributor.authorJing, Guangmei-
dc.contributor.author井光美-
dc.date.accessioned2016-02-17T23:14:38Z-
dc.date.available2016-02-17T23:14:38Z-
dc.date.issued2015-
dc.identifier.citationJing, G. [井光美]. (2015). Image interpolation and image-based content summary. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5699926-
dc.identifier.urihttp://hdl.handle.net/10722/223040-
dc.description.abstractWith the prevalence of smartphones and modern digital cameras, people have access to digital photo collections and videos more often than before. Organizing and browsing a considerable amount of image data becomes a challenging problem. This thesis mainly focus on gradient guided image interpolation for single images and compact summarization for videos and photo collections using geometric-based method. This thesis first investigates the problem of image enlargement for a single image, which aims to improve image details when zooming. A novel edge prior is proposed for image interpolation which assumes that the variation of image intensity away from an edge is locally similar along the edge. This new approach performs significantly better than several representative edge-directed image interpolation methods in terms of PSNR and SSIM, as well as subjective visual quality in most of the test cases. The second part of this thesis studies the comic-style presentation for videos. A new approach that conveniently converts conversational videos into comics with manga-style layout in a content-driven manner is introduced. The main components, including panels and word balloons, that constitute a visually pleasing comic page are intelligently organized. The information contained in a comic page is qualitatively measured and an efficient Markov chain Monte Carlo sampling algorithm is designed for the proposed optimization. A user study demonstrates that users much prefer the produced manga-style comics to purely Western style comics. Extensive experiments and comparisons against previous work also verify the effectiveness of the proposed approach. The third part of this thesis presents a new framework for producing a visually appealing photo collage from a photo collection. The core of this framework is a novel divide-and-conquer photo collage approach. While conventional photo collage methods are often built upon complex non-linear optimization systems, the proposed approach divides the canvas from the geometric point of view and assigns each image an independent area to display in a purely saliency-driven manner. To seek an optimal partition of the canvas, a new region partition algorithm which accepts the irregular salient image regions as input has been developed. Each of the subregions resulting from the partition fits its corresponding salient region well. This leads to better utilization of the limited canvas space, thus yielding a more compact yet informative collage. Moreover, a seam carving based scheme is devised to complete the collage result. Extensive experiments and comparisons against state-of-the-art methods are conducted. A user study shows that the generated collage outperforms competitive methods.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshDigital images-
dc.subject.lcshInterpolation-
dc.titleImage interpolation and image-based content summary-
dc.typePG_Thesis-
dc.identifier.hkulb5699926-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplineComputer Science-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.5353/th_b5699926-
dc.identifier.mmsid991018966789703414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats