File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions

TitleOpen-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions
Authors
Issue Date2020
PublisherSpringer
Citation
16th European Conference on Computer Vision (ECCV 2020), Glasgow, UK, August 23-28 2020. In Vedaldi, A, Bischof, H, Brox, T, et al. (Eds), Computer Vision – ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XI, p. 89-106. Cham: Springer, 2020 How to Cite?
AbstractWe propose a novel algorithm, named Open-Edit, which is the first attempt on open-domain image manipulation with open-vocabulary instructions. It is a challenging task considering the large variation of image domains and the lack of training supervision. Our approach takes advantage of the unified visual-semantic embedding space pretrained on a general image-caption dataset, and manipulates the embedded visual features by applying text-guided vector arithmetic on the image feature maps. A structure-preserving image decoder then generates the manipulated images from the manipulated feature maps. We further propose an on-the-fly sample-specific optimization approach with cycle-consistency constraints to regularize the manipulated images and force them to preserve details of the source images. Our approach shows promising results in manipulating open-vocabulary color, texture, and high-level attributes for various scenarios of open-domain images (Code is released at https://github.com/xh-liu/Open-Edit).
Persistent Identifierhttp://hdl.handle.net/10722/316564
ISBN
ISSN
2023 SCImago Journal Rankings: 0.606
Series/Report no.Lecture Notes in Computer Science ; 12356
LNCS Sublibrary. SL 6, Image Processing, Computer Vision, Pattern Recognition, and Graphics

 

DC FieldValueLanguage
dc.contributor.authorLiu, Xihui-
dc.contributor.authorLin, Zhe-
dc.contributor.authorZhang, Jianming-
dc.contributor.authorZhao, Handong-
dc.contributor.authorTran, Quan-
dc.contributor.authorWang, Xiaogang-
dc.contributor.authorLi, Hongsheng-
dc.date.accessioned2022-09-14T11:40:45Z-
dc.date.available2022-09-14T11:40:45Z-
dc.date.issued2020-
dc.identifier.citation16th European Conference on Computer Vision (ECCV 2020), Glasgow, UK, August 23-28 2020. In Vedaldi, A, Bischof, H, Brox, T, et al. (Eds), Computer Vision – ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XI, p. 89-106. Cham: Springer, 2020-
dc.identifier.isbn9783030586201-
dc.identifier.issn0302-9743-
dc.identifier.urihttp://hdl.handle.net/10722/316564-
dc.description.abstractWe propose a novel algorithm, named Open-Edit, which is the first attempt on open-domain image manipulation with open-vocabulary instructions. It is a challenging task considering the large variation of image domains and the lack of training supervision. Our approach takes advantage of the unified visual-semantic embedding space pretrained on a general image-caption dataset, and manipulates the embedded visual features by applying text-guided vector arithmetic on the image feature maps. A structure-preserving image decoder then generates the manipulated images from the manipulated feature maps. We further propose an on-the-fly sample-specific optimization approach with cycle-consistency constraints to regularize the manipulated images and force them to preserve details of the source images. Our approach shows promising results in manipulating open-vocabulary color, texture, and high-level attributes for various scenarios of open-domain images (Code is released at https://github.com/xh-liu/Open-Edit).-
dc.languageeng-
dc.publisherSpringer-
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)-
dc.relation.ispartofComputer Vision – ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XI-
dc.relation.ispartofseriesLecture Notes in Computer Science ; 12356-
dc.relation.ispartofseriesLNCS Sublibrary. SL 6, Image Processing, Computer Vision, Pattern Recognition, and Graphics-
dc.titleOpen-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1007/978-3-030-58621-8_6-
dc.identifier.scopuseid_2-s2.0-85097645652-
dc.identifier.spage89-
dc.identifier.epage106-
dc.identifier.eissn1611-3349-
dc.publisher.placeCham-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats