Image tagging by joint deep visual-semantic propagation

Ma, Yuexin; Zhu, Xinge; Sun, Yujing; Yan, Bingzheng

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1007/978-3-319-77380-3_3
Scopus: eid_2-s2.0-85047464475
WOS: WOS:000460422000003
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Image tagging by joint deep visual-semantic propagation

Title	Image tagging by joint deep visual-semantic propagation
Authors	Ma, Yuexin Zhu, Xinge Sun, Yujing Yan, Bingzheng
Keywords	CNN-LSTM Image tagging Visual-semantic
Issue Date	2018
Publisher	Springer.
Citation	18th Pacific Rim Conference on Multimedia, Harbin, China, 28-29 September 2017. In Advances in Multimedia Information Processing – PCM 2017:18th Pacific-Rim Conference on Multimedia, Harbin, China, September 28-29, 2017, Revised Selected Papers, Part I, p. 25-35. Cham: Springer, 2018 How to Cite? DOI: http://dx.doi.org/10.1007/978-3-319-77380-3_3
Abstract	Image tagging has attracted much research interest due to its wide applications. Many existing methods have gained impressive results, however, they have two main limitations: (1) only focus on tagging images, but ignore the tags’ influences on visual feature modeling. (2) model the tag correlation without considering visual contents of image. In this paper, we propose a joint visual-semantic propagation model (JVSP) to address these two issues. First, we leverage a joint visual-semantic modeling to harvest integrated features which can accurately reflect the relationship between tags and image regions. Second, we introduce a visual-guided LSTM to capture the co-occurrence relation of the tags. Third, we also design a diversity loss to enforce that our model learns to focus on different regions. Experimental results on three challenging datasets demonstrate that our proposed method leads to significant performance gains over existing methods.
Persistent Identifier	http://hdl.handle.net/10722/307235
ISBN	9783319773797
ISSN	0302-9743 2023 SCImago Journal Rankings: 0.606
ISI Accession Number ID	WOS:000460422000003
Series/Report no.	Lecture Notes in Computer Science ; 10735

DC Field	Value	Language
dc.contributor.author	Ma, Yuexin	-
dc.contributor.author	Zhu, Xinge	-
dc.contributor.author	Sun, Yujing	-
dc.contributor.author	Yan, Bingzheng	-
dc.date.accessioned	2021-11-03T06:22:12Z	-
dc.date.available	2021-11-03T06:22:12Z	-
dc.date.issued	2018	-
dc.identifier.citation	18th Pacific Rim Conference on Multimedia, Harbin, China, 28-29 September 2017. In Advances in Multimedia Information Processing – PCM 2017:18th Pacific-Rim Conference on Multimedia, Harbin, China, September 28-29, 2017, Revised Selected Papers, Part I, p. 25-35. Cham: Springer, 2018	-
dc.identifier.isbn	9783319773797	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	http://hdl.handle.net/10722/307235	-
dc.description.abstract	Image tagging has attracted much research interest due to its wide applications. Many existing methods have gained impressive results, however, they have two main limitations: (1) only focus on tagging images, but ignore the tags’ influences on visual feature modeling. (2) model the tag correlation without considering visual contents of image. In this paper, we propose a joint visual-semantic propagation model (JVSP) to address these two issues. First, we leverage a joint visual-semantic modeling to harvest integrated features which can accurately reflect the relationship between tags and image regions. Second, we introduce a visual-guided LSTM to capture the co-occurrence relation of the tags. Third, we also design a diversity loss to enforce that our model learns to focus on different regions. Experimental results on three challenging datasets demonstrate that our proposed method leads to significant performance gains over existing methods.	-
dc.language	eng	-
dc.publisher	Springer.	-
dc.relation.ispartof	Advances in Multimedia Information Processing – PCM 2017:18th Pacific-Rim Conference on Multimedia, Harbin, China, September 28-29, 2017, Revised Selected Papers, Part I	-
dc.relation.ispartofseries	Lecture Notes in Computer Science ; 10735	-
dc.subject	CNN-LSTM	-
dc.subject	Image tagging	-
dc.subject	Visual-semantic	-
dc.title	Image tagging by joint deep visual-semantic propagation	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1007/978-3-319-77380-3_3	-
dc.identifier.scopus	eid_2-s2.0-85047464475	-
dc.identifier.spage	25	-
dc.identifier.epage	35	-
dc.identifier.eissn	1611-3349	-
dc.identifier.isi	WOS:000460422000003	-
dc.publisher.place	Cham	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Image tagging by joint deep visual-semantic propagation

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats