File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1007/978-3-319-77380-3_3
- Scopus: eid_2-s2.0-85047464475
- WOS: WOS:000460422000003
- Find via
Supplementary
- Citations:
- Appears in Collections:
Conference Paper: Image tagging by joint deep visual-semantic propagation
Title | Image tagging by joint deep visual-semantic propagation |
---|---|
Authors | |
Keywords | CNN-LSTM Image tagging Visual-semantic |
Issue Date | 2018 |
Publisher | Springer. |
Citation | 18th Pacific Rim Conference on Multimedia, Harbin, China, 28-29 September 2017. In Advances in Multimedia Information Processing – PCM 2017:18th Pacific-Rim Conference on Multimedia, Harbin, China, September 28-29, 2017, Revised Selected Papers, Part I, p. 25-35. Cham: Springer, 2018 How to Cite? |
Abstract | Image tagging has attracted much research interest due to its wide applications. Many existing methods have gained impressive results, however, they have two main limitations: (1) only focus on tagging images, but ignore the tags’ influences on visual feature modeling. (2) model the tag correlation without considering visual contents of image. In this paper, we propose a joint visual-semantic propagation model (JVSP) to address these two issues. First, we leverage a joint visual-semantic modeling to harvest integrated features which can accurately reflect the relationship between tags and image regions. Second, we introduce a visual-guided LSTM to capture the co-occurrence relation of the tags. Third, we also design a diversity loss to enforce that our model learns to focus on different regions. Experimental results on three challenging datasets demonstrate that our proposed method leads to significant performance gains over existing methods. |
Persistent Identifier | http://hdl.handle.net/10722/307235 |
ISBN | |
ISSN | 2023 SCImago Journal Rankings: 0.606 |
ISI Accession Number ID | |
Series/Report no. | Lecture Notes in Computer Science ; 10735 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ma, Yuexin | - |
dc.contributor.author | Zhu, Xinge | - |
dc.contributor.author | Sun, Yujing | - |
dc.contributor.author | Yan, Bingzheng | - |
dc.date.accessioned | 2021-11-03T06:22:12Z | - |
dc.date.available | 2021-11-03T06:22:12Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | 18th Pacific Rim Conference on Multimedia, Harbin, China, 28-29 September 2017. In Advances in Multimedia Information Processing – PCM 2017:18th Pacific-Rim Conference on Multimedia, Harbin, China, September 28-29, 2017, Revised Selected Papers, Part I, p. 25-35. Cham: Springer, 2018 | - |
dc.identifier.isbn | 9783319773797 | - |
dc.identifier.issn | 0302-9743 | - |
dc.identifier.uri | http://hdl.handle.net/10722/307235 | - |
dc.description.abstract | Image tagging has attracted much research interest due to its wide applications. Many existing methods have gained impressive results, however, they have two main limitations: (1) only focus on tagging images, but ignore the tags’ influences on visual feature modeling. (2) model the tag correlation without considering visual contents of image. In this paper, we propose a joint visual-semantic propagation model (JVSP) to address these two issues. First, we leverage a joint visual-semantic modeling to harvest integrated features which can accurately reflect the relationship between tags and image regions. Second, we introduce a visual-guided LSTM to capture the co-occurrence relation of the tags. Third, we also design a diversity loss to enforce that our model learns to focus on different regions. Experimental results on three challenging datasets demonstrate that our proposed method leads to significant performance gains over existing methods. | - |
dc.language | eng | - |
dc.publisher | Springer. | - |
dc.relation.ispartof | Advances in Multimedia Information Processing – PCM 2017:18th Pacific-Rim Conference on Multimedia, Harbin, China, September 28-29, 2017, Revised Selected Papers, Part I | - |
dc.relation.ispartofseries | Lecture Notes in Computer Science ; 10735 | - |
dc.subject | CNN-LSTM | - |
dc.subject | Image tagging | - |
dc.subject | Visual-semantic | - |
dc.title | Image tagging by joint deep visual-semantic propagation | - |
dc.type | Conference_Paper | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1007/978-3-319-77380-3_3 | - |
dc.identifier.scopus | eid_2-s2.0-85047464475 | - |
dc.identifier.spage | 25 | - |
dc.identifier.epage | 35 | - |
dc.identifier.eissn | 1611-3349 | - |
dc.identifier.isi | WOS:000460422000003 | - |
dc.publisher.place | Cham | - |