File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: RGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge

TitleRGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge
Authors
KeywordsDeep learning
RGB-D object recognition
Transfer learning
Issue Date2015
Citation
IEEE Transactions on Multimedia, 2015, v. 17, n. 11, p. 1899-1908 How to Cite?
AbstractFor the task of RGB-D object recognition, it is important to identify suitable representations of images, which can boost the performance of object recognition. In this work, we propose a novel representation learning method for RGB-D images by jointly incorporating the underlying data structure and the prior knowledge of the data. Specifically, the convolutional neural networks (CNN) are employed to learn image representation by exploiting the underlying data structure. To handle the problem of the limited RGB and depth images for object recognition, the multi-level hierarchies of features trained on ImageNet from the CNN are transferred to learn rich generic feature representation for RGB and depth images while the labeled images are leveraged. On the other hand, we propose a novel deep auto-encoders (DAE) to exploit the prior knowledge, which can overcome the expensive computational cost of optimization in feature encoding. The expected representations of images are obtained by integrating the two types of image representations. To verify the effectiveness of the proposed method, we thoroughly conduct extensive experiments on two publicly available RGB-D datasets. The encouraging experimental results compared with the state-of-the-art approaches demonstrate the advantages of the proposed method.
Persistent Identifierhttp://hdl.handle.net/10722/345082
ISSN
2023 Impact Factor: 8.4
2023 SCImago Journal Rankings: 2.260

 

DC FieldValueLanguage
dc.contributor.authorTang, Jinhui-
dc.contributor.authorJin, Lu-
dc.contributor.authorLi, Zechao-
dc.contributor.authorGao, Shenghua-
dc.date.accessioned2024-08-15T09:25:07Z-
dc.date.available2024-08-15T09:25:07Z-
dc.date.issued2015-
dc.identifier.citationIEEE Transactions on Multimedia, 2015, v. 17, n. 11, p. 1899-1908-
dc.identifier.issn1520-9210-
dc.identifier.urihttp://hdl.handle.net/10722/345082-
dc.description.abstractFor the task of RGB-D object recognition, it is important to identify suitable representations of images, which can boost the performance of object recognition. In this work, we propose a novel representation learning method for RGB-D images by jointly incorporating the underlying data structure and the prior knowledge of the data. Specifically, the convolutional neural networks (CNN) are employed to learn image representation by exploiting the underlying data structure. To handle the problem of the limited RGB and depth images for object recognition, the multi-level hierarchies of features trained on ImageNet from the CNN are transferred to learn rich generic feature representation for RGB and depth images while the labeled images are leveraged. On the other hand, we propose a novel deep auto-encoders (DAE) to exploit the prior knowledge, which can overcome the expensive computational cost of optimization in feature encoding. The expected representations of images are obtained by integrating the two types of image representations. To verify the effectiveness of the proposed method, we thoroughly conduct extensive experiments on two publicly available RGB-D datasets. The encouraging experimental results compared with the state-of-the-art approaches demonstrate the advantages of the proposed method.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Multimedia-
dc.subjectDeep learning-
dc.subjectRGB-D object recognition-
dc.subjectTransfer learning-
dc.titleRGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TMM.2015.2476660-
dc.identifier.scopuseid_2-s2.0-84946732106-
dc.identifier.volume17-
dc.identifier.issue11-
dc.identifier.spage1899-
dc.identifier.epage1908-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats