Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification from the Bottom Up

GE, W; LIN, X; Yu, Y

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/CVPR.2019.00315
WOS: WOS:000529484003021

Supplementary

Citations:
- Web of Science: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification from the Bottom Up

Title	Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification from the Bottom Up
Authors	GE, W LIN, X Yu, Y
Issue Date	2019
Publisher	Institute of Electrical and Electronics Engineers, Inc..
Citation	IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15-20 June, 2019. In Proceedings: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition: CVPR 2019 How to Cite? DOI: http://dx.doi.org/10.1109/CVPR.2019.00315
Abstract	Given a training dataset composed of images and corresponding category labels, deep convolutional neural networks show a strong ability in mining discriminative parts for image classification. However, deep convolutional neural networks trained with image level labels only tend to focus on the most discriminative parts while missing other object parts, which could provide complementary information. In this paper, we approach this problem from a different perspective. We build complementary parts models in a weakly supervised manner to retrieve information suppressed by dominant object parts detected by convolutional neural networks. Given image level labels only, we first extract rough object instances by performing weakly supervised object detection and instance segmentation using Mask R-CNN and CRF-based segmentation. Then we estimate and search for the best parts model for each object instance under the principle of preserving as much diversity as possible. In the last stage, we build a bi-directional long short-term memory (LSTM) network to fuze and encode the partial information of these complementary parts into a comprehensive feature for image classification. Experimental results indicate that the proposed method not only achieves significant improvement over our baseline models, but also outperforms state-of-the-art algorithms by a large margin (6.7%, 2.8%, 5.2% respectively) on Stanford Dogs 120, Caltech-UCSD Birds 2011-200 and Caltech 256.
Persistent Identifier	http://hdl.handle.net/10722/316285
ISBN	9781728132938
ISI Accession Number ID	WOS:000529484003021

DC Field	Value	Language
dc.contributor.author	GE, W	-
dc.contributor.author	LIN, X	-
dc.contributor.author	Yu, Y	-
dc.date.accessioned	2022-09-02T06:08:47Z	-
dc.date.available	2022-09-02T06:08:47Z	-
dc.date.issued	2019	-
dc.identifier.citation	IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15-20 June, 2019. In Proceedings: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition: CVPR 2019	-
dc.identifier.isbn	9781728132938	-
dc.identifier.uri	http://hdl.handle.net/10722/316285	-
dc.description.abstract	Given a training dataset composed of images and corresponding category labels, deep convolutional neural networks show a strong ability in mining discriminative parts for image classification. However, deep convolutional neural networks trained with image level labels only tend to focus on the most discriminative parts while missing other object parts, which could provide complementary information. In this paper, we approach this problem from a different perspective. We build complementary parts models in a weakly supervised manner to retrieve information suppressed by dominant object parts detected by convolutional neural networks. Given image level labels only, we first extract rough object instances by performing weakly supervised object detection and instance segmentation using Mask R-CNN and CRF-based segmentation. Then we estimate and search for the best parts model for each object instance under the principle of preserving as much diversity as possible. In the last stage, we build a bi-directional long short-term memory (LSTM) network to fuze and encode the partial information of these complementary parts into a comprehensive feature for image classification. Experimental results indicate that the proposed method not only achieves significant improvement over our baseline models, but also outperforms state-of-the-art algorithms by a large margin (6.7%, 2.8%, 5.2% respectively) on Stanford Dogs 120, Caltech-UCSD Birds 2011-200 and Caltech 256.	-
dc.language	eng	-
dc.publisher	Institute of Electrical and Electronics Engineers, Inc..	-
dc.relation.ispartof	Proceedings: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition: CVPR 2019	-
dc.title	Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification from the Bottom Up	-
dc.type	Conference_Paper	-
dc.identifier.email	Yu, Y: yzyu@cs.hku.hk	-
dc.identifier.authority	Yu, Y=rp01415	-
dc.identifier.doi	10.1109/CVPR.2019.00315	-
dc.identifier.hkuros	336349	-
dc.identifier.spage	760	-
dc.identifier.epage	769	-
dc.identifier.isi	WOS:000529484003021	-
dc.publisher.place	United States	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification from the Bottom Up

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats