Feature intertwiner for object detection

Li, Hongyang; Dai, Bo; Shi, Shaoshuai; Ouyang, Wanli; Wang, Xiaogang

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Scopus: eid_2-s2.0-85083952656

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- HKU Musketeers Foundation Institute of Data Science: Conference papers

Conference Paper: Feature intertwiner for object detection

Title	Feature intertwiner for object detection
Authors	Li, Hongyang Dai, Bo Shi, Shaoshuai Ouyang, Wanli Wang, Xiaogang
Issue Date	2019
Citation	7th International Conference on Learning Representations, ICLR 2019, 2019 How to Cite?
Abstract	A well-trained model should classify objects with a unanimous score for every category. This requires the high-level semantic features should be alike among samples, despite a wide span in resolution, texture, deformation, etc. Previous works focus on re-designing the loss function or proposing new regularization constraints on the loss. In this paper, we address this problem via a new perspective. For each category, it is assumed that there are two sets in the feature space: one with more reliable information and the other with a less reliable source. We argue that the reliable set could guide the feature learning of the less reliable set during training - in the spirit of student mimicking teacher's behavior and thus pushing towards a more compact class centroid in the high-dimensional space. Such a scheme also benefits the reliable set since samples become closer within the same category - implying that it is easier for the classifier to identify. We refer to this mutual learning process as feature intertwiner and embed the spirit into object detection. It is well-known that objects of low resolution are more difficult to detect due to the loss of detailed information during network forward pass. We thus regard objects of high resolution as the reliable set and objects of low resolution as the less reliable set. Specifically, an intertwiner is achieved by minimizing the distribution divergence between two sets. We design a historical buffer to represent all previous samples in the reliable set and utilize them to guide the feature learning of the less reliable set. The design of obtaining an effective feature representation for the reliable set is further investigated, where we introduce the optimal transport (OT) algorithm into the framework. Samples in the less reliable set are better aligned with the reliable set with aid of OT metric. Incorporated with such a plug-and-play intertwiner, we achieve an evident improvement over previous state-of-the-arts on the COCO object detection benchmark.
Persistent Identifier	http://hdl.handle.net/10722/351398

DC Field	Value	Language
dc.contributor.author	Li, Hongyang	-
dc.contributor.author	Dai, Bo	-
dc.contributor.author	Shi, Shaoshuai	-
dc.contributor.author	Ouyang, Wanli	-
dc.contributor.author	Wang, Xiaogang	-
dc.date.accessioned	2024-11-20T03:56:02Z	-
dc.date.available	2024-11-20T03:56:02Z	-
dc.date.issued	2019	-
dc.identifier.citation	7th International Conference on Learning Representations, ICLR 2019, 2019	-
dc.identifier.uri	http://hdl.handle.net/10722/351398	-
dc.description.abstract	A well-trained model should classify objects with a unanimous score for every category. This requires the high-level semantic features should be alike among samples, despite a wide span in resolution, texture, deformation, etc. Previous works focus on re-designing the loss function or proposing new regularization constraints on the loss. In this paper, we address this problem via a new perspective. For each category, it is assumed that there are two sets in the feature space: one with more reliable information and the other with a less reliable source. We argue that the reliable set could guide the feature learning of the less reliable set during training - in the spirit of student mimicking teacher's behavior and thus pushing towards a more compact class centroid in the high-dimensional space. Such a scheme also benefits the reliable set since samples become closer within the same category - implying that it is easier for the classifier to identify. We refer to this mutual learning process as feature intertwiner and embed the spirit into object detection. It is well-known that objects of low resolution are more difficult to detect due to the loss of detailed information during network forward pass. We thus regard objects of high resolution as the reliable set and objects of low resolution as the less reliable set. Specifically, an intertwiner is achieved by minimizing the distribution divergence between two sets. We design a historical buffer to represent all previous samples in the reliable set and utilize them to guide the feature learning of the less reliable set. The design of obtaining an effective feature representation for the reliable set is further investigated, where we introduce the optimal transport (OT) algorithm into the framework. Samples in the less reliable set are better aligned with the reliable set with aid of OT metric. Incorporated with such a plug-and-play intertwiner, we achieve an evident improvement over previous state-of-the-arts on the COCO object detection benchmark.	-
dc.language	eng	-
dc.relation.ispartof	7th International Conference on Learning Representations, ICLR 2019	-
dc.title	Feature intertwiner for object detection	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.scopus	eid_2-s2.0-85083952656	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Feature intertwiner for object detection

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats