Parametric Classification for Generalized Category Discovery: A Baseline Study

Qi, Xiaojuan

File Download

There are no files associated with this item.

Supplementary

Citations:
Appears in Collections:
- Electrical & Electronic Engineering: Conference papers

Conference Paper: Parametric Classification for Generalized Category Discovery: A Baseline Study

Title	Parametric Classification for Generalized Category Discovery: A Baseline Study
Authors	Qi, Xiaojuan
Issue Date	2-Oct-2023
Abstract	Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples. Previous studies argued that parametric classifiers are prone to overfitting to seen categories, and endorsed using a non-parametric classifier formed with semi-supervised k-means. However, in this study, we investigate the failure of parametric classifiers, verify the effectiveness of previous design choices when high-quality supervision is available, and identify unreliable pseudo-labels as a key problem. We demonstrate that two prediction biases exist: the classifier tends to predict seen classes more often, and produces an imbalanced distribution across seen and novel categories. Based on these findings, we propose a simple yet effective parametric classification method that benefits from entropy regularisation, achieves state-of-the-art performance on multiple GCD benchmarks and shows strong robustness to unknown class numbers. We hope the investigation and proposed simple framework can serve as a strong baseline to facilitate future studies in this field. Our code is available at: https://github.com/CVMI-Lab/SimGCD.
Persistent Identifier	http://hdl.handle.net/10722/340351

DC Field	Value	Language
dc.contributor.author	Qi, Xiaojuan	-
dc.date.accessioned	2024-03-11T10:43:31Z	-
dc.date.available	2024-03-11T10:43:31Z	-
dc.date.issued	2023-10-02	-
dc.identifier.uri	http://hdl.handle.net/10722/340351	-
dc.description.abstract	<p>Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples. Previous studies argued that parametric classifiers are prone to overfitting to seen categories, and endorsed using a non-parametric classifier formed with semi-supervised k-means. However, in this study, we investigate the failure of parametric classifiers, verify the effectiveness of previous design choices when high-quality supervision is available, and identify unreliable pseudo-labels as a key problem. We demonstrate that two prediction biases exist: the classifier tends to predict seen classes more often, and produces an imbalanced distribution across seen and novel categories. Based on these findings, we propose a simple yet effective parametric classification method that benefits from entropy regularisation, achieves state-of-the-art performance on multiple GCD benchmarks and shows strong robustness to unknown class numbers. We hope the investigation and proposed simple framework can serve as a strong baseline to facilitate future studies in this field. Our code is available at: https://github.com/CVMI-Lab/SimGCD.</p>	-
dc.language	eng	-
dc.relation.ispartof	2023 International Conference on Computer Vision (02/10/2023-06/10/2023, , , Paris Convention Centre)	-
dc.title	Parametric Classification for Generalized Category Discovery: A Baseline Study	-
dc.type	Conference_Paper	-

File Download

Supplementary

Conference Paper: Parametric Classification for Generalized Category Discovery: A Baseline Study

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats