README.md

March 17, 2026 · View on GitHub

Category Discovery: An Open-World Perspective

Zhenqi He , Yuanpei Liu , Kai Han

arXiv


This repository serves as a supplementary resource for our survey paper on Category Discovery (CD) methods. It includes a comprehensive collection of key papers, frameworks, and approaches in the field of CD, summarizing the most recent advancements and techniques. The materials here aim to provide researchers with an accessible overview of current trends and methodologies in CD, along with references and additional insights to support further exploration.

We will continue to maintain and update this repository with new papers and resources as the field evolves. Contributions are welcome, and we encourage pull requests (PRs) to help expand and improve the content for the community.

Table of Contents

Introduction

Alt text

Category Discovery (CD) addresses the limitations of the closed-world assumption by embracing an open-world setting. As shown in above figure, CD differs from semi-supervised learning and OSR&OOD by clustering unlabelled data that contains unseen categories. It is motivated by that human beings are capable of discovering unknown species by transferring existing knowledge on explored species. CD proves highly applicable across various real-world scenarios. For example, in autonomous driving, vehicles must continuously detect and classify new objects—such as unfamiliar road signs or obstacles—beyond their initial training to ensure safe navigation. In retail, CD can automatically recognize newly introduced products in supermarkets without the need for manual labeling.

Roadmap

Alt text
In recent years, CD has garnered increasing attention, leading to a proliferation of research exploring various methodologies and settings. It was initially introduced as Novel Category Discovery (NCD) in to cluster unlabelled novel categories by leveraging knowledge from labelled base categories. This concept was later expanded into Generalized Category Discovery (GCD), which relaxed earlier constraints by assuming that the unlabelled data contains both novel and base categories, thereby more closely mirroring real-world scenarios. Further advancing the field, Han *etal.* proposed Semantic Category Discovery (SCD), aiming to assign semantic labels to unlabelled samples from an unconstrained vocabulary space. Additionally, CD has been applied to complex scenarios such as continual learning, where models learn incrementally over time, and federated learning, which focuses on training models across decentralized devices while ensuring data privacy. CD methods have also been explored in challenging settings, including few-shot learning, where limited labelled data is available, and with imbalanced distribution and domain-shifted data, making CD more applicable to real-world problems.

Category Discovery

Alt text

Novel Category Discovery (NCD)

The concept of NCD aims to transfer the knowledge learned from base categories to cluster unlabelled unseen categories, motivated by the observation where a child could easily distinguish novel categories (e.g., birds and elephants) after learning to classify base categories (e.g., dogs and cats).

Formally, given a dataset D=DLDU`\mathcal{D} = \mathcal{D}_L \cup \mathcal{D}_U`, where the labelled portion is DL={(xi,yi)}i=1MX×YL`\mathcal{D}_L = \{(\mathbf{x}_i, y_i)\}_{i=1}^M \subset \mathcal{X} \times \mathcal{Y}_L` and the unlabelled portion is DU={(xi,y^i)}i=1KX×YU`\mathcal{D}_U = \{(\mathbf{x}_i, \hat{y}_i)\}_{i=1}^K \subset \mathcal{X} \times \mathcal{Y}_U` (with the labels y^i`\hat{y}_i` being inaccessible during training), the objective of NCD is to leverage the discriminative information learned from the annotated data to cluster the unlabelled data.

This setting presumes that the label spaces of the labelled and unlabelled data are disjoint, i.e., YLYU=\mathcal{Y}_L \cap \mathcal{Y}_U = \varnothing, implying CN=YU\mathcal{C}_N = \mathcal{Y}_U, while also assuming a high degree of semantic similarity between the base and novel categories.

YearMethodPub.BackboneLabel Assignment# Unlabelled categoriesDataset
2018KCLICLRResNetParametric ClassifierOver-estimateOmniglot, ImageNeg-1K, Office31
2019MCLICLRResNet, VGG, LeNetParametric ClassifierOver-estimateOmniglot, CIFAR-10&100, ImageNet-1K, MNIST
DTCICCVResNet, VGGSoft Assignmentkk-MeansOmniglot, CIFAR-10&100, ImageNet-1K, SVHN
2020RS, RS+ICLRResNetParametric ClassifierKnownOmniglot, CIFAR-10&100, ImageNet-1K, SVHN
2021Qing etal.Neural NetworksResNetParametric ClassifierKnownCIFAR-10&100, SVHN
OpenMixCVPRResNet, VGGParametric ClassifierKnownCIFAR-10&100, ImageNet-1K
NCLCVPRResNetParametric ClassifierKnownCIFAR-10&100, ImageNet-1K
JOINTICCVResNetParametric ClassifierKnownCIFAR-10&100, ImageNet-1K
UNOICCVResNetParametric ClassifierKnownCIFAR-10&100, ImageNet-1K
DualRSICCVResNetParametric ClassifierKnownCIFAR-10&100, ImageNet-100&1K, SSB
2022SMIICASSPVGG-16kk-MeansKnownCIFAR-10&100, ImageNet-1K
PSSCNNCDT'CYBN/ABKBH kk-MeansProgressive label propagationCoil20, Yeast, MSRA25, PalmData25, Abalone, USPS, Letter, MNIST
Li etal.NeurIPSWResNetkk-Meanskk-MeansCIFAR-100, ImageNet-1K
2023ResTuneT'NNLSResNetkk-MeansKnownCIFAR-10&100, TinyImageNet
SK-HurtTMLRResNetkk-Meanskk-MeansCIFAR-100, ImageNet-1K
IICCVPRResNetParametric Classifierkk-MeansCIFAR-10&100, ImageNet-1K
NSCLICMLResNetkk-Meanskk-MeansCIFAR-100, ImageNet-1K
CRKDICCVResNet, ViTParametric ClassifierKnownCIFAR-100, SSB
Feng etal.MICCAIResNetParametric ClassifierKnownISIC2019
2024RAPLCVPRResNetkk-MeansKnownSoyAgeing
SCKDECCVResNet, ViTParametric ClassifierKnownCIFAR-10&100, ImageNet-100, SSB
APLT'PAMIResNetParametric ClassifierKnownCIFAR-10&100, Omniglot, ImageNet-1K
PrePrintHasan etal.ArXivResNetParametric Classifierkk-MeansCIFAR-10&100

Generalized Category Discovery (GCD)

Extending the NCD paradigm, Generalized Category Discovery relaxes the disjointness assumption between the base and novel categories, thereby presenting a more challenging and realistic scenario. In GCD, the labelled and unlabelled datasets may share common categories, i.e., YLYU\mathcal{Y}_L \cap \mathcal{Y}_U \neq \varnothing, and the set of novel categories is defined as a subset of YU\mathcal{Y}_U (i.e., CNYU\mathcal{C}_N \subset \mathcal{Y}_U). This general formulation is particularly pertinent to practical applications such as plant species discovery, where an existing database of known species is augmented with newly observed species, necessitating the clustering of both known and novel instances.

Notably, an equivalent formulation has been introduced by Cao etal. under the designation of Open-World Semi-Supervised Learning. In the following context, we refer to both formulations under the umbrella term Generalized Category Discovery.

YearMethodPub.BackboneLabel Assignment# Unlabelled categoriesDataset
2022GCDCVPRViTSemi-k-Meanskk-MeansCIFAR-10&100, ImageNet-100, SSB, Herb19
ORCACVPRResNetParametric ClassifierKnownCIFAR-10&100, ImageNet-100, Single-Cell
ComExCVPRResNetParametric ClassifierKnownCIFAR-10&100
OpenLDNECCVResNetParametric ClassifierKnownCIFAR-10&100, ImageNet-100, TinyImage, Oxford Pets
TRSSLECCVResNetParametric Classifierkk-MeansCIFAR-10&100, ImageNet-100, TinyImage, Oxford Pets, Scars, Aircrafts
NACHNeurIPSResNetParametric ClassifierKnownCIFAR-10&100, ImageNet-100
XConBMVCViTSemi-k-Meanskk-MeansCIFAR-10&100, ImageNet-100, SSB, Oxford Pets
2023OpenConTMLRResNetPrototype-basedkk-MeansCIFAR-10&100, ImageNet-100
PromptCALCVPRViTSemi-k-MeansKnownCIFAR-10&100, ImageNet-100, SSB
DCCLCVPRViTInfomapInfomapCIFAR-10&100, ImageNet-100, CUB, Scars, Oxford Pets
OpenNCDIJCAIResNetPrototype-basedPrototype GroupingCIFAR-10&100, ImageNet-100
SimGCDICCVViTParametric Classifierkk-MeansCIFAR-10&100, ImageNet-100, SSB, Herb19
GPCICCVViTGMMGMMCIFAR-10&100, ImageNet-100, SSB
PIMICCVViTParametric Classifierkk-MeansCIFAR-10&100, ImageNet-100, CUB, Scars, Herb19
TIDANeurIPSResNetParametric ClassifierKnownCIFAR-10&100, ImageNet-100, TinyImageNet, Scars, Aircraft
μ\muGCDNeurIPSResNet, ViT, ViTkk-MeansKnownClevr-4
InfoSieveNeurIPSViTkk-Meanskk-MeansCIFAR-10&100, ImageNet-100, SSB, Oxford Pets, Herb19
SORLNeurIPSResNetkk-MeansKnownCIFAR-10&100
Yang etal.ICONIPViTLouvainLouvainCIFAR-10&100, ImageNet-100, CUB, Scars, Herb19
2024AMENDWACVViTParametric ClassifierKnownCIFAR-10&100, ImageNet-100, SSB, Herb19
GCAWACVViTGuided Cluster Aggregationkk-MeansCIFAR-10&100, ImageNet-100, SSB
SPT-NetICLRViT, ViTParametric Classifierkk-MeansCIFAR-10&100, ImageNet-100, SSB
LegoGCDCVPRViTParametric ClassifierKnownCIFAR-10&100, ImageNet-100, -1K, SSB, Herb19
CMSCVPRViTAgglomerative ClusteringAgglomerative ClusteringCIFAR-100, ImageNet-100, SSB, Herb19
ActiveGCDCVPRViTParametric ClassifierKnownCIFAR-10&100, ImageNet-100, SSB
TextGCDECCVViTParametric ClassifierKnownCIFAR-10&100, ImageNet-100, -1K, SSB, Oxford Pets, Flowers102
LPSIJCAIResNetParametric ClassifierKnownCIFAR-10&100, ImageNet-100
Contextuality-GCDICIPViTParametric ClassifierKnownCIFAR-10&100, ImageNet-100, -1K, SSB, Herb19
2025MSGCDInformation FusionViTParametric ClassifierKnownCIFAR-100, SSB
CPTIJCVViTSimilarity-Basedkk-MeansCIFAR-10&100, ImageNet-100,CUB, Scars, Herb19
PAL-GCDAAAIViTParametric ClassifierDBSCANCIFAR-100,ImageNet-100,SSB, Herb19
DebGCDICLRViTParametric ClassifierDBSCANCIFAR-10&100,ImageNet-100&1K,SSB, Herb19,Oxford-Pets
ProtoGCDT'PAMIViTParametric Classifierkk-MeansCIFAR-10&100,ImageNet-100&1K,SSB, Herb19
MOSCVPRViTParametric ClassifierKnownSSB, Oxford-Pets
GETCVPRViTParametric ClassifierKnownCIFAR-10&100,ImageNet-100,SSB,Herb19
AptGCDCVPRViTParametric ClassifierKnownCIFAR-10&100,ImageNet-100,SSB,Herb19
Dai et alCVPRViT-KnownSSB, Herb19
HypCDCVPRViT-KnownCIFAR-10&100,ImageNet-100,SSB,Herb19
PrePrintCLIP-GCDArXivViTSemi-k-Meanskk-MeansCIFAR-10&100, ImageNet-100, -1K, SSB, Flowers102, DomainNet
MCDLArXivViTParametric ClassifierKnownCIFAR-10&100, ImageNet-100, -1K, CUB, SCars, Herb19
PNPArXivViTInfomapInfomapCIFAR-10&100, ImageNet-100, -1K, SSB, Herb19
RPIMArXivViTParametric ClassifierKnownCIFAR-10&100, ImageNet-100, -1K, CUB, Scars, Herb19
OpenGCDArXivViTParametric Classifierkk-MeansCIFAR-10&100, CUB
ConceptGCDArXivViT, ViTParametric Classifierkk-MeansCIFAR-100, ImageNet-100, -1K, SSB, Herb19
GETArXivViTParametric ClassifierKnownCIFAR-10&100, ImageNet-100, SSB, Herb19

Continual Category Discovery (CCD)

CCD provides a continual setting of category discovery in which new categories are identified sequentially while retaining previously acquired knowledge. CCD presents several distinct scenarios based on the structure of the incoming data.

In the Class Incremental Scenario, the training set Dtraint`\mathcal{D}_{\mathrm{train}}^t` contains solely unlabelled instances from novel categories. In the Mixed Incremental Scenario, Dtraint`\mathcal{D}_{\mathrm{train}}^t` is composed exclusively of unlabelled data drawn from both novel and base categories. Finally, in the Semi-Supervised Mixed Incremental Scenario, Dtraint`\mathcal{D}_{\mathrm{train}}^t` comprises both labelled and unlabelled samples, which originate from the base as well as the novel categories.

YearMethodPub.BackboneScenarioLabel Assignment# Unlabelled categoriesDataset
2022NCDwFECCVResNetClass IncrementalParametric ClassifierKnownCIFAR-10/100, ImageNet-1K
FRoSTECCVResNetClass IncrementalParametric ClassifierKnownCIFAR-10/100, TinyImageNet
GMNeurIPSResNetAllParametric ClassifierKnownCIFAR-100, ImageNet-100, CUB
2023PA-GCDICCVViT, ResnetMixed IncrementalParametric ClassifierAffinity PropagationCUB, MIT67, Stanford Dogs, Aircraft
MetaGCDICCVViTMixed Incrementalkk-Meanskk-MeansCIFAR-10/100, TinyImageNet
iGCDICCVResNetSelf-Supervised Mixed IncrementalSoft Nearest NeighborDensity PeaksCUB, Aircraft, CIFAR-100
2024Msc-iNCDICPRViTClass IncrementalParametric ClassifierKnownCIFAR-100, ImageNet-100/1K
ADMAAAIResNetClass IncrementalParametric ClassifierKnownCIFAR-10/100, TinyImageNet
PromptCCDECCVViTMixed IncrementalGMMGMPCIFAR-100, ImageNet-100, TinyImageNet
DEANECCVViTMixed IncrementalParametric ClassifierAffinity PropagationCUB, Aircraft, CIFAR-100
CAMPECCVViTSelf-Supervised Mixed IncrementalNearest Centroid ClassifierKnownCUB, Aircraft, SCars, DomainNet, CIFAR-100
HappyNeurIPSViTMixed IncrementalParametric ClassifierSilhouette ScoreCIFAR-100, ImageNet-100, TinyImageNet, CUB
PreprintFEAArXivViTClass IncrementalParametric ClassifierKnownCIFAR-10/100, TinyImageNet

On-the-fly Category Discovery (OCD)

OCD extends conventional category discovery to an inductive learning paradigm with streaming inference. It trains on a labelled support set DSD_S to cluster unlabelled query set DQD_Q where DSD_S is unavailable during training and its samples are individually at test time.

YearMethodPub.BackboneLabel Assignment# Unlabelled categoriesDataset
2023SMILECVPRViTHash-basedHash-codingCIFAR-10&100, ImageNet-100, CUB, Scars, Herb19
2024PHENeurIPSViTHamming Ball-BasedHamming Ball-BasedCUB, Scars, Oxford Pets, Food-101, iNaturalist

Category Discovery with domain shift

This setting relaxes the conventional assumption that both labelled and unlabelled data are drawn from the same semantic domain. Formally, let DL`\mathcal{D}_L` denote the labelled data, assumed to be exclusively drawn from the domain ΩB`\Omega_B`, and let DU`\mathcal{D}_U` denote the unlabelled data, which may include samples originating from both ΩB`\Omega_B` and an additional domain ΩN`\Omega_{N}`. The objective is to accurately classify images drawn from the combined domain Ω=ΩBΩN`\Omega = \Omega_B \cup \Omega_{N}`, under the assumption that the novel domain is disjoint from the base domain (i.e., ΩBΩN=`\Omega_B \cap \Omega_{N} = \varnothing`). In practice, the novel domain ΩN`\Omega_{N}` may encompass multiple subdomains.

YearMethodPub.BackboneΩU\Omega_{\mathcal{U}}Label Assignment# Unlabelled categoriesDatasetYLYU\mathcal{Y_L} \cap \mathcal{Y_U}
2022Yu etal.AAAIResNetSingle New DomainParametric Classifierkk-MeansOffice, OfficeHome, VisDA\varnothing
SCDAICMEResNetMultiple New DomainsParametric Classifierkk-MeansOffice, OfficeHome, DomainNet\varnothing
2023SANICCVResNetSingle New DomainParametric ClassifierN/AOffice, OfficeHome, VisDA, DomainNet\varnothing
2024CDAD-NetCVPRWViTSingle New DomainSemi-k-MeansElbowOfficeHome, PACS, DomainNet, CIFAR-10&100, ImageNet-100\neq \varnothing
2025HiLoICLRViTMultiple new DomainsParametric Classifierkk-MeansDomainNet, SSB-C\neq \varnothing
ArXivWang etal.ArXivViTSingle New DomainParametric ClassifierKnownCIFAR-10, OfficeHome, DomainNet\varnothing

Distribution-Agnostic Category Discovery (DA-CD)

DA-CD eliminates the requirement for a balanced distribution imposed on both labelled and unlabelled data in conventional category discovery. Instead, it acknowledges that the data may follow a skewed distribution, such that for certain categories Yi`\mathcal{Y}_i` and Yj`\mathcal{Y}_j` within the set Y`\mathcal{Y}` it holds that PYx(Yi)>PYx(Yj)`\mathbb{P}_{\mathcal{Y}_x}(\mathcal{Y}_i) > \mathbb{P}_{\mathcal{Y}_x}(\mathcal{Y}_j)`. In this formulation, the set Yx`\mathcal{Y}_x` may refer to either the labelled categories YL`\mathcal{Y}_L` or the unlabelled categories YU`\mathcal{Y}_U`.

YearMethodPub.BackboneScenarioLabel Assignment# Unlabelled categoriesDataset
2023NCDLRTMLRViTLong-tailed DistributionParametric Classifierkk-MeansCIFAR-10, ImageNet-100, Herb19, iNaturalist18
ImbaGCDCVPRWResnetImbalanced DistributionParametric ClassifierKnownCIFAR-10&100, ImageNet-100
GCDLRICCVWResnetImbalanced DistributionParametric ClassifierKnownCIFAR-10&100, ImageNet-100
BYOPCVPRResNetImbalanced DistributionParametric ClassifierKnownCIFAR-10&100, TinyImageNet
BaConNeurIPSViTLong-tailed Distributionkk-MeansKnownCIFAR-10&100-LT, ImageNet-100-LT, Places-LT
2024Fan etalCVPRViTLong-tailed Distributionkk-MeansSpectral graphBioMedical Datasets

Semantic Category Discovery (SCD)

In contrast to NCD and GCD, which focus solely on grouping visually similar images without considering their semantic meaning, SCD extends these paradigms by also assigning a semantic label to each unlabelled instance. Specifically, SCD leverages an open vocabulary label space to achieve this goal. In this context, WordNet, comprising approximately 68,000 labels, is employed as a comprehensive and unconstrained vocabulary, facilitating the assignment of meaningful semantic labels.

YearMethodPub.BackboneWord SpaceLabel Assignment# Unlabelled categoriesDataset
2024SCDCVPRWViT~OpenKMeans+Top-k VotingKnownImageNet-100&1K, SCars, CUB
SNCDAAAIResNetBase + Novel CategoriesParametric ClassifierKnownCIFAR-10&100, ImageNet-100

Few-Shots Category Discovery (FS-CD)

FS-CD addresses the challenge of identifying novel classes when only a very limited amount of labelled data is available. This setting extends traditional category discovery by integrating the principles of few-shot learning. In particular, FSCD adopts an NN-way, kk-shot framework in which the model is required to discriminate among NN distinct classes with merely kk labelled examples per class for base categories.

Chi etal. extend NCD to a few-shot setting by linking it to meta-learning, based on the shared assumption that base and novel categories possess high-level semantic features. By adapting meta-learning techniques such as Model-Agnostic Meta-Learning and Prototypical Networks (ProtoNet), their approach shifts the focus from classification to clustering tasks—a critical adjustment for few-shot category discovery. A key innovation is the introduction of the Clustering-rule-aware Task Sampler, which ensures that training tasks adhere to consistent clustering rules, thereby enabling the model to generalize better to novel categories despite the limited labelled data. However, this method assumes that the number of novel categories is known in advance.

Federated Category Discovery (FCD)

FCD extends Category Discovery in a federated learning setting, facilitating decentralized and collaborative model training among clients while safeguarding data privacy.

YearMethodPub.BackboneLabel Assignment# Unlabelled categoriesDataset
2023FedoSSLICMLResNetParametric ClassifierKnownCIFAR-10/100, CINIC-10
2024FedGCDCVPRViTGMMSemi-FINCHCIFAR-10/100, ImageNet-100, CUB, SCars, Pets
PreprintGALArXivResNet&34Parametric ClassifierPotential Prototype MergeCIFAR-100, TinyImageNet, ImageNet-100
  • Visual-AI/Awesome-Category-Discovery — An awesome-style paper list organized by year. Contributions are welcome—feel free to open issues or submit pull requests to add relevant works.
  • Awesome-Novel-Class-Discovery — We appreciate this excellent paper collection curated by Joseph, which complements our repo with additional coverage of novel/class discovery works and resources.