Awesome Long-Tailed Learning (TPAMI 2023)

November 25, 2025 · View on GitHub

Awesome PRs Welcome

We released Deep Long-Tailed Learning: A Survey and our codebase to the community. In this survey, we reviewed recent advances in long-tailed learning based on deep neural networks. Existing long-tailed learning studies can be grouped into three main categories (i.e., class re-balancing, information augmentation and module improvement), which can be further classified into nine sub-categories (as shown in the below figure). We also provided empirical analysis for several state-of-the-art methods by evaluating to what extent they address the issue of class imbalance. We concluded the survey by highlighting important applications of deep long-tailed learning and identifying several promising directions for future research.

After completing this survey, we decided to release our long-tailed learning resources and codebase, hoping to push the development of the community. If you have any questions or suggestions, please feel free to contact us.

1. Type of Long-tailed Learning

SymbolSamplingCSLLATLAug
TypeRe-samplingClass-sensitive LearningLogit AdjustmentTransfer LearningData Augmentation
SymbolRLCDDTEnsembleother
TypeRepresentation LearningClassifier DesignDecoupled TrainingEnsemble LearningOther Types

2. Top-tier Conference Papers (Updated on 2025 June)

2025

TitleVenueYearTypeCode
Supervised Exploratory Learning for Long-Tailed Visual RecognitionICCV2025Sampling,RL
You Are Your Own Best Teacher: Achieving Centralized-level Performance in Federated Learning under Heterogeneous and Long-tailed DataICCV2025LA,TL,RLOfficial
Boosting Class Representation via Semantically Related Instances for Robust Long-Tailed Learning with Noisy LabelsICCV2025TL,Ensemble,otherOfficial
Long-Tailed Classification with Multi-Granularity SemanticsICCV2025Aug,RL
AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory PredictionICCV2025Aug,RL
Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion ModelICCV2025Aug,other
Category-Specific Selective Feature Enhancement for Long-Tailed Multi-Label Image ClassificationICCV2025RL
Overcoming Dual Drift for Continual Long-Tailed Visual Question AnsweringICCV2025RL,CD
A Tiny Change, A Giant Leap: Long-Tailed Class-Incremental Learning via Geometric Prototype AlignmentICCV2025RL,CDOfficial
Toward Long-Tailed Online Anomaly Detection through Class-Agnostic ConceptsICCV2025otherOfficial
Rethinking the Bias of Foundation Model under Long-tailed DistributionICML2025LA
Advancing Personalized Learning with Neural Collapse for Long-Tail ChallengeICML2025RLOfficial
Focal-SAM: Focal Sharpness-Aware Minimization for Long-Tailed ClassificationICML2025RL
A Square Peg in a Square Hole: Meta-Expert for Long-Tailed Semi-Supervised LearningICML2025RL,Ensemble
Balancing Model Efficiency and Performance: Adaptive Pruner for Long-tailed DataICML2025otherOfficial
TailedCore : Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly DetectionCVPR2025SamplingOfficial
Fractal Calibration for Long-Tailed Object DetectionCVPR2025LA,otherOfficial
SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object DetectionCVPR2025TL
Learning from Neighbors: Category Extrapolation for Long-Tail LearningCVPR2025TL,Aug,RL
Distilling Long-tailed DatasetsCVPR2025TL,DTOfficial
Search and Detect: Training-Free Long Tail Object Detection via Web-Image RetrievalCVPR2025otherOfficial
TAET: Two-Stage Adversarial Equalization Training on Long-Tailed DistributionsCVPR2025otherOfficial
Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information AmountICLR2025CSL
Rethinking Classifier Re-Training in Long-Tailed Recognition: Label Over-Smooth Can BalanceICLR2025LA,CD
Long-tailed Adversarial Training with Self-DistillationICLR2025TL,other
ConMix: Contrastive Mixup at Representation Level for Long-tailed Deep ClusteringICLR2025Aug,RLOfficial
Geometry of Long-Tailed Representation Learning: Rebalancing Features for Skewed DistributionsICLR2025RL

2024

TitleVenueYearTypeCode
Taming the long tail in human mobility predictionNeurIPS2024Sampling,LA
Long-tailed object detection pre-training: dynamic rebalancing contrastive learning with dual reconstructionNeurIPS2024Sampling,RL
AUCSeg: AUC-oriented pixel-level long-tail semantic segmentationNeurIPS2024Sampling,otherOfficial
Continuous contrastive learning for long-tailed semi-supervised recognitionNeurIPS2024CSL,LA,RLOfficial
Long-tailed out-of-distribution detection via normalized outlier distribution adaptationNeurIPS2024LAOfficial
LLM-ESR: Large language models enhancement for long-tailed sequential recommendationNeurIPS2024TL,EnsembleOfficial
DiffuLT: Diffusion for long-tail recognition without external knowledgeNeurIPS2024Aug
LLM-AutoDA: Large language model-driven automatic data augmentation for long-tailed problemsNeurIPS2024AugOfficial
Breaking long-tailed learning bottlenecks: A controllable paradigm with hypernetwork-generated diverse expertsNeurIPS2024EnsembleOfficial
Once Read is Enough: Domain-specific pretraining-free language models with cluster-guided sparse experts for long-tail domain knowledgeNeurIPS2024Ensemble
Improving visual prompt tuning by Gaussian neighborhood minimization for long-tailed visual recognitionNeurIPS2024otherOfficial
Towards heterogeneous long-tailed learning: Benchmarking, Metrics, and ToolboxNeurIPS2024otherOfficial
What makes CLIP more robust to long-tailed pre-training data? A controlled study for transferable insightsNeurIPS2024otherOfficial
Flexible distribution alignment: Towards long-tailed semi-supervised learning with proper calibrationECCV2024LAOfficial
Long-tail temporal action segmentation with group-wise temporal logit adjustmentECCV2024LAOfficial
Distribution-aware robust learning from long-tailed data with noisy labelsECCV2024Aug,RLOfficial
Distributionally robust loss for long-tailed multi-label image classificationECCV2024RLOfficial
Rectify the regression bias in long-tailed object detectionECCV2024RL,CD
LTRL: Boosting long-tail recognition via reflective learningECCV2024otherOfficial
Learning label shift correction for test-agnostic long-tailed recognitionICML2024LAOfficial
Generative active learning for long-tailed instance segmentationICML2024Aug
ELTA: An enhancer against long-tail for aesthetics-oriented modelsICML2024AugOfficial
Distribution alignment optimization through neural collapse for long-tailed classificationICML2024RLOfficial
Long-tail learning with foundation model: Heavy fine-tuning hurtsICML2024CDOfficial
SimPro: A simple probabilistic framework towards realistic long-tailed semi-supervised learningICML2024CDOfficial
Harnessing hierarchical label distribution variations in test agnostic long-tail recognitionICML2024EnsembleOfficial
Two Fists, One Heart: Multi-objective optimization based strategy fusion for long-tailed learningICML2024otherOfficial
BEM: Balanced and entropy-based mix for long-tailed semi-supervised learningCVPR2024CSL,TL,CDOfficial
DeiT-LT: Distillation strikes back for vision transformer training on long-tailed datasetsCVPR2024CSL,TL,CDOfficial
Revisiting adversarial training under long-tailed distributionsCVPR2024CSL,AugOfficial
Long-tailed anomaly detection with learnable class namesCVPR2024TL,AugOfficial
LTGC: Long-tail recognition via leveraging LLMs-driven generated contentCVPR2024AugOfficial
Delving into the trajectory long-tail distribution for multi-object trackingCVPR2024Aug,CDOfficial
Long-tail class incremental learning via independent sub-prototype constructionCVPR2024RL
Long-tailed diffusion models with oriented calibrationICLR2024Sampling,CSL,TLOfficial
Kill two birds with one stone: Rethinking data augmentation for deep long-tailed learningICLR2024AugOfficial
FedLoGe: Joint local and generic federated learning under long-tailed dataICLR2024RLOfficial
Learning to reject meets long-tail learningICLR2024CD
Exploring weight balancing on long-tailed recognition problemICLR2024DTOfficial
Pareto deep long-tailed recognition: A conflict-averse solutionICLR2024otherOfficial

2023

TitleVenueYearTypeCode
How re-sampling helps for long-tail learning?NeurIPS2023Sampling,AugOfficial
Fed-GraB: Federated long-tailed learning with self-adjusting gradient balancerNeurIPS2023CSLOfficial
Enhancing minority classes by mixing: an adaptative optimal transport approach for long-tailed classificationNeurIPS2023AugOfficial
Learning from rich semantics and coarse locations for long-tailed object detectionNeurIPS2023RLOfficial
Generalized test utilities for long-tail performance in extreme multi-label classificationNeurIPS2023other
Label-noise learning with intrinsically long-tailed dataICCV2023SamplingOfficial
MDCS: More diverse experts with consistency self-distillation for long-tailed recognitionICCV2023Sampling,TL,EnsembleOfficial
Subclass-balancing Contrastive Learning for Long-tailed RecognitionICCV2023Sampling,RLOfficial
When Noisy Labels Meet Long Tail Dilemmas: A Representation Calibration MethodICCV2023Sampling,RL,DT
AREA: Adaptive reweighting via effective area for long-tailed classificationICCV2023CSLOfficial
Reconciling object-level and global-level objectives for long-tail detectionICCV2023CSLOfficial
Local and global logit adjustments for long-tailed learningICCV2023CSL,LA,Ensemble
Learning in imperfect environment: Multi-label classification with long-tailed distribution and partial labelsICCV2023CSL,TLOfficial
Global balanced experts for federated long-tailed learningICCV2023CSL, EnsembleOfficial
Boosting long-tailed object detection via step-wise learning on smooth-tail dataICCV2023Ensemble
Long-tailed recognition by mutual information maximization between latent features and ground-truth labelsICML2023CSL,RLOfficial
Large language models struggle to learn long-tail knowledgeICML2023Aug
Feature directions matter: Long-tailed learning via rotated balanced representationICML2023RL
Wrapped Cauchy distributed angular softmax for long-tailed visual recognitionICML2023RL,CDOfficial
Rethinking image super resolution from long-tailed distribution learning perspectiveCVPR2023CSL
Transfer knowledge from head to tail: Uncertainty calibration under long-tailed distributionCVPR2023CSL,TLOfficial
Towards realistic long-tailed semi-supervised learning: Consistency is all you needCVPR2023CSL,TL,EnsembleOfficial
Global and local mixture consistency cumulative learning for long-tailed visual recognitionsCVPR2023CSL,RLOfficial
Long-tailed visual recognition via self-heterogeneous integration with knowledge excavationCVPR2023TL,EnsembleOfficial
Balancing logit variation for long-tailed semantic segmentationCVPR2023AugOfficial
Use your head: Improving long-tail video recognitionCVPR2023AugOfficial
FCC: Feature clusters compression for long-tailed visual recognitionCVPR2023RLOfficial
FEND: A future enhanced distribution-aware contrastive learning framework for long-tail trajectory predictionCVPR2023RL
SuperDisco: Super-class discovery improves visual recognition for the long-tailCVPR2023RL
Class-conditional sharpness-aware minimization for deep long-tailed recognitionCVPR2023DTOfficial
Balanced product of calibrated experts for long-tailed recognitionCVPR2023EnsembleOfficial
No one left behind: Improving the worst categories in long-tailed learningCVPR2023Ensemble
On the effectiveness of out-of-distribution data in self-supervised long-tail learningICLR2023Sampling,TL,AugOfficial
LPT: Long-tailed prompt tuning for image classificationICLR2023Sampling,TL,OtherOfficial
Long-tailed partial label learning via dynamic rebalancingICLR2023CSLOfficial
Delving into semantic scale imbalanceICLR2023CSL,RL
INPL: Pseudo-labeling the inliers first for imbalanced semi-supervised learningICLR2023TL
CUDA: Curriculum of data augmentation for long-tailed recognitionICLR2023AugOfficial
Long-tailed learning requires feature learningICLR2023RL
Decoupled training for long-tailed classification with stochastic representationsICLR2023RL,DT

2022

TitleVenueYearTypeCode
Self-supervised aggregation of diverse experts for test-agnostic long-tailed recognitionNeurIPS2022CSL,EnsembleOfficial
SoLar: Sinkhorn label refinery for imbalanced partial-label learningNeurIPS2022CSLOfficial
Do we really need a learnable classifier at the end of deep neural network?NeurIPS2022RL,CD
Maximum class separation as inductive bias in one matrixNeurIPS2022CDOfficial
Escaping saddle points for effective generalization on class-imbalanced dataNeurIPS2022otherOfficial
Breadcrumbs: Adversarial class-balanced sampling for long-tailed recognitionECCV2022Sampling,Aug,DTOfficial
Constructing balance from imbalance for long-tailed image recognitionECCV2022Sampling,RLOfficial
Tackling long-tailed category distribution under domain shiftsECCV2022CSL,Aug,RLOfficial
Improving GANs for long-tailed data through group spectral regularizationECCV2022CSL,OtherOfficial
Learning class-wise visual-linguistic representation for long-tailed visual recognitionECCV2022TL,RLOfficial
Learning with free object segments for long-tailed instance segmentationECCV2022Aug
SAFA: Sample-adaptive feature augmentation for long-tailed image classificationECCV2022Aug,RL
On multi-domain long-tailed recognition, imbalanced domain generalization, and beyondECCV2022RLOfficial
Invariant feature learning for generalized long-tailed classificationECCV2022RLOfficial
Towards calibrated hyper-sphere representation via distribution overlap coefficient for long-tailed learningECCV2022RL,CDOfficial
Long-tailed instance segmentation using Gumbel optimized lossECCV2022CDOfficial
Long-tailed class incremental learningECCV2022DTOfficial
Identifying hard noise in long-tailed sample distributionECCV2022OtherOfficial
Relieving long-tailed instance segmentation via pairwise class balanceCVPR2022CSLOfficial
The majority can help the minority: Context-rich minority oversampling for long-tailed classificationCVPR2022TL,AugOfficial
Long-tail recognition via compositional knowledge transferCVPR2022TL,RL
BatchFormer: Learning to explore sample relationships for robust representation learningCVPR2022TL,RLOfficial
Nested collaborative learning for long-tailed visual recognitionCVPR2022RL,EnsembleOfficial
Long-tailed recognition via weight balancingCVPR2022DTOfficial
Class-balanced pixel-level self-labeling for domain adaptive semantic segmentationCVPR2022otherOfficial
Killing two birds with one stone: Efficient and robust training of face recognition CNNs by partial FCCVPR2022otherOfficial
Optimal transport for long-tailed recognition with learnable cost matrixICLR2022LA
Do deep networks transfer invariances across classes?ICLR2022TL,AugOfficial
Self-supervised learning is more robust to dataset imbalanceICLR2022RL

2021

TitleVenueYearTypeCode
Improving contrastive learning on imbalanced seed data via open-world samplingNeurIPS2021Sampling,TL, DCOfficial
Semi-supervised semantic segmentation via adaptive equalization learningNeurIPS2021Sampling,CSL,TL, AugOfficial
On model calibration for long-tailed object detection and instance segmentationNeurIPS2021LAOfficial
Label-imbalanced and group-sensitive classification under overparameterizationNeurIPS2021LA
Towards calibrated model for long-tailed visual recognition from prior perspectiveNeurIPS2021Aug, RLOfficial
Supercharging imbalanced data learning with energy-based contrastive representation transferNeurIPS2021Aug, TL, RLOfficial
VideoLT: Large-scale long-tailed video recognitionICCV2021SamplingOfficial
Exploring classification equilibrium in long-tailed object detectionICCV2021Sampling,CSLOfficial
GistNet: a geometric structure transfer network for long-tailed recognitionICCV2021Sampling,TL, DC
FASA: Feature augmentation and sampling adaptation for long-tailed instance segmentationICCV2021Sampling,CSL
ACE: Ally complementary experts for solving long-tailed recognition in one-shotICCV2021Sampling,EnsembleOfficial
Influence-Balanced Loss for Imbalanced Visual ClassificationICCV2021CSLOfficial
Re-distributing biased pseudo labels for semi-supervised semantic segmentation: A baseline investigationICCV2021TLOfficial
Self supervision to distillation for long-tailed visual recognitionICCV2021TLOfficial
Distilling virtual examples for long-tailed recognitionICCV2021TL
MosaicOS: A simple and effective use of object-centric images for long-tailed object detectionICCV2021TLOfficial
Parametric contrastive learningICCV2021RLOfficial
Distributional robustness loss for long-tail learningICCV2021RLOfficial
Learning of visual relations: The devil is in the tailsICCV2021DT
Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed DetectionICML2021SamplingOfficial
Self-Damaging Contrastive LearningICML2021TL,RLOfficial
Delving into deep imbalanced regressionICML2021OtherOfficial
Long-tailed multi-label visual recognition by collaborative training on uniform and re-balanced samplingsCVPR2021Sampling,Ensemble
Equalization loss v2: A new gradient balance approach for long-tailed object detectionCVPR2021CSLOfficial
Seesaw loss for long-tailed instance segmentationCVPR2021CSLOfficial
Adaptive class suppression loss for long-tail object detectionCVPR2021CSLOfficial
PML: Progressive margin loss for long-tailed age classificationCVPR2021CSL
Disentangling label distribution for long-tailed visual recognitionCVPR2021CSL,LAOfficial
Adversarial robustness under long-tailed distributionCVPR2021CSL,LA,CDOfficial
Distribution alignment: A unified framework for long-tail visual recognitionCVPR2021CSL,LA,DTOfficial
Improving calibration for long-tailed recognitionCVPR2021CSL,Aug,DTOfficial
CReST: A class-rebalancing self-training framework for imbalanced semi-supervised learningCVPR2021TLOfficial
Conceptual 12M: Pushing web-scale image-text pre-training to recognize long-tail visual conceptsCVPR2021TLOfficial
RSG: A simple but effective module for learning imbalanced datasetsCVPR2021TL,AugOfficial
MetaSAug: Meta semantic augmentation for long-tailed visual recognitionCVPR2021AugOfficial
Contrastive learning based hybrid networks for long-tailed image classificationCVPR2021RL
Unsupervised discovery of the long-tail in instance segmentation using hierarchical self-supervisionCVPR2021RL
Long-tail learning via logit adjustmentICLR2021LAOfficial
Long-tailed recognition by routing diverse distribution-aware expertsICLR2021TL,EnsembleOfficial
Exploring balanced feature spaces for representation learningICLR2021RL,DT

2020

TitleVenueYearTypeCode
Balanced meta-softmax for long-taield visual recognitionNeurIPS2020Sampling,CSLOfficial
Posterior recalibration for imbalanced datasetsNeurIPS2020LAOfficial
Long-tailed classification by keeping the good and removing the bad momentum causal effectNeurIPS2020LA,CDOfficial
Rethinking the value of labels for improving classimbalanced learningNeurIPS2020TL,RLOfficial
The devil is in classification: A simple framework for long-tail instance segmentationECCV2020Sampling,DT,EnsembleOfficial
Imbalanced continual learning with partitioning reservoir samplingECCV2020SamplingOfficial
Distribution-balanced loss for multi-label classification in long-tailed datasetsECCV2020CSLOfficial
Feature space augmentation for long-tailed dataECCV2020TL,Aug,DT
Learning from multiple experts: Self-paced knowledge distillation for long-tailed classificationECCV2020TL,EnsembleOfficial
Solving long-tailed recognition with deep realistic taxonomic classifierECCV2020CDOfficial
Learning to segment the tailCVPR2020Sampling,TLOfficial
BBN: Bilateral-branch network with cumulative learning for long-tailed visual recognitionCVPR2020Sampling,EnsembleOfficial
Overcoming classifier imbalance for long-tail object detection with balanced group softmaxCVPR2020Sampling,EnsembleOfficial
Rethinking class-balanced methods for long-tailed visual recognition from a domain adaptation perspectiveCVPR2020CSLOfficial
Equalization loss for long-tailed object recognitionCVPR2020CSLOfficial
Domain balancing: Face recognition on long-tailed domainsCVPR2020CSL
M2m: Imbalanced classification via majorto-minor translationCVPR2020TL,AugOfficial
Deep representation learning on long-tailed data: A learnable embedding augmentation perspectiveCVPR2020TL,Aug,RL
Inflated episodic memory with region self-attention for long-tailed visual recognitionCVPR2020RL
Decoupling representation and classifier for long-tailed recognitionICLR2020Sampling,CSL,RL,CD,DTOfficial

2019

TitleVenueYearTypeCode
Meta-weight-net: Learning an explicit mapping for sample weightingNeurIPS2019CSLOfficial
Learning imbalanced datasets with label-distribution-aware margin lossNeurIPS2019CSLOfficial
Dynamic curriculum learning for imbalanced data classificationICCV2019Sampling
Class-balanced loss based on effective number of samplesCVPR2019CSLOfficial
Striking the right balance with uncertaintyCVPR2019CSL
Feature transfer learning for face recognition with under-represented dataCVPR2019TL,Aug
Unequal-training for deep face recognition with long-tailed noisy dataCVPR2019RLOfficial
Large-scale long-tailed recognition in an open worldCVPR2019RLOfficial

2018

TitleVenueYearTypeCode
Large scale fine-grained categorization and domain-specific transfer learningCVPR2018TLOfficial

2017

TitleVenueYearTypeCode
Learning to model the tailNeurIPS2017CSL
Focal loss for dense object detectionICCV2017CSL
Range loss for deep face recognition with long-tailed training dataICCV2017RL
Class rectification hard mining for imbalanced deep learningICCV2017RL

2016

TitleVenueYearTypeCode
Learning deep representation for imbalanced classificationCVPR2016Sampling,RL
Factors in finetuning deep model for object detection with long-tail distributionCVPR2016CSL,RL

3. Benchmark Datasets

DatasetLong-tailed Task# Class# Training data# Test data
ImageNet-LTClassification1,000115,84650,000
CIFAR100-LTClassification10050,00010,000
Places-LTClassification36562,50036,500
iNaturalist 2018Classification8,142437,51324,426
LVIS v0.5Detection and Segmentation1,23057,00020,000
LVIS v1Detection and Segmentation1,203100,00019,800
VOC-LTMulti-label Classification201,1424,952
COCO-LTMulti-label Classification801,9095,000
VideoLTVideo Classification1,004179,35225,622

4. Our codebase

  • To use our codebase, please install requirements:
    pip install -r requirements.txt
    
  • Hardware requirements: 4 GPUs with >= 23G GPU RAM are recommended.
  • ImageNet-LT dataset: please download ImageNet-1K dataset, and put it to the ./data file.
    data
    └──ImageNet
        ├── train
        └── val
    
  • Softmax:
    cd ./Main-codebase 
    Training: python3 main.py --seed 1 --cfg config/ImageNet_LT/ce.yaml  --exp_name imagenet/CE  --gpu 0,1,2,3 
    
  • Weighted Softmax:
    cd ./Main-codebase 
    Training: python3 main.py --seed 1 --cfg config/ImageNet_LT/weighted_ce.yaml  --exp_name imagenet/weighted_ce  --gpu 0,1,2,3
    
  • ESQL (Equalization loss):
    cd ./Main-codebase 
    Training: python3 main.py --seed 1 --cfg config/ImageNet_LT/seql.yaml  --exp_name imagenet/seql  --gpu 0,1,2,3
    
  • Balanced Softmax:
    cd ./Main-codebase 
    Training: python3 main.py --seed 1 --cfg config/ImageNet_LT/balanced_softmax.yaml  --exp_name imagenet/BS  --gpu 0,1,2,3
    
  • LADE:
    cd ./Main-codebase 
    Training: python3 main.py --seed 1 --cfg config/ImageNet_LT/lade.yaml  --exp_name imagenet/LADE  --gpu 0,1,2,3
    
  • De-confound (Casual):
    cd ./Main-codebase 
    Training: python3 main.py --seed 1 --cfg config/ImageNet_LT/causal.yaml  --exp_name imagenet/causal --remine_lambda 0.1 --alpha 0.005 --gpu 0,1,2,3
    
  • Decouple (IB-CRT):
    cd ./Main-codebase 
    Training stage 1: python3 main.py --seed 1 --cfg config/ImageNet_LT/ce.yaml  --exp_name imagenet/CE  --gpu 0,1,2,3 
    Training stage 2: python3  main.py --cfg ./config/ImageNet_LT/cls_crt.yaml --model_dir exp_results/imagenet/CE/final_model_checkpoint.pth  --gpu 0,1,2,3 
    
  • MiSLAS:
    cd ./MiSLAS-codebase
    Training stage 1: CUDA_VISIBLE_DEVICES=0,1,2,3 python3 train_stage1.py --cfg config/imagenet/imagenet_resnext50_stage1_mixup.yaml
    Training stage 2: CUDA_VISIBLE_DEVICES=0,1,2,3 python3 train_stage2.py --cfg config/imagenet/imagenet_resnext50_stage2_mislas.yaml resume checkpoint_path
    Evalutation: CUDA_VISIBLE_DEVICES=0  python3 eval.py --cfg ./config/imagenet/imagenet_resnext50_stage2_mislas.yaml  resume checkpoint_path_stage2
    
  • RSG:
    cd ./RSG-codebase
    Training: python3 imagenet_lt_train.py 
    Evalutation: python3 imagenet_lt_test.py 
    
  • ResLT:
    cd ./ResLT-codebase
    Training: CUDA_VISIBLE_DEVICES=0,1,2,3 bash sh/X50.sh
    Evalutation: CUDA_VISIBLE_DEVICES=0 bash sh/X50_eval.sh
    # The test performance can be found in the log file.
    
  • PaCo:
    cd ./PaCo-codebase
    Training: CUDA_VISIBLE_DEVICES=0,1,2,3 bash sh/ImageNetLT_train_X50.sh
    Evalutation: CUDA_VISIBLE_DEVICES=0 bash sh/ImageNetLT_eval_X50.sh
    # The test performance can be found in the log file.
    
  • LDAM:
    cd ./Ensemble-codebase 
    Training: CUDA_VISIBLE_DEVICES=0,1,2,3 python3 train.py -c ./configs/config_imagenet_lt_resnext50_ldam.json
    Evalutation: CUDA_VISIBLE_DEVICES=0 python3 test.py -r checkpoint_path
    
  • RIDE:
    cd ./Ensemble-codebase 
    Training: CUDA_VISIBLE_DEVICES=0,1,2,3 python3 train.py -c ./configs/config_imagenet_lt_resnext50_ride.json
    Evalutation: CUDA_VISIBLE_DEVICES=0 python3 test.py -r checkpoint_path
    
  • SADE:
    cd ./Ensemble-codebase 
    Training: CUDA_VISIBLE_DEVICES=0,1,2,3 python3 train.py -c ./configs/config_imagenet_lt_resnext50_sade.json
    Evalutation: CUDA_VISIBLE_DEVICES=0 python3 test.py -r checkpoint_path
    

5. Empirical Studies

(1) Long-tailed benchmarking performance

  • We evaluate several state-of-the-art methods on ImageNet-LT to see to what extent they handle class imbalance via new evaluation metrics, i.e., UA (upper bound accuracy) and RA (relative accuracy). We categorize these methods based on class re-balancing (CR), information augmentation (IA) and module improvement (MI).

  • Almost all long-tailed methods perform better than the Softmax baseline in terms of accuracy, which demonstrates the effectiveness of long-tailed learning.
  • Training with 200 epochs leads to better performance for most long-tailed methods, since sufficient training enables deep models to fit data better and learn better image representations.
  • In addition to accuracy, we also evaluate long-tailed methods based on UA and RA. For the methods that have higher UA, the performance gain comes not only from the alleviation of class imbalance, but also from other factors, like data augmentation or better network architectures. Therefore, simply using accuracy for evaluation is not accurate enough, while our proposed RA metric provides a good complement, since it alleviates the influences of factors apart from class imbalance.
  • For example, MiSLAS, based on data mixup, has higher accuracy than Balanced Sofmtax under 90 training epochs, but it also has higher UA. As a result, the relative accuracy of MiSLAS is lower than Balanced Sofmtax, which means that Balanced Sofmtax alleviates class imbalance better than MiSLAS under 90 training epochs.
  • Although some recent high-accuracy methods have lower RA, the overall development trend of long-tailed learning is still positive, as shown in the below figure.

  • The current state-of-the-art long-tailed method in terms of both accuracy and RA is SADE (ensemble-based method).

(2) More discussions on cost-sensitive losses

  • We further evaluate the performance of different cost-sensitive learning losses based on the decoupled training scheme.
  • Decoupled training, compared to joint training, can further improve the overall performance of most cost-sensitive learning methods apart from balanced softmax (BS).
  • Although BS outperofmrs other cost-sensitive losses under one-stage training, they perform comparably under decoupled training. This implies that although these cost-sensitive losses perform differently under joint training, they essentially learn similar quality of feature representations.

5. Citation

If this repository is helpful to you, please cite our survey.

@article{zhang2023deep,
      title={Deep long-tailed learning: A survey},
      author={Zhang, Yifan and Kang, Bingyi and Hooi, Bryan and Yan, Shuicheng and Feng, Jiashi},
      journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
      year={2023},
      publisher={IEEE}
}

5. Other Resources