Computer Science > Computer Vision and Pattern Recognition

arXiv:2207.13378 (cs)

[Submitted on 27 Jul 2022 (v1), last revised 31 Mar 2023 (this version, v2)]

Title:Identifying Hard Noise in Long-Tailed Sample Distribution

Authors:Xuanyu Yi, Kaihua Tang, Xian-Sheng Hua, Joo-Hwee Lim, Hanwang Zhang

View PDF

Abstract:Conventional de-noising methods rely on the assumption that all samples are independent and identically distributed, so the resultant classifier, though disturbed by noise, can still easily identify the noises as the outliers of training distribution. However, the assumption is unrealistic in large-scale data that is inevitably long-tailed. Such imbalanced training data makes a classifier less discriminative for the tail classes, whose previously "easy" noises are now turned into "hard" ones -- they are almost as outliers as the clean tail samples. We introduce this new challenge as Noisy Long-Tailed Classification (NLT). Not surprisingly, we find that most de-noising methods fail to identify the hard noises, resulting in significant performance drop on the three proposed NLT benchmarks: ImageNet-NLT, Animal10-NLT, and Food101-NLT. To this end, we design an iterative noisy learning framework called Hard-to-Easy (H2E). Our bootstrapping philosophy is to first learn a classifier as noise identifier invariant to the class and context distributional changes, reducing "hard" noises to "easy" ones, whose removal further improves the invariance. Experimental results show that our H2E outperforms state-of-the-art de-noising methods and their ablations on long-tailed settings while maintaining a stable performance on the conventional balanced settings. Datasets and codes are available at this https URL

Comments:	Accepted to ECCV2022(Oral) ; Datasets and codes are available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2207.13378 [cs.CV]
	(or arXiv:2207.13378v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2207.13378

Submission history

From: Xuanyu Yi [view email]
[v1] Wed, 27 Jul 2022 09:03:03 UTC (1,661 KB)
[v2] Fri, 31 Mar 2023 07:03:13 UTC (1,661 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Identifying Hard Noise in Long-Tailed Sample Distribution

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Identifying Hard Noise in Long-Tailed Sample Distribution

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators