Tackling the Problem of Data Imbalancing for Melanoma Classification

Malignant melanoma is the most dangerous type of skin cancer, yet melanoma is the most treatable kind of cancer when diagnosed at an early stage. In this regard, Computer-Aided Diagnosis systems based on machine learning have been developed to discern melanoma lesions from benign and dysplastic nevi in dermoscopic images. Similar to a large range of real world applications encountered in machine learning, melanoma classification faces the challenge of imbalanced data, where the percentage of melanoma cases in comparison with benign and dysplastic cases is far less. This article analyzes the impact of data balancing strategies at the training step. Subsequently, Over-Sampling (OS) and Under-Sampling (US) are extensively compared in both feature and data space, revealing that NearMiss-2 (NM2) outperform other methods achieving Sensitivity (SE) and Specificity (SP) of 91.2% and 81.7%, respectively. More generally, the reported results highlight that methods based on US or combination of OS and US in feature space outperform the others.

Mots clés

IMBALANCED CLASSIFICATION MELANOMA DERMOSCOPY

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

master(5).pdf (1.78 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Guillaume Lemaitre : Connectez-vous pour contacter le contributeur

https://u-bourgogne.hal.science/hal-01250949

Soumis le : mardi 5 janvier 2016-14:29:26

Dernière modification le : jeudi 7 septembre 2023-16:08:10

Archivage à long terme le : jeudi 7 avril 2016-15:23:42

Dates et versions

hal-01250949 , version 1 (05-01-2016)

Identifiants

HAL Id : hal-01250949 , version 1

Citer

Mojdeh Rastgoo, Guillaume Lemaître, Joan Massich, Olivier Morel, Franck Marzani, et al.. Tackling the Problem of Data Imbalancing for Melanoma Classification. Bioimaging, Feb 2016, Rome, Italy. ⟨hal-01250949⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BOURGOGNE CNRS LE2I IMVIA CORES VIBOT HESAM IRENAV LAMPA LCPI LABOMAP LISPEN MSMP

286 Consultations

449 Téléchargements