Medical image classification and diagnosis based on machine learning has made significant achievements and gradually penetrated the healthcare industry. However, medical data characteristics such as relatively small datasets for rare diseases or imbalance in class distribution for rare conditions significantly restrains their adoption and reuse. Imbalanced datasets lead to difficulties in learning and obtaining accurate predictive models. This paper follows the FAIR paradigm and proposes a technique for the alignment of class distribution, which enables improving image classification performance in imbalanced data and ensuring data reuse. The experiments on the acne disease dataset support that the proposed framework outperforms the baselines and enable to achieve up to 5% improvement in image classification.
Biloborodova, TetianaSkarga-Bandurova, Inna Koverha, MarkSkarha-Bandurov, IlliaYevsieieva, Yelyzaveta
School of Engineering, Computing and Mathematics
Year of publication: 2021Date of RADAR deposit: 2022-07-01