Skip to main content
Have a personal or library account? Click to login
imbalanced-learn-extra: A Python Package for Novel Oversampling Algorithms Cover

imbalanced-learn-extra: A Python Package for Novel Oversampling Algorithms

Open Access
|Jan 2026

Abstract

Learning from imbalanced data is a common challenge in supervised learning, as most classifiers assume balanced class distributions. Among the strategies to mitigate this issue, oversampling algorithms offer a flexible and model-agnostic solution by generating synthetic samples for minority classes. In this paper, we introduce the imbalanced-learn-extra Python library, an open-source extension of the imbalanced-learn ecosystem that provides additional oversampling techniques for research and practical use. The library integrates seamlessly with Scikit-Learn, allowing users to easily incorporate it into existing workflows. It implements Geometric SMOTE, a geometrically enhanced drop-in replacement for the original SMOTE algorithm, and clustering-based oversampling methods such as KMeans-SMOTE and G-SOMO, which combine existing imbalanced-learn oversamplers with Scikit-Learn clustering algorithms to address within-class imbalances. Rather than re-assessing the performance of these algorithms, which has already been thoroughly evaluated in prior studies, this paper focuses on their software design, implementation, and practical use within a unified framework.

DOI: https://doi.org/10.5334/jors.459 | Journal eISSN: 2049-9647
Language: English
Submitted on: Feb 26, 2023
Accepted on: Nov 24, 2025
Published on: Jan 19, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Georgios Douzas, Fernando Bacao, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.