Comparative Analysis of SMOTE and ROSE Oversampling Techniques for kNN-Based Autonomous Vehicle Behavior Modeling
Abstract
In this paper, we present a comparative analysis of Synthetic Minority Oversampling TEchnique (SMOTE) and Random OverSampling Examples (ROSE) oversampling techniques for K Nearest Neigbhors KNN-based autonomous vehicle behavior modeling. We address the challenges posed by imbalanced and mixed datasets in the context of autonomous vehicle testing, where the majority of test outcomes are classified as ”OK” (safe) and fewer as ”KO” (unsafe). We propose an enhanced approach that extends our previous work by incorporating ROSE as an alternative to SMOTE for generating synthetic samples. We integrate these resampling techniques with Leave- One-Out Cross-Validation (LOO-CV), applying resampling at each iteration to ensure data balancing is tailored to each training set. Additionally, we investigate the impact of different encoding strategies for categorical variables, including OneHot, binary encoding, and Factor Analysis of Mixed Data (FAMD). Our research aims to develop a robust classification model capable of accurately predicting autonomous vehicle behavior while effectively managing class imbalance and mixed data types, despite the limited availability of data due to costly and timeconsuming testing procedures.
Keywords
IA, Autonomous Vehicles, SMOTE, ROSE
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
C. Serbouh Touazi, I. Ahriz, N. Niang and A. Piperno, "Comparative Analysis of SMOTE and ROSE Oversampling Techniques for kNN-Based Autonomous Vehicle Behavior Modeling," in Journal of Communications Software and Systems, vol. 21, no. 2, pp. 132-143, April 2025, doi: https://doi.org/10.24138/jcomss-2024-0121
@article{serbouh-touazi2025comparativeanalysis, author = {Celine Serbouh Touazi and Iness Ahriz and Ndeye Niang and Alain Piperno}, title = {Comparative Analysis of SMOTE and ROSE Oversampling Techniques for kNN-Based Autonomous Vehicle Behavior Modeling}, journal = {Journal of Communications Software and Systems}, month = {4}, year = {2025}, volume = {21}, number = {2}, pages = {132--143}, doi = {https://doi.org/10.24138/jcomss-2024-0121}, url = {https://doi.org/https://doi.org/10.24138/jcomss-2024-0121} }