Martín, José DavidPontes Balanza, BeatrizRiquelme Santos, José Cristóbal2025-03-192025-03-192023-07-24Martín, J.D., Pontes Balanza, B. y Riquelme Santos, J.C. (2023). Simultaneous Evolutionary Optimization of Features Subset and Clusters Number. En GECCO '23 Companion: Proceedings of the Companion Conference on Genetic and Evolutionary Computation (307-310), Lisboa, Portugal: ACM.https://hdl.handle.net/11441/170555Cluster analysis is a popular technique used to identify patterns in data mining. However, evaluating the accuracy of a clustering task is a challenging process which remains to be an open issue. In this work, we focus on two factors that significantly influence clustering performance: the optimal number of clusters and the subset of relevant attributes. While the former has been extensively studied, the latter has received comparatively less attention, especially in relation to its equivalent in supervised learning. Despite their clear interdependence, these factors have rarely been studied together. In this context, we propose an evolutionary algorithm that simultaneously optimizes both factors using ad-hoc variations of internal validation indices as a fitness function.application/pdf4 p.engAttribution-NonCommercial-NoDerivatives 4.0 Internationalhttp://creativecommons.org/licenses/by-nc-nd/4.0/Feature SelectionClusteringGenetic AlgorithmSimultaneous Evolutionary Optimization of Features Subset and Clusters Numberinfo:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/openAccesshttps://doi.org/10.1145/3583133.3590603