Mostrar el registro sencillo del ítem

Artículo

dc.creatorBenati, Stefanoes
dc.creatorGarcía Quiles, Sergioes
dc.creatorPuerto Albandoz, Justoes
dc.date.accessioned2018-01-31T11:20:53Z
dc.date.available2018-01-31T11:20:53Z
dc.date.issued2018
dc.identifier.citationBenati, S., García Quiles, S. y Puerto Albandoz, J. (2018). Mixed integer linear programming and heuristic methods for feature selection in clustering. Journal of the Operational Research Society, 1-17.
dc.identifier.issn0160-5682es
dc.identifier.issn1476-9360es
dc.identifier.urihttps://hdl.handle.net/11441/69808
dc.description.abstractThis paper studies the problem of selecting relevant features in clustering problems, out of a data set in which many features are useless, or masking. The data set comprises a set U of units, a set V of features, a set R of (tentative) cluster centres and distances dijk for every i ∈ U, k ∈ R, j ∈ V . The feature selection problem consists of finding a subset of features Q ⊆ V such that the total sum of the distances from the units to the closest centre is minimized. This is a combinatorial optimization problem that we show to be NP-complete, and we propose two mixed integer linear programming formulations to calculate the solution. Some computational experiments show that if clusters are well separated and the relevant features are easy to detect, then both formulations can solve problems with many integer variables. Conversely, if clusters overlap and relevant features are ambiguous, then even small problems are unsolved. To overcome this difficulty, we propose two heuristic methods to find that, most of the time, one of them, called q-vars, calculates the optimal solution quickly. Then, the q-vars heuristic is combined with the k-means algorithm to cluster some simulated data. We conclude that this approach outperforms other methods for clustering with variable selection that were proposed in the literature.es
dc.description.sponsorshipMinisterio de Economía y Competitividades
dc.description.sponsorshipFundación Sénecaes
dc.description.sponsorshipMinistero dell’Istruzione, dell’Universitá e della Ricercaes
dc.formatapplication/pdfes
dc.language.isoenges
dc.publisherTaylor & Francises
dc.relation.ispartofJournal of the Operational Research Society, 1-17.
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectInteger linear programminges
dc.subjectHeuristicses
dc.subjectq-varses
dc.subjectCluster analysises
dc.subjectp-median problemes
dc.titleMixed integer linear programming and heuristic methods for feature selection in clusteringes
dc.typeinfo:eu-repo/semantics/articlees
dcterms.identifierhttps://ror.org/03yxnpp24
dc.type.versioninfo:eu-repo/semantics/submittedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Estadística e Investigación Operativaes
dc.relation.projectIDMTM2013-46962-C02-01es
dc.relation.projectIDMTM2016-74983-C02-01es
dc.relation.projectID19320/PI/14es
dc.relation.publisherversionhttp://www.tandfonline.com/doi/pdf/10.1080/01605682.2017.1398206?needAccess=truees
dc.identifier.doi10.1080/01605682.2017.1398206es
dc.contributor.groupUniversidad de Sevilla. FQM331: Métodos y Modelos de la Estadística y la Investigación Operativaes
idus.format.extent34 p.es
dc.journaltitleJournal of the Operational Research Societyes
dc.publication.initialPage1es
dc.publication.endPage17es

FicherosTamañoFormatoVerDescripción
Mixed integer linear programming ...274.4KbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional