dc.creator | Luna Romera, José María | es |
dc.creator | Martínez Ballesteros, María del Mar | es |
dc.creator | García Gutiérrez, Jorge | es |
dc.creator | Riquelme Santos, José Cristóbal | es |
dc.date.accessioned | 2022-04-13T07:23:32Z | |
dc.date.available | 2022-04-13T07:23:32Z | |
dc.date.issued | 2019 | |
dc.identifier.citation | Luna Romera, J.M., Martínez Ballesteros, M.d.M., García Gutiérrez, J. y Riquelme Santos, J.C. (2019). External clustering validity index based on chi-squared statistical test. Information Sciences, 487 (June 2019), 1-17. | |
dc.identifier.issn | 0020-0255 | es |
dc.identifier.uri | https://hdl.handle.net/11441/132081 | |
dc.description.abstract | Clustering is one of the most commonly used techniques in data mining. Its main goal is
to group objects into clusters so that each group contains objects that are more similar to
each other than to objects in other clusters. The evaluation of a clustering solution is a task
carried out through the application of validity indices. These indices measure the quality
of the solution and can be classified as either internal that calculate the quality of the
solution through the data of the clusters, or as external indices that measure the quality
by means of external information such as the class. Generally, indices from the literature
determine their optimal result through graphical representation, whose results could be
imprecisely interpreted. The aim of this paper is to present a new external validity index
based on the chi-squared statistical test named Chi Index, which presents accurate results
that require no further interpretation. Chi Index was analyzed using the clustering results
of 3 clustering methods in 47 public datasets. Results indicate a better hit rate and a lower
percentage of error against 15 external validity indices from the literature. | es |
dc.description.sponsorship | Ministerio de Economía y Competitividad TIN2014-55894-C2-R | es |
dc.description.sponsorship | Ministerio de Economía y Competitividad TIN2017-88209-C2-2-R | es |
dc.format | application/pdf | es |
dc.format.extent | 17 | es |
dc.language.iso | eng | es |
dc.publisher | Elsevier | es |
dc.relation.ispartof | Information Sciences, 487 (June 2019), 1-17. | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Clustering Analysis | es |
dc.subject | External validity indices | es |
dc.subject | Comparing cluster | es |
dc.subject | Big Data | es |
dc.title | External clustering validity index based on chi-squared statistical test | es |
dc.type | info:eu-repo/semantics/article | es |
dcterms.identifier | https://ror.org/03yxnpp24 | |
dc.type.version | info:eu-repo/semantics/publishedVersion | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |
dc.contributor.affiliation | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos | es |
dc.relation.projectID | TIN2014-55894-C2-R | es |
dc.relation.projectID | TIN2017-88209-C2-2-R | es |
dc.relation.publisherversion | https://www.sciencedirect.com/science/article/pii/S0020025519301550 | es |
dc.identifier.doi | 10.1016/j.ins.2019.02.046 | es |
dc.contributor.group | Universidad de Sevilla. TIC-254: Data Science and Big Data Lab | es |
dc.journaltitle | Information Sciences | es |
dc.publication.volumen | 487 | es |
dc.publication.issue | June 2019 | es |
dc.publication.initialPage | 1 | es |
dc.publication.endPage | 17 | es |
dc.identifier.sisius | 21826603 | es |
dc.contributor.funder | Ministerio de Economía y Competitividad (MINECO). España | es |
dc.description.awardwinning | Premio Mensual Publicación Científica Destacada de la US. Escuela Técnica Superior de Ingeniería Informática | |