dc.creator | Rodríguez, Daniel | es |
dc.creator | Herraiz, Israel | es |
dc.creator | Harrison, Rachel | es |
dc.creator | Dolado, Javier | es |
dc.creator | Riquelme Santos, José Cristóbal | es |
dc.date.accessioned | 2016-06-24T10:52:23Z | |
dc.date.available | 2016-06-24T10:52:23Z | |
dc.date.issued | 2014 | |
dc.identifier.citation | Rodríguez, D., Herraiz, I., Harrison, R., Dolado, J. y Riquelme Santos, J.C. (2014). Preliminary Comparison of Techniques for Dealing with Imbalance in Software Defect Prediction. En 18th International Conference on Evaluation and Assessment in Software Engineering, EASE'14 (43-1-43-10), London: ACM. | |
dc.identifier.isbn | 978-1-4503-2476-2 | es |
dc.identifier.uri | http://hdl.handle.net/11441/42731 | |
dc.description.abstract | Imbalanced data is a common problem in data mining when
dealing with classi cation problems, where samples of a class
vastly outnumber other classes. In this situation, many data
mining algorithms generate poor models as they try to opti-
mize the overall accuracy and perform badly in classes with
very few samples. Software Engineering data in general and
defect prediction datasets are not an exception and in this
paper, we compare different approaches, namely sampling,
cost-sensitive, ensemble and hybrid approaches to the prob-
lem of defect prediction with different datasets preprocessed
differently. We have used the well-known NASA datasets
curated by Shepperd et al. There are differences in the re-
sults depending on the characteristics of the dataset and the
evaluation metrics, especially if duplicates and inconsisten-
cies are removed as a preprocessing step. | es |
dc.description.sponsorship | Unión Europea ICEBERG 324356 | es |
dc.description.sponsorship | MICYT TIN2007- 68084-C02-02 | es |
dc.description.sponsorship | MICYT TIN2013-46928-C3-2-R | es |
dc.format | application/pdf | es |
dc.language.iso | eng | es |
dc.publisher | ACM | es |
dc.relation.ispartof | 18th International Conference on Evaluation and Assessment in Software Engineering, EASE'14 (2014), 43-1-43-10 | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Defect Prediction | es |
dc.subject | Imbalanced data | es |
dc.subject | Data Quality | es |
dc.title | Preliminary Comparison of Techniques for Dealing with Imbalance in Software Defect Prediction | es |
dc.type | info:eu-repo/semantics/conferenceObject | es |
dcterms.identifier | https://ror.org/03yxnpp24 | |
dc.type.version | info:eu-repo/semantics/acceptedVersion | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |
dc.contributor.affiliation | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos | es |
dc.relation.projectID | ICEBERG 324356 | es |
dc.relation.projectID | TIN2007- 68084-C02-02 | es |
dc.relation.projectID | TIN2013-46928-C3-2-R | es |
dc.identifier.doi | http://dx.doi.org/10.1145/2601248.2601294 | es |
idus.format.extent | 10 | es |
dc.publication.initialPage | 43-1 | es |
dc.publication.endPage | 43-10 | es |
dc.eventtitle | 18th International Conference on Evaluation and Assessment in Software Engineering, EASE'14 | es |
dc.eventinstitution | London | es |
dc.relation.publicationplace | New York | es |
dc.identifier.idus | https://idus.us.es/xmlui/handle/11441/42731 | |