Mostrar el registro sencillo del ítem

Artículo

dc.creatorCarrizosa Priego, Emilio Josées
dc.creatorOlivares Nadal, Alba Victoriaes
dc.creatorRamírez Cobo, Josefaes
dc.date.accessioned2021-02-01T12:31:23Z
dc.date.available2021-02-01T12:31:23Z
dc.date.issued2020-04-01
dc.identifier.citationCarrizosa Priego, E.J., Olivares Nadal, A.V. y Ramírez Cobo, J. (2020). Integer constraints for enhancing interpretability in linear regression. SORT. Statistics and Operations Research Transactions, 44 (1), 67-98.
dc.identifier.issn2013-8830es
dc.identifier.urihttps://hdl.handle.net/11441/104390
dc.description.abstractOne of the main challenges researchers face is to identify the most relevant features in a prediction model. As a consequence, many regularized methods seeking sparsity have flourished. Although sparse, their solutions may not be interpretable in the presence of spurious coefficients and correlated features. In this paper we aim to enhance interpretability in linear regression in presence of multicollinearity by: (i) forcing the sign of the estimated coefficients to be consistent with the sign of the correlations between predictors, and (ii) avoiding spurious coefficients so that only significant features are represented in the model. This will be addressed by modelling constraints and adding them to an optimization problem expressing some estimation procedure such as ordinary least squares or the lasso. The so-obtained constrained regression models will become Mixed Integer Quadratic Problems. The numerical experiments carried out on real and simulated datasets show that tightening the search space of some standard linear regression models by adding the constraints modelling (i) and/or (ii) help to improve the sparsity and interpretability of the solutions with competitive predictive quality.es
dc.formatapplication/pdfes
dc.format.extent28 p.es
dc.language.isoenges
dc.publisherInstitut d´Estadística de Catalunya (Idescat)es
dc.relation.ispartofSORT. Statistics and Operations Research Transactions, 44 (1), 67-98.
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectLinear regressiones
dc.subjectMulticollinearityes
dc.subjectSparsityes
dc.subjectCardinality constraintes
dc.subjectMixed Integer Non Linear Programminges
dc.titleInteger constraints for enhancing interpretability in linear regressiones
dc.typeinfo:eu-repo/semantics/articlees
dcterms.identifierhttps://ror.org/03yxnpp24
dc.type.versioninfo:eu-repo/semantics/publishedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Estadística e Investigación Operativaes
dc.relation.publisherversionhttps://doi.org/10.2436/20.8080.02.95es
dc.identifier.doi10.2436/20.8080.02.95es
dc.journaltitleSORT. Statistics and Operations Research Transactionses
dc.publication.volumen44es
dc.publication.issue1es
dc.publication.initialPage67es
dc.publication.endPage98es

FicherosTamañoFormatoVerDescripción
Integer constraints for enhancing ...463.1KbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional