Mostrar el registro sencillo del ítem

Artículo

dc.creatorJiménez Aguirre, Patriciaes
dc.creatorCorchuelo Gil, Rafaeles
dc.date.accessioned2022-04-11T07:41:48Z
dc.date.available2022-04-11T07:41:48Z
dc.date.issued2022
dc.identifier.citationJiménez Aguirre, P. y Corchuelo Gil, R. (2022). On validating web information extraction proposals. Expert Systems with Applications, 199 (August 2022, art. nº 116700)
dc.identifier.issn0957-4174es
dc.identifier.urihttps://hdl.handle.net/11441/131997
dc.description.abstractMany people who have to make informed decisions in today’s always-on culture use information extractors to feed their systems with information that comes from human-friendly documents. Unfortunately, many proposals that validate information extractors have deficiencies that make it difficult to perform homogeneous comparisons, confirm or refute performance hypotheses, or draw unbiased conclusions. Consequently, it is very difficult to select the best-performing proposal on a sound basis. The state-of-the-art validation method overcomes many deficiencies in the previous proposals, but still overlooks the following issues: completeness of the validation datasets, that is, whether they provide a complete set of annotations or not; structure of the information, that is, whether they check the structure of the record instances extracted or just the attribute instances; and, finally, how extractions and annotations are matched. The decisions made regarding the previous issues have an impact on the effectiveness results. In this article, we have exhaustively analysed the literature and we have also highlighted the main weaknesses to tackle. We present a guideline and a method to compute the effectiveness, which complements and enhances the state-of-the-art validation method.es
dc.description.sponsorshipMinisterio de Economía y Competitividad TIN2016-75394-Res
dc.description.sponsorshipMinisterio de Ciencia e Innovación PID2020-112540RB-C44es
dc.description.sponsorshipJunta de Andalucía P18-RT-1060es
dc.description.sponsorshipJunta de Andalucía US-1381375es
dc.formatapplication/pdfes
dc.format.extent9es
dc.language.isoenges
dc.publisherElsevieres
dc.relation.ispartofExpert Systems with Applications, 199 (August 2022, art. nº 116700)
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectWeb information extractiones
dc.subjectValidation methodes
dc.titleOn validating web information extraction proposalses
dc.typeinfo:eu-repo/semantics/articlees
dcterms.identifierhttps://ror.org/03yxnpp24
dc.type.versioninfo:eu-repo/semantics/publishedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticoses
dc.relation.projectIDTIN2016-75394-Res
dc.relation.projectIDPID2020-112540RB-C44es
dc.relation.projectIDP18-RT-1060es
dc.relation.projectIDUS-1381375es
dc.relation.publisherversionhttps://www.sciencedirect.com/science/article/pii/S0957417422001798?via%3Dihubes
dc.identifier.doi10.1016/j.eswa.2022.116700es
dc.contributor.groupUniversidad de Sevilla. TIC258: Data-centric Computing Research Hubes
dc.journaltitleExpert Systems with Applicationses
dc.publication.volumen199es
dc.publication.issueAugust 2022, art. nº 116700es
dc.contributor.funderMinisterio de Economía y Competitividad (MINECO). Españaes
dc.contributor.funderMinisterio de Ciencia e Innovación (MICIN). Españaes
dc.contributor.funderJunta de Andalucíaes

FicherosTamañoFormatoVerDescripción
1-s2.0-S0957417422001798-main.pdf570.6KbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional