dc.creator | Sleiman, Hassan A. | es |
dc.creator | Corchuelo Gil, Rafael | es |
dc.date.accessioned | 2023-03-15T08:30:51Z | |
dc.date.available | 2023-03-15T08:30:51Z | |
dc.date.issued | 2013-09 | |
dc.identifier.citation | Sleiman, H.A. y Corchuelo Gil, R. (2013). A Survey on Region Extractors from Web Documents. IEEE Transactions on Knowledge and Data Engineering, 25 (9), 1960-1981. https://doi.org/10.1109/TKDE.2012.135. | |
dc.identifier.issn | 1041-4347 (impreso) | es |
dc.identifier.issn | 1558-2191 (online) | es |
dc.identifier.uri | https://hdl.handle.net/11441/143378 | |
dc.description.abstract | Extracting information from web documents has become a research area in which new proposals sprout out year after year. This has motivated several researchers to work on surveys that attempt to provide an overall picture of the many existing proposals. Unfortunately, none of these surveys provide a complete picture, because they do not take region extractors into account. These tools are kind of preprocessors, because they help information extractors focus on the regions of a web document that contain relevant information. With the increasing complexity of web documents, region extractors are becoming a must to extract information from many websites. Beyond information extraction, region extractors have also found their way into information retrieval, focused web crawling, topic distillation, adaptive content delivery, mashups, and metasearch engines. In this paper, we survey the existing proposals regarding region extractors and compare them side by side. | es |
dc.description.sponsorship | Ministerio de Educación y Ciencia TIN2007-64119 | es |
dc.description.sponsorship | Junta de Andalucía P07-TIC-2602 | es |
dc.description.sponsorship | Junta de Andalucía P08- TIC-4100 | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2008-04718-E | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-21744 | es |
dc.description.sponsorship | Ministerio de Economía, Industria y Competitividad TIN2010-09809-E | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-10811-E | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-09988-E | es |
dc.format | application/pdf | es |
dc.format.extent | 22 | es |
dc.language.iso | eng | es |
dc.publisher | IEEE | es |
dc.relation.ispartof | IEEE Transactions on Knowledge and Data Engineering, 25 (9), 1960-1981. | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Information extractors | es |
dc.subject | wrappers | es |
dc.subject | web documents | es |
dc.subject | region extractors | es |
dc.subject | enterprise information integration | es |
dc.title | A Survey on Region Extractors from Web Documents | es |
dc.type | info:eu-repo/semantics/article | es |
dcterms.identifier | https://ror.org/03yxnpp24 | |
dc.type.version | info:eu-repo/semantics/publishedVersion | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |
dc.contributor.affiliation | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos | es |
dc.relation.projectID | TIN2007-64119 | es |
dc.relation.projectID | P07-TIC-2602 | es |
dc.relation.projectID | P08- TIC-4100 | es |
dc.relation.projectID | TIN2008-04718-E | es |
dc.relation.projectID | TIN2010-21744 | es |
dc.relation.projectID | TIN2010-09809-E | es |
dc.relation.projectID | TIN2010-10811-E | es |
dc.relation.projectID | TIN2010-09988-E | es |
dc.relation.publisherversion | https://ieeexplore.ieee.org/abstract/document/6231632 | es |
dc.identifier.doi | 10.1109/TKDE.2012.135 | es |
dc.journaltitle | IEEE Transactions on Knowledge and Data Engineering | es |
dc.publication.volumen | 25 | es |
dc.publication.issue | 9 | es |
dc.publication.initialPage | 1960 | es |
dc.publication.endPage | 1981 | es |
dc.contributor.funder | Ministerio de Educación y Ciencia (MEC). España | es |
dc.contributor.funder | Junta de Andalucía | es |
dc.contributor.funder | Ministerio de Ciencia e Innovación (MICIN). España | es |
dc.contributor.funder | Ministerio de Economía, Industria y Competitividad | es |