Mostrar el registro sencillo del ítem

Artículo

dc.creatorHernández Salmerón, Inmaculada Concepciónes
dc.creatorRivero, Carlos R.es
dc.creatorRuiz Cortés, Davides
dc.date.accessioned2021-02-16T12:21:52Z
dc.date.available2021-02-16T12:21:52Z
dc.date.issued2019
dc.identifier.citationHernández Salmerón, I.C., Rivero, C.R. y Ruiz Cortés, D. (2019). Deep Web crawling: a survey. World Wide Web, 22, 1577-1610.
dc.identifier.issn1386-145Xes
dc.identifier.urihttps://hdl.handle.net/11441/105031
dc.description.abstractDeep Web crawling refers to the problem of traversing the collection of pages in a deep Web site, which are dynamically generated in response to a particular query that is submitted using a search form. To achieve this, crawlers need to be endowed with some features that go beyond merely following links, such as the ability to automatically discover search forms that are entry points to the deep Web, fill in such forms, and follow certain paths to reach the deep Web pages with relevant information. Current surveys that analyse the state of the art in deep Web crawling do not provide a framework that allows comparing the most up-to-date proposals regarding all the different aspects involved in the deep Web crawling process. In this article, we propose a framework that analyses the main features of existing deep Web crawling-related techniques, including the most recent proposals, and provides an overall picture regarding deep Web crawling, including novel features that to the present day had not been analysed by previous surveys. Our main conclusion is that crawler evaluation is an immature research area due to the lack of a standard set of performance measures, or a benchmark or publicly available dataset to evaluate the crawlers. In addition, we conclude that the future work in this area should be focused on devising crawlers to deal with ever-evolving Web technologies and improving the crawling efficiency and scalability, in order to create effective crawlers that can operate in real-world contexts.es
dc.description.sponsorshipMinisterio de Economía y Competitividad TIN2016-75394-Res
dc.description.sponsorshipMinisterio de Economía y Competitividad TIN2013-40848-Res
dc.formatapplication/pdfes
dc.format.extent34es
dc.language.isoenges
dc.publisherSpringeres
dc.relation.ispartofWorld Wide Web, 22, 1577-1610.
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectDeep Webes
dc.subjectWeb Crawlinges
dc.subjectForm fillinges
dc.subjectQuery selectiones
dc.subjectSurveyes
dc.titleDeep Web crawling: a surveyes
dc.typeinfo:eu-repo/semantics/articlees
dcterms.identifierhttps://ror.org/03yxnpp24
dc.type.versioninfo:eu-repo/semantics/submittedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticoses
dc.relation.projectIDTIN2016-75394-Res
dc.relation.projectIDTIN2013-40848-Res
dc.relation.publisherversionhttps://link.springer.com/article/10.1007/s11280-018-0602-1es
dc.identifier.doi10.1007/s11280-018-0602-1es
dc.journaltitleWorld Wide Webes
dc.publication.issue22es
dc.publication.initialPage1577es
dc.publication.endPage1610es
dc.identifier.sisius21582564es
dc.contributor.funderMinisterio de Economía y Competitividad (MINECO). Españaes
dc.contributor.funderMinisterio de Economía y Competitividad (MINECO). Españaes

FicherosTamañoFormatoVerDescripción
Deep Web crawling a survey.pdf1.890MbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional