Mostrar el registro sencillo del ítem

Ponencia

dc.creatorHernández Salmerón, Inmaculada Concepciónes
dc.creatorSleiman, Hassan A.es
dc.creatorRuiz Cortés, Davides
dc.creatorCorchuelo Gil, Rafaeles
dc.date.accessioned2017-11-09T10:22:31Z
dc.date.available2017-11-09T10:22:31Z
dc.date.issued2011
dc.identifier.citationHernández Salmerón, I.C., Sleiman, H.A., Ruiz Cortés, D. y Corchuelo Gil, R. (2011). A Conceptual Framework for Efficient Web Crawling in Virtual Integration Contexts. En WISM 2011: International Conference on Web Information Systems and Mining (282-291), Taiyuan, China: Springer.
dc.identifier.isbn978-3-642-23981-6es
dc.identifier.issn0302-9743es
dc.identifier.urihttp://hdl.handle.net/11441/65832
dc.description.abstractVirtual Integration systems require a crawling tool able to navigate and reach relevant pages in the Web in an efficient way. Existing proposals in the crawling area are aware of the efficiency problem, but still most of them need to download pages in order to classify them as relevant or not. In this paper, we present a conceptual framework for designing crawlers supported by a web page classifier that relies solely on URLs to determine page relevance. Such a crawler is able to choose in each step only the URLs that lead to relevant pages, and therefore reduces the number of unnecessary pages downloaded, optimising bandwidth and making it efficient and suitable for virtual integration systems. Our preliminary experiments show that such a classifier is able to distinguish between links leading to different kinds of pages, without previous intervention from the user.es
dc.description.sponsorshipMinisterio de Educación y Ciencia TIN2007-64119es
dc.description.sponsorshipJunta de Andalucía P07-TIC-2602es
dc.description.sponsorshipJunta de Andalucía P08- TIC-4100es
dc.description.sponsorshipMinisterio de Ciencia e Innovación TIN2008-04718-Ees
dc.description.sponsorshipMinisterio de Ciencia e Innovación TIN2010-21744es
dc.description.sponsorshipMinisterio de Economía, Industria y Competitividad TIN2010-09809-Ees
dc.description.sponsorshipMinisterio de Ciencia e Innovación TIN2010-10811-Ees
dc.description.sponsorshipMinisterio de Ciencia e Innovación TIN2010-09988-Ees
dc.formatapplication/pdfes
dc.language.isoenges
dc.publisherSpringeres
dc.relation.ispartofWISM 2011: International Conference on Web Information Systems and Mining (2011), p 282-291
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectCrawlerses
dc.subjectWeb Navigationes
dc.subjectVirtual Integrationes
dc.titleA Conceptual Framework for Efficient Web Crawling in Virtual Integration Contextses
dc.typeinfo:eu-repo/semantics/conferenceObjectes
dcterms.identifierhttps://ror.org/03yxnpp24
dc.type.versioninfo:eu-repo/semantics/submittedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticoses
dc.relation.projectIDTIN2007-64119es
dc.relation.projectIDP07-TIC-2602es
dc.relation.projectIDP08- TIC-4100es
dc.relation.projectIDTIN2008-04718-Ees
dc.relation.projectIDTIN2010-21744es
dc.relation.projectIDTIN2010-09809-Ees
dc.relation.projectIDTIN2010-10811-Ees
dc.relation.projectIDTIN2010-09988-Ees
dc.relation.publisherversionhttps://link.springer.com/chapter/10.1007/978-3-642-23982-3_35es
dc.identifier.doi10.1007/978-3-642-23982-3_35es
idus.format.extent10es
dc.publication.initialPage282es
dc.publication.endPage291es
dc.eventtitleWISM 2011: International Conference on Web Information Systems and Mininges
dc.eventinstitutionTaiyuan, Chinaes
dc.relation.publicationplaceBerlines

FicherosTamañoFormatoVerDescripción
A Conceptual Framework.pdf211.6KbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional