dc.creator | Hernández Salmerón, Inmaculada Concepción | es |
dc.creator | Sleiman, Hassan A. | es |
dc.creator | Ruiz Cortés, David | es |
dc.creator | Corchuelo Gil, Rafael | es |
dc.date.accessioned | 2021-02-16T09:42:04Z | |
dc.date.available | 2021-02-16T09:42:04Z | |
dc.date.issued | 2011 | |
dc.identifier.citation | Hernández Salmerón, I.C., Sleiman, H.A., Ruiz Cortés, D. y Corchuelo Gil, R. (2011). A Tool for Web Links Prototyping. En ICAI 2011: International Conference on Artificial Intelligence Las Vegas, Nevada, USA: CSREA Press. | |
dc.identifier.isbn | 9781601321831 | es |
dc.identifier.isbn | 9781601321848 | es |
dc.identifier.uri | https://hdl.handle.net/11441/105016 | |
dc.description.abstract | Crawlers for Virtual Integration processes must be
efficient, given that VI process is online, which means that while
the system is looking for the required information, the user
is waiting for a response. Therefore, downloading a minimum
number of irrelevant pages is mandatory in order to improve
the crawler efficiency. Most crawlers need to download a page
in order the determine its relevance, which results in a high
number of irrelevant pages downloaded. We propose a tool
that builds a set of prototype links for a given site, where
each prototype represents links leading to pages containing a
certain concept. These prototypes can then be used to classify
pages before downloading them, just by analysing their URL.
Therefore, they are the support for crawlers to navigate through
sites downloading a minimum number of irrelevant pages while
reducing bandwidth, making them suitable for VI systems. | es |
dc.description.sponsorship | Ministerio de Educación y Ciencia TIN2007-64119 | es |
dc.description.sponsorship | Junta de Andalucía P07-TIC-2602 | es |
dc.description.sponsorship | Junta de Andalucía P08-TIC-4100 | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2008-04718-E | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-21744 | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-09809-E | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-10811-E | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-09988-E | es |
dc.format | application/pdf | es |
dc.format.extent | 7 | es |
dc.language.iso | eng | es |
dc.publisher | CSREA Press | es |
dc.relation.ispartof | ICAI 2011: International Conference on Artificial Intelligence (2011). | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Web Crawling | es |
dc.subject | Web Page Classification | es |
dc.subject | Virtual Integration | es |
dc.subject | Prototype-based Classification | es |
dc.title | A Tool for Web Links Prototyping | es |
dc.type | info:eu-repo/semantics/conferenceObject | es |
dcterms.identifier | https://ror.org/03yxnpp24 | |
dc.type.version | info:eu-repo/semantics/publishedVersion | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |
dc.contributor.affiliation | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos | es |
dc.relation.projectID | TIN2007-64119 | es |
dc.relation.projectID | P07-TIC-2602 | es |
dc.relation.projectID | P08-TIC-4100 | es |
dc.relation.projectID | TIN2008-04718-E | es |
dc.relation.projectID | TIN2010-21744 | es |
dc.relation.projectID | TIN2010-09809-E | es |
dc.relation.projectID | TIN2010-10811-E | es |
dc.relation.projectID | TIN2010-09988-E | es |
dc.eventtitle | ICAI 2011: International Conference on Artificial Intelligence | es |
dc.eventinstitution | Las Vegas, Nevada, USA | es |
dc.relation.publicationplace | Las Vegas, Nevada, USA | es |
dc.contributor.funder | Ministerio de Educación y Ciencia (MEC). España | es |
dc.contributor.funder | Junta de Andalucía | es |
dc.contributor.funder | Ministerio de Ciencia e Innovación (MICIN). España | es |