Mostrar el registro sencillo del ítem

Artículo

dc.creatorHernández Salmerón, Inmaculada Concepciónes
dc.creatorRivero, Carlos R.es
dc.creatorRuiz Cortés, Davides
dc.creatorCorchuelo Gil, Rafaeles
dc.date.accessioned2017-11-20T11:11:00Z
dc.date.available2017-11-20T11:11:00Z
dc.date.issued2016
dc.identifier.citationHernández Salmerón, I.C., Rivero, C.R., Ruiz Cortés, D. y Corchuelo Gil, R. (2016). CALA: Classifying Links Automatically based on their URL. Journal of Systems and Software, 115 (may 2016), 130-143.
dc.identifier.issn0164-1212es
dc.identifier.urihttp://hdl.handle.net/11441/66257
dc.description.abstractWeb page classification refers to the problem of automatically assigning a web page to one or moreclasses after analysing its features. Automated web page classifiers have many applications, and many re- searchers have proposed techniques and tools to perform web page classification. Unfortunately, the ex- isting tools have a number of drawbacks that makes them unappealing for real-world scenarios, namely:they require a previous extensive crawling, they are supervised, they need to download a page beforeclassifying it, or they are site-, language-, or domain-dependent. In this article, we propose CALA, a toolfor URL-based web page classification. The strongest features of our tool are that it does not require aprevious extensive crawling to achieve good classification results, it is unsupervised, it is based exclu- sively on URL features, which means that pages can be classified without downloading them, and it issite-, language-, and domain-independent, which makes it generally applicable. We have validated ourtool with 22 real-world web sites from multiple domains and languages, and our conclusion is that CALAis very effective and efficient in practice.es
dc.description.sponsorshipMinisterio de Educación y Ciencia TIN2007-64119es
dc.description.sponsorshipJunta de Andalucía P07-TIC-2602es
dc.description.sponsorshipJunta de Andalucía P08-TIC-4100es
dc.description.sponsorshipMinisterio de Ciencia e Innovación TIN2008-04718-Ees
dc.description.sponsorshipMinisterio de Ciencia e Innovación TIN2010-21744es
dc.description.sponsorshipMinisterio de Ciencia e Innovación TIN2010-09809-Ees
dc.description.sponsorshipMinisterio de Ciencia e Innovación TIN2010-10811-Ees
dc.description.sponsorshipMinisterio de Ciencia e Innovación TIN2010-09988-Ees
dc.description.sponsorshipMinisterio de Economía y Competitividad TIN2011-15497-Ees
dc.description.sponsorshipMinisterio de Economía y Competitividad TIN2013-40848-Res
dc.formatapplication/pdfes
dc.language.isoenges
dc.publisherElsevieres
dc.relation.ispartofJournal of Systems and Software, 115 (may 2016), 130-143.
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectWeb Page Classificationes
dc.subjectURL Patternses
dc.titleCALA: Classifying Links Automatically based on their URLes
dc.typeinfo:eu-repo/semantics/articlees
dcterms.identifierhttps://ror.org/03yxnpp24
dc.type.versioninfo:eu-repo/semantics/submittedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticoses
dc.relation.projectIDTIN2007-64119es
dc.relation.projectIDP07-TIC-2602es
dc.relation.projectIDP08-TIC-4100es
dc.relation.projectIDTIN2008-04718-Ees
dc.relation.projectIDTIN2010-21744es
dc.relation.projectIDTIN2010-09809-Ees
dc.relation.projectIDTIN2010-10811-Ees
dc.relation.projectIDTIN2010-09988-Ees
dc.relation.projectIDTIN2011-15497-Ees
dc.relation.projectIDTIN2013-40848-Res
dc.relation.publisherversionhttp://www.sciencedirect.com/science/article/pii/S016412121600042Xes
dc.identifier.doi10.1016/j.jss.2016.02.006es
dc.contributor.groupUniversidad de Sevilla. TIC134: Sistemas Informáticoses
idus.format.extent14es
dc.journaltitleJournal of Systems and Softwarees
dc.publication.volumen115es
dc.publication.issuemay 2016es
dc.publication.initialPage130es
dc.publication.endPage143es
dc.identifier.sisius20926048es

FicherosTamañoFormatoVerDescripción
CALA ClAssifying.pdf3.610MbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional