Mostrar el registro sencillo del ítem

Artículo

dc.creatorOrtega Rodríguez, Francisco Javieres
dc.creatorTroyano Jiménez, José Antonioes
dc.creatorCruz Mata, Fermínes
dc.creatorGarcía Vallejo, Carlos Antonioes
dc.date.accessioned2022-03-11T08:04:56Z
dc.date.available2022-03-11T08:04:56Z
dc.date.issued2012
dc.identifier.citationOrtega Rodríguez, F.J., Troyano Jiménez, J.A., Cruz Mata, F. y García Vallejo, C.A. (2012). PolaritySpam: Propagating Content-based Information Through a Web-Graph to Detect Web Spam. International Journal of Innovative Computing, Information and Control, 8 (4), 2915-2928.
dc.identifier.issn1349-4198es
dc.identifier.urihttps://hdl.handle.net/11441/130681
dc.description.abstractSpam web pages have become a problem for Information Retrieval systems due to the negative effects that this phenomenon can cause in their results. In this work we tackle the problem of detecting these pages with a propagation algorithm that, taking as input a web graph, chooses a set of spam and not-spam web pages in order to spread their spam likelihood over the rest of the network. Thus we take advantage of the links between pages to obtain a ranking of pages according to their relevance and their spam likelihood. Our intuition consists in giving a high reputation to those pages related to relevant ones, and giving a high spam likelihood to the pages linked to spam web pages. We introduce the novelty of including the content of the web pages in the computation of an a priori estimation of the spam likelihood of the pages, and propagate this information. Our graph-based algorithm computes two scores for each node in the graph. Intuitively, these values represent how bad or good (spam-like or not) is a web page, according to its textual content and its relations in the graph. The experimental results show that our method outperforms other techniques for spam detectiones
dc.description.sponsorshipMinisterio de Educación y Ciencia HUM2007-66607-C04-04es
dc.formatapplication/pdfes
dc.format.extent14es
dc.language.isoenges
dc.publisherICIC Internationales
dc.relation.ispartofInternational Journal of Innovative Computing, Information and Control, 8 (4), 2915-2928.
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectInformation retrievales
dc.subjectWeb spam detectiones
dc.subjectGraph algorithmses
dc.subjectPageRankes
dc.subjectWeb searches
dc.titlePolaritySpam: Propagating Content-based Information Through a Web-Graph to Detect Web Spames
dc.typeinfo:eu-repo/semantics/articlees
dc.type.versioninfo:eu-repo/semantics/submittedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticoses
dc.relation.projectIDHUM2007-66607-C04-04es
dc.relation.publisherversionhttp://www.ijicic.org/contents.htmes
dc.contributor.groupUniversidad de Sevilla. TIC134: Sistemas Informáticoses
dc.journaltitleInternational Journal of Innovative Computing, Information and Controles
dc.publication.volumen8es
dc.publication.issue4es
dc.publication.initialPage2915es
dc.publication.endPage2928es
dc.identifier.sisius20031169es
dc.contributor.funderMinisterio de Educación y Ciencia (MEC). Españaes

FicherosTamañoFormatoVerDescripción
POLARITYSPAM PROPAGATING CONTE ...262.3KbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional