Mostrar el registro sencillo del ítem

Ponencia

dc.creatorOrtega Rodríguez, Francisco Javieres
dc.creatorMacDonald, Craiges
dc.creatorTroyano Jiménez, José Antonioes
dc.creatorCruz Mata, Fermínes
dc.date.accessioned2020-08-05T09:22:03Z
dc.date.available2020-08-05T09:22:03Z
dc.date.issued2010
dc.identifier.citationOrtega Rodríguez, F.J., MacDonald, C., Troyano Jiménez, J.A. y Cruz Mata, F. (2010). Spam detection with a content-based random-walk algorithm. En SMUC 2010: 2nd international workshop on Search and mining user-generated contents (45-52), Toronto, ON, Canada: ACM Digital Library.
dc.identifier.isbn978-1-4503-0386-6es
dc.identifier.urihttps://hdl.handle.net/11441/100111
dc.description.abstractIn this work we tackle the problem of the spam detection on the Web. Spam web pages have become a problem for Web search engines, due to the negative effects that this phe-nomenon can cause in their retrieval results. Our approach is based on a random-walk algorithm that obtains a ranking of pages according to their relevance and their spam likelihood. We introduce the novelty of taking into account the content of the web pages to characterize the web graph and to ob-tain an a- priori estimation of the spam likekihood of the web pages. Our graph-based algorithm computes two scores for each node in the graph. Intuitively, these values represent how bad or good (spam-like or not) is a web page, according to its textual content and the relations in the graph. Our experiments show that our proposed technique outperforms other link-based techniques for spam detection.es
dc.description.sponsorshipMinisterio de Educación y Ciencia HUM2007-66607-C04-04es
dc.formatapplication/pdfes
dc.format.extent7es
dc.language.isoenges
dc.publisherACM Digital Libraryes
dc.relation.ispartofSMUC 2010: 2nd international workshop on Search and mining user-generated contents (2010), p 45-52
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectInformation Retrievales
dc.subjectWeb spam detectiones
dc.subjectGraph algorithmses
dc.subjectPageRankes
dc.subjectWeb searches
dc.titleSpam detection with a content-based random-walk algorithmes
dc.typeinfo:eu-repo/semantics/conferenceObjectes
dcterms.identifierhttps://ror.org/03yxnpp24
dc.type.versioninfo:eu-repo/semantics/submittedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticoses
dc.relation.projectIDHUM2007-66607-C04-04es
dc.relation.publisherversionhttps://dl.acm.org/doi/10.1145/1871985.1871994es
dc.identifier.doi10.1145/1871985.1871994es
dc.publication.initialPage45es
dc.publication.endPage52es
dc.eventtitleSMUC 2010: 2nd international workshop on Search and mining user-generated contentses
dc.eventinstitutionToronto, ON, Canadaes
dc.relation.publicationplaceNew York, USAes
dc.contributor.funderMinisterio de Educación y Ciencia (MEC). Españaes

FicherosTamañoFormatoVerDescripción
Spam detection with a content- ...496.0KbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional