Repositorio de producción científica de la Universidad de Sevilla

An approach for discovering keywords from Spanish tweets using Wikipedia

 

Advanced Search
 
Opened Access An approach for discovering keywords from Spanish tweets using Wikipedia
Cites

Show item statistics
Icon
Export to
Author: Ayala Hernández, Daniel
Roldán Salvador, Juan Carlos
Ruiz Cortés, David
Ortega Gallego, Fernando
Department: Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos
Date: 2015
Published in: ADCAIJ: Advances in distributed computing and artificial intelligence journal, 4 (2), 73-87.
Document type: Article
Abstract: Most approaches to keywords discovery when analyzing microblogging messages (among them those from Twitter) are based on statistical and lexical information about the words that compose the text. The lack of context in the short messages can be problematic due to the low co-occurrence of words. In this paper, we present a new approach for keywords discovering from Spanish tweets based on the addition of context information using Wikipedia as a knowledge base. We present four different ways to use Wikipedia and two ways to rank the new keywords. We have tested these strategies using more than 60000 Spanish tweets, measuring performance and analyzing particularities of each strategy.
Cite: Ayala Hernández, D., Roldán Salvador, J.C., Ruiz Cortés, D. y Ortega Gallego, F. (2015). An approach for discovering keywords from Spanish tweets using Wikipedia. ADCAIJ: Advances in distributed computing and artificial intelligence journal, 4 (2), 73-87.
Size: 448.9Kb
Format: PDF

URI: http://hdl.handle.net/11441/49141

DOI: 10.14201/ADCAIJ2015427388

See editor´s version

This work is under a Creative Commons License: 
Attribution-NonCommercial-NoDerivatives 4.0 Internacional

This item appears in the following Collection(s)