Ponencia
ITALICA at PAN 2013: An Ensemble Learning Approach to Author Profiling Notebook for PAN at CLEF 2013
Autor/es | Cruz Mata, Fermín
Haro R. Rafa Ortega Rodríguez, Francisco Javier |
Departamento | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos |
Fecha de publicación | 2013 |
Fecha de depósito | 2022-03-10 |
Publicado en |
|
ISBN/ISSN | 1613-0073 |
Resumen | This notebook discusses the approach to the Author Profiling task developed by the Italica group for PAN 2013. This system implements two different
sets of classifiers which are combined later in order to build a final ... This notebook discusses the approach to the Author Profiling task developed by the Italica group for PAN 2013. This system implements two different sets of classifiers which are combined later in order to build a final classifier that takes into account the decisions of the previous ones. The initial classifiers are focused on vector space representations of the documents as a bag of words and n-grams of POS tags and also on a set of stylistic features of the texts. The final classifier consists of a stacking schema that combines the other ones. This approach has obtained better results for the Spanish dataset than for the English dataset, probably due to the use of more detailed POS tagset in the former |
Cita | Cruz Mata, F., Haro R. Rafa, y Ortega Rodríguez, F.J. (2013). ITALICA at PAN 2013: An Ensemble Learning Approach to Author Profiling Notebook for PAN at CLEF 2013. En CLEF 2013 Working Notes Valencia, España: CEUR Workshop Proceedings (CEUR-WS.org). |
Ficheros | Tamaño | Formato | Ver | Descripción |
---|---|---|---|---|
ITALICA at PAN 2013 an ensemble ... | 99.57Kb | [PDF] | Ver/ | |