Ponencia
Improving the Performance of a Named Entity Extractor by Applying a Stacking Scheme
Autor/es | Troyano Jiménez, José Antonio
Díaz Madrigal, Víctor Jesús Enríquez de Salamanca Ros, Fernando Romero Moreno, Luisa María |
Departamento | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos |
Fecha de publicación | 2004 |
Fecha de depósito | 2020-08-03 |
Publicado en |
|
ISBN/ISSN | 978-3-540-23806-5 0302-9743 |
Resumen | In this paper we investigate the way of improving the performance
of a Named Entity Extraction (NEE) system by applying machine
learning techniques and corpus transformation. The main resources used
in our experiments ... In this paper we investigate the way of improving the performance of a Named Entity Extraction (NEE) system by applying machine learning techniques and corpus transformation. The main resources used in our experiments are the publicly available tagger TnT and a corpus of Spanish texts in which named entities occurrences are tagged with BIO tags. We split the NEE task into two subtasks 1) Named Entity Recognition (NER) that involves the identification of the group of words that make up the name of an entity and 2) Named Entity Classification (NEC) that determines the category of a named entity. We have focused our work on the improvement of the NER task, generating four different taggers with the same training corpus and combining them using a stacking scheme. We improve the baseline of the NER task (Fβ=1 value of 81.84) up to a value of 88.37. When a NEC module is added to the NER system the performance of the whole NEE task is also improved. A value of 70.47 is achieved from a baseline of 66.07. |
Cita | Troyano Jiménez, J.A., Díaz Madrigal, V.J., Enríquez de Salamanca Ros, F. y Romero Moreno, L.M. (2004). Improving the Performance of a Named Entity Extractor by Applying a Stacking Scheme. En IBERAMIA 2004: 9th Ibero-American Conference on Artificial Intelligence (295-304), Puebla, México: Springer. |
Ficheros | Tamaño | Formato | Ver | Descripción |
---|---|---|---|---|
Improving the Performance of a ... | 128.1Kb | [PDF] | Ver/ | |