Mostrar el registro sencillo del ítem

Artículo

dc.creatorTroyano Jiménez, José Antonioes
dc.creatorEnríquez de Salamanca Ros, Fernandoes
dc.creatorCruz Mata, Fermínes
dc.creatorCañete Valdeón, José Migueles
dc.creatorOrtega Rodríguez, Francisco Javieres
dc.date.accessioned2020-08-03T08:06:38Z
dc.date.available2020-08-03T08:06:38Z
dc.date.issued2007
dc.identifier.citationTroyano Jiménez, J.A., Enríquez de Salamanca Ros, F., Cruz Mata, F., Cañete Valdeón, J.M. y Ortega Rodríguez, F.J. (2007). Improving the Performance of a Tagger Generator in an Information Extraction Application. Journal of Universal Computer Science, 13 (9), 1287-1299.
dc.identifier.issn0948-695Xes
dc.identifier.urihttps://hdl.handle.net/11441/100068
dc.description.abstractIn this paper we present an experience in the extraction of named entities from Spanish texts using stacking. Named Entity Extraction (NEE) is a subtask of Information Extraction that involves the identification of groups of words that make up the name of an entity, and the classification of these names into a set of predefined categories. Our approach is corpus-based, we use a re-trainable tagger generator to obtain a named entity extractor from a set of tagged examples. The main contribution of our work is that we obtain the systems needed in a stacking scheme without making use of any additional training material or tagger generators. Instead of it, we have generated the variability needed in stacking by applying corpus transformation to the original training corpus. Once we have several versions of the training corpus we generate several extractors and combine them by means of a machine learning algorithm. Experiments show that the combination of corpus transformation and stacking improve the performance of the tagger generator in this kind of natural language processing applications. The best of our experiments achieves an improvement of more than six percentual points respect to the predefined baseline.es
dc.formatapplication/pdfes
dc.format.extent13es
dc.language.isoenges
dc.publisherGraz University of Technology, Institut für Informations systeme und Computer Medien (IICM)es
dc.relation.ispartofJournal of Universal Computer Science, 13 (9), 1287-1299.
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectNamed Entity Extractiones
dc.subjectCorpus Transformationes
dc.subjectSystem Combinationes
dc.subjectStackinges
dc.titleImproving the Performance of a Tagger Generator in an Information Extraction Applicationes
dc.typeinfo:eu-repo/semantics/articlees
dcterms.identifierhttps://ror.org/03yxnpp24
dc.type.versioninfo:eu-repo/semantics/publishedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticoses
dc.relation.publisherversionhttp://www.jucs.org/jucs_13_9/improving_the_performance_ofes
dc.journaltitleJournal of Universal Computer Sciencees
dc.publication.volumen13es
dc.publication.issue9es
dc.publication.initialPage1287es
dc.publication.endPage1299es
dc.identifier.sisius6634177es

FicherosTamañoFormatoVerDescripción
Improving the Performance of a ...150.3KbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional