Ponencia
AYNEC-DataGen: a tool for generating evaluation datasets for Knowledge Graphs completion
Autor/es | Ayala Hernández, Daniel
Borrego Díaz, Agustín Hernández Salmerón, Inmaculada Concepción Rivero, Carlos R. Ruiz Cortés, David |
Departamento | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos |
Fecha de publicación | 2019 |
Fecha de depósito | 2021-02-16 |
Publicado en |
|
Resumen | In the context of knowledge graphs, the task of completion of
relations consists in adding missing triples to a knowledge graph, usually
by classifying potential candidates as true of false. Creating an evalu-
ation ... In the context of knowledge graphs, the task of completion of relations consists in adding missing triples to a knowledge graph, usually by classifying potential candidates as true of false. Creating an evalu- ation dataset for these techniques is not trivial, since there is a large amount of variables to consider which, if not taken into account, may cause misleading results. So far, there is not a well de ned work ow that identi es the variation points when creating a dataset, and what are the possible strategies that can be followed in each step. Furthermore, there are no tools that help create such datasets in an easy way. To address this need, we have created AYNEC-DataGen, a customisable tool for the generation of datasets with multiple variation points related to the pre- processing of the original knowledge graph, the splitting of triples into training and testing sets, and the generation of negative examples. The output of our tool includes the evaluation dataset, an optional export in an open format for its visualisation, and additional files with metadata. Our tool is freely available online. |
Agencias financiadoras | Ministerio de Economía y Competitividad (MINECO). España |
Identificador del proyecto | TIN2016-75394-R |
Cita | Ayala Hernández, D., Borrego Díaz, A., Hernández Salmerón, I.C., Rivero, C.R. y Ruiz Cortés, D. (2019). AYNEC-DataGen: a tool for generating evaluation datasets for Knowledge Graphs completion. En JISBD 2019: XXIV Jornadas de Ingeniería del Software y Bases de Datos Cáceres, España: SISTEDES: Ingeniería de Software y las Tecnologías de Desarrollo de Software. |
Ficheros | Tamaño | Formato | Ver | Descripción |
---|---|---|---|---|
AYNEC DataGen a tool for generating ... | 121.8Kb | [PDF] | Ver/ | |