Ponencia
Benchmarking the Performance of Linked Data Translation Systems
Autor/es | Rivero, Carlos R.
Schultz, Andreas Bizer, Christian Ruiz Cortés, David |
Departamento | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos |
Fecha de publicación | 2012 |
Fecha de depósito | 2017-11-20 |
Publicado en |
|
ISBN/ISSN | 1613-0073 |
Resumen | Linked Data sources on the Web use a wide range of different
vocabularies to represent data describing the same type
of entity. For some types of entities, like people or bibliographic
record, common vocabularies have ... Linked Data sources on the Web use a wide range of different vocabularies to represent data describing the same type of entity. For some types of entities, like people or bibliographic record, common vocabularies have emerged that are used by multiple data sources. But even for representing data of these common types, different user communities use different competing common vocabularies. Linked Data applications that want to understand as much data from the Web as possible, thus need to overcome vocabulary heterogeneity and translate the original data into a single target vocabulary. To support application developers with this integration task, several Linked Data translation systems have been developed. These systems provide languages to express declarative mappings that are used to translate heterogeneous Web data into a single target vocabulary. In this paper, we present a benchmark for comparing the expressivity as well as the runtime performance of data translation systems. Based on a set of examples from the LOD Cloud, we developed a catalog of fifteen data translation patterns and survey how often these patterns occur in the example set. Based on these statistics, we designed the LODIB (Linked Open Data Integration Benchmark) that aims to reflect the real-world heterogeneities that exist on the Web of Data. We apply the benchmark to test the performance of two data translation systems, Mosto and LDIF, and compare the performance of the systems with the SPARQL 1.1 CONSTRUCT query performance of the Jena TDB RDF store. |
Identificador del proyecto | P07-TIC-2602
P08- TIC-4100 TIN2008-04718-E TIN2010-21744 TIN2010-10811-E TIN2010-09988-E FP7-256975 (LATC) |
Cita | Rivero, C.R., Schultz, A., Bizer, C. y Ruiz Cortés, D. (2012). Benchmarking the Performance of Linked Data Translation Systems. En LDOW 2012: WWW2012 Workshop on Linked Data on the Web Lyon, France: CEUR-WS. |
Ficheros | Tamaño | Formato | Ver | Descripción |
---|---|---|---|---|
Benchmarking the Performance.pdf | 123.2Kb | [PDF] | Ver/ | |