학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

RDFINT: A Benchmark for Comparing Data Warehouse with Virtual Integration Approaches for Integration of RDF Data

Resource Type: Conference
Authors: Oni, Samson; Pansare, Kajal; Arneja, Sukrit Singh; Chen, Zhiyuan; Crainiceanu, Adina; Needham, Don
Source: 2020 IEEE International Conference on Big Data (Big Data) Big Data (Big Data), 2020 IEEE International Conference on. :2820-2826 Dec, 2020
Subject: Communication, Networking and Broadcast Technologies
Computing and Processing
Engineering Profession
Geoscience
Signal Processing and Analysis
Measurement
Data integration
Benchmark testing
Data warehouses
Big Data
Resource description framework
Generators
Language

Online Access

Full Text (IEEE)

초록

Users often need to integrate large amounts of RDF data from multiple sources. Although there has been a lot of work on data cleaning and integration for structured data, relatively little work has been done for RDF data. We consider two different approaches to data integration: 1) a traditional data warehouse approach where relevant RDF data is extracted from different sources and then integrated in a data warehouse; 2) a virtual integration approach where RDF data still resides at each source and data integration happens when the data is queried, through a mediator that coordinates with wrappers at each source. It is often unclear how to choose the appropriate approach given an application scenario. This paper proposes RDFINT, a benchmark to compare these two approaches for integrating RDF data. We describe typical data integration operations, metrics that can be used to compare these two approaches, and factors that affect these metrics. We also report preliminary results of an implementation of these two approaches using the Apache Jena Fuseki framework.

공지

DAU Library

학술논문

요약정보

RDFINT: A Benchmark for Comparing Data Warehouse with Virtual Integration Approaches for Integration of RDF Data

Online Access

초록