eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

In-Memory Join Algorithms on GPUs for Large-Data

Resource Type: Conference
Authors: Guo, Chengxin; Chen, Hong
Source: 2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS) High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS), 2019 IEEE 21st International Conference on. :1060-1067 Aug, 2019
Subject: Communication, Networking and Broadcast Technologies
Computing and Processing
Graphics processing units
Instruction sets
Partitioning algorithms
Memory management
Data communication
Pipelines
Hash Join, Sort-Merge Join, Graphics Processing Unit, Large Dataset, Database
Language

Online Access

Full Text (IEEE)

초록

In traditional databases, join is one of the most computationally expensive operations in query processing. During the past years, GPU has been adopted to improve the performance of join processing because of the features of massive parallelism and high memory bandwidth. Limited by the capacity of GPU memory and the absence of virtual memory management, however, handling the relations that exceed the capacity of the GPU memory is a challenge for GPU-based join algorithms. Because of the high computing throughput provided by GPUs and the low bandwidth of data communication between the CPUs and the GPUs, data have to be partitioned to fit the features of GPUs and to reduce the cost of data transmission. Furthermore, a series of novel techniques have been developed on the GPUs, which can benefit the join algorithms. In this work, we focus on the optimizing of processing join operator on large relations and propose the designs of in-memory hash join and sort-merge join on GPUs. We present the data partition method on the GPUs implemented with a pipeline mechanism. Furthermore, the shuffle instructions and the CUDA streams are applied in our algorithms to best utilize the GPUs. Experimental results indicate that our hash join algorithm delivers up to 1.51X and 1.24X speedup over the state-of-the-art hash join algorithm on CPUs on NVIDIA GTX1080ti-Pascal GPU and TitanV-Volta GPU respectively. For sort-merge join, our algorithm achieves up to 3.52X and 2.21X improvements on the same GPUs respectively compared to the baselines on CPUs.

공지

DAU Library

eArticles

요약정보

In-Memory Join Algorithms on GPUs for Large-Data

Online Access

초록