학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

LWSDP: Locality-Aware Warp Scheduling and Dynamic Data Prefetching Co-design in the Per-SM Private Cache of GPGPUs

Resource Type: Conference
Authors: Wang, Wangguang; Wang, Mingyu; Zhang, Yicong; Wei, Yukun; Yu, Zhiyi
Source: 2023 IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS) ICPADS Parallel and Distributed Systems (ICPADS), 2023 IEEE 29th International Conference on. :1230-1237 Dec, 2023
Subject: Communication, Networking and Broadcast Technologies
Computing and Processing
Processor scheduling
Prefetching
Graphics processing units
Distributed databases
Switches
Parallel processing
Dynamic scheduling
GPGPUs
Data Locality
Warp Scheduler
Dynamic Prefetching
Load/Store Unit
Language
ISSN: 2690-5965

Online Access

Full Text (IEEE)

초록

General Purpose Graphics Processing Units (GPG-PUs) employ frequent context switching to mask the long-latency of memory operations. However, GPGPUs still suffer from stagnation due to the incomplete overlapping of memory operations. To alleviate this stagnation and enhance Memory-Level Parallelism (MLP), it is crucial to overlap and minimize memory operations. This paper conducts a comprehensive analysis of data locality in GPGPUs and proposes an approach called Locality-Aware Warp Scheduling and Dynamic Data Prefetching (LWSDP) Co-design in the Per-SM Private Cache of GPGPUs, which effectively utilizes data locality to improve MLP. In addition to employing a coordinated scheduler and dynamic data prefetching, we incorporate Prefetching Requests Admitted Cache Access Re-execution (PRA-CAR) to mitigate the adverse impact of excessive prefetching memory requests on memory saturation. Experimental results demonstrate that LWSDP achieves an average 33.02% performance improvement and an average 28.16% miss rate reduction compared to the previous schedulers on data locality-sensitive kernels.

공지

DAU Library

학술논문

요약정보

LWSDP: Locality-Aware Warp Scheduling and Dynamic Data Prefetching Co-design in the Per-SM Private Cache of GPGPUs

Online Access

초록