EagleRec: Edge-Scale Recommendation System Acceleration with Inter-Stage Parallelism Optimization on GPUs
- Resource Type
- Conference
- Authors
- Yu, Yongbo; Yu, Fuxun; Sheng, Xiang; Liu, Chenchen; Chen, Xiang
- Source
- 2023 60th ACM/IEEE Design Automation Conference (DAC) Design Automation Conference (DAC), 2023 60th ACM/IEEE. :1-6 Jul, 2023
- Subject
- Components, Circuits, Devices and Systems
Computing and Processing
Engineering Profession
Performance evaluation
Industries
Design automation
Graphics processing units
Optimization methods
Parallel processing
Throughput
- Language
Recommendation systems suggest items to users by predicting their preferences based on historical data. The industry traditionally handles large-scale recommendation requests by scaling the number of devices without much concern for a single device’s performance. However, there is a trend for recommendation systems to gradually move from a centralized service to an edge device. The edge-scale recommendation systems have distinct features that are different from traditional large-scale deployments, which poses different challenges to the acceleration of the recommendation system. In this paper, we focus on the edge-scale recommendation system and propose an inter-stage parallelism optimization method deployed on a single GPU. Experiments show that our framework could improve recommendation system throughput by 1.89×~2.2× for different datasets on the GPU.