학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Distributed Computation of DNN via DRL With Spatiotemporal State Embedding

Resource Type: Periodical
Authors: Kim, S.; Jung, S.; Lee, H.
Source: IEEE Internet of Things Journal IEEE Internet Things J. Internet of Things Journal, IEEE. 11(7):12686-12701 Apr, 2024
Subject: Computing and Processing
Communication, Networking and Broadcast Technologies
Artificial neural networks
Computational modeling
Cloud computing
Processor scheduling
Partitioning algorithms
Job shop scheduling
Task analysis
Deep neural network (DNN) partitioning
deep reinforcement learning (DRL)
distributed computing
DNN scheduling
offloading
Language
ISSN: 2327-4662
2372-2541

Online Access

초록

Offloading techniques are considered one of the key enablers of deep neural network (DNN)-based artificial intelligence (AI) services on end devices with limited computing resources. However, offloading DNN layers involves hard combinatorial problems. To this end, we develop a deep reinforcement learning (DRL)-based offloading algorithm for computing DNN layers with minimum end-to-end inference latency. We combine long short-term memory (LSTM) and graph neural network for state embedding that can exploit spatial correlation over the network to accelerate training, and temporal correlation over time to reduce the overhead of state monitoring. With this embedding, our DRL algorithm can draw multiple actions from a single state observation and adapt, without retraining, to new environments unseen in the training phase. We show through extensive simulations that our algorithm outperforms the existing ones in terms of both latency and robustness to feedback delay which is inevitable in practice, in particular, achieving a performance enhancement of up to 29.6% in some scenarios.

공지

DAU Library

학술논문

요약정보

Distributed Computation of DNN via DRL With Spatiotemporal State Embedding

Online Access

초록