학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Evaluation of MPI Allreduce for Distributed Training of Convolutional Neural Networks

Resource Type: Conference
Authors: Castello, Adrian; Catalan, Mar; Dolz, Manuel F.; Mestre, Jose I.; Quintana-Orti, Enrique S.; Duato, Jose
Source: 2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) PDP Parallel, Distributed and Network-Based Processing (PDP), 2021 29th Euromicro International Conference on. :109-116 Mar, 2021
Subject: Computing and Processing
Training
Graphics
Deep learning
Tensors
Message passing
Neural networks
Convolutional neural networks
Message Passing Interface (MPI)
collective communication primitives
Allreduce
deep learning
distributed training
convolutional neural networks
Language
ISSN: 2377-5750

Online Access

Full Text (IEEE)

초록

Training deep neural networks is a costly procedure, often performed via sophisticated deep learning frameworks on clusters of computers. As faster processor technologies are integrated into these cluster facilities (e.g., NVIDIA’s graphics accelerators or Google’s tensor processing units), the communication component of the training process rapidly becomes a performance bottleneck. In this paper, we offer a complete analysis of the key collective communication primitive for the distributed data-parallel training of convolutional network networks (CNNs) focused on three relevant instances of the Message Passing Interface (MPI): MPICH, OpenMPI, and IntelMPI. In addition, our experimental evaluation is extended to expose the practical impact of this collective primitive when the training is performed using TensorFlow+ Horovod on a 16-node cluster. Finally, the theoretical analysis is further refined to a number of accelerated cluster configurations that are emulated by adjusting the communication-arithmetic ratio of the training process.

공지

DAU Library

학술논문

요약정보

Evaluation of MPI Allreduce for Distributed Training of Convolutional Neural Networks

Online Access

초록