학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Training Large-Vocabulary Neural Language Models by Private Federated Learning for Resource-Constrained Devices

Resource Type: Conference
Authors: Xu, Mingbin; Song, Congzheng; Tian, Ye; Agrawal, Neha; Granqvist, Filip; van Dalen, Rogier; Zhang, Xiao; Argueta, Arturo; Han, Shiyi; Deng, Yaqiao; Liu, Leo; Walia, Anmol; Jin, Alex
Source: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2023 - 2023 IEEE International Conference on. :1-5 Jun, 2023
Subject: Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
Signal Processing and Analysis
Training
Adaptation models
Privacy
Differential privacy
Vocabulary
Federated learning
Computational modeling
Language
ISSN: 2379-190X

Online Access

Full Text (IEEE)

초록

Federated Learning (FL) is a technique to train models on distributed edge devices with local data samples. Differential Privacy (DP) can be applied with FL to provide a formal privacy guarantee for sensitive data on device. Our goal is to train a large neural network language model (NNLM) on compute-constrained devices while preserving privacy using FL and DP. However, the noise required to guarantee differential privacy increases as the model size grows, which often prevents convergence. We propose Partial Embedding Updates (PEU), a novel technique to reduce the impact of DP-noise by decreasing payload size. Furthermore, we adopt Low Rank Adaptation (LoRA) and Noise Contrastive Estimation (NCE) to reduce the memory demands of large models on compute-constrained devices. We demonstrate in simulation and with real devices that this combination of techniques makes it possible to train large-vocabulary language models while preserving accuracy and privacy.

공지

DAU Library

학술논문

요약정보

Training Large-Vocabulary Neural Language Models by Private Federated Learning for Resource-Constrained Devices

Online Access

초록