eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

High-efficiency Device-Cloud Collaborative Transformer Model

Resource Type: Conference
Authors: Jiang, Penghao; Xin, Ke; Li, Chunxi; Zhou, Yinsi
Source: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) CVPRW Computer Vision and Pattern Recognition Workshops (CVPRW), 2023 IEEE/CVF Conference on. :2204-2210 Jun, 2023
Subject: Computing and Processing
Engineering Profession
Training
Computational modeling
Conferences
Memory management
Collaboration
Interference
Transformers
Language
ISSN: 2160-7516

Online Access

Full Text (IEEE)

초록

Natural Language Processing (NLP) experts have had significant success with unsupervised language pre-training techniques. However, compared to typical NLP models, modern self-attention models require far more computational and memory resources than conventional NLP models, making pre-training or even fine-tuning them quite costly. It drastically restricts their success and uses in a variety of fields. To improve the efficiency, we propose Device-Cloud Collaborative Transformer for an efficient language model, which is a framework across cloud and device, and is designed to encourage learning of representations that generalize better to many different tasks. Specifically, we design Device-Cloud Collaborative Transformer architecture of large language models that benefits both cloud modeling and device modeling. Experimental results demonstrate the effectiveness of our proposed method.

공지

DAU Library

eArticles

요약정보

High-efficiency Device-Cloud Collaborative Transformer Model

Online Access

초록