eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Dynamic Transformer for Efficient Machine Translation on Embedded Devices

Resource Type: Conference
Authors: Parry, Hishan; Xun, Lei; Sabet, Amin; Bi, Jia; Hare, Jonathon; Merrett, Geoff V.
Source: 2021 ACM/IEEE 3rd Workshop on Machine Learning for CAD (MLCAD) Machine Learning for CAD (MLCAD), 2021 ACM/IEEE 3rd Workshop on. :1-6 Aug, 2021
Subject: Components, Circuits, Devices and Systems
Performance evaluation
Solid modeling
Graphics processing units
Switches
Machine learning
Dynamic scheduling
Hardware
Dynamic DNNs for NLP
Efficient Transformer
Embedded platform
Runtime Resource Management
Language

Online Access

Full Text (IEEE)

초록

The Transformer architecture is widely used for machine translation tasks. However, its resource-intensive nature makes it challenging to implement on constrained embedded devices, particularly where available hardware resources can vary at run-time. We propose a dynamic machine translation model that scales the Transformer architecture based on the available resources at any particular time. The proposed approach, ‘Dynamic-HAT’, uses a HAT SuperTransformer as the backbone to search for SubTransformers with different accuracy-latency trade-offs at design time. The optimal SubTransformers are sampled from the SuperTransformer at run-time, depending on latency constraints. The Dynamic-HAT is tested on the Jetson Nano and the approach uses inherited SubTransformers sampled directly from the SuperTransformer with a switching time of

공지

DAU Library

eArticles

요약정보

Dynamic Transformer for Efficient Machine Translation on Embedded Devices

Online Access

초록