학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Bidirectional Multi-Stack RNNs with Attention for Machine Translation

Resource Type: Conference
Authors: Chen, Zhiren; Qiu, Ziyu; Chen, Nuo
Source: 2023 IEEE 2nd Industrial Electronics Society Annual On-Line Conference (ONCON) On-Line Conference (ONCON), 2023 IEEE 2nd Industrial Electronics Society Annual. :1-6 Dec, 2023
Subject: Components, Circuits, Devices and Systems
Computing and Processing
General Topics for Engineers
Robotics and Control Systems
Signal Processing and Analysis
Training
Vocabulary
Computational modeling
Transfer learning
Transformers
Decoding
Task analysis
Language

Online Access

Full Text (IEEE)

초록

This project developed a novel attention algorithm for multi-stack bidirectional encoder-decoder RNN (including GRU and LSTM) sequence-to-sequence models, particularly for language translation tasks. The attention mechanism utilizes matrix rearranging and multiplication to compute the significance of the vectors in the encoder output to the vectors in the current decoder hidden states when predicting each word. Our approach achieved 98% of the performance of fine-tuned pretrained T5-small, with 30% to 50% fewer parameters depending on vocabulary size, making our model an ideal choice in cases of single-processor training, low processor resource, limited memory, small dataset, or tasks not supported by pretrained transformers.

공지

DAU Library

학술논문

요약정보

Bidirectional Multi-Stack RNNs with Attention for Machine Translation

Online Access

초록