eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Self-Attention and Transformers: Driving the Evolution of Large Language Models

Resource Type: Conference
Authors: Luo, Qing; Zeng, Wei; Chen, Manni; Peng, Gang; Yuan, Xiaofeng; Yin, Qiang
Source: 2023 IEEE 6th International Conference on Electronic Information and Communication Technology (ICEICT) Electronic Information and Communication Technology (ICEICT), 2023 IEEE 6th International Conference on. :401-405 Jul, 2023
Subject: Communication, Networking and Broadcast Technologies
Computing and Processing
Fields, Waves and Electromagnetics
Signal Processing and Analysis
Computer vision
Visualization
Technological innovation
Computational modeling
Transformers
Artificial intelligence
Task analysis
Self-Attention
transformer
large language model (LLM)
natural language processing (NLP)
generative pretrained transformer (GPT)
Language
ISSN: 2836-7782

Online Access

Full Text (IEEE)

초록

Transformers, originally introduced for machine translation, and built upon the Self-Attention mechanism, have undergone a remarkable evolution, establishing themselves as the bedrock of large language models (LLMs). Their unparalleled capacity to model intricate relationships and capture extensive dependencies within sequences has propelled their prominence. This article, presented in a popular science format, serves as an introduction to the transformer architecture, elucidating its innovative structure that enables efficient processing of long sequences and capturing dependencies over extended distances. We believe that this resource will prove valuable to college students or youth researchers aspiring to delve into the study and research of modern Artificial Intelligence (AI) domains.

공지

DAU Library

eArticles

요약정보

Self-Attention and Transformers: Driving the Evolution of Large Language Models

Online Access

초록