학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Rethinking Mobilevitv2 for Real-Time Semantic Segmentation

Resource Type: Conference
Authors: Zhang, Zhengbin; Xu, Zhenhao; Gu, Xingsheng; Xiong, Juan
Source: 2023 4th International Conference on Computers and Artificial Intelligence Technology (CAIT) Computers and Artificial Intelligence Technology (CAIT), 2023 4th International Conference on. :57-64 Dec, 2023
Subject: Communication, Networking and Broadcast Technologies
Computing and Processing
Training
Computers
Semantic segmentation
Computational modeling
Semantics
Transformers
Real-time systems
Attention mechanism
Transformer
Real-time semantic segmentation
Hybrid model
Language

Online Access

Full Text (IEEE)

초록

Semantic segmentation empowers various real-world applications. Nevertheless, the substantial computational cost, such as O(k 2 ) time complexity associated with the number of tokens in multi-headed self-attention, poses challenges for deploying these models on edge devices with constrained hardware resources. This paper introduces a novel family of backbones designed for real-time semantic segmentation, referred to as Linear and Re-parameter Vision Transformer (LARFormer). Particularly, we introduce a Re-parameter Mobile Block (RMB), which employs three branches during training and a single branch during inference. Furthermore, we introduce Linear Separable Self-Attention(LSSA), which reduces the computational complexity from O(k 2 ) to O(k). Extensive experiments on the ADE20K dataset and Pascal VOC 2012 dataset demonstrate the effectiveness of the proposed LARFormer by achieving a promising trade-off between segmentation accuracy and inference speed.

공지

DAU Library

학술논문

요약정보

Rethinking Mobilevitv2 for Real-Time Semantic Segmentation

Online Access

초록