학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

BTU-Net: Bidirectional Transformer U-Net for Buildings Segmentation

Resource Type: Conference
Authors: Ma, Shannong; Fan, Jinsheng; Ru, Fei; Fang, Jiangxiong; Hu, Hui
Source: 2023 China Automation Congress (CAC) Automation Congress (CAC), 2023 China. :8880-8884 Nov, 2023
Subject: Aerospace
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Image segmentation
Visualization
Correlation
Convolution
Neural networks
Transformers
Decoding
Transformer
image segmentation
Convolutional neural network
U-Net
Language
ISSN: 2688-0938

Online Access

Full Text (IEEE)

초록

Deep convolutional neural networks have been employed in image segmentation more and more recently due to their ability to extract detailed properties from images. One of the most successful neural network frameworks for image segmentation among them is the encoder-decoder network structure. U-Net combines an encoder and a decoder to segment images at the pixel level for image segmentation tasks. U-Net uses multi-scale convolutional layers to extract visual information; nevertheless, these layers are unable to record long-distance correlations. In order to gather both local and global information about the image, this work proposes a bidirectional Transformer U-Net (BTU-Net) model, which draws inspiration from the Transformer concept. The BTU-Net structure has an encoder with five down-sampling layers and a decoder with five up-sampling levels. Two-way transformer hybrid convolution modules are used in the final three layers, whereas multi-scale convolution modules are used in the first two layers. With the addition of convolution layers and two-way convolution layers, the quadratic complexity of the traditional self-attention mechanism decreases linearly. The IoU, F1-score, accuracy, recall, and precision scores of our suggested model are 61.9%, 67.2%, 83.9%, 63.3%, and 84.3%, respectively, and experiments have shown that they are comparable to other network models.

공지

DAU Library

학술논문

요약정보

BTU-Net: Bidirectional Transformer U-Net for Buildings Segmentation

Online Access

초록