학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Explainable Encoder-Decoder Crack Segmentation: Convolutional Network Vs. Transformer

Resource Type: Conference
Authors: Al-Huda, Zaid; Al-antari, Mugahed A.; Peng, Bo; Saleh, Radhwan A.A.
Source: 2023 3rd International Conference on Emerging Smart Technologies and Applications (eSmarTA) Emerging Smart Technologies and Applications (eSmarTA), 2023 3rd International Conference on. :1-7 Oct, 2023
Subject: Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Fields, Waves and Electromagnetics
General Topics for Engineers
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Training
Image segmentation
Visualization
Roads
Predictive models
Transformers
Robustness
Crack Segmentation
Convolutional Encoder Decoder
Vison Transformer
Attention Mechanism
Explainable Saliency maps
Language

Online Access

Full Text (IEEE)

초록

Crack segmentation is a crucial task in various domains, particularly in infrastructure inspection, civil engineering, and road maintenance. To accurately detect various cracks from the input RGB images, two artificial intelligence (AI) approaches have been presented for segmentation purpose: Convolutional Neural Network (CNN) and transformer-based techniques (ViT). In this study, we present a comparative analysis to evaluate the robustness of convolutional networks against ViT. CNNs are built upon a series of convolutional and pooling layers, designed to capture local patterns and features in an image. On the other hand, ViTs utilize self-attention mechanisms to capture global relationships within a sequence of input patches from the image. In addition to quantitative evaluation comparison, qualitative visual explainable heat saliency maps are derived. We use two crack datasets for comparison evaluation purposes: Crack500 and DeepCrack. We compare the evaluation results among six XAI models (three models for each; CNN and ViT). The segmentation results show the commutative measurements among the CNN and ViT models. Such a comprehensive comparison study could be helpful to assist the researchers in the domain for the best model selection.

공지

DAU Library

학술논문

요약정보

Explainable Encoder-Decoder Crack Segmentation: Convolutional Network Vs. Transformer

Online Access

초록