학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

HCTNet: Hybrid CNN-Transformer Architecture Network for Self-Supervised Monocular Depth Estimation

Resource Type: Conference
Authors: Ma, Li; Fu, Yonghui; Lu, Xinhua; Xue, Qingji; Miao, Jingui
Source: 2023 International Conference on Computer Science and Automation Technology (CSAT) CSAT Computer Science and Automation Technology (CSAT), 2023 International Conference on. :353-357 Oct, 2023
Subject: Computing and Processing
Solid modeling
Pedestrians
Computational modeling
Estimation
Computer architecture
Virtual reality
Predictive models
monocular depth estimation
hybrid model
convolutional neural network
self-supervised learning
Transformer
Language

Online Access

Full Text (IEEE)

초록

Estimating depth from images is a crucial computer vision task with wide-ranging applications in fields such as autonomous driving, drones, and virtual reality. Self- supervised monocular depth estimation utilizes image sequences to achieve semi-supervised learning and has shown promising application prospects. However, current self-supervised methods still suffer from deficiencies in enhancing feature dependencies and properly handling local information, resulting in limited performance and low prediction accuracy. In this work, we propose a novel network architecture, HCTNet, based on the U- Net framework, aimed at further improving prediction accuracy. The network utilizes a Hybrid CNN- Transformer as the depth encoder to capture and convey contextual information, demonstrating competitive results on the KITTI dataset.

공지

DAU Library

학술논문

요약정보

HCTNet: Hybrid CNN-Transformer Architecture Network for Self-Supervised Monocular Depth Estimation

Online Access

초록