eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

자기지도학습을 통한 수술 영상에서의 분할 성능 향상: 복강경 간 절제술 영상 내 담관에 대한 실시간 분할 / Improving Segmentation Performance in Surgical Videos through Self-Supervised Learning: Real-time Segmentation of Bile Ducts in aparoscopic Hepatectomy Videos

Resource Type: Dissertation/ Thesis
Authors: 김보근 / Kim, Bogeun
Source
Subject: Real-time segmentation
Bile duct
Laparoscopic hepatectomy
Self-supervised learning
Surgical video guidance
Language: Korean

Online Access

Full Text (dCollection)

초록

살아있는 기증자 간 이식을 위한 복강경 간 절제술은 그 자체로 수술적 난이도가 높고, 특히 담관에 대한 식별이 세심하게 요구된다. 본 연구는 실시간 영상 유도기술을 통해 담관을 정확하게 식별하고 최적의 절개 부위를 결정할 수 있도록 도움을 주는 딥러닝 기반의 분할 모델을 제시한다. 더불어 수술 영상 분야에서 빈번하게 나타나는 충분하지 않은 labeled data의 상황과 라벨이 없는 수많은 데이터의 잠재력을 고려하여 자기지도학습의 필요성에 주목하였고, 수술 영상에서 자기지도학습의 영향력을 다방면으로 확인하였다. 데이터와 라벨링을 위해 삼성서울병원에서 수행된 복강경 간 절제술 영상 30개 중 담도가 나타나는 구간에서 프레임을 추출하였고, 총 300장의 이미지에 대해 외과전문의가 라벨링하였다. 자기지도학습을 위한 라벨이 필요없는 데이터는 10개의 수술 영상 중 전체 구간에 대하여 임의의 프레임을 1,000장씩 무작위로 샘플링하여 구성하였다. 분할 모델은 ResNet-50 인코더를 사용한 U-Net 기반의 아키텍처로 구성하였고, 자기지도학습의 최신 방법론인 MoCo v2를 인코더에 대해 적용하였다. 특히 unlabeled data와 labeled data의 공급을 조절하여 자기지도학습이 분할 성능에 미치는 영향을 정량적으로 확인하여 다양한 상황에서 자기지도학습의 잠재력을 확인하였다. 그 결과, 분할 모델은 테스트셋에 대한 평가에서 64.1% DSC(Dice Similarity Coefficient)를 나타냈고, 일반적인 학습 방법 대비 최소 +2.5%p에서 최대 +9.7%p의 성능차이를 보였다. 더불어 GPU 환경에서 79.8fps의 추론 속도를 기록하여 실제 임상에 적용할 경우 실시간으로 담관 영역에 대한 가이드를 제공할 수 있을 것으로 기대된다. 본 연구는 최근 몇 년 동안 빠르게 진전된 자기지도학습 방법론을 수술 분야에 적용하였으며, 특히 수술용 네비게이션의 기초가 되는 분할 작업에 대한 자기지도학습의 적용과 300건이라는 작은 데이터셋으로 그 효과성을 입증하였다. 본 연구의 결과로 말미암아 labeled data가 충분하지 않은 수술 분야에서 자기지도학습은 주목할 만한 잠재력을 나타내며, 자기지도학습은 분할 작업 외 다른 다양한 작업에서 향상된 일반화 성능을 이끌어갈 가능성이 있다는 것을 시사하고 있다.
Laparoscopic hepatectomy for living donor liver transplantation is inherently challenging, particularly requiring the meticulous identification of the bile duct. This study proposes a deep learning-based segmentation model that aids in accurately identifying the bile duct and determining the optimal incision site through real-time video guidance technology. In addition, considering the limitations of labeled data and the potential of unlabeled data, the study emphasizes the need for self-supervised learning (SSL). The segmentation model was constructed using a U-Net architecture with a ResNet-50 encoder, and state-of-the-art SSL method (MoCo v2) was applied to the encoder. Particularly, by adjusting the supply of unlabeled and labeled data, we quantitatively assessed the impact of SSL on the segmentation performance. This allowed us to explore the potential of SSL in various scenarios, providing insights into its effectiveness under different circumstances. As a result, the segmentation model exhibited a Dice Similarity Coefficient(DSC) of 64.1% on the test set, indicating up to +9.7%p DSC compared to conventional methods. This highlights the noteworthy potential of self-supervised learning in the field of surgery, where labeled data is often insufficient, and cases applying SSL are not as prevalent. Moreover, with an inference speed of 79.8 fps in a GPU, the proposed model is expected to provide real-time guidance for the bile duct area when applied in actual clinical settings.

공지

DAU Library

eArticles

요약정보

Online Access

초록