학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

동작 인식을 위한 교사-학생 구조 기반 CNN
Teacher-Student Architecture Based CNN for Action Recognition

Resource Type: Article
Authors: Yulan Zhao; 이효종 ( Hyo Jong Lee
Source: 정보처리학회논문지. 컴퓨터 및 통신시스템 / KIPS Transactions on Computer and Communication Systems. Mar 31, 2022 11(3):99
Subject: Two-Stream Network
Teacher-Student Architecture
CNN
Optical Flow
Action Recognition
양 스트림
교사-학생 아키텍처
광학 흐름
동작 인식
Language: Korean
English
ISSN: 2287-5891

Online Access

Full Text (KISS)

초록

대부분 첨단 동작 인식 컨볼루션 네트워크는 RGB 스트림과 광학 흐름 스트림, 양 스트림 아키텍처를 기반으로 하고 있다. RGB 프레임 스트림은 모양 특성을 나타내고 광학 흐름 스트림은 동작 특성을 해석한다. 그러나 광학 흐름은 계산 비용이 매우 높기 때문에 동작 인식 시간에 지연을 초래한다. 이에 양 스트림 네트워크와 교사-학생 아키텍처에서 영감을 받아 행동 인식을 위한 새로운 네트워크 디자인을 개발하였다. 제안 신경망은 두 개의 하위 네트워크로 구성되어있다. 즉, 교사 역할을 하는 광학 흐름 하위 네트워크와 학생 역할을 하는 RGB 프레임 하위 네트워크를 연결하였다. 훈련 단계에서 광학 흐름의 특징을 추출하고 교사 서브 네트워크를 훈련시킨 다음 그 특징을 학생 서브 네트워크를 훈련시키기 위한 기준선으로 지정하여 학생 서브 네트워크에 전송한다. 테스트 단계에서는 광학 흐름을 계산하지 않고 대기 시간이 줄어들도록 학생 네트워크만 사용한다. 제안 네트워크는 실험을 통하여 정확도 면에서 일반 이중 스트림 아키텍처에 비해 높은 정확도를 보여주는 것을 확인하였다.
Convolutional neural network (CNN) generally uses two-stream architecture RGB and optical flow stream for its action recognition function. RGB frames stream display appearance and optical flow stream interprets its action. However, the standard method of using optical flow is costly in its computational time and latency associated with increased action recognition. The purpose of the study was to evaluate a novel way to create a two sub-networks in neural networks. The optical flow sub-network was assigned as a teacher and the RGB frames as a student. In the training stage, the optical flow sub-network extracts features through the teacher sub-network and transmits the information to student sub-network for baseline training. In the test stage, only student sub-network was operational with decreased in latency without computing optical flow. Experimental results shows that our network fed only by RGB stream gets a competitive accuracy of 54.5% on HMDB51, which is 1.5 times better than that on R3D-18.

공지

DAU Library

학술논문

요약정보

동작 인식을 위한 교사-학생 구조 기반 CNN
Teacher-Student Architecture Based CNN for Action Recognition

Online Access

초록

학술논문

동작 인식을 위한 교사-학생 구조 기반 CNNTeacher-Student Architecture Based CNN for Action Recognition

동작 인식을 위한 교사-학생 구조 기반 CNN
Teacher-Student Architecture Based CNN for Action Recognition