학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

OctShuffleMLT: A Compact Octave Based Neural Network for End-to-End Multilingual Text Detection and Recognition

Resource Type: Conference
Authors: Lundgren, Antonio; Castro, Dayvid; Lima, Estanislau; Bezerra, Byron
Source: 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW) Document Analysis and Recognition Workshops (ICDARW), 2019 International Conference on. 4:37-42 Sep, 2019
Subject: Computing and Processing
Feature extraction
Text recognition
Adaptation models
Computational modeling
Computer architecture
Task analysis
Optical character recognition software
scene text
text detection
text recognition
Octave Convolutions
deep learning
Language

Online Access

Full Text (IEEE)

초록

In recent years, scene text detection has witnessed rapid progress especially with the recent development of convolutional neural networks. However, there still exist many challenges in applying very deep networks to many real-world applications, that have hardware limitations, such as robots, and smartphones. To address these challenges, in this paper, we propose the OctShuffleMLT, an effective fully convolutional neural network, with fewer layers and parameters, which can precisely detect multilingual scene text. Our proposed model is based on the Octave Convolutions that use compact blocks, which reduces memory inference by 13.16%, FLOPS by 71.86%, and the number of parameters by 34.04% when compared to the baseline system. Extensive experiments were conducted on ICDAR 2015 and ICDAR 2017 datasets. Experimental results show that our model can produce accurate detection recognition results on both datasets. The code for the paper is made available on the GitHub repository https://github.com/victoic/OctShuffle-MLT.

공지

DAU Library

학술논문

요약정보

OctShuffleMLT: A Compact Octave Based Neural Network for End-to-End Multilingual Text Detection and Recognition

Online Access

초록