eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Review of Semi-Structured Document Information Extraction Techniques Based on Deep Learning

Resource Type: Conference
Authors: Li, Yangchun; Jiang, Wei; Song, Shouyou
Source: 2023 2nd International Conference on Machine Learning, Cloud Computing and Intelligent Mining (MLCCIM) MLCCIM Machine Learning, Cloud Computing and Intelligent Mining (MLCCIM), 2023 2nd International Conference on. :112-119 Jul, 2023
Subject: Computing and Processing
Deep learning
Systematics
Optical character recognition
Finance
Computer architecture
Medical services
Information retrieval
semi-structured documents
deep learning
detection and recognition
information extraction
Language

Online Access

Full Text (IEEE)

초록

With the advent of global digital transformation, using an intelligent method based on deep learning to extract crucial information from semi-structured documents, as represented by various types of receipts and invoices, has emerged as an imperative measure to ensure business stability, data security, and improved work efficiency. This paper provides a detailed review on deep learning-based techniques for information extraction, with systematic introduction, hierarchical analysis, method comparison, and summary with expectations for future development. The review begins with a comprehensive explication of the defining characteristics of semi-structured documents, along with a detailed introduction to the research background, application areas, and technical challenges related to information extraction from semi-structured documents. Then the review extends to an overview of two developmental stages, i.e. the shift from traditional information extraction to deep learning-based information extraction, followed by discussion about technical architecture and method classification, which elaborates on key technologies in terms of typical datasets, detection and recognition, and information reduction. Lastly, paper summarizes the prospects and development in the field. Future research will focus on strengthening algorithm universal and lightweight, as well as improving information protection capabilities and the diversity of datasets.

공지

DAU Library

eArticles

요약정보

Review of Semi-Structured Document Information Extraction Techniques Based on Deep Learning

Online Access

초록