Document image analysis and recognition: a survey

Nikolaev, D.P.; Slavin, O.A.; Bulatov, K.B.; Arlazarov, V.V.; Petrova, O.O.; Savelev, B.I.

Отрывок: At the same time, the use of large corpora, such as Google Web 1T, Google Book, and others, for the construction of language models remains relevant. Due to the widespread use of automatic text recognition technologies and, as a result, the high variabil- ity of the data, approaches that have the property of a...

Полная запись метаданных

Поле DC	Значение	Язык
dc.contributor.author	Arlazarov, V.V.	-
dc.contributor.author	Bulatov, K.B.	-
dc.contributor.author	Nikolaev, D.P.	-
dc.contributor.author	Petrova, O.O.	-
dc.contributor.author	Savelev, B.I.	-
dc.contributor.author	Slavin, O.A.	-
dc.date.accessioned	2023-06-21 14:07:02	-
dc.date.available	2023-06-21 14:07:02	-
dc.date.issued	2022-08	-
dc.identifier	Dspace\SGAU\20230601\104023	ru
dc.identifier.citation	Arlazarov VV, Andreeva EI, Bulatov KB, Nikolaev DP, Petrova OO, Savelev BI, Slavin OA. Document image analysis and recognition: a survey. Computer Optics 2022; 46(4): 567-589. DOI: 10.18287/2412-6179-CO-1020.	ru
dc.identifier.uri	https://dx.doi.org/10.18287/2412-6179-CO-1020	-
dc.identifier.uri	http://repo.ssau.ru/handle/Zhurnal-Komputernaya-optika/Document-image-analysis-and-recognition-a-survey-104023	-
dc.description.abstract	This paper analyzes the problems of document image recognition and the existing solutions. Document recognition algorithms have been studied for quite a long time, but despite this, currently, the topic is relevant and research continues, as evidenced by a large number of associated publications and reviews. However, most of these works and reviews are devoted to individual recognition tasks. In this review, the entire set of methods, approaches, and algorithms necessary for document recognition is considered. A preliminary systematization allowed us to distinguish groups of methods for extracting information from documents of different types: single-page and multi-page, with text and handwritten contents, with a fixed template and flexible structure, and digitalized via different ways: scanning, photographing, video recording. Here, we consider methods of document recognition and analysis applied to a wide range of tasks: identification and verification of identity, due diligence, machine learning algorithms, questionnaires, and audits. The groups of methods necessary for the recognition of a single page image are examined: the classical computer vision algorithms, i.e., keypoints, local feature descriptors, Fast Hough Transforms, image binarization, and modern neural network models for document boundary detection, document classification, document structure analysis, i.e., text blocks and tables localization, extraction and recognition of the details, post-processing of recognition results. The review provides a description of publicly available experimental data packages for training and testing recognition algorithms. Methods for optimizing the performance of document image analysis and recognition methods are described.	ru
dc.description.sponsorship	The reported study was funded by RFBR, project number 20-17-50177. The authors thank Sc. D. Vladimir L. Arlazarov (FRC CSC RAS), Pavel Bezmaternykh (FRC CSC RAS), Elena Limonova (FRC CSC RAS), Ph. D. Dmitry Polevoy (FRC CSC RAS), Daniil Tropin (LLC “Smart Engines Service”), Yuliya Chernysheva (LLC “Smart Engines Service”), Yuliya Shemyakina (LLC “Smart Engines Service”) for valuable comments and suggestions.	ru
dc.language.iso	en	ru
dc.publisher	Самарский национальный исследовательский университет	ru
dc.relation.ispartofseries	46;4	-
dc.subject	document recognition	ru
dc.subject	image normalization	ru
dc.subject	binarization	ru
dc.subject	local features	ru
dc.subject	segmentation	ru
dc.subject	document boundary detection	ru
dc.subject	artificial neural network	ru
dc.subject	information extraction	ru
dc.subject	document sorting	ru
dc.subject	document comparison	ru
dc.subject	video sequence recognition	ru
dc.title	Document image analysis and recognition: a survey	ru
dc.type	Article	ru
dc.textpart	At the same time, the use of large corpora, such as Google Web 1T, Google Book, and others, for the construction of language models remains relevant. Due to the widespread use of automatic text recognition technologies and, as a result, the high variabil- ity of the data, approaches that have the property of a...	-
Располагается в коллекциях:	Журнал "Компьютерная оптика"

Файлы этого ресурса:

Файл	Описание	Размер	Формат
2412-6179_2022_46-4_567-589.pdf		1.29 MB	Adobe PDF	Просмотреть/Открыть

Показать базовое описание ресурса Просмотр статистики
Поделиться:

Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.

Репозиторий Самарского университета