Weighted combination of per-frame recognition results for text recognition in a video stream

Arlazarov, V.V.; Arlazarov, V.L.; Bulatov, K.; Petrova, O.

Отрывок: Weighted combination of per-frame recognition results for text recognition… Petrova O., Bulatov K., Arlazarov V.V., Arlazarov V.L. Компьютерная оптика, 2021, том 45, №1 DOI: 10.18287/2412-6179-CO-795 83 Previous work [55] described experiments performed on the MIDV-500 [16] dataset. This dataset contains 500 video clips of identity documents captured with mobile cameras without strong distortion. However, it seems im- portant to evaluate the quality of the proposed method a...

Полная запись метаданных

Поле DC	Значение	Язык
dc.contributor.author	Petrova, O.	-
dc.contributor.author	Bulatov, K.	-
dc.contributor.author	Arlazarov, V.V.	-
dc.contributor.author	Arlazarov, V.L.	-
dc.date.accessioned	2021-03-01 10:20:06	-
dc.date.available	2021-03-01 10:20:06	-
dc.date.issued	2021-02	-
dc.identifier	Dspace\SGAU\20210228\87755	ru
dc.identifier.citation	Petrova O, Bulatov K, Arlazarov VV, Arlazarov VL. Weighted combination of per-frame recognition results for text recognition in a video stream. Computer Optics 2021, 45(1): 77-89. DOI: 10.18287/2412-6179-CO-795.	ru
dc.identifier.uri	https://dx.doi.org/10.18287/2412-6179-CO-795	-
dc.identifier.uri	http://repo.ssau.ru/handle/Zhurnal-Komputernaya-optika/Weighted-combination-of-perframe-recognition-results-for-text-recognition-in-a-video-stream-87755	-
dc.description.abstract	The scope of uses of automated document recognition has extended and as a result, recognition techniques that do not require specialized equipment have become more relevant. Among such techniques, document recognition using mobile devices is of interest. However, it is not always possible to ensure controlled capturing conditions and, consequentially, high quality of input images. Unlike specialized scanners, mobile cameras allow using a video stream as an input, thus obtaining several images of the recognized object, captured with various characteristics. In this case, a problem of combining the information from multiple input frames arises. In this paper, we propose a weighing model for the process of combining the per-frame recognition results, two approaches to the weighted combination of the text recognition results, and two weighing criteria. The effectiveness of the proposed approaches is tested using datasets of identity documents captured with a mobile device camera in different conditions, including perspective distortion of the document image and low lighting conditions. The experimental results show that the weighting combination can improve the text recognition result quality in the video stream, and the per-character weighting method with input image focus estimation as a base criterion allows one to achieve the best results on the datasets analyzed.	ru
dc.description.sponsorship	This work is partially supported by the Russian Foundation for Basic Research (projects 17-29-03236 and 18-07-01387).	ru
dc.language.iso	en	ru
dc.publisher	Самарский национальный исследовательский университет	ru
dc.relation.ispartofseries	45;1	-
dc.subject	mobile OCR	ru
dc.subject	video stream	ru
dc.subject	anytime algorithms	ru
dc.subject	weighted combination	ru
dc.subject	ensemble methods	ru
dc.title	Weighted combination of per-frame recognition results for text recognition in a video stream	ru
dc.type	Article	ru
dc.textpart	Weighted combination of per-frame recognition results for text recognition… Petrova O., Bulatov K., Arlazarov V.V., Arlazarov V.L. Компьютерная оптика, 2021, том 45, №1 DOI: 10.18287/2412-6179-CO-795 83 Previous work [55] described experiments performed on the MIDV-500 [16] dataset. This dataset contains 500 video clips of identity documents captured with mobile cameras without strong distortion. However, it seems im- portant to evaluate the quality of the proposed method a...	-
Располагается в коллекциях:	Журнал "Компьютерная оптика"

Файлы этого ресурса:

Файл	Описание	Размер	Формат
450110.pdf	Основная статья	2.67 MB	Adobe PDF	Просмотреть/Открыть

Показать базовое описание ресурса Просмотр статистики
Поделиться:

Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.

Репозиторий Самарского университета