Tiny CNN for feature point description for document analysis: approach and dataset

Arlazarov, V.L.; Chirvonaya, A.; Sheshkus, A.

Отрывок: Fig. 5. Neural network convergence plot and training statistics In fig. 6 we show some images from the used da- tasets. While HPatches is a dataset of the general image patches mostly containing outdoors images the MIDV- 500 and MIDV-2019 datasets contain document images. The second one introduces heavier projective distortions and is considered to be harder. Both datasets have various complex backgrounds and are challenging for the task. 3. Results In tab. 4, 5, and ...

Полная запись метаданных

Поле DC	Значение	Язык
dc.contributor.author	Sheshkus, A.	-
dc.contributor.author	Chirvonaya, A.	-
dc.contributor.author	Arlazarov, V.L.	-
dc.date.accessioned	2023-05-04 11:01:26	-
dc.date.available	2023-05-04 11:01:26	-
dc.date.issued	2022-06	-
dc.identifier	Dspace\SGAU\20230413\103041	ru
dc.identifier	Dspace\SGAU\20230426\103041	ru
dc.identifier	Dspace\SGAU\20230503\103041	ru
dc.identifier.citation	Sheshkus A, Chirvonaya A, Arlazarov VL. Tiny CNN for feature point description for document analysis: approach and dataset. Computer Optics 2022; 46(3): 429-435. DOI: 10.18287/2412-6179-CO-1016.	ru
dc.identifier.uri	https://dx.doi.org/10.18287/2412-6179-CO-1016	-
dc.identifier.uri	http://repo.ssau.ru/handle/Zhurnal-Komputernaya-optika/Tiny-CNN-for-feature-point-description-for-document-analysis-approach-and-dataset-103041	-
dc.description.abstract	In this paper, we study the problem of feature points description in the context of document analysis and template matching. Our study shows that specific training data is required for the task especially if we are to train a lightweight neural network that will be usable on devices with limited computational resources. In this paper, we construct and provide a dataset of photo and synthetically generated images and a method of training patches generation from it. We prove the effectiveness of this data by training a lightweight neural network and show how it performs in both general and documents patches matching. The training was done on the provided dataset in comparison with HPatches training dataset and for the testing, we solve HPatches testing framework tasks and template matching task on two publicly available datasets with various documents pictured on complex backgrounds: MIDV-500 and MIDV-2019.	ru
dc.description.sponsorship	This work was supported by the Russian Foundation for Basic Research (projects 18-29-26033 and 19-29-09064).	ru
dc.language.iso	en	ru
dc.publisher	Самарский национальный исследовательский университет	ru
dc.relation.ispartofseries	46;3	-
dc.subject	feature points description	ru
dc.subject	training dataset	ru
dc.subject	metrics learning	ru
dc.title	Tiny CNN for feature point description for document analysis: approach and dataset	ru
dc.type	Article	ru
dc.textpart	Fig. 5. Neural network convergence plot and training statistics In fig. 6 we show some images from the used da- tasets. While HPatches is a dataset of the general image patches mostly containing outdoors images the MIDV- 500 and MIDV-2019 datasets contain document images. The second one introduces heavier projective distortions and is considered to be harder. Both datasets have various complex backgrounds and are challenging for the task. 3. Results In tab. 4, 5, and ...	-
Располагается в коллекциях:	Журнал "Компьютерная оптика"

Файлы этого ресурса:

Файл	Описание	Размер	Формат
2412-6179_2022_46-3_429-435.pdf	Основная статья	1.46 MB	Adobe PDF	Просмотреть/Открыть

Показать базовое описание ресурса Просмотр статистики
Поделиться:

Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.

Репозиторий Самарского университета