Отрывок: Fig. 5. Neural network convergence plot and training statistics In fig. 6 we show some images from the used da- tasets. While HPatches is a dataset of the general image patches mostly containing outdoors images the MIDV- 500 and MIDV-2019 datasets contain document images. The second one introduces heavier projective distortions and is considered to be harder. Both datasets have various complex backgrounds and are challenging for the task. 3. Results In tab. 4, 5, and ...
Полная запись метаданных
Поле DC Значение Язык
dc.contributor.authorSheshkus, A.-
dc.contributor.authorChirvonaya, A.-
dc.contributor.authorArlazarov, V.L.-
dc.date.accessioned2023-05-04 11:01:26-
dc.date.available2023-05-04 11:01:26-
dc.date.issued2022-06-
dc.identifierDspace\SGAU\20230413\103041ru
dc.identifierDspace\SGAU\20230426\103041ru
dc.identifierDspace\SGAU\20230503\103041ru
dc.identifier.citationSheshkus A, Chirvonaya A, Arlazarov VL. Tiny CNN for feature point description for document analysis: approach and dataset. Computer Optics 2022; 46(3): 429-435. DOI: 10.18287/2412-6179-CO-1016.ru
dc.identifier.urihttps://dx.doi.org/10.18287/2412-6179-CO-1016-
dc.identifier.urihttp://repo.ssau.ru/handle/Zhurnal-Komputernaya-optika/Tiny-CNN-for-feature-point-description-for-document-analysis-approach-and-dataset-103041-
dc.description.abstractIn this paper, we study the problem of feature points description in the context of document analysis and template matching. Our study shows that specific training data is required for the task especially if we are to train a lightweight neural network that will be usable on devices with limited computational resources. In this paper, we construct and provide a dataset of photo and synthetically generated images and a method of training patches generation from it. We prove the effectiveness of this data by training a lightweight neural network and show how it performs in both general and documents patches matching. The training was done on the provided dataset in comparison with HPatches training dataset and for the testing, we solve HPatches testing framework tasks and template matching task on two publicly available datasets with various documents pictured on complex backgrounds: MIDV-500 and MIDV-2019.ru
dc.description.sponsorshipThis work was supported by the Russian Foundation for Basic Research (projects 18-29-26033 and 19-29-09064).ru
dc.language.isoenru
dc.publisherСамарский национальный исследовательский университетru
dc.relation.ispartofseries46;3-
dc.subjectfeature points descriptionru
dc.subjecttraining datasetru
dc.subjectmetrics learningru
dc.titleTiny CNN for feature point description for document analysis: approach and datasetru
dc.typeArticleru
dc.textpartFig. 5. Neural network convergence plot and training statistics In fig. 6 we show some images from the used da- tasets. While HPatches is a dataset of the general image patches mostly containing outdoors images the MIDV- 500 and MIDV-2019 datasets contain document images. The second one introduces heavier projective distortions and is considered to be harder. Both datasets have various complex backgrounds and are challenging for the task. 3. Results In tab. 4, 5, and ...-
Располагается в коллекциях: Журнал "Компьютерная оптика"

Файлы этого ресурса:
Файл Описание Размер Формат  
2412-6179_2022_46-3_429-435.pdfОсновная статья1.46 MBAdobe PDFПросмотреть/Открыть



Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.