Отрывок: Here, BU is original U-Net-bin method and RUB – its retrained version which used some domain data from the newly annotated PWGT for MIDV-500. The choice of Otsu stems from its results in the first experiment and the fact that it is still a common baseline for the task of binarization. Unfortunately, the size of T2020 is too big, so PWGT preparation is too resource consuming. Thus, Bgt is not available for this experiment. The s...
Название : A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition
Авторы/Редакторы : Sánchez-Rivero, R.
Bezmaternykh, P.V.
Gayer, A.V.
Morales-González, A.
José Silva-Mata, F.
Bulatov, K.B.
Ключевые слова : document image binarization
identity document recognition
optical character recognition
deep learning
U-Net architecture
Дата публикации : Авг-2023
Издательство : Самарский национальный исследовательский университет
Библиографическое описание : Sánchez-Rivero R, Bezmaternykh P, Gayer A, Morales-González A, José Silva-Mata F, Bulatov K. A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition. Computer Optics 2023; 47(4): 627-636. DOI: 10.18287/2412-6179-CO-1207.
Серия/номер : 47;4
Аннотация : Text recognition has benefited considerably from deep learning research, as well as the preprocessing methods included in its workflow. Identity documents are critical in the field of document analysis and should be thoroughly researched in relation to this workflow. We propose to examine the link between deep learning-based binarization and recognition algorithms for this sort of documents on the MIDV-500 and MIDV-2020 datasets. We provide a series of experiments to illustrate the relation between the quality of the collected images with respect to the binarization results, as well as the influence of its output on final recognition performance. We show that deep learning-based binarization solutions are affected by the capture quality, which implies that they still need significant improvements. We also show that proper binarization results can improve the performance for many recognition methods. Our retrained U-Net-bin outperformed all other binarization methods, and the best result in recognition was obtained by Paddle Paddle OCR v2.
URI (Унифицированный идентификатор ресурса) : https://dx.doi.org/10.18287/2412-6179-CO-1207
http://repo.ssau.ru/handle/Zhurnal-Komputernaya-optika/A-joint-study-of-deep-learningbased-methods-for-identity-document-image-binarization-and-its-influence-on-attribute-recognition-107771
Другие идентификаторы : Dspace\SGAU\20231228\107771
Располагается в коллекциях: Журнал "Компьютерная оптика"

Файлы этого ресурса:
Файл Описание Размер Формат  
2412-6179_2023_47_4_627-636.pdf1.12 MBAdobe PDFПросмотреть/Открыть



Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.