Отрывок: It contains manuscripts written in modern German. Train sample consists of 353 lines, validation - 29 lines, and test - 87 lines. Schiller contains handwritten texts written in modern German. Train sample consists of 244 lines, validation - 21 lines, and test - 63 lines. Ricordi contains handwritten texts written in Italian. Train sample consists of 295 lines, validation - 19 lines, and test - 69 lines. http://www.computeroptics.ru/eng/index.html journal@computeroptics.ru...
Полная запись метаданных
Поле DC Значение Язык
dc.contributor.authorShonenkov, A.V.-
dc.contributor.authorKarachev, D.K.-
dc.contributor.authorNovopoltsev, M.Y.-
dc.contributor.authorPotanin, M.S.-
dc.contributor.authorDimitrov, D.V.-
dc.contributor.authorChertok, A.V.-
dc.date.accessioned2023-05-04 11:01:13-
dc.date.available2023-05-04 11:01:13-
dc.date.issued2022-06-
dc.identifierDspace\SGAU\20230413\103043ru
dc.identifierDspace\SGAU\20230426\103043ru
dc.identifierDspace\SGAU\20230503\103043ru
dc.identifier.citationShonenkov AV, Karachev DK, Novopoltsev MY, Potanin MS, Dimitrov DV, Chertok AV. Handwritten text generation and strikethrough characters augmentation. Computer Optics 2022; 46(3): 455-464. DOI: 10.18287/2412-6179-CO-1049.ru
dc.identifier.urihttps://dx.doi.org/10.18287/2412-6179-CO-1049-
dc.identifier.urihttp://repo.ssau.ru/handle/Zhurnal-Komputernaya-optika/Handwritten-text-generation-and-strikethrough-characters-augmentation-103043-
dc.description.abstractWe introduce two data augmentation techniques, which, used with a Resnet-BiLSTM-CTC network, significantly reduce Word Error Rate and Character Error Rate beyond best-reported results on handwriting text recognition tasks. We apply a novel augmentation that simulates strikethrough text (HandWritten Blots) and a handwritten text generation method based on printed text (StackMix), which proved to be very effective in handwriting text recognition tasks. StackMix uses weakly-supervised framework to get character boundaries. Because these data augmentation techniques are independent of the network used, they could also be applied to enhance the performance of other networks and approaches to handwriting text recognition. Extensive experiments on ten handwritten text datasets show that HandWritten Blots augmentation and StackMix significantly improve the quality of handwriting text recognition models.ru
dc.language.isoenru
dc.publisherСамарский национальный исследовательский университетru
dc.relation.ispartofseries46;3-
dc.subjectdata augmentationru
dc.subjecthandwritten text recognitionru
dc.subjectstrikethrough textru
dc.subjectcomputer visionru
dc.subjectStackMixru
dc.subjecthandwritten blotsru
dc.titleHandwritten text generation and strikethrough characters augmentationru
dc.typeArticleru
dc.textpartIt contains manuscripts written in modern German. Train sample consists of 353 lines, validation - 29 lines, and test - 87 lines. Schiller contains handwritten texts written in modern German. Train sample consists of 244 lines, validation - 21 lines, and test - 63 lines. Ricordi contains handwritten texts written in Italian. Train sample consists of 295 lines, validation - 19 lines, and test - 69 lines. http://www.computeroptics.ru/eng/index.html journal@computeroptics.ru...-
Располагается в коллекциях: Журнал "Компьютерная оптика"

Файлы этого ресурса:
Файл Описание Размер Формат  
2412-6179_2022_46-3_455-464.pdfОсновная статья1.32 MBAdobe PDFПросмотреть/Открыть



Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.