Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.date | 2023-02 | |
| dc.date.accessioned | 2025-08-27T05:20:59Z | - |
| dc.date.available | 2025-08-27T05:20:59Z | - |
| dc.date.issued | 2023-02 | |
| dc.identifier.identifier | Dspace\SGAU\20230216\102049 | |
| dc.identifier.citation | Bakshandaeva D, Dimitrov D, Arkhipkin V, Shonenkov A, Potanin M, Karachev D, Kuznetsov A, Voronov A, Petiushko A, Davydova V, Tutubalina E. Many heads but one brain: FusionBrain – a single multimodal multitask architecture and a competition. Computer Optics 2023; 47(1): 185-195. DOI: 10.18287/ 2412-6179-CO-1220. | |
| dc.identifier.uri | 10.18287/2412-6179-CO-1220 | |
| dc.identifier.uri | http://repo.ssau.ru/jspui/handle/123456789/22868 | - |
| dc.description.abstract | Supporting the current trend in the AI community, we present the AI Journey 2021 Challenge called FusionBrain, the first competition which is targeted to make a universal architecture which could process different modalities (in this case, images, texts, and code) and solve multiple tasks for vision and language. The FusionBrain Challenge combines the following specific tasks: Code2code Translation, Handwritten Text recognition, Zero-shot Object Detection, and Visual Question Answering. We have created datasets for each task to test the participants’ submissions on it. Moreover, we have collected and made publicly available a new handwritten dataset in both English and Russian, which consists of 94,128 pairs of images and texts. We also propose a multimodal and multitask architecture – a baseline solution, in the centre of which is a frozen foundation model and which has been trained in Fusion mode along with Single-task mode. The proposed Fusion approach proves to be competitive and more energy-efficient compared to the task-specific one. | |
| dc.description.sponsorship | We would like to thank Sber and SberCloud for granting the GPU-resources to us to experiment with different architectures and also to the participants to train their models, and for supporting the FusionBrain Challenge in general. | |
| dc.language | en | |
| dc.publisher | Самарский национальный исследовательский университет | |
| dc.relation.ispartofseries | 47;1 | |
| dc.title | Many heads but one brain: FusionBrain – a single multimodal multitask architecture and a competition | |
| dc.type | Article | |
| local.identifier.olduri | http://repo.ssau.ru/handle/Zhurnal-Komputernaya-optika/Many-heads-but-one-brain-FusionBrain-–-a-single-multimodal-multitask-architecture-and-a-competition-102049 | |
| local.identifier.olduri | http://repo.ssau.ru/handle/Zhurnal-Komputernaya-optika/Many-heads-but-one-brain-FusionBrain-–-a-single-multimodal-multitask-architecture-and-a-competition-102049 | |
| Appears in Collections: | Журнал "Компьютерная оптика" | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 21_Bakshandaeva_Dimitrov_Arkhipkin_Shonenkov_Potanin_Karachev_Kuznetsov-aut-MA-L-JuN2-gr.pdf | Основная статья | 1.49 MB | Adobe PDF | View/Open |
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.