Отрывок: 2). Our FaceDetectNet architecture is based on Google- Net CNN [21] implemented in Caffe / DIGITS [22] framework and pretrained on the ILSVRC 2012 image base [23]. This basic CNN is transformed to FCN archi- tecture via excluding of full-connected layers. Finally, it contains 2 convolution layers, 9 inception modules, and 4 pooling layers. The output grid cell of FaceDetectNet is 8×8 pixels of size. We select the GoogleNet as a basic CNN model due to ...
Название : FaceDetectNet: face detection via fully-convolutional network
Авторы/Редакторы : Gorbatsevich, V.S.
Moiseenko, A.S.
Vizilter, Y.V.
Ключевые слова : CNN
face detection
DetectNet
YOLO
Дата публикации : Фев-2019
Издательство : Самарский национальный исследовательский университет им. акакдемика С.П. Королева, Институт систем обработки изображений РАН - филиал ФНИЦ «Кристаллография и фотоника» РАН
Библиографическое описание : Gorbatsevich VS, Moiseenko AS, Vizilter YV. FaceDetectNet: Face detection via fully-convolutional network. Computer Optics 2019; 43(1): 63-71. DOI: 10.18287/2412-6179-2019-43-1-63-71.
Серия/номер : 43;1
Аннотация : Face detection is one of the most popular computer vision tasks. There are a lot of face detection approaches proposed including different CNN-based techniques, but the problem of optimal balancing between detection quality and computational speed is still relevant. In this paper we propose new CNN-based solution for face detection called FaceDetectNet. Our CNN architecture is based on ideas of YOLO/DetectNet and GoogleNet architecture supported with some new tools and implementation details created especially for our face detection application. We propose: original iterative proposal clustering (IPC) algorithm for aggregation of output face proposals formed by CNN and the 2-level “weak pyramid” providing better detection quality on the testing sets containing both small and huge images. Our face detection approach is close to previously proposed SSD-based face detection, but the principal difference is that we use the deep features of top hidden CNN layer for forming the face proposals of any size. Thus we utilize the global semantic and context information for improving the detection quality for small faces. Our FaceDetectNet is trained and tested on the most challenging WIDER FACE detection benchmark. Our algorithm achieves the average precision (AP) 0.69 on the WIDER FACE hard level, and thus outperforms all competitive detectors on the Hard level besides the HR state-of-the-art solution. Note that HR solution is based on essentially deeper and slower CNN, while our FaceDetectNet can work in real-time on the NVIDIA GeForce 1080 GPU. On the other hand, SSD-based face detector with comparable CNN parameters provides AP 0.625 only on the WIDER FACE hard level. So, our approach provides the best quality with reasonable computational speed.
URI (Унифицированный идентификатор ресурса) : https://dx.doi.org/10.18287/2412-6179-2019-43-1-63-71
http://repo.ssau.ru/handle/Zhurnal-Komputernaya-optika/FaceDetectNet-face-detection-via-fullyconvolutional-network-74813
Другие идентификаторы : Dspace\SGAU\20190324\74813
ГРНТИ: 28.23.15
Располагается в коллекциях: Журнал "Компьютерная оптика"

Файлы этого ресурса:
Файл Описание Размер Формат  
430107.pdfОсновная статья1.76 MBAdobe PDFПросмотреть/Открыть



Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.