Periocular recognition has attracted attention in recent times. The advent of the COVID-19 pandemic and the consequent obligation to wear facial masks made face recognition problematic due to the important occlusion of the lower part of the face. In this work, a dual-input Neural Network architecture is proposed. The structure is a Siamese-like model, with two identical parallel streams (called base models) that process the two inputs separately. The input is represented by RGB images of the right eye and the left eye belonging to the same subject. The outputs of the two base models are merged through a fusion layer. The aim is to investigate how deep feature aggregation affects periocular recognition. The experimentation is performed on the Masked Face Recognition Database ((MFRED)-F-2) which includes videos of 46 participants with and without masks. Three different fusion layers are applied to understand which type of merging technique is most suitable for data aggregation. Experimental results show promising performance for almost all experimental configurations with a worst-case accuracy of 90% and a best-case accuracy of 97%.
Fusion of Periocular Deep Features in a Dual-Input CNN for Biometric Recognition
Abate, A;Cimmino, L;Nappi, M;Narducci, F
2022-01-01
Abstract
Periocular recognition has attracted attention in recent times. The advent of the COVID-19 pandemic and the consequent obligation to wear facial masks made face recognition problematic due to the important occlusion of the lower part of the face. In this work, a dual-input Neural Network architecture is proposed. The structure is a Siamese-like model, with two identical parallel streams (called base models) that process the two inputs separately. The input is represented by RGB images of the right eye and the left eye belonging to the same subject. The outputs of the two base models are merged through a fusion layer. The aim is to investigate how deep feature aggregation affects periocular recognition. The experimentation is performed on the Masked Face Recognition Database ((MFRED)-F-2) which includes videos of 46 participants with and without masks. Three different fusion layers are applied to understand which type of merging technique is most suitable for data aggregation. Experimental results show promising performance for almost all experimental configurations with a worst-case accuracy of 90% and a best-case accuracy of 97%.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.