Projects per year
Abstract
Face presentation attacks, also known as spoofing attacks, pose a substantial threat to biometric systems that rely on facial recognition systems, such as access control systems, mobile payments, and identity verification systems. To mitigate the spoofing risk, several video-based methods have been presented in the literature that analyze facial motion in successive video
frames. However, estimating the motion between adjacent frames is a challenging task and requires high computational cost. In this paper, we rephrase the face anti-spoofing task as a motion prediction problem and introduce a deep ensemble learning model with a frame skipping mechanism. In particular, the proposed frame skipping adopts a uniform sampling approach by dividing the original video into video clips of fixed size. By doing so, every nth frame of the clip is selected to ensure that the temporal patterns can easily be perceived during the training of three different recurrent neural networks (RNNs). Motivated by the performance of individual RNNs, a meta-model is developed to improve the overall detection performance by combining
the prediction of individual RNNs. Extensive experiments were performed on four datasets, and state-of-the-art performance is reported on MSU-MFSD (3.12%), Replay-Attack (11.19%), and OULU-NPU (12.23%) databases by using half total error rates (HTERs) in the most challenging cross-dataset testing scenario.
frames. However, estimating the motion between adjacent frames is a challenging task and requires high computational cost. In this paper, we rephrase the face anti-spoofing task as a motion prediction problem and introduce a deep ensemble learning model with a frame skipping mechanism. In particular, the proposed frame skipping adopts a uniform sampling approach by dividing the original video into video clips of fixed size. By doing so, every nth frame of the clip is selected to ensure that the temporal patterns can easily be perceived during the training of three different recurrent neural networks (RNNs). Motivated by the performance of individual RNNs, a meta-model is developed to improve the overall detection performance by combining
the prediction of individual RNNs. Extensive experiments were performed on four datasets, and state-of-the-art performance is reported on MSU-MFSD (3.12%), Replay-Attack (11.19%), and OULU-NPU (12.23%) databases by using half total error rates (HTERs) in the most challenging cross-dataset testing scenario.
Original language | English |
---|---|
Title of host publication | 2023 Twelfth International Conference on Image Processing Theory, Tools and Applications (IPTA) |
Publisher | IEEE |
Number of pages | 6 |
ISBN (Print) | 979-8-3503-2541-6 |
DOIs | |
Publication status | Published - 2023 |
MoE publication type | A4 Conference publication |
Event | International Conference on Image Processing Theory, Tools and Applications - Paris, France Duration: 16 Oct 2023 → 19 Oct 2023 Conference number: 12 |
Publication series
Name | International workshops on image processing theory, tools, and applications |
---|---|
ISSN (Electronic) | 2154-512X |
Conference
Conference | International Conference on Image Processing Theory, Tools and Applications |
---|---|
Abbreviated title | IPTA |
Country/Territory | France |
City | Paris |
Period | 16/10/2023 → 19/10/2023 |
Fingerprint
Dive into the research topics of 'Deep Ensemble Learning with Frame Skipping for Face Anti-Spoofing'. Together they form a unique fingerprint.Projects
- 1 Active
-
USSEE: Understanding speech and scene with ears and eyes (USSEE)
Laaksonen, J. (Principal investigator), Pehlivan Tort, S. (Project Member), Wang, T.-J. (Project Member), Guo, Z. (Project Member), Saif, A. (Project Member) & Riahi, I. (Project Member)
01/01/2022 → 31/12/2024
Project: Academy of Finland: Other research funding