Abstract
Previous face forgery detection methods mainly focus on appearance features, which may be easily attacked by sophisticated manipulation. Considering the majority of current face manipulation methods generate fake faces based on a single frame, which do not take frame consistency and coordination into consideration, artifacts on frame sequences are more effective for face forgery detection. However, current sequence-based face forgery detection methods use general video classification networks directly, which discard the special and discriminative motion information for face manipulation detection. To this end, we propose an effective sequence-based forgery detection framework based on an existing video classification method. To make the motion features more expressive for manipulation detection, we propose an alternative motion consistency block instead of the original motion features module. To make the learned features more generalizable, we propose an auxiliary anomaly detection block. With these two specially designed improvements, we make a general video classification network achieve promising results on three popular face forgery datasets.
Abstract (translated)
过去的面部伪造检测方法主要关注外观特征,这可能很容易被高级操纵所攻击。考虑到当前大多数面部 manipulation 方法都是基于单个帧生成的假脸,没有考虑帧的一致性和协调性,序列中的伪影对于面部伪造检测来说更为有效。然而,现有的序列基于面部伪造检测的方法直接使用通用视频分类网络,这忽略了面部操纵检测的特殊和鉴别信息。为此,我们提出了一个基于现有视频分类方法的序列基于伪造检测框架。为了使操纵检测更具有表现力,我们提出了一个替代的动态一致性模块,而不是原始动态特征模块。为了使学习到的特征更具通用性,我们提出了一个辅助异常检测模块。通过这两个特别设计的改进,我们使一般视频分类网络在三个流行的面部伪造数据集上取得了良好的结果。
URL
https://arxiv.org/abs/2403.05172