Bojia Zi, Minghao Chang ,Jingjing Chen, Yu-Gang Jiang, Fudan University.
Xingjun Ma, Deakin University.
WildDeepfake הוא מאגר המכיל 7314 רצפי פנים (face sequences) שמקורם ב-707 סרטוני DEEPFAKE שנאספו בשיתוף פעולה על ידי חוקרים מאוניברסיטת פודן שבסין, ואוניברסיטת דיקין שבאוסטרליה. הסרטונים נאספו מרחבי האינטרנט במטרה ליצור מאגר DEEPFAKE מגוון שיתרום לפיתוח אמצעים טובים יותר לזיהוי חומרי DEEPFAKE, מתוקף התבססותו על חומרים מהאינטרנט ולא על סרטוני DEEPFAKE שצולמו ונערכו במיוחד לטובת הפיתוח של אמצעים אלה.
Abstract:
In recent years, the abuse of a face swap technique called Deepfake has raised enormous public concerns. Deepfake manipulates deep learning techniques to replace one person's face in a video to someone else's without leaving obvious traces. So far, a large number of deepfake videos (also known as "deepfakes") have been crafted and uploaded to the internet, which calls for the development of effective countermeasures.One promising countermeasure against deepfakes is deepfake detection. Several deepfake datasets have been released to support the training and testing of deepfake detectors, such as DeepfakeDetection and FaceForensics++.While this has greatly advanced deepfake detection, most of the real videos in these datasets are filmed with a few volunteer actors in limited scenes, and the fake videos are crafted by researchers using a few popular deepfake softwares. Detectors developed on these datasets may lose effectiveness when applied to detect the vast variety of deepfake videos in the wild (those uploaded to varies video-sharing websites). To better support detection against real-world deepfakes, in this paper, we introduce a new dataset WildDeepfake, which consists of 7,314 face sequences extracted from 707 deepfake videos that are collected completely from the internet. WildDeepfake is a small dataset that can be used, in addition to existing datasets, to develop more effective detectors against real-world deepfakes. We conduct a systematic evaluation of a set of baseline detection networks on both existing and our WildDeepfake datasets, and show that WildDeepfake is indeed a more challenging dataset, where the detection performance can decrease drastically. We also propose two (eg. 2D and 3D) Attention-based Deepfake Detection Networks (ADDNets) to leverage the attention masks on real/fake faces for improved detection. We empirically verify the effectiveness of ADDNets on both existing and WildDeepfake.