dfdc_1st | dfdc_2nd | local_relation | xception | f3net | com_evaluate | |
---|---|---|---|---|---|---|
ours_output_ofd | 0.0182 | 0.2515 | 0.0026 | 0.3132 | 0.0133 | 0.1843 |
groundtruth | 0.0033 | 0.2464 | 0.0193 | 0.2999 | 0.0127 | 0.179 |
TK_AVS_G_Pose_Driven | 0.0044 | 0.1258 | 0 | 0.3072 | 0.0182 | 0.1402 |
TK_AVS_audio_part | 0.3745 | 0.2132 | 0 | 0.2951 | 0.0228 | 0.2787 |
LIipGAN_audio_part | 0.2216 | 0.2386 | 0.2517 | 0.3093 | 0.3475 | 0.4213 |
Wav2Lip_demo_00 | 0.4307 | 0.252 | 0.5872 | 0.3025 | 0.6765 | 0.6922 |
Wav2Lip_audio_part | 0.4018 | 0.2551 | 0.594 | 0.3076 | 0.6765 | 0.6879 |
LipGAN_demo_00 | 0.1972 | 0.2343 | 0.3083 | 0.301 | 0.3969 | 0.4425 |
use
dfdc_1st | dfdc_2nd | local_relation | xception | f3net | com_evaluate | |
---|---|---|---|---|---|---|
ours_output_ofd | 0.0149 | 0.0051 | -0.0167 | 0.0133 | 0.0006 | 0.0053 |
groundtruth | 0 | 0 | 0 | 0 | 0 | 0 |
TK_AVS_G_Pose_Driven_ | 0.0011 | -0.1206 | -0.0193 | 0.0073 | 0.0055 | -0.0388 |
TK_AVS_audio_part | 0.3712 | -0.0332 | -0.0193 | -0.0048 | 0.0101 | 0.0997 |
LIipGAN_audio_part | 0.2183 | -0.0078 | 0.2324 | 0.0094 | 0.3348 | 0.2423 |
Wav2Lip_demo_00 | 0.4274 | 0.0056 | 0.5679 | 0.0026 | 0.6638 | 0.5132 |
Wav2Lip_audio_part | 0.3985 | 0.0087 | 0.5747 | 0.0077 | 0.6638 | 0.5089 |
LipGAN_demo_00 | 0.1939 | -0.0121 | 0.289 | 0.0011 | 0.3842 | 0.2635 |
Size | PSNR | SSIM | MSE | |
---|---|---|---|---|
LipGAN | (512, 512) | 20.3095 | 0.7690 | 0.1859 |
Wav2Lip | (512, 512) | 30.4198 | 0.8917 | 0.05296 |
TK_AVS | (224, 224) | 13.8606 | 0.4022 | 0.3583 |
ours | (256, 256) | 34.300 | 0.9562 | 0.03580 |
Confidence | Min_Dist | Confidence_res | Min_Dist_res | |
---|---|---|---|---|
ours_output_demo | 1.4733 | 10.2248 | -1.0943 | -0.3341 |
gd | 2.5676 | 10.5589 | 0 | 0 |
wav2lip_demo_00 | 5.5541 | 7.0789 | 2.9865 | -3.48 |
lip_gandemo_00 | 4.6427 | 7.889 | 2.0751 | -2.6699 |
tk_avs_audio_part | 8.161 | 5.9864 | 5.5934 | -4.5725 |
wav2lip_audio_part | 3.8961 | 8.7756 | 1.3285 | -1.7833 |
lip_gan_audio_part | 3.3709 | 9.5071 | 0.8033 | -1.0518 |
ours_output_audio_part | 1.1985 | 10.8441 | -1.3691 | 0.2852 |