Towards Realistic Landmark-Guided Facial Video Inpainting Based on GANs

IS&T Electronic Imaging, Image Processing: Algorithms and Systems XXII

TL;DR

Our study introduces a network designed for expression-based video inpainting, employing generative adversarial networks (GANs) to handle static and moving occlusions across video frames. By utilizing facial landmarks and an occlusion-free reference image, our model maintains the users identity consistently across frames.

Abstract

Facial video inpainting plays a crucial role in a wide range of applications, including but not limited to the removal of obstructions in video conferencing and telemedicine, enhancement of facial expression analysis, privacy protection, integration of graphical overlays, and virtual makeup. This domain presents serious challenges due to the intricate nature of facial features and the inherent human familiarity with faces, heightening the need for accurate and persuasive completions. In addressing challenges specifically related to occlusion removal in this context, our focus is on the progressive task of generating complete images from facial data covered by masks, ensuring both spatial and temporal coherence. Our study introduces a network designed for expression-based video inpainting, employing generative adversarial networks (GANs) to handle static and moving occlusions across all frames. By utilizing facial landmarks and an occlusion-free reference image, our model maintains the users identity consistently across frames. We further enhance emotional preservation through a customized facial expression recognition (FER) loss function, ensuring detailed inpainted outputs. Our proposed framework exhibits proficiency in eliminating occlusions from facial videos in an adaptive form, whether appearing static or dynamic on the frames, while providing realistic and coherent results.

BibTex

If you use our work in your research, please cite our publication:

@inproceedings{EI.2024.36.10.IPAS-246,
author = {Ghorbani Lohesera,  Fatemeh and Egiazarian, Karen and Knorr, Sebastian},
title = {Towards Realistic Landmark-Guided Facial Video Inpainting Based on GANs},
booktitle = {IS&T Electronic Imaging, Image Processing: Algorithms and Systems XXII},
address = {Burlingame, California, USA},
year = {2024},
pages = {1-6},
doi = {10.2352/EI.2024.36.10.IPAS-246}
}