EP4283577A3

EP4283577A3 - Text and audio-based real-time face reenactment

Info

Publication number: EP4283577A3
Application number: EP23201194.0A
Authority: EP
Inventors: Pavel Savchenkov; Maxim LUKIN; Aleksandr MASHRABOV
Original assignee: Snap Inc
Current assignee: Snap Inc
Priority date: 2019-01-18
Filing date: 2020-01-18
Publication date: 2024-02-14
Also published as: EP4283578A3; KR20210119441A; WO2020150688A1; EP3912159B1; KR102509666B1; EP4283578A2; EP3912159A1; CN113228163A; EP4283577A2

Abstract

A computer-implemented method is disclosed. The method receives (1005) an input text and a target image. The target image includes a target face. The method generates (1010), based on the input text, a sequence of sets of acoustic features representing the input text. The method generates (1015), based on the sequence of sets of acoustic features, a sequence of sets of mouth key points. The method generates, based on the sequence of sets of mouth key points, a sequence of sets of facial key points. The method generates (1020), based on the sequence of sets of the facial key points and the target image, a sequence of frames. The frames include the target face modified based on at least one set of mouth key points of the sequence of sets of mouth key points. The method generates (1025), based on the sequence of frames, an output video.