WO2014001095A1

WO2014001095A1 - Method for audiovisual content dubbing

Info

Publication number: WO2014001095A1
Application number: PCT/EP2013/062243
Authority: WO
Inventors: Pierre Hellier; Lionel Oisel; Patrick Perez
Original assignee: Thomson Licensing
Priority date: 2012-06-26
Filing date: 2013-06-13
Publication date: 2014-01-03

Abstract

A method for processing a video shot comprising a set of traceable objects is described. One or more traceable objects of the set of traceable objects are selected. One or more zones respectively encompassing the one or more selected traceable objects are positioned such that the respective zones are kept at the same respective position between subsequent frames of the video shot. Content of the video shot outside the positioned zones is blurred.

Description

METHOD FOR AUDIOVISUAL CONTENT DUBBING

FIELD OF THE INVENTION The present invention relates to a solution for processing a video shot. In particular, the invention relates to a method for ergonomic and secure dubbing of audiovisual content.

BACKGROUND OF THE INVENTION

When a movie meets success in a given country, it is generally exported to other countries. But the other countries do not necessarily speak the same language. A first solution to solve this issue is to add subtitles in the destination country language. Another, sometimes preferred solution, is to replace the original audio track in the original language with an audio track in the language of the destination country. In this second solution, a particular step of replacing the voices in the original language with the voices in the destination language, called dubbing, is required. In order to perform this dubbing in good conditions, the dubbers advantageously follow with their eyes the faces on a screen, and especially the lips, related to the dubbed voices, such that the added voices are well synchronized with the actions occurring in the movie.

A problem is that it is cumbersome for the dubber to follow with the eyes the face related to the voice he wants to dub.

SUMMARY OF THE INVENTION

It is an object of the present invention to solve the

aforementioned problem and propose an improved solution for dubbing of audiovisual content. According to the invention, a method for processing a video shot comprising a set of traceable objects comprises the steps of:

- selecting one or more traceable object of the set of

traceable objects;

- positioning one or more zones respectively encompassing the one or more selected traceable objects such that the respective zones are kept at the same respective position between

subsequent frames of the video shot; and

- blurring content of the video shot outside the positioned zones .

In this way, when displayed on a screen, the objects of interest for the dubber do not move, and it is easier for him to follow, for example, a face with his eyes. In addition to that, blurring the video content outside the positioned zones, i.e. the geographical complement of the positioned zones, allows to keeping the visual attention of the dubber centered on the objects of interest displayed on the screen.

Another benefit is that the size of the processed video is lighter in terms of memory. Therefore, the processed video can be transmitted more easily, for example via a network. The video quality of the processed video may be kept maximal at the selected zones, allowing the dubbers to better follow in particular the movement of the lips when human faces appear.

Another benefit of the blurring is that, if the processed video is intercepted by movie pirates, this processed video has no commercial value in comparison to the original video to be processed. As a result, the dubbing can, thanks to the

described method, be performed in both a secure and quick manner . Advantageously, the method also allows to input coordinates for defining the positioning of the one or more positioned zones, e.g. rectangles. In this way an overlap of zones is avoidable. Advantageously, the method also comprises a step of resizing the one or more selected traceable objects. In this way, when a traceable object, for example a face, is tracked, the resizing by zooming allows to better follow the lips of a talking face. Reducing the size may also avoid the overlap of two talking faces.

BRIEF DESCRIPTION OF THE DRAWINGS

For a better understanding the invention shall now be explained in more detail in the following description with reference to the figures. It is understood that the invention is not limited to this exemplary embodiment and that specified features can also expediently be combined and/or modified without departing from the scope of the present invention as defined in the appended claims. In the figures:

Fig. 1 illustrates a step of selecting a traceable object according to the present invention; and Fig. 2 illustrates the processing of a video shot according to the present invention.

DETAILED DESCRIPTION OF PREFERED EMBODIMENTS A video shot comprising a set of traceable objects is provided. An example of such traceable objects are human faces appearing on videos. Their detection is well documented by the state of the art, for example in international application

PCT/GB2003/005186. For the sake of clarity, in the following it will be considered that traceable objects are human faces. As illustrated in Fig. 1, the user is provided with a set of traceable objects A, B, C and D, from which the user, thanks to a dedicated user interface, selects the traceable objects he wants to track. He selects A, B and C for example. The set of traceable objects may be extracted from the first frame of the video shot to process.

Then, once the objects have been selected by the user, zones encompassing the selected objects are positioned at a same position between frames t, appearing at time t, and frame t+dt appearing at time t+dt, of the video shot to process, as illustrated in Fig. 2. The zones are rectangles in the provided example. Everything outside the zones is displayed with a blur on it. For example, it is made totally black. Optionally, a user may define the positioning of one or more selected traceable objects with the aid of the dedicated user interface .

Advantageously, the selected traceable objects are resized. This allows more easily following the lips of a talking face and avoiding the overlap of two talking faces.

Claims

1. A method for processing a video shot comprising a set of

traceable objects, the method comprising the steps of:

- selecting (101) one or more traceable object of the set of traceable objects;

- positioning one or more zones respectively encompassing the one or more selected traceable objects such that the respective zones are kept at the same respective position (102) between subsequent frames of the video shot; and

- blurring content of the video shot outside the positioned zones .

The method according to claim 1, further comprising the step of inputting coordinates for defining the positioning of the one or more positioned zones.

The method according to claim 1 or 2, further comprising the step of resizing the one or more selected traceable objects.

The method according to one of claims 1 to 3, wherein the one or more positioned zones are rectangles.