WO2014147081A1

WO2014147081A1 - Method for adjusting the sound level or loudness of an audio stream

Info

Publication number: WO2014147081A1
Application number: PCT/EP2014/055432
Authority: WO
Inventors: Samuel Vermeulen
Original assignee: France Brevets
Priority date: 2013-03-18
Filing date: 2014-03-18
Publication date: 2014-09-25
Also published as: FR3003386A1

Abstract

The invention relates to a method and system for adjusting the sound level of an audio stream during the step for transitioning from at least one first stream F₁ to a second stream F₂, characterized in that it comprises at least the following steps: determining at least one audio parameter Nm, D representative of the first decoded stream F_1d, in order to form a sound template G to be applied to the second decoded stream F_2d at the moment the transition of first stream F₁to the second stream F₂ is carried out; and applying the sound template G onto the decoded audio samples of the second stream F_2d in order to adjust the sound level of the second decoded stream F_2d and encode the samples of the audio signal of F_2d before it is broadcast. The invention can be used for adjusting the loudness of an advertising stream.

Description

METHOD FOR ADAPTING SOUND LEVEL OR

LOUDNESS OF AN AUDIO STREAM

The object of the invention relates to a method and a system for evaluating the perceived loudness level or "loudness" permanently in a broadcast broadcast nationally, in order to change the noise level of a local or advertising program when from the stall of the national program to advertising. The invention relates to a mechanism for ensuring coherent loudness (loudness in Anglo-Saxon) for different audio streams. In particular, the sound volume must comply with legal requirements in the United States.

For example, when you want to go from a national stream to a common multi-channel publicity file, for example, there is a problem: national flows do not necessarily all have the same noise levels, which can lead to sound level jumps. Each channel has its own loudness level and the local customer is not allowed to change the audio level of the national stream while it is being broadcast.

Data streams in a compressed format, or "splicing" in the compressed domain does not usually allow to change the sound level of advertising on the fly, as the data is broadcast.

In the prior art, a proposed solution consists in adjusting the sound volume at the switching output of the streams. The adjustment is therefore performed a posteriori. This solution is not suitable when the operator does not have the rights to modify the national flow, for example.

It is also known to perform an audio pre-processing of an advertisement file in order to adapt it to an ITU (International Telecommunication Union) standard level upstream of the stall of the streams. This method however, has the disadvantage of not being able to adapt the audio level of the advertising file to the audio level of the national channel in which the advertisement is inserted.

Another way of proceeding, known from the prior art, is to apply a "loudness" adaptation treatment at the output of the stall device or "splicer" in the English language. This method implements, as represented in an example in FIG. 1, an audio decoder 1, which will decode the compressed audio data, a sound level analyzer 2 of the decoded audio data. The level of loudness measured on the decoded audio data is then transmitted to a sound level adaptation module 3 on the fly, comprising an automatic gain control device 4 which will adapt the sound level of the samples of the decoded data. The data then passes through a module 5 and a peak limiter 6. The data samples are then coded 7 to be broadcast in the form of frames checking a format, for example, the format of the digital compression system known to the human being. business under the name Dolby Digital or DD or the Dolby Digital Plus or DD + system. This method allows for continuous adaptation of the audio stream to a given template, but involves coding on the stream and often a loss of performance.

The idea of the present invention is based on a new approach which consists in particular of evaluating the audio loudness level at the heart of a stall device so that the latter can, constantly, evaluate the loudness level on the national stream and change the level. sound of advertising on the fly during the stall operation. Instead of performing a sound level correction a posteriori, the invention proposes to observe the sound level on the national stream F to adjust the sound level of a local stream F ₂ , live or on the fly.

The object of the invention relates to a method for adapting the sound level of an audio stream during the stalling step of at least a first Fi stream to a second stream F ₂ , characterized in that it comprises at least the following steps:

To determine at least one audio parameter Nm, D representative of the first decoded stream Fi _d , in order to constitute a sound mask G to be applied on the second stream F ₂ ,

At the instant when the stalling of the first stream Fi towards the second stream F _{2 is carried out} , apply the sound template G on the decoded audio samples F _2d of the second stream F _{2 in} order to adapt the sound level of the second stream F _2d and code the samples of the audio signal of F _2d before it is broadcast.

In order to determine the representative audio parameter or parameters of the first flux F _d , the method determines the average sound level Nm and the dynamic D of the audio signal of the first decoded stream F _d. The method may also determine the peak value of the decoded audio stream Fi _d .

According to an implementation variant, the first stream F is decoded in order to determine the audio parameter or parameters representative of the first stream F-i.

The second stream F ₂ is, for example, a stream containing advertising, the second stream being common to several channels.

The invention also relates to a system for adapting the sound level of an audio stream F ₂ from parameters measured on a first audio stream Fi characterized in that it comprises at least the following elements:

A splicer adapted to manage the broadcasting of the first audio stream Fi and the second audio stream F ₂ ,

A module adapted to determine at least one audio parameter Nm, D of the first decoded stream Fi _d and to construct a sound mask G,

A decoding module of the second stream F ₂ and a sound level adaptation module of the second decoded stream F _2d receiving as input parameter the sound template G determined, An encoding module of the second decoded stream of adapted sound level.

The module for determining the audio parameter of the first decoded stream F-id is, for example, adapted to measure the average sound level Nm of the first audio stream F and the dynamic D of the first audio stream F and the adaptation module comprises a compressor and a limiter.

The module for determining the audio parameter of the first decoded audio stream Fi _d is, for example, adapted to measure the level of the peak values of the first decoded audio stream F _d.

The system may include a module for decoding the first decoded audio stream Fi _d disposed downstream of the module to determine an audio parameter.

The system according to the invention is for example used to adapt the sound level of a stream containing advertising, the stream being common to several channels.

Other features and advantages of the device according to the invention will appear better on reading the description which follows of an example of embodiment given by way of illustration and in no way limiting attached to the figures which represent:

FIG. 1, an example of a volume processing chain according to the prior art,

• Figure 2, an example of a diagram for the processing of audio signals according to the invention.

FIG. 2 schematizes an exemplary diagram of a system and a method according to the invention whose particular function is to adapt the sound level of a local stream F ₂ , corresponds, for example, to a publicity stream common to several national channels, live or on the fly. The national stream Fi and the local stream F ₂ are transmitted to a module 20 stall or stream splicer whose function is to manage the instants of broadcasting the national stream Fi and the local stream F ₂ over time. The splicer 20 comprises a controller 21 which receives at least the two flows F _; F ₂ and who has the knowledge of the hours of diffusion in time of the various programs, stored in a table T, for example. A splicing algorithm in the compressed domain, known to those skilled in the art, for switching from one program to another to the ready frame is also loaded into the controller 21 for execution.

The national stream F is broadcast through a normal broadcast channel, according to the path I of FIG. 2. During this broadcast, the method will permanently perform, path II, a sound level measurement of the national stream F-,. For this, the national F is decoded by means of a decoder 22, then the decoded stream F _d is transmitted to a module 23 adapted to determine a mean value Nm of sound level over a given time interval, using, for example an analysis algorithm known to those skilled in the art. The algorithm used also determines the dynamics D of the audio signal F _d . These two parameters mean sound level Nm and dynamic D, at least, are stored in a sound template 24. The analysis algorithm can also record the peak values Np of the audio signal Fi _d , which will also be stored in the sound template. These values Nm, D and in some cases Np, will then be used to adapt the sound level of the advertising flow F ₂ by means of a module 26 comprising a compressor 27 and an audio limiter 28. The dynamic D serves to configure the audio compressor 27 , to reproduce the same dynamics at the flux F ₂ , the average sound level Nm is used to configure the automatic gain control algorithm and the audio limiter 28 output, the peak values are used for the output limiter.

Without departing from the scope of the invention, a solution consists, for example, in continuously measuring the sound level of the flux F _; on a large number of audio samples compressed from Dolby metadata, such as the dialog normalization diainorm metadata, or the DRC dynamic range and PEG-1 layer 2 M scale factors. These formats are known to the 'Man of the trade and will not be detailed. When an operator wishes to insert an advertising program, for example, he will perform a local stall to the advertising file, F ₂ stream. At the stall, the method will apply to the decoded audio signal F ₂ d of the advertising flow F ₂ the determined sound template. To benefit from this function, the operator will activate a function 32 for authorizing the modification of the sounds in the advertising file F ₂ . For example, at the human machine interface the operator will have to tick a box to activate this function.

Option 32 being activated, we will be able to adapt its sound level. For this, the local stream F ₂ , path III, is decoded by a suitable decoder 25 known to those skilled in the art, the decoded stream F _2d is obtained. The sound template G obtained by the analysis of the flux F is transmitted to the module 26 for adjusting the audio level of the data samples F _2d obtained by decoding the stream F ₂ . Sound pattern G can include information from ITU-1770 standard analysis, LUFS loudness level, Loudness Unit LUFS range, actual dB peak value ( decibel). The compressor 27 reduces the sound dynamics of the flux F _2d by taking into account the value of the dynamic D of the sound gauge G. The limiter 28 present in the module 26 adjusts the maximum sound level of the audio flux F _2d taking into account the value average sound level Nm of template G.

The controlled volume flow F ₂ is then encoded in a coding module and then transmitted to a FIFO stack 30 for broadcast in the order provided in the programming.

The second audio stream F ₂ may also include alert messages or local information.

The method and system according to the invention can be used in the context of mobile phones, at the player or "player" to broadcast an advertisement. It is also possible to apply this treatment to the splicer of each phone at the end subscriber. The system and the method according to the invention propose to perform loudness processing not at the level of each subscriber of a service, but upstream in the stall device or splicer. This has the particular advantage of not requiring configuration necessary for the user of the "splicer / player". The system works regardless of the audio formats used. The method does not require the implementation of unnecessary processing such as sound level adjustments or re-encoding on the national level. It offers better performance and keeps the audio quality of the national stream.

Claims

1 - Method for adapting the sound level of an audio stream during the step of stalling at least a first stream F to a second stream F ₂ , characterized in that it comprises at least the following steps:

To determine at least one audio parameter Nm, D representative of the first decoded stream Fi _d , in order to constitute a sound mask G to be applied to the second decoded stream F _2d ,

At the instant when the stalling of the first stream Fi towards the second stream F _{2 is carried out} , apply the sound template G on the decoded audio samples of the second stream F _{2d in} order to adapt the sound level of the second decoded stream F _2d and code the samples of the audio signal of F _2d before it is broadcast. 2 - Process according to claim 1 characterized in that one determines at least the average sound level Nm and dynamic D of the audio signal of the first decoded stream F _d.

3 - Method according to one of claims 1 or 2 characterized in that one determines the peak value of the audio stream F _d .

4 - Process according to one of claims 1 to 3 characterized in that the stream F ₂ is a stream containing advertising, the stream being common to several channels.

5 - System for adapting the sound level of an audio stream F ₂ from parameters measured on a first audio stream Fi characterized in that it comprises at least the following elements:

A splicer adapted to manage the broadcasting of the first audio stream Fi and the second audio stream F ₂ , A module (23) adapted to determine at least one audio parameter Nm, D of the first decoded stream F _d and to construct a sound template G (24),

A decoding module (25) of the second flux F ₂ and an adaptation module (26) of the sound level of the second decoded stream F ₂ d receiving as input parameter the determined sound pattern G,

An encoding module (29) of the second decoded sound level stream adapted. 6 - System according to claim 5 characterized in that the module for determining the audio parameter of the first stream F is adapted to measure the average sound level Nm of the first decoded audio stream Fi _d and the dynamic D of the first audio stream Fi and in that the adaptation module (26) comprises a compressor (27) and a limiter (28).

7 - System according to claim 6 characterized in that the module for determining the audio parameter of the first decoded audio stream Fi _d is adapted to measure the level of the peak values of the first decoded audio stream.

8 - System according to claim 5 characterized in that it comprises a decoding module (22) of the first audio stream Fi disposed downstream of the determination module (23) of the audio parameter. 9 - Use of the system according to one of claims 5 to 7 to the adaptation of the sound level of a stream containing advertising, the stream being common to several channels.