WO2014147081A1 - Method for adjusting the sound level or loudness of an audio stream - Google Patents
Method for adjusting the sound level or loudness of an audio stream Download PDFInfo
- Publication number
- WO2014147081A1 WO2014147081A1 PCT/EP2014/055432 EP2014055432W WO2014147081A1 WO 2014147081 A1 WO2014147081 A1 WO 2014147081A1 EP 2014055432 W EP2014055432 W EP 2014055432W WO 2014147081 A1 WO2014147081 A1 WO 2014147081A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- stream
- audio
- decoded
- sound level
- sound
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 20
- 230000005236 sound signal Effects 0.000 claims abstract description 9
- 230000006978 adaptation Effects 0.000 claims description 8
- 230000004907 flux Effects 0.000 claims description 7
- 230000007704 transition Effects 0.000 abstract 1
- 230000006870 function Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/233—Processing of audio elementary streams
- H04N21/2335—Processing of audio elementary streams involving reformatting operations of audio signals, e.g. by converting from one coding standard to another
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/23424—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for inserting or substituting an advertisement
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
- H04N21/812—Monomedia components thereof involving advertisement data
Definitions
- the object of the invention relates to a method and a system for evaluating the perceived loudness level or "loudness" permanently in a broadcast broadcast nationally, in order to change the noise level of a local or advertising program when from the stall of the national program to advertising.
- the invention relates to a mechanism for ensuring coherent loudness (loudness in Anglo-Saxon) for different audio streams. In particular, the sound volume must comply with legal requirements in the United States.
- Data streams in a compressed format, or "splicing" in the compressed domain does not usually allow to change the sound level of advertising on the fly, as the data is broadcast.
- a proposed solution consists in adjusting the sound volume at the switching output of the streams. The adjustment is therefore performed a posteriori. This solution is not suitable when the operator does not have the rights to modify the national flow, for example.
- Another way of proceeding, known from the prior art, is to apply a "loudness" adaptation treatment at the output of the stall device or "splicer" in the English language.
- This method implements, as represented in an example in FIG. 1, an audio decoder 1, which will decode the compressed audio data, a sound level analyzer 2 of the decoded audio data.
- the level of loudness measured on the decoded audio data is then transmitted to a sound level adaptation module 3 on the fly, comprising an automatic gain control device 4 which will adapt the sound level of the samples of the decoded data.
- the data then passes through a module 5 and a peak limiter 6.
- the data samples are then coded 7 to be broadcast in the form of frames checking a format, for example, the format of the digital compression system known to the human being. business under the name Dolby Digital or DD or the Dolby Digital Plus or DD + system.
- This method allows for continuous adaptation of the audio stream to a given template, but involves coding on the stream and often a loss
- the idea of the present invention is based on a new approach which consists in particular of evaluating the audio loudness level at the heart of a stall device so that the latter can, constantly, evaluate the loudness level on the national stream and change the level. sound of advertising on the fly during the stall operation. Instead of performing a sound level correction a posteriori, the invention proposes to observe the sound level on the national stream F to adjust the sound level of a local stream F 2 , live or on the fly.
- the object of the invention relates to a method for adapting the sound level of an audio stream during the stalling step of at least a first Fi stream to a second stream F 2 , characterized in that it comprises at least the following steps:
- D representative of the first decoded stream Fi d in order to constitute a sound mask G to be applied on the second stream F 2 ,
- the method determines the average sound level Nm and the dynamic D of the audio signal of the first decoded stream F d. The method may also determine the peak value of the decoded audio stream Fi d .
- the first stream F is decoded in order to determine the audio parameter or parameters representative of the first stream F-i.
- the second stream F 2 is, for example, a stream containing advertising, the second stream being common to several channels.
- the invention also relates to a system for adapting the sound level of an audio stream F 2 from parameters measured on a first audio stream Fi characterized in that it comprises at least the following elements:
- a splicer adapted to manage the broadcasting of the first audio stream Fi and the second audio stream F 2 ,
- a module adapted to determine at least one audio parameter Nm, D of the first decoded stream Fi d and to construct a sound mask G,
- the module for determining the audio parameter of the first decoded stream F-id is, for example, adapted to measure the average sound level Nm of the first audio stream F and the dynamic D of the first audio stream F and the adaptation module comprises a compressor and a limiter.
- the module for determining the audio parameter of the first decoded audio stream Fi d is, for example, adapted to measure the level of the peak values of the first decoded audio stream F d.
- the system may include a module for decoding the first decoded audio stream Fi d disposed downstream of the module to determine an audio parameter.
- the system according to the invention is for example used to adapt the sound level of a stream containing advertising, the stream being common to several channels.
- FIG. 1 an example of a volume processing chain according to the prior art
- FIG. 2 schematizes an exemplary diagram of a system and a method according to the invention whose particular function is to adapt the sound level of a local stream F 2 , corresponds, for example, to a publicity stream common to several national channels, live or on the fly.
- the national stream Fi and the local stream F 2 are transmitted to a module 20 stall or stream splicer whose function is to manage the instants of broadcasting the national stream Fi and the local stream F 2 over time.
- the splicer 20 comprises a controller 21 which receives at least the two flows F ; F 2 and who has the knowledge of the hours of diffusion in time of the various programs, stored in a table T, for example.
- a splicing algorithm in the compressed domain known to those skilled in the art, for switching from one program to another to the ready frame is also loaded into the controller 21 for execution.
- the national stream F is broadcast through a normal broadcast channel, according to the path I of FIG. 2. During this broadcast, the method will permanently perform, path II, a sound level measurement of the national stream F-,.
- the national F is decoded by means of a decoder 22, then the decoded stream F d is transmitted to a module 23 adapted to determine a mean value Nm of sound level over a given time interval, using, for example an analysis algorithm known to those skilled in the art.
- the algorithm used also determines the dynamics D of the audio signal F d . These two parameters mean sound level Nm and dynamic D, at least, are stored in a sound template 24.
- the analysis algorithm can also record the peak values Np of the audio signal Fi d , which will also be stored in the sound template.
- Nm, D and in some cases Np will then be used to adapt the sound level of the advertising flow F 2 by means of a module 26 comprising a compressor 27 and an audio limiter 28.
- the dynamic D serves to configure the audio compressor 27 , to reproduce the same dynamics at the flux F 2 , the average sound level Nm is used to configure the automatic gain control algorithm and the audio limiter 28 output, the peak values are used for the output limiter.
- a solution consists, for example, in continuously measuring the sound level of the flux F ; on a large number of audio samples compressed from Dolby metadata, such as the dialog normalization diainorm metadata, or the DRC dynamic range and PEG-1 layer 2 M scale factors. These formats are known to the 'Man of the trade and will not be detailed.
- Dolby metadata such as the dialog normalization diainorm metadata, or the DRC dynamic range and PEG-1 layer 2 M scale factors.
- Option 32 being activated, we will be able to adapt its sound level.
- the local stream F 2 path III, is decoded by a suitable decoder 25 known to those skilled in the art, the decoded stream F 2d is obtained.
- the sound template G obtained by the analysis of the flux F is transmitted to the module 26 for adjusting the audio level of the data samples F 2d obtained by decoding the stream F 2 .
- Sound pattern G can include information from ITU-1770 standard analysis, LUFS loudness level, Loudness Unit LUFS range, actual dB peak value ( decibel).
- the compressor 27 reduces the sound dynamics of the flux F 2d by taking into account the value of the dynamic D of the sound gauge G.
- the limiter 28 present in the module 26 adjusts the maximum sound level of the audio flux F 2d taking into account the value average sound level Nm of template G.
- the controlled volume flow F 2 is then encoded in a coding module and then transmitted to a FIFO stack 30 for broadcast in the order provided in the programming.
- the second audio stream F 2 may also include alert messages or local information.
- the method and system according to the invention can be used in the context of mobile phones, at the player or "player” to broadcast an advertisement. It is also possible to apply this treatment to the splicer of each phone at the end subscriber.
- the system and the method according to the invention propose to perform loudness processing not at the level of each subscriber of a service, but upstream in the stall device or splicer. This has the particular advantage of not requiring configuration necessary for the user of the "splicer / player".
- the system works regardless of the audio formats used.
- the method does not require the implementation of unnecessary processing such as sound level adjustments or re-encoding on the national level. It offers better performance and keeps the audio quality of the national stream.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Business, Economics & Management (AREA)
- Marketing (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
The invention relates to a method and system for adjusting the sound level of an audio stream during the step for transitioning from at least one first stream F1 to a second stream F2, characterized in that it comprises at least the following steps: determining at least one audio parameter Nm, D representative of the first decoded stream F1d, in order to form a sound template G to be applied to the second decoded stream F2d at the moment the transition of first stream F1 to the second stream F2 is carried out; and applying the sound template G onto the decoded audio samples of the second stream F2d in order to adjust the sound level of the second decoded stream F2d and encode the samples of the audio signal of F2d before it is broadcast. The invention can be used for adjusting the loudness of an advertising stream.
Description
PROCEDE D'ADAPTATION DE NIVEAU SONORE OU METHOD FOR ADAPTING SOUND LEVEL OR
LOUDNESS D'UN FLUX AUDIO LOUDNESS OF AN AUDIO STREAM
L'objet de l'invention concerne un procédé et un système permettant d'évaluer le niveau sonore perçu ou « loudness » en permanence au niveau d'une émission diffusée nationalement, afin de modifier le niveau sonore d'un programme local ou publicitaire lors du décrochage du programme national vers la publicité. L'invention porte sur un mécanisme permettant de garantir un volume sonore (loudness en anglo-saxon) cohérent pour différents flux audio. En particulier, le volume sonore doit être conforme à des obligations légales aux Etats Unis. The object of the invention relates to a method and a system for evaluating the perceived loudness level or "loudness" permanently in a broadcast broadcast nationally, in order to change the noise level of a local or advertising program when from the stall of the national program to advertising. The invention relates to a mechanism for ensuring coherent loudness (loudness in Anglo-Saxon) for different audio streams. In particular, the sound volume must comply with legal requirements in the United States.
Lorsque l'on souhaite faire du décrochage local d'un flux national vers un fichier de publicité commun à plusieurs chaînes, par exemple, un problème apparaît: les flux nationaux n'ont pas nécessairement tous les mêmes niveaux sonores, ce qui peut provoquer des sauts de niveau sonore. Chaque chaîne possède son propre niveau sonore ou « loudness » et le client local n'a en général pas le droit de modifier le niveau audio du flux national pendant sa diffusion. For example, when you want to go from a national stream to a common multi-channel publicity file, for example, there is a problem: national flows do not necessarily all have the same noise levels, which can lead to sound level jumps. Each channel has its own loudness level and the local customer is not allowed to change the audio level of the national stream while it is being broadcast.
Les flux de données se présentant sous un format compressé, le décrochage ou « splicing » dans le domaine compressé ne permet pas habituellement de modifier à la volée le niveau sonore de la publicité, au fur et à mesure de la diffusion des données. Data streams in a compressed format, or "splicing" in the compressed domain does not usually allow to change the sound level of advertising on the fly, as the data is broadcast.
Dans l'art antérieur, une solution proposée consiste à ajuster le volume sonore en sortie de commutation des flux. L'ajustement est donc réalisé a posteriori. Cette solution ne convient pas lorsque l'opérateur ne possède pas les droits pour modifier le flux national, par exemple. In the prior art, a proposed solution consists in adjusting the sound volume at the switching output of the streams. The adjustment is therefore performed a posteriori. This solution is not suitable when the operator does not have the rights to modify the national flow, for example.
Il est aussi connu d'effectuer un pré-traitement audio d'un fichier de publicité afin de l'adapter à un niveau standard ITU (International Télécommunication Union) en amont du décrochage des flux. Cette méthode
présente toutefois l'inconvénient de ne pouvoir adapter le niveau audio du fichier publicitaire au niveau audio de la chaîne nationale dans laquelle la publicité est insérée. It is also known to perform an audio pre-processing of an advertisement file in order to adapt it to an ITU (International Telecommunication Union) standard level upstream of the stall of the streams. This method however, has the disadvantage of not being able to adapt the audio level of the advertising file to the audio level of the national channel in which the advertisement is inserted.
Une autre manière de procéder, connue de l'art antérieur, consiste à appliquer un traitement d'adaptation de « loudness » en sortie du dispositif de décrochage ou « splicer » en langue anglo-saxonne. Cette méthode met en œuvre, comme il est représenté dans un exemple à la figure 1 , un décodeur audio 1 , qui va décoder les données audio compressées, un analyseur 2 de niveau sonore des données audio décodées. Le niveau de loudness mesuré sur les données audio décodées est ensuite transmis à un module d'adaptation 3 de niveau sonore à la volée, comprenant un dispositif de contrôle automatique de gain 4 qui va adapter le niveau sonore des échantillons des données décodées. Les données passent ensuite par un module 5 et un limiteur de pics 6. Les échantillons des données sont ensuite codés 7 pour être diffusés sous forme de trames vérifiant un format, par exemple, le format du système de compression numérique connu de l'Homme du métier sous le nom anglo-saxon Dolby Digital ou DD ou le système Dolby Digital Plus ou DD+ . Cette méthode permet une adaptation en permanence du flux audio à un gabarit donné, mais implique un codage sur le flux et souvent une perte de performances. Another way of proceeding, known from the prior art, is to apply a "loudness" adaptation treatment at the output of the stall device or "splicer" in the English language. This method implements, as represented in an example in FIG. 1, an audio decoder 1, which will decode the compressed audio data, a sound level analyzer 2 of the decoded audio data. The level of loudness measured on the decoded audio data is then transmitted to a sound level adaptation module 3 on the fly, comprising an automatic gain control device 4 which will adapt the sound level of the samples of the decoded data. The data then passes through a module 5 and a peak limiter 6. The data samples are then coded 7 to be broadcast in the form of frames checking a format, for example, the format of the digital compression system known to the human being. business under the name Dolby Digital or DD or the Dolby Digital Plus or DD + system. This method allows for continuous adaptation of the audio stream to a given template, but involves coding on the stream and often a loss of performance.
L'idée de la présente invention repose sur une nouvelle approche qui consiste notamment à évaluer le niveau audio loudness au cœur d'un dispositif de décrochage afin que ce dernier puisse, en permanence, évaluer le niveau loudness sur le flux national et modifier le niveau sonore de la publicité à la volée lors de l'opération de décrochage. Au lieu d'effectuer une correction de niveau sonore a posteriori, l'invention propose d'observer le niveau sonore sur le flux national F pour ajuster le niveau sonore d'un flux local F2, en direct ou à la volée. The idea of the present invention is based on a new approach which consists in particular of evaluating the audio loudness level at the heart of a stall device so that the latter can, constantly, evaluate the loudness level on the national stream and change the level. sound of advertising on the fly during the stall operation. Instead of performing a sound level correction a posteriori, the invention proposes to observe the sound level on the national stream F to adjust the sound level of a local stream F 2 , live or on the fly.
L'objet de l'invention concerne un procédé pour adapter le niveau sonore d'un flux audio lors de l'étape de décrochage d'au moins un premier
flux Fi vers un deuxième flux F2, caractérisé en ce qu'il comporte au moins les étapes suivantes : The object of the invention relates to a method for adapting the sound level of an audio stream during the stalling step of at least a first Fi stream to a second stream F 2 , characterized in that it comprises at least the following steps:
• déterminer au moins un paramètre audio Nm, D représentatif du premier flux décodé F-id, afin de constituer un gabarit sonore G à appliquer sur le deuxième flux F2, To determine at least one audio parameter Nm, D representative of the first decoded stream Fi d , in order to constitute a sound mask G to be applied on the second stream F 2 ,
• à l'instant où l'on réalise le décrochage du premier flux Fi vers le deuxième flux F2, appliquer le gabarit sonore G sur les échantillons audio décodés F2d du deuxième flux F2 afin d'adapter le niveau sonore du deuxième flux F2d et coder les échantillons du signal audio de F2d avant sa diffusion. At the instant when the stalling of the first stream Fi towards the second stream F 2 is carried out , apply the sound template G on the decoded audio samples F 2d of the second stream F 2 in order to adapt the sound level of the second stream F 2d and code the samples of the audio signal of F 2d before it is broadcast.
Pour déterminer le ou les paramètres audio représentatifs du premier flux F d, le procédé détermine le niveau sonore moyen Nm et la dynamique D du signal audio du premier flux décodé F d. Le procédé peut aussi déterminer la valeur pic du flux audio décodé Fid. In order to determine the representative audio parameter or parameters of the first flux F d , the method determines the average sound level Nm and the dynamic D of the audio signal of the first decoded stream F d. The method may also determine the peak value of the decoded audio stream Fi d .
Selon une variante de mise en œuvre, le premier flux F est décodé afin de déterminer le ou les paramètres audio représentatifs du premier flux F-i . According to an implementation variant, the first stream F is decoded in order to determine the audio parameter or parameters representative of the first stream F-i.
Le deuxième flux F2 est, par exemple, un flux contenant de publicité, le deuxième flux étant commun à plusieurs chaînes. The second stream F 2 is, for example, a stream containing advertising, the second stream being common to several channels.
L'invention concerne aussi un système pour adapter le niveau sonore d'un flux audio F2 à partir de paramètres mesurés sur un premier flux audio Fi caractérisé en ce qu'il comporte au moins les éléments suivants :The invention also relates to a system for adapting the sound level of an audio stream F 2 from parameters measured on a first audio stream Fi characterized in that it comprises at least the following elements:
• un splicer adapté à gérer la diffusion du premier flux audio Fi et du deuxième flux audio F2, A splicer adapted to manage the broadcasting of the first audio stream Fi and the second audio stream F 2 ,
• un module adapté à déterminer au moins un paramètre audio Nm, D du premier flux décodé Fid et à construire un gabarit sonore G,A module adapted to determine at least one audio parameter Nm, D of the first decoded stream Fi d and to construct a sound mask G,
• un module de décodage du deuxième flux F2 et un module d'adaptation du niveau sonore du deuxième flux décodé F2d recevant comme paramètre d'entrée le gabarit sonore G déterminé,
• un module de codage du deuxième flux décodé de niveau sonore adapté. A decoding module of the second stream F 2 and a sound level adaptation module of the second decoded stream F 2d receiving as input parameter the sound template G determined, An encoding module of the second decoded stream of adapted sound level.
Le module de détermination du paramètre audio du premier flux décodé F-id est, par exemple, adapté à mesurer le niveau moyen sonore Nm du premier flux audio F et la dynamique D du premier flux audio F et le module d'adaptation comprend un compresseur et un limiteur. The module for determining the audio parameter of the first decoded stream F-id is, for example, adapted to measure the average sound level Nm of the first audio stream F and the dynamic D of the first audio stream F and the adaptation module comprises a compressor and a limiter.
Le module de détermination du paramètre audio du premier flux audio décodé Fid est, par exemple, adapté à mesurer le niveau des valeurs pics du premier flux audio décodé F d. The module for determining the audio parameter of the first decoded audio stream Fi d is, for example, adapted to measure the level of the peak values of the first decoded audio stream F d.
Le système peut comporter un module pour décoder le premier flux audio décodé Fid disposé en aval du module pour déterminer un paramètre audio. The system may include a module for decoding the first decoded audio stream Fi d disposed downstream of the module to determine an audio parameter.
Le système selon l'invention est par exemple utilisé pour adapter le niveau sonore d'un flux contenant de la publicité, le flux étant commun à plusieurs chaînes. The system according to the invention is for example used to adapt the sound level of a stream containing advertising, the stream being common to several channels.
D'autres caractéristiques et avantages du dispositif selon l'invention apparaîtront mieux à la lecture de la description qui suit d'un exemple de réalisation donné à titre illustratif et nullement limitatif annexé des figures qui représentent : Other features and advantages of the device according to the invention will appear better on reading the description which follows of an example of embodiment given by way of illustration and in no way limiting attached to the figures which represent:
• la figure 1 , un exemple de chaîne de traitement de volume selon l'art antérieur, FIG. 1, an example of a volume processing chain according to the prior art,
• la figure 2, un exemple de schéma pour le traitement des signaux audio selon l'invention. • Figure 2, an example of a diagram for the processing of audio signals according to the invention.
La figure 2 schématise un exemple de schéma d'un système et d'un procédé selon l'invention ayant notamment pour fonction d'adapter le niveau sonore d'un flux local F2, correspond, par exemple à un flux de publicité commun à plusieurs chaînes nationales, en direct ou à la volée. Le flux national Fi et le flux local F2 sont transmis à un module 20 de décrochage de flux ou splicer ayant pour fonction de gérer les instants de
diffusion du flux national Fi et du flux local F2 dans le temps. Le splicer 20 comporte un contrôleur 21 qui reçoit au moins les deux flux F ; F2 et qui a la connaissance des heures de diffusion dans le temps des différents programmes, mémorisées dans une table T, par exemple. Un algorithme de splicing dans le domaine compressé, connu de l'Homme du métier, permettant de basculer d'un programme vers un autre à la trame prêt est aussi chargé dans le contrôleur 21 pour exécution. FIG. 2 schematizes an exemplary diagram of a system and a method according to the invention whose particular function is to adapt the sound level of a local stream F 2 , corresponds, for example, to a publicity stream common to several national channels, live or on the fly. The national stream Fi and the local stream F 2 are transmitted to a module 20 stall or stream splicer whose function is to manage the instants of broadcasting the national stream Fi and the local stream F 2 over time. The splicer 20 comprises a controller 21 which receives at least the two flows F ; F 2 and who has the knowledge of the hours of diffusion in time of the various programs, stored in a table T, for example. A splicing algorithm in the compressed domain, known to those skilled in the art, for switching from one program to another to the ready frame is also loaded into the controller 21 for execution.
Le flux national F est diffusé au travers d'une chaîne normale de diffusion, selon le chemin I de la figure 2. Durant cette diffusion le procédé va en permanence effectuer, chemin II, une mesure de niveau sonore du flux national F-, . Pour cela, le national F est décodé au moyen d'un décodeur 22, puis le flux décodé F d est transmis à un module 23 adapté à déterminer une valeur moyenne Nm de niveau sonore sur un intervalle de temps donné, en utilisant, par exemple un algorithme d'analyse connu de l'Homme du métier. L'algorithme utilisé détermine aussi la dynamique D du signal audio F d. Ces deux paramètres niveau sonore moyen Nm et dynamique D, au moins, sont mémorisés dans un gabarit sonore 24. L'algorithme d'analyse peut aussi relever les valeurs pics Np du signal audio F-id, qui seront aussi mémorisées dans le gabarit sonore 24. Ces valeurs Nm, D et dans certains cas Np, seront ensuite utilisées pour adapter le niveau sonore du flux publicitaire F2 grâce à un module 26 comprenant un compresseur 27 et un limiteur audio 28. La dynamique D sert à configurer le compresseur audio 27, pour reproduire cette même dynamique au niveau du flux F2, le niveau sonore moyen Nm sert à configurer l'algorithme de contrôle automatique de gain et le limiteur audio 28 en sortie, les valeurs pics servent au limiteur de sortie. The national stream F is broadcast through a normal broadcast channel, according to the path I of FIG. 2. During this broadcast, the method will permanently perform, path II, a sound level measurement of the national stream F-,. For this, the national F is decoded by means of a decoder 22, then the decoded stream F d is transmitted to a module 23 adapted to determine a mean value Nm of sound level over a given time interval, using, for example an analysis algorithm known to those skilled in the art. The algorithm used also determines the dynamics D of the audio signal F d . These two parameters mean sound level Nm and dynamic D, at least, are stored in a sound template 24. The analysis algorithm can also record the peak values Np of the audio signal Fi d , which will also be stored in the sound template. These values Nm, D and in some cases Np, will then be used to adapt the sound level of the advertising flow F 2 by means of a module 26 comprising a compressor 27 and an audio limiter 28. The dynamic D serves to configure the audio compressor 27 , to reproduce the same dynamics at the flux F 2 , the average sound level Nm is used to configure the automatic gain control algorithm and the audio limiter 28 output, the peak values are used for the output limiter.
Sans sortir du cadre de l'invention, une solution consiste, par exemple, à mesurer en continu, le niveau sonore du flux F ; sur un nombre important d'échantillons audio compressés à partir des métadonnées Dolby, telles que la métadonnée diainorm de normalisation des dialogues, ou la plage dynamique DRC et des facteurs d'échelle en M PEG-1 couche 2. Ces formats sont connus de l'Homme du métier et ne seront pas détaillés.
Lorsqu'un opérateur souhaite insérer un programme publicitaire, par exemple, il va effectuer un décrochage local vers le fichier de publicité, flux F2. Au niveau du décrochage, le procédé va appliquer sur le signal audio décodé F2d du flux publicitaire F2 le gabarit sonore déterminé. Pour bénéficier de cette fonction, l'opérateur va activer une fonction 32 d'autorisation de la modification des sons dans le fichier publicitaire F2. Par exemple, au niveau de l'interface homme machine l'opérateur aura à cocher une case pour activer cette fonction. Without departing from the scope of the invention, a solution consists, for example, in continuously measuring the sound level of the flux F ; on a large number of audio samples compressed from Dolby metadata, such as the dialog normalization diainorm metadata, or the DRC dynamic range and PEG-1 layer 2 M scale factors. These formats are known to the 'Man of the trade and will not be detailed. When an operator wishes to insert an advertising program, for example, he will perform a local stall to the advertising file, F 2 stream. At the stall, the method will apply to the decoded audio signal F 2 d of the advertising flow F 2 the determined sound template. To benefit from this function, the operator will activate a function 32 for authorizing the modification of the sounds in the advertising file F 2 . For example, at the human machine interface the operator will have to tick a box to activate this function.
L'option 32 étant activée, on va pouvoir adapter son niveau sonore. Pour cela, le flux local F2, chemin III, est décodé par un décodeur 25 adapté connu de l'Homme du métier, on obtient le flux décodé F2d. Le gabarit sonore G obtenu par l'analyse du flux F est transmis au module 26 d'ajustement du niveau audio des échantillons de données F2d obtenues par décodage du flux F2. Le gabarit sonore G peut comprendre les informations provenant d'analyse du standard ITU-1770, le niveau de loudness en LUFS (unité de mesure Loudness Unit Full Scale), la gamme de loudness exprimée en LUFS, la valeur du pic réel en dB (décibel). Le compresseur 27 réduit la dynamique sonore du flux F2d en tenant compte de la valeur de la dynamique D du gabarit sonore G. Le limiteur 28 présent dans le module 26 adapte le niveau maximal sonore du flux audio F2d en tenant compte de la valeur moyenne de niveau sonore Nm du gabarit G. Option 32 being activated, we will be able to adapt its sound level. For this, the local stream F 2 , path III, is decoded by a suitable decoder 25 known to those skilled in the art, the decoded stream F 2d is obtained. The sound template G obtained by the analysis of the flux F is transmitted to the module 26 for adjusting the audio level of the data samples F 2d obtained by decoding the stream F 2 . Sound pattern G can include information from ITU-1770 standard analysis, LUFS loudness level, Loudness Unit LUFS range, actual dB peak value ( decibel). The compressor 27 reduces the sound dynamics of the flux F 2d by taking into account the value of the dynamic D of the sound gauge G. The limiter 28 present in the module 26 adjusts the maximum sound level of the audio flux F 2d taking into account the value average sound level Nm of template G.
Le flux F2 de volume contrôlé est ensuite encodé 29 dans un module de codage, puis transmis à une pile FIFO 30 pour être diffusé selon l'ordre prévu dans la programmation. The controlled volume flow F 2 is then encoded in a coding module and then transmitted to a FIFO stack 30 for broadcast in the order provided in the programming.
Le deuxième flux audio F2 peut aussi comporter des messages d'alerte ou des informations locales. The second audio stream F 2 may also include alert messages or local information.
Le procédé et le système selon l'invention peuvent être utilisés dans le cadre des téléphones portables, au niveau du lecteur ou « player » devant diffuser une publicité. Il est aussi possible d'appliquer ce traitement au niveau du splicer de chaque téléphone au niveau de l'abonné final.
Le système et le procédé selon l'invention proposent de réaliser le traitement de « loudness » non pas au niveau de chaque abonné d'un service, mais en amont dans le dispositif de décrochage ou splicer. Ceci présente notamment comme avantage de ne pas demander de configuration nécessaire pour l'utilisateur du « splicer/player ». Le système fonctionne quelque soit les formats audio utilisés. Le procédé ne demande pas la mise en œuvre de traitements inutiles tels que des ajustements de niveau sonore ou de ré-encodage sur le niveau national. Il offre de meilleures performances et permet de conserver la qualité audio du flux national.
The method and system according to the invention can be used in the context of mobile phones, at the player or "player" to broadcast an advertisement. It is also possible to apply this treatment to the splicer of each phone at the end subscriber. The system and the method according to the invention propose to perform loudness processing not at the level of each subscriber of a service, but upstream in the stall device or splicer. This has the particular advantage of not requiring configuration necessary for the user of the "splicer / player". The system works regardless of the audio formats used. The method does not require the implementation of unnecessary processing such as sound level adjustments or re-encoding on the national level. It offers better performance and keeps the audio quality of the national stream.
Claims
REVENDICATIONS
1 - Procédé pour adapter le niveau sonore d'un flux audio lors de l'étape de décrochage d'au moins un premier flux F vers un deuxième flux F2, caractérisé en ce qu'il comporte au moins les étapes suivantes : 1 - Method for adapting the sound level of an audio stream during the step of stalling at least a first stream F to a second stream F 2 , characterized in that it comprises at least the following steps:
• déterminer au moins un paramètre audio Nm, D représentatif du premier flux décodé F-id, afin de constituer un gabarit sonore G à appliquer sur le deuxième flux décodé F2d, To determine at least one audio parameter Nm, D representative of the first decoded stream Fi d , in order to constitute a sound mask G to be applied to the second decoded stream F 2d ,
• à l'instant où l'on réalise le décrochage du premier flux Fi vers le deuxième flux F2, appliquer le gabarit sonore G sur les échantillons audio décodés du deuxième flux F2d afin d'adapter le niveau sonore du deuxième flux décodé F2d et coder les échantillons du signal audio de F2d avant sa diffusion. 2 - Procédé selon la revendication 1 caractérisé en ce que l'on détermine au moins le niveau sonore moyen Nm et la dynamique D du signal audio du premier flux décodé F d. At the instant when the stalling of the first stream Fi towards the second stream F 2 is carried out , apply the sound template G on the decoded audio samples of the second stream F 2d in order to adapt the sound level of the second decoded stream F 2d and code the samples of the audio signal of F 2d before it is broadcast. 2 - Process according to claim 1 characterized in that one determines at least the average sound level Nm and dynamic D of the audio signal of the first decoded stream F d.
3 - Procédé selon l'une des revendications 1 ou 2 caractérisé en ce que l'on détermine la valeur pic du flux audio F d. 3 - Method according to one of claims 1 or 2 characterized in that one determines the peak value of the audio stream F d .
4 - Procédé selon l'une des revendications 1 à 3 caractérisé en ce que le flux F2 est un flux contenant de publicité, le flux étant commun à plusieurs chaînes. 4 - Process according to one of claims 1 to 3 characterized in that the stream F 2 is a stream containing advertising, the stream being common to several channels.
5 - Système pour adapter le niveau sonore d'un flux audio F2 à partir de paramètres mesurés sur un premier flux audio Fi caractérisé en ce qu'il comporte au moins les éléments suivants : 5 - System for adapting the sound level of an audio stream F 2 from parameters measured on a first audio stream Fi characterized in that it comprises at least the following elements:
• un splicer adapté à gérer la diffusion du premier flux audio Fi et du deuxième flux audio F2,
• un module (23) adapté à déterminer au moins un paramètre audio Nm, D du premier flux décodé F d et à construire un gabarit sonore G (24), A splicer adapted to manage the broadcasting of the first audio stream Fi and the second audio stream F 2 , A module (23) adapted to determine at least one audio parameter Nm, D of the first decoded stream F d and to construct a sound template G (24),
• un module de décodage (25) du deuxième flux F2 et un module d'adaptation (26) du niveau sonore du deuxième flux décodé F2d recevant comme paramètre d'entrée le gabarit sonore G déterminé,A decoding module (25) of the second flux F 2 and an adaptation module (26) of the sound level of the second decoded stream F 2 d receiving as input parameter the determined sound pattern G,
• un module de codage (29) du deuxième flux décodé de niveau sonore adapté. 6 - Système selon la revendication 5 caractérisé en ce que le module de détermination du paramètre audio du premier flux F est adapté à mesurer le niveau moyen sonore Nm du premier flux audio décodé F-id et la dynamique D du premier flux audio Fi et en ce que le module d'adaptation (26) comprend un compresseur (27) et un limiteur (28). An encoding module (29) of the second decoded sound level stream adapted. 6 - System according to claim 5 characterized in that the module for determining the audio parameter of the first stream F is adapted to measure the average sound level Nm of the first decoded audio stream Fi d and the dynamic D of the first audio stream Fi and in that the adaptation module (26) comprises a compressor (27) and a limiter (28).
7 - Système selon la revendication 6 caractérisé en ce que le module de détermination du paramètre audio du premier flux audio décodé Fid est adapté à mesurer le niveau des valeurs pics du premier flux audio décodé 7 - System according to claim 6 characterized in that the module for determining the audio parameter of the first decoded audio stream Fi d is adapted to measure the level of the peak values of the first decoded audio stream.
8 - Système selon la revendication 5 caractérisé en ce qu'il comporte un module de décodage (22) du premier flux audio Fi disposé en aval du module de détermination (23) du paramètre audio. 9 - Utilisation du système selon l'une des revendications 5 à 7 à l'adaptation du niveau sonore d'un flux contenant de la publicité, le flux étant commun à plusieurs chaînes.
8 - System according to claim 5 characterized in that it comprises a decoding module (22) of the first audio stream Fi disposed downstream of the determination module (23) of the audio parameter. 9 - Use of the system according to one of claims 5 to 7 to the adaptation of the sound level of a stream containing advertising, the stream being common to several channels.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR1352390 | 2013-03-18 | ||
FR1352390A FR3003386A1 (en) | 2013-03-18 | 2013-03-18 | METHOD FOR ADAPTING THE SOUND OR LOUDNESS LEVEL OF AN AUDIO STREAM |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014147081A1 true WO2014147081A1 (en) | 2014-09-25 |
Family
ID=48771617
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2014/055432 WO2014147081A1 (en) | 2013-03-18 | 2014-03-18 | Method for adjusting the sound level or loudness of an audio stream |
Country Status (2)
Country | Link |
---|---|
FR (1) | FR3003386A1 (en) |
WO (1) | WO2014147081A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10389323B2 (en) | 2017-12-18 | 2019-08-20 | Tls Corp. | Context-aware loudness control |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5822018A (en) * | 1996-04-02 | 1998-10-13 | Farmer; James O. | Method and apparatus for normalizing signal levels in a signal processing system |
US20050078840A1 (en) * | 2003-08-25 | 2005-04-14 | Riedl Steven E. | Methods and systems for determining audio loudness levels in programming |
US20100109926A1 (en) * | 2008-10-31 | 2010-05-06 | At&T Intellectual Property I, L.P. | System and Method to Modify a Metadata Parameter |
US20100272290A1 (en) * | 2009-04-17 | 2010-10-28 | Carroll Timothy J | Loudness consistency at program boundaries |
-
2013
- 2013-03-18 FR FR1352390A patent/FR3003386A1/en not_active Withdrawn
-
2014
- 2014-03-18 WO PCT/EP2014/055432 patent/WO2014147081A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5822018A (en) * | 1996-04-02 | 1998-10-13 | Farmer; James O. | Method and apparatus for normalizing signal levels in a signal processing system |
US20050078840A1 (en) * | 2003-08-25 | 2005-04-14 | Riedl Steven E. | Methods and systems for determining audio loudness levels in programming |
US20100109926A1 (en) * | 2008-10-31 | 2010-05-06 | At&T Intellectual Property I, L.P. | System and Method to Modify a Metadata Parameter |
US20100272290A1 (en) * | 2009-04-17 | 2010-10-28 | Carroll Timothy J | Loudness consistency at program boundaries |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10389323B2 (en) | 2017-12-18 | 2019-08-20 | Tls Corp. | Context-aware loudness control |
Also Published As
Publication number | Publication date |
---|---|
FR3003386A1 (en) | 2014-09-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2337176C (en) | Process for adjusting the sound level of a digital sound recording | |
TWI422147B (en) | An apparatus for processing an audio signal and method thereof | |
US9576584B2 (en) | System for perceived enhancement and restoration of compressed audio signals | |
US9154102B2 (en) | System for combining loudness measurements in a single playback mode | |
CN105900170B (en) | Signal quality based enhancement and compensation of compressed audio signals | |
FR3007564A3 (en) | AUDIO DECODER WITH PROGRAM INFORMATION METADATA | |
EP2691952A1 (en) | Allocation, by sub-bands, of bits for quantifying spatial information parameters for parametric encoding | |
EP0906613B1 (en) | Method and device for coding an audio signal by "forward" and "backward" lpc analysis | |
EP3490255B1 (en) | Intelligent compression of grainy video content | |
EP2795618B1 (en) | Method of detecting a predetermined frequency band in an audio data signal, detection device and computer program corresponding thereto | |
CA2616484A1 (en) | Sound broadcasting system | |
JP2011508897A (en) | Voice codec quality improving apparatus and method | |
WO2014147081A1 (en) | Method for adjusting the sound level or loudness of an audio stream | |
WO2006103327A1 (en) | Method and device for evaluating degradation of quality caused by an invariance of a stimulus, as perceived by a recipient of said stimulus | |
EP2979437B1 (en) | Optimised mixing of audio streams encoded by sub-band encoding | |
WO2006103323A1 (en) | Method and device for evaluating a quality of signal representing at least one stimulus, as perceived by a recipient of said stimulus | |
EP1792305A1 (en) | Method and device for evaluating the efficiency of a noise reducing function for audio signals | |
FR2949894A1 (en) | Individual's e.g. moderator, courtesy determining method for e.g. broadcasting audio programs in radio, involves measuring time information of individual during discussion, and determining courtesy of individual from measured information | |
EP2940863A1 (en) | Compression method and dynamic audio compressor | |
CA2974156C (en) | Amplifier with adjustment of the automatic sound level | |
WO2024052372A1 (en) | Intelligent voice synthesis | |
FR3117635A1 (en) | Control of the quality level of an audio/video signal during a communication between at least two devices | |
CA3228059A1 (en) | Method and device for limiting of output synthesis distortion in a sound codec | |
WO2016207128A1 (en) | Method and device for producing an audio file | |
FR3007184A1 (en) | MONITORING THE QUENTIFICATION NOISE ATTENUATION TREATMENT INTRODUCED BY COMPRESSIVE CODING |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14713404 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 14713404 Country of ref document: EP Kind code of ref document: A1 |