EP1382038A2 - Dispositif et procede pour l'insertion d'un filigrane dans un signal audio - Google Patents

Dispositif et procede pour l'insertion d'un filigrane dans un signal audio

Info

Publication number
EP1382038A2
EP1382038A2 EP02740586A EP02740586A EP1382038A2 EP 1382038 A2 EP1382038 A2 EP 1382038A2 EP 02740586 A EP02740586 A EP 02740586A EP 02740586 A EP02740586 A EP 02740586A EP 1382038 A2 EP1382038 A2 EP 1382038A2
Authority
EP
European Patent Office
Prior art keywords
watermark
spectral
signal
audio signal
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP02740586A
Other languages
German (de)
English (en)
Other versions
EP1382038B1 (fr
Inventor
Jürgen HERRE
Ralph Kulessa
Christian Neubauer
Frank Siebenhaar
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Original Assignee
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV filed Critical Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Publication of EP1382038A2 publication Critical patent/EP1382038A2/fr
Application granted granted Critical
Publication of EP1382038B1 publication Critical patent/EP1382038B1/fr
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

Definitions

  • the present invention relates to the field of audio coding and in particular to methods and devices for embedding a watermark in an audio signal.
  • Modern audio coding methods process discrete-time audio samples to provide a bit stream that is compressed relative to the original audio signal.
  • the stream of discrete-time audio samples is first windowed to generate successive blocks of windowed audio samples from the stream of audio samples. Further processing takes place in blocks.
  • a block of audio samples generated by windowing is typically converted into a spectral representation by means of an analysis filter bank.
  • the spectral representation comprises spectral values lying next to one another from frequency 0 to the maximum audio frequency, which can be, for example, 16 kHz.
  • the audio spectral values are grouped and quantized in scale factor bands. The quantization takes place in such a way that the quantization noise introduced by the quantization is dimensioned such that it is masked by the audio signal.
  • a psychoacoustic model which, on the basis of the audio signal, supplies an energy value for each scale factor band, which indicates the energy level up to which quantization noise is masked, ie it will not be audible in the decoded audio signal. If, on the other hand, the quantization noise introduced by the quantizer lies above the psychoacoustic masking threshold, the re-decoded audio signal will contain audible interference.
  • the quantizer levels of the quantizer are dependent on the masking threshold calculated. When the quantization levels are calculated, the audio spectral values are quantized using these quantization levels to obtain quantized audio spectral values. For data efficiency reasons, the quantized audio spectral values of an entropy coding, such as B.
  • Huffman coding to provide a bit stream with code words that represent the audio spectral values.
  • page information is added to the stream of code words, which includes, among other things, the scale factors on the basis of which an audio decoder can determine the quantization levels that have been used in the encoder.
  • the bit stream including side information is split up into code words on the one hand and side information on the other using a bit stream demultiplexer.
  • the entropy coding is first undone. Then the entropy decoded, values quantized audio spectral values, an inverse quantization that is subjected to inversely quantized spectral values ⁇ to get. These are then converted from the frequency domain to the time domain using a synthesis filter bank. The decoded audio signal is present at the output of the synthesis filter bank.
  • the psychoacoustic masking threshold of an audio signal section depends on the actual input audio signal. If the audio signal changes over time, the masking properties also change over time. For reasons of data efficiency, it is preferred to always introduce as much quantization noise into the audio signal as possible, ie the quantization noise should correspond as closely as possible to the psychoacoustic masking threshold. Audio signal sections with good masking properties can therefore be encoded with a relatively low bit cost, while on the other hand audio signal sections with relatively poor masking properties, such as. B. tonal audio signal sections, must be quantized very finely, which in turn means that a large number of bits must be used to encode these audio signal sections.
  • bit savings bank function is usually used.
  • the bit savings bank is filled when easy-to-encode audio sections are encoded such that bits that are not needed to encode these easy-to-encode sections are not simply "wasted” by more subtle than necessary quantization, but nevertheless a coarser quantization is used and the surplus bits are "inserted" into the bit savings bank.
  • bit savings bank function thus acts as a buffer
  • This method also has the advantage of high audio quality, since the quantization noise and the watermark noise can be matched to one another if the energy introduced into the audio signal by the watermark is below the psychoacoustic masking threshold lies.
  • the method is also characterized by a high level of robustness, since the watermark cannot be removed from the re-decoded audio signal, for example by an illegal distributor of the audio signal, without impairing the audio quality.
  • a disadvantage of the described method is the fact that the watermark may be quantized or weakened under certain circumstances by the quantization of the signal applied to the watermark. This is due to the fact that the energy of the watermark signal is sometimes in the range of the quantization interval. Furthermore, there is only limited control over the interference introduced by the watermark. B. can affect an audio quality loss.
  • Another watermarking method is to embed the watermark during the compression of the audio signal. This concept is described in the specialist publication "Combined Compression / Watermarking for Audio Signals", Frank Siebenhaar, Christian Neubauer and Jürgen Herre, 110th AES Convention, May 12-15, 2001, Amsterdam, Preprint 5344.
  • First, an uncompressed audio signal is fed to a psychoacoustic model in order to determine the masking threshold.
  • the audio signal is then transformed into the frequency domain.
  • the spread, spectrally represented watermark signal is weighted based on the masking threshold in the frequency domain and added to the spectrum of the input audio signal.
  • the parameters for the quantization are determined on the basis of the masking threshold, whereupon the watermarked signal is quantized and encoded.
  • This method is also characterized by a low level of computational complexity, since certain operations, such as e.g. B. the calculation of the masking model and the conversion of the audio signal into a spectral representation only have to be carried out once.
  • the method usually also provides one good audio quality, since quantization noise and watermark noise can be coordinated.
  • a disadvantage of this method is also the fact that the watermark may be quantized or weakened under certain circumstances by the quantization of the signal applied to the watermark. This is again due to the fact that the energy of the watermark signal is sometimes in the range of the quantization interval. Furthermore, there is only limited control over the interference introduced by the watermark. B. can affect an audio quality loss.
  • the spread watermark signal is also characterized by a large number of spectral lines. However, so that the watermark does not lead to audible interference in the decoded audio signal, the height of the watermark spectral lines is significantly less than the height of the audio signal spectral lines.
  • the combined spectrum is only slightly changed compared to the original spectrum. The subsequent quantization of the combined spectrum will always remove the watermark without replacement if the quantization step size is greater than the height of the watermark spectral lines that are quantized with this quantizer step size. If too many watermark spectral lines are "quantized away" by the subsequent quantization, the watermark detector can no longer extract a clear watermark.
  • the object of the present invention is to provide an improved concept for embedding a watermark in an audio signal which on the one hand provides good audio quality and on the other hand also ensures good watermark detectability.
  • This object is achieved by a method for embedding a watermark in an audio signal according to claim 1 or by a device for embedding a watermark in an audio signal according to claim 16.
  • the present invention is based on the finding that better watermark detectability is achieved if the fact that the audio signal including the watermark is subjected to quantization is taken into account in the watermark embedding.
  • a watermark will only be detectable if a spectral line, which represents the watermark and audio signal, falls through the watermark into a different quantization level than if no watermark is embedded. Only in this case will a watermark detector, which only receives quantized information, be able to detect a watermark.
  • the spectral representation of the watermark signal is therefore processed in such a way that it is ensured that the watermark signal processed by the processing step is designed in such a way that it will still be present after quantization.
  • a predetermined watermark starting value is selected, which depends on the spectral representation of the watermark signal.
  • the watermark must not lead to any or only a very slight interference in the audio signal.
  • a disturbance introduced by the predetermined watermark start value into the spectral representation of the audio signal is determined, but in which the conditions after a quantization of the spectral representation of the audio signal can be used. On the one hand, this makes it possible to see whether some of the watermark remains after quantization.
  • the watermark initial value error is greater than a predetermined Storungsschwelle
  • the watermark initial value is changed until the system introduced by a modified watermark initial value in the spectral representation after quantization error less than or equal to the Restaurant ⁇ voted fault threshold is.
  • the resulting changed watermark starting value is then combined with the audio signal in order to obtain the watermarked audio signal in which the watermark is embedded.
  • An advantage of the present invention is that ratios that ultimately do not correspond to the initial ratios are no longer taken into account, namely the audio signal / watermark ratios before the quantization, but that the watermark z. B. iteratively changed until a desired watermark "interference energy" is found.
  • the conditions after the quantizer are now taken into account, i. H. the ratios that are relevant for the audio signal decoder and for the watermark extractor.
  • the watermark energy has usually been set in the prior art in such a way that the watermark energy is less than or equal to the psychoacoustic masking threshold, the unpredictability of what happens to the watermark signal during quantization still remains.
  • the case could occur that the watermark is quantized away, which means that no watermark or only a very weak watermark could be extracted in the decoded signal.
  • the case could also arise that, although the watermark What has been important is that it is below the masking threshold, yet noise has been introduced that was audible in the decoded signal.
  • the method according to the invention has the advantage that for cases in which good detectability is particularly important, certain - tolerable - disturbances are deliberately introduced into the audio signal in favor of a higher watermark detectability, while in other cases in which the watermarks -Detectability does not have to be ensured at all times under all circumstances, compromises with regard to watermark detectability can be made in order to meet the highest audio quality requirements.
  • the watermark signal is added to the audio signal prior to quantization to obtain a combined signal.
  • the combined signal is then quantized and again inversely quantized and compared with the original audio signal. The comparison determines whether the interference introduced by the watermark is tolerable. If it is determined that the disturbance is not tolerable, the spectrum of the watermark signal is weighted iteratively using certain strategies in order then to carry out quantization and inverse quantization again until it is determined that the disturbance is now tolerable.
  • the watermark spectrum obtained by this processing is then added to the original audio spectrum.
  • the added or combined signal is then quantized, entropy-encoded and provided with side information to obtain an audio bit stream in which the watermark is present.
  • the original audio signal is quantized.
  • a quantized watermark is added to the audio signal to obtain the combined signal.
  • the combined signal is then no longer quantized again, as in the first exemplary embodiment, but is directly entropy-coded.
  • the "quantized" watermark signal added to the quantized audio signal is set in such a way that on the one hand the requirement for tolerable interference is met and on the other hand a desired watermark detectability is achieved.
  • FIG. 1 shows a block diagram of an inventive device for embedding a watermark in an audio signal
  • FIG. 2 shows a block diagram of a device according to the invention for introducing a watermark into an audio signal according to a first exemplary embodiment
  • FIG. 3 shows an inventive device for embedding a watermark in an audio signal according to a second exemplary embodiment; and 4a to 4d a schematic explanation of the line selection algorithm in the second exemplary embodiment of the present invention.
  • the device according to the invention shown in FIG. 1 comprises an audio input 10 and a watermark input 12. Both the audio signal at the audio input 10 and the watermark signal at the watermark input 12 are converted into a spectral representation by means of a device 14 or 16.
  • the spectral representation of the audio signal comprises audio spectral values, while the spectral representation of the watermark signal has watermark spectral values.
  • a device 18 for combining the audio spectral values are combined with changed watermark spectral values in order to obtain the combined audio signal in an output 20, in which the watermark is embedded.
  • a device 22 for processing the spectral representation of the watermark signal is provided depending on a psychoacoustic masking threshold supplied via an input 24.
  • the spectral representation of the watermark signal is processed depending on the psychoacoustic masking threshold obtained via the input 24 in order to obtain a processed watermark signal so that a disturbance introduced into the audio signal by the processed watermark signal is below a predetermined disturbance threshold which is different from that psychoacoustic masking threshold depends.
  • the device 22 for processing the spectral representation of the watermark signal comprises a device 26 for selecting a predetermined watermark starting value, which depends on the spectral representation of the watermark signal.
  • a disturbance introduced by the predetermined watermark start value into the spectral representation of the audio signal after quantization of the spectral representation of the audio signal is detected.
  • quantization information is supplied by a device 30 for supplying quantization information.
  • the device 30 provides quantization information which depends on the original audio signal, that is to say the audio signal without a watermark.
  • a device 32 examines whether the ascertained disturbance is greater than the predetermined disturbance threshold. If not, i. H. if the disturbance is acceptable, the watermark start value is fed directly to the device 18 for combining. If this is the case, however. H. if the introduced interference is too great or different than desired, a device 34 for changing the watermark start value is activated until the interference introduced by a changed watermark start value in the spectral representation of the audio signal after quantization is less than or equal to the predetermined one Interference threshold is.
  • the loop sketched in the processing device 22 may have to be iterated several times in order to at some point obtain a changed watermark starting value at the output of the device 32, which is used as a processed watermark signal and fed to the combining device 18 in order to to receive the audio signal at the output 20 in which the watermark is embedded.
  • the combination is carried out by means of an addition 18 before the quantization.
  • the device 28 for determining the initial value introduced into the audio signal by the block watermark weighting 26 is determined by first quantizing and inversely quantizing the combined signal in a quantizer / inverse quantizer device 28a.
  • the disturbance introduced by the watermark is then calculated in a device 28b, for example by forming the difference and squaring the difference values, and then in the device 32 with the psy- choacoustic masking threshold 24 compared. If the disturbance is too great, the device 34, which is labeled "weight control" in FIG. 2, is actuated in order to supply changed weighting factors to the block 26, in order then to change the weighted spectrum of watermarks in the device 18 with the original audio signal in to combine spectral representation and to go through the iteration loop again.
  • the watermark spectrum with a watermark spectrum weighted equally for all spectral lines as the initial watermark value.
  • the weighting factor for each spectral line is therefore equal to a constant for all spectral lines, which is chosen such that the watermark energy lies above the masking threshold. Then the watermark energy is gradually reduced in order to then "iterate" the energy of the watermark below the masking threshold.
  • the device 34 is designed to control the weighting factors in order to reduce all weighting factors, e.g. B. cut in half. If the disturbance is then still too large, all current weighting factors could be halved again in a next iteration step, etc. This can continue until the device 32 determines that the disturbance is now OK.
  • the spectrally represented watermark signal is thus spectrally weighted with the current weighting factors provided by block 34 by means of a weighting filter bank, which can be contained in block 26, as has been explained with reference to FIG. 2.
  • the resulting signal is added to the original audio signal.
  • the combined signal at the output of device 18 is quantized and inversely quantized and results in the signal present at the output of device 28a which is fed into device 28b in the same way as the original audio signal.
  • the device 28b now compares the original signal with the quantized and again inversely quantized signal and uses this to determine the quantization error signal which is supplied to the device 32.
  • the weighting control in block 34 is activated in order to determine new, better weighting factors.
  • the masking threshold determined by the masking model is available, which indicates how much interference in the signal is "allowed" at a certain point in the signal spectrum. If the weight control block 34 has determined optimal weighting factors with regard to the desired audio signal interference and the desired watermark detectability, ie watermark energy, the method terminates.
  • the quantized spectral values of the combination signal last determined by block 28a are then passed on to the bitstream multiplexer as a result, in order to be formed there together with the side information into an audio bitstream. 3 is discussed below in order to illustrate a device for embedding a watermark in an audio signal according to a second exemplary embodiment of the present invention. In contrast to the first exemplary embodiment shown in FIG.
  • this combination 18 in FIG. 3 takes place in the "quantization range", ie it becomes a quantized audio signal with a combined quantized watermark.
  • This can be achieved either by using a quantizer 42 to calculate the quantizer stages by quantizing the original audio signal, or by extracting the quantizer stages from a coded audio signal.
  • means 40a for calculating the quantized audio signal minus a predetermined number of n quantization levels and means 40b for calculating the quantized audio signal plus a predetermined number of n quantization levels are operated.
  • a so-called “maximum” watermark is first calculated as a predetermined watermark starting value by means of a device 36. To calculate the predetermined maximum watermark, only the signs of the watermark spectrum are used. If the watermark spectrum has a positive sign, the corresponding spectral value of the original quantized audio signal is increased by n quantization levels, n being an integer greater than or is 1.
  • the sign of a watermark spectral value is negative
  • the corresponding quantized spectral value ie the spectral value of the audio signal at the same frequency as the spectral value of the watermark signal whose sign is currently being viewed
  • n quantization levels are reduced by n quantization levels.
  • a device 38 that implements a line selection algorithm is provided.
  • the device 38 determines the interference introduced into the audio signal by the maximum watermark provided by the device 36. If the disturbance is greater than the predetermined disturbance threshold, the device 38 changes the "maximum" watermark by selecting individual lines until the disturbance introduced by the watermark is less than or equal to the predetermined disturbance threshold. If this condition is fulfilled, the watermark, which is already in quantized form, and the quantized original audio signal are fed to the adder 18 in order to obtain the quantized watermarked audio signal on the output side.
  • 4a shows, by way of example, a quantized audio signal which, because of the clarity of the illustration, only shows three spectral values 50a-50c.
  • an audio spectrum has e.g. B. 1024 spectral values.
  • the number of non-zero quantized spectral values depends on how many audio spectral values have been quantized to zero. Of course, the quantized audio spectral values have different heights in the real case.
  • 4b shows an audio spectrum with plus or minus n quantization levels (depending on the sign of the watermark spectral values).
  • the spectral component of the watermark corresponding to the audio spectral value 50a of FIG. 4a has a negative sign for the example shown in FIG. 4b.
  • the spectral component of the watermark, which corresponds to the audio spectral value 50b of FIG. 4a, has a positive sign in the example shown in FIG. 4b, while the third spectral component of the watermark again had a negative sign.
  • the amount of the watermark spectral components is initially irrelevant, since it is assumed that watermark detection is already possible if the quantized audio spectral values 50a-50c are changed by the watermark.
  • the maximum watermark which is determined by the device 36 from FIG. 3, is shown in FIG. 4c for the case shown in FIG. 4b. It has a spectrum that is characterized in that each quantized original audio spectral value is changed by a quantization level, either enlarged if the watermark has a positive sign, or decreased if the watermark had a negative sign.
  • the amount of a watermark spectral line could be taken into account in such a way that not only is incremented or decremented by one quantization level, but that is incremented or decremented by several quantization levels if the amount of Watermark spectral line is correspondingly large.
  • the function of the device 38 from FIG. 3 will now be described with reference to FIG. 4d.
  • the facility 38 If the left quantized audio spectral component 50a is reduced by a quantization level, as is represented by the spectral component 50a ', the situation is such that the situation is such that the interference introduced by the watermark is too large, as is represented by the spectral component 50a', this becomes the spectral component not selected by the device 38, which is so noticeable in the changed watermark spectral values after the line selection that the changed watermark has a spectral line of 0 at this point.
  • the quantized audio spectral values are now based on the watermark signal, ie using the sign of the watermark signal by z. B. plus or minus changed a quantization level.
  • This procedure has advantages in that computing time can be saved since the quantization and inverse quantization (device 28a of FIG. 2) and the weighting of the watermark (device 26 of FIG. 2) can be omitted without replacement.
  • the maximum watermark (FIG. 4c) is determined line by line on the basis of the audio spectra already precalculated, ie the original spectrum and the original spectrum minus n quantization levels or the original spectrum plus n quantization levels. This results as the difference between the original spectrum (FIG. 4a) and the audio spectrum changed by a number of n quantization levels (FIG. 4b), the difference having the same sign as the unweighted watermark.
  • the line selection algorithm which is executed in the device 38, takes into account the amount of the unweighted watermark spectral lines, the masking threshold 24 and possibly a bit savings bank function 44 of the audio encoder.
  • the lines of the maximum watermark in such a way that the watermark spreading band signal is embedded in a broadband manner, i. H. that as many lines of the quantized audio signal as possible are changed.
  • the structure of the watermark should be changed as little as possible within a frequency band.
  • bit savings bank function which can provide additional bits to later signal blocks, as has been carried out.
  • the line selection strategy is preferably adapted to the fill level of the bit savings bank, so as to allow, for example, when the bit savings bank is filled, that even quantized audio spectral values of the original audio signal that have the value 0 are watermarked, which would normally not be permitted due to the bit requirement , This can noticeably improve watermark detection.
  • the original values are also available after the transformation into the frequency range.
  • the quantization of the original audio spectral values can also be seen as a kind of watermark embedding, since a certain disturbance of the audio spectrum results in both the quantization and the addition of a watermark signal. Due to its random nature, the disturbance introduced by the quantization should not be regarded as a watermark. However, if the introduced disturbance is correct with the watermark due to the quantization, the quantization noise supports the detectability of the watermark. The following cases result from this.
  • the watermark is introduced with the correct sign.
  • the device 38 of FIG. 3 is preferably arranged such that, due to the fact that the quantization is already in phase with the watermark spectral value for one certain frequency a disturbance has been introduced, arranged to dispense with the introduction of another watermark disturbance.
  • a quantization level could be added to further improve the detectability.
  • the psychoacoustic masking threshold is not calculated line by line, but by scale factor band. This means that energies of individual spectral lines are not considered, but the total energies z. B. 20 spectral lines in a scale factor band. However, in a scale factor band in which many watermark spectral lines are tolerable, a few lines in the sense of good audio quality can be dispensed with without the watermark detectability suffering significantly.
  • This functionality can also be achieved in the exemplary embodiment shown in FIG. 2 in that the weighting control 34 of FIG.
  • the concept according to the invention is such that a spectrally represented watermark signal is first generated. This is weighted using weighting factors. The weighted signal is added to the original audio signal, which is available in a spectral representation. Alternatively, a change in the lines of the original audio signal, which is available in a spectral representation, is carried out on the basis of the watermark signal. The disturbance introduced after the quantization is then determined, the disturbance being determined by quantizing, inverse quantizing and forming the difference from the original, or the disturbance being pre-calculated.
  • new weighting factors are determined, using the masking threshold, using a line selection strategy, or using a line selection strategy in particular in such a way that the sign and amount of the spectral lines of the unweighted watermark are used, and that the sum of the watermark line and the original spectral line is such it is determined that this new spectral line falls within a different quantization interval than the original spectral line.
  • the concept according to the invention is advantageous in that it can be used both for bitstream watermarking methods and for methods which carry out audio coding and watermark embedding in one step. Another advantage of the concept according to the invention is that full control over the introduced disturbance can be achieved. This makes it possible to specifically set the method in favor of optimal watermark detection or optimal audio quality.
  • Another advantage of the concept according to the invention is full control over the frequency distribution of the watermark spreading band signal into the audio signal.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)
EP02740586A 2001-06-18 2002-05-10 Dispositif et procede pour l'insertion d'un filigrane dans un signal audio Expired - Lifetime EP1382038B1 (fr)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
DE10129239A DE10129239C1 (de) 2001-06-18 2001-06-18 Vorrichtung und Verfahren zum Einbetten eines Wasserzeichens in ein Audiosignal
DE10129239 2001-06-18
PCT/EP2002/005173 WO2002103695A2 (fr) 2001-06-18 2002-05-10 Dispositif et procede pour l'insertion d'un filigrane dans un signal audio

Publications (2)

Publication Number Publication Date
EP1382038A2 true EP1382038A2 (fr) 2004-01-21
EP1382038B1 EP1382038B1 (fr) 2005-06-29

Family

ID=7688519

Family Applications (1)

Application Number Title Priority Date Filing Date
EP02740586A Expired - Lifetime EP1382038B1 (fr) 2001-06-18 2002-05-10 Dispositif et procede pour l'insertion d'un filigrane dans un signal audio

Country Status (5)

Country Link
US (1) US7346514B2 (fr)
EP (1) EP1382038B1 (fr)
AT (1) ATE298921T1 (fr)
DE (2) DE10129239C1 (fr)
WO (1) WO2002103695A2 (fr)

Families Citing this family (62)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030133592A1 (en) 1996-05-07 2003-07-17 Rhoads Geoffrey B. Content objects with computer instructions steganographically encoded therein, and associated methods
US6738495B2 (en) * 1995-05-08 2004-05-18 Digimarc Corporation Watermarking enhanced to withstand anticipated corruptions
US8306811B2 (en) * 1996-08-30 2012-11-06 Digimarc Corporation Embedding data in audio and detecting embedded data in audio
US7602940B2 (en) 1998-04-16 2009-10-13 Digimarc Corporation Steganographic data hiding using a device clock
US7644282B2 (en) 1998-05-28 2010-01-05 Verance Corporation Pre-processed information embedding system
DE19947877C2 (de) * 1999-10-05 2001-09-13 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Einbringen von Informationen in einen Datenstrom sowie Verfahren und Vorrichtung zum Codieren eines Audiosignals
US6737957B1 (en) 2000-02-16 2004-05-18 Verance Corporation Remote control signaling using audio watermarks
US7239981B2 (en) 2002-07-26 2007-07-03 Arbitron Inc. Systems and methods for gathering audience measurement data
US9711153B2 (en) 2002-09-27 2017-07-18 The Nielsen Company (Us), Llc Activating functions in processing devices using encoded audio and detecting audio signatures
US8959016B2 (en) 2002-09-27 2015-02-17 The Nielsen Company (Us), Llc Activating functions in processing devices using start codes embedded in audio
CA2499967A1 (fr) 2002-10-15 2004-04-29 Verance Corporation Systeme de suivi de media, de gestion et d'information
US7352878B2 (en) * 2003-04-15 2008-04-01 Digimarc Corporation Human perceptual model applied to rendering of watermarked signals
DE10321983A1 (de) * 2003-05-15 2004-12-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Einbetten einer binären Nutzinformation in ein Trägersignal
JP2005084625A (ja) * 2003-09-11 2005-03-31 Music Gate Inc 電子透かし合成方法及びプログラム
US7616776B2 (en) * 2005-04-26 2009-11-10 Verance Corproation Methods and apparatus for enhancing the robustness of watermark extraction from digital host content
US7369677B2 (en) 2005-04-26 2008-05-06 Verance Corporation System reactions to the detection of embedded watermarks in a digital host content
US9055239B2 (en) * 2003-10-08 2015-06-09 Verance Corporation Signal continuity assessment using embedded watermarks
US20060239501A1 (en) 2005-04-26 2006-10-26 Verance Corporation Security enhancements of digital watermarks for multi-media content
DE102004009955B3 (de) * 2004-03-01 2005-08-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Ermitteln einer Quantisierer-Schrittweite
DE102004021403A1 (de) 2004-04-30 2005-11-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Informationssignalverarbeitung durch Modifikation in der Spektral-/Modulationsspektralbereichsdarstellung
DE102004021404B4 (de) 2004-04-30 2007-05-10 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Wasserzeicheneinbettung
US8020004B2 (en) 2005-07-01 2011-09-13 Verance Corporation Forensic marking using a common customization function
US8781967B2 (en) 2005-07-07 2014-07-15 Verance Corporation Watermarking in an encrypted domain
US7809018B2 (en) * 2005-12-16 2010-10-05 Coding Technologies Ab Apparatus for generating and interpreting a data stream with segments having specified entry points
DE602006006346D1 (de) * 2005-12-16 2009-05-28 Dolby Sweden Ab Vorrichtung zum erzeugen und interpretieren eines datenstroms mit einer reihe von segmenten unter verwendung von daten in nachfolgenden datenrahmen
ATE532350T1 (de) 2006-03-24 2011-11-15 Dolby Sweden Ab Erzeugung räumlicher heruntermischungen aus parametrischen darstellungen mehrkanaliger signale
US8032371B2 (en) * 2006-07-28 2011-10-04 Apple Inc. Determining scale factor values in encoding audio data with AAC
US8010370B2 (en) * 2006-07-28 2011-08-30 Apple Inc. Bitrate control for perceptual coding
CA2682926C (fr) * 2006-10-18 2015-10-13 Destiny Software Productions Inc. Filigranage de donnees de media
CA2705549C (fr) 2007-11-12 2015-12-01 The Nielsen Company (Us), Llc Procedes et dispositifs pour effectuer le tatouage audio et la detection et l'extraction de tatouage
US8457951B2 (en) * 2008-01-29 2013-06-04 The Nielsen Company (Us), Llc Methods and apparatus for performing variable black length watermarking of media
US8259938B2 (en) 2008-06-24 2012-09-04 Verance Corporation Efficient and secure forensic marking in compressed
US8359205B2 (en) * 2008-10-24 2013-01-22 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermarking and watermark detection and extraction
US9667365B2 (en) 2008-10-24 2017-05-30 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermarking and watermark detection and extraction
JP2012525655A (ja) 2009-05-01 2012-10-22 ザ ニールセン カンパニー (ユー エス) エルエルシー 一次ブロードキャストメディアコンテンツに関連する二次コンテンツを提供するための方法、機器、及び製造品
US8838977B2 (en) 2010-09-16 2014-09-16 Verance Corporation Watermark extraction and content screening in a networked environment
EP2544179A1 (fr) * 2011-07-08 2013-01-09 Thomson Licensing Procédé et appareil pour quantifier la modulation d'index pour tatouer un signal d'entrée
US8923548B2 (en) 2011-11-03 2014-12-30 Verance Corporation Extraction of embedded watermarks from a host content using a plurality of tentative watermarks
US8533481B2 (en) 2011-11-03 2013-09-10 Verance Corporation Extraction of embedded watermarks from a host content based on extrapolation techniques
US8682026B2 (en) 2011-11-03 2014-03-25 Verance Corporation Efficient extraction of embedded watermarks in the presence of host content distortions
US8615104B2 (en) 2011-11-03 2013-12-24 Verance Corporation Watermark extraction based on tentative watermarks
US8745403B2 (en) 2011-11-23 2014-06-03 Verance Corporation Enhanced content management based on watermark extraction records
US9323902B2 (en) 2011-12-13 2016-04-26 Verance Corporation Conditional access using embedded watermarks
US9547753B2 (en) 2011-12-13 2017-01-17 Verance Corporation Coordinated watermarking
US9571606B2 (en) 2012-08-31 2017-02-14 Verance Corporation Social media viewing system
US8726304B2 (en) 2012-09-13 2014-05-13 Verance Corporation Time varying evaluation of multimedia content
US8869222B2 (en) 2012-09-13 2014-10-21 Verance Corporation Second screen content
US20140075469A1 (en) 2012-09-13 2014-03-13 Verance Corporation Content distribution including advertisements
US9401153B2 (en) 2012-10-15 2016-07-26 Digimarc Corporation Multi-mode audio recognition and auxiliary data encoding and decoding
US9305559B2 (en) 2012-10-15 2016-04-05 Digimarc Corporation Audio watermark encoding with reversing polarity and pairwise embedding
US9225310B1 (en) * 2012-11-08 2015-12-29 iZotope, Inc. Audio limiter system and method
US9262793B2 (en) 2013-03-14 2016-02-16 Verance Corporation Transactional video marking system
US9251549B2 (en) 2013-07-23 2016-02-02 Verance Corporation Watermark extractor enhancements based on payload ranking
US9711152B2 (en) 2013-07-31 2017-07-18 The Nielsen Company (Us), Llc Systems apparatus and methods for encoding/decoding persistent universal media codes to encoded audio
US20150039321A1 (en) 2013-07-31 2015-02-05 Arbitron Inc. Apparatus, System and Method for Reading Codes From Digital Audio on a Processing Device
US9208334B2 (en) 2013-10-25 2015-12-08 Verance Corporation Content management using multiple abstraction layers
EP2881941A1 (fr) * 2013-12-09 2015-06-10 Thomson Licensing Procédé et appareil pour filigranage d'un signal audio
CN106170988A (zh) 2014-03-13 2016-11-30 凡瑞斯公司 使用嵌入式代码的交互式内容获取
CN105244033B (zh) * 2014-07-09 2019-07-16 意法半导体亚太私人有限公司 用于数字水印的系统和方法
US10043527B1 (en) 2015-07-17 2018-08-07 Digimarc Corporation Human auditory system modeling with masking energy adaptation
KR102021739B1 (ko) * 2018-06-04 2019-11-05 채령 양자암호키로 생성한 암호화 제품정보 데이터와 해시함수 2차 행렬 (x,y) 좌표 값으로 마킹하는 복제 위변조 방지 양자암호마킹기 및 양자암호마킹 물품관리 시스템
CN113035213B (zh) * 2020-12-24 2022-07-22 中国电影科学技术研究所 数字音频水印检测方法及装置

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5319735A (en) * 1991-12-17 1994-06-07 Bolt Beranek And Newman Inc. Embedded signalling
EP0753226B1 (fr) * 1994-03-31 2008-07-30 THE ARBITRON COMPANY, a division of CERIDIAN CORPORATION Dispositifs et methodes d'incorporation et de decodage de codes dans des signaux audio
US5822360A (en) * 1995-09-06 1998-10-13 Solana Technology Development Corporation Method and apparatus for transporting auxiliary data in audio signals
WO1997033391A1 (fr) * 1996-03-07 1997-09-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Procede de codage pour introduire un signal de donnees non audible dans un signal audio, procede de decodage, codeur et decodeur correspondants
US6061793A (en) * 1996-08-30 2000-05-09 Regents Of The University Of Minnesota Method and apparatus for embedding data, including watermarks, in human perceptible sounds
JP4226687B2 (ja) 1998-05-01 2009-02-18 ユナイテッド・モジュール・コーポレーション 電子透かし埋め込み装置及びオーディオ符号化装置
JP2000209097A (ja) 1999-01-14 2000-07-28 Sony Corp 信号処理装置とその方法、信号記録装置、信号再生装置および記録媒体
DE19906512C2 (de) * 1999-02-17 2001-01-25 Frank Kurth Verfahren zum unbemerkten Übertragen und/oder Speichern von Zusatzinformationen innerhalb eines Signals, insbesondere Audiosignals
DE19938095A1 (de) 1999-08-12 2001-03-01 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Einbringen von Informationen in ein Audiosignal und Verfahren und Vorrichtung zum Ermitteln von in einem Audiosignal eingebrachten Informationen
DE19947877C2 (de) * 1999-10-05 2001-09-13 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Einbringen von Informationen in einen Datenstrom sowie Verfahren und Vorrichtung zum Codieren eines Audiosignals
AU2001264351A1 (en) 2000-06-08 2002-01-02 Markany Inc. Watermark embedding and extracting method for protecting digital audio contents copyright and preventing duplication and apparatus using thereof
KR100898879B1 (ko) * 2000-08-16 2009-05-25 돌비 레버러토리즈 라이쎈싱 코오포레이션 부수 정보에 응답하여 하나 또는 그 이상의 파라메터를변조하는 오디오 또는 비디오 지각 코딩 시스템

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO02103695A2 *

Also Published As

Publication number Publication date
WO2002103695A3 (fr) 2003-05-22
US20040184369A1 (en) 2004-09-23
ATE298921T1 (de) 2005-07-15
US7346514B2 (en) 2008-03-18
WO2002103695A2 (fr) 2002-12-27
DE50203511D1 (de) 2005-08-04
EP1382038B1 (fr) 2005-06-29
DE10129239C1 (de) 2002-10-31

Similar Documents

Publication Publication Date Title
EP1382038B1 (fr) Dispositif et procede pour l'insertion d'un filigrane dans un signal audio
DE60214027T2 (de) Kodiervorrichtung und dekodiervorrichtung
DE602004005197T2 (de) Vorrichtung und verfahren zum kodieren eines audiosignals und vorrichtung und verfahren zum dekodieren eines kodierten audiosignals
EP0954909B1 (fr) Procede de codage d'un signal audio
DE60313332T2 (de) Audiocodierung mit niedriger bitrate
DE19921122C1 (de) Verfahren und Vorrichtung zum Verschleiern eines Fehlers in einem codierten Audiosignal und Verfahren und Vorrichtung zum Decodieren eines codierten Audiosignals
DE60209888T2 (de) Kodieren eines audiosignals
EP1687810B1 (fr) Dispositif et procede pour determiner un pas de quantification
EP0931386B1 (fr) Procede de signalisation d'une substitution de bruit lors du codage d'un signal audio
EP1112621B1 (fr) Dispositif et procede pour effectuer un codage entropique de mots d'information et dispositif et procede pour decoder des mots d'information ayant subi un codage entropique
EP0910928A1 (fr) Codage et decodage de signaux audio au moyen d'un procede stereo en intensite et de prediction
EP0910927A1 (fr) Procede de codage et de decodage de valeurs spectrales stereophoniques
EP1212857B1 (fr) Procede et dispositif pour l'introduction d'informations dans un flux de donnees, ainsi que procede et dispositif pour le codage d'un signal audio
WO2005083678A1 (fr) Dispositif et procede pour traiter un signal multicanal
WO2002063611A1 (fr) Procede et dispositif pour produire ou decoder un flux de donnees modulable compte tenu d'une banque de bits, codeur et codeur modulable
DE10200653A1 (de) Skalierbarer Codierer und Decodierer für einen skalierten Datenstrom
WO2001043503A2 (fr) Procede et dispositif pour traiter un signal audio stereo
EP1016319A1 (fr) Procede et dispositif pour coder un signal stereo temporellement discret
EP1023777B1 (fr) Procede et dispositif pour limiter un courant de donnees audio dont le debit binaire peut etre mis a l'echelle
DE602004009926T2 (de) Vorrichtung und verfahren zum einbetten eines wasserzeichens unter verwendung von subbandfilterung
WO2002058054A1 (fr) Procede et dispositif pour produire un flux de donnees modulable et procede et dispositif pour decoder un flux de donnees modulable
DE69823458T2 (de) Mehrband-Sprachdecoder
DE69931932T2 (de) Verfahren und Vorrichtung zum Dekodieren und Einfügen eines Wasserzeichens in einen Datenstrom
EP1277346B1 (fr) Dispositif et procédé pour l'analyse de la représentation spectrale d'un signal temporel décodé
DE19742201C1 (de) Verfahren und Vorrichtung zum Codieren von Audiosignalen

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20031118

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

RIN1 Information on inventor provided before grant (corrected)

Inventor name: SIEBENHAAR, FRANK

Inventor name: KULESSA, RALPH

Inventor name: HERRE, JUERGEN

Inventor name: NEUBAUER, CHRISTIAN

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.

Effective date: 20050629

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20050629

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20050629

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REF Corresponds to:

Ref document number: 50203511

Country of ref document: DE

Date of ref document: 20050804

Kind code of ref document: P

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: LANGUAGE OF EP DOCUMENT: GERMAN

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20050929

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20050929

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051010

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20050919

REG Reference to a national code

Ref country code: SE

Ref legal event code: TRGR

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051207

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060531

26N No opposition filed

Effective date: 20060330

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20050629

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060510

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20050629

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 15

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 16

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 17

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: NL

Payment date: 20210519

Year of fee payment: 20

Ref country code: FR

Payment date: 20210521

Year of fee payment: 20

Ref country code: DE

Payment date: 20210525

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: BE

Payment date: 20210519

Year of fee payment: 20

Ref country code: AT

Payment date: 20210518

Year of fee payment: 20

Ref country code: SE

Payment date: 20210521

Year of fee payment: 20

Ref country code: CH

Payment date: 20210525

Year of fee payment: 20

Ref country code: GB

Payment date: 20210526

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 50203511

Country of ref document: DE

REG Reference to a national code

Ref country code: NL

Ref legal event code: MK

Effective date: 20220509

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

REG Reference to a national code

Ref country code: BE

Ref legal event code: MK

Effective date: 20220510

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20220509

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK07

Ref document number: 298921

Country of ref document: AT

Kind code of ref document: T

Effective date: 20220510

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20220509