EP2439735A1 - Procédé et appareil pour générer des motifs de phase de référence - Google Patents
Procédé et appareil pour générer des motifs de phase de référence Download PDFInfo
- Publication number
- EP2439735A1 EP2439735A1 EP10306091A EP10306091A EP2439735A1 EP 2439735 A1 EP2439735 A1 EP 2439735A1 EP 10306091 A EP10306091 A EP 10306091A EP 10306091 A EP10306091 A EP 10306091A EP 2439735 A1 EP2439735 A1 EP 2439735A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- blocks
- frequency domain
- audio signal
- windowing
- subblocks
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims description 17
- 238000001228 spectrum Methods 0.000 claims abstract description 53
- 230000005236 sound signal Effects 0.000 claims abstract description 31
- 238000001914 filtration Methods 0.000 claims description 16
- 238000012545 processing Methods 0.000 claims description 15
- 230000000694 effects Effects 0.000 abstract description 2
- 238000003775 Density Functional Theory Methods 0.000 description 16
- 238000011156 evaluation Methods 0.000 description 6
- 239000013598 vector Substances 0.000 description 4
- 238000007792 addition Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
Definitions
- the invention relates to a method and to an apparatus for generating reference phase patterns to be used for controlling watermarking by frequency domain phase modulation of spectra of an audio signal.
- watermark embedding occurs in the phase domain by modifying phases of the host signal spectrum according to phase reference patterns. Specifically, the host audio signal is split into blocks of length L B and each block is used for embedding a watermark symbol. The watermark message is represented by a sequence of symbols. For embedding, each block can be further partitioned into N SB overlapping subblocks of L SB samples length with an overlap length of L SB /2.
- phase of the weighted subblock spectrum are modified according to generated reference phases and according to results from a psycho-acoustic analysis of the host audio signal.
- reference phases for each watermark message symbol are generated by the following steps:
- Step B3) mimics the windowing-overlap process that is commonly used in the analysis stage of digital audio signal processing.
- a windowing function is selected such that an ideal reconstruction is guaranteed following ideal analysis (windowing-overlap) and synthesis (windowing-overlap-add) processes.
- generated reference angles in step B4) are employed to modify host audio spectrum phases after the host signal has undergone the windowing-overlap processing.
- phase generation processing is that many IDFTs/DFTs are required, which imposes constraints on the achievable speed of hardware/software implementations, and the deployment of related audio watermarking schemes is therefore limited for real-time applications.
- IDFTs/DFTs are required, which imposes constraints on the achievable speed of hardware/software implementations, and the deployment of related audio watermarking schemes is therefore limited for real-time applications.
- the audio content is watermarked with a label that identifies the customer.
- a copyright owner will license multiple different works to the customer.
- different keys for individual copyrighted works are used. Therefore efficient reference phase generation is a necessary prerequisite for deploying an audio watermarking scheme in such application scenario.
- a problem to be solved by the invention is to provide more efficient generation of the reference phase patterns. This problem is solved by the method disclosed in claim 1. An apparatus that utilises this method is disclosed in claim 6.
- the efficient generation of random reference phase patterns is carried out by approximating the inverse transform of the phase reference patterns, and the corresponding forward transform and the windowing and windowing-overlapping effects using two different short-length frequency-domain filters for even and odd blocks or subblocks.
- the computational complexity for generating the reference phase pattern blocks or subblocks for frequency domain phase modulation of the audio signal blocks or subblocks is significantly reduced, which in turn reduces the complexity of both watermark embedder at sender side and watermark detector at receiver side.
- the inventive reference phase generation fulfils critical real-time requirements and enables a wide deployment of corresponding audio watermarking processings.
- the flexibility of controlling the generated phases is maintained, e.g. in case structured phases are used instead of random ones.
- the inventive method is suited for generating reference phase patterns to be used for controlling watermarking by frequency domain phase modulation of spectra of an audio signal, said method including the steps:
- the inventive apparatus is suited for generating reference phase patterns to be used for controlling watermarking by frequency domain phase modulation of spectra of an audio signal, said apparatus including:
- an audio signal block a is partitioned into N SB overlapped (50% overlapping) subblocks of length L SB denoted as ⁇ a m , 0 ⁇ m ⁇ N SB ⁇ . Accordingly, the even subblocks are non-overlapping and each odd subblock can be obtained by its two adjacent even subblocks. Specifically, the (2 i +1)-th (i.e.
- the phases of spectra of windowed subblocks a w,m serve as reference phases for the watermark embedding, followed by DFT for the weighted subblocks to get reference spectra.
- ⁇ A m [ k ], 0 ⁇ k ⁇ L SB -1 ⁇ and ⁇ W [ k ], 0 ⁇ k ⁇ L SB -1 ⁇ shall denote DFTs for the m-th subblock a m and for the windowing function, respectively.
- the first term in Eq.(4) is the contribution from two spectral lines of the same index within two adjacent subblocks and the second term is the contribution from other spectral lines.
- the reference spectrum for an odd subblock is obtained by inserting Eq.(4) into Eq.(2). Therefore, for the evaluation of reference spectra, two frequency-domain filters ⁇ F [ k ] ⁇ and ⁇ W [ k ] ⁇ are involved. In case they can be well-approximated by a few filter taps, an efficient evaluation of reference spectra is feasible.
- the spectrum of the windowing function can also be approximated by a few dominant components.
- the second term in Eq.(9) can be interpreted as the spectrum difference that passes through two filters, which can be regarded as representing in total a single filter:
- L SB ⁇ q - L w L w - 1 k + q 2 ⁇ A 2 ⁇ i ⁇ ⁇ k + q > + A 2 ⁇ i + 2 ⁇ ⁇ k + q > ⁇ W ⁇ q > + 1
- L SB 2 ⁇ q - L g L g - 1 k + q ⁇ A 2 ⁇ i ⁇ ⁇ k + q > - A 2 ⁇ i + 2 ⁇ ⁇ k + q > ⁇ G ⁇ q >
- L g L w + L 1 .
- Eq.(8) and Eq.(10) provide an efficient evaluation of the reference spectra, whereby only two frequency-domain filters ⁇ W [ ⁇ q >] ⁇ and ⁇ G [ ⁇ q >] ⁇ with a few dominant taps are required.
- the corresponding filter coefficients can be pre-calculated, and the approximation accuracy can be controlled by filter length parameters L w and L g . That is, the trade-off between robustness and computational complexity can be adjusted by choosing different L w and L g values.
- the original processing employing FFTs/IFFTs requires on average 62 real-value multiplications and 62 real-value additions for the evaluation of each DFT coefficient of the reference spectra.
- a received audio signal AS is sampled in an acquisition or receiving section step or stage 51, and thereafter passes through a segmentation, windowing and DFT step or stage 52 to a phase modulation step or stage 53, in which the phases are modulated within the frequency domain using references patterns or phases from a block 56.
- a corresponding IDFT, windowing and overlap-add step or stage 54 is passed, which outputs the watermarked audio signal WAS.
- the phase value modification can be controlled according to results from a psycho-acoustic analysis of the audio signal AS.
- a secret key SK is used to generate pseudo-random phases in a step or stage 55, which controls a reference pattern or phase generation step or stage 56 that operates using the above-described inventive processing.
- stage 56 does not include IDFT, windowing, DFT and overlap+add, but instead approximates these functions by a first frequency domain filtering by W for even subblocks and a different second frequency domain filtering by W and G for odd subblocks, thereby generating approximated versions of reference phase patterns for the audio signal subblocks.
- step 60 nSymbols, nSubBlocks and the secret key SK are input.
- running variable 'j' is set '0' and in step 612 running variable 'i' is set '0'.
- step 62 (which is included in step/stage 55 in Fig. 5 ) a random phase is generated for subblocks 2i and 2i+2.
- step 63 the spectra are constructed from the phases, whereby the magnitude equals '1'.
- step 64 the WOL subblock 2i+1 and the WIN subblocks 2i and 2i+2 are calculated. Steps 63 and 64 are included in step/stage 56 in Fig. 5 .
- step 65 'i' is incremented by '1'.
- the running variable 'j' is incremented by '1' in step 67. If 'j' has reached value (nSymbols-1) in step 68, nSubBlocks reference spectra for nSymbols are output in step 69.
- the order of even and odd subblocks is exchanged.
- a combination of frequency-domain convolution and IDFT-windowing-DFT can be employed for the phase generation.
- Eq.(8) is used for generating the reference spectra for an even subblock Eq.(8).
- Eq.(10) is realised as a frequency-domain convolution
- IDFT-windowing-DFT a time-domain subblock corresponding to ⁇ A 2i [ k ] - A 2i+2 [ k ] ⁇ is obtained via IDFT, and is thereafter weighted by a time-domain filter corresponding to ⁇ G [ k ] ⁇ .
- the reference spectrum of the odd subblock is obtained by a DFT of the weighted subblock.
- Such alternative processing typically has a higher computational complexity than the inventive processing described before but a lower complexity than the processing described in the background section, because the IDFT-windowing-DFT is applied to odd blocks only.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP10306091A EP2439735A1 (fr) | 2010-10-06 | 2010-10-06 | Procédé et appareil pour générer des motifs de phase de référence |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP10306091A EP2439735A1 (fr) | 2010-10-06 | 2010-10-06 | Procédé et appareil pour générer des motifs de phase de référence |
Publications (1)
Publication Number | Publication Date |
---|---|
EP2439735A1 true EP2439735A1 (fr) | 2012-04-11 |
Family
ID=43607892
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP10306091A Withdrawn EP2439735A1 (fr) | 2010-10-06 | 2010-10-06 | Procédé et appareil pour générer des motifs de phase de référence |
Country Status (1)
Country | Link |
---|---|
EP (1) | EP2439735A1 (fr) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014062332A1 (fr) * | 2012-10-16 | 2014-04-24 | The Nielsen Company (Us), Llc | Procédés et appareils pour exécuter une détection et une extraction d'un tatouage numérique audio |
US9596521B2 (en) | 2014-03-13 | 2017-03-14 | Verance Corporation | Interactive content acquisition using embedded codes |
US9639911B2 (en) | 2014-08-20 | 2017-05-02 | Verance Corporation | Watermark detection using a multiplicity of predicted patterns |
US9769543B2 (en) | 2014-11-25 | 2017-09-19 | Verance Corporation | Enhanced metadata and content delivery using watermarks |
US9942602B2 (en) | 2014-11-25 | 2018-04-10 | Verance Corporation | Watermark detection and metadata delivery associated with a primary content |
CN108417199A (zh) * | 2013-01-18 | 2018-08-17 | 株式会社东芝 | 音频水印信息检测装置及音频水印信息检测方法 |
US10277959B2 (en) | 2014-12-18 | 2019-04-30 | Verance Corporation | Service signaling recovery for multimedia content using embedded watermarks |
US10504200B2 (en) | 2014-03-13 | 2019-12-10 | Verance Corporation | Metadata acquisition using embedded watermarks |
US11722741B2 (en) | 2021-02-08 | 2023-08-08 | Verance Corporation | System and method for tracking content timeline in the presence of playback rate changes |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007031423A1 (fr) | 2005-09-16 | 2007-03-22 | Thomson Licensing | Filigranage inaudible de signaux audio faisant appel a des modifications de phase |
-
2010
- 2010-10-06 EP EP10306091A patent/EP2439735A1/fr not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007031423A1 (fr) | 2005-09-16 | 2007-03-22 | Thomson Licensing | Filigranage inaudible de signaux audio faisant appel a des modifications de phase |
Non-Patent Citations (3)
Title |
---|
FERRIS T L J ET AL: "Frequency domain method for windowing in Fourier analysis", ELECTRONICS LETTERS, vol. 28, no. 15, 16 July 1992 (1992-07-16), pages 1440, XP002626131, ISSN: 0013-5194, DOI: 10.1049/el:19920916 * |
M.ARNOLD; P.G.BAUM; W.VOESSING: "A phase modulation audio watermarking technique", LLTH INFORMATION HIDING WORKSHOP, 2009, pages 102 - 116 |
MICHAEL ARNOLD ET AL: "A Phase Modulation Audio Watermarking Technique", 8 June 2009, INFORMATION HIDING, SPRINGER BERLIN HEIDELBERG, BERLIN, HEIDELBERG, PAGE(S) 102 - 116, ISBN: 978-3-642-04430-4, XP019129276 * |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9368123B2 (en) | 2012-10-16 | 2016-06-14 | The Nielsen Company (Us), Llc | Methods and apparatus to perform audio watermark detection and extraction |
WO2014062332A1 (fr) * | 2012-10-16 | 2014-04-24 | The Nielsen Company (Us), Llc | Procédés et appareils pour exécuter une détection et une extraction d'un tatouage numérique audio |
CN108417199B (zh) * | 2013-01-18 | 2022-11-22 | 株式会社东芝 | 音频水印信息检测装置及音频水印信息检测方法 |
CN108417199A (zh) * | 2013-01-18 | 2018-08-17 | 株式会社东芝 | 音频水印信息检测装置及音频水印信息检测方法 |
US10110971B2 (en) | 2014-03-13 | 2018-10-23 | Verance Corporation | Interactive content acquisition using embedded codes |
US9596521B2 (en) | 2014-03-13 | 2017-03-14 | Verance Corporation | Interactive content acquisition using embedded codes |
US9681203B2 (en) | 2014-03-13 | 2017-06-13 | Verance Corporation | Interactive content acquisition using embedded codes |
US10504200B2 (en) | 2014-03-13 | 2019-12-10 | Verance Corporation | Metadata acquisition using embedded watermarks |
US10499120B2 (en) | 2014-03-13 | 2019-12-03 | Verance Corporation | Interactive content acquisition using embedded codes |
US9854332B2 (en) | 2014-03-13 | 2017-12-26 | Verance Corporation | Interactive content acquisition using embedded codes |
US9854331B2 (en) | 2014-03-13 | 2017-12-26 | Verance Corporation | Interactive content acquisition using embedded codes |
US10445848B2 (en) | 2014-08-20 | 2019-10-15 | Verance Corporation | Content management based on dither-like watermark embedding |
US10354354B2 (en) | 2014-08-20 | 2019-07-16 | Verance Corporation | Content synchronization using watermark timecodes |
US9805434B2 (en) | 2014-08-20 | 2017-10-31 | Verance Corporation | Content management based on dither-like watermark embedding |
US9639911B2 (en) | 2014-08-20 | 2017-05-02 | Verance Corporation | Watermark detection using a multiplicity of predicted patterns |
US10178443B2 (en) | 2014-11-25 | 2019-01-08 | Verance Corporation | Enhanced metadata and content delivery using watermarks |
US9942602B2 (en) | 2014-11-25 | 2018-04-10 | Verance Corporation | Watermark detection and metadata delivery associated with a primary content |
US9769543B2 (en) | 2014-11-25 | 2017-09-19 | Verance Corporation | Enhanced metadata and content delivery using watermarks |
US10277959B2 (en) | 2014-12-18 | 2019-04-30 | Verance Corporation | Service signaling recovery for multimedia content using embedded watermarks |
US11722741B2 (en) | 2021-02-08 | 2023-08-08 | Verance Corporation | System and method for tracking content timeline in the presence of playback rate changes |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2439735A1 (fr) | Procédé et appareil pour générer des motifs de phase de référence | |
CA2897276C (fr) | Dispositif et procede pour manipuler un signal audio comportant un evenement transitoire | |
CN102741921B (zh) | 改进的基于子带块的谐波换位 | |
CN102985970B (zh) | 在用于音频信号的基于相角声码器的带宽扩展中改善的幅值响应和时间对准 | |
EP1686566A2 (fr) | Traitenment de son avec transposition de fréquence | |
Pourhashemi et al. | Audio watermarking based on synergy between Lucas regular sequence and Fast Fourier Transform | |
Unoki et al. | Robust, blindly-detectable, and semi-reversible technique of audio watermarking based on cochlear delay characteristics | |
Islam et al. | Robust dual domain twofold encrypted image-in-audio watermarking based on SVD | |
Moorer | A note on the implementation of audio processing by short-term fourier transform | |
Hemis et al. | Adjustable audio watermarking algorithm based on DWPT and psychoacoustic modeling | |
Nouri et al. | Conceptual authentication speech hashing base upon hypotrochoid graph | |
Zhang | Audio dual watermarking scheme for copyright protection and content authentication | |
EP2532002B1 (fr) | Appareil, procédé et programme informatique pour le traitement de signaux audio | |
Singh et al. | Multiplicative watermarking of audio in DFT magnitude | |
Salcic | Audio watermarking | |
Lin et al. | Audio watermarking techniques | |
Subhashini et al. | Robust audio watermarking for monitoring and information embedding | |
Kumar et al. | Increased watermark-to-host correlation of uniform random phase watermarks in audio signals | |
Fu et al. | Empirical mode decomposition based blind audio watermarking | |
Dutta et al. | An adaptive robust watermarking algorithm for audio signals using SVD | |
Nouri et al. | Analysis of a novel audio hash function based upon stationary wavelet transform | |
Naqash et al. | Robust Audio Watermarking Based on Iterative Filtering | |
Nishimura | Reversible and robust audio watermarking based on spread spectrum and amplitude expansion | |
Safitri et al. | Audio Watermarking Combined with Compressive Sampling Based on QIM and DST-QR Techniques | |
Garlapati et al. | A symbol based watermarking approach for spread spectrum audio watermarking methods |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20121012 |