KR20050023454A

KR20050023454A - Watermark detection

Info

Publication number: KR20050023454A
Application number: KR10-2005-7001183A
Authority: KR
Inventors: 알폰스 에이. 엠. 엘. 브루커스; 제프 에이. 하이츠마; 미네 반데르빈; 안토니우스 에이. 씨. 엠. 칼커
Original assignee: 코닌클리케 필립스 일렉트로닉스 엔.브이.
Priority date: 2002-07-22
Filing date: 2003-07-07
Publication date: 2005-03-09

Abstract

추측 신호(suspect signal)와 워터마크 사이의 크로스-상관(cross-correlation)의 계산에 기초하는 워터마크 검출 방법이 개시된다. 상기 상관에 반하게(adversely) 영향을 미치는 연장 기본 신호 컴포넌트들(prolonged dominant signal components)에 대해 더욱 강해지도록(robust), 상기 워터마크와 상관되는 상기 신호 샘플들의 시퀀스(61)는 하위-시퀀스들(A(k))로 분할된다. 상기 하위-시퀀스들은, 개별적으로 초기 신호 변화들을 나타내지만, 집합적으로는(62) 샘플 값들의 더욱 평탄한 분포를 나타내는 수정된 하위-시퀀스들(B(k))을 획득하도록 가중 함수(weighting function)에 의해 처리된다. 그에 따라, 신호 내의 기본 피크들은 실질적으로 감소된다.A watermark detection method is disclosed that is based on the calculation of cross-correlation between a guess signal and a watermark. The sequence 61 of signal samples correlated with the watermark is sub-sequences to be stronger for prolonged dominant signal components that adversely affect the correlation. Divided by (A (k)). The sub-sequences individually represent initial signal changes, but collectively (62) a weighting function to obtain modified sub-sequences B (k) which represent a flatter distribution of sample values. Is handled by As such, the fundamental peaks in the signal are substantially reduced.

Description

Watermark Detection {WATERMARK DETECTION}

본 발명은 신호 내의 워터마크를 검출하는 방법 및 장치에 관한 것으로, 상기 방법은 신호 샘플들의 시퀀스와 미리결정된 워터마크 사이의 상관을 계산하는 단계 및 상기 상관이 주어진 임계치를 초과하는 지의 여부를 검출하는 단계를 포함한다.The present invention relates to a method and apparatus for detecting a watermark in a signal, the method comprising calculating a correlation between a sequence of signal samples and a predetermined watermark and detecting whether the correlation exceeds a given threshold Steps.

워터마크들은 오디오 또는 비디오와 같은 정보 신호들의 컨텐트에 임베딩된(embedded) 감지할 수 없는 메시지들이다. 워터마크들은 모니터링 및 복제 제어와 같은 다양한 어플리케이션들을 지원한다. 일반적으로 워터마크는 워터마크 각각의 샘플들에 따라 신호 샘플들을 수정함으로써 신호에 임베딩된다. 용어 "샘플들"은 워터마크가 임베딩된 영역 내의 신호값들을 나타낸다.Watermarks are undetectable messages embedded in the content of information signals such as audio or video. Watermarks support various applications such as monitoring and copy control. In general, a watermark is embedded in the signal by modifying the signal samples according to each sample of the watermark. The term "samples" refers to signal values in the region where the watermark is embedded.

오디오에 대한 종래 워터마크 임베딩 및 검출 시스템은, 2002년 10월 30일 내지 2002년 11월 4일, ACM Multimedia Conference의 199 내지 122 페이지에 있는 Jaap Haitsma, Michiel van der Veen, Ton Kalker 및 Fons Bruekers의 제목 "Audio Watermarking for Monitoring and Copy Protection"에 개시되었다. 오디오 신호는 프레임들로 분할되고, 주파수 영역으로 변환된다. 워터마크 시퀀스는 각 프레임의 푸리에 계수들(Fourier coefficients)의 크기들(magnitudes)로 임베딩된다. 검출기는 워터마킹된 오디오 신호의 시간 영역 버전을 수신한다. 상기 수신된 신호는 프레임들로 분할되고, 주파수 영역으로 변환된다. 푸리에 계수들의 크기들은 워터마크 시퀀스와 크로스-상관(cross-correlated)된다. 상관이 주어진 임계치를 초과하는 경우, 워터마크가 존재하는 것이다. 서두에서 규정된 표현 "신호 샘플들의 시퀀스"는 상기 경우에서 오디오 프레임의 푸리에 계수들의 크기들을 나타낸다.Conventional watermark embedding and detection systems for audio are described by Jaap Haitsma, Michiel van der Veen, Ton Kalker and Fons Bruekers, October 30, 2002-November 4, 2002, pages 199-122 of the ACM Multimedia Conference. Title "Audio Watermarking for Monitoring and Copy Protection". The audio signal is divided into frames and converted into a frequency domain. The watermark sequence is embedded with the magnitudes of the Fourier coefficients of each frame. The detector receives a time domain version of the watermarked audio signal. The received signal is divided into frames and converted into a frequency domain. The magnitudes of the Fourier coefficients are cross-correlated with the watermark sequence. If the correlation exceeds a given threshold, then there is a watermark. The expression "sequence of signal samples" defined at the outset represents the magnitudes of the Fourier coefficients of the audio frame in this case.

비디오에 대한 종래의 워터마크 임베딩 및 검출 시스템은, 1999년 1월, SPIE의 Proceedings의 Vol.3657, 103 내지 112 페이지에 있는 Ton Kalker, Geert Depovere, Jaap Haitsma and Maurice Maes의 제목 "A Video watermarking System for Broadcast Monitoring"에 개시되었다. 상기 시스템에서, 워터마크는 픽셀 영역에 임베딩된다. 워터마크 시퀀스는 이미지를 통해 타일링된 128x128의 워터마크 패턴이다. 워터마크 검출기는 128x128 이미지 블록들을 워터마크 패턴과 상관시킨다. 상기 상관이 충분히 큰 경우에, 워터마크가 존재하는 것이다. 서두에서 규정한 표현 "신호 샘플들의 시퀀스"는 상기 경우에서 128x128 픽셀들의 이미지 블록들을 나타낸다.A conventional watermark embedding and detection system for video is entitled, “A Video watermarking System” by Ton Kalker, Geert Depovere, Jaap Haitsma and Maurice Maes, January 1999, Vol.3657, Proceedings of SPIE, pages 103-112. for Broadcast Monitoring ". In the system, the watermark is embedded in the pixel area. The watermark sequence is a 128x128 watermark pattern tiled through the image. The watermark detector correlates 128x128 image blocks with the watermark pattern. If the correlation is large enough, there is a watermark. The expression "sequence of signal samples" defined at the outset represents the image blocks of 128x128 pixels in this case.

워터마크 검출 알고리즘들은, 오디오 신호 내에 존재하거나 오디오 신호에 부가된 강한 싱글톤(strong single tone), 또는 모든 비디오 프레임마다 고정된 위치 상에 존재하는 강한 로고 표시(strong logo present)나 매 프레임의 기저부에 있는 백색 자막 문자들과 같은 특정 신호 상태들 또는 공격들(attacks)에 대해 민감하다.Watermark detection algorithms can be based on a strong single tone that is present in or appended to an audio signal, or a strong logo present that is on a fixed position every frame of video, or the base of every frame. It is sensitive to certain signal conditions or attacks, such as the white captioned characters in.

도 1은 워터마크 임베딩 처리에 대한 배경 정보를 제공하기 위해 워터마크를 임베딩하는 종래 장치를 개략적으로 도시하는 도면.1 is a schematic illustration of a conventional apparatus for embedding a watermark to provide background information for watermark embedding processing;

도 2는 본 발명에 따라 워터마크를 검출하는 장치의 바람직한 실시예를 개략적으로 도시하는 도면.2 schematically shows a preferred embodiment of an apparatus for detecting a watermark according to the present invention;

도 3은 본 발명에 따른 방법의 수행을 도시하는 오디오 신호의 상관 피크값들을 도시하는 그래프.3 is a graph showing correlation peak values of an audio signal illustrating the performance of the method according to the invention.

도 4 내지 도 6은 도 2에 도시된 워터마크 검출 장치의 동작을 도시하는 도면들.4 to 6 are views showing the operation of the watermark detection apparatus shown in FIG.

도 7은 본 발명에 따른 워터마크 검출 방법의 수행을 나타내는 상관 피크값들을 도시하는 다른 그래프.Fig. 7 is another graph showing correlation peak values indicating performance of the watermark detection method according to the present invention.

본 발명의 목적은 종래 워터마크 검출 방법의 성능을 개선하기 위한 것이다.An object of the present invention is to improve the performance of the conventional watermark detection method.

따라서, 본 발명에 따른 방법은 신호 샘플들의 상기 시퀀스를 전처리하는 단계를 포함하며, 상기 전처리는:Thus, the method according to the invention comprises the step of preprocessing said sequence of signal samples, said preprocessing:

- 신호 샘플들의 시퀀스를 하위-시퀀스들로 분할하는 단계;Dividing the sequence of signal samples into sub-sequences;

- 하위-시퀀스의 모든 신호 샘플들을 동일 가중치로 가중시키고, 시퀀스에 걸쳐 신호 샘플들의 실질적으로 균일한 분포를 획득하도록 하위-시퀀스들간의 상기 가중치를 변화시키는 단계; 및 Weighting all signal samples of the sub-sequences with equal weights and changing the weights between the sub-sequences to obtain a substantially uniform distribution of signal samples over the sequence; And

- 신호 샘플들의 전처리된 시퀀스를 획득하도록 가중된 하위-시퀀스들을 연쇄(concatenating) 시키는 단계를 포함하는 것을 특징으로 한다.Concatenating the weighted sub-sequences to obtain a preprocessed sequence of signal samples.

본 발명에 따른 방법은 워터마크를 나타내는 작은 신호 변화들을 유지하면서, 큰 신호 피크들을 효율적으로 억제한다. 이것은 신호 내의 교란 성분(disturbing component)의 위치를 인지 또는 검출하지 않고도 달성된다.The method according to the invention effectively suppresses large signal peaks while maintaining small signal changes indicative of a watermark. This is accomplished without knowing or detecting the location of the disturbing component in the signal.

본 발명은 워터마크 검출 방법이 다수의 신호 시퀀스들의 축적(accumulation)을 포함하는 경우에 특히 효율적이다. 통상적으로 상기와 같은 축적은 검출 신뢰도를 향상시키지만(워터마크 시퀀스들은 합당하지만 신호는 평균화된다), 신호가 실질적으로 축적된 모든 시퀀스들 내에 동일한 교란 성분을 포함하는 경우에서는 그렇지 않다. 본 발명에 따른 방법의 바람직한 실시예에서, 전처리 단계는 상기 축적된 시퀀스들에 적용된다. 그에 따라, 교란 성분은 축적된 시퀀스들로부터 효과적으로 제거될 수 있음이 달성되었다.The present invention is particularly efficient when the watermark detection method involves the accumulation of multiple signal sequences. Typically such accumulation improves detection reliability (watermark sequences are reasonable but the signal is averaged), but not where the signal contains the same disturbing component in all the accumulated sequences. In a preferred embodiment of the method according to the invention, the pretreatment step is applied to the accumulated sequences. Thus, it has been achieved that the disturbing component can be effectively removed from the accumulated sequences.

본 발명에 따른 유리한 실시예에서, 신호 샘플들의 시퀀스는 중첩하는, 바람직하게는 윈도우된(windowed), 하위-시퀀스들로 분할된다. 적절한 윈도우는 널리 공지된 Hanning window 또는 Hanning window의 제곱근(square root)이다. 50%의 중첩이 좋은 결과들을 가져오는 것으로 확인되었다. 워터마크와 상관될 연쇄 시퀀스는 가중된(weighted) 하위-시퀀스들을 추가함으로써 획득된다.In an advantageous embodiment according to the invention, the sequence of signal samples is divided into overlapping, preferably windowed, sub-sequences. Suitable windows are the well-known Hanning window or the square root of the Hanning window. 50% overlap was found to produce good results. The concatenation sequence to be correlated with the watermark is obtained by adding weighted sub-sequences.

바람직하게는, 가중 단계는 신호 샘플들의 하위-시퀀스를 푸리에 변환하고, 푸리에 계수들을 크기를 정규화하고, 정규화된 계수들을 역변환하는 단계를 포함한다. 대안적으로, 가중 단계는 하위-시퀀스의 모든 신호 샘플들을 상기 하위-시퀀스의 최대 신호 샘플로 분할하는 것을 포함한다. 제 2 옵션, 즉 스케일링은, 주파수 영역으로 크기들을 정규화함으로써 가중치가 획득되는 제 1 옵션보다 낮은 계산 복잡성을 갖는다. 실시예들에서, 시퀀스는 신호 특성들에 기초하여 적응적으로 가중된다.Preferably, the weighting step comprises Fourier transforming a sub-sequence of signal samples, normalizing the Fourier coefficients, and inversely transforming the normalized coefficients. Alternatively, the weighting step includes dividing all signal samples of the sub-sequence into the maximum signal samples of the sub-sequence. The second option, ie scaling, has a lower computational complexity than the first option, which is weighted by normalizing the magnitudes in the frequency domain. In embodiments, the sequence is adaptively weighted based on the signal characteristics.

본 발명의 이들 및 그 밖의 양상들은 첨부 도면들을 참조하여 분명해지고, 명료해질 것이다.These and other aspects of the invention will be apparent from and elucidated with reference to the accompanying drawings.

이제 본 발명은 오디오 신호에 임베딩된 워터마크의 검출에 대해 설명한다. 우선 배경 정보를 제공하기 위해 임베딩 장치를 설명한다. 도 1은 상기 장치를 개략적으로 도시한다. 장치는 오디오 샘플들(x(n))의 형태로 오디오 신호를 수신하고, 상기 신호에 워터마크(w(n))를 추가하기 위한 가산기(101)를 포함한다. 워터마크(w(n))의 주요부(dominant part)는 푸리에 영역(Fourier domain)에서 도출된다. 장치는 오디오 신호를 2048 샘플들의 시퀀스들 또는 프레임들로 분할하는 분할 유닛(102)을 포함한다. 시퀀스들은 푸리에 변환(103)을 이용하여 변환된다. 주파수 영역의 랜덤 워터마크(W(k))는 평균 및 표준 편차(mean and standard deviation) 0 및 1과 함께 정규분포(normal distribution)로부터 개별적으로 얻어진다. 워터마크(W(k))는 10비트 페이로드(d)를 나타내는 양만큼 시프팅 회로(104)에서 주기적으로 시프팅된다. 푸리에 계수들의 크기들은 다음에 따라 곱셈기(multiplier; 105)에 의해 수정된다.The present invention now describes the detection of a watermark embedded in an audio signal. First, the embedding apparatus will be described to provide background information. 1 schematically shows the device. The apparatus includes an adder 101 for receiving an audio signal in the form of audio samples x (n) and adding a watermark w (n) to the signal. The dominant part of the watermark w (n) is derived from the Fourier domain. The apparatus includes a splitting unit 102 that splits an audio signal into sequences or frames of 2048 samples. The sequences are transformed using Fourier transform 103. The random watermark W (k) in the frequency domain is obtained separately from the normal distribution with mean and standard deviations 0 and 1. The watermark W (k) is periodically shifted in the shifting circuit 104 by an amount representing the 10-bit payload d. The magnitudes of the Fourier coefficients are modified by a multiplier 105 as follows.

W_i(k) = W_s(k)X_i(k)W _i (k) = W _s (k) X _i (k)

여기서, i는 프레임 또는 시퀀스 번호를 나타내고, X_i(k)는 프레임(x_i(n))의 스펙트럼 표현, W_s(k)는 W(k)의 주기적으로 시프팅된 버전, 및 Wi(k)는 주파수 영역 워터마크의 결과를 나타낸다. 역 푸리에 변환(106)은 시간 영역 워터마크 표현( w(n))을 획득하기 위해 사용된다.Where i represents a frame or sequence number, X _i (k) is a spectral representation of the frame (x _i (n)), W _s (k) is a periodically shifted version of W (k), and Wi ( k) shows the result of the frequency domain watermark. Inverse Fourier transform 106 is used to obtain a time domain watermark representation w (n).

도 2는 본 발명에 따라 워터마크를 검출하는 장치의 바람직한 실시예를 개략적으로 도시한다. 본 도면에서 도시한 바와 같이, 장치는 축적(accumulation; 1), 전처리(pre-processing; 2), 및 상관(correlation; 3)의 주요 세 단계들을 포함한다.2 schematically shows a preferred embodiment of an apparatus for detecting a watermark according to the present invention. As shown in the figure, the apparatus comprises three main steps: accumulation (1), pre-processing (2), and correlation (3).

축적 단계의 분할 유닛(11)에서, 장치는 추측 오디오 신호(y(n))를 2048 오디오 샘플들의 프레임들 또는 시퀀스들(y_i(n))로 분할한다. 각각의 시퀀스는 푸리에 변환되고(12), 푸리에 계수들(Y_i(k))의 크기들이 계산된다(13). 프레임(i)의 푸리에 계수들의 크기들은 워터마크 정보가 임베딩된 1024 리얼 번호들의 시퀀스(｜Y｜_i(k))를 구성한다. 장치의 바람직한 실시예에서, 다수의 상기 시퀀스들(｜Y｜_i(k))은 축적된 시퀀스(Y(k))를 획득하도록 축적기(14)에 의해 축적된다. 축적되는 시퀀스들의 수는 주기 말하자면, 2초의 오디오 신호 주기를 나타내도록 선택된다.In the dividing unit 11 of the accumulating step, the apparatus divides the speculative audio signal y (n) into frames or sequences y _i (n) of 2048 audio samples. Each sequence is Fourier transformed 12 and the magnitudes of the Fourier coefficients Y _i (k) are calculated 13. The magnitudes of the Fourier coefficients of frame i constitute a sequence of 1024 real numbers (| Y | _i (k)) in which watermark information is embedded. In a preferred embodiment of the apparatus, a plurality of said sequences | Y | _i (k) are accumulated by accumulator 14 to obtain an accumulated sequence Y (k). The number of accumulating sequences is chosen to represent a period, that is, an audio signal period of two seconds.

이제 상관 단계(3)가 간단히 개시된다. 상관을 이용하는 워터마크 검출의 상세 설명에 대해, 국제 특허 출원 제 WO 99/45707을 참조한다. 상관 단계는 신호 샘플들의 축적된 시퀀스와 워터마크 시퀀스(W(k))의 모든 가능한 시프팅된 버전 사이의 상관(C)을 계산한다(본 예의 "신호 샘플들"은 푸리에 계수들의 크기들을 나타냄을 유의). 상관 단계는 시퀀스(Z(k))를 수신한다. 상관 단계는 축적 단계(1)로부터 축적된 시퀀스(즉, Z(k)=Y(k))를 직접적으로 수신하는 것으로 초기에 가정한다.The correlation step 3 is now briefly initiated. For a detailed description of watermark detection using correlation, see international patent application WO 99/45707. The correlation step calculates a correlation C between the accumulated sequence of signal samples and all possible shifted versions of the watermark sequence W (k) ("signal samples" in this example represent magnitudes of Fourier coefficients). Note). The correlation step receives the sequence Z (k). The correlation step is initially assumed to directly receive the accumulated sequence from the accumulation step 1 (ie, Z (k) = Y (k)).

W(k)의 가능한 모든 시프팅된 버전에 대한 크로스-상관은 푸리에 변환을 이용하여 가장 효율적으로 계산된다. 종래 크로스-상관은 다음과 같이 기록될 수 있다.The cross-correlation for all possible shifted versions of W (k) is most efficiently calculated using the Fourier transform. The conventional cross-correlation can be recorded as follows.

C=F^-1(F(Z(k))×F^*(W(k)))C = F ^-1 (F (Z (k)) × F ^* (W (k)))

여기서, F(.)는 푸리에 변환을, F^*(.)는 복합 푸리에 계수들의 활용을 포함하는 푸리에 변환을, 그리고 F^-1(.)은 역 푸리에 변환을 나타낸다. 각각의 변환들은 도 2의 푸리에 변환 회로들(31, 32, 및 33)에 의해 수행된다. 곱셈은 곱셈기(34)에 의해 수행된다.Where F (.) Denotes a Fourier transform, F ^* (.) Denotes a Fourier transform that includes the use of complex Fourier coefficients, and F- ¹ (.) Denotes an inverse Fourier transform. Respective transformations are performed by the Fourier transform circuits 31, 32, and 33 of FIG. 2. Multiplication is performed by multiplier 34.

검출 수행은 SPOMF(Symmetrical Phase Only Filtering)에 의해 향상된다. 상기 크로스-상관 절차에서는, 신호들(F(Z(k)) 및 F^*(W(k)))의 위상 정보만이 사용된다. 위상-전용 동작(phase-only operation)은 다음과 같이 규정되고, 도 2의 각 위상 추출 회로들(35 및 36)에 의해 수행된다.Detection performance is enhanced by symmetrical phase only filtering (SPOMF). In the cross-correlation procedure, only phase information of signals F (Z (k)) and F ^* (W (k)) is used. Phase-only operation is defined as follows and performed by the respective phase extraction circuits 35 and 36 in FIG.

피크 검출기(4)는 크로스-상관 함수(C)가 주어진 검출 임계치보다 큰 피크값()(예를 들어, σ가 상관 함수의 표준 편차인 가운데 5σ)을 나타내는지 여부를 결정한다. 이러한 경우, 워터마크(W(k))가 존재하는 것이다. 피크 검출기는 또한 워터마크(W(k))에 적용되는 시프트 양에 대응하는 상기 피크 값의 위치를 검색(retrieve)하고, 그에 따라 10비트 페이로드(d)를 나타낸다. 그러나, 상기 양상은 본 발명에 관련하지 않는다.The peak detector 4 has a peak value where the cross-correlation function C is greater than a given detection threshold. ), E.g., 5 is the standard deviation of the correlation function. In this case, the watermark W (k) exists. The peak detector also retrieves the position of the peak value corresponding to the shift amount applied to the watermark W (k), thus representing a 10 bit payload d. However, this aspect is not relevant to the present invention.

도 3은 오디오 신호의 1초 간격들로 측정된 상관 피크값들()의 그래프를 도시한다. 짙은 선(31)은 음악의 정규 피스에 대한 결과를 나타낸다. 쉽게 확인할 수 있는 것과 같이, 각각의 피크값은 임계치(5)를 명백하게 초과하는 것으로 즉, 신호는 임베딩된 워터마크를 갖는다. 파선(32)은 동일한 음악의 피스들에 대한 피크값들을 나타내고, 강한 15kHz 사인파로 분포되고 있다. 어떤 피크값들도 임계치(5)를 초과하지 않고 있다. 이제 검출기는, 상기 신호가 임베딩된 워터마크를 갖지 않는 것으로 잘못된 결정을 내린다. 이러한 문제는 도 4 및 도 5를 참조하여 기술된다. 도 4에서, 번호(41)는 음악의 정규 피스로부터 도출된 일반적 축적 시퀀스(Y(k))를 나타낸다. 도 5에서, 번호(51)는 동일하지만 교란된 음악의 피스로부터 도출된 대응하는 시퀀스(Y(k))를 나타낸다. 15kHz 톤이 신호를 조절(dominate)하여, 워터마크 정보를 나르는 시퀀스(51) 내 푸리에 성분들의 크기 변화들이 시퀀스(41) 내 변화들과 비교해 무의미하게 축소된다.3 shows correlation peak values (measured at one second intervals of an audio signal, Shows a graph. Dark line 31 represents the result for a regular piece of music. As you can easily see, each peak value is a threshold (5 E.g., the signal has an embedded watermark. The dashed lines 32 represent peak values for the same pieces of music and are distributed with a strong 15 kHz sine wave. None of the peaks have a threshold (5 ) Is not exceeded. The detector now makes a false decision that the signal does not have an embedded watermark. This problem is described with reference to FIGS. 4 and 5. In Fig. 4, the number 41 represents the general accumulation sequence Y (k) derived from the normal piece of music. In Fig. 5, the number 51 represents the corresponding sequence Y (k) derived from the same but disturbing piece of music. As the 15 kHz tone dominates the signal, the magnitude changes of the Fourier components in the sequence 51 carrying the watermark information are insignificantly reduced compared to the changes in the sequence 41.

상기 문제를 극복하기 위한 가능한 솔루션은, 예컨대 교란 성분들이 존재하는 비디오 프레임 부분들 또는 오디오 스펙트럼 부분들의 신호 부분들을 무시하는 것이다. 예를 들면, 비디오 신호에서 로고의 위치는 대응하는 픽셀들이 무시될 수 있도록, 미리 공지되는 것이다. 또는, 오디오 워터마크 검출기가 FM 라디오국을 관측하는 경우, 반송파에 근접한 주파수들은 무시될 수 있다. 신호의 부분들을 무시하는 것은 신호에 갑작스럽게 가중 함수를 더 많거나 더 적게 공급하는 것으로 보일 수 있다. 그러나, 일반적으로 교란 성분들의 위치는 공지되지 않는다. 몇 종류의 메카니즘은 신호에 가중 함수를 적응하기를 원하는 것이다.A possible solution to overcome this problem is to ignore the signal parts of the video frame parts or the audio spectral parts, for example where there are disturbing components. For example, the position of the logo in the video signal is known in advance so that the corresponding pixels can be ignored. Or, if the audio watermark detector observes an FM radio station, frequencies close to the carrier can be ignored. Ignoring parts of the signal may appear to supply more or less weight functions suddenly. In general, however, the location of disturbing components is not known. Some kind of mechanism wants to adapt the weighting function to the signal.

따라서, 본 발명에 따라 워터마크를 검출하는 장치는 축적 단계(1)와 상관 단계(3) 사이에 전처리 단계(2)를 포함한다(도 2 참조). 전처리 단계는 하위-분할 유닛(21), 가중 회로(22), 및 연쇄 회로(23)를 포함한다. Thus, the apparatus for detecting a watermark according to the present invention includes a preprocessing step 2 between the accumulation step 1 and the correlation step 3 (see Fig. 2). The pretreatment step comprises a sub-division unit 21, a weighting circuit 22, and a chain circuit 23.

하위-분할 유닛(21)은 축적된 시퀀스(Y(k))를 가능한 다수의 중첩 및 윈도우된 하위-시퀀스들(A(k))로 분할한다. 시퀀스(Y(k))가 1024 신호 샘플들을 포함하는 오디오 신호들에 대해, 16 샘플들의 하위-시퀀스 길이는 좋은 선택으로 확인되었다.The sub-division unit 21 divides the accumulated sequence Y (k) into as many overlapping and windowed sub-sequences A (k) as possible. For audio signals where the sequence Y (k) contains 1024 signal samples, the sub-sequence length of 16 samples has been found to be a good choice.

가중 회로(22)는 각각의 개별 하위-시퀀스를 가중 함수로 가중시킨다. 각각의 하위-시퀀스 내에서의 신호 샘플들의 원래 변화들은 유지되는 반면, 가중 함수는 신호 샘플들의 분포가 전체 시퀀스에 걸쳐 실질적으로 균일해지도록 선택된다. 표현"실질적으로 균일함"은 예를 들어, 하위-시퀀스들의 신호 샘플들의 평균값이 전체 하위-시퀀스들에 대해 동일한 것을 의미할 수 있다.The weighting circuit 22 weights each individual sub-sequence as a weighting function. The original changes of the signal samples within each sub-sequence are maintained, while the weighting function is chosen such that the distribution of the signal samples is substantially uniform throughout the entire sequence. The expression “substantially uniform” can mean, for example, that the mean value of the signal samples of the sub-sequences is the same for the entire sub-sequences.

일 실시예에서, 이것은 주파수 영역의 각각의 하위-시퀀스의 크기들을 정규화함으로써 획득된다. 따라서, 가중 회로는 다음의 동작을 수행한다.In one embodiment, this is obtained by normalizing the magnitudes of each sub-sequence of the frequency domain. Thus, the weighting circuit performs the following operations.

B(k)=F^-1(P(F(A(k))) (1)B (k) = F ^-1 (P (F (A (k))) (1)

여기서 F(.)는 푸리에 변환을 나타내고, P(.)는 상기와 같은 위상 전용 동작을 나타내며, F^-1(.)은 역 푸리에 변환을 나타낸다.Where F (.) Represents the Fourier transform, P (.) Represents the phase-only operation as described above, and F- ¹ (.) Represents the Inverse Fourier transform.

다른 실시예에서, 가중치는 다음의 스케일링 동작에 의해 수행된다.In another embodiment, the weights are performed by the following scaling operation.

여기서, A_k 및 B_k는 개별적으로 가중된 하위-시퀀스(B(k)) 및 원래의 하위-시퀀스(A(k))의 샘플들을 나타내고, ｜A_k｜는 하위-시퀀스(A(k))의 신호 샘플들의 최대 절대값(absolute value)이다.Where A _k and B _k represent the samples of the individually weighted sub-sequence B (k) and the original sub-sequence A (k), where A _k | is the sub-sequence A (k Is the maximum absolute value of the signal samples.

실질적으로, 가중된 하위-시퀀스들(B(k))은 전처리된 시퀀스(Z(k))를 획득하도록 연쇄 회로(23)에 의해 연쇄된다. 하위-시퀀스들이 서로를 중첩하는 경우, 적절한 윈도우들(예컨대, Hanning windows)이 바람직하게 B(k)에 적용된다. 그것은 연쇄 단계(2)에 입력되는 전처리된 시퀀스(Z(k))이다.Substantially, the weighted sub-sequences B (k) are concatenated by chain circuit 23 to obtain a preprocessed sequence Z (k). If the sub-sequences overlap each other, appropriate windows (eg Hanning windows) are preferably applied to B (k). It is a preprocessed sequence Z (k) that is input to the chaining step 2.

도 6은 전처리 동작을 개략적으로 나타내는 도면들을 도시한다. 참조번호(61)는 하위-시퀀스들(A(k))로 분할되는 축적된 시퀀스(Y(k))를 나타낸다. 참조번호(62)는 가중된 하위-시퀀스들(B(k))을 연쇄함으로써 시퀀스(Z(k))가 획득됨을 나타낸다. 도시한 바와 같이, 각각의 하위-시퀀스(A(k))는 가중되어왔다. 동일한 가중치가 하위-시퀀스의 모든 신호 샘플들에 적용되어왔지만, 다른 가중치들은 다른 하위-시퀀스들에 적용된다. 신호 샘플들의 변화가 국소적으로 유지되는 반면, 신호 샘플들의 더욱 균일한 분포가 그 결과이다.6 shows diagrams schematically illustrating a preprocessing operation. Reference numeral 61 denotes an accumulated sequence Y (k) which is divided into sub-sequences A (k). Reference numeral 62 indicates that the sequence Z (k) is obtained by concatenating the weighted sub-sequences B (k). As shown, each sub-sequence A (k) has been weighted. The same weight has been applied to all signal samples of the sub-sequence, but different weights are applied to the other sub-sequences. While the change in the signal samples remains local, a more uniform distribution of the signal samples is the result.

도 4 및 도 5는 음악의 특정 피스에 대해 전처리 단계(2)의 효과를 실제적으로 도시한다. 상기한 바와 같이, 도 4의 번호(41)는 정규 음악 피스로부터 도출된 축적된 시퀀스(Y(k))를 나타낸다. 도 5의 번호(51)는 강한 15kHz의 톤으로 분포되는 동일한 음악의 피스로부터 도출된 축적된 시퀀스(Y(k))를 나타낸다. 시퀀스들은 1024 축적된 신호 샘플들을 포함한다. 참조 번호들(42 및 52)은 식(1)에 의해 규정된 바와 같이 주파수 영역에서 각각의 하위-시퀀스의 크기들을 정규화함으로써 획득된 대응하는 가중 시퀀스들(Z(k))을 나타낸다. 참조 번호들(43 및 53)은 식(2)에 의해 규정된 바와 같이 스케일링함으로써 획득된 대응하는 가중 시퀀스들(Z(k))을 나타낸다. 양측 음악 피스들에 대해, 특히 분포된 음악의 피스들을 제외하고, 도면들은 상당히 더 큰 상관 피크가 상관 단계에서 검출될 것으로 예측될 수 있음을 나타낸다.4 and 5 actually illustrate the effect of the preprocessing step 2 on a particular piece of music. As mentioned above, the number 41 in Fig. 4 represents the accumulated sequence Y (k) derived from the normal musical piece. Number 51 in Fig. 5 represents the accumulated sequence Y (k) derived from the same piece of music distributed in a strong 15 kHz tone. The sequences include 1024 accumulated signal samples. Reference numerals 42 and 52 denote corresponding weighting sequences Z (k) obtained by normalizing the magnitudes of each sub-sequence in the frequency domain as defined by equation (1). Reference numerals 43 and 53 denote corresponding weighting sequences Z (k) obtained by scaling as defined by equation (2). For both music pieces, especially except for pieces of distributed music, the figures show that a significantly larger correlation peak can be expected to be detected in the correlation step.

본 발명에 따라 워터마크 검출 방법으로 획득된 개선사항이 도 3에 도시된다. 본 도면에서, 짙은 선들은 음악의 정규 피스를 나타내고, 파선들은 음악의 분포된 피스들을 나타낸다. 짙은 선(31) 및 파선(32)는 이미 상술되었다. 짙은 선들(33 및 35)은 식(1)에 따른 가중 동작의 수행을 도시한다. 파선들(34 및 36)은 식(2)에 따른 가중 동작의 수행을 도시한다. 쉽게 확인할 수 있는 바와 같이, 모든 피크 상관값들은 피크 검출기(4)에 의해 이용되는 임계치(5σ)보다 위에 있다. 완전을 기하기 위해, 도 7은 동일 범례들(identical legends) 및 현재는 인코딩되고 그 후에 디코딩되는 mp3인 동일한 음악 피스들에 대한 참조 번호들과 함께 동일한 그래프를 도시한다.The improvement obtained with the watermark detection method according to the invention is shown in FIG. 3. In this figure, the dark lines represent normal pieces of music, and the dashed lines represent distributed pieces of music. The dark line 31 and the broken line 32 have already been described above. Dark lines 33 and 35 illustrate the performance of the weighting operation according to equation (1). The dashed lines 34 and 36 illustrate the performance of the weighting operation according to equation (2). As can be readily seen, all peak correlation values are above the threshold 5σ used by the peak detector 4. For the sake of completeness, FIG. 7 shows the same graph with the same legends and reference numbers for the same musical pieces that are mp3 that are now encoded and then decoded.

상기한 실시예들에서, 워터마크는 푸리에 계수들의 크기들의 극소 수정들에 의해, 즉 주파수 영역에서 나타난다. 그러나, 본 발명은 시간적 또는 공간적 (비디오) 영역에 임베딩된 워터마크의 검출에 동일하게 적용될 수 있음을 이해한다.In the above embodiments, the watermark is represented by the micro modifications of the magnitudes of the Fourier coefficients, ie in the frequency domain. However, it is understood that the invention is equally applicable to the detection of watermarks embedded in temporal or spatial (video) regions.

워터마크 검출 방법은 추측 신호와 워터마크 사이의 크로스-상관의 계산에 기초하여 개시된다. 상관에 반하는 연장된 기본 신호 컴포넌트들에 대해 더욱 강해지도록, 워터마크와 상관되는 신호 샘플들의 시퀀스(61)는 하위-시퀀스들(A(k))로 분할된다. 하위-시퀀스들은 수정된 하위-시퀀스들(B(k))을 획득하도록 가중 함수에 의해 처리되고, 개별적으로 원래의 신호 변화들을 나타내지만, 샘플값들의 더욱 균일한 분포를 집합적으로 나타낸다(62). 그에 따라, 신호의 기본 피크들은 실질적으로 감소된다.The watermark detection method is initiated based on the calculation of the cross-correlation between the speculative signal and the watermark. To be stronger for the extended basic signal components against the correlation, the sequence 61 of signal samples correlated with the watermark is divided into sub-sequences A (k). The sub-sequences are processed by a weighting function to obtain modified sub-sequences B (k), individually representing the original signal changes, but collectively representing a more uniform distribution of sample values (62) ). As such, the fundamental peaks of the signal are substantially reduced.

Claims

A method of detecting a watermark in a signal comprising computing a correlation between a sequence of signal samples and a predetermined watermark, and detecting whether the correlation exceeds a given threshold;

The method includes pre-processing the sequence of signal samples,

The pretreatment step is:

Dividing the sequence of signal samples into sub-sequences;

Weighting all signal samples of the sub-sequences with equal weights and varying the weights by sub-sequences to obtain a substantially uniform distribution of signal samples over the sequence; And

Concatenating the weighted sub-sequences to obtain a preprocessed sequence of signal samples.

2. The method of claim 1, further comprising accumulating a plurality of sequences of signal samples prior to correlation, wherein said preprocessing is applied to said cumulative sequences.

2. The method of claim 1, wherein dividing the sequence of signal samples into sub-sequences comprises dividing into overlapping sub-sequences.

4. The method of claim 3, wherein the overlap is 50%.

4. The method of claim 3, wherein dividing into overlapping sub-sequences applies a window function to the overlapping sub-sequences.

2. The method of claim 1, wherein the weighting step comprises Fourier transforming a sub-sequence of the signal samples, normalizing the magnitudes of Fourier coefficients, and inversely transforming the normalized coefficients.

2. The method of claim 1, wherein the weighting step comprises dividing all signal samples of a sub-sequence into a maximum signal sample of the sub-sequence.

11. An apparatus for detecting a watermark in a signal comprising computing means for calculating a correlation between a sequence of signal samples and a predetermined watermark, and threshold means for detecting whether the correlation exceeds a given threshold;

The apparatus comprises preprocessing means for preprocessing the sequence of signal samples,

The pretreatment means is:

Means for dividing the sequence of signal samples into sub-sequences;

Weighting means for weighting all signal samples of a sub-sequence with equal weights and changing said weights between sub-sequences to obtain a substantially uniform distribution of signal samples over said sequence; And

-Concatenating means for concatenating said weighted sub-sequences to obtain a preprocessed sequence of said signal samples.

A computer program product arranged to cause a computer executing a computer program to perform the method claimed in any one of claims 1 to 7.