KR20020031654A

KR20020031654A - Method and apparatus for embedding watermarks using fast fourier transformed data

Info

Publication number: KR20020031654A
Application number: KR1020000062195A
Authority: KR
Inventors: 황준성
Original assignee: 황준성; 토탈테크날리지 (주)한국지점
Priority date: 2000-10-23
Filing date: 2000-10-23
Publication date: 2002-05-03

Abstract

PURPOSE: A method and a system for inserting and extracting a watermark using Fourier transform are provided to additionally insert lots of information into a digital audio signal to offer various information items to a listener. CONSTITUTION: A digital audio watermark inserting system includes a watermark generator, a frequency converter(712), a critical frequency measuring unit(713), and a watermark summing unit(714). The watermark generator converts a watermark to be inserted into a digital audio signal into a bit stream. The frequency converter Fourier-transforms the digital audio signal. The critical frequency measuring unit calculates a critical frequency of the digital audio spectrum transformed by the frequency converter. The watermark summing unit calculates an intermediate value for a frequency band higher than the critical frequency, and controls the spectrum value upward or downward on the basis of the intermediate value according as the data value of the watermark bit stream is "0" or "1".

Description

Watermark Insertion and Extraction Method using Fourier Transform {METHOD AND APPARATUS FOR EMBEDDING WATERMARKS USING FAST FOURIER TRANSFORMED DATA}

본 발명은 디지털 오디오 신호에 워터마크를 삽입하고 추출하는 장치 및 방법에 관한 것으로, 특히 디지털 오디오 데이터를 주파수 영역(frequency domain)으로 푸리에 변환(Fourier Transform)을 통한 주파수 변환을 수행한 후에 워터마크를 삽입하는 기술에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus and method for embedding and extracting a watermark into a digital audio signal. In particular, the present invention relates to a watermark after performing a frequency transformation through Fourier transform of digital audio data into a frequency domain. It is about a technique to insert.

최근 디지털 신호 기술과 네트워크 기술이 발달함에 따라 멀티미디어 정보를 손쉽게 받아보는 세계가 도래하고 있다. 특히, 디지털 오디오는 엠펙(MPEG) 등의 기술로 압축되어 인터넷을 통해 쉽게 전파되는 특성이 있으므로, 네트워크 상에서 불법적으로 유통되는 사례가 빈번히 발생할 수 있다.Recently, with the development of digital signal technology and network technology, a world that easily receives multimedia information is coming. In particular, since digital audio is compressed by a technology such as MPEG and easily propagated through the Internet, cases of illegal distribution on a network may frequently occur.

이와 같이, 디지털 오디오에 대한 저작권을 보호하기 위하여 데이터에 워터마크(watermark)를 삽입하거나, 공개키 알고리즘 등을 사용하여 데이터를 암호화하는 기술이 이용되고 있다. 디지털 오디오의 워터마킹 방법은 하위 비트의 부호화 방법, 위상 부호화 방법, 확산 스펙트럼(spread spectrum) 방법 및 반향 은폐 방법 등의 네 가지로 분류할 수 있다.As described above, in order to protect copyright of digital audio, a technique of embedding a watermark in the data or encrypting the data using a public key algorithm or the like is used. The watermarking method of digital audio can be classified into four types such as a lower bit encoding method, a phase encoding method, a spread spectrum method, and an echo concealment method.

종래 기술에 따른 하위 비트 조작 기법은 디지털 오디오의 샘플링 지점 (sampling point)에서, 비교적 중요도가 떨어지는 비트 스트림(bit stream)을 워터마크로 대치하는 기술이다. 예를 들어, 샘플링 비율이 8 KHz인 경우, 워터마크 정보는 최대 8 Kbit/sec가 될 수 있다.The lower bit manipulation technique according to the prior art replaces a relatively insignificant bit stream with a watermark at a sampling point of digital audio. For example, when the sampling rate is 8 KHz, the watermark information may be up to 8 Kbit / sec.

또한, 위상 부호화 기법은 디지털 오디오 데이터의 위상을 변화시키는 기법으로서, 통상적으로 음의 크기에는 민감하지만 위상의 변화에는 둔감하다는 점을 이용하여 위상 데이터에 워터마킹을 삽입하는 기술이다. 즉, 위상 부호화 기법은 시간 영역의 음악 데이터를 주파수 영역의 데이터로 변환을 한 후에, 워터마킹을 삽입하여 변화시킨 후에 다시 시간 영역으로 변화시키는 기술이다.In addition, the phase encoding technique is a technique for changing the phase of digital audio data, and is a technique for inserting watermarking into the phase data by using a sensitivity that is generally sensitive to sound volume but insensitive to a change in phase. That is, the phase coding technique is a technique of converting music data in the time domain into data in the frequency domain, inserting and changing the watermarking, and then changing the time data back to the time domain.

그런데, 위상의 변화는 매우 민감하므로 손실 압축에 쉽게 상실되어질 수 있다. 한편, 종래 기술에 따른 확산 스펙트럼 기법은 워터마크 정보를 어느 한 곳에 치우치지 않도록 광역화하여 확산시키는 기술로서, 종래의 주파수 영역법에 비하여 손실 압축과 재표본화 및 다른 데이터 조작에 대하여 내성을 지닌다.However, phase changes are very sensitive and can easily be lost to lossy compression. On the other hand, the spread spectrum technique according to the prior art is a technique for widening and spreading the watermark information so as not to be biased anywhere, and is more resistant to lossy compression, resampling, and other data manipulation than the conventional frequency domain method.

그러나, 확산 스펙트럼 기법은 노이즈 레벨을 유지하고 들리지 않도록 오디오 신호의 0.5% 범위 내의 크기로 워터마크를 약하게 하여야 하는 제한이 있다. 한편, 종래 기술에 따른 반향 은폐 기법은 디지털 오디오의 신호를 손실시키지 않으면서 강인한 워터마크를 삽입할 수 있는 기술로서, 메아리를 이용하여 정보를 숨길 수 있다. 그러나, 반향 은폐 기법은 반향 첨가 또는 선형 예측 켑스트럼과 같이 제3 영역에서 쉽게 마크가 깨질 수 있다는 단점을 지니고 있다.However, the spread spectrum technique has a limitation in that the watermark must be weakened to a size within the 0.5% range of the audio signal so as to maintain the noise level and not be heard. Meanwhile, the echo concealment technique according to the related art is a technique capable of inserting a strong watermark without losing a signal of digital audio, and may hide information using an echo. However, the echo concealment technique has the disadvantage that the mark can be easily broken in the third region, such as echo addition or linear prediction cepstrum.

따라서, 본 발명의 제1 목적은 디지털 오디오에 적용할 수 있는 워터마킹 방법과 장치를 제공하는데 있다.Accordingly, it is a first object of the present invention to provide a watermarking method and apparatus applicable to digital audio.

본 발명의 제2 목적은 상기 제1 목적에 부가하여, 디지털 오디오에 많은 정보를 추가로 삽입함으로써 청취자에게 다양한 정보를 제공할 수 있는 워터마킹 방법 및 장치를 제공하는데 있다.It is a second object of the present invention to provide a watermarking method and apparatus which can provide various information to a listener by additionally inserting a lot of information into digital audio in addition to the first object.

본 발명의 제3 목적은 상기 제1 목적에 부가하여, 디지털 오디오 원본 없이 워터마킹을 수행할 수 있는 방법 및 장치를 제공하는데 있다.It is a third object of the present invention to provide a method and apparatus capable of performing watermarking without a digital audio source in addition to the first object.

본 발명의 제4 목적은 상기 제1 목적에 부가하여, 주파수 영역에서 워터마킹을 수행할 수 있는 방법 및 장치를 제공하는데 있다.A fourth object of the present invention is to provide a method and apparatus capable of performing watermarking in the frequency domain in addition to the first object.

도1은 본 발명에 따른 디지털 오디오 워터마크 삽입 및 추출 시스템의 구성을 나타낸 도면.1 is a diagram showing the configuration of a digital audio watermark embedding and extraction system according to the present invention;

도2는 본 발명에 따른 워터마크 생성 장치의 구성을 나타낸 도면.2 is a diagram showing the configuration of a watermark generating apparatus according to the present invention;

도3은 본 발명에 따른 워터마크 삽입 장치의 구성을 나타낸 도면.3 is a diagram showing the configuration of a watermark embedding apparatus according to the present invention;

도4a 및 도4b는 본 발명에 따라 디지털 오디오 신호를 주파수 변환기를 통해 주파수 영역의 스펙트럼으로 고속 푸리에 변환을 수행한 결과를 나타낸 도면.4A and 4B show the results of performing a Fast Fourier Transform of a digital audio signal into a spectrum in the frequency domain through a frequency converter in accordance with the present invention.

도5는 쿠토시스 값이 큰 경우와 작은 경우의 주파수 스펙트럼의 분포를 비교하여 나타낸 도면.Fig. 5 is a diagram comparing the distribution of the frequency spectrum in the case of large and small kutosis values.

도6은 본 발명에 따라 주파수 영역에 워터마크가 삽입된 음악을 나타내는 도면.6 is a view showing music in which a watermark is inserted in a frequency domain according to the present invention;

도7은 본 발명에 따른 워터마크 추출 장치의 구성을 나타낸 도면.7 is a diagram showing the configuration of a watermark extraction apparatus according to the present invention;

<도면의 주요 부분에 대한 부호의 설명><Explanation of symbols for the main parts of the drawings>

50 :워터마크 정보50: Watermark information

51 :워터마크 데이터51: Watermark data

100 :워터마크 생성 장치100: watermark generator

200 :워터마크 삽입 장치200: watermark insertion device

400 :워터마크 추출 장치400: watermark extraction device

710 :비트 변환부710: bit conversion unit

711 :강도 조절부711: intensity control unit

712 :주파수 변환부712: frequency converter

713 :임계 주파수 측정부713: critical frequency measurement unit

714 :워터마크 합산부714: Watermark totaling department

715 :주파수 역변환부715: inverse frequency converter

716,716' :고속 푸리에 변환된 디지털 오디오의 스펙트럼의 예716,716 ': An example of the spectrum of a fast Fourier transformed digital audio

717,717' :고속 푸리에 변환된 디지털 오디오의 스펙트럼의 예717,717 ': Example spectrum of fast Fourier transformed digital audio

718, 719 :워터마크를 삽입할 영역718, 719: Area to insert watermark

720 :중간값 측정부720: middle value measuring unit

721 :비트 정보 변환부721: bit information converter

780 :중간값780: Middle value

783 :워터마크 삽입 영역783: Watermark insertion area

상기 목적을 달성하기 위하여, 본 발명은 디지털 오디오 데이터를 푸리에 변환(Fourier Transform)을 통한 주파수 변환을 수행한 후 워터마크를 삽입하는 기술로서, 고속 푸리에 변환(Fast Fourier Transform; FFT)을 이용하기 때문에 워터마킹 수행 속도가 신속하고, 임계 주파수 측정기와 청각 심리 모델을 이용함으로써 많은 양의 정보를 디지털 오디오 데이터에 삽입할 수 있다.In order to achieve the above object, the present invention uses a fast Fourier transform (FFT) as a technique for inserting a watermark after performing a frequency conversion through the Fourier Transform (Digital Audio data) The speed of watermarking is fast and a large amount of information can be inserted into the digital audio data by using a critical frequency meter and an auditory psychological model.

본 발명은 많은 양의 정보를 삽입할 수 있도록 함으로써, 저작권자에 대한 상세한 설명은 물론 다양한 부가 정보를 삽입하는 것을 가능하게 한다. 즉, 보통 사람이 민감하게 청취할 수 있는 최대 주파수는 약 6 KHz로서, 보통 디지털 오디오가 44.1 KHz의 주파수 대역폭을 지니는 것을 감안하면, 1초에 약 38.1 Kbit의 데이터를 워터마킹할 수 있게 된다.The present invention makes it possible to insert a large amount of information, thereby enabling the insertion of various additional information as well as a detailed description of the copyright holder. In other words, the maximum frequency that an average person can listen sensitively is about 6 KHz. Given that digital audio has a frequency bandwidth of 44.1 KHz, it is possible to watermark about 38.1 Kbits of data per second.

본 발명에서는 워터마크를 삽입하기 위하여 임계 주파수 측정기를 사용하고 있으며, 음악 저작권에 대한 정보 뿐 아니라 노래 가사 정보를 삽입하거나, 데이터 통신에도 이용될 수 있다.In the present invention, a threshold frequency meter is used to insert a watermark, and it can be used not only for music copyright information but also for song lyrics information or for data communication.

이하에서는, 첨부 도면 도1 내지 도7을 참조하여 본 발명에 따른 워터마킹 기술을 상세히 설명한다.Hereinafter, a watermarking technique according to the present invention will be described in detail with reference to FIGS. 1 to 7.

도1은 본 발명에 따른 디지털 오디오 워터마크 삽입 및 추출 시스템의 구성을 나타낸 도면이다. 도1을 참조하면, 워터마크 정보(50)가 워터마크 생성 장치 (100)를 통해서 비트 스트림(bit stream) 신호(W; 51)로 변환된다. 변환된 워터마크 비트 신호(W; 51)는 워터마크 삽입 장치(200)를 통하여 원음 OM(x) 디지털 오디오 신호(40)에 삽입되고, 워터마크가 삽입된 음악(210) WM(x)은 워터마크 추출 장치(400)가 내장된 오디오 플레이어에서 실시간으로 워터마크 인증을 하게 된다. 이 때에, 불법 사용자의 오디오 플레이어에서는 음악이 연주되지 않으며, 정당한 사용자 인증 정보(51)가 추출된 경우에만 음악이 연주되게 할 수 있다.1 is a diagram showing the configuration of a digital audio watermark embedding and extraction system according to the present invention. Referring to FIG. 1, the watermark information 50 is converted into a bit stream signal W 51 through the watermark generating apparatus 100. The converted watermark bit signal (W) 51 is inserted into the original audio OM (x) digital audio signal 40 through the watermark embedding apparatus 200, and the music 210 WM (x) having the watermark embedded therein is The watermark extraction apparatus 400 performs watermark authentication in real time on the embedded audio player. At this time, music is not played in the audio player of the illegal user, and the music can be played only when legitimate user authentication information 51 is extracted.

도2는 본 발명에 따른 워터마크 생성 장치(100)의 구성을 나타낸 도면이다. 도2를 참조하면, 워터마크 정보(50)는 비트 변환부(710)를 통해, 예를 들어 10101110… 등의 바이너리 비트 스트림으로 변환되고 강도 조절부(711)로 입력된다. 본 발명에 따른 강도 조절부(711)는 임계 주파수 측정부(713)에서 생성된 값(S)에 따라 바이너리 비트 스트림 워터마크 정보의 강도를 조절한다.2 is a diagram showing the configuration of a watermark generating apparatus 100 according to the present invention. Referring to FIG. 2, the watermark information 50 is transmitted through the bit converter 710, for example, 10101110. It is converted into a binary bit stream and the like, and is input to the intensity controller 711. The strength adjusting unit 711 according to the present invention adjusts the strength of the binary bit stream watermark information according to the value S generated by the threshold frequency measuring unit 713.

도3은 본 발명에 따른 워터마크 삽입 장치(200)의 구성을 나타낸 도면이다. 도3을 참조하면, 워터마크 삽입 장치(200)는 주파수 변환부(712), 임계 주파수 측정부(713), 워터마크 합산부(714), 주파수 역변환부(715) 등을 포함할 수 있다. 본 발명에 따른 주파수 변환부(712)는 원본 음악(40) OM(x)를 입력받고, 푸리에 변환(Fourier Transform)을 통해 주파수 성분으로 변환된다.3 is a diagram showing the configuration of a watermark embedding apparatus 200 according to the present invention. Referring to FIG. 3, the watermark embedding apparatus 200 may include a frequency converter 712, a threshold frequency measurer 713, a watermark adder 714, a frequency inverse converter 715, and the like. The frequency converter 712 according to the present invention receives the original music 40 OM (x), and is converted into frequency components through a Fourier transform.

본 발명의 양호한 실시예로서, 주파수 변환부(712)는 고속 푸리에 변환 회로 (Fast Fourier Transform; FFT) 또는 고속 푸리에 변환 프로그램 모듈로써 구현할 수 있다. 예를 들어서, 44.1 KHz의 밴드 대역폭을 지니는 CD 음악의 경우, 2048 비트 데이터에 대하여 고속 푸리에 변환을 수행하면 좌우 대칭인 주파수 스펙트럼을 얻게된다.In a preferred embodiment of the present invention, the frequency converter 712 may be implemented as a fast Fourier transform (FFT) or a fast Fourier transform program module. For example, for CD music with a band bandwidth of 44.1 KHz, fast Fourier transforms on 2048-bit data results in symmetric frequency spectrum.

도4a 및 도4b는 본 발명에 따라 디지털 오디오 신호를 주파수 변환기를 통해주파수 영역의 스펙트럼으로 고속 푸리에 변환을 수행한 결과를 나타낸 도면이다. 도4a를 참조하면, 원본 음악(40) 데이터가 넓은 주파수 대역(716, 716')에 대해 분포하는 것으로서, 이 경우 도4a에 도시된 사각형 영역(718)에 워터마크를 삽입할 때 삽입되는 워터마크 양은 적은 반면에 워터마크 강도가 크게 된다.4A and 4B illustrate a result of performing fast Fourier transform of a digital audio signal into a spectrum of a frequency domain through a frequency converter according to the present invention. Referring to FIG. 4A, the original music 40 data is distributed over a wide frequency band 716 and 716 ', in which case the water inserted when the watermark is inserted into the rectangular area 718 shown in FIG. 4A. The mark amount is small while the watermark intensity is large.

한편, 도4b를 참조하면, 원본 음악의 데이터가 비교적 좁은 주파수 대역 (717, 717')에 대해 분포하고 있으므로, 주파수 성분이 낮은 대역(719)에 워터마크를 삽입하더라도, 청취자가 워터마크가 삽입되었음을 인식할 가능성이 높게 된다. 따라서, 이 경우에는 삽입되는 워터마크 양은 크지만 워터마크의 강도를 강도 조절부(711)를 통해 작게 하여 삽입하게 된다. 이 때에, 강도 조절부(711)가 조절하는 강도 조절은 아래의 식에 의해 결정될 수 있다.On the other hand, referring to Fig. 4B, since the original music data is distributed over a relatively narrow frequency band 717, 717 ', even if the watermark is inserted into the band 719 with a low frequency component, the listener inserts the watermark. It is more likely to recognize that it is. Therefore, in this case, although the amount of watermark to be inserted is large, the intensity of the watermark is inserted through the intensity adjusting unit 711 to be inserted. At this time, the intensity control adjusted by the intensity control unit 711 may be determined by the following equation.

if F(t) ≤ Threshold_Freq 1, Scale = α1if F (t) ≤ Threshold_Freq 1, Scale = α1

if Threshold_Freq 1 < F(t) ≤ Threshold_Freq 2, Scale = α2if Threshold_Freq 1 <F (t) ≤ Threshold_Freq 2, Scale = α2

if Threshold_Freq 2 < F(t), Scale = α3if Threshold_Freq 2 <F (t), Scale = α3

여기서, F(t)는 임계 주파수 측정부(713)에서 측정된 주파수를 의미하고, 스케일(Scale)은 워터마크의 강도를 의미한다. 본 발명의 양호한 실시예로서, 스케일 강도는 임의 값을 설정할 수 있다.Here, F (t) refers to the frequency measured by the threshold frequency measuring unit 713, and Scale refers to the intensity of the watermark. As a preferred embodiment of the present invention, the scale intensity can set any value.

F(t)는 도4a 및 도4b에 도시된 사각형 영역(718, 719)을 정의하기 위하여 주파수 성분 스펙트럼 값이 현저히 작아지기 시작하는 부분의 코너 주파수(corner frequency)를 지정하게 된다. 이 때에, 임계 주파수 측정부(713)는 주파수 크기가작아지는 부분의 검지를 위하여 청각 심리 모델을 이용할 수 있다. 본 발명에 따른 양호한 실시예로서, 사람의 인식이 용이하지 않은 10 KHz 이상을 기준으로 F(t)를 산출할 수 있다.F (t) designates the corner frequency of the portion where the frequency component spectral values start to become significantly smaller in order to define the rectangular regions 718 and 719 shown in Figs. 4A and 4B. At this time, the threshold frequency measuring unit 713 may use the auditory psychological model for detecting the portion where the frequency magnitude is small. As a preferred embodiment according to the present invention, it is possible to calculate F (t) on the basis of 10 KHz or more, which is not easy for human recognition.

본 발명에 따른 임계 주파수 측정부(713)는 워터마크 삽입 영역(718, 719)의 산출을 위하여 쿠토시스(kurtosis)를 측정할 수 있다. 수학적으로 쿠토시스 값이 클수록 데이터 분포는 중앙에 집중하게 되고, 그 값이 영(zero) 이하인 경우에는 평활한 분포를 지니게 된다.The threshold frequency measuring unit 713 according to the present invention may measure kutosis to calculate the watermark embedding regions 718 and 719. Mathematically, the larger the value of the Kutosis, the more the data distribution is concentrated in the center. If the value is zero or less, the distribution is smooth.

도5는 쿠토시스 값이 큰 경우와 작은 경우의 주파수 스펙트럼의 분포를 비교하여 나타낸 도면이다. 도5를 참조하면, 쿠토시스 값이 커서 데이터 분포가 중앙 집중적일수록, 고주파 값들이 저주파 값들보다 작다는 것을 의미하므로 작은 변화에 대해서도 청취자들은 쉽게 인식할 수 있게 된다. 따라서, 이 경우 워터마크를 삽입할 영역(718, 719)을 좀 더 고주파 쪽으로 이동하여야만, 청취자가 인식할 수 없는 영역에 워터마크를 삽입할 수 있게 된다.FIG. 5 is a diagram showing a comparison of distributions of frequency spectra in the case of large and small kutosis values. FIG. Referring to FIG. 5, since the larger the data distribution is centralized, the higher the values of the kuthosis, the higher the frequency values are smaller than the low frequency values, so that the listeners can easily recognize the small changes. Therefore, in this case, the watermarks 718 and 719 need to be moved to a higher frequency side, so that the watermark can be inserted into an area that the listener cannot recognize.

이와 반대로, 쿠토시스 값이 작으면 작을수록 고주파 성분과 저주파 성분의 값의 크기가 크지 않게 됨을 의미하므로, 고주파 성분의 작은 변화에를 청취자가 감지하는 것이 용이하지 않게 된다. 따라서, 이 경우에는 앞의 경우보다 좀 더 저주파 쪽으로 임계 주파수를 설정함으로써 방대한 양의 워터마크 정보를 삽입할 수 있다.On the contrary, since the smaller the kutosis value means that the value of the high frequency component and the low frequency component are not large, it is not easy for the listener to sense a small change in the high frequency component. Therefore, in this case, a large amount of watermark information can be inserted by setting a threshold frequency toward a lower frequency side than in the previous case.

한편, 쿠토시스를 구하는 산출식은 다음과 같다.On the other hand, the calculation formula for obtaining the Kutosis is as follows.

여기서,이고 N은 데이터의 개수를 나타낸다.here, And N represents the number of data.

본 발명에 따른 일 실시예로서, 44.1 KHz의 대역폭 음질을 지니는 오디오 데이터에 대하여 2,048 비트로 샘플링을 한 경우, 쿠토시스 식으로부터 산출된 임계 주파수가 7 KHz부터 22.05 KHz에 해당하는 총 1,024 비트의 약 2/3 영역(대략 650 비트)에 대하여 워터마크 데이터를 삽입할 수 있게 된다.According to an embodiment of the present invention, when sampling 2,048 bits of audio data having a bandwidth of 44.1 KHz, a total of about 1,024 bits of the total frequency corresponding to a frequency of 7 KHz to 22.05 KHz, which is calculated from the Kutosis equation, is approximately 2 bits. Watermark data can be inserted in the / 3 area (approximately 650 bits).

본 발명에 따른 워터마크 삽입 기술은 단순히 저작권자의 저작권 보호를 위한 단순 정보뿐 아니라, 노래 가사를 비롯한 다양한 형태의 기타 정보를 방대한 양으로 주파수 도메인에서 청취자가 감지 못하는 수준 하에서 디지털 오디오에 삽입하는 것을 가능하게 한다.The watermark embedding technology according to the present invention enables to insert not only simple information for copyright protection of copyright holders, but also various kinds of other information including song lyrics in digital audio under the level that the listener cannot detect in the frequency domain. Let's do it.

다시 도3을 참조하면, 임계 주파수 측정기(713)로부터 출력된 신호는 워터마크 비트 스트림과 합산하는 워터마크 합산부(714)로 입력되고, 주파수 역변환부 (715)에 의해 시간 영역(time domain)의 디지털 오디오로 변환된다. 본 발명에 따른 워터마크 삽입 장치의 특징은 워터마크가 원본 신호에 합산될 때에, 워터마크 삽입 영역(718, 719)의 주파수 성분 스펙트럼 값을 중간값(average)으로 모두 선 처리된 후에 삽입이 되어진다는 점이다. 이와 같이, 중간값으로 선 처리함으로써 워터마크 비트 스트림을 합산하는 과정에서, 그 값이 "0" 또는 "1" 인가에 따라 스펙트럼 값을 상향 또는 하향함으로써 정보를 싣게 된다.Referring back to FIG. 3, the signal output from the threshold frequency measuring device 713 is input to the watermark adding unit 714, which is added with the watermark bit stream, and is time domaind by the frequency inverse transform unit 715. Is converted to digital audio. The feature of the watermark embedding apparatus according to the present invention is that when the watermark is added to the original signal, the watermark embedding apparatus is inserted after preprocessing all the frequency component spectrum values of the watermark embedding regions 718 and 719 into an average value. Is that you lose. As described above, in the process of summing the watermark bit streams by preprocessing the intermediate values, information is loaded by raising or lowering the spectral value according to whether the value is "0" or "1".

도6은 본 발명에 따라 주파수 영역에 워터마크가 삽입된 음악을 나타내는 도면이다. 도6을 참조하면, 워터마크 삽입 영역(783)을 전술한 쿠토시스 산출을 통하여 정의하고, 선 처리 과정을 통해 중간값(780)으로 평활화시키는 작업을 수행한다. 이어서, 비트 스트림으로 코딩(coding)된 워터마크 데이터(51)의 데이터 값에 따라, 그 값이 "1"인 경우에는 스펙트럼 값을 상향(781)하고, "0"인 경우에는 하향(782)함으로써 바이너리 정보를 싣게 된다.6 is a view showing music in which a watermark is inserted in a frequency domain according to the present invention. Referring to Fig. 6, the watermark embedding area 783 is defined through the above-described kuthosis calculation and smoothed to the intermediate value 780 through the line processing process. Subsequently, according to the data value of the watermark data 51 coded into the bit stream, if the value is "1", the spectral value is raised 781, and if it is "0", it is downward 778. By doing so, binary information is loaded.

도7은 본 발명에 따른 워터마크 추출 장치(400)의 구성을 나타낸 도면이다. 도7을 참조하면, 워터마크 추출기(400)는 주파수 변환부(712), 임계 주파수 측정부 (713), 중간값 측정부(720), 비트 정보 변환부(721) 등을 포함할 수 있다. 워터마크가 삽입된 디지털 오디오(210) WM(x)가 입력되면, 주파수 변환부(712)를 통하여 주파수 성분 스펙트럼으로 변환시킨 후 임계 주파수 측정부(713)를 통해 워터마크가 삽입되어 있는 위치를 파악한다.7 is a diagram showing the configuration of a watermark extraction apparatus 400 according to the present invention. Referring to FIG. 7, the watermark extractor 400 may include a frequency converter 712, a threshold frequency measurer 713, an intermediate value measurer 720, a bit information converter 721, and the like. When the digital audio 210 WM (x) having the watermark inserted therein is inputted, it is converted into a frequency component spectrum through the frequency converter 712 and then the position where the watermark is inserted through the threshold frequency measurement unit 713. Figure out.

이어서, 중간값 측정부(720)는 워터마크가 삽입되어 있는 영역(783)에 대해 중간값(780)을 계산하고, 각각의 데이터 포인트에 대하여 그 값보다 크면 신호 "1"(781)로 간주하고 작으면 "0"의 비트 신호(782)를 추출한다. 추출된 비트 신호는 비트 정보 변환기를 통해 저작권 정보로 변환 출력된다.Subsequently, the median measuring unit 720 calculates the median value 780 for the area 783 in which the watermark is inserted, and considers the signal "1" 781 if it is larger than the value for each data point. If small, the bit signal 782 of "0" is extracted. The extracted bit signal is converted into copyright information through a bit information converter.

전술한 내용은 후술할 발명의 특허 청구 범위를 보다 잘 이해할 수 있도록 본 발명의 특징과 기술적 장점을 다소 폭넓게 개설하였다. 본 발명의 특허 청구 범위를 구성하는 부가적인 특징과 장점들이 이하에서 상술될 것이다. 개시된 본발명의 개념과 특정 실시예는 본 발명과 유사 목적을 수행하기 위한 다른 구조의 설계나 수정의 기본으로서 즉시 사용될 수 있음이 당해 기술 분야의 숙련된 사람들에 의해 인식되어야 한다.The foregoing has outlined rather broadly the features and technical advantages of the present invention to better understand the claims of the invention which will be described later. Additional features and advantages that make up the claims of the present invention will be described below. It should be appreciated by those skilled in the art that the conception and specific embodiments of the disclosed subject matter can be used immediately as a basis for designing or modifying other structures for carrying out similar purposes to the present invention.

또한, 본 발명에서 개시된 발명 개념과 실시예가 본 발명의 동일 목적을 수행하기 위하여 다른 구조로 수정하거나 설계하기 위한 기초로서 당해 기술 분야의 숙련된 사람들에 의해 사용되어질 수 있을 것이다. 또한, 당해 기술 분야의 숙련된 사람에 의한 그와 같은 수정 또는 변경된 등가 구조는 특허 청구 범위에서 기술한 발명의 사상이나 범위를 벗어나지 않는 한도 내에서 다양한 변화, 치환 및 변경이 가능하다.In addition, the inventive concepts and embodiments disclosed herein may be used by those skilled in the art as a basis for modifying or designing other structures for carrying out the same purposes of the present invention. In addition, such modifications or altered equivalent structures by those skilled in the art may be variously changed, substituted, and changed without departing from the spirit or scope of the invention described in the claims.

이상과 같이, 본 발명은 디지털 오디오 신호의 FFT 변환 주파수 스펙트럼에 대하여, 청취자가 청취에 둔감한 주파수 영역에 워터마크를 삽입함으로써 방대한 양의 워터마크 정보를 삽입할 수 있게 된다.As described above, according to the present invention, a large amount of watermark information can be inserted by inserting a watermark into a frequency region insensitive to listening to the FFT converted frequency spectrum of the digital audio signal.

그 결과, 불법으로 유통되는 디지털 컨텐츠의 재생을 방지하고, 저작권 정보뿐 아니라 노래 가사와 같은 부가 정보를 함께 삽입할 수 있는 효과가 있다.As a result, it is possible to prevent the reproduction of illegally distributed digital contents and to insert not only copyright information but also additional information such as song lyrics.

Claims

Generating a watermark to be inserted into the digital audio signal as a bit stream;

Fourier transforming the digital audio signal into a frequency domain;

Calculating a threshold frequency with respect to a frequency domain spectrum of the Fourier transformed digital audio signal; And

Inserting the watermark bit stream in a frequency domain for a band having a frequency greater than the threshold frequency

Digital audio watermark embedding method comprising a.

The digital audio watermark embedding method of claim 1, wherein the measuring of the threshold frequency sets a threshold frequency by calculating kutosis for a Fourier transform spectrum of the digital audio signal.

The method of claim 1, wherein the embedding of the watermark bit stream in a frequency band greater than the threshold frequency is performed.

Adjusting the strength of the watermark bit stream

Digital audio watermark embedding method further comprising.

The method of claim 1, wherein the inserting of the watermark bit stream in a frequency region greater than the threshold frequency is performed.

Measuring an intermediate value for a frequency region greater than the threshold frequency;

For a frequency domain larger than the threshold frequency, the Fourier transform value at each data point is greater or less than the median, depending on whether the data value of the watermark bit stream is "1" or "0". Inserting the watermark bit stream by setting to a value

Digital audio watermark embedding method comprising a.

The method of claim 1, wherein the digital audio watermark embedding method

Fourier inverse transforming the watermarked frequency spectrum subsequently after inserting the watermark bit stream in a frequency domain

Digital audio watermark embedding method comprising a.

Fourier transforming the watermark-embedded digital audio signal into a frequency domain;

Calculating a threshold frequency with respect to a frequency domain spectrum of the Fourier transformed digital audio signal;

Measuring an intermediate value for a frequency band greater than the threshold frequency;

Extracting watermark bitstream information by determining a zero crossing based on the intermediate value at each data pointer for a frequency band larger than the threshold frequency;

Digital audio watermark extraction method comprising a.

A watermark generator for converting the watermark to be inserted into the digital audio signal into a bit stream in binary form;

A frequency converter for Fourier transforming the digital audio signal;

A threshold frequency measuring unit for calculating a threshold frequency for the spectrum of digital audio converted by the frequency converter; And

The watermark is calculated by calculating an intermediate value for a frequency band greater than the threshold frequency and adjusting the spectrum value up or down based on the intermediate value according to whether the data value of the watermark bit stream is "0" or "1". Watermark adder to insert

Digital audio watermark insertion device comprising a.

The digital audio watermark embedding apparatus according to claim 7, wherein the watermark generating unit further comprises a strength adjusting unit for adjusting the strength of the watermark bit stream.

8. The digital audio watermark embedding apparatus of claim 7, wherein the digital audio watermark embedding apparatus comprises an inverse frequency converting unit for performing Fourier inverse transform of the digital audio spectrum into which the watermark is inserted following the watermark adding unit.

A frequency converter for performing Fourier transform on the watermark-embedded digital audio signal;

A threshold frequency measuring unit for calculating a threshold frequency for the spectrum of digital audio converted by the frequency converter;

An intermediate value measuring unit for calculating an intermediate value for a frequency region greater than the threshold frequency; And

A bit information converter configured to extract watermark bitstream information by determining zero crossing for a frequency band greater than the threshold frequency based on the intermediate value calculated by the intermediate value measurer.

Digital audio watermark extraction apparatus comprising a.