JPH1168573A

JPH1168573A - Sound data processing method, sound data processor and record medium, recorded with sound data processing program

Info

Publication number: JPH1168573A
Application number: JP9217225A
Authority: JP
Inventors: Toshimichi Urata; 敏道浦田; Naoya Saito; 直哉斉藤; Akira Inoue; 彰井上
Original assignee: M KEN KK
Current assignee: M KEN KK
Priority date: 1997-08-12
Filing date: 1997-08-12
Publication date: 1999-03-09

Abstract

PROBLEM TO BE SOLVED: To easily confirm and grasp the existence of sound data which has been copied illegally by embedding one of bit data of security data into plural Fourier transform coefficient units respectively. SOLUTION: A blocked Fourier transform coefficient generating means 5 executes Fourier transform of blocked sound data in each block and calculates a Fourier transform coefficient. A unit-forming means 6 configures plural units, which are constituted of two Fourier transform coefficients that become a pair from plural Fourier transform coefficients that constitute a block. A security data embedding means 7 allocates one of bit data of security data to plural units respectively that have been configured by the unit-forming means. An inverse Fourier transform processing means 8 performs inverse Fourier transform processing of a blocked Fourier transform coefficient group, in which the security data has been embedded in each block and restores them to sound data based on the result.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声データの処理
方法及び音声データの処理装置に関するものであり、更
に詳しくは、音声データの不正なコピーを有効に防止す
る為の音声データの処理方法及び音声データの処理装置
に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio data processing method and an audio data processing apparatus, and more particularly, to an audio data processing method and an audio data processing method for effectively preventing unauthorized copying of audio data. The present invention relates to an audio data processing device.

【０００２】[0002]

【従来の技術】従来より、テープ、ディスク等の任意の
記録媒体に記録されているデジタル音声データ、楽音デ
ータ等の音声データが、当該記録媒体が一旦市場で販売
された後は、同音声データの著作権者の同意を得ること
なく、勝手にコピーされ、係る不正コピーが、大量に安
価に販売されていることから、著作権者の権利が不当に
侵害され、当著作権者が大幅な損害を被っているのが現
状である。2. Description of the Related Art Conventionally, audio data such as digital audio data and musical sound data recorded on an arbitrary recording medium such as a tape or a disk is transmitted once the recording medium is sold in the market. Without the consent of the copyright owner, the unauthorized copy is sold in large quantities at a low price, so the rights of the copyright owner are unjustly infringed and the copyright owner It is currently suffering damage.

【０００３】然も、現在では、係る不正なコピーを有効
に防止する手段は、実質的にはなく、法律上からも証拠
の確認が難しいので、係る不正なコピー、海賊版を訴追
する事が難し状態にある。係る、音声データの不正なコ
ピーを防止する方法には多くの方法が、提案されている
が、再生音の音質が低下する等の問題があり、実用には
なっていないのが現状である。[0003] At present, there is practically no means for effectively preventing such illegal copying, and it is difficult to confirm the evidence from the law, so it is difficult to prosecute such illegal copying or pirated copies. In state. Many methods have been proposed for preventing such unauthorized copying of audio data. However, there are problems such as deterioration in sound quality of reproduced sound, and at present it is not practically used.

【０００４】そこで、この問題を改善する為に、デジタ
ル音声データに、その音声データの著作権者、または、
その実施権者が、自らの意思によって、販売を許可した
真正な音声データである事を示す、デジタルビットデー
タから構成された暗号、署名データ等のセキュリティデ
ータを、当該音声データそのものに聴覚的に影響のない
ように埋め込み、その音声データの著作権、所有権など
の侵害を防ぐ様にする「データハイディング」の研究
が、最近活発に行われているが、これまでの処、効果的
なデータハイディング技術は開発されていない。Therefore, in order to improve this problem, the copyright holder of the digital audio data, or
The licensee audibly adds security data, such as encryption and signature data composed of digital bit data, indicating that the audio data is genuine audio data that has been authorized for sale, to the audio data itself. Research on "data hiding", which embeds data so that it has no influence and prevents infringement of copyright and ownership of the audio data, has been actively conducted recently. Data hiding technology has not been developed.

【０００５】[0005]

【発明が解決しようとする課題】本発明の目的は、上記
した従来技術の欠点を改良し、簡易な技術構成に基づ
き、予め所定の音声データに著作権を有している者、又
はそのライセンンスを得ている者が、自己の製品である
事を後でチェック出来るセキュリティデータを埋め込
み、それによって、自己の製品か否かの判断、不正にコ
ピーされたものであるか否かの判断等が容易に行う事の
出来る音声データの処理方法及び音声データの処理装置
を提供するものである。SUMMARY OF THE INVENTION It is an object of the present invention to improve the above-mentioned drawbacks of the prior art, and to provide a person who has a copyright on predetermined audio data in advance based on a simple technical configuration or a license thereof. Embeds security data that can be checked later to confirm that the product is its own, so that it is possible to determine whether or not it is its own product, whether or not it has been copied illegally, etc. An object of the present invention is to provide an audio data processing method and an audio data processing device that can be easily performed.

【０００６】[0006]

【課題を解決するための手段】本発明は上記した目的を
達成するため、基本的には以下に記載されたような技術
構成を採用するものである。即ち、本発明にかかる第１
の態様としては、特定の音声データにセキュリティデー
タを埋め込むに際し、当該特定の音声データを、当該セ
キュリティデータのデジタルビット数の少なくとも２倍
の数となる連続する複数個のブロックに分割した後、当
該各ブロック毎に当該ブロックの音声データをフーリエ
変換を実施して、各ブロック毎にフーリエ変換係数を求
めると共に、当該ブロック内に於ける当該複数個の各フ
ーリエ変換係数から、互いに最も遠距離に離れて位置す
る２つのフーリエ変換係数同志を第１の組とする第１ユ
ニットを形成し、次で、第１のユニットを構成した２つ
のフーリエ変換係数を除いて互いに最も遠距離に離れて
位置する２つのフーリエ変換係数同志を第２の組とする
第２ユニットを形成し、以下同様にして順次ユニットを
形成して、当該セキュリティデータのデジタルビット数
に対応する数のフーリエ変換係数ユニットを形成し、該
複数個のフーリエ変換係数ユニットのそれぞれに当該セ
キュリティデータのビットデータの一つを埋め込む様に
構成された音声データの処理方法であり、又、本発明
に於ける第２の態様としては、音声データ記憶手段、セ
キュリティデータ記憶手段、音声データブロック化手
段、ブロック化された音声データに対してブロック毎に
フーリエ変換を実行してフーリエ変換係数を求めるブロ
ック化フーリエ変換係数生成手段、当該ブロック内の当
該複数個のフーリエ変換係数から、対となる２つのフー
リエ変換係数で構成される複数個のユニットを構成する
ユニット形成手段、当該ユニット形成手段で形成された
複数個のユニットのそれぞれに対して、該セキュリティ
データのビットデータの一つを割り当てるセキュリティ
データ埋め込み手段、当該セキュリティデータを埋め込
まれたブロック化フーリエ変換係数群にブロック毎に逆
フーリエ変換処理を行いその結果を元の音声データに戻
す逆フーリエ変換処理手段、及び上記各手段を総合的に
制御する制御手段とから構成されている音声データの処
理装置である。The present invention basically employs the following technical configuration in order to achieve the above object. That is, the first of the present invention
As a mode of embedding, when embedding the security data in the specific audio data, the specific audio data is divided into a plurality of continuous blocks that are at least twice the number of digital bits of the security data, and Fourier transform is performed on the audio data of the block for each block to obtain a Fourier transform coefficient for each block, and the furthest away from the plurality of Fourier transform coefficients in the block. A first unit is formed with two Fourier transform coefficients located at the same distance as a first set, and then located farthest apart from each other except for the two Fourier transform coefficients that constituted the first unit. A second unit is formed in which two Fourier transform coefficients are used as a second set, and a unit is sequentially formed in the same manner. Audio data processing method configured to form a number of Fourier transform coefficient units corresponding to the number of digital bits of security data, and to embed one of the bit data of the security data in each of the plurality of Fourier transform coefficient units. Also, as a second aspect of the present invention, a voice data storage means, a security data storage means, a voice data blocking means, and a Fourier transform is executed for each block of the blocked voice data. Block Fourier transform coefficient generating means for obtaining a Fourier transform coefficient by means of: a unit forming means for forming a plurality of units composed of a pair of two Fourier transform coefficients from the plurality of Fourier transform coefficients in the block; For each of the plurality of units formed by the unit forming means, Security data embedding means for allocating one of bit data of security data, inverse Fourier transform for performing inverse Fourier transform processing for each block on a block of Fourier transform coefficients in which the security data is embedded, and returning the result to original voice data The audio data processing device includes a processing unit and a control unit that comprehensively controls the above units.

【０００７】[0007]

【実施の形態】本発明に係る音声データの処理方法及び
音声データの処理装置は、上記したような技術構成を採
用しているので、所定の音声データに対して、人間の聴
覚に影響の無い形式で特定のセキュリティデータ、例え
ば暗号データ、署名データ等を埋め込む事が出来、又容
易にそれを再生する事が可能となるのである。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The audio data processing method and audio data processing apparatus according to the present invention employ the above-described technical configuration, so that predetermined audio data does not affect human hearing. It is possible to embed specific security data, for example, encryption data, signature data, and the like in a format, and it is possible to easily reproduce it.

【０００８】つまり、本発明に於いては、人間の聴覚特
性の一つとして挙げられる、「音声信号に於ける絶対位
相については鈍感であるが、相対位相に対しては、敏感
である。」という特性をうまく利用したものである。即
ち、一般的に、一次元音声データは、周期の異なる三角
関数を、基底関数群として、それらの一次結合の形で表
される。それらの、基底関数群からいくつか基底関数を
取り出し、それらの位相をいくらかずらした後、再びそ
れらの一次結合の形で表した音声データは、元の音声デ
ータと人間の耳では、全く聞き比べることができないと
言う関係にある。That is, in the present invention, one of the human auditory characteristics is cited as follows: "It is insensitive to the absolute phase of the audio signal, but sensitive to the relative phase." This is a good use of the characteristic. That is, in general, one-dimensional audio data is expressed in the form of a linear combination of trigonometric functions having different periods as a basis function group. After taking out some basis functions from the basis functions and shifting their phases by some degree, the voice data expressed in the form of their linear combination again is completely comparable to the original voice data and the human ear. There is a relationship that can not do.

【０００９】従って、本発明に於いては、このような、
人間の聴覚特性を用いて音声データに対して、適宜の署
名データ、暗号データ等のセキュリティデータを、各音
声信号の相対的な位相を変えないで絶対的位相を一部変
化させる事によって、当該セキュリティデータの挿入と
音質の低下を防止出来る効果とを実現させる事が可能と
なったものである。Therefore, in the present invention,
For audio data using human auditory characteristics, security data such as appropriate signature data and encryption data are partially changed in absolute phase without changing the relative phase of each audio signal. This makes it possible to realize the effect of inserting security data and preventing the deterioration of sound quality.

【００１０】[0010]

【実施例】以下に、本発明に係る音声データの処理方法
及び音声データの処理装置の具体例を図面を参照しなが
ら詳細に説明する。即ち、図１は、本発明に係る音声デ
ータの処理装置の１具体例の構成を示すブロックダイア
グラムであり、図中、音声データ記憶手段２、セキュリ
ティデータ記憶手段３、音声データブロック化手段４、
ブロック化された音声データに対してブロック毎にフー
リエ変換を実行してフーリエ変換係数を求めるブロック
化フーリエ変換係数生成手段５、当該ブロックを構成す
る複数個のフーリエ変換係数から、対となる２つのフー
リエ変換係数で構成される複数個のユニットを構成する
ユニット形成手段６、当該ユニット形成手段で形成され
た複数個のユニットのそれぞれに対して、該セキュリテ
ィデータのビットデータの一つを割り当てるセキュリテ
ィデータ埋め込み手段７、当該セキュリティデータを埋
め込まれたブロック化フーリエ変換係数群にブロック毎
に逆フーリエ変換処理を行いその結果を元の音声データ
に戻す逆フーリエ変換処理手段８、及び上記各手段を総
合的に制御する制御手段９とから構成されている音声デ
ータの処理装置１が示されている。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing an embodiment of a voice data processing method and a voice data processing apparatus according to the present invention. That is, FIG. 1 is a block diagram showing the configuration of one specific example of the audio data processing apparatus according to the present invention. In FIG. 1, audio data storage means 2, security data storage means 3, audio data blocking means 4,
Blocked Fourier transform coefficient generation means 5 for performing a Fourier transform on each block of the speech data for each block to obtain a Fourier transform coefficient, a pair of two Fourier transform coefficients constituting the block, Unit forming means 6 forming a plurality of units formed by Fourier transform coefficients; security data for allocating one of the bit data of the security data to each of the plurality of units formed by the unit forming means; Embedding means 7, inverse Fourier transform processing means 8 for performing an inverse Fourier transform process for each block on the block of Fourier transform coefficients in which the security data is embedded, and returning the result to the original audio data, and the above means. Processing device 1 for audio data, comprising It is shown.

【００１１】更に、本発明に係る音声データの処理装置
１に於いては、上記の説明に於て明らかな様に、一旦セ
キュリティデータが埋め込まれた当該元の音声データに
対してフーリエ変換を施す手段１１と、セキュリティデ
ータを埋め込んでいるか否かが不明の被検査対象音声デ
ータを一旦記憶しておく被検査対象音声データ記憶手段
１２、該被検査対象音声データに同様のフーリエ変換を
施す手段１３、双方の音声データに関するフーリエ変換
処理後のデータを比較する比較手段１４及び被検査対象
音声データが、所定のセキュリティデータを含んでいる
か否かを判断する判別手段１５とを更に含んでいるもの
である。Further, in the audio data processing apparatus 1 according to the present invention, as is apparent from the above description, the original audio data in which the security data is embedded is Fourier-transformed. Means 11; voice data storage means 12 for temporarily storing voice data to be tested for which it is unknown whether or not security data is embedded; means 13 for applying the same Fourier transform to the voice data to be tested 13 And comparing means 14 for comparing the data after the Fourier transform processing with respect to both voice data, and determining means 15 for determining whether or not the voice data to be inspected includes predetermined security data. is there.

【００１２】本発明に係る音声データの処理方法を上記
図１の音声データの処理装置を使用して説明するなら
ば、本発明に於いては、先ず、所定の所定のセキュリテ
ィデータを埋め込む必要のある音声データを選択して、
所定の音声データ記録手段２に格納しておく。一方、当
該音声データに埋め込むべきセキュリティデータをセキ
ュリティデータ記憶手段３に格納する。If the audio data processing method according to the present invention is described using the audio data processing apparatus shown in FIG. 1, in the present invention, first, it is necessary to embed predetermined security data. Select some audio data,
It is stored in a predetermined voice data recording means 2. On the other hand, security data to be embedded in the audio data is stored in the security data storage unit 3.

【００１３】本発明に使用される当該セキュリティデー
タは、例えば１、０の何れかを取る２値ビットデータが
複数個直列に配列されたデータ列で構成されるものが使
用しえる。つまり、当該セキュリティデータをＡＢＣと
する場合には、当該各符号を例えば“１１１０１”、
“１００１１”、“００１１０”と言うように決めてお
く。As the security data used in the present invention, there can be used security data composed of a data string in which a plurality of binary bit data taking either 1 or 0 are serially arranged. That is, when the security data is set to ABC, each code is set to “11101”, for example.
It is decided to say “10011” or “00110”.

【００１４】勿論、本発明に於て使用される当該セキュ
リティデータは、上記の様な文字列のみではなく、画像
データでも使用可能である。次いで、上記で選択された
セキュリティデータを埋め込むべき音声データを当該音
声データ記録手段２から読み出し、音声データブロック
化手段４に於て複数個のブロックに分割するブロック化
処理を行うものである。Of course, the security data used in the present invention can be used not only for the above-described character strings but also for image data. Next, the audio data in which the security data selected as described above is to be embedded is read from the audio data recording means 2, and the audio data blocking means 4 performs block processing for dividing the data into a plurality of blocks.

【００１５】係るブロック化に於けるブロックの数は、
後述する様に、少なくとも該セキュリティデータを構成
するビットデータのビット数ｎの２倍である必要があ
り、好ましくは、２倍以上、より好ましくは２ｎ＋２で
ある。本発明に於て、該ブロックは、図２に示す様に、
上記の数のブロックが、複数個連続して配置される様に
構成する事が望ましい。The number of blocks in such blocking is as follows:
As will be described later, the number of bits of the bit data constituting the security data must be at least twice, preferably twice or more, and more preferably 2n + 2. In the present invention, the block is, as shown in FIG.
It is desirable that the above-mentioned number of blocks be configured so as to be continuously arranged.

【００１６】更に、本発明に於て、当該音声データの中
で上記連続する複数個のブロックが形成される位置は、
特に限定されるものではなく、任意の位置に配置形成さ
せる事が出来、又その複数個の当該ブロック群は、一個
に限ることはなく、繰り返して当該音声データの中に配
置しておく事が可能である。それによって、第３者が、
当該音声データの一部のみを切り出して使用した場合で
も、その不正なコピーを確認する事が可能となる。Further, in the present invention, the position where the plurality of continuous blocks are formed in the audio data is
It is not particularly limited, and can be arranged and formed at an arbitrary position. The plurality of block groups are not limited to one, and can be repeatedly arranged in the audio data. It is possible. Thereby, the third party
Even if only a part of the audio data is cut out and used, it is possible to confirm the illegal copy.

【００１７】本発明に於ける当該各ブロックの大きさは
特に限定されるものでは無いが、好ましくは２のべき乗
の数（２ⁿ) である事が望ましく、例えば、各ブロック
とも１ブロックのデータ数が２５６となる様にブロック
化する事が望ましい。図２は、上記の音声データのブロ
ック化の原理を説明する図であり、具体的には、埋め込
み対象となる音声データを一ブロックのサンプル数が２
のべき乗であるブロックの集まりとして表す事を示して
いる。In the present invention, the size of each block is not particularly limited, but is preferably a power of 2 (2 ⁿ ). For example, each block has one block of data. It is desirable to block the number so that it becomes 256. FIG. 2 is a diagram for explaining the principle of the above-described audio data blocking. More specifically, audio data to be embedded has two samples per block.
It is shown as a set of blocks that are powers of.

【００１８】つまり、今、一次元の実数値関数Sound( i
) ( iは0 以上の整数値 )を埋め込み対象音声データと
する。また、ブロックのサンプル数を2 個、 k、 j ∈
Ｚでi = 2k + j ( 0 ≦ j < 2ⁿ) である時、一次元の
実数値関数Ｂ_k( j) を、Ｂ_k( j) = Sound(i ) （式１）で表す。但し、Ｚは整数全体を表す。That is, now, a one-dimensional real-valued function Sound (i
) (i is an integer greater than or equal to 0) is the audio data to be embedded. Also, the number of samples in the block is 2, k, j ∈
When Z = i = 2k + j (0 ≦ j < ²ⁿ ), the one-dimensional real-valued function B _k (j) is expressed by B _k (j) = Sound (i) (Equation 1). However, Z represents the whole integer.

【００１９】次に、本発明に於いては、ブロック化フー
リエ変換係数生成手段５に於て、上記ブロック化された
それぞれの音声データの当該各ブロック毎に当該ブロッ
クの音声データをフーリエ変換を実施して、各ブロック
毎に複数個のフーリエ変換係数Ｆ(k) を求めるものであ
る。本発明に於て、音声データをブロック化する目的
は、このように、音声データに対してフーリエ変換を行
う為であり、当該フーリエ変換処理操作に要する時間を
短縮する為に採用するものである。Next, in the present invention, the Fourier transform of the audio data of the block is performed for each of the blocks of the audio data in the block by the Fourier transform coefficient generation means 5 for blocking. Then, a plurality of Fourier transform coefficients F (k) are obtained for each block. In the present invention, the purpose of blocking audio data is to perform a Fourier transform on the audio data as described above, and to reduce the time required for the Fourier transform processing operation. .

【００２０】係る音声データに対して、フーリエ変換を
行いフーリエ変換係数を求める事は一般的となっている
ので、ここでは詳細に説明する事は省略するが、基本的
概念に関してのみ、触れておく事にする。即ち、音声デ
ータはデジタルデータであるので、ここでは、離散フー
リエ変換について説明する。然しながら、本発明に於い
ては、高速フーリエ変換を採用しているが、この高速フ
ーリエ変換は、離散フーリエ変換を高速化したもので、
計算結果は離散フーリエ変換を施したものと全く同じで
ある。Since it is common to perform Fourier transform on such audio data to obtain Fourier transform coefficients, detailed description thereof will be omitted here, but only the basic concept will be described. I will do it. That is, since the audio data is digital data, the discrete Fourier transform will be described here. However, in the present invention, the fast Fourier transform is adopted, and this fast Fourier transform is a speedup of the discrete Fourier transform.
The calculation result is exactly the same as that obtained by performing the discrete Fourier transform.

【００２１】定義１( 離散フーリエ変換 ) 離散一次元実数値関数f(n) ( n ∈ Ｚ、0 ≦ n < N )
が与えられた時、（式２）で示す離散一次元複素数値関
数F(k) (k ∈ Ｚ、0 ≦ k < N )を、f(n)の離散フーリ
エ変換と呼ぶ。Definition 1 (Discrete Fourier Transform) Discrete one-dimensional real-valued function f (n) (n∈Z, 0 ≦ n <N)
Is given, a discrete one-dimensional complex-valued function F (k) (k∈Z, 0 ≦ k <N) represented by (Equation 2) is called a discrete Fourier transform of f (n).

【００２２】[0022]

【数１】 (Equation 1)

【００２３】定義１の離散フーリエ変換を全てのブロッ
クに対して施し、そのフーリエ変換係数のデータに対し
て、セキュリティデータビット列を埋め込む。次に、本
発明に於いては、上記の操作によって、各ブロック毎に
複数個のフーリエ変換係数F(k)が求まった後に、ユニッ
ト形成手段６に於て、上記ブロックの該複数個のフーリ
エ変換係数群の中から、図２に示す様に、先ず互いに最
も遠距離に離れて位置する２つのフーリエ変換係数F(0)
とフーリエ変換係数F(n)同志を第１の組とする第１ユニ
ットＵ１を形成する。The discrete Fourier transform defined in Definition 1 is applied to all blocks, and a security data bit string is embedded in the data of the Fourier transform coefficients. Next, in the present invention, after a plurality of Fourier transform coefficients F (k) are obtained for each block by the above operation, the unit forming means 6 sets the plurality of Fourier transform coefficients F (k) of the block. From the group of transform coefficients, as shown in FIG. 2, first, two Fourier transform coefficients F (0) located farthest apart from each other
And a Fourier transform coefficient F (n) as a first unit to form a first unit U1.

【００２４】つまり、図２に示すブロックＢ０に於いて
は、フーリエ変換係数F(0)とフーリエ変換係数F(n)とが
上記条件を満足するので、フーリエ変換係数F(0)とフー
リエ変換係数F(n)とで第１ユニットＵ１が形成される。
次で、第１のユニットを構成した２つのブロック、即ち
フーリエ変換係数F(0)とフーリエ変換係数F(n)を除いた
残りのフーリエ変換係数群の中で互いに最も遠距離に離
れて位置する２つのフーリエ変換係数同志を第２の組と
する第２ユニットを形成する。That is, in the block B0 shown in FIG. 2, since the Fourier transform coefficient F (0) and the Fourier transform coefficient F (n) satisfy the above condition, the Fourier transform coefficient F (0) and the Fourier transform The first unit U1 is formed with the coefficient F (n).
Next, in the two blocks constituting the first unit, that is, in the remaining Fourier transform coefficient group except for the Fourier transform coefficient F (0) and the Fourier transform coefficient F (n), the positions farthest apart from each other are set. A second unit is formed in which the two Fourier transform coefficients are used as a second set.

【００２５】つまり、図２に示すブロックＢ０に於いて
は、フーリエ変換係数F(1)とフーリエ変換係数F(n-1)と
が上記条件を満足するので、フーリエ変換係数F(1)とフ
ーリエ変換係数F(n-1)とで第２のユニットＵ２が形成さ
れる。次で、第１のユニットと第２のユニットを構成す
る４個のフーリエ変換係数、即ちフーリエ変換係数F
(0)、F(1)、F(n-1)及びF(n)を除いた残りのフーリエ変
換係数群の中で互いに最も遠距離に離れて位置する２つ
のフーリエ変換係数同志を第３の組とする第３ユニット
Ｕ３を形成する。That is, in the block B0 shown in FIG. 2, since the Fourier transform coefficient F (1) and the Fourier transform coefficient F (n-1) satisfy the above condition, the Fourier transform coefficient F (1) A second unit U2 is formed with the Fourier transform coefficient F (n-1). Next, four Fourier transform coefficients constituting the first unit and the second unit, that is, Fourier transform coefficients F
(0), F (1), F (n-1) and F (n), except for the remaining Fourier transform coefficient groups apart from each other, the two Fourier transform coefficients located farthest apart from each other are set to the third To form a third unit U3.

【００２６】つまり、図２に示すフーリエ変換係数群に
於いては、フーリエ変換係数F(2)とフーリエ変換係数F
(n-2)とが上記条件を満足するので、ブロックＢ２とブ
ロックＢ_N-2とが上記条件を満足するので、フーリエ変
換係数F(2)とフーリエ変換係数F(n-2)とで第３ユニット
Ｕ３が形成される。以下同様にして、２ｎ個のブロック
から順次ユニットを形成して、当該セキュリティデータ
のデジタルビット数ｎに対応する数であるｎ個のフーリ
エ変換係数ユニットを形成する。That is, in the Fourier transform coefficient group shown in FIG. 2, the Fourier transform coefficient F (2) and the Fourier transform coefficient F
Since (n-2) satisfies the above condition, since the block B2 and the block B _N-2 satisfy the above condition, the Fourier transform coefficient F (2) and the Fourier transform coefficient F (n-2) A third unit U3 is formed. Similarly, units are sequentially formed from 2n blocks to form n Fourier transform coefficient units corresponding to the digital bit number n of the security data.

【００２７】本発明に於いては、上記ブロック操作を実
行するに際し、当該ブロック群の最初と最後のブロック
のデータは使用しない方がフーリエ変換操作上から望ま
しい場合があるので、実際には、上記音声データから該
複数個のブロック群を形成するに際して、当該セキュリ
ティデータの構成ビット数ｎに対して２ｎ＋２の数のブ
ロックを形成し、係るブロック群の両端にある各ブロッ
クは、フーリエ変換には使用しない事が望ましい。In the present invention, when performing the above-mentioned block operation, it is sometimes desirable from the Fourier transformation operation not to use the data of the first and last blocks of the block group. In forming the plurality of block groups from the audio data, 2n + 2 blocks are formed with respect to the number n of constituent bits of the security data, and each block at both ends of the block group is used for Fourier transform. It is desirable not to do it.

【００２８】係る操作を実行した後、本発明に於いて
は、セキュリティデータ埋め込み手段７に於て、当該ユ
ニット形成手段６で形成された複数個のユニットＵ１〜
Ｕｎのそれぞれに対して、該セキュリティデータのビッ
トデータの一つを割り当てる操作が実行される。本発明
に於ける当該各ユニットに該セキュリティデータのビッ
トデータの一つを割り当てる操作としては、例えば、上
記した様に、埋め込むべき当該セキュリティデータがＡ
の符号であり、そのデジタルビットデータが“１１１０
１”となっていたとすると、第１のユニットＵ１には、
“１”のデータが埋め込まれ、第２のユニットＵ２に
は、“１”のデータが埋め込まれ、第３のユニットＵ３
には、“１”のデータが埋め込まれ、第４のユニットＵ
４には、“０”のデータが埋め込まれ、第５のユニット
Ｕ５には、“１”のデータが埋め込まれる様に操作が行
われる。After performing such an operation, in the present invention, in the security data embedding means 7, a plurality of units U1 to U1 formed by the unit forming means 6 are formed.
An operation of assigning one of the bit data of the security data to each of Un is performed. As an operation of assigning one of the bit data of the security data to each unit in the present invention, for example, as described above, the security data to be embedded is A
And the digital bit data is “1110”
1 ", the first unit U1 has
The data of “1” is embedded, the data of “1” is embedded in the second unit U2, and the third unit U3 is embedded in the second unit U2.
Is embedded with data of “1”, and the fourth unit U
The operation is performed such that the data of “0” is embedded in 4 and the data of “1” is embedded in the fifth unit U5.

【００２９】本発明に於ける当該セキュリティデータの
埋め込み方法の一例としては、例えば、当該セキュリテ
ィデータのビットデータが“１”である場合には、当該
ブロックのフーリエ変換係数の位相をπだけ移動させる
様に処理し、又当該セキュリティデータのビットデータ
が“０”である場合には、当該ブロックのフーリエ変換
係数の位相をπだけ移動させない様に処理するものであ
る。As an example of the security data embedding method in the present invention, for example, when the bit data of the security data is “1”, the phase of the Fourier transform coefficient of the block is shifted by π. When the bit data of the security data is “0”, the phase of the Fourier transform coefficient of the block is not shifted by π.

【００３０】より具体的には、当該セキュリティデータ
のビットデータが“１”である場合に、当該ブロックの
フーリエ変換係数の位相をπだけ移動させる様に処理す
る事は、当該ブロックのフーリエ変換係数に於ける実数
部分を虚数部に変換してその符号を逆につけ、一方当該
当該ブロックのフーリエ変換係数に於ける虚数部分は、
符号はそのままにして実数部分に変換する事を意味し、
又当該セキュリティデータのビットデータが“０”であ
る場合には、当該ブロックのフーリエ変換係数に於ける
実数部分も虚数部も何等変化させない様にする事を意味
する。More specifically, when the bit data of the security data is “1”, processing to shift the phase of the Fourier transform coefficient of the block by π is performed by changing the Fourier transform coefficient of the block. Is converted to an imaginary part and its sign is reversed, while the imaginary part of the Fourier transform coefficients of the block is
It means to convert the sign to the real part without changing the sign.
If the bit data of the security data is “0”, it means that neither the real part nor the imaginary part of the Fourier transform coefficient of the block is changed.

【００３１】係るセキュリティデータの埋め込み処理操
作の原理を簡単に以下に示しておく。任意のk( k ∈
Ｚ,k ∈ [0,全ブロック数] ) に対し、Ｂ_kを離散フー
リエ変換したものをF _kとする。また、埋め込みデータ
ビットを一次元の離散整数値関数Ｕ（d) (但し、 d,d
_n∈ Ｚ, d _n< 2 ^n-1, U(d) = 1 or 0 , d ∈ [1,d
_n] でU(d)=0) とし、F_k(m) ( m ∈ Ｚ, m ∈ [1, 2
ⁿ ] )に、このビットデータU(d)を埋め込んだ結果、得
られる関数をF'_k(m) とすると、F'_k(m) は、The principle of the security data embedding processing operation will be briefly described below. Any k (k ∈
Z, k ∈ [0, the total number of blocks]), and F _k is the result of discrete Fourier transform of B _k . Also, the embedded data bits are converted to a one-dimensional discrete integer value function U (d) (where d, d
_n Ｚ Z, d _n <2 ^n-1 , U (d) = 1 or 0, d ∈ [1, d
_n ], U (d) = 0), and F _k (m) (m∈Z, m∈ [1, 2
ⁿ ]), ^assuming that a function obtained as a result of embedding the bit data U (d) is F ′ _k (m), F ′ _k (m) becomes

【００３２】[0032]

【数２】 (Equation 2)

【００３３】と書ける。本発明に於いては、上記の方法
によって、所定のセキュリティデータを、所望の音声デ
ータに埋め込んだ後、逆フーリエ変換処理手段８に於
て、当該セキュリティデータを埋め込まれたブロック化
フーリエ変換係数群にブロック毎に逆フーリエ変換処理
を行い、その埋め込み変換結果を元の音声データに組み
込む操作が実行される。[0033] In the present invention, after the predetermined security data is embedded in the desired audio data by the above-described method, the inverse Fourier transform processing means 8 sets the blocked Fourier transform coefficient group in which the security data is embedded. Then, an operation of performing an inverse Fourier transform process for each block and incorporating the embedded conversion result into the original audio data is performed.

【００３４】此処で、本発明に係る逆フーリエ変換処理
の原理を簡単に説明しておくと、定義１( 離散フーリエ逆変換 ) 離散一次元実数値関数f(n) ( n ∈ Ｚ、0 ≦ n < N )
が与えられ、離散一次元複素数値関数Ｆ（k) (k ∈
Ｚ、0 ≦ k < N )が、f(n)の離散フーリエ変換であれ
ば、式７が成り立つ。Here, the principle of the inverse Fourier transform processing according to the present invention will be briefly described. Definition 1 (Discrete Fourier inverse transform) Discrete one-dimensional real-valued function f (n) (n∈Z, 0 ≦ n <N)
And a discrete one-dimensional complex-valued function F (k) (k ∈
If Z, 0 ≦ k <N) is a discrete Fourier transform of f (n), Equation 7 holds.

【００３５】[0035]

【数３】 (Equation 3)

【００３６】上記のF'_kにフーリエ逆変換を施して得ら
れた離散データ値をf'_kとすると、このf'_kがデジタル
音声データB _kにセキュリティデータビット列Ｕを埋め
込んだものに相当する。以下に、本発明に於けるセキュ
リティデータ埋め込み処理操作の１具体例を簡単なモデ
ル例について以下に説明する。[0036] When 'discrete data values obtained by performing inverse Fourier transform on the _k f' above F and _k, corresponding to the f _'k is by embedding the security data bit sequence U to the digital audio data B _k . Hereinafter, a specific example of the security data embedding processing operation according to the present invention will be described with reference to a simple model example.

【００３７】即ち、セキュリティデータ埋め込み対象音
声データをSound(k),埋め込みセキュリティビット列をs
yomei[j],埋め込み音声データSound を離散フーリエ変
換したデータをF[Sound](p),F[Sound](p) にセキュリテ
ィデータビット列を埋め込んだ結果を、F'[Sound](p)と
する。That is, the sound data to be embedded with security data is Sound (k), and the embedded security bit string is s.
yomei [j], the result of embedding the security data bit string in F [Sound] (p), F [Sound] (p) of the data obtained by performing a discrete Fourier transform of the embedded sound data Sound, and F '[Sound] (p) I do.

【００３８】此処で、Sound(k)は、整数空間上で定義さ
れている整数値を返す関数で、k=0,1,...,N とする。ま
た、syomei[j] も整数空間上で定義された関数で、０，
１のみを返す関数とし、j=０，１であるとする。ここで
は、次式でsyomei[j] を定義する。Here, Sound (k) is a function that returns an integer value defined on the integer space, and k = 0, 1,..., N. Syomei [j] is a function defined on the integer space, and 0,
A function that returns only 1 and j = 0,1. Here, syomei [j] is defined by the following equation.

【００３９】[0039]

【数４】 (Equation 4)

【００４０】このとき、F[Sound](p) は、整数空間上で
定義される関数で、虚数を返す関数として、式２により
次の式８で表される。但し、p=0,1,....,Nである。F は
虚数で、その実数部分をRe[F](p), 虚数部分をIm[F](p) 式８とすると、F'[Sound](p)は、式３、４，５，６、を用い
て、次式で表す。Syomei[0]=1 より、At this time, F [Sound] (p) is a function defined on an integer space, and is a function that returns an imaginary number, and is expressed by the following equation 8 using equation 2. Here, p = 0, 1,..., N. F is an imaginary number, and its real part is represented by Re [F] (p) and its imaginary part is represented by Im [F] (p). Equation 8 If F '[Sound] (p) is represented by Equations 3, 4, 5, 6 , And is represented by the following equation. From Syomei [0] = 1,

【００４１】[0041]

【数５】 (Equation 5)

【００４２】ここでも、Re,Im は、それぞれ｛｝内の
実数部と虚数部を表す。また、syomei[1]=0 より、Here, Re and Im respectively represent a real part and an imaginary part in {}. Also, from syomei [1] = 0,

【００４３】[0043]

【数６】 (Equation 6)

【００４４】この、F'を離散フーリエ逆変換することに
より、埋め込み音声データSound'(k)が得られる。ま
た、セキュリティデータを取り出すには、By performing an inverse discrete Fourier transform on F ′, embedded audio data Sound ′ (k) is obtained. Also, to retrieve security data,

【００４５】[0045]

【数７】 (Equation 7)

【００４６】Sound(k)とSound'(k) をそれぞれフーリエ
変換し、それぞれのフーリエ変換データを比較し、値が
違っていれば、署名ビット1を取り出し、値が同じであ
れば、セキュリティデータビット0 を取り出す。以下
に、そのアルゴリズムを簡単に示す。｛｝内はn
は1 から順番にNまで動く。Sound (k) and Sound '(k) are Fourier-transformed, respectively, and their Fourier-transformed data are compared. If the values are different, signature bit 1 is taken out. Extract bit 0. The algorithm is briefly described below. ｛N is n
Moves from 1 to N in order.

【００４７】｛ F[Sound](n) = F[Sound'](n)でなけれ
ば、syomei[n-1]=1 ；F[Sound](n) = F[Sound'](n)であ
れば、 syomei[n-1]=0｝上記した本発明に係る音声処理方法の工程手順の一例を
図３のフローチャートを参照しながら説明する。即ち、
本発明に係る特定の音声データに特定のセキュリティデ
ータを埋め込む方法の手順としては、先ずステップ
（１）に於いて、当該特定の音声データを選択し、ステ
ップ（２）に於いて当該セキュリティデータのデジタル
ビット数ｎを確認する。If F [Sound] (n) = F [Sound '] (n), syomei [n-1] = 1; F [Sound] (n) = F [Sound'] (n) If so, syomei [n-1] = 0} An example of the procedure of the above-described audio processing method according to the present invention will be described with reference to the flowchart of FIG. That is,
As a procedure of a method of embedding specific security data in specific audio data according to the present invention, first, in step (1), the specific audio data is selected, and in step (2), the security data of the security data is selected. Check the digital bit number n.

【００４８】その後ステップ（３）に進んで、当該選択
された特定の音声データを少なくとも当該セキュリティ
データのデジタルビット数ｎの２倍の数となる連続する
複数個のブロックに分割する処理が実行され、ステップ
（４）に於いて、当該各ブロック毎に当該ブロック中に
含まれる音声データに対してフーリエ変換を実行し、各
ブロック毎にフーリエ変換係数を求める。Thereafter, the process proceeds to step (3) to execute a process of dividing the selected specific audio data into a plurality of continuous blocks having at least twice the number of digital bits n of the security data. In step (4), a Fourier transform is performed on the audio data included in the block for each block, and a Fourier transform coefficient is obtained for each block.

【００４９】次いでステップ（５）に進み、当該一つの
ブロックに於ける当該複数個の各フーリエ変換係数の内
から、先ず互いに最も遠距離に離れて位置する２つのフ
ーリエ変換係数同志を第１の組と選択して第１のフーリ
エ変換係数ユニットを形成する工程と、第１のユニット
を構成した２つのフーリエ変換係数を除いた各フーリエ
変換係数の中で、互いに最も遠距離に離れて位置する２
つのフーリエ変換係数同志を第２の組と選択して第２の
フーリエ変換係数ユニットを形成する工程と、以下同様
にして順次ユニットを形成して、第３、第４、・・・第
ｎの組を形成し、当該セキュリティデータのデジタルビ
ット数に対応する数のフーリエ変換係数ユニットを形成
する工程とが連続的に実行され、ｎ組込のフーリエ変換
係数ユニットを形成する。Next, proceeding to step (5), from among the plurality of Fourier transform coefficients in the one block, first, two Fourier transform coefficients located farthest from each other are combined with each other as a first one. Selecting a set to form a first Fourier transform coefficient unit, and wherein the four Fourier transform coefficients except for the two Fourier transform coefficients constituting the first unit are located farthest apart from each other 2
Selecting two Fourier transform coefficients as a second set to form a second Fourier transform coefficient unit, and then sequentially forming units in the same manner to form a third, fourth,. Forming a set and forming a number of Fourier transform coefficient units corresponding to the number of digital bits of the security data is continuously performed to form n built-in Fourier transform coefficient units.

【００５０】その後、ステップ（６）に於いて、該複数
個のフーリエ変換係数ユニットのそれぞれに、当該セキ
ュリティデータのビットデータの一つを当てはめ当該セ
キュリティデータを該音声データに埋め込む操作が実行
され、ステップ（７）に於いて、当該セキュリティデー
タを埋め込んだ音声データのブロック化フーリエ変換係
数群を元の音声データに組み込む操作が実行される。Thereafter, in step (6), an operation of applying one of the bit data of the security data to each of the plurality of Fourier transform coefficient units and embedding the security data in the voice data is performed, In step (7), an operation of incorporating the group of blocked Fourier transform coefficients of the audio data in which the security data is embedded into the original audio data is performed.

【００５１】本具体例に於ける当該ステップ（６）の音
声データに埋め込む操作の一例は、図４に示すサブルー
チンで示される様な方法がある。即ち、ステップ（６）
に於いては、スタート後ステップ（２０）に於いて、所
定の位置、例えば先頭つまりユニットＵi （i=1)を選択
し、ステップ（２１）に於いて、該セキュリティデータ
の１番目のビットデータの値が１か否かが判断される。As an example of the operation of embedding the voice data in the step (6) in this specific example, there is a method shown by a subroutine shown in FIG. That is, step (6)
In step (20) after the start, a predetermined position, for example, the head, that is, unit Ui (i = 1) is selected, and in step (21), the first bit data of the security data is selected. Is determined to be 1 or not.

【００５２】当該ステップ（２１）でＮＯであれば、当
該ユニットＵ１に於けるフーリエ変換係数のＦ（１）と
Ｆ（ｎ）は、ステップ（２２）に於いて、πの位相変更
を実行せず、又当該ステップ（２１）でＹＥＳであれ
ば、当該ユニットＵ１に於けるフーリエ変換係数のＦ
（１）とＦ（ｎ）は、ステップ（２３）に於いて、πの
位相変更を実行する。If NO in step (21), the Fourier transform coefficients F (1) and F (n) in the unit U1 execute the phase change of π in step (22). If the answer is YES in step (21), the Fourier transform coefficient F in the unit U1 is used.
(1) and F (n) execute a phase change of π in step (23).

【００５３】その後、ステップ（２４）に進んで、ユニ
ットＵi のｉがｉ＝ｎかが判断され、ＹＥＳであれば、
当該操作はＥＮＤとなり、又該ステップ（２４）で、Ｎ
Ｏであれば、ステップ（２５）に進んで、ｉの値を１だ
け歩進させてステップ（２１）に戻に、上記ステップ
（２１）、ステップ（２２）及びステップ（２３）の各
操作が繰り返される。Thereafter, the process proceeds to a step (24), where it is determined whether or not i of the unit Ui is i = n.
The operation becomes END, and in step (24), N
If O, the process proceeds to step (25), in which the value of i is incremented by 1 and the process returns to step (21), and the operations of step (21), step (22), and step (23) are performed. Repeated.

【００５４】そして、ｎ個の全ユニットＵi にセキュリ
ティデータの各ビットデータが埋め込まれたと判断され
た場合に、当該サブルーチンが終了し、上記ステップ
（６）に戻る。又、上記ステップ（７）以降に於ける、
被検査対象音声データに対する検査処理操作或いは埋め
込まれたセキュリティデータの取り出し処理操作の一例
としては、図５に示すフローチャートに示されている様
に、スタート後、ステップ（３０）で当該セキュリティ
データが埋め込まれた音声データをブロック化して、ス
テップ（３１）に於いて当該各ブロックに対してフーリ
エ変換を施す操作が実行されると共に、ステップ（３
２）で当該セキュリティデータを埋め込む前の音声デー
タをブロック化して、ステップ（３３）に於いて、当該
各ブロックに対してフーリエ変換を施す操作が実行され
る。When it is determined that each bit data of the security data is embedded in all the n units Ui, the subroutine ends, and the process returns to the step (6). Also, after the above step (7),
As an example of the inspection processing operation for the voice data to be inspected or the extraction processing operation of the embedded security data, as shown in the flowchart of FIG. 5, after the start, the security data is embedded in step (30). An operation of dividing the obtained audio data into blocks and performing a Fourier transform on each of the blocks in step (31) is executed.
In step 2), the voice data before embedding the security data is divided into blocks, and in step (33), an operation of performing a Fourier transform on each block is performed.

【００５５】その後ステップ（３４）に進んで、両者の
各ブロックに於けるフーリエ変換係数同士が比較され、
つまり両者の係数が同一か否かが判断され、ＹＥＳであ
れば、ステップ（３５）に於いて、埋め込みによるフー
リエ変換係数の変更操作は存在しなかったと判断し、当
該セキュリティデータの当該ビットは０であると判断
し、ステップ（３４）でＮＯであれば、ステップ（３
６）に於いて、当該セキュリティデータの当該ビットは
１であると判断する。Thereafter, the flow advances to step (34) to compare the Fourier transform coefficients of the respective blocks with each other.
That is, it is determined whether or not the two coefficients are the same. If YES, it is determined in step (35) that the operation of changing the Fourier transform coefficient by embedding did not exist, and the corresponding bit of the security data is set to 0. And if NO in step (34), step (3)
In 6), it is determined that the bit of the security data is 1.

【００５６】本発明に於ける他の態様としては、特定の
音声データにセキュリティデータを埋め込む処理操作を
コンピュータに実行させるものであって、当該特定の音
声データを選択する工程、当該セキュリティデータのデ
ジタルビット数ｎを確認する工程、当該選択された特定
の音声データを少なくとも当該セキュリティデータのデ
ジタルビット数ｎの２倍の数となる連続する複数個のブ
ロックに分割する工程、当該各ブロック毎に当該ブロッ
ク中に含まれる音声データに対してフーリエ変換を実行
し各ブロック毎にフーリエ変換係数を求める工程、当該
ブロック内に於ける当該複数個の各フーリエ変換係数の
内から、先ず互いに最も遠距離に離れて位置する２つの
フーリエ変換係数同志を第１の組と選択して第１のフー
リエ変換係数ユニットを形成する工程、次で、第１のユ
ニットを構成した２つのフーリエ変換係数を除いた各フ
ーリエ変換係数の中で、互いに最も遠距離に離れて位置
する２つのフーリエ変換係数同志を第２の組と選択して
第２のフーリエ変換係数ユニットを形成する工程、以下
同様にして順次ユニットを形成して、第３、第４、・・
・第ｎの組を形成し、当該セキュリティデータのデジタ
ルビット数に対応する数のフーリエ変換係数ユニットを
形成する工程、該複数個のフーリエ変換係数ユニットの
それぞれに当該セキュリティデータのビットデータの一
つを当てはめ当該セキュリティデータを該音声データに
埋め込む工程、当該セキュリティデータを埋め込まれた
ブロック化フーリエ変換係数群にブロック毎に逆フーリ
エ変換処理を行い、その結果を元の音声データに戻す工
程、とから構成されている音声データの処理方法をコン
ピュータに実行させるプログラムを記録した記録媒体で
あり、更には、当該記録媒体は、当該セキュリティデー
タが埋め込まれた当該元の音声データに対してフーリエ
変換を施す工程、当該セキュリティデータを埋め込んで
いるか否かが不明の被検査対象音声データに同様のフー
リエ変換を施す工程、双方の音声データに関するフーリ
エ変換処理後のデータを比較する事により、被検査対象
音声データが、所定のセキュリティデータを含んでいる
か否かを判断する判別工程とを更に含んでいるものであ
る。In another aspect of the present invention, a process for embedding security data in specific audio data is performed by a computer, and the step of selecting the specific audio data, the digitalization of the security data, Checking the number of bits n, dividing the selected specific audio data into a plurality of continuous blocks at least twice the number of digital bits n of the security data, A step of performing a Fourier transform on the audio data contained in the block to obtain a Fourier transform coefficient for each block; first, from among the plurality of Fourier transform coefficients in the block, The two Fourier transform coefficients located apart from each other are selected as a first set, and the first Fourier transform coefficient unit is selected. Forming a first unit, and then, among the respective Fourier transform coefficients excluding the two Fourier transform coefficients constituting the first unit, the two Fourier transform coefficients located farthest from each other are referred to as a second Fourier transform coefficient. To form a second Fourier transform coefficient unit by selecting the set of .times.
Forming an n-th set and forming a number of Fourier transform coefficient units corresponding to the number of digital bits of the security data, wherein each of the plurality of Fourier transform coefficient units is one of bit data of the security data; Applying the security data to the audio data, performing an inverse Fourier transform process for each block on the block of Fourier transform coefficients in which the security data is embedded, and returning the result to the original audio data. A recording medium on which a program for causing a computer to execute a configured audio data processing method is recorded, and further, the recording medium performs a Fourier transform on the original audio data in which the security data is embedded. It is unknown whether the process or the security data is embedded A step of performing the same Fourier transform on the audio data to be inspected, and comparing the data after the Fourier transform processing for both audio data to determine whether or not the audio data to be inspected includes predetermined security data. And a discriminating step.

【００５７】[0057]

【発明の効果】本発明に係る該音声処理方法は、上記の
様な構成を有していることから、音声データを複数のブ
ロックに分割し、それぞれのブロック毎にフーリエ変換
を施し、フーリエ変換係数を求め、係るフーリエ変換係
数にセキュリティデータのそれぞれのビットデータの値
を埋め込む様にしたものであるから、人間の聴覚に関知
されない条件下で容易に所定のセキュリティデータを音
声データに埋め込み、又容易にそれを取り出せ、又不正
にコピーされた音声データの有無を容易に確認把握する
事が可能となる音声処理方法及び音声処理装置が提供さ
れるのである。Since the audio processing method according to the present invention has the above-described configuration, the audio data is divided into a plurality of blocks, and a Fourier transform is performed for each block. Since the coefficient is obtained and the value of each bit data of the security data is embedded in the Fourier transform coefficient, predetermined security data can be easily embedded in the voice data under a condition unrelated to human hearing. An audio processing method and an audio processing apparatus which can easily extract the audio data and can easily confirm and grasp the presence or absence of illegally copied audio data are provided.

[Brief description of the drawings]

【図１】図１は、本発明に係る音声処理装置の一具体例
の構成を示すブロックダイアグラムである。FIG. 1 is a block diagram showing a configuration of a specific example of an audio processing device according to the present invention.

【図２】図２は、本発明に於いて使用される音声データ
のブロック化の例を説明する図である。FIG. 2 is a diagram illustrating an example of blocking audio data used in the present invention.

【図３】図３は、本発明に係る音声データ処理方法の一
具体例の手順を説明するフローチャートである。FIG. 3 is a flowchart illustrating a procedure of a specific example of an audio data processing method according to the present invention.

【図４】図４は、本発明に係る音声データ処理方法の一
部に於いて使用されるサブルーチンをしめすフローチャ
ートである。FIG. 4 is a flowchart showing a subroutine used in a part of the audio data processing method according to the present invention.

【図５】図５は、本発明に係る音声データ処理方法の他
の具体例の手順を説明するフローチャートである。FIG. 5 is a flowchart illustrating a procedure of another specific example of the audio data processing method according to the present invention.

[Explanation of symbols]

１…音声データ装置２…音声データ記憶手段３…セキュリティデータ記憶手段４…音声データブロック化手段５…ブロック化フーリエ変換係数生成手段６…ユニット形成手段７…セキュリティデータ埋め込み手段８…逆フーリエ変換処理手段９…制御手段１０…セキュリティデータ埋め込み音声データ記憶手段１１…セキュリティデータ埋込み音声データのフーリエ
変換手段１２…元の音声データ若しくは被検査対象音声データ記
憶手段１３…フーリエ変換手段１４…比較手段１５…判別手段DESCRIPTION OF SYMBOLS 1 ... Audio data apparatus 2 ... Audio data storage means 3 ... Security data storage means 4 ... Audio data blocking means 5 ... Blocked Fourier transform coefficient generation means 6 ... Unit formation means 7 ... Security data embedding means 8 ... Inverse Fourier transform processing Means 9 ... Control means 10 ... Security data embedded voice data storage means 11 ... Security data embedded voice data Fourier transform means 12 ... Original voice data or test object voice data storage means 13 ... Fourier transform means 14 ... Comparison means 15 ... Judgment means

Claims

[Claims]

When embedding security data in specific audio data, the specific audio data is divided into a plurality of continuous blocks each having at least twice the number of digital bits of the security data. For each block, the audio data of the block is subjected to a Fourier transform to obtain a Fourier transform coefficient for each block, and the Fourier transform coefficients within the block are located at the farthest distance from each other. To form a first unit having two Fourier transform coefficients as a first set, and then the two units located farthest apart from each other except for the two Fourier transform coefficients that constituted the first unit. Forming a second unit having the Fourier transform coefficients as a second set, forming units sequentially in the same manner, and A method of processing voice data, comprising forming a number of Fourier transform coefficient units corresponding to the number of digital bits of data, and embedding one of the bit data of the security data in each of the plurality of Fourier transform coefficient units. .

2. When embedding one of the bit data of the security data in each of the plurality of Fourier transform coefficient units, if the bit data of the security data is 1, the embedding of the Fourier transform coefficient unit is performed. 2. The audio data processing method according to claim 1, wherein the phase of the coefficient value is shifted by π.

3. Embedding one of the bit data of the security data in each of the plurality of Fourier transform coefficient units, if the bit data of the security data is 1, 2. The audio data processing method according to claim 1, wherein the imaginary part and the real part in the coefficient value are exchanged, and the sign of the real part is inverted.

4. A security data-embedded data obtained by performing an inverse Fourier transform process on a series of blocked Fourier transform coefficients including a plurality of Fourier transform coefficients in which the security data is embedded is converted into original voice data. 4. The method for processing audio data according to claim 1, wherein the audio data is incorporated.

5. An object to be inspected which performs a Fourier transform for each block of audio data including a Fourier transform coefficient in which the security data is embedded and which is blocked, and in which whether or not the security data is embedded is unknown. A method for processing audio data, wherein the audio data is subjected to a similar Fourier transform, and the two are compared to inspect the audio data to be inspected.

6. An audio data storage unit, a security data storage unit, an audio data blocking unit, and a blocked Fourier transform coefficient generation unit for performing a Fourier transform on each of the blocked audio data to obtain a Fourier transform coefficient. Means, a plurality of Fourier transform coefficients in the one block, a unit forming means constituting a plurality of units composed of a pair of two Fourier transform coefficients, a plurality of units formed by the unit forming means Security data embedding means for allocating one of the bit data of the security data to each of the units; performing an inverse Fourier transform process for each block on a group of blocked Fourier transform coefficients in which the security data is embedded; Inverse Fourier transform processing means incorporated in the audio data of And a control means for comprehensively controlling each of the above means.

7. A means for performing a Fourier transform on the original voice data in which security data is embedded, wherein the voice data to be inspected is unknown whether or not the security data is embedded. Means for performing a similar Fourier transform to the voice data, and discriminating means for determining whether or not the voice data to be inspected includes predetermined security data by comparing the data after the Fourier transform processing for both voice data. 7. The audio data processing device according to claim 6, further comprising:

8. When embedding the security data in the specific audio data, selecting the specific audio data, confirming the number of digital bits n of the security data, at least transmitting the selected specific audio data to the specific audio data. Dividing the security data into a plurality of continuous blocks each having twice the number of digital bits n; performing a Fourier transform on the audio data included in each block for each block; Obtaining a Fourier transform coefficient; from among the plurality of Fourier transform coefficients in the block, first, two Fourier transform coefficients located farthest apart from each other are selected as a first set and a first set is selected. Forming a Fourier transform coefficient unit of the following: Next, the two Fourier transform coefficients constituting the first unit are removed. Of the Fourier transform coefficients that were located, the two Fourier transform coefficients located farthest apart from each other are referred to as the second.
Forming a second Fourier transform coefficient unit by selecting a set of
... forming an n-th set and forming a number of Fourier transform coefficient units corresponding to the number of digital bits of the security data; Embedding the security data in the audio data by applying one, performing an inverse Fourier transform process for each block on the block of Fourier transform coefficients in which the security data is embedded, and incorporating the result in the original audio data; and And a method for processing audio data.

9. performing a Fourier transform on the original voice data in which the security data is embedded;
A step of performing the same Fourier transform on the audio data to be inspected for which it is unknown whether or not the security data is embedded.By comparing the data after the Fourier transform processing for both the audio data, the audio data to be inspected is 9. The method for processing audio data according to claim 8, further comprising a determining step of determining whether the security data includes predetermined security data.

10. When embedding security data in specific audio data, a step of selecting the specific audio data, a step of confirming a digital bit number n of the security data, Dividing the security data into a plurality of continuous blocks each having twice the number of digital bits n; performing a Fourier transform on the audio data included in each block for each block; Obtaining a Fourier transform coefficient; from among the plurality of Fourier transform coefficients in the block, first, two Fourier transform coefficients located farthest apart from each other are selected as a first set and a first set is selected. Forming a Fourier transform coefficient unit of the following, where the two Fourier transform coefficients forming the first unit are: Among the Fourier transform coefficients except for the two, the two Fourier transform coefficients located farthest apart from each other are defined as the second
Forming a second Fourier transform coefficient unit by selecting a set of
... forming an n-th set and forming a number of Fourier transform coefficient units corresponding to the number of digital bits of the security data; Applying one of the security data to the audio data, performing an inverse Fourier transform process for each block on the block of Fourier transform coefficients in which the security data is embedded, and returning the result to the original audio data; The recording medium which recorded the program which makes a computer perform the processing method of the audio | voice data comprised from this.

11. A step of performing a Fourier transform on the original voice data in which the security data is embedded, and performing a similar Fourier transform on the voice data to be inspected for which it is unknown whether or not the security data is embedded. And a discriminating step of judging whether or not the voice data to be inspected includes predetermined security data by comparing the data after the Fourier transform processing for both voice data. A recording medium storing a program for causing a computer to execute the method for processing audio data according to claim 10.