JP2000330579A

JP2000330579A - Method and device for inserting watermark into music information and its program recording medium

Info

Publication number: JP2000330579A
Application number: JP11144274A
Authority: JP
Inventors: Yumiko Matsuura; 由美子松浦; Kenichi Minami; 憲一南; Atsuki Tomioka; 淳樹富岡; Kazuhiro Sugiyama; 和弘杉山
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1999-05-25
Filing date: 1999-05-25
Publication date: 2000-11-30

Abstract

PROBLEM TO BE SOLVED: To make noise caused by inserting a watermark less conspicuous. SOLUTION: In the watermark insertion method, a watermark inserting position is randomly determined with respect to musical sound signals. If one of the binary watermark information is '1', the musical sounds from an insertion position Pn to Xmsec are duplicated and made into as insertion signals. If the information is '0', the musical sounds from Pn+X to Xmsec are duplicated and inserted to an insertion position Pn as insertion signals. Note that before inserting the signals, the musical sounds from the position Pn to Xmsec are duplicated, the sound pressure is increased for 5 dB to produce relative signals, the signals are delayed for Ymsec, that is an allowable delay for the perception characteristic, and the signals are inserted into the musical sounds. Thus, the insertion signals are suppressed hearingwise by the relative signals.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、広域電子計算機
網（インターネット）等の通信網を介し、音楽情報の配
信を行うシステムにおいて、例えば音楽情報の著作権を
保護するための情報、いわゆる電子透かしを音楽情報に
重畳する方法、装置及びプログラム記録媒体に関するも
のである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a system for distributing music information via a communication network such as a wide area computer network (Internet), for example, information for protecting the copyright of music information, a so-called digital watermark. And an apparatus for superimposing music information on music information.

【０００２】[0002]

【従来の技術】従来はインターネット等の通信網を介し
て音楽情報を配信するシステムにおいて、音楽情報を無
断で容易に複製されてしまうという問題があり、数々の
セキュリティ情報の埋め込み手法が考え出されている。
符号化された音楽情報に関しては、セキュリティデータ
をデジタルデータとして付加しておくことによってセキ
ュリティデータの読み込みを行い、複製を防止するなど
の手法が取られていた。しかし、符号化された音楽情報
から音声波形データに復号化されてしまうと、付加され
ていた情報が消滅し、セキュリティデータのない情報の
複製が可能になってしまう。そこで、復号化されてもセ
キュリティデータを失わないよう、音声波形データに関
してはＳ／Ｎ比を劣化させることなくセキュリティデー
タを埋め込む手法について検討されてきた。2. Description of the Related Art Conventionally, in a system for distributing music information via a communication network such as the Internet, there is a problem that music information is easily copied without permission, and various security information embedding methods have been devised. ing.
With respect to encoded music information, security data has been added as digital data to read the security data to prevent duplication and the like. However, if the encoded music information is decoded into audio waveform data, the added information disappears, and information without security data can be copied. Therefore, a method of embedding security data in audio waveform data without deteriorating the S / N ratio has been studied so that the security data is not lost even if the data is decoded.

【０００３】[0003]

【発明が解決しようとする課題】音声波形データに多く
の情報を含めたセキュリティデータを埋め込もうとする
際、雑音を少なくするために人間の耳で聴取することの
できない周波数領域にデータを埋め込むという手法が取
られている。しかし、ほとんどの符号化では、人間の耳
で聴取できない部分を削ることによってデータ量を削減
するという手法をとっているため、埋め込みデータが消
えてしまうということが生じる。When embedding security data including a great deal of information in audio waveform data, the data is embedded in a frequency region that cannot be heard by human ears in order to reduce noise. That technique is taken. However, in most encodings, a method of reducing the amount of data by removing portions that cannot be heard by the human ear is employed, and thus embedded data may disappear.

【０００４】[0004]

【課題を解決するための手段】そこで、この発明では埋
め込む情報は雑音として埋め込んでしまい、その雑音を
音として抑圧する信号を挿入することによって、埋め込
みデータの存在を知覚不可能にし、複製を防止するため
音楽情報に変更を行うと雑音を残すことを可能にし、さ
らに左右のチャネルに埋め込みデータを分散させること
によって、データ埋め込み位置の相関の解読が困難であ
るようにデータを埋め込む。Therefore, according to the present invention, information to be embedded is embedded as noise, and a signal for suppressing the noise as sound is inserted to make the presence of the embedded data inaudible and prevent duplication. Therefore, when the music information is changed, it is possible to leave noise when the data is changed, and furthermore, by embedding the data into the left and right channels, the data is embedded so that it is difficult to decipher the correlation between the data embedding positions.

【０００５】[0005]

【発明の実施の形態】この発明の多チャネル楽音への透
かし挿入法とその装置の構成は以下のようになってい
る。図１に示すように挿入情報生成部１で、デジタルデ
ータに変換された楽音データとその楽音データの属性を
入力とし、その楽音データに挿入する情報を２値の信号
として生成する。透かし情報挿入部２では、その楽音デ
ータ内の挿入情報を埋め込む位置を決定し、前記２値信
号の挿入情報を透かしとして挿入する。相対信号挿入部
３ではその挿入された信号により生じる雑音を音として
抑圧する信号を生成して楽音データに挿入する。透かし
情報解読部４で挿入信号を読み取り、その読み取った信
号を属性情報再生部５で属性情報に復元する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The method of inserting a watermark into a multi-channel tone according to the present invention and the configuration of the apparatus are as follows. As shown in FIG. 1, the insertion information generation unit 1 receives as input the tone data converted into digital data and the attributes of the tone data, and generates information to be inserted into the tone data as a binary signal. The watermark information insertion unit 2 determines a position where the insertion information in the musical sound data is to be embedded, and inserts the insertion information of the binary signal as a watermark. The relative signal insertion unit 3 generates a signal that suppresses noise generated by the inserted signal as a sound, and inserts the signal into the musical sound data. The insertion signal is read by the watermark information decoding unit 4, and the read signal is restored to the attribute information by the attribute information reproducing unit 5.

【０００６】さらに具体的に述べると、挿入情報生成部
１内の属性情報生成部１１では、入力された属性情報か
ら、例えば権利情報等の変化することがまれで、重要度
が高く改ざんを最大限に防ぐ必要のある作曲者、著作権
者など楽音データに固有で普遍な情報Ｉｅと、例えば配
信元、利用者の利用状況、配信経路等の更新頻度が高く
高速な読み出し処理が必要とされる配信者、利用者など
の利用状況によって変動する情報Ｉｍとを生成する。More specifically, the attribute information generation unit 11 in the insertion information generation unit 1 rarely changes, for example, the right information, etc. from the input attribute information. The information Ie that is unique to music data, such as a composer and a copyright holder, that needs to be prevented as much as possible, and a high-speed readout process that requires a high update frequency, such as a distribution source, a user's usage status, and a distribution route, are required. And information Im that fluctuates depending on the use situation of the distributor, the user, and the like.

【０００７】挿入コード生成部１２では、属性情報生成
部１１で生成されたこれら挿入情報を、楽音データに挿
入可能な２値へ変換する。普遍な情報Ｉｅは情報生成に
時間を要しても解読が困難な形にする必要があるため、
例えば固有の２値化辞書を用いる、固有の符号化方法を
用いるなどして２値化を行う。一方、変動する情報Ｉｍ
については、経路（どのような経路で複製がなされたか
を示す経路）を追加していくこと、経路を迅速に特定す
ることを考量し、情報量を少なく、かつ高速に読み出し
可能である必要があるため、例えばＨuffman符号化など
広く一般的で高圧縮率、処理が少ないものを用い２値を
行う。The insert code generator 12 converts the insert information generated by the attribute information generator 11 into binary data that can be inserted into musical tone data. Since the universal information Ie needs to be in a form that is difficult to decipher even if it takes time to generate information,
For example, binarization is performed using a unique binarization dictionary, a unique encoding method, or the like. On the other hand, the changing information Im
As for (2), it is necessary to reduce the amount of information and to be able to read data at high speed in consideration of adding a path (a path indicating the type of duplication performed) and quickly specifying the path. For this reason, for example, binarization is performed using a widely-common, high-compression rate, and low-processing amount such as Huffman coding.

【０００８】透かし情報挿入部２は図２に示すように、
情報の挿入位置に規則性があると情報の取り出し、解読
が容易になるため、その困難性を求めるために、挿入位
置を分散させる必要があり、その元となる乱数を乱数発
生部２１で発生させる。乱数はこの楽音データに対して
は一意性を保つ必要があり、さらに解読の困難性をも保
つため、乱数の発生元となるシード値や幅、乱数発生ア
ルゴリズムを、情報提供者、情報利用者両者が共有す
る。[0008] As shown in FIG.
If the information insertion position has regularity, it is easy to extract and decode the information. Therefore, in order to find the difficulty, it is necessary to disperse the insertion position. Let it. Random numbers must maintain uniqueness with respect to this musical tone data, and in order to maintain the difficulty of decoding, the seed value and width from which random numbers are generated, the random number generation algorithm, and the information provider and information user Both share.

【０００９】時系列位置決定部２２では、乱数発生部２
１で発生した乱数の第１番目の値から、該当乱数を昇順
に並べるか降順に並べるかを決定する。挿入する時系列
点を決定するためには、第２番目の以後の発生乱数を、
挿入データの総ビット数分使用し、その乱数を第１番目
の値から決定された順に並べたリストを生成する。各値
はその音楽データを秒単位で表した各時系列点を表して
おり、該当時間に情報が挿入されることになる。The time-series position determining unit 22 includes a random number generating unit 2
From the first value of the random numbers generated in step 1, it is determined whether the random numbers are arranged in ascending or descending order. To determine the time series point to insert, the second and subsequent generated random numbers are
A list is generated by using the total number of bits of the insertion data and arranging the random numbers in the order determined from the first value. Each value represents each time-series point representing the music data in units of seconds, and information is inserted at the corresponding time.

【００１０】挿入チャネル決定部２３では、乱数発生部
２１で発生した乱数の第１番目の値が偶数か奇数かによ
り、左（右）のチャネルに偶数、奇数のどちらを割り当
てるかを決定する。挿入するチャネルを決定する第２番
目以後の乱数は、挿入データの総ビット数分について、
発生順にその値の偶数、奇数を判別し、挿入するチャネ
ルを決定する。The insertion channel determining unit 23 determines whether to assign an even number or an odd number to the left (right) channel depending on whether the first value of the random number generated by the random number generating unit 21 is an even number or an odd number. The second and subsequent random numbers for determining the channel to be inserted are as follows for the total number of bits of the insertion data.
The even and odd numbers of the values are determined in the order of occurrence, and the channel to be inserted is determined.

【００１１】挿入周波数決定部２４では、符号化により
削除されないと予測される部分で、かつ知覚的に判別が
困難な場所を選択する。一例として、符号化により削除
されない部分としては、聴覚心理モデルのマスクに入ら
ない部分が考えられることから、マスキングを受けない
部分を選択する。さらに、継続的に音が持続している部
分では多少の変化も知覚してしまうため、持続の短い周
波数を選択する、雑音を打ち消した後の少々の残響音が
残っても知覚しにくいよう近い周波数に音が集中してい
るところを選択するなどの処理を行い、これら３つの条
件を満たす周波数帯を情報挿入周波数と決定する。The insertion frequency determination unit 24 selects a part which is predicted not to be deleted by encoding and which is difficult to perceptually determine. As an example, a part that does not enter the mask of the psychoacoustic model is considered as a part that is not deleted by encoding, and therefore a part that is not subjected to masking is selected. Furthermore, since a slight change is perceived in the part where the sound is continuously sustained, a frequency with a short duration is selected, and even if a small amount of reverberation remains after canceling noise, it is hard to perceive Processing such as selecting a portion where sound is concentrated on the frequency is performed, and a frequency band satisfying these three conditions is determined as the information insertion frequency.

【００１２】挿入信号生成部２５では、例えばジャンル
による違いなど全く種類の異なる信号を透かしとして挿
入してしまうと、十分に抑圧することが困難になるた
め、挿入信号ＩＳｎとして時系列位置決定部２２と挿入
チャネル決定部２３で決定された挿入位置Ｐｎ付近の楽
音信号ＭＳ（ｎ）の複製を用いる。挿入コード生成部１
２で２値化された挿入情報Ｉｎの片方の値、例えば１と
０に２値化している場合は、その挿入位置に１を挿入す
る場合は、図４に示すように挿入位置Ｐｎから打ち消し
難いクリック音にならないような十分な長さＸ（単位ms
ec）前の楽音信号を複製する。クリック音として知覚さ
れない最低限の複製長Ｘは、挿入位置Ｐｎから初期値と
して与えられる参考信号長Ｓ（単位msec）前までの楽音
信号ＭＳ（Ｐｎ−Ｓ），…，ＭＳ（Ｐｎ）の平均周波数
Ｓp を算出し（２５１，図３）、以下の式（１）で近似
する。In the insertion signal generation unit 25, if completely different types of signals such as genres are inserted as watermarks, it is difficult to sufficiently suppress the signals. Therefore, the time series position determination unit 22 is used as the insertion signal ISn. And a copy of the tone signal MS (n) near the insertion position Pn determined by the insertion channel determination unit 23. Insertion code generator 1
When one value of the insertion information In binarized by 2 is binarized to, for example, 1 and 0, and 1 is inserted at the insertion position, the insertion is canceled from the insertion position Pn as shown in FIG. A length X (unit: ms) that does not cause a difficult click sound
ec) Duplicate the previous tone signal. The minimum copy length X that is not perceived as a click sound is the average of the tone signals MS (Pn-S),..., MS (Pn) from the insertion position Pn to the reference signal length S (unit: msec) given as an initial value. The frequency Sp is calculated (251, FIG. 3) and approximated by the following equation (1).

【００１３】つまり、参考信号長Ｓは楽音信号の周波数を分析できる
に必要な長さであり、例えば２００msecとし、その分析
周波数の平均周波数Ｓp が０以上１ＫHz以下であるか、
否かを判定（２５２），０＜Ｓp ＜１ＫHzであればＸ＝
１８７５／Ｓp＋１０msecとし（２５３），Ｓp が１ＫH
zを超えていればＸ＝１２msecとする（２５４）。この
挿入位置Ｐｎに挿入する挿入情報Ｉｎが１であるか否か
を調べ（２５５）、この挿入位置Ｐｎに０を挿入する場
合は、図４に示すように挿入位置Ｐｎから挿入信号が埋
め込まれる時間Ｘ分をあけ、その位置Ｐｎ＋Ｘからその
先の位置Ｐｎ＋２Ｘまでの時間Ｘの分の信号ＩＳｎ＝Ｍ
Ｓ（Ｐｎ＋Ｘ），…，ＭＳ（Ｐｎ＋２Ｘ）を複製する
（２５６，図３参照）。[0013] In other words, the reference signal length S is a length necessary for analyzing the frequency of the musical tone signal. For example, the reference signal length S is set to 200 msec.
(252), if 0 < Sp < 1 KHz, X =
1875 / Sp + 10 msec (253), where Sp is 1 KH
If it exceeds z, X = 12 msec (254). It is checked whether or not the insertion information In to be inserted into the insertion position Pn is 1 (255). When 0 is inserted into the insertion position Pn, an insertion signal is embedded from the insertion position Pn as shown in FIG. After a time X, the signal ISn = M for the time X from the position Pn + X to the position Pn + 2X after that.
S (Pn + X),..., MS (Pn + 2X) are duplicated (256, see FIG. 3).

【００１４】挿入情報Ｉｎが１の場合は図４に示すよう
に挿入位置Ｐｎに対し、時間Ｘ前の位置Ｐｎ−Ｘから時
間Ｘの分の信号ＩＳｎ＝ＭＳ（Ｐｎ−Ｘ），…，ＭＳ
（Ｐｎ）を複製する（２５７）。なお、前記式（１）
は、例えば１ＫHz以上では一方の耳に１２ミリ秒早く音
が到達すると、それより遅延してきた音はその耳に聞き
とれないという公知の聴覚特性により数値が選ばれたも
のである。When the insertion information In is 1, as shown in FIG. 4, the signal ISn = MS (Pn-X),..., MS for the time X from the position Pn-X before the time X with respect to the insertion position Pn.
(Pn) is copied (257). Note that the above equation (1)
Is a numerical value selected according to the well-known auditory characteristic that, for example, at 1 kHz or more, when a sound arrives at one ear earlier by 12 milliseconds, the sound delayed later cannot be heard by the ear.

【００１５】このように挿入信号の１，０に対応した複
製信号を得て、これを後述するように楽音信号の挿入位
置Ｐｎに透かし情報として挿入するが、その挿入部分は
楽音信号と透かし情報との合成信号となり、つまり雑音
となる。この雑音を抑圧するため、以下に述べるように
相対信号を作成して楽音信号に挿入する。左、右チャネ
ルの片方のチャネルからの音が１〜３０msecほど先行す
ると、後続の対応している他方のチャネルの音を知覚す
ることができず、後続する他方のチャネルの音圧を５dB
程度大きくすることにより知覚可能になり、同位相に聞
こえるという人間の左右の耳の知覚特性を用い、相対信
号挿入部では挿入信号が挿入されたチャネルの挿入位置
の楽音を、同じチャネルの後続データに音圧を上げ合成
したものから知覚するように変更するという手法で打ち
消しを行う。つまり、図５Ａに示すように、“１”を挿
入する場合は挿入位置Ｐｎから直前のＸmsecの楽音を複
製して矢印のように挿入位置ＰｎからＸmsecの部分に
挿入するが、その挿入前のＰｎからＸmsecの部分の楽音
を複製し（矢印のように取り出し）、これの音圧を上
げて矢印のようにＰｎ＋ＹからＸmsecの部分に挿入し
て、前記の挿入にもとづく雑音（合成）を抑圧する。
同様に図５Ｂに示すように、“０”を挿入する場合は、
Ｐｎ＋ＸからＰｎ＋２Ｘの楽音の複製を矢印のように
ＰｎからＸmsecの部分に挿入し、その挿入前のＰｎから
Ｘmsecの部分を矢印のように取り出し、音圧を上げて
矢印のようにＰｎ＋ＹからＸmsecの部分に挿入する。As described above, a duplicate signal corresponding to the inserted signal 1, 0 is obtained, and this is inserted as watermark information into the insertion position Pn of the tone signal as described later. , Ie, noise. In order to suppress this noise, a relative signal is created and inserted into a tone signal as described below. If the sound from one of the left and right channels precedes by about 1 to 30 msec, the sound of the other corresponding subsequent channel cannot be perceived, and the sound pressure of the other following channel becomes 5 dB.
The relative signal insertion unit uses the perceptual characteristics of the left and right ears of the human to hear the same phase, and the tone at the insertion position of the channel in which the insertion signal is inserted is converted to the subsequent data of the same channel. The noise is canceled by a method in which the sound pressure is increased and the synthesized sound is changed so as to be perceived. That is, as shown in FIG. 5A, when "1" is inserted, the musical tone of Xmsec immediately before the insertion position Pn is copied and inserted into the portion of Xmsec from the insertion position Pn as indicated by an arrow, but before the insertion. Duplicate the tone from Pn to Xmsec (taken out as indicated by the arrow), increase the sound pressure and insert it into the portion from Pn + Y to Xmsec as indicated by the arrow to suppress noise (synthesis) based on the insertion. I do.
Similarly, as shown in FIG. 5B, when inserting “0”,
A copy of the musical tone from Pn + X to Pn + 2X is inserted into the part from Pn to Xmsec as shown by the arrow, the part from Pn to Xmsec before the insertion is taken out as shown by the arrow, and the sound pressure is increased to increase the sound pressure from Pn + Y to Xmsec as shown by the arrow. Insert into the part.

【００１６】逆位相位置決定部３１では、図６に示すよ
うに透かし情報挿入部２で決定された挿入位置の時系列
位置を取得し、挿入チャネルのリストを生成し、各その
挿入位置から知覚特性により許容される遅延時間Ｙ（単
位 msec)後ろへずらした位置Ｙｎを、挿入された複製情
報にもとづく雑音を抑圧するための相対信号を挿入する
位置に決定する。この時、楽音信号が様々な周波数を持
つ多彩な音源を用いている場合と、音源数が少ない場合
とでは、後者の方をより遅延時間を短くして知覚困難に
する必要があるため、遅延時間Ｙを以下の式で求める。
あらかじめ周波数帯域を入力値Ｈ個のサブバンドＳbnに
分けておき、挿入位置Ｐｎから初期値として与えられる
参考信号長Ｓまでの楽音信号ＭＳ（Ｐｎ），…，ＭＳ
（Ｐｎ＋Ｓ）に対し、次数Ｎのフーリエ変換を行い（３
１−１）、その結果をサブバンドＳＢｎに対応させたと
き、対応する値の存在しているサブバンド数をＨｓとす
る。つまり、ｉ＝１，２，…Ｎ（Ｎ次数）を０に、また
スペクトルが存在するサブバンドの個数Ｈｓを０にそれ
ぞれ初期化し（３１−２），ｉ＜Ｎかを判定し（３１−
３），ｉ＜Ｎであれば、ｉのスペクトルＦｎ（ｉ）がサ
ブバンドの何れかにあるかを判定し（３１−４）であれ
ばＨｓを＋１し（３１−５），さらにｉを＋１してステ
ップ３１−３に戻り、ステップ３１−４でサブバンドに
なければステップ３１−６に移り、ステップ３１−３で
ｉがＮと等しくなれば、Ｈｓ／Ｈが１／１０以下である
かの判定を行う（３１−７）。The antiphase position determining section 31 acquires the time-series positions of the insertion positions determined by the watermark information inserting section 2 as shown in FIG. 6, generates a list of insertion channels, and perceives from the insertion positions. The position Yn shifted backward by the delay time Y (unit: msec) allowed by the characteristic is determined as a position where a relative signal for suppressing noise based on the inserted copy information is inserted. At this time, when the sound signal uses various sound sources having various frequencies and when the number of sound sources is small, the latter needs to have a shorter delay time to make it difficult to perceive. The time Y is obtained by the following equation.
The frequency band is divided in advance into H sub-bands Sbn having input values, and tone signals MS (Pn),..., MS from the insertion position Pn to the reference signal length S given as an initial value.
Fourier transform of order N is performed on (Pn + S) (3
1-1), when the result is made to correspond to the sub-band SBn, the number of sub-bands having the corresponding value is set to Hs. That is, i = 1, 2,..., N (Nth order) is initialized to 0, and the number Hs of subbands in which the spectrum exists is initialized to 0 (31-2), and it is determined whether i <N (31-
3) If i <N, it is determined whether the spectrum Fn (i) of i is in any of the subbands. If (31-4), Hs is incremented by 1 (31-5), and i is further increased. The value returns to step 31-3, and if it is not in the sub-band in step 31-4, the process proceeds to step 31-6. If i is equal to N in step 31-3, Hs / H is 1/10 or less. Is determined (31-7).

【００１７】最小遅延Ｌmin を１≦Ｌmin ，例えば２ms
ec，最大許容遅延Ｌmax をＬmax ≦３０かつＬmin ＜Ｌ
max ，例えば３０msecで初期値として設定しておき、Ｈ
ｓ／Ｈが１／１０以下ならＹ＝Ｌmin とし（３１−
８），Ｈｓ／Ｈが１／１０以下でなければＹ＝Ｌmax ×
（Ｈｓ／Ｈ）とする（３１−９）。Ｐｎ＋ＹをＹｎとす
る（３１−１０）。つまり、となる。The minimum delay Lmin is 1 ≦ Lmin, for example, 2 ms
ec, the maximum allowable delay Lmax is Lmax ≦ 30 and Lmin <L
max, for example, set as an initial value at 30 msec.
If s / H is 1/10 or less, Y = Lmin (31−
8) If Hs / H is not less than 1/10, Y = Lmax ×
(Hs / H) (31-9). Let Pn + Y be Yn (31-10). That is, Becomes

【００１８】相対信号生成部３２では、まず時系列位置
決定部２２と挿入チャネル決定部２３で決定された位置
の楽音信号を情報が挿入される長さＸの分ＭＳ（Ｐ
ｎ），…，ＭＳ（Ｐｎ＋Ｘ）を複製する。さらに、遅延
信号が先行信号と同じ位相に知覚されるに十分な音圧α
（単位dB）を決定し、複製した信号の音圧をα上げてお
く。知覚特性から、音圧レベルαは遅延時間Ｙと相関を
持っているが、この実施例では、遅延時間Ｙは前式
（２）より３０msec以下になるため、αは定数５とな
る。このようにして相対信号ＤＳ′が生成される。The relative signal generator 32 first converts the tone signal at the position determined by the time-sequence position determiner 22 and the insertion channel determiner 23 into an MS (P
n),..., MS (Pn + X). Furthermore, sound pressure α sufficient for the delayed signal to be perceived in the same phase as the preceding signal
(Unit: dB), and increase the sound pressure of the duplicated signal by α. From the perceptual characteristics, the sound pressure level α has a correlation with the delay time Y, but in this embodiment, the delay time Y is 30 msec or less from the equation (2), so α is a constant 5. Thus, the relative signal DS 'is generated.

【００１９】挿入信号挿入部３３では、挿入信号生成部
２５で生成された信号ＩＳｎを、時系列位置決定部２２
と挿入チャネル決定部２３で決定された位置の楽音信号
ＭＳ（Ｐｎ），…，ＭＳ（Ｐｎ＋Ｘ）に合成する。相対
信号挿入部３４では、相対信号生成部３２で生成された
相対信号ＤＳ′を逆位相位置決定部３１で決定された位
置Ｙｎの楽音信号ＭＳ（Ｙｎ），…，ＭＳ（Ｙｎ＋Ｘ）
に合成する。The insertion signal insertion unit 33 converts the signal ISn generated by the insertion signal generation unit 25 into the time series position determination unit 22.
, And MS (Pn + X) at the positions determined by the insertion channel determination unit 23. The relative signal insertion unit 34 converts the relative signal DS ′ generated by the relative signal generation unit 32 into the tone signals MS (Yn),..., MS (Yn + X) at the position Yn determined by the antiphase position determination unit 31.
To be synthesized.

【００２０】透かし情報解読部４は図７に示すように、
乱数発生部４１で、透かし情報挿入時に乱数を発生した
乱数発生部２１と同じ手法を用いて乱数の発生を行う。
時系列読出位置検出部４２，読出チャネル検出部４３，
読出周波数検出部４４については、それぞれ同様に透か
し情報挿入時の時系列位置決定部２２，挿入チャネル決
定部２３，挿入周波数決定部２４と同じ処理を行い、情
報の挿入位置を特定する。情報読出部４５では、読出周
波数検出部４４までの処理により特定された情報の挿入
位置から挿入情報の読み出しを行う。読み出された挿入
情報は２値情報再生部４６で２値情報に変換される。例
えば、前記特定された情報の挿入位置Ｐｎに対し、前記
例では情報“１”の場合にＰｎ−Ｘ〜Ｐｎの間の楽音信
号が複製されて挿入されたから、このＰｎ−Ｘ〜Ｐｎの
楽音信号と、前記読み出された挿入情報との相関をとる
と、情報“１”の場合には、大きな相関値が得られ、情
報“０”の場合は、小さな相関値となる。このことを利
用して読み出した挿入情報を２値情報に変換する。As shown in FIG. 7, the watermark information decrypting section 4
The random number generation unit 41 generates a random number using the same method as the random number generation unit 21 that generates a random number when inserting watermark information.
A time-series read position detector 42, a read channel detector 43,
The read frequency detector 44 similarly performs the same processing as the time series position determiner 22, the insertion channel determiner 23, and the insert frequency determiner 24 when watermark information is inserted, and specifies the information insertion position. The information reading unit 45 reads the insertion information from the insertion position of the information specified by the processing up to the reading frequency detection unit 44. The read insertion information is converted by the binary information reproducing unit 46 into binary information. For example, in the above example, when the information is "1", the tone signal between Pn-X and Pn is duplicated and inserted into the specified information insertion position Pn. When a correlation between the signal and the read insertion information is obtained, a large correlation value is obtained when the information is "1", and a small correlation value is obtained when the information is "0". Using this, the read insertion information is converted into binary information.

【００２１】属性情報再生部５は図８に示すように、要
求情報選択部５１では普遍情報Ｉｅと、変動情報Ｉｍ，
どちらの情報の要求がされているのかを判別し、属性情
報復号部５２に再生指示を出す。属性情報復号部５２で
は、要求情報選択部５１から要求された情報の復号化を
行う。普遍情報Ｉｅが要求されると固有の辞書を用いて
符号化をした場合は、復号辞書により復号化する。固有
の符号化を用いた場合は、該符号化に対する復号化を行
い属性情報の再生を行う。As shown in FIG. 8, the attribute information reproducing unit 5 uses the universal information Ie and the variation information Im,
It determines which information is requested, and issues a reproduction instruction to the attribute information decoding unit 52. The attribute information decoding unit 52 decodes the information requested by the request information selection unit 51. When the universal information Ie is requested and the encoding is performed using the unique dictionary, the encoding is performed using the decoding dictionary. When the unique encoding is used, the encoding is decoded and the attribute information is reproduced.

【００２２】上述では、この発明をステレオの楽音信号
に透かし情報を挿入する場合に適用したが、モノラルの
音楽信号に透かし情報を挿入する場合にも適用でき、同
様に３チャネル以上の楽音信号に挿入する場合にも適用
できる。要はこの発明は透かし情報の挿入にもとづく雑
音を、相対信号を挿入することにより音として抑圧され
たものとなるようにすることにある。In the above description, the present invention is applied to a case where watermark information is inserted into a stereo tone signal. However, the present invention can also be applied to a case where watermark information is inserted into a monaural music signal. Similarly, the present invention is applied to a tone signal having three or more channels. It is also applicable when inserting. In short, the present invention is to make noise based on the insertion of watermark information suppressed as sound by inserting a relative signal.

【００２３】上述した各部の機能はコンピュータにプロ
グラムを解読実行させて作用させることもできる。The functions of the respective parts described above can also be applied by causing a computer to decode and execute a program.

【００２４】[0024]

【発明の効果】この発明により、例えば不法な複製など
を防ぐための権利情報などを音楽情報に埋め込むことが
可能で、埋め込み情報による雑音を抑圧する相対信号を
挿入することにより、音楽情報の劣化を防ぐことがで
き、音楽情報に変更を加えると抑圧するために挿入した
相対信号により雑音が生じることから複製を防ぐこと、
複製しても埋め込みデータを残すことが可能になるた
め、音楽情報の配信に安全性を確保することができる。According to the present invention, for example, it is possible to embed right information for preventing illegal duplication or the like into music information, and to insert a relative signal for suppressing noise due to the embedded information, thereby deteriorating music information. To prevent duplication due to noise caused by the relative signal inserted to suppress when music information is changed,
Since the embedded data can be left even after the copy, the security of the distribution of the music information can be ensured.

【００２５】さらに、データを埋め込むことにより情報
量に変化が生じないこと、改ざんし難いことから、同じ
ダウンロード時間での様々な情報の埋め込み配信が可能
で、利用者に付加情報を提供することが可能となる。Furthermore, since the amount of information does not change by embedding data and it is difficult to falsify, it is possible to embed and distribute various information at the same download time, and to provide additional information to the user. It becomes possible.

[Brief description of the drawings]

【図１】Ａはこの発明の楽音情報への透かし挿入法の全
体の構成を表すブロック図、Ｂはその挿入情報生成部１
の流れを示す図である。FIG. 1A is a block diagram showing the overall configuration of a method for inserting a watermark into musical sound information according to the present invention, and FIG.
It is a figure showing the flow of.

【図２】図１Ａの透かし情報挿入部２の処理の流れを示
す図。FIG. 2 is a diagram showing a flow of processing of a watermark information insertion unit 2 of FIG. 1A.

【図３】図２中の挿入信号生成部２５の処理の流れを示
す図。FIG. 3 is a view showing a flow of processing of an insertion signal generation unit 25 in FIG. 2;

【図４】挿入信号の生成の様子を示す図。FIG. 4 is a diagram showing how an insertion signal is generated.

【図５】挿入信号と相対信号の各挿入の様子を示す図。FIG. 5 is a diagram showing a state of each insertion of an insertion signal and a relative signal.

【図６】図１Ａ中の相対信号挿入部３の処理の流れを示
す図。FIG. 6 is a diagram showing a flow of processing of a relative signal insertion unit 3 in FIG. 1A.

【図７】図１Ａ中の透かし情報解読部４の処理の流れを
示す図。FIG. 7 is a diagram showing a flow of processing of a watermark information decryption unit 4 in FIG. 1A.

【図８】図１Ａ中の属性情報再生部５の処理の流れを示
す図。FIG. 8 is a diagram showing a flow of processing of an attribute information reproducing unit 5 in FIG. 1A.

フロントページの続き (72)発明者富岡淳樹東京都新宿区西新宿三丁目19番２号日本電信電話株式会社内 (72)発明者杉山和弘東京都新宿区西新宿三丁目19番２号日本電信電話株式会社内Ｆターム(参考） 5B017 AA06 BA07 BB03 CA16 5B082 GA01 GA02 GC05 5D044 AB05 BC01 BC04 CC04 DE50 GK17 5J064 AA01 CA01 CC07 Continuing on the front page (72) Inventor Junki Tomioka 3-19-2 Nishishinjuku, Shinjuku-ku, Tokyo Japan Telegraph and Telephone Corporation (72) Inventor Kazuhiro Sugiyama 3-192-2 Nishishinjuku, Shinjuku-ku, Tokyo Japan F-term in Telegraph and Telephone Corporation (reference) 5B017 AA06 BA07 BB03 CA16 5B082 GA01 GA02 GC05 5D044 AB05 BC01 BC04 CC04 DE50 GK17 5J064 AA01 CA01 CC07

Claims

[Claims]

A step of generating watermark information from information to be inserted; a step of determining a watermark information insertion position for music information; a step of inserting the watermark information at the watermark information insertion position; Generating a relative signal that suppresses noise generated in the music information as sound based on the insertion of the watermark information; determining a relative signal insertion position at which the relative signal is inserted into the music information; Inserting the relative signal into a position.

2. The step of generating the relative signal includes the step of extracting the music information at the watermark information insertion position, and the step of obtaining the relative signal by increasing the sound pressure level of the extracted music information. The method of determining the relative signal insertion position is a process of finding a position delayed by a delay time allowed by perceptual characteristics with respect to the watermark information insertion position and setting the position as the relative signal insertion position. 1. A method for inserting a watermark into the music information described in 1.

3. The step of obtaining the position delayed by the delay time includes the step of frequency-analyzing the music information at the watermark information insertion position, and the ratio of the frequency present as a result of the analysis to the entire frequency band of the music information. 3. A method for inserting a watermark into music information according to claim 2, comprising the step of determining the delay time according to the following.

4. The step of generating the watermark information includes the step of converting the information to be inserted into binary information, and the step of converting the music immediately before the watermark information insertion position into one of the binary information. 4. The method according to claim 1, further comprising extracting information, extracting the music information slightly after the watermark information insertion position as the other of the binary information, and using the music information as the watermark information. Watermark insertion method to music information described in.

5. The music information includes a plurality of channels, and the watermark information insertion position determining step is a step of determining a watermark information insertion position by randomly distributing the watermark information insertion positions to the plurality of channels. Item 6. A method for inserting a watermark into music information according to any one of Items 1 to 4.

6. A means for generating watermark information from information to be inserted, a means for determining a watermark information insertion position for music information, a means for inserting the watermark information at the watermark information insertion position, Means for generating a relative signal for suppressing noise generated in the music information as sound based on the insertion of the watermark information; means for determining a relative signal insertion position for inserting the relative signal with respect to the music information; Means for inserting the relative signal at the insertion position.

7. The means for generating the relative signal comprises means for extracting the music information at the watermark information insertion position, and means for obtaining the relative signal by increasing the sound pressure level of the extracted music information. The relative signal insertion position determining means is a means for obtaining a position delayed by a delay time allowed by perceptual characteristics with respect to the watermark information insertion position and setting the position as the relative signal insertion position. 6. A watermark insertion device for music information described in 6.

8. The means for obtaining the position delayed by the delay time comprises: means for frequency-analyzing the music information at the watermark information insertion position; and the ratio of the frequency present as a result of the analysis to the entire frequency band of the music information. 8. The apparatus for inserting a watermark into music information according to claim 7, further comprising means for determining the delay time according to the following.

9. The watermark information generating means includes means for converting the information to be inserted into binary information, and as one of the binary information, the music information immediately before the watermark information insertion position. 9. A means for extracting information and extracting the music information slightly after the watermark information insertion position as the other of the binary information and using the extracted music information as the watermark information. A watermark insertion device for music information described in 1.

10. The music information comprises a plurality of channels, and the watermark information insertion position determining means is means for randomly determining the watermark information insertion position among the plurality of channels. Item 10. An apparatus for inserting a watermark into music information according to any one of Items 6 to 9.

11. A process for generating watermark information from information to be inserted, a process for determining a watermark information insertion position for music information, a process for inserting the watermark information at the watermark information insertion position, A process of generating a relative signal for suppressing noise generated in the music information as a sound based on the insertion of the watermark information, a process of determining a relative signal insertion position at which the relative signal is inserted into the music information, and a process of inserting the relative signal A recording medium recording a program for causing a computer of a device for inserting a watermark into music information to perform a process of inserting the relative signal into a position.

12. The relative signal generation process includes a process of extracting the music information at the watermark information insertion position, and a process of obtaining the relative signal by increasing the extracted music information by a predetermined amount. 12. The relative signal insertion position determining process is a process of obtaining a position delayed by a delay time allowed by perceptual characteristics with respect to the watermark information insertion position and setting the position as the relative signal insertion position. Recording medium.

13. A process for obtaining the position delayed by the delay time includes a process for frequency-analyzing the music information at the watermark information insertion position, and a ratio of a frequency present as a result of the analysis to the entire frequency band of the music information. 13. The recording medium according to claim 12, further comprising a process of determining the delay time according to the following.

14. A process for generating the watermark information, the process of converting the information to be inserted into binary information, and the process of converting the music immediately before the watermark information insertion position into one of the binary information. 14. A process according to claim 11, further comprising extracting information, extracting the music information just after the watermark information insertion position as the other of the binary information, and setting the music information as the watermark information. A recording medium according to claim 1.

15. The music information includes a plurality of channels, and the watermark information insertion position determination process is a process of randomly distributing and determining the watermark information insertion position among the plurality of channels. Items 11 to 14
The recording medium according to any one of the above.