JP6056260B2

JP6056260B2 - Sound processing apparatus, sound processing method and program

Info

Publication number: JP6056260B2
Application number: JP2012183408A
Authority: JP
Inventors: 武史城
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2012-08-22
Filing date: 2012-08-22
Publication date: 2017-01-11
Anticipated expiration: 2032-08-22
Also published as: JP2014041255A

Description

本発明は、透かし信号が埋め込まれた音響信号を処理する音処理装置、音処理方法及びプログラムに関する。 The present invention relates to a sound processing apparatus, a sound processing method, and a program for processing an acoustic signal in which a watermark signal is embedded.

音響信号に対して、人間には知覚できない領域に情報を埋め込む情報透かし技術が知られている。音響信号への情報透かし技術は著作権情報やデータ配信などへの応用が期待されており、様々な手法が提案されている。例えば、エコーハイディング（Echo-Hiding）が広く知られている（例えば特許文献１参照）。 An information watermarking technique for embedding information in an area that cannot be perceived by humans with respect to an acoustic signal is known. Information watermarking technology for acoustic signals is expected to be applied to copyright information and data distribution, and various methods have been proposed. For example, Echo-Hiding is widely known (see, for example, Patent Document 1).

エコーハイディングは、音響信号を単位時間ごとに区切り、各区間に人間には知覚困難な人工的なエコーを重畳し、そのエコーの遅延時間を用いて情報を表現する技術である。図１は、エコーハイディングによる情報透かしの一例を示す図である。 Echo hiding is a technique that divides an acoustic signal into unit time, superimposes an artificial echo that is difficult for humans to perceive in each section, and expresses information using the delay time of the echo. FIG. 1 is a diagram illustrating an example of an information watermark by echo hiding.

例えば、図１に示すように、'００'，'０１'，'１０'，'１１'というシンボルにそれぞれ対応する遅延時間を０．１ｍｓｅｃ，０．２ｍｓｅｃ，０．３ｍｓｅｃ，０．４ｍｓｅｃとする。このとき、各区間に４つのエコーの中から１つを重畳した場合、人工エコーが埋め込まれた各区間は、それぞれ２ｂｉｔの情報を表現することになる。 For example, as shown in FIG. 1, the delay times corresponding to the symbols “00”, “01”, “10”, and “11” are 0.1 msec, 0.2 msec, 0.3 msec, and 0.4 msec, respectively. . At this time, when one of the four echoes is superimposed on each section, each section in which the artificial echo is embedded expresses 2-bit information.

音響信号からデータを抽出する際には、ケプストラム係数列と言われる特徴量を用いる。ケプストラム係数列は、以下の式（１）で与えられる。
Ｃｅｐｓｔｒｕｍ＝ＩＦＦＴ（ｌｏｇ（ａｂｓ（ＦＦＴ（Ｘ））））・・・式（１）
Ｘ：デジタル録音されたオーディオサンプルのベクトル
ＦＦＴ：フーリエ変換を実行する関数
ａｂｓ：絶対値取得
ｌｏｇ：対数（対数の底は任意）
ＩＦＦＴ：逆フーリエ変換を実行する関数
式（１）により得られたベクトル列は、ケプストラム係数列と呼ばれ、入力音源に重畳音が含まれているかを解析するために用いられる。 When extracting data from an acoustic signal, a feature value called a cepstrum coefficient sequence is used. The cepstrum coefficient sequence is given by the following equation (1).
Cepstrum = IFFT (log (abs (FFT (X)))) (1)
X: vector of digitally recorded audio samples FFT: function for performing Fourier transform abs: absolute value acquisition log: logarithm (base of logarithm is arbitrary)
IFFT: Function for performing inverse Fourier transform A vector sequence obtained by equation (1) is called a cepstrum coefficient sequence, and is used to analyze whether an input sound source includes superimposed sound.

図２は、ケプストラム解析結果の一例を示す図である。入力音源に対して時間Ｔだけ遅延させたエコー音を重畳したとする。この場合、ケプストラム解析した結果では、ケプストラム空間上のＴの位置に鋭いピークを観測することができる。図２に示すグラフの横軸を、時間軸に対してケフレンシと称する。 FIG. 2 is a diagram illustrating an example of a cepstrum analysis result. It is assumed that an echo sound delayed by time T is superimposed on the input sound source. In this case, as a result of the cepstrum analysis, a sharp peak can be observed at the position T in the cepstrum space. The horizontal axis of the graph shown in FIG. 2 is referred to as quefrency with respect to the time axis.

受信側の抽出処理では、予め埋め込まれ得る人工エコーの遅延時間は既知であるという前提を用いる。人工エコー群に対応する位置のケプストラム係数の中から最大の値を持つケフレンシが選択され、そのケフレンシ（遅延時間）に対応するシンボルが埋め込まれているものと判定される。人口エコー群に対応する位置とは、図１に示す場合、０．１ｍｓｅｃ，０．２ｍｓｅｃ，０．３ｍｓｅｃ，０．４ｍｓｅｃである。 The extraction process on the reception side uses the premise that the delay time of an artificial echo that can be embedded in advance is known. The quefrency having the maximum value is selected from the cepstrum coefficients at the position corresponding to the artificial echo group, and it is determined that the symbol corresponding to the quefrency (delay time) is embedded. In the case shown in FIG. 1, the positions corresponding to the artificial echo group are 0.1 msec, 0.2 msec, 0.3 msec, and 0.4 msec.

近年、ＭＥＭＳ（Micro Electro Mechanical Systems）センサや低電力マイクロコントローラによって、携帯電話やタブレットＰＣ（Personal Computer）、ＰＣなどの電力はますます低くなっている。一方で、音響信号のデジタルデータは、加速度センサや温度センサなどの他のセンサと比較してサンプリングレートが高いため、相対的に処理負荷が高い。ここで、受信信号電力と、サンプリング毎の閾値とを比較してサンプリングレートを制御する技術がある（例えば特許文献２参照）。 In recent years, the power of mobile phones, tablet PCs (Personal Computers), PCs, and the like has become lower and lower due to MEMS (Micro Electro Mechanical Systems) sensors and low-power microcontrollers. On the other hand, since the digital data of the acoustic signal has a higher sampling rate than other sensors such as an acceleration sensor and a temperature sensor, the processing load is relatively high. Here, there is a technique for controlling the sampling rate by comparing the received signal power and the threshold for each sampling (see, for example, Patent Document 2).

特開２０１２−３７７０１号公報JP 2012-37701 A 特開２００４−１６５９２９号公報JP 2004-165929 A

透かし抽出のためのケプストラムを演算するためには、ＦＦＴ等の負荷の高い処理を繰り返し実行する。そのため、抽出処理を実行する装置の消費電力は増大してしまい、装置の動作時間が減少してしまうという問題がある。 In order to calculate a cepstrum for watermark extraction, high load processing such as FFT is repeatedly executed. Therefore, there is a problem that the power consumption of the apparatus that executes the extraction process increases, and the operation time of the apparatus decreases.

よって、エコーハイディングなどのリアルタイムで行う透かし抽出処理を、携帯端末装置などの低消費電力が要求される装置で実行するためには、抽出精度を下げることなく処理量を削減し、省電力を図るとよい。一般的に、サンプリングレートを上げると抽出精度は上がるが消費電力が大きくなり、サンプリングレートを下げると消費電力は小さくなるが抽出精度が下がる。 Therefore, in order to perform watermark extraction processing such as echo hiding in real time on devices that require low power consumption such as mobile terminal devices, the processing amount can be reduced without reducing the extraction accuracy, and power can be saved. It is good to plan. Generally, when the sampling rate is increased, the extraction accuracy is increased but the power consumption is increased. When the sampling rate is decreased, the power consumption is decreased but the extraction accuracy is decreased.

しかし、従来技術では、透かし信号の抽出精度を下げることなく、省電力を図ることができなかった。 However, in the prior art, it was not possible to save power without lowering the accuracy of watermark signal extraction.

そこで、開示の技術は、透かし信号の抽出精度の劣化を防ぎつつ、電力消費を抑えることができる音処理装置、音処理方法及びプログラムを提供することを目的とする Therefore, the disclosed technique aims to provide a sound processing device, a sound processing method, and a program capable of suppressing power consumption while preventing deterioration of watermark signal extraction accuracy.

開示の一態様における音処理装置は、透かし信号が重畳された音響信号を、所定のサンプリングレートでアナログ信号からデジタル信号に変換するＡＤ変換部と、分割された前記デジタル信号の所定区間毎に、前記透かし信号の抽出の成否を判定する判定部と、前記判定部による判定結果を記憶する記憶部と、前記記憶部に記憶された判定結果に基づいて、前記所定のサンプリングレートを制御するレート制御部と、を備える。 The sound processing device according to one aspect of the disclosure includes an AD conversion unit that converts an acoustic signal on which a watermark signal is superimposed from an analog signal into a digital signal at a predetermined sampling rate, and a predetermined section of the divided digital signal, A determination unit for determining success or failure of extraction of the watermark signal, a storage unit for storing a determination result by the determination unit, and a rate control for controlling the predetermined sampling rate based on the determination result stored in the storage unit A section.

開示の技術によれば、透かし信号の抽出精度の劣化を防ぎつつ、電力消費を抑えることができる。 According to the disclosed technology, it is possible to suppress power consumption while preventing deterioration of the watermark signal extraction accuracy.

エコーハイディングによる情報透かしの一例を示す図。The figure which shows an example of the information watermark by echo hiding. ケプストラム解析結果の一例を示す図。The figure which shows an example of a cepstrum analysis result. 実施例における音処理装置のハードウェアの一例を示すブロック図。The block diagram which shows an example of the hardware of the sound processing apparatus in an Example. 実施例におけるプロセッサの機能の一例を示すブロック図。The block diagram which shows an example of the function of the processor in an Example. 透かし信号の抽出の成否判定を説明する図。The figure explaining the success or failure determination of the extraction of a watermark signal. サンプリングレートの状態遷移の一例を示す図。The figure which shows an example of the state transition of a sampling rate. 同期処理を説明する図。The figure explaining a synchronous process. サンプリングレートの変更タイミングの一例を示す図。The figure which shows an example of the change timing of a sampling rate. 実施例における透かし抽出処理の一例を示すフローチャート。6 is a flowchart illustrating an example of a watermark extraction process in the embodiment. 実施例における同期処理の一例を示すフローチャート。The flowchart which shows an example of the synchronous process in an Example. 変形例における音処理装置の概略構成の一例を示す図。The figure which shows an example of schematic structure of the sound processing apparatus in a modification.

以下、添付図面を参照して各実施例について説明する。 Embodiments will be described below with reference to the accompanying drawings.

［実施例］
＜構成＞
図３は、実施例における音処理装置１０のハードウェアの一例を示すブロック図である。実施例における音処理装置１０は、例えば、音響透かし抽出処理のためのマイクロコントローラである。なお、音処理装置１０は、図３に示すマイクロコントローラを含む携帯端末装置や、タブレット端末、ＰＣ、ゲーム機器などの情報処理装置を音処理装置と称してもよい。 [Example]
<Configuration>
FIG. 3 is a block diagram illustrating an example of hardware of the sound processing device 10 according to the embodiment. The sound processing apparatus 10 in the embodiment is, for example, a microcontroller for acoustic watermark extraction processing. Note that in the sound processing device 10, an information processing device such as a mobile terminal device including a microcontroller illustrated in FIG. 3, a tablet terminal, a PC, or a game device may be referred to as a sound processing device.

図３に示す音処理装置１０は、マイクセンサ１０２と、ＡＤＣ（アナログデジタルコンバータ）１０４と、バス１０６と、タイマー１０８と、プロセッサ１１０と、メモリ１１２とを有する。なお、メモリ１１２は、プロセッサ１１０に直接接続されているが、バス１０６を介して接続されてもよい。 The sound processing apparatus 10 illustrated in FIG. 3 includes a microphone sensor 102, an ADC (analog / digital converter) 104, a bus 106, a timer 108, a processor 110, and a memory 112. The memory 112 is directly connected to the processor 110, but may be connected via the bus 106.

図３に示すように、プロセッサ１１０は、バス１０６に接続される。バス１０６には、ペリフェラル・デバイスとしてタイマー１０８、ＡＤＣ１０４が接続される。また、ＡＤＣ１０４には、マイクセンサ１０２が接続される。 As shown in FIG. 3, the processor 110 is connected to the bus 106. A timer 108 and an ADC 104 are connected to the bus 106 as peripheral devices. The microphone sensor 102 is connected to the ADC 104.

マイクセンサ１０２は、受信部として機能し、透かし信号が重畳された音響信号のアナログ信号を受信する。音響信号は、音声や楽曲などの信号であり、リアルタイムで透かし信号が抽出可能なデータが重畳されている。リアルタイムで透かし信号が抽出可能な技術として、以下では、エコーハイディングを用いる。マイクセンサ１０２は、受信した音響信号をＡＤＣ１０４に出力する。 The microphone sensor 102 functions as a receiving unit and receives an analog signal of an acoustic signal on which a watermark signal is superimposed. The acoustic signal is a signal such as voice or music, and data on which a watermark signal can be extracted in real time is superimposed. In the following, echo hiding is used as a technique capable of extracting a watermark signal in real time. The microphone sensor 102 outputs the received acoustic signal to the ADC 104.

ＡＤＣ１０４は、例えばＡＤ（アナログデジタル）変換部として機能する。ＡＤＣ１０４は、マイクセンサ１０２から取得した音響信号を、プロセッサ１１０からの制御により所定のサンプリングレートでアナログ信号からデジタル信号に変換する。 The ADC 104 functions as an AD (analog / digital) converter, for example. The ADC 104 converts the acoustic signal acquired from the microphone sensor 102 from an analog signal to a digital signal at a predetermined sampling rate under the control of the processor 110.

タイマー１０８は、サンプリングのタイミングを割り込みによってプロセッサ１１０に指示する。サンプリングレートは、タイマー１０８に設定されている。例えば、タイマー１０８に設定された所定のサンプリングレートは、プロセッサ１１０により書き換えられる。 The timer 108 instructs the processor 110 by sampling the sampling timing. The sampling rate is set in the timer 108. For example, the predetermined sampling rate set in the timer 108 is rewritten by the processor 110.

プロセッサ１１０は、音響信号のデジタル信号を取得し、タイマー１０８による割り込みタイミングでＡＤＣ１０４に変換指示を出し、透かし抽出処理を実行する。透かし抽出処理は、メモリ１１２の命令メモリ１１２ａに記憶された透かし抽出処理プログラムをロードし、実行することで実現できる。 The processor 110 acquires the digital signal of the acoustic signal, issues a conversion instruction to the ADC 104 at the interrupt timing by the timer 108, and executes watermark extraction processing. The watermark extraction process can be realized by loading and executing a watermark extraction process program stored in the instruction memory 112a of the memory 112.

プロセッサ１１０は、透かし抽出処理の成否の結果をメモリ１１２のデータメモリ１１２ｂに記憶する。プロセッサ１１０の詳細な処理については、図４を用いて後述する。 The processor 110 stores the result of the success or failure of the watermark extraction process in the data memory 112b of the memory 112. Detailed processing of the processor 110 will be described later with reference to FIG.

メモリ１１２は、例えばＲＯＭ（Read Only Memory）やＲＡＭ（Random Access Memory）などの記憶部であり、命令メモリ１１２ａと、データメモリ１１２ｂとを有する。命令メモリ１１２ａ及びデータメモリ１１２ｂは、例えばメモリ１１２における記憶領域がそれぞれ異なる。 The memory 112 is a storage unit such as a read only memory (ROM) or a random access memory (RAM), and includes an instruction memory 112a and a data memory 112b. The instruction memory 112a and the data memory 112b have different storage areas in the memory 112, for example.

命令メモリ１１２ａは、プロセッサ１１０により実行される透かし抽出処理プログラムを記憶する。 The instruction memory 112 a stores a watermark extraction processing program executed by the processor 110.

データメモリ１１２ｂは、音響信号のデジタルデータ列を格納するためのサンプルバッファと、透かしの抽出処理の成否情報を示すデータを格納するための成否バッファとを有する。 The data memory 112b includes a sample buffer for storing a digital data string of the acoustic signal and a success / failure buffer for storing data indicating success / failure information of the watermark extraction process.

サンプルバッファは、例えば最大で５１２サンプルが記憶できる領域である。成否バッファは、例えば６区間の抽出処理の成否を示す１ビットの情報を記憶する領域である。 The sample buffer is an area in which, for example, a maximum of 512 samples can be stored. The success / failure buffer is an area for storing, for example, 1-bit information indicating success / failure of extraction processing of 6 sections.

《プロセッサ》
ここで、プロセッサの機能について説明する。図４は、実施例におけるプロセッサ１１０の機能の一例を示すブロック図である。図４に示すプロセッサ１１０は、透かし抽出処理プログラムを実行することで、取得部２０２と、抽出部２０４と、レート制御部２０６として機能する。《Processor》
Here, the function of the processor will be described. FIG. 4 is a block diagram illustrating an example of functions of the processor 110 in the embodiment. The processor 110 illustrated in FIG. 4 functions as the acquisition unit 202, the extraction unit 204, and the rate control unit 206 by executing the watermark extraction processing program.

取得部２０２は、バス１０６を介してＡＤＣ１０４から音響信号のデジタル信号を所定のサンプリングレートで取得する。このとき、取得部２０２は、タイマー１０８による割り込みタイミングによりデジタル信号をサンプリングする。取得部２０２は、取得したデジタル信号を抽出部２０４に出力する。 The acquisition unit 202 acquires a digital signal of an acoustic signal from the ADC 104 via the bus 106 at a predetermined sampling rate. At this time, the acquisition unit 202 samples the digital signal at the interrupt timing by the timer 108. The acquisition unit 202 outputs the acquired digital signal to the extraction unit 204.

抽出部２０４は、所定のサンプリングレートで取得された音響信号のデジタル信号に対し、このデジタル信号を所定区間に分割し、所定区間毎に、透かし抽出処理を行う。抽出部２０４は、図２に示すように、所定区間毎にケプストラム解析を行って遅延時間に対応する透かし信号を抽出する。 The extraction unit 204 divides this digital signal into predetermined intervals for the digital signal of the acoustic signal acquired at a predetermined sampling rate, and performs watermark extraction processing for each predetermined interval. As illustrated in FIG. 2, the extraction unit 204 performs cepstrum analysis for each predetermined section and extracts a watermark signal corresponding to the delay time.

このとき、抽出部２０４は、判定部２４２を有し、判定部２４２は、所定区間毎に、ケプストラム解析結果を用いて透かし信号が抽出されたか否かを判定する。 At this time, the extraction unit 204 includes a determination unit 242, and the determination unit 242 determines whether a watermark signal is extracted using a cepstrum analysis result for each predetermined section.

図５は、透かし信号の抽出の成否判定を説明する図である。図５に示すように、遅延時間に対応するケフレンシでの最大のケプストラム係数と、閾値とを比較することで、抽出の成否を判定する。 FIG. 5 is a diagram for explaining success / failure determination of watermark signal extraction. As shown in FIG. 5, the success or failure of the extraction is determined by comparing the maximum cepstrum coefficient at the quefrency corresponding to the delay time with a threshold value.

判定部２４２は、例えば、最大のケプストラム係数が閾値以上であれば、抽出成功として「１」で表し、最大のケプストラム係数が閾値未満であれば、抽出失敗として「０」で表す。また、閾値は、予め設定された固定値でもよいし、区間内のケプストラム係数の平均値と標準偏差とに基づく値、例えば平均値＋標準偏差でもよい。 For example, if the maximum cepstrum coefficient is equal to or greater than the threshold, the determination unit 242 indicates “1” as extraction success, and indicates “0” as extraction failure if the maximum cepstrum coefficient is less than the threshold. The threshold value may be a fixed value set in advance, or may be a value based on the average value and standard deviation of cepstrum coefficients in the section, for example, average value + standard deviation.

図５（Ａ）では、遅延時間に対応するケフレンシであり、ケプストラム係数が最大となるケフレンシは０．２ｍｓｅｃである。このケフレンシのケプストラム係数が閾値以上であるので、判定部２４２は、抽出成功と判定する。 In FIG. 5A, the quefrency corresponding to the delay time, and the quefrency with the maximum cepstrum coefficient is 0.2 msec. Since the cepstrum coefficient of the quefrency is equal to or greater than the threshold value, the determination unit 242 determines that extraction is successful.

図５（Ｂ）では、遅延時間に対応するケフレンシであり、ケプストラム係数が最大となるケフレンシは０．３ｍｓｅｃである。このケフレンシのケプストラム係数が閾値未満であるので、判定部２４２は、抽出失敗と判定する。 In FIG. 5B, the quefrency corresponding to the delay time, and the quefrency with the maximum cepstrum coefficient is 0.3 msec. Since the cepstrum coefficient of the quefrency is less than the threshold value, the determination unit 242 determines that the extraction has failed.

判定部２４２は、図５に示すように、区間毎に抽出の成否の判定を行い、この判定結果を成否バッファに記憶する。 As illustrated in FIG. 5, the determination unit 242 determines whether the extraction is successful for each section, and stores the determination result in the success / failure buffer.

図４に戻り、レート制御部２０６は、メモリ１１２に記憶された判定結果に基づいて、音響信号のデジタル信号をサンプリングするサンプリングレートを制御する。例えば、レート制御部２０６は、複数のサンプリングレートを用意しておき、過去の所定数の区間の判定結果に基づいていずれかのサンプリングレートを選択し、選択したサンプリングレートを用いてタイマー１０８のサンプリングレートを書き換える。 Returning to FIG. 4, the rate control unit 206 controls the sampling rate for sampling the digital signal of the acoustic signal based on the determination result stored in the memory 112. For example, the rate control unit 206 prepares a plurality of sampling rates, selects one of the sampling rates based on the determination results of a predetermined number of sections in the past, and uses the selected sampling rate to sample the timer 108. Rewrite the rate.

また、レート制御部２０６は、メモリ１１２に記憶された第１所定数の区間の成否の個数をカウントし、成功の個数が閾値以上であれば所定のサンプリングレートを下げ、失敗の個数が閾値以上であれば所定のサンプリングレートを上げるようにしてもよい。第１所定数は、例えば６であるが適宜設定変更可能であり、また、閾値も適宜設定変更可能である。 Further, the rate control unit 206 counts the number of successes / failures of the first predetermined number of sections stored in the memory 112. If the number of successes is equal to or greater than the threshold, the rate control unit 206 decreases the predetermined sampling rate, and the number of failures exceeds the threshold. If so, the predetermined sampling rate may be increased. The first predetermined number is, for example, 6, but can be appropriately changed, and the threshold can also be appropriately changed.

また、レート制御部２０６は、メモリ１１２に記憶された直近の第２所定数の連続区間の判定結果が、全て成功であれば所定のサンプリングレートを下げ、全て失敗であれば所定のサンプリングレートを上げるようにしてもよい。第２所定数は、例えば３であるが適宜設定変更可能である。 Further, the rate control unit 206 lowers the predetermined sampling rate if the determination results of the most recent second predetermined number of consecutive sections stored in the memory 112 are all successful, and sets the predetermined sampling rate if all are unsuccessful. You may make it raise. The second predetermined number is, for example, 3, but can be appropriately changed.

なお、レート制御部２０６は、サンプリングレートが上限に達した場合は上限値以上に上げることはしない。同様に、レート制御部２０６は、サンプリングレートが下限に達した場合は下限値に以下に下げることはしない。 Note that the rate control unit 206 does not increase the sampling rate beyond the upper limit when the sampling rate reaches the upper limit. Similarly, when the sampling rate reaches the lower limit, the rate control unit 206 does not lower the lower limit value.

これにより、レート制御部２０６は、最近の透かし信号の抽出結果に基づいて適応的にサンプリングレートを制御することができる。 Accordingly, the rate control unit 206 can adaptively control the sampling rate based on the latest watermark signal extraction result.

（具体例）
次に、実施例における透かし抽出処理の具体例について説明する。レート制御部２０６は、ＡＤＣ１０４のサンプリングレートとして例えば３２ｋＨｚ、１６ｋＨｚ、８ｋＨｚを用意する。 (Concrete example)
Next, a specific example of the watermark extraction process in the embodiment will be described. The rate control unit 206 prepares, for example, 32 kHz, 16 kHz, and 8 kHz as the sampling rate of the ADC 104.

また、抽出部２０４は、透かし信号を抽出する区間を１６ｍｓｅｃと設定する。ＡＤＣ１０４のサンプリングレートを上記の通り３２ｋＨｚ、１６ｋＨｚ、８ｋＨｚとする。各区間に含まれるディジタルサンプル数は、それぞれ５１２個、２５６個、１２８個となる。 Further, the extraction unit 204 sets a section for extracting a watermark signal as 16 msec. The sampling rate of the ADC 104 is set to 32 kHz, 16 kHz, and 8 kHz as described above. The number of digital samples included in each section is 512, 256, and 128, respectively.

また、実施例における判定部２４２は、各区間の抽出成否判定のための閾値を、区間単位で動的に決定する。判定部２４２は、各区間のケプストラム係数列の平均値ａｖｇと標準偏差ｓｄを計算し、閾値をａｖｇ＋ｓｄと設定する。 In addition, the determination unit 242 according to the embodiment dynamically determines a threshold value for the extraction success / failure determination of each section for each section. The determination unit 242 calculates the average value avg and standard deviation sd of the cepstrum coefficient sequence in each section, and sets the threshold value as avg + sd.

人工エコーの遅延時間に対応するケプストラム係数の値が、この閾値以上の場合には、判定部２４２は、判定対象区間を抽出成功と判定する。遅延時間に対応するケプストラム係数の値が閾値よりも低い場合には、判定部２４２は、抽出失敗と判定する。 When the value of the cepstrum coefficient corresponding to the delay time of the artificial echo is equal to or greater than this threshold, the determination unit 242 determines that the determination target section is successfully extracted. When the value of the cepstrum coefficient corresponding to the delay time is lower than the threshold value, the determination unit 242 determines that extraction has failed.

また、実施例においては、成否バッファに格納する成否情報は６区間分とし、後述する図６に示すように過去６区間の抽出成否をもってＡＤＣ１０４のサンプリングレートを決定するものとする。 In the embodiment, the success / failure information stored in the success / failure buffer is for six sections, and the sampling rate of the ADC 104 is determined based on the success / failure of extraction of the past six sections as shown in FIG.

また、レート制御部２０６は、連続３個の区間で成功、連続３個の区間で失敗となれば、６区間分の成否結果の蓄積を待たずに、ＡＤＣ１０４のサンプリングレートを決定してもよい。 Further, the rate control unit 206 may determine the sampling rate of the ADC 104 without waiting for the accumulation of the success / failure results for six sections if the success is made in three consecutive sections and the failure is made in three consecutive sections. .

図６は、サンプリングレートの状態遷移の一例を示す図である。図６に示すように、サンプリングレートは、例えば３つ（３２ｋｈｚ、１６ｋＨｚ、８ｋｈｚ）であり、３２ｋＨｚでスタートする。なお、サンプリングレートの数は３つに限定されず、複数あればよい。 FIG. 6 is a diagram illustrating an example of state transition of the sampling rate. As shown in FIG. 6, the sampling rate is three (32 kHz, 16 kHz, 8 kHz), for example, and starts at 32 kHz. Note that the number of sampling rates is not limited to three but may be plural.

（現在のサンプリングレートが３２ｋＨｚの場合）
レート制御部２０６は、成否バッファに記憶された６個の判定結果のうち４個以上が失敗を示す場合、又は直近の連続する３個の判定結果が失敗を示す場合、又は変更条件に達していない場合、サンプリングレートをそのままにする。変更条件に達していないとは、成否バッファに３個又は６個の判定結果が記憶されていない場合や、連続３個や６個の判定結果の上記条件を満たさないことである。 (When the current sampling rate is 32 kHz)
The rate control unit 206 determines that the change condition has been reached when four or more of the six determination results stored in the success / failure buffer indicate failure, or when the last three consecutive determination results indicate failure. If not, leave the sampling rate as is. The fact that the change condition has not been reached means that 3 or 6 determination results are not stored in the success / failure buffer, or that the above conditions for 3 or 6 consecutive determination results are not satisfied.

また、レート制御部２０６は、成否バッファに記憶された６個の判定結果のうち４個以上が成功を示す場合、又は直近の連続する３個の判定結果が成功を示す場合、サンプリングレートを１６ｋＨｚに決定する。 Further, the rate control unit 206 sets the sampling rate to 16 kHz when four or more of the six determination results stored in the success / failure buffer indicate success, or when the latest three consecutive determination results indicate success. To decide.

（現在のサンプリングレートが１６ｋＨｚの場合）
レート制御部２０６は、成否バッファに記憶された６個の判定結果のうち４個以上が失敗を示す場合、又は直近の連続する３個の判定結果が失敗を示す場合、サンプリングレートを３２ｋＨｚに決定する。 (When the current sampling rate is 16 kHz)
The rate control unit 206 determines the sampling rate to be 32 kHz when four or more of the six determination results stored in the success / failure buffer indicate failure, or when the latest three consecutive determination results indicate failure. To do.

また、レート制御部２０６は、変更条件に達していない場合、サンプリングレートをそのままにする。 Further, the rate control unit 206 keeps the sampling rate as it is when the change condition is not reached.

また、レート制御部２０６は、成否バッファに記憶された６個の判定結果のうち４個以上が成功を示す場合、又は直近の連続する３個の判定結果が成功を示す場合、サンプリングレートを８ｋＨｚに決定する。 Further, the rate control unit 206 sets the sampling rate to 8 kHz when four or more of the six determination results stored in the success / failure buffer indicate success, or when the latest three consecutive determination results indicate success. To decide.

（現在のサンプリングレートが８ｋＨｚの場合）
レート制御部２０６は、成否バッファに記憶された６個の判定結果のうち４個以上が失敗を示す場合、又は直近の連続する３個の判定結果が失敗を示す場合サンプリングレートを１６ｋＨｚに決定する。 (When the current sampling rate is 8 kHz)
The rate control unit 206 determines the sampling rate to be 16 kHz when four or more of the six determination results stored in the success / failure buffer indicate failure, or when the latest three consecutive determination results indicate failure. .

また、レート制御部２０６は、成否バッファに記憶された６個の判定結果のうち４個以上が成功を示す場合、又は直近の連続する３個の判定結果が成功を示す場合、又は変更条件に達していない場合、サンプリングレートをそのままにする。 In addition, the rate control unit 206 determines that when four or more of the six determination results stored in the success / failure buffer indicate success, or when the latest three consecutive determination results indicate success, or the change condition If not, leave the sampling rate as it is.

これにより、良好に透かし信号を抽出できている場合は、サンプリングレートを下げて電力消費を抑え、透かし信号の抽出が適切にできていない場合は、サンプリングレートを上げて抽出精度を上げることができる。よって、上記構成を有することで、開示の技術は省電力を達成することができる。 As a result, if the watermark signal can be extracted well, the sampling rate can be reduced to reduce power consumption. If the watermark signal cannot be extracted properly, the sampling rate can be increased to increase the extraction accuracy. . Thus, with the above-described configuration, the disclosed technology can achieve power saving.

＜同期処理＞
透かし抽出処理では、抽出精度を上げるため、プロセッサ１１０の同期処理を行う。図７は、同期処理を説明する図である。図７（Ａ）は、原信号と減衰させた人工エコーを示す。図７（Ａ）に示すように、送信側で、透かし信号（００，０１，１０，１１）を示すエコー（０．１ｍｓｅｃ，０．２ｍｓｅｃ，０．３ｍｓｅｃ，０．４ｍｓｅｃ）以外に、同期用の専用シンボル'Ｓ'に対応させた遅延時間（０．５ｍｓｅｃ）のエコーを重畳する。同期用の専用シンボルは、同期シンボルとも称す。送信側は、この同期用の専用シンボル'Ｓ'を重畳した区間を、定期的に挿入する。 <Synchronous processing>
In the watermark extraction process, the processor 110 performs a synchronization process in order to increase the extraction accuracy. FIG. 7 is a diagram for explaining the synchronization processing. FIG. 7A shows the original signal and the attenuated artificial echo. As shown in FIG. 7A, on the transmitting side, in addition to echoes (0.1 msec, 0.2 msec, 0.3 msec, 0.4 msec) indicating the watermark signal (00, 01, 10, 11), An echo having a delay time (0.5 msec) corresponding to the dedicated symbol 'S' is superimposed. The dedicated symbol for synchronization is also referred to as a synchronization symbol. The transmitting side periodically inserts a section in which the synchronization dedicated symbol 'S' is superimposed.

図７（Ｂ）は、入力音源を示す。図７（Ｂ）に示す入力音源を受信側（音処理装置１０）で受信したとする。 FIG. 7B shows an input sound source. Assume that the input sound source shown in FIG. 7B is received by the receiving side (sound processing apparatus 10).

図７（Ｃ）は、プロセッサの同期処理を示す。受信側は、例えば、図７（Ｃ）に示すように、各１６ｍｓｅｃの区間の幅よりも狭い幅（例えば４ｍｓｅｃ）の小区間での透かし抽出処理を実行する。受信側は、同期シンボル'Ｓ'の小区間と、データシンボル（'００'，'０１'，'１０'，'１１'）の小区間の変化点を検知し、区間の境界である同機ポイントと認識する。 FIG. 7C shows processor synchronization processing. For example, as shown in FIG. 7C, the receiving side executes watermark extraction processing in a small section having a width (for example, 4 msec) narrower than the width of each 16 msec section. The receiving side detects a change point between the small interval of the synchronization symbol 'S' and the small interval of the data symbol ('00', '01', '10', '11'), and the aircraft point that is the boundary of the interval Recognize.

図７（Ｄ）は、プロセッサ透かし抽出処理の開始点を示す。図７（Ｄ）に示すように、受信側では、同期ポイントを基準として、１６ｍｓｅｃ単位で受信した音響信号を分割する。 FIG. 7D shows the starting point of the processor watermark extraction process. As shown in FIG. 7D, the reception side divides the received acoustic signal in units of 16 msec with reference to the synchronization point.

これにより、プロセッサ１１０の透かし抽出処理を音源の取得と同期させることができる。なお、同期処理は、定期的に行ってもよいし、最初の一回だけ行うようにしてもよい。 Thereby, the watermark extraction process of the processor 110 can be synchronized with the acquisition of the sound source. The synchronization process may be performed periodically or may be performed only once.

＜変更タイミング＞
プロセッサ１１０は、ＡＤＣ１０４のサンプリングレートを変更する必要がある場合は、現在格納中の区間のサンプルデータを全て取得してからＡＤＣ１０４のレートを変更する。 <Change timing>
When it is necessary to change the sampling rate of the ADC 104, the processor 110 acquires all the sample data of the section currently being stored and then changes the rate of the ADC 104.

それは、ケプストラム抽出処理中もタイマー１０８から割込みが発生するため、ＡＤＣ１０４のサンプリングレートをすぐに変更すると、１区間内で異なるサンプリングレートで取得されたサンプルデータが混在してしまう。よって、正常な透かし抽出ができなくなる。 This is because an interrupt is generated from the timer 108 even during the cepstrum extraction process, so if the sampling rate of the ADC 104 is changed immediately, sample data acquired at different sampling rates within one section will be mixed. Therefore, normal watermark extraction cannot be performed.

図８は、サンプリングレートの変更タイミングの一例を示す図である。図８に示すように、ＡＤＣ１０４のサンプリングレートを変更するタイミングは、成否判定済みの区間５の直後に続く区間（現在サンプリング中の区間６）の次の区間７の先頭となる。 FIG. 8 is a diagram illustrating an example of the change timing of the sampling rate. As shown in FIG. 8, the timing for changing the sampling rate of the ADC 104 is the head of the next section 7 of the section immediately following the section 5 for which success / failure has been determined (the section 6 currently being sampled).

図８に示す例では、区間０〜５のうち、４個が抽出成功となっているので、レート制御部２０６は、ＡＤＣ１０４のサンプリングレートをダウンサンプリングするよう制御する。このとき、区間６は既にサンプリング中であるので、レート制御部２０６は、区間７からサンプリングレートが変更されるよう、タイマー１０８のサンプリングレートを書き換える。 In the example illustrated in FIG. 8, since four of the sections 0 to 5 have been successfully extracted, the rate control unit 206 controls to downsample the sampling rate of the ADC 104. At this time, since the section 6 is already being sampled, the rate control unit 206 rewrites the sampling rate of the timer 108 so that the sampling rate is changed from the section 7.

これにより、レート制御部２０６は、適切なタイミングでサンプリングレートを制御すすることができる。 Thereby, the rate control unit 206 can control the sampling rate at an appropriate timing.

＜透かし信号の復元＞
実際の処理では、成否判定においてＮＧと判定されたフレームからは正確なデータが抽出されていない可能性が高い。そのため、透かし抽出結果から、重畳されたデータ列の復元の仕組みが導入されるとよい。 <Restoring the watermark signal>
In actual processing, there is a high possibility that accurate data is not extracted from the frame determined as NG in the success / failure determination. Therefore, it is preferable to introduce a mechanism for restoring the superimposed data string from the watermark extraction result.

１フレームのサンプル数を多く取ることによって、周波数分解能を高めることで透かし抽出精度を向上させてもよいが、データの転送レートが１フレームのサンプル数に対して逆比例の関係になってしまう。 The watermark extraction accuracy may be improved by increasing the frequency resolution by taking a large number of samples in one frame, but the data transfer rate is inversely proportional to the number of samples in one frame.

よって、重畳対象の透かし信号列そのものに誤り訂正処理を施した後に、誤り訂正信号が付加された透かし信号を音響信号に埋め込むことが望ましい。受信側では、抽出した透かし信号列の誤り訂正信号を処理することで、あるレベルまでのビット欠落に対して元の透かし信号を完全に復元することが可能となる。 Therefore, it is desirable to embed a watermark signal to which an error correction signal is added in an acoustic signal after performing an error correction process on the watermark signal sequence itself to be superimposed. On the receiving side, by processing the error correction signal of the extracted watermark signal sequence, it is possible to completely restore the original watermark signal with respect to missing bits up to a certain level.

ＬＤＰＣ（低密度パリティ検査符号）などのシャノン限界に近い技術を用いることで、データ転送レートの低下を可能な限り抑えることが可能である。一方で、数ｍＷレベルで動作する低電力性を求められるマイクロコントローラにおいては、訂正能力はＬＤＰＣよりも劣るが、高速かつ処理量の少ない畳み込み符号などの誤り訂正アルゴリズムを適用することが望ましい。 By using a technique close to the Shannon limit, such as LDPC (low density parity check code), it is possible to suppress a decrease in data transfer rate as much as possible. On the other hand, in a microcontroller that operates at a level of several mW and requires low power, the correction capability is inferior to that of LDPC, but it is desirable to apply an error correction algorithm such as a convolutional code with high speed and a small amount of processing.

＜動作＞
次に、実施例における音処理装置１０の動作について説明する。図９は、実施例における透かし抽出処理の一例を示すフローチャートである。ステップＳ１０１で、プロセッサ１１０は、同期処理を行う。同期処理については、図１０を用いて後述する。 <Operation>
Next, the operation of the sound processing apparatus 10 in the embodiment will be described. FIG. 9 is a flowchart illustrating an example of a watermark extraction process in the embodiment. In step S101, the processor 110 performs a synchronization process. The synchronization process will be described later with reference to FIG.

ステップＳ１０２で、プロセッサ１１０は、初期化処理を行う。プロセッサ１１０は、例えば、タイマーレートの変数であるＲａｔｅ（サンプリングレート）を３２ｋＨｚに設定し、１フレームサンプル数の変数であるＳａｍｐｌｅを５１２に設定する。 In step S102, the processor 110 performs an initialization process. For example, the processor 110 sets Rate (sampling rate), which is a variable of the timer rate, to 32 kHz, and sets Sample, which is a variable of the number of one-frame samples, to 512.

ステップＳ１０３で、プロセッサ１１０は、タイマー１０８を動作させる。ステップＳ１０４で、プロセッサ１１０は、タイマー１０８に設定されたＲａｔｅ（３２ｋＨｚ）に応じて、定期的に割り込みが発生すると、ＡＤＣ１０４へＡＤ変換リスクエストを発行する。プロセッサ１１０は、変換結果をＡＤＣ１０４から読み出して取得し、取得したデータをサンプルバッファに格納する。 In step S103, the processor 110 operates the timer 108. In step S <b> 104, the processor 110 issues an AD conversion request to the ADC 104 when an interrupt occurs periodically according to the Rate (32 kHz) set in the timer 108. The processor 110 reads out and acquires the conversion result from the ADC 104, and stores the acquired data in the sample buffer.

プロセッサ１１０は、割り込みの回数をカウントし、１フレーム分のサンプルデータが格納されたか否かを判定する。例えば１フレーム分を、１区間分とも称す。１フレーム分のサンプルデータが格納された場合（ステップＳ１０４−ＹＥＳ）ステップＳ１０５に進み、１フレーム分のサンプルデータが格納されていない場合（ステップＳ１０４−ＮＯ）ステップＳ１０４に戻る。 The processor 110 counts the number of interruptions and determines whether or not one frame of sample data has been stored. For example, one frame is also referred to as one section. When sample data for one frame is stored (step S104—YES), the process proceeds to step S105. When sample data for one frame is not stored (step S104—NO), the process returns to step S104.

ステップＳ１０５で、プロセッサ１１０（抽出部２０４）は、Ｒａｔｅに設定されたサンプリングレートに従って１フレーム分のサンプルデータに対して透かし抽出処理を行う。プロセッサ１１０は、同期処理で最後に抽出したフレーム（'Ｓ'以外のシンボルが抽出された最初のフレーム）の最初のサンプル点の時間を同期開始点として、データの透かし抽出処理を行う。 In step S105, the processor 110 (extraction unit 204) performs watermark extraction processing on the sample data for one frame in accordance with the sampling rate set in Rate. The processor 110 performs data watermark extraction processing using the time of the first sample point of the last frame extracted in the synchronization processing (the first frame from which symbols other than 'S' are extracted) as the synchronization start point.

ステップＳ１０６で、プロセッサ１１０は、１フレーム（１区間）の透かし抽出が終了すると、サンプルバッファのサンプルデータ列を破棄し、バッファを空にする。 In step S106, when the watermark extraction for one frame (one section) is completed, the processor 110 discards the sample data string in the sample buffer and empties the buffer.

ステップＳ１０７で、プロセッサ１１０は、透かし抽出の終了指示があるか否かを判定する。終了指示はユーザから与えられたり、透かし信号として埋め込まれたりしてもよい。終了指示があれば（ステップＳ１０７−ＹＥＳ）透かし抽出処理を終了し、終了指示がなければ（ステップＳ１０７−ＮＯ）ステップＳ１０８に進む。 In step S107, the processor 110 determines whether there is an instruction to end watermark extraction. The end instruction may be given by the user or embedded as a watermark signal. If there is an end instruction (step S107—YES), the watermark extraction process is ended, and if there is no end instruction (step S107—NO), the process proceeds to step S108.

ステップＳ１０８で、プロセッサ１１０（判定部２４２）は、抽出されたデータに対し、遅延時間に対応するケプストラム係数の最大ピークと、閾値とを比較する。 In step S108, the processor 110 (determination unit 242) compares the maximum peak of the cepstrum coefficient corresponding to the delay time with the threshold value for the extracted data.

ステップＳ１０９で、プロセッサ１１０（判定部２４２）は、最大ピークが閾値以上であれば成功、最大ピークが閾値以下であれば失敗とする成否結果を成否バッファに格納する。閾値は、例えば判定対象の区間毎に変更される値である。 In step S109, the processor 110 (determination unit 242) stores a success / failure result in the success / failure buffer indicating success if the maximum peak is equal to or greater than the threshold and failure if the maximum peak is equal to or less than the threshold. The threshold is a value that is changed for each section to be determined, for example.

ステップＳ１１０で、プロセッサ１１０（レート制御部２０６）は、成否結果が３個格納済みであるか否かを判定する。３個格納済みであれば（ステップＳ１１０−ＹＥＳ）ステップＳ１１１に進み、３個格納済みでなければ（ステップＳ１１０−ＮＯ）ステップＳ１０４に戻る。 In step S110, the processor 110 (rate control unit 206) determines whether or not three success / failure results have been stored. If three are stored (step S110-YES), the process proceeds to step S111. If three are not stored (step S110-NO), the process returns to step S104.

ステップＳ１１１で、プロセッサ１１０（レート制御部２０６）は、成否バッファに格納された直近の連続３個の成否結果が全て成功であるか否かを判定する。全て成功であれば（ステップＳ１１１−ＹＥＳ）ステップＳ１１７に進み、全て成功でなければ（ステップＳ１１１−ＮＯ）ステップＳ１１２に進む。 In step S111, the processor 110 (rate control unit 206) determines whether or not all three consecutive success / failure results stored in the success / failure buffer are successful. If all are successful (step S111-YES), the process proceeds to step S117, and if all is not successful (step S111-NO), the process proceeds to step S112.

ステップＳ１１２で、プロセッサ１１０（レート制御部２０６）は、成否バッファに格納された直近の連続３個の成否結果が全て失敗であるか否かを判定する。全て失敗であれば（ステップＳ１１２−ＹＥＳ）ステップＳ１２０に進み、全て失敗でなければ（ステップＳ１１２−ＮＯ）ステップＳ１１３に進む。 In step S112, the processor 110 (rate control unit 206) determines whether or not all three consecutive success / failure results stored in the success / failure buffer have failed. If all are unsuccessful (step S112—YES), the process proceeds to step S120. If all are unsuccessful (step S112—NO), the process proceeds to step S113.

ステップＳ１１３で、プロセッサ１１０（レート制御部２０６）は、成否結果が６個格納済みであるか否かを判定する。６個格納済みであれば（ステップＳ１１３−ＹＥＳ）ステップＳ１１４に進み、６個格納済みでなければ（ステップＳ１１３−ＮＯ）ステップＳ１０４に戻る。 In step S113, the processor 110 (rate control unit 206) determines whether or not six success / failure results have been stored. If six have been stored (step S113—YES), the process proceeds to step S114, and if not six (step S113—NO), the process returns to step S104.

ステップＳ１１４で、プロセッサ１１０（レート制御部２０６）は、成否バッファに格納された６個の成否結果のうち４個以上が成功であるか否かを判定する。４個以上が成功であれば（ステップＳ１１４−ＹＥＳ）ステップＳ１１７に進み、４個以上が成功でなければ（ステップＳ１１４−ＮＯ）ステップＳ１１５に進む。 In step S114, the processor 110 (rate control unit 206) determines whether or not four or more of the six success / failure results stored in the success / failure buffer are successful. If four or more are successful (step S114-YES), the process proceeds to step S117, and if four or more is not successful (step S114-NO), the process proceeds to step S115.

ステップＳ１１５で、プロセッサ１１０（レート制御部２０６）は、成否バッファに格納された６個の成否結果のうち４個以上が失敗であるか否かを判定する。４個以上が失敗であれば（ステップＳ１１５−ＹＥＳ）ステップＳ１２０に進み、４個以上が失敗でなければ（ステップＳ１１５−ＮＯ）ステップＳ１１６に進む。 In step S115, the processor 110 (rate control unit 206) determines whether or not four or more of the six success / failure results stored in the success / failure buffer have failed. If four or more pieces have failed (step S115—YES), the process proceeds to step S120, and if four or more pieces have not failed (step S115—NO), the process proceeds to step S116.

ステップＳ１１６で、プロセッサ１１０は、成否バッファから最も古いデータ（成否結果）を破棄する。 In step S116, the processor 110 discards the oldest data (success / failure result) from the success / failure buffer.

ステップＳ１１７で、プロセッサ１１０（レート制御部２０６）は、Ｒａｔｅが８ｋＨｚであるか否かを判定する。Ｒａｔｅ＝＝８ｋＨｚであれば（ステップＳ１１７−ＹＥＳ）ステップＳ１１６に進み、Ｒａｔｅ＝＝８ｋＨｚでなければ（ステップＳ１１７−ＮＯ）ステップＳ１１８に進む。 In step S117, the processor 110 (rate control unit 206) determines whether or not Rate is 8 kHz. If Rate == 8 kHz (step S117—YES), the process proceeds to step S116, and if Rate == 8 kHz is not satisfied (step S117—NO), the process proceeds to step S118.

ステップＳ１１８で、プロセッサ１１０（レート制御部２０６）は、Ｒａｔｅ＝Ｒａｔｅ／２、Ｓａｍｐｌｅ＝Ｓａｍｐｌｅ／２に設定する。 In step S118, the processor 110 (rate control unit 206) sets Rate = Rate / 2 and Sample = Sample / 2.

ステップＳ１１９で、プロセッサ１１０（レート制御部２０６）は、成否バッファのデータを全て破棄する。 In step S119, the processor 110 (rate control unit 206) discards all data in the success / failure buffer.

ステップＳ１２０で、プロセッサ１１０（レート制御部２０６）は、Ｒａｔｅが３２ｋＨｚであるか否かを判定する。Ｒａｔｅ＝＝３２ｋＨｚであれば（ステップＳ１２０−ＹＥＳ）ステップＳ１１６に進み、Ｒａｔｅ＝＝３２ｋＨｚでなければ（ステップＳ１２０−ＮＯ）ステップＳ１２１に進む。 In step S120, the processor 110 (rate control unit 206) determines whether or not Rate is 32 kHz. If Rate == 32 kHz (step S120—YES), the process proceeds to step S116, and if Rate == 32 kHz is not satisfied (step S120—NO), the process proceeds to step S121.

ステップＳ１２１で、プロセッサ１１０（レート制御部２０６）は、Ｒａｔｅ＝Ｒａｔｅ×２、Ｓａｍｐｌｅ＝Ｓａｍｐｌｅ×２に設定する。ステップＳ１２１の後は、ステップＳ１１９に進む。 In step S121, the processor 110 (rate control unit 206) sets Rate = Rate × 2 and Sample = Sample × 2. After step S121, the process proceeds to step S119.

これにより、透かし抽出の成否の状況をみながら、効率的に電力消費を抑えることができる。 As a result, power consumption can be efficiently suppressed while observing the success or failure of watermark extraction.

図１０は、実施例における同期処理の一例を示すフローチャートである。図１０に示すステップＳ２０１で、プロセッサ１１０は、同期処理の設定を行う。プロセッサ１１０は、例えば、Ｒａｔｅを３２ｋＨｚに設定し、また、Ｓａｍｐｌｅを１２８に設定する。同期処理の１フレームは、図７（Ｃ）に示すように、透かし抽出処理の例えば１／４とする。 FIG. 10 is a flowchart illustrating an example of the synchronization process in the embodiment. In step S201 illustrated in FIG. 10, the processor 110 sets synchronization processing. For example, the processor 110 sets Rate to 32 kHz and sets Sample to 128. As shown in FIG. 7C, one frame of the synchronization process is set to, for example, 1/4 of the watermark extraction process.

ステップＳ２０２で、プロセッサ１１０は、タイマー１０８を動作させる。ステップＳ２０３で、プロセッサ１１０は、タイマー１０８に設定されたサンプリングレート（３２ｋＨｚ）に応じて、定期的に割り込みが発生すると、ＡＤＣ１０４へＡＤ変換リスクエストを発行する。プロセッサ１１０は、変換結果をＡＤＣ１０４から読み出して取得し、取得したデータをサンプルバッファに格納する。 In step S202, the processor 110 operates the timer 108. In step S <b> 203, the processor 110 issues an AD conversion request to the ADC 104 when an interruption occurs periodically according to the sampling rate (32 kHz) set in the timer 108. The processor 110 reads out and acquires the conversion result from the ADC 104, and stores the acquired data in the sample buffer.

プロセッサ１１０は、割り込みの回数をカウントし、１フレーム分のサンプルデータが格納されたか否かを判定する。１フレーム分のサンプルデータが格納された場合（ステップＳ２０３−ＹＥＳ）ステップＳ２０４に進み、１フレーム分のサンプルデータが格納されていない場合（ステップＳ２０３−ＮＯ）ステップＳ２０３に戻る。 The processor 110 counts the number of interruptions and determines whether or not one frame of sample data has been stored. When sample data for one frame is stored (step S203—YES), the process proceeds to step S204, and when sample data for one frame is not stored (step S203—NO), the process returns to step S203.

ステップＳ２０４で、プロセッサ１１０（抽出部２０４）は、１フレーム分のサンプルデータに対して透かし抽出処理を行う。 In step S204, the processor 110 (extraction unit 204) performs a watermark extraction process on the sample data for one frame.

ステップＳ２０５で、プロセッサ１１０は、サンプルバッファから１フレーム分のサンプルデータを破棄する。 In step S205, the processor 110 discards one frame of sample data from the sample buffer.

ステップＳ２０６で、プロセッサ１１０は、透かし抽出の結果が、同期シンボルである'Ｓ'か否かを判定する。同期シンボルが抽出されれば（ステップＳ２０６−ＹＥＳ）ステップＳ２０７に進み、同期シンボルが抽出されなければ（ステップＳ２０６−ＮＯ）ステップＳ２０３に戻る。 In step S206, the processor 110 determines whether or not the result of watermark extraction is 'S', which is a synchronization symbol. If a synchronization symbol is extracted (step S206—YES), the process proceeds to step S207. If a synchronization symbol is not extracted (step S206—NO), the process returns to step S203.

ステップＳ２０７で、プロセッサ１１０は、タイマー１０８に設定されたサンプリングレート（３２ｋＨｚ）に応じて取得したサンプリングデータをサンプルバッファに格納し、１フレーム分のサンプルデータが格納されたか否かを判定する。１フレーム分のサンプルデータが格納された場合（ステップＳ２０７−ＹＥＳ）ステップＳ２０８に進み、１フレーム分のサンプルデータが格納されていない場合（ステップＳ２０７−ＮＯ）ステップＳ２０７に戻る。 In step S207, the processor 110 stores the sampling data acquired according to the sampling rate (32 kHz) set in the timer 108 in the sample buffer, and determines whether or not sample data for one frame has been stored. If sample data for one frame is stored (step S207—YES), the process proceeds to step S208. If sample data for one frame is not stored (step S207—NO), the process returns to step S207.

ステップＳ２０８で、プロセッサ１１０（抽出部２０４）は、１フレーム分のサンプルデータに対して透かし抽出処理を行う。 In step S208, the processor 110 (extraction unit 204) performs a watermark extraction process on the sample data for one frame.

ステップＳ２０９で、プロセッサ１１０は、透かし抽出の結果が、同期シンボルである'Ｓ'か否かを判定する。同期シンボルが抽出されれば（ステップＳ２０９−ＹＥＳ）ステップＳ２１０に進み、同期シンボルが抽出されなければ（ステップＳ２０９−ＮＯ）同期処理を終了する。 In step S209, the processor 110 determines whether or not the result of watermark extraction is 'S', which is a synchronization symbol. If a synchronization symbol is extracted (step S209—YES), the process proceeds to step S210. If a synchronization symbol is not extracted (step S209—NO), the synchronization process is terminated.

ステップＳ２１０で、プロセッサ１１０は、サンプルバッファから１フレーム分のサンプルデータを破棄する。 In step S210, the processor 110 discards one frame of sample data from the sample buffer.

つまり、プロセッサ１１０は、同期シンボルを１度抽出した場合は、同期フレームで有ることを意味するので、同期シンボル'Ｓ'の抽出が終了するまで抽出処理を実行する。同期シンボル'Ｓ'以外のシンボルが抽出された場合、同期フレームが終了し、データフレームが開始されたことを意味する。よって、同期処理が終了した後に、図９のステップＳ１０２に進む。 That is, when the synchronization symbol is extracted once, it means that it is a synchronization frame, and therefore the processor 110 executes the extraction process until the extraction of the synchronization symbol 'S' is completed. If a symbol other than the synchronization symbol 'S' is extracted, it means that the synchronization frame has ended and the data frame has started. Therefore, after the synchronization process is completed, the process proceeds to step S102 in FIG.

以上のように、過去の抽出結果の成否判定によって音処理装置１０は動的にＡＤＣ１０４のサンプリングレートを変更しながら透かし抽出を行うことができる。これにより、透かし抽出精度の劣化を抑えながらもサンプリングレートとプロセッサの動作率を削減することで、ＡＤＣ１０４とプロセッサ１１０の電力削減を実現する。 As described above, the sound processing apparatus 10 can perform watermark extraction while dynamically changing the sampling rate of the ADC 104 based on the success / failure determination of the past extraction results. Thus, the power consumption of the ADC 104 and the processor 110 can be reduced by reducing the sampling rate and the operation rate of the processor while suppressing the deterioration of the watermark extraction accuracy.

＜効果＞
音響処理においては、周波数分解能が高いほど高精度な処理が可能となるため、透かし抽出処理においてもＡＤＣのサンプリングレートが抽出精度に大きく影響する。しかしながら、一般的にＡＤＣの消費電力は、サンプリングレートに対して比例するため、高い抽出精度を要求するほど消費電力も高くなってしまう。 <Effect>
In the acoustic processing, the higher the frequency resolution, the higher the accuracy of processing is possible. Therefore, in the watermark extraction processing, the ADC sampling rate greatly affects the extraction accuracy. However, since the power consumption of the ADC is generally proportional to the sampling rate, the power consumption increases as the high extraction accuracy is required.

また、ケプストラム演算の負荷において、ＦＦＴは、サンプル数が多いほど計算時間が増大してしまう。例えば、音響信号の各区間におけるサンプル数が１０２４サンプルであった場合に対して、サンプル数が２０４８サンプルである場合を比較する。この場合、ＦＦＴの計算時間は、単純に見積もって２０４８×ｌｏｇ２（２０４８）／１０２４×ｌｏｇ２（１０２４）＝２．２倍となってしまう。 Further, in the load of cepstrum calculation, the calculation time of FFT increases as the number of samples increases. For example, the case where the number of samples is 2048 samples is compared with the case where the number of samples in each section of the acoustic signal is 1024 samples. In this case, the FFT calculation time is simply estimated to be 2048 × log 2 (2048) / 1024 × log 2 (1024) = 2.2 times.

音処理装置の処理性能は、動作周波数に対して比例するため、ＦＦＴの計算時間が支配的なケプストラム演算処理においても、２．２倍の時間が必要となる。例えば、１６ｋＨｚの音響信号において、区間のサンプル数が１０２４サンプルの時に、音処理装置のケプストラム抽出時間が２０ｍｓｅｃであると仮定する。この仮定において、抽出精度を上げるためにＡＤＣのサンプリングレートを２倍の３２ｋＨｚにしたとすると、概算で４４ｍｓｅｃの抽出時間が必要になる。 Since the processing performance of the sound processing device is proportional to the operating frequency, 2.2 times more time is required even in the cepstrum calculation processing in which the FFT calculation time is dominant. For example, it is assumed that the cepstrum extraction time of the sound processing apparatus is 20 msec when the number of samples in the section is 1024 samples in a 16 kHz acoustic signal. Under this assumption, if the ADC sampling rate is doubled to 32 kHz in order to increase the extraction accuracy, an approximate extraction time of 44 msec is required.

１６ｋＨｚ／１０２４サンプル、あるいは３２ｋＨｚ／２０４８サンプルの場合、各区間の幅が６４ｍｓｅｃであるため、１６ｋＨｚでは約６８（４４／６４）％、３２ｋＨｚでは約３１（２０／６４）％の時間だけ、音処理装置はアイドル状態になる。 In the case of 16 kHz / 1024 samples or 32 kHz / 2048 samples, since the width of each section is 64 msec, sound processing is performed for approximately 68 (44/64)% at 16 kHz and approximately 31 (20/64)% at 32 kHz. The device is idle.

音処理装置がアイドル状態の場合、音処理装置のクロック供給や電源供給を遮断することによって動作電力を極めて小さくすることができる。そのため、動作電力を可能な限り小さくするためには抽出精度を下げない範囲で、可能な限りＡＤＣのサンプリングレートを小さくすることが望ましい。 When the sound processing apparatus is in an idle state, the operating power can be extremely reduced by cutting off the clock supply and power supply of the sound processing apparatus. Therefore, in order to reduce the operating power as much as possible, it is desirable to reduce the ADC sampling rate as much as possible without reducing the extraction accuracy.

しかしながら、入力の音響信号から実際に透かしを抽出するまでは現在のＡＤＣのサンプリングレートが妥当であるかは判定できない。そのため、開示の技術では、時間的に近接した過去の抽出結果から動的にＡＤＣを変更することで抽出精度を保ったまま低電力化を図ることができる。 However, it cannot be determined whether the current ADC sampling rate is valid until the watermark is actually extracted from the input acoustic signal. Therefore, in the disclosed technique, it is possible to reduce power consumption while maintaining the extraction accuracy by dynamically changing the ADC from past extraction results that are close in time.

特に、透かしの抽出成功／失敗は曲調によって変化し、曲調は導入、間奏など、ある程度長い期間同じ傾向が連続する性質があるため、時間的に近い過去の区間の抽出結果を参照することで、適切なＡＤＣのサンプリングレートを決定することができる。 In particular, the success / failure of watermark extraction varies depending on the tune, and the tune has the property that the same tendency continues for a certain length of time, such as introduction and interlude, so by referring to the extraction results of past sections that are close in time, An appropriate ADC sampling rate can be determined.

また、録音した音響信号には、自然反響音や騒音など様々なノイズ成分が重畳されてしまう。そのため、ノイズ成分を含む状態での抽出結果は、信頼性が低いといえる。よって、判定部２４２は、人工エコーの位置に対応するケプストラムピークの値が閾値よりも小さかった場合は、ノイズの影響を受けたことによって人工エコーの強度が相対的に小さくなってしまったと判断する。この場合、判定部２４２は、抽出結果を信頼できないものと判断し、失敗と判定することができる。 In addition, various noise components such as natural reverberation and noise are superimposed on the recorded acoustic signal. Therefore, it can be said that the extraction result in a state including a noise component has low reliability. Therefore, when the value of the cepstrum peak corresponding to the position of the artificial echo is smaller than the threshold value, the determination unit 242 determines that the strength of the artificial echo has become relatively small due to the influence of noise. . In this case, the determination unit 242 determines that the extraction result is unreliable and can determine failure.

透かしの抽出精度は、曲調によって変化する場合がある。曲調は、導入、サビ、間奏などある程度長い期間連続する傾向がある。そのため、時間的に近い過去の区間の抽出結果を参照することで、適切なＡＤＣのサンプリングレートを決定することができる。 The extraction accuracy of the watermark may change depending on the tune. The tunes tend to continue for a long period of time, such as introduction, rust, and interlude. Therefore, an appropriate ADC sampling rate can be determined by referring to the extraction results of past intervals that are close in time.

よって、音処理装置は、時間的に近い過去Ｎ個の区間からの抽出結果の成否の値を記憶しておき、その中に含まれる失敗と成功の区間の個数から現在のＡＤＣのレートが所望の抽出結果に対して妥当か否かを判定することができる。 Therefore, the sound processing device stores the success / failure values of the extraction results from the past N intervals that are close in time, and the current ADC rate is desired from the number of failure and success intervals included therein. It can be determined whether or not the extraction result is valid.

もし、成功の区間が多かった場合、ＡＤＣの現在のサンプリングレートは高すぎる可能性があるため、ＡＤＣのサンプリングレートを下げることによって音処理装置の処理時間が削減され、動作電力が低減される。 If there are many successful sections, the current sampling rate of the ADC may be too high, so reducing the sampling rate of the ADC reduces the processing time of the sound processing device and reduces the operating power.

また、失敗の区間が多かった場合、ＡＤＣの現在のサンプリングレートが低すぎるので所望の抽出精度に達していない可能性があるため、音処理装置１０は、ＡＤＣのサンプリングレートを上げることによって抽出精度を向上させる。 When there are many failed sections, the current sampling rate of the ADC is too low and the desired extraction accuracy may not be reached. Therefore, the sound processing apparatus 10 increases the sampling accuracy by increasing the sampling rate of the ADC. To improve.

このように、音処理装置１０は、動的にＡＤＣのサンプリングレートを変更することで、抽出精度目標という制約の範囲内で、音処理装置の計算時間と動作電力とを最適化し、不要に高いサンプリングレートでの処理を抑制することができる。 In this way, the sound processing apparatus 10 dynamically changes the sampling rate of the ADC to optimize the calculation time and operating power of the sound processing apparatus within the limit of the extraction accuracy target, and is unnecessarily high. Processing at the sampling rate can be suppressed.

また、過去Ｎ個の区間の情報を用いる場合、ＡＤＣのサンプリングレートは、抽出精度の変化に可能な限り早く対応して変化させることが望ましい。開示の技術では、成功、あるいは失敗のフレームがＮよりも小さいＬ個連続した場合、無駄な動作電力の削減、あるいは抽出精度の向上のために、ＡＤＣのサンプリングレートを直ちに変更する。よって、Ｎ個の区間の成否情報が蓄積されるのを待たずに、より短い時間でＡＤＣのサンプリングレートを変更させることができる。上記の例では、Ｎは６であり、Ｌは３であるが、この例に限られない。 Further, when using information on the past N sections, it is desirable to change the ADC sampling rate in response to changes in extraction accuracy as soon as possible. In the disclosed technique, when L success or failure frames are consecutively smaller than N, the ADC sampling rate is immediately changed in order to reduce unnecessary operating power or improve extraction accuracy. Therefore, the ADC sampling rate can be changed in a shorter time without waiting for the success / failure information of N sections to be accumulated. In the above example, N is 6 and L is 3. However, the present invention is not limited to this example.

なお、Ｎを１とし、区間毎にＡＤＣのサンプリングレートを変更することも考えられる。この場合、データメモリ１１２ｂは必須の構成ではなく、レート制御部２０６は、判定部２４２による判定の都度、レート制御を行ってもよい。 It is also conceivable to set N to 1 and change the ADC sampling rate for each section. In this case, the data memory 112b is not an essential configuration, and the rate control unit 206 may perform rate control each time the determination unit 242 determines.

＜変形例＞
次に、変形例における音処理装置について説明する。 <Modification>
Next, a sound processing apparatus according to a modification will be described.

＜構成＞
図１１は、変形例における音処理装置３０の概略構成の一例を示す図である。図１１に示す音処理装置３０は、制御部３０２と、主記憶部３０４と、補助記憶部３０６と、通信部３０８と、記録媒体Ｉ／Ｆ部３１０とを少なくとも有する。各部は、バスを介して相互にデータ送受信可能に接続されている。 <Configuration>
FIG. 11 is a diagram illustrating an example of a schematic configuration of a sound processing device 30 according to a modification. The sound processing device 30 illustrated in FIG. 11 includes at least a control unit 302, a main storage unit 304, an auxiliary storage unit 306, a communication unit 308, and a recording medium I / F unit 310. Each unit is connected via a bus so that data can be transmitted / received to / from each other.

制御部３０２は、コンピュータの中で、各装置の制御やデータの演算、加工を行うＣＰＵ（Central Processing Unit）である。また、制御部３０２は、主記憶部３０４や補助記憶部３０６に記憶されたプログラムを実行する演算装置であり、通信部３０８や各記憶部からデータを受け取り、演算、加工した上で、出力部や各記憶部に出力する。 The control unit 302 is a CPU (Central Processing Unit) that performs control of each device, calculation of data, and processing in the computer. The control unit 302 is an arithmetic unit that executes a program stored in the main storage unit 304 or the auxiliary storage unit 306. The control unit 302 receives data from the communication unit 308 or each storage unit, calculates, processes, and outputs the data. And output to each storage unit.

また、制御部３０２は、例えば補助記憶部３０６に記憶される上記実施例のいずれかの処理を行うプログラムを実行することで、上述した処理を実行することができる。 In addition, the control unit 302 can execute the above-described processing by executing a program for performing any of the above-described processes stored in the auxiliary storage unit 306, for example.

主記憶部３０４は、ＲＯＭ（Read Only Memory）やＲＡＭ（Random Access Memory）などであり、制御部３０２が実行する基本ソフトウェアであるＯＳやアプリケーションソフトウェアなどのプログラムやデータを記憶又は一時保存する記憶装置である。 The main storage unit 304 is a ROM (Read Only Memory), a RAM (Random Access Memory), or the like, and a storage device that stores or temporarily stores programs and data such as an OS and application software that are basic software executed by the control unit 302 It is.

補助記憶部３０８は、ＨＤＤ（Hard Disk Drive）などであり、アプリケーションソフトウェアなどに関連するデータを記憶する記憶装置である。また、補助記憶部３０６は、記録媒体３１２などから取得されたプログラムを記憶しておいてもよい。 The auxiliary storage unit 308 is an HDD (Hard Disk Drive) or the like, and is a storage device that stores data related to application software or the like. The auxiliary storage unit 306 may store a program acquired from the recording medium 312 or the like.

通信部３０８は、有線又は無線で通信を行う。通信部３０８は、例えばサーバなどから送信されたプログラムを補助記憶部３０６に記憶するようにしてもよい。 The communication unit 308 performs wired or wireless communication. The communication unit 308 may store, for example, a program transmitted from a server or the like in the auxiliary storage unit 306.

記録媒体Ｉ／Ｆ（インターフェース）部３１０は、ＵＳＢ（Universal Serial Bus）などのデータ伝送路を介して接続された記録媒体３１２（例えば、フラッシュメモリなど）と音処理装置３０とのインターフェースである。 A recording medium I / F (interface) unit 310 is an interface between the sound processing apparatus 30 and a recording medium 312 (for example, a flash memory) connected via a data transmission path such as a USB (Universal Serial Bus).

また、記録媒体３１２に、所定のプログラムを格納し、この記録媒体３１２に格納されたプログラムは記録媒体Ｉ／Ｆ部３１０を介して音処理装置３０にインストールされる。インストールされた所定のプログラムは、音処理装置３０により実行可能となる。 In addition, a predetermined program is stored in the recording medium 312, and the program stored in the recording medium 312 is installed in the sound processing apparatus 30 via the recording medium I / F unit 310. The installed predetermined program can be executed by the sound processing device 30.

図３に示すプロセッサ１１０は、例えば制御部３０２により実現され、図３に示すメモリ１１２は、例えば主記憶部３０４や補助記憶部３０６などにより実現されうる。 The processor 110 illustrated in FIG. 3 is realized by, for example, the control unit 302, and the memory 112 illustrated in FIG. 3 can be realized by, for example, the main storage unit 304, the auxiliary storage unit 306, and the like.

また、透かし抽出プログラムを記録媒体３１２に記録し、このプログラムが記録された記録媒体３１２をコンピュータや携帯端末に読み取らせて、前述した処理を実現させることも可能である。 It is also possible to record the watermark extraction program on the recording medium 312 and cause the computer or portable terminal to read the recording medium 312 on which the program is recorded, thereby realizing the processing described above.

なお、記録媒体３１２は、ＣＤ−ＲＯＭ、フレキシブルディスク、光磁気ディスク等の様に情報を光学的，電気的或いは磁気的に記録する記録媒体、ＲＯＭ、フラッシュメモリ等の様に情報を電気的に記録する半導体メモリ等、様々なタイプの記録媒体を用いることができる。記録媒体３１２は、搬送波を含まない。 The recording medium 312 is a recording medium that records information optically, electrically, or magnetically, such as a CD-ROM, a flexible disk, or a magneto-optical disk, and information is electrically stored such as a ROM or flash memory. Various types of recording media such as a semiconductor memory for recording can be used. The recording medium 312 does not include a carrier wave.

なお、上記実施例や変形例で説明した音処理装置は、例えば、電車の車内アナウンスに、アナウンスと同じ内容を透かし信号で重畳し、耳の不自由な人が、透かし信号が示すアナウンスを表示部に表示する際に用いられる。これにより、音処理装置は、音に埋め込まれたアナウンスの内容や店舗情報などを知ることなどに適用することができる。 In addition, the sound processing apparatus described in the above-described embodiments and modifications, for example, superimposes the same content as the announcement on the in-car announcement of a train with a watermark signal, and a person with hearing disabilities displays the announcement indicated by the watermark signal. It is used when displaying on the screen. Thereby, the sound processing device can be applied to knowing the contents of the announcement embedded in the sound, store information, and the like.

以上、開示の音処理装置、音処理方法及びプログラムについて詳述したが、上記の実施例や変形例に限定されるものではなく、特許請求の範囲に記載された範囲内において、種々の変形及び変更が可能である。また、前述した実施例や変形例の構成要素を全部又は複数を組み合わせることも可能である。 The sound processing device, the sound processing method, and the program disclosed above have been described in detail. However, the present invention is not limited to the above-described embodiments and modification examples. It can be changed. It is also possible to combine all or a plurality of the constituent elements of the above-described embodiments and modifications.

なお、以上の実施例に関し、さらに以下の付記を開示する。
（付記）
透かし信号が重畳された音響信号を、所定のサンプリングレートでアナログ信号からデジタル信号に変換するＡＤ変換部と、
分割された前記デジタル信号の所定区間毎に、前記透かし信号の抽出の成否を判定する判定部と、
前記判定部による判定結果を記憶する記憶部と、
前記記憶部に記憶された判定結果に基づいて、前記所定のサンプリングレートを制御するレート制御部と、
を備える音処理装置。
（付記２）
前記レート制御部は、
前記記憶部に記憶された第１所定数の区間の成否の個数をカウントし、成功の個数が閾値以上であれば前記所定のサンプリングレートを下げ、失敗の個数が前記閾値以上であれば前記所定のサンプリングレートを上げる付記１記載の音処理装置。
（付記３）
前記レート制御部は、
前記記憶部に記憶された直近の第２所定数の連続区間の判定結果が、全て成功であれば前記所定のサンプリングレートを下げ、全て失敗であれば前記所定のサンプリングレートを上げる付記１又は２記載の音処理装置。
（付記４）
前記音響信号は、重畳されるエコーの遅延時間によって前記透かし信号が埋め込まれ、
前記判定部は、
前記所定区間のケプストラム解析結果により、前記遅延時間に対応するケプストラム係数と、該所定区間のケプストラム係数列の平均値と標準偏差とに基づく閾値とを比較することで、前記抽出の成否を判定する付記１乃至３いずれか一項に記載の音処理装置。
（付記５）
透かし信号が重畳された音響信号を、所定のサンプリングレートでＡＤ変換部によりアナログ信号からデジタル信号に変換し、
分割された前記デジタル信号の所定区間毎に、前記透かし信号の抽出の成否を判定し、
前記所定区間毎の判定結果を記憶部に記憶し、
前記記憶部に記憶された判定結果に基づいて、前記所定のサンプリングレートを制御する処理をコンピュータが実行する音処理方法。
（付記６）
透かし信号が重畳されたアナログ信号の音響信号に対し、所定のサンプリングレートで変換されたデジタル信号を取得し、
分割された前記デジタル信号の所定区間毎に、前記透かし信号の抽出の成否を判定し、
前記所定区間毎の判定結果を記憶部に記憶し、
前記記憶部に記憶された判定結果に基づいて、前記所定のサンプリングレートを制御する処理をコンピュータに実行させるプログラム。 In addition, the following additional remarks are disclosed regarding the above Example.
(Appendix)
An AD converter that converts the audio signal on which the watermark signal is superimposed, from an analog signal to a digital signal at a predetermined sampling rate;
A determination unit that determines success or failure of extraction of the watermark signal for each predetermined section of the divided digital signal;
A storage unit for storing a determination result by the determination unit;
A rate control unit that controls the predetermined sampling rate based on the determination result stored in the storage unit;
A sound processing apparatus comprising:
(Appendix 2)
The rate control unit
The number of successes / failures in the first predetermined number of sections stored in the storage unit is counted. If the number of successes is equal to or greater than a threshold value, the predetermined sampling rate is decreased, and if the number of failures is equal to or greater than the threshold value, the predetermined number is determined. The sound processing apparatus according to appendix 1, wherein the sampling rate is increased.
(Appendix 3)
The rate control unit
Additional remark 1 or 2 if the determination results of the second predetermined number of the most recent consecutive sections stored in the storage unit are all successful, the predetermined sampling rate is lowered, and if all are unsuccessful, the predetermined sampling rate is increased. The sound processing apparatus as described.
(Appendix 4)
The acoustic signal is embedded with the watermark signal by the delay time of the echo to be superimposed,
The determination unit
The success or failure of the extraction is determined by comparing a cepstrum coefficient corresponding to the delay time with a threshold value based on an average value and a standard deviation of the cepstrum coefficient sequence of the predetermined section based on the cepstrum analysis result of the predetermined section. The sound processing apparatus according to any one of appendices 1 to 3.
(Appendix 5)
The audio signal on which the watermark signal is superimposed is converted from an analog signal to a digital signal by an AD converter at a predetermined sampling rate,
For each predetermined section of the divided digital signal, determine the success or failure of extraction of the watermark signal,
The determination result for each predetermined section is stored in a storage unit,
A sound processing method in which a computer executes a process of controlling the predetermined sampling rate based on a determination result stored in the storage unit.
(Appendix 6)
Acquire a digital signal converted at a predetermined sampling rate for an analog acoustic signal with a watermark signal superimposed on it,
For each predetermined section of the divided digital signal, determine the success or failure of extraction of the watermark signal,
The determination result for each predetermined section is stored in a storage unit,
A program for causing a computer to execute processing for controlling the predetermined sampling rate based on a determination result stored in the storage unit.

１０音処理装置
１０２マイクセンサ
１０４ＡＤＣ
１０６バス
１０８タイマー
１１０プロセッサ
１１２メモリ
２０２取得部
２０４抽出部
２０６レート制御部
２４２判定部 10 sound processing apparatus 102 microphone sensor 104 ADC
106 Bus 108 Timer 110 Processor 112 Memory 202 Acquisition Unit 204 Extraction Unit 206 Rate Control Unit 242 Determination Unit

Claims

An AD converter that converts the audio signal on which the watermark signal is superimposed, from an analog signal to a digital signal at a predetermined sampling rate;
A determination unit that determines success or failure of extraction of the watermark signal for each predetermined section of the divided digital signal;
A storage unit for storing a determination result by the determination unit;
A rate control unit that controls the predetermined sampling rate based on the determination result stored in the storage unit;
Equipped with a,
The rate control unit
The number of successes / failures in the first predetermined number of sections stored in the storage unit is counted. If the number of successes is equal to or greater than a threshold value, the predetermined sampling rate is decreased, and if the number of failures is equal to or greater than the threshold value, the predetermined number is determined. A sound processing device that increases the sampling rate .

An AD converter that converts the audio signal on which the watermark signal is superimposed, from an analog signal to a digital signal at a predetermined sampling rate;
A determination unit that determines success or failure of extraction of the watermark signal for each predetermined section of the divided digital signal;
A storage unit for storing a determination result by the determination unit;
A rate control unit that controls the predetermined sampling rate based on the determination result stored in the storage unit;
With
The rate control unit
The determination result of the continuous interval of the most recent second predetermined number stored in the storage unit, if all successful lowering the predetermined sampling rate, if all failed to increase the predetermined sampling rate Ruoto processor .

The acoustic signal is embedded with the watermark signal by the delay time of the echo to be superimposed,
The determination unit
The success or failure of the extraction is determined based on a cepstrum coefficient corresponding to the delay time and a threshold value based on an average value and a standard deviation of the cepstrum coefficient sequence of the predetermined section based on a cepstrum analysis result of the predetermined section. Item 3. The sound processing apparatus according to Item 1 or 2 .

The audio signal on which the watermark signal is superimposed is converted from an analog signal to a digital signal by an AD converter at a predetermined sampling rate,
For each predetermined section of the divided digital signal, determine the success or failure of extraction of the watermark signal,
The determination result for each predetermined section is stored in a storage unit,
Based on the determination result stored in the storage unit, the predetermined sampling rate is controlled ,
The number of successes / failures in the first predetermined number of sections stored in the storage unit is counted. If the number of successes is equal to or greater than a threshold value, the predetermined sampling rate is decreased, and if the number of failures is equal to or greater than the threshold value, the predetermined number is determined. A sound processing method in which the computer executes a process of increasing the sampling rate of the sound.

The audio signal on which the watermark signal is superimposed is converted from an analog signal to a digital signal by an AD converter at a predetermined sampling rate,
For each predetermined section of the divided digital signal, determine the success or failure of extraction of the watermark signal,
The determination result for each predetermined section is stored in a storage unit,
Based on the determination result stored in the storage unit, the predetermined sampling rate is controlled,
If the determination results of the most recent second predetermined number of consecutive sections stored in the storage unit are all successful, the computer lowers the predetermined sampling rate, and if all the determinations fail, the computer increases the predetermined sampling rate. The sound processing method to perform.

Acquire a digital signal converted at a predetermined sampling rate for an analog acoustic signal with a watermark signal superimposed on it,
For each predetermined section of the divided digital signal, determine the success or failure of extraction of the watermark signal,
The determination result for each predetermined section is stored in a storage unit,
Based on the determination result stored in the storage unit, the predetermined sampling rate is controlled ,
The number of successes / failures in the first predetermined number of sections stored in the storage unit is counted. If the number of successes is equal to or greater than a threshold value, the predetermined sampling rate is decreased, and if the number of failures is equal to or greater than the threshold value, the predetermined number is determined. A program that causes a computer to execute a process to increase the sampling rate of a computer.

Acquire a digital signal converted at a predetermined sampling rate for an analog acoustic signal with a watermark signal superimposed on it,
For each predetermined section of the divided digital signal, determine the success or failure of extraction of the watermark signal,
The determination result for each predetermined section is stored in a storage unit,
Based on the determination result stored in the storage unit, the predetermined sampling rate is controlled,
If the determination results of the most recent second predetermined number of consecutive sections stored in the storage unit are all successful, the predetermined sampling rate is lowered, and if all are unsuccessful, the predetermined sampling rate is increased. The program to be executed.