JP2003263155A

JP2003263155A - Frequency analyzer and acoustic signal encoding device

Info

Publication number: JP2003263155A
Application number: JP2002064309A
Authority: JP
Inventors: Toshio Motegi; 敏雄茂出木
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2002-03-08
Filing date: 2002-03-08
Publication date: 2003-09-19

Abstract

<P>PROBLEM TO BE SOLVED: To provide a frequency analyzer and an acoustic signal encoding device, capable of reducing a total period of time required for correlation calculation by removing overlapped correlation calculation in an overlapped area of unit sections. <P>SOLUTION: Unit sections each of which a reference unit for analysis are set up so that adjacent unit sections (dI, dI<SB>+1</SB>) on time series are mutually overlapped and the time series signals of respective unit sections are successively extracted as section signals. The correlation of the unit section dI<SB>+1</SB>is calculated by calculating the correlation of a signal located on the head part dh of the preceding section signal with a prescribed harmonic function, calculating the correlation of a signal located on the end part db of the current section signal with the prescribed harmonic signal, subtracting the correlation value calculated for the head part dh from the whole correlation value of the preceding section dI, and adding the correlation value calculated for the end part db to the whole correlation value of the preceding section dI. <P>COPYRIGHT: (C)2003,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、放送メディア（ラジ
オ、テレビ）、通信メディア（ＣＳ映像・音声配信、イ
ンターネット音楽配信、通信カラオケ）、パッケージメ
ディア（ＣＤ、ＭＤ、カセット、ビデオ、ＬＤ、ＣＤ−
ＲＯＭ、ゲームカセット、携帯音楽プレーヤ向け固体メ
モリ媒体）などで提供する各種オーディオコンテンツの
制作、並びに、音楽演奏録音信号から楽譜出版、通信カ
ラオケ配信用ＭＩＤＩデータ、演奏ガイド機能付き電子
楽器向け自動演奏データ、携帯電話・ＰＨＳ・ポケベル
などの着信メロディデータを自動的に作成する自動採譜
技術に関する。The present invention relates to broadcast media (radio, television), communication media (CS video / audio distribution, Internet music distribution, communication karaoke), package media (CD, MD, cassette, video, LD, CD). −
Production of various audio contents provided in ROM, game cassettes, solid-state memory media for portable music players, etc., as well as music performance recording signals, musical score publishing, MIDI data for communication karaoke distribution, and automatic performance data for electronic musical instruments with performance guide function. , Automatic music transcription technology for automatically creating ringing melody data for mobile phones, PHS, pagers, etc.

【０００２】[0002]

【従来の技術】音響信号に代表される時系列信号には、
その構成要素として複数の周期信号が含まれている。こ
のため、与えられた時系列信号にどのような周期信号が
含まれているかを解析する手法は、古くから知られてい
る。例えば、フーリエ解析は、与えられた時系列信号に
含まれる周波数成分を解析するための方法として広く利
用されている。2. Description of the Related Art A time series signal represented by an acoustic signal is
A plurality of periodic signals are included as its constituent elements. Therefore, a method of analyzing what kind of periodic signal is included in a given time series signal has been known for a long time. For example, Fourier analysis is widely used as a method for analyzing frequency components included in a given time series signal.

【０００３】このような時系列信号の周波数解析方法を
利用すれば、音響信号を符号化することも可能である。
コンピュータの普及により、原音となるアナログ音響信
号を所定のサンプリング周波数でサンプリングし、各サ
ンプリング時の信号強度を量子化してデジタルデータと
して取り込むことが容易にできるようになってきてお
り、こうして取り込んだデジタルデータに対してフーリ
エ解析などの手法を適用し、原音信号に含まれていた周
波数成分を抽出すれば、各周波数成分を示す符号によっ
て原音信号の符号化が可能になる。By using such a frequency analysis method for time series signals, it is possible to encode acoustic signals.
With the spread of computers, it has become easy to sample the analog sound signal that is the original sound at a predetermined sampling frequency, quantize the signal strength at each sampling, and capture it as digital data. If a method such as Fourier analysis is applied to the data and the frequency components included in the original sound signal are extracted, the original sound signal can be encoded by the code indicating each frequency component.

【０００４】また、電子楽器による楽器音を符号化しよ
うという発想から生まれたＭＩＤＩ（Musical Instrume
nt Digital Interface）規格も、パーソナルコンピュー
タの普及とともに盛んに利用されるようになってきてい
る。このＭＩＤＩ規格による符号データ（以下、ＭＩＤ
Ｉデータという）は、基本的には、楽器のどの鍵盤キー
を、どの程度の強さで弾いたか、という楽器演奏の操作
を記述したデータであり、このＭＩＤＩデータ自身に
は、実際の音の波形は含まれていない。そのため、実際
の音を再生する場合には、楽器音の波形を記憶したＭＩ
ＤＩ音源が別途必要になるが、その符号化効率の高さが
注目を集めており、ＭＩＤＩ規格による符号化および復
号化の技術は、現在、パーソナルコンピュータを用いて
楽器演奏、楽器練習、作曲などを行うソフトウェアに広
く採り入れられている。MIDI (Musical Instrume) was born from the idea of encoding musical instrument sounds by electronic musical instruments.
The nt Digital Interface) standard has also been actively used with the spread of personal computers. Code data according to this MIDI standard (hereinafter referred to as MID
Basically, the I data) is data that describes the operation of the musical instrument playing, such as which keyboard key of the musical instrument was played and with what strength. The MIDI data itself contains the actual sound. Waveform not included. Therefore, when reproducing the actual sound, the MI that stores the waveform of the instrument sound is stored.
Although a DI sound source is required separately, its high coding efficiency has been attracting attention, and the MIDI coding and decoding technology is currently used for musical instrument performance, musical instrument practice, composition, etc. using a personal computer. It is widely adopted in software that does.

【０００５】そこで、音響信号に代表される時系列信号
に対して、所定の手法で解析を行うことにより、その構
成要素となる周期信号を抽出し、抽出した周期信号をＭ
ＩＤＩデータを用いて符号化しようとする提案がなされ
ている。例えば、特開平１０−２４７０９９号公報、特
開平１１−７３１９９号公報、特開平１１−７３２００
号公報、特開平１１−９５７５３号公報、特開２０００
−９９００９号公報、特開２０００−９９０９２号公
報、特開２０００−９９０９３号公報、特開２０００−
２６１３２２号公報、特開２００１−５４５０号公報、
特開２００１−１４８６３３号公報には、任意の時系列
信号について、構成要素となる周波数を解析し、その解
析結果からＭＩＤＩデータを作成することができる種々
の方法が提案されている。Therefore, a time-series signal typified by an acoustic signal is analyzed by a predetermined method to extract a periodic signal which is a constituent element thereof, and the extracted periodic signal is M
Proposals have been made to encode using IDI data. For example, JP-A-10-247099, JP-A-11-73199, and JP-A-11-73200.
JP, JP-A-11-95753, JP, 2000
-99009, JP 2000-99092 A, JP 2000-99093 A, JP 2000-
261322, JP 2001-5450 A,
Japanese Unexamined Patent Application Publication No. 2001-148633 proposes various methods capable of analyzing a frequency as a constituent element of an arbitrary time series signal and creating MIDI data from the analysis result.

【０００６】上記公報に記載された発明では、短時間フ
ーリエ変換法もしくは一般化調和解析の手法を用いて時
系列信号の周波数解析を行ってきた。短時間フーリエ変
換法では、計算負荷は少ないが、周波数分解能が比較的
低く、一般化調和解析の手法では、周波数分解能は高い
が、計算負荷も大きいという問題があった。そこで、本
出願人は、特願２００２−９２２３号において、相関計
算テーブルを利用することにより、周波数分解能は高
く、計算負荷も少ない手法を提案した。In the invention described in the above publication, the frequency analysis of the time series signal has been performed by using the short-time Fourier transform method or the generalized harmonic analysis method. The short-time Fourier transform method has a small calculation load, but has a relatively low frequency resolution, and the generalized harmonic analysis method has a problem that the frequency resolution is high but the calculation load is large. Therefore, the applicant of the present application has proposed in Japanese Patent Application No. 2002-9223 a method that uses a correlation calculation table to have a high frequency resolution and a small calculation load.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、上記相
関計算テーブルを用いた手法では、隣接する単位区間を
重複させて設定しているため、時間分解能を高めるため
に、重複領域を大きくすると、相関計算を行う回数も増
大するという問題が生じる。However, in the method using the above correlation calculation table, the adjacent unit sections are set so as to overlap each other. Therefore, if the overlap region is increased to increase the time resolution, the correlation calculation is performed. There is a problem that the number of times of performing is also increased.

【０００８】上記のような点に鑑み、本発明は、単位区
間の重複領域における相関計算の重複をなくし、相関計
算にかかる総所要時間を削減することが可能な周波数解
析装置および音響信号の符号化装置を提供することを課
題とする。In view of the above points, the present invention eliminates the overlap of the correlation calculation in the overlapping area of the unit section and reduces the total time required for the correlation calculation, and the code of the acoustic signal. An object of the present invention is to provide an activation device.

【０００９】[0009]

【課題を解決するための手段】上記課題を解決するた
め、本発明では、与えられた時系列信号を時系列のスペ
クトルデータに変換する周波数解析装置として、解析を
行う基本単位である単位区間を、時系列上において隣接
する単位区間が互いに重複するように設定し、各単位区
間の時系列信号を順次抽出する区間信号抽出手段と、前
記区間信号抽出手段により直前に抽出された前区間信号
と新たに抽出された現区間信号との間で合致しない前区
間信号の先頭部に位置する信号に対して、所定の調和関
数との相関を算出する前区間先頭の相関算出手段と、前
区間信号と新たに抽出された現区間信号との間で合致し
ない現区間信号の後尾部に位置する信号に対して、所定
の調和関数との相関を算出する現区間後尾の相関算出手
段と、前区間全体の相関値に対して、前記前区間先頭の
相関算出手段により算出した値を減算し、前記現区間後
尾の相関算出手段により算出した値を加算することによ
り現区間全体の相関値を算出する相関合算手段と、前記
相関合算手段で得られた現区間全体の相関値を保持する
ための前区間相関値の記憶手段と、前記相関合算手段で
得られた現区間全体の相関値に基づいて所定の変換を行
い、各単位区間に対応するスペクトルデータを算出する
スペクトル算出手段により構成するようにしたことを特
徴とする。In order to solve the above-mentioned problems, in the present invention, as a frequency analysis device for converting a given time-series signal into time-series spectrum data, a unit section which is a basic unit for analysis is used. , Section signal extraction means for setting adjacent unit sections in time series so as to overlap each other and sequentially extracting time series signals of each unit section, and a previous section signal extracted immediately before by the section signal extraction means. Correlation calculation means at the beginning of the previous section for calculating the correlation with a predetermined harmonic function for the signal located at the beginning of the previous section signal that does not match the newly extracted current section signal, and the previous section signal And a newly-extracted current section signal, the current section tail correlation calculation means for calculating a correlation with a predetermined harmonic function for a signal located at the tail of the current section signal that does not match, and a previous section overall The correlation sum for calculating the correlation value of the entire current section by subtracting the value calculated by the correlation calculation section at the beginning of the preceding section and adding the value calculated by the correlation calculation section at the end of the current section to the function value Means, storage means for storing the correlation value of the previous section for holding the correlation value of the entire current section obtained by the correlation summing means, and a predetermined value based on the correlation value of the entire current section obtained by the correlation summing means It is characterized in that it is configured by a spectrum calculation means for performing conversion and calculating spectrum data corresponding to each unit section.

【００１０】本発明によれば、複数の単位区間を、時系
列に隣接する単位区間が互いに重複するように設定し
て、各単位区間と調和関数との相関値を算出することに
より、時系列信号の周波数解析を行う際に、各単位区間
の相関値の算出を、単位区間内の全区間信号と調和関数
との相関を求めるのでなく、直前の単位区間の相関値を
利用して、直前の単位区間の現単位区間と重複しない部
分の相関値を算出して減算すると共に現単位区間の直前
の単位区間と重複しない部分の相関値を算出して加算す
ることにより算出するようにしたので、単位区間の重複
領域における相関計算の重複をなくし、相関計算にかか
る総所要時間を削減することが可能となる。According to the present invention, a plurality of unit sections are set so that adjacent unit sections in the time series overlap with each other, and the correlation value between each unit section and the harmonic function is calculated to obtain the time series. When performing the frequency analysis of a signal, the correlation value of each unit section is calculated by using the correlation value of the immediately preceding unit section instead of calculating the correlation between all section signals in the unit section and the harmonic function. Since the correlation value of the part of the unit section that does not overlap with the current unit section is calculated and subtracted, and the correlation value of the part that does not overlap with the unit section immediately before the current unit section is calculated and added, the calculation is performed. It is possible to eliminate the overlap of the correlation calculation in the overlapping area of the unit section and reduce the total time required for the correlation calculation.

【００１１】[0011]

【発明の実施の形態】以下、本発明の実施形態について
図面を参照して詳細に説明する。（1.基本原理）はじめに、本発明に係る周波数の解析お
よび音響信号の符号化の基本原理を述べておく。この基
本原理は、前掲の各公報もしくは明細書に開示されてい
るので、ここではその概要のみを簡単に述べることにす
る。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below in detail with reference to the drawings. (1. Basic Principle) First, the basic principle of frequency analysis and audio signal coding according to the present invention will be described. Since this basic principle is disclosed in the above-mentioned publications or specifications, only the outline thereof will be briefly described here.

【００１２】図１（ａ）に示すように、時系列信号とし
てアナログ音響信号が与えられたものとする。図１の例
では、横軸に時間ｔ、縦軸に振幅（強度）をとって、こ
の音響信号を示している。ここでは、まずこのアナログ
音響信号を、デジタルの音響データとして取り込む処理
を行う。これは、従来の一般的なＰＣＭの手法を用い、
所定のサンプリング周波数でこのアナログ音響信号をサ
ンプリングし、振幅を所定の量子化ビット数を用いてデ
ジタルデータに変換する処理を行えば良い。As shown in FIG. 1A, it is assumed that an analog acoustic signal is given as a time series signal. In the example of FIG. 1, the horizontal axis represents time t, and the vertical axis represents amplitude (intensity) to show this acoustic signal. Here, first, a process of taking in the analog acoustic signal as digital acoustic data is performed. This uses the conventional general PCM method,
The analog acoustic signal may be sampled at a predetermined sampling frequency, and the amplitude may be converted into digital data by using a predetermined number of quantization bits.

【００１３】続いて、この解析対象となる音響信号の時
間軸上に、複数の単位区間を設定する。図１（ａ）に示
す例では、時間軸ｔ上に等間隔に６つの時刻ｔ１〜ｔ６
が定義され、これら各時刻を始点および終点とする５つ
の単位区間ｄ１〜ｄ５が設定されている。図１の例で
は、全て同一の区間長をもった単位区間が時間軸上で重
複せずに設定されているが、隣接する単位区間が時間軸
上で部分的に重なり合うような区間設定を行ってもかま
わない。特に、本発明においては、単位区間を重複する
ことを必須要件としている。Then, a plurality of unit sections are set on the time axis of the acoustic signal to be analyzed. In the example shown in FIG. 1A, six times t1 to t6 are equally spaced on the time axis t.
Is defined, and five unit sections d1 to d5 whose start point and end point are the respective time points are set. In the example of FIG. 1, all unit sections having the same section length are set without overlapping on the time axis, but the section setting is performed so that adjacent unit sections partially overlap on the time axis. It doesn't matter. Particularly, in the present invention, it is an essential requirement that the unit sections overlap.

【００１４】こうして単位区間が設定されたら、各単位
区間ごとの音響信号（以下、区間信号と呼ぶことにす
る）について、それぞれ代表周波数を選出する。各区間
信号には、通常、様々な周波数成分が含まれているが、
例えば、その中で成分の強度割合の大きな周波数成分を
代表周波数として選出すれば良い。ここで、代表周波数
とはいわゆる基本周波数が一般的であるが、音声のフォ
ルマント周波数などの倍音周波数や、ノイズ音源のピー
ク周波数も代表周波数として扱うことがある。代表周波
数は１つだけ選出しても良いが、音響信号によっては複
数の代表周波数を選出した方が、より精度の高い符号化
が可能になる。図１（ｂ）には、個々の単位区間ごとに
それぞれ３つの代表周波数を選出し、１つの代表周波数
を１つの代表符号（図では便宜上、音符として示してあ
る）として符号化した例が示されている。ここでは、代
表符号（音符）を収容するために３つのトラックＴ１，
Ｔ２，Ｔ３が設けられているが、これは個々の単位区間
ごとに選出された３つずつの代表符号を、それぞれ異な
るトラックに収容するためである。When the unit section is set in this way, a representative frequency is selected for each acoustic signal (hereinafter referred to as section signal) for each unit section. Each section signal usually contains various frequency components,
For example, a frequency component having a large intensity ratio of the components may be selected as the representative frequency. Here, the representative frequency is generally a so-called fundamental frequency, but a harmonic frequency such as a formant frequency of voice or a peak frequency of a noise sound source may be treated as a representative frequency. Although only one representative frequency may be selected, more accurate encoding becomes possible if a plurality of representative frequencies are selected depending on the acoustic signal. FIG. 1B shows an example in which three representative frequencies are selected for each unit section and one representative frequency is encoded as one representative code (in the figure, it is shown as a note for convenience). Has been done. Here, three tracks T1 for accommodating a representative code (note) are provided.
T2 and T3 are provided so that the three representative codes selected for each unit section are accommodated in different tracks.

【００１５】例えば、単位区間ｄ１について選出された
代表符号ｎ（ｄ１，１），ｎ（ｄ１，２），ｎ（ｄ１，
３）は、それぞれトラックＴ１，Ｔ２，Ｔ３に収容され
ている。ここで、各符号ｎ（ｄ１，１），ｎ（ｄ１，
２），ｎ（ｄ１，３）は、ＭＩＤＩ符号におけるノート
ナンバーを示す符号である。ＭＩＤＩ符号におけるノー
トナンバーは、０〜１２７までの１２８通りの値をと
り、それぞれピアノの鍵盤の１つのキーを示すことにな
る。具体的には、例えば、代表周波数として４４０Ｈｚ
が選出された場合、この周波数はノートナンバーｎ＝６
９（ピアノの鍵盤中央の「ラ音（Ａ３音）」に対応）に
相当するので、代表符号としては、ｎ＝６９が選出され
ることになる。もっとも、図１（ｂ）は、上述の方法に
よって得られる代表符号を音符の形式で示した概念図で
あり、実際には、各音符にはそれぞれ強度に関するデー
タも付加されている。例えば、トラックＴ１には、ノー
トナンバーｎ（ｄ１，１），ｎ（ｄ２，１）・・・とい
う音高を示すデータとともに、ｅ（ｄ１，１），ｅ（ｄ
２，１）・・・という強度を示すデータが収容されるこ
とになる。この強度を示すデータは、各代表周波数の成
分が、元の区間信号にどの程度の度合いで含まれていた
かによって決定される。具体的には、各代表周波数をも
った周期関数の区間信号に対する相関値に基づいて強度
を示すデータが決定されることになる。また、図１
（ｂ）に示す概念図では、音符の横方向の位置によっ
て、個々の単位区間の時間軸上での位置が示されている
が、実際には、この時間軸上での位置を正確に数値とし
て示すデータが各音符に付加されていることになる。For example, the representative codes n (d1,1), n (d1,2), n (d1,) selected for the unit section d1.
3) are housed in the tracks T1, T2, T3, respectively. Here, each code n (d1,1), n (d1,
2) and n (d1,3) are codes indicating note numbers in the MIDI code. The note number in the MIDI code takes 128 values from 0 to 127, and each indicates one key on the keyboard of the piano. Specifically, for example, the representative frequency is 440 Hz
Is selected, the frequency is note number n = 6
Since it corresponds to 9 (corresponding to "Ra sound (A3 sound)" at the center of the keyboard of the piano), n = 69 is selected as the representative code. However, FIG. 1B is a conceptual diagram showing the representative code obtained by the above-described method in the form of a musical note, and in fact, each musical note is also provided with data relating to its strength. For example, the track T1 includes note numbers n (d1,1), n (d2,1) ... Pitch data and e (d1,1), e (d
Data indicating the strength of 2, 1, 1 ... The data indicating this intensity is determined by the degree to which the component of each representative frequency is included in the original section signal. Specifically, the data indicating the intensity is determined based on the correlation value of the section signal of the periodic function having each representative frequency. Also, FIG.
In the conceptual diagram shown in (b), the position of each unit section on the time axis is shown by the position of the note in the lateral direction. The data shown as is added to each note.

【００１６】音響信号を符号化する形式としては、必ず
しもＭＩＤＩ形式を採用する必要はないが、この種の符
号化形式としてはＭＩＤＩ形式が最も普及しているた
め、実用上はＭＩＤＩ形式の符号データを用いるのが好
ましい。ＭＩＤＩ形式では、「ノートオン」データもし
くは「ノートオフ」データが、「デルタタイム」データ
を介在させながら存在する。「ノートオン」データは、
特定のノートナンバーＮとベロシティーＶを指定して特
定の音の演奏開始を指示するデータであり、「ノートオ
フ」データは、特定のノートナンバーＮとベロシティー
Ｖを指定して特定の音の演奏終了を指示するデータであ
る。また、「デルタタイム」データは、所定の時間間隔
を示すデータである。ベロシティーＶは、例えば、ピア
ノの鍵盤などを押し下げる速度（ノートオン時のベロシ
ティー）および鍵盤から指を離す速度（ノートオフ時の
ベロシティー）を示すパラメータであり、特定の音の演
奏開始操作もしくは演奏終了操作の強さを示すことにな
る。It is not always necessary to adopt the MIDI format as the format for encoding the acoustic signal, but since the MIDI format is the most popular as this type of encoding format, the MIDI format code data is practically used. Is preferably used. In the MIDI format, “note on” data or “note off” data exists with “delta time” data interposed. The "Note On" data is
The "note-off" data is data for instructing the start of playing a specific sound by designating a specific note number N and velocity V. The "note-off" data is a data of a specific note designated by a specific note number N and velocity V. This is data for instructing the end of performance. The "delta time" data is data indicating a predetermined time interval. Velocity V is a parameter indicating, for example, the speed at which the piano keyboard is pushed down (velocity at note-on) and the speed at which the finger is released from the keyboard (velocity at note-off), and operation to start playing a specific sound. Alternatively, it indicates the strength of the performance ending operation.

【００１７】前述の方法では、第ｉ番目の単位区間ｄｉ
について、代表符号としてＪ個のノートナンバーｎ（ｄ
ｉ，１），ｎ（ｄｉ，２），・・・，ｎ（ｄｉ，Ｊ）が
得られ、このそれぞれについて強度ｅ（ｄｉ，１），ｅ
（ｄｉ，２），・・・，ｅ（ｄｉ，Ｊ）が得られる。そ
こで、次のような手法により、ＭＩＤＩ形式の符号デー
タを作成することができる。まず、「ノートオン」デー
タもしくは「ノートオフ」データの中で記述するノート
ナンバーＮとしては、得られたノートナンバーｎ（ｄ
ｉ，１），ｎ（ｄｉ，２），・・・，ｎ（ｄｉ，Ｊ）を
そのまま用いれば良い。一方、「ノートオン」データも
しくは「ノートオフ」データの中で記述するベロシティ
ーＶとしては、得られた強度ｅ（ｄｉ，１），ｅ（ｄ
ｉ，２），・・・，ｅ（ｄｉ，Ｊ）を所定の方法で規格
化した値を用いれば良い。また、「デルタタイム」デー
タは、各単位区間の長さに応じて設定すれば良い。In the above method, the i-th unit section di
About J note numbers n (d
i, 1), n (di, 2), ..., N (di, J) are obtained for each of these intensities e (di, 1), e
(Di, 2), ..., E (di, J) are obtained. Therefore, the code data in the MIDI format can be created by the following method. First, as the note number N described in the “note on” data or the “note off” data, the obtained note number n (d
i, 1), n (di, 2), ..., N (di, J) may be used as they are. On the other hand, as the velocity V described in the “note-on” data or the “note-off” data, the obtained intensities e (di, 1), e (d
i, 2), ..., E (di, J) may be standardized by a predetermined method. The “delta time” data may be set according to the length of each unit section.

【００１８】（2.周期関数との相関を求める具体的な方
法）上述した基本原理に基づく方法では、区間信号に対
して、１つまたは複数の代表周波数が選出され、この代
表周波数をもった周期信号によって、当該区間信号が表
現されることになる。ここで、選出される代表周波数
は、文字どおり、当該単位区間内の信号成分を代表する
周波数である。この代表周波数を選出する具体的な方法
としては、短時間フーリエ変換を利用する方法、一般化
調和解析を利用する方法、相関計算テーブルを利用する
方法がある。このうち相関計算テーブルを利用する方法
は、本出願人が特願２００２−９２２３号において提案
した方法である。本発明は、相関計算テーブルを利用す
る方法を用いた場合に、特に効果が高いため、相関計算
テーブルを用いて周期関数との相関を求める具体的な方
法を述べておく。(2. Concrete Method for Obtaining Correlation with Periodic Function) In the method based on the above-mentioned basic principle, one or a plurality of representative frequencies are selected for the section signal and have the representative frequency. The section signal is represented by the periodic signal. Here, the selected representative frequency is literally a frequency representing the signal component in the unit section. As a specific method for selecting the representative frequency, there are a method using a short-time Fourier transform, a method using a generalized harmonic analysis, and a method using a correlation calculation table. Among these, the method of using the correlation calculation table is the method proposed by the applicant in Japanese Patent Application No. 2002-9223. Since the present invention is particularly effective when a method using a correlation calculation table is used, a specific method for obtaining a correlation with a periodic function using the correlation calculation table will be described.

【００１９】複数の周期関数として、図２に示すような
三角関数が用意されているものとする。これらの三角関
数は、同一周波数をもった正弦関数と余弦関数との対か
ら構成されており、１２８通りの標準周波数ｆ（０）〜
ｆ（１２７）のそれぞれについて、正弦関数および余弦
関数の対が定義されていることになる。ここでは、同一
の周波数をもった正弦関数および余弦関数からなる一対
の関数を、当該周波数についての周期関数として定義す
ることにする。すなわち、ある特定の周波数についての
周期関数は、一対の正弦関数および余弦関数によって構
成されることになる。このように、一対の正弦関数と余
弦関数とにより周期関数を定義するのは、信号に対する
周期関数の相関値を求める際に、相関値が位相の影響を
受ける事を考慮するためである。なお、図２に示す各三
角関数内の変数Ｆおよびｋは、区間信号Ｘについてのサ
ンプリング周波数Ｆおよびサンプル番号ｋに相当する変
数である。例えば、周波数ｆ（０）についての正弦波
は、ｓｉｎ（２πｆ（０）ｋ／Ｆ）で示され、任意のサ
ンプル番号ｋを与えると、区間信号を構成する第ｋ番目
のサンプルと同一時間位置における周期関数の振幅値が
得られる。ここでは、１２８通りの標準周波数ｆ（０）
〜ｆ（１２７）を以下に示す〔数式１〕で定義する。It is assumed that a trigonometric function as shown in FIG. 2 is prepared as a plurality of periodic functions. These trigonometric functions are composed of a pair of a sine function and a cosine function having the same frequency, and 128 standard frequencies f (0) to
For each of f (127), a pair of sine and cosine functions will be defined. Here, a pair of functions including a sine function and a cosine function having the same frequency will be defined as a periodic function for the frequency. That is, the periodic function for a specific frequency is composed of a pair of sine function and cosine function. Thus, the reason why the periodic function is defined by a pair of sine function and cosine function is to consider that the correlation value is influenced by the phase when the correlation value of the periodic function with respect to the signal is obtained. The variables F and k in each trigonometric function shown in FIG. 2 are variables corresponding to the sampling frequency F and the sample number k for the interval signal X. For example, a sine wave for the frequency f (0) is represented by sin (2πf (0) k / F), and given an arbitrary sample number k, the same time position as the kth sample forming the interval signal is given. The amplitude value of the periodic function at is obtained. Here, 128 standard frequencies f (0)
~ F (127) is defined by the following [Formula 1].

【００２０】〔数式１〕ｆ（ｎ）＝４４０×２^γ ⁽ⁿ⁾ γ（ｎ）＝（ｎ−６９）／１２ただし、ｎ＝０，１，２，・・・，１２７[Formula 1] f (n) = 440 × 2 ^γ ⁽ⁿ⁾ γ (n) = (n−69) / 12 where n = 0, 1, 2, ..., 127

【００２１】このような式によって標準周波数を定義し
ておくと、最終的にＭＩＤＩデータを用いた符号化を行
う際に便利である。なぜなら、このような定義によって
設定される１２８通りの標準周波数ｆ（０）〜ｆ（１２
７）は、等比級数をなす周波数値をとることになり、Ｍ
ＩＤＩデータで利用されるノートナンバーに対応した周
波数になるからである。したがって、図２に示す１２８
通りの標準周波数ｆ（０）〜ｆ（１２７）は、対数尺度
で示した周波数軸上に等間隔（ＭＩＤＩにおける半音単
位）に設定した周波数ということになる。Defining the standard frequency by such an equation is convenient when finally performing encoding using MIDI data. This is because there are 128 standard frequencies f (0) to f (12) set by such a definition.
7) is to take frequency values forming a geometric series, and M
This is because the frequency corresponds to the note number used in the IDI data. Therefore, 128 shown in FIG.
The standard frequencies f (0) to f (127) are the frequencies set at equal intervals (semitone unit in MIDI) on the frequency axis shown by the logarithmic scale.

【００２２】続いて、任意の区間の区間信号に対する各
周期関数の相関の求め方について、具体的な説明を行
う。例えば、図３に示すように、ある単位区間ｄについ
て区間信号Ｘが与えられていたとする。ここでは、区間
長Ｌをもった単位区間ｄについて、サンプリング周波数
Ｆでサンプリングが行なわれており、全部でｗ個のサン
プル値が得られているものとし、サンプル番号を図示の
ように、０，１，２，３，・・・，ｋ，・・・，ｗ−
２，ｗ−１とする（白丸で示す第ｗ番目のサンプルは、
右に隣接する次の単位区間の先頭に含まれるサンプルと
する）。この場合、任意のサンプル番号ｋについては、
Ｘ（ｋ）なる振幅値がデジタルデータとして与えられて
いることになる。ここで、ｗは以下の記述においても定
数のような記載をしているが、一般にはｎの値に応じて
変化させ、区間長Ｌを超えない範囲で最大となるＦ／ｆ
（ｎ）の整数倍の値に設定することが望ましい。通常、
図１に示したように所定の単位区間、すなわち短時間に
おいて解析する場合は、Ｘ（ｋ）に対して各サンプルご
とに中央の重みが１に近く、両端の重みが０に近くなる
ような窓関数Ｗ（ｋ）を乗じることが行われており、こ
れが短時間フーリエ変換法である。ただし、本発明で
は、後述するように、ある単位区間の解析を行う場合
に、直接その単位区間の相関値を計算せずに、その直前
の単位区間の相関値を基本にして、基本の相関値から、
直前の単位区間に特有の部分の相関値を減算し、重複し
ていない現単位区間の後尾部分の相関値を加算する処理
を行っている。このためには、窓関数Ｗ（ｋ）を乗じる
ことは好ましくなく、これが、本発明において、短時間
フーリエ変換法および一般化調和解析法を利用しない理
由である。Next, a concrete description will be given of how to obtain the correlation of each periodic function with respect to a section signal of an arbitrary section. For example, as shown in FIG. 3, it is assumed that the section signal X is given to a certain unit section d. Here, it is assumed that the unit section d having the section length L is sampled at the sampling frequency F and w sample values are obtained in total, and the sample number is 0, as shown in the figure. 1,2,3, ..., k, ..., w-
2, w-1 (the w-th sample shown by the white circle is
It shall be the sample included at the beginning of the next unit section adjacent to the right). In this case, for any sample number k,
This means that the amplitude value X (k) is given as digital data. Here, w is also described as a constant in the following description, but in general, it is changed according to the value of n and becomes maximum within a range not exceeding the section length L.
It is desirable to set the value to an integral multiple of (n). Normal,
As shown in FIG. 1, when the analysis is performed in a predetermined unit section, that is, in a short time, the central weight is close to 1 and the weights at both ends are close to 0 for each sample with respect to X (k). The window function W (k) is multiplied, which is the short-time Fourier transform method. However, in the present invention, as will be described later, in the case of analyzing a certain unit section, the correlation value of the unit section immediately before that is not directly calculated, but the basic correlation is calculated. From the value,
The correlation value of the portion peculiar to the immediately preceding unit section is subtracted, and the correlation value of the tail portion of the current unit section that does not overlap is added. For this purpose, it is not preferable to multiply by the window function W (k), which is the reason why the short time Fourier transform method and the generalized harmonic analysis method are not used in the present invention.

【００２３】このような区間信号Ｘに対して、第ｎ番目
の標準周波数ｆ（ｎ）をもった正弦関数Ｒｎとの相関値
を求める原理を示す。両者の相関値Ａ（ｎ）は、以下の
〔数式２〕によって定義することができる。The principle of obtaining the correlation value with the sine function Rn having the nth standard frequency f (n) for such a section signal X will be described. The correlation value A (n) between the two can be defined by the following [Formula 2].

【００２４】〔数式２〕Ａ(ｎ)＝(２／ｗ)Σ_k=0,w-1ｘ(ｋ) sin(２πｆ_nｋ／Ｆ) Ｂ(ｎ)＝(２／ｗ)Σ_k=0,w-1ｘ(ｋ) cos(２πｆ_nｋ／Ｆ) Ｅ(ｎ)＝｛Ａ(ｎ)²＋Ｂ(ｎ)²｝^1/2 [Formula 2] A (n) = (2 / w) Σ _{k = 0, w-1} x (k) sin (2πf _n k / F) B (n) = (2 / w) Σ _{k = 0, w-1} x (k) cos (2πf _n k / F) E (n) = {A (n) ² + B (n) ² } ^1/2

【００２５】上記〔数式２〕において、Ｘ（ｋ）は、図
３に示すように、区間信号Ｘにおけるサンプル番号ｋの
振幅値であり、ｓｉｎ（２πｆ_nｋ／Ｆ）は、時間軸上
での同位置における正弦関数Ｒｎの振幅値である。な
お、数式が繁雑になるのを避けるため、数式内ではｆ
（ｎ）をｆ_nと表現している。〔数式２〕の第１の演算
式は、単位区間ｄ内の全サンプル番号ｋ＝０〜ｗ−１の
次元について、それぞれ区間信号Ｘの振幅値と正弦関数
Ｒｎの振幅ベクトルの内積を求める式ということができ
る。In the above [Formula 2], X (k) is the amplitude value of the sample number k in the interval signal X, as shown in FIG. 3, and sin (2πf _n k / F) is on the time axis. It is the amplitude value of the sine function Rn at the same position of. In order to avoid complicated expressions, f
(N) is expressed as f _n . The first arithmetic expression of [Equation 2] is an expression for obtaining the inner product of the amplitude value of the interval signal X and the amplitude vector of the sine function Rn for the dimensions of all sample numbers k = 0 to w−1 in the unit interval d. Can be said.

【００２６】同様に、上記〔数式２〕の第２の演算式
は、区間信号Ｘと、第ｎ番目の標準周波数ｆ（ｎ）をも
った余弦関数との相関値を求める式であり、両者の相関
値はＢ（ｎ）で与えられる。なお、相関値Ａ（ｎ）を求
めるための第１の演算式も、相関値Ｂ（ｎ）を求めるた
めの第２の演算式も、最終的に２／ｗが乗ぜられている
が、これは相関値を規格化するためのものでり、前述の
とおりｗはｎに依存して変化させるのが一般的であるた
め、この係数もｎに依存する変数である。Similarly, the second arithmetic expression of the above [Equation 2] is an expression for obtaining the correlation value between the interval signal X and the cosine function having the nth standard frequency f (n). The correlation value of is given by B (n). Note that both the first arithmetic expression for obtaining the correlation value A (n) and the second arithmetic expression for obtaining the correlation value B (n) are finally multiplied by 2 / w. Is for normalizing the correlation value, and since w is generally changed depending on n as described above, this coefficient is also a variable depending on n.

【００２７】区間信号Ｘと標準周波数ｆ（ｎ）をもった
標準周期関数との相関実効値は、上記〔数式２〕の第３
の演算式に示すように、正弦関数との相関値Ａ（ｎ）と
余弦関数との相関値Ｂ（ｎ）との二乗和平方根のうち、
正の値であるＥ（ｎ）によって示すことができる。この
相関実効値の大きな標準周期関数の周波数を代表周波数
として選出すれば、この代表周波数を用いて区間信号Ｘ
を符号化することができる。The effective value of the correlation between the interval signal X and the standard periodic function having the standard frequency f (n) is the third value of the above [Formula 2].
As shown in the following equation, of the square root of the sum of squares of the correlation value A (n) with the sine function and the correlation value B (n) with the cosine function,
It can be indicated by a positive value, E (n). If the frequency of the standard periodic function having a large effective value of the correlation is selected as the representative frequency, the interval signal X
Can be encoded.

【００２８】すなわち、この相関値Ｅ（ｎ）が所定の基
準以上の大きさとなる１つまたは複数の標準周波数を代
表周波数として選出すれば良い。なお、ここで「相関値
Ｅ（ｎ）が所定の基準以上の大きさとなる」という選出
条件は、例えば、何らかの閾値を設定しておき、相関値
Ｅ（ｎ）がこの閾値を超えるような標準周波数ｆ（ｎ）
をすべて代表周波数として選出する、という絶対的な選
出条件を設定しても良いが、例えば、相関値Ｅ（ｎ）の
大きさの順にＱ番目までを選出する、というような相対
的な選出条件を設定しても良い。That is, one or a plurality of standard frequencies whose correlation value E (n) is greater than a predetermined standard may be selected as the representative frequency. The selection condition that “the correlation value E (n) is greater than or equal to a predetermined reference” is, for example, a threshold that is set in advance and the correlation value E (n) exceeds the threshold. Frequency f (n)
May be set as a representative frequency, but an absolute selection condition may be set. For example, relative selection conditions such as selecting up to the Qth in the order of the magnitude of the correlation value E (n). May be set.

【００２９】（2.1.相互相関テーブルを利用した手法）
設定された単位区間における区間信号と調和関数との相
関計算を行う手法としては、短時間フーリエ変換法と、
一般化調和解析を利用した手法が有名である。しかし、
短時間フーリエ変換法では周波数分解能が充分でなく、
短時間フーリエ変換法の問題点をこれを解決するための
一般化調和解析を利用した手法では、短時間フーリエ変
換法に比べて、周期関数である調和関数との相関演算回
数が桁違いに多いため、計算負荷が大きいという問題が
あった。そこで、本出願人は、特願２００２−９２２３
号において、相互相関テーブルを利用して周波数解析を
行う手法を提案した。この手法により、短時間フーリエ
変換法と同等な計算負荷で一般化調和解析と同等な周波
数分解能を実現することが可能であると共に、一般化調
和解析で問題になっていた、抽出される信号成分の精度
の向上を図ることが可能となる。この相互相関テーブル
を利用した手法を次に説明する。(2.1. Method using cross-correlation table)
As a method of calculating the correlation between the section signal and the harmonic function in the set unit section, a short-time Fourier transform method,
The method using generalized harmonic analysis is famous. But,
The short-time Fourier transform method does not have sufficient frequency resolution,
In the method using generalized harmonic analysis to solve the problems of the short-time Fourier transform method, the number of correlation operations with the harmonic function, which is a periodic function, is orders of magnitude higher than that of the short-time Fourier transform method. Therefore, there is a problem that the calculation load is large. Therefore, the present applicant has filed Japanese Patent Application No. 2002-9223.
In this issue, we proposed a method for frequency analysis using a cross-correlation table. With this method, it is possible to realize a frequency resolution equivalent to that of generalized harmonic analysis with a computational load equivalent to that of the short-time Fourier transform method, and the extracted signal components that have been a problem in generalized harmonic analysis. It is possible to improve the accuracy of. A method using this cross-correlation table will be described below.

【００３０】まず、上述のように、複数の標準周波数を
設定し、各標準周波数に対応する標準周期関数を調和関
数として準備する。このとき設定される標準周波数とし
ては、周波数解析の特性に合わせて任意に設定すること
ができるが、音響信号の符号化に利用するためには、図
２および〔数式１〕に示したように、ＭＩＤＩ規格のノ
ートナンバーｎに対応させて設定することが好ましい。First, as described above, a plurality of standard frequencies are set, and a standard periodic function corresponding to each standard frequency is prepared as a harmonic function. The standard frequency set at this time can be arbitrarily set according to the characteristics of the frequency analysis, but in order to use it for encoding the acoustic signal, as shown in FIG. 2 and [Equation 1], , MIDI standard note number n is preferably set.

【００３１】続いて、各調和関数同士の相関である相互
相関を全ての組合せに対して算出し、相互相関テーブル
を作成する。この際、周波数ｆ（ｍ）の調和関数の周波
数ｆ（ｎ）の調和関数に対する相互相関Ｒ(ｆ_m,ｆ_n)
は、以下の〔数式３〕により算出する。Then, the cross-correlation, which is the correlation between the harmonic functions, is calculated for all the combinations, and the cross-correlation table is created. At this time, the cross-correlation R (f _m , f _n ) of the harmonic function of the frequency f (m) with respect to the harmonic function of the frequency f ( _n )
Is calculated by the following [Formula 3].

【００３２】〔数式３〕Ａ(ｆ_m,ｆ_n)＝(２／Ｔ(ｎ))Σ_k=0,T(n)-1sin(２πｆ_mｋ
／Ｆ) sin(２πｆ_nｋ／Ｆ) Ｂ(ｆ_m,ｆ_n)＝(２／Ｔ(ｎ))Σ_k=0,T(n)-1sin(２πｆ_mｋ
／Ｆ) cos(２πｆ_nｋ／Ｆ) Ｒ(ｆ_m,ｆ_n)＝｛Ａ(ｆ_m,ｆ_n)²＋Ｂ(ｆ_m,ｆ_n)²｝^1/2 [Formula 3] A (f _m , f _n ) = (2 / T (n)) Σ _{k = 0, T (n) -1} sin (2πf _m k
_{/ F) sin (2πf n k} / F) B (f m, f n) = (2 / T (n)) Σ k = 0, T (n) -1 sin (2πf m k
/ F) cos (2πf _n k / F) R (f _m , f _n ) = {A (f _m , f _n ) ² + B (f _m , f _n ) ² } ^1/2

【００３３】上記〔数式３〕の第３式で算出される相互
相関Ｒ(ｆ_m,ｆ_n)は２次元の相互相関テーブルの１要素
を示す。図２に示したようにｍ、ｎがノートナンバーに
対応している場合、相互相関テーブルには、各ノートナ
ンバーｍに対応する１２８個のノートナンバーの相関値
が記録され、全部で１２８×１２８個の相関値が記録さ
れることになる。The cross-correlation R (f _m , f _n ) calculated by the third expression of the above [Expression 3] represents one element of the two-dimensional cross-correlation table. As shown in FIG. 2, when m and n correspond to note numbers, the correlation value of 128 note numbers corresponding to each note number m is recorded in the cross-correlation table, which is 128 × 128 in total. Correlation values will be recorded.

【００３４】相互相関テーブルの準備ができたら、解析
対象となる時系列信号の全区間に渡って単位区間を設定
し、設定された単位区間の時系列信号を区間信号として
抽出する。単位区間の設定は、図１（ａ）に示したのと
は異なり、隣接する単位区間が互いに重複するように設
定する。When the cross-correlation table is prepared, a unit section is set over the entire section of the time series signal to be analyzed, and the time series signal of the set unit section is extracted as a section signal. Differently from the case shown in FIG. 1A, the unit sections are set such that adjacent unit sections overlap each other.

【００３５】続いて、抽出した区間信号に対して、全調
和関数との相関計算を行う。例えば、図２に示したよう
なノートナンバーに対応して標準周波数を設定した場合
には、１２８個の調和関数との相関計算が行われる。こ
の段階での調和関数との相関計算は、下記の〔数式４〕
により行われる。すなわち、区間信号のうち、先頭か
ら、相関計算を行う調和関数の周期の整数倍で単位区間
長を超えない部分と、調和関数との相関を算出する。算
出された相関値は、各単位区間ごとに用意される信号相
関配列に格納される。ここでは、１つの区間信号に対し
ては、各調和関数との相関計算が行われるのは、この１
回だけとなり、相関計算の回数を抑えるのに貢献してい
る。この段階での標準周波数ｆ（ｎ）の調和関数と、区
間信号ｘ（ｋ）との相関値Ｅ(ｆ_n)は、以下の〔数式
４〕により算出される。Subsequently, the correlation calculation with the total harmonic function is performed on the extracted section signal. For example, when the standard frequency is set corresponding to the note number as shown in FIG. 2, correlation calculation with 128 harmonic functions is performed. The correlation calculation with the harmonic function at this stage is performed by the following [Formula 4].
Done by. That is, the correlation between the harmonic function and the portion of the interval signal from the beginning that is an integral multiple of the cycle of the harmonic function for which correlation calculation is performed and does not exceed the unit interval length is calculated. The calculated correlation value is stored in the signal correlation array prepared for each unit section. Here, the correlation calculation with each harmonic function is performed for one section signal is
Since it is only once, it contributes to suppressing the number of correlation calculations. The correlation value E (f _n ) between the harmonic function of the standard frequency f (n) and the interval signal x (k) at this stage is calculated by the following [Formula 4].

【００３６】〔数式４〕Ａ(ｆ_n)＝(２／Ｔ(ｎ))Σ_k=0,T(n)-1ｘ(ｋ) sin(２πｆ
_nｋ／Ｆ) Ｂ(ｆ_n)＝(２／Ｔ(ｎ))Σ_k=0,T(n)-1ｘ(ｋ) cos(２πｆ
_nｋ／Ｆ) Ｅ(ｆ_n)＝｛Ａ(ｆ_n)²＋Ｂ(ｆ_n)²｝^1/2 [Formula 4] A (f _n ) = (2 / T (n)) Σ _{k = 0, T (n) -1} x (k) sin (2πf
_n k / F) B (f _n ) = (2 / T (n)) Σ _{k = 0, T (n) -1} x (k) cos (2πf
_n k / F) E (f _n ) = {A (f _n ) ² + B (f _n ) ² } ^1/2

【００３７】この〔数式４〕は、〔数式２〕の相関計算
サンプル数ｗを相関計算時間Ｔ(ｎ)に置き替えただけ
で、実質的には上記と同等の式である。This [Equation 4] is substantially the same as the above, except that the correlation calculation sample number w in [Equation 2] is replaced by the correlation calculation time T (n).

【００３８】信号相関配列が得られたら、配列中の各要
素である相関値を、相互相関テーブルを利用して補正す
る。具体的には、標準周波数ｆ（ｎ）との相関値Ｅ
(ｆ_n)の補正値Ｅ´(ｆ_n)は、標準周波数ｆ（ｍ）との相
関値Ｅ(ｆ_m)、標準周波数ｆ（ｍ）の標準周波数ｆ
（ｎ）に対する相互相関Ｒ(ｆ_m,ｆ_n)、標準周波数ｆ
（ｍ）の自己相関Ｒ(ｆ_m,ｆ_m)を用いて、以下の〔数式
５〕により算出される。When the signal correlation array is obtained, the correlation value which is each element in the array is corrected using the cross correlation table. Specifically, the correlation value E with the standard frequency f (n)
correction value _{_{(f n) E'(f n}} ) , the correlation value between the standard frequency _{f (m) E (f m} ), the standard frequency f of the normal frequency f (m)
The cross-correlation R for _{(n) (f m, f} n), the standard frequency f
It is calculated by the following [Equation 5] using the autocorrelation R (f _m , f _m ) of (m).

【００３９】〔数式５〕Ｅ´(ｆ_n)＝Ｅ(ｆ_n)−Σ_m=0,N-1Ｅ(ｆ_m) Ｒ(ｆ_m,ｆ_n)
／Ｒ(ｆ_m,ｆ_m)[Equation 5] E ′ (f _n ) = E (f _n ) −Σ _{m = 0, N−1} E (f _m ) R (f _m , f _n ).
_{_{/ R (f m, f m}} )

【００４０】上記〔数式５〕により算出された補正値Ｅ
´(ｆ_n)は、相関配列中の標準周波数ｆ（ｎ）に対応す
る位置に格納され、以降は相関値Ｅ(ｆ_m)として他の補
正値Ｅ´(ｆ_n)の算出に利用される。このようにして、
設定された全標準周波数に対応する補正値Ｅ´(ｆ_n)を
算出する。このとき、ｎ＝０〜Ｎ−１のうち、どの相関
値Ｅ(ｆ_n)から補正していくかについては、基本的に
は、信号相関配列における相関値の初期値の大きさの順
に従う。こうしてＮ個の相関値が補正された信号相関配
列が得られる。ただし、この時点では配列内の要素のう
ち、負の値になっているものがある場合がある。その場
合は、その値を０にすることにより、信号相関配列の値
が全て０または正の値となるようにし、これを補正相関
配列とする。このように補正相関配列の値を０以上にす
るのは、相関値が負の値ということは基本的に有り得な
いので、現実的でない値を削除するためである。また、
負の値の要素を０にする処理を、信号相関配列中の全て
の要素が補正された後で行うのは、補正値Ｅ´(ｆ_n)が
負であった場合に、この補正値Ｅ´(ｆ_n)を〔数式５〕
に示したＥ(ｆ_m)として、他の補正値の算出に利用する
ためである。これにより、補正値が負であった場合は、
〔数式５〕の右辺のΣによる総和が減少し、結果として
補正前の相関値Ｅ(ｆ_n)に増加されるようになる。本発
明では、このようにして補正値が負であったとしても、
その値を変更せずにそのまま利用して他の要素の補正値
を求めるため、一般化調和解析のように、減算する含有
信号の順番により差分信号が変化し、得られる相関値が
異なるということがない。そのため、信号相関配列にお
ける相関値の初期値の大きさの順番に依存することな
く、補正を行うことが可能となる。Correction value E calculated by the above [Formula 5]
'(F _n) is stored in a position corresponding to the normal frequency f in the correlation sequence (n), and later is used to calculate the correlation value E (f _m) as another correction value E'(f _n) It In this way
A correction value E ′ (f _n ) corresponding to all set standard frequencies is calculated. At this time, which correlation value E (f _n ) is corrected from n = 0 to N−1 basically follows the order of the magnitude of the initial value of the correlation value in the signal correlation array. . In this way, a signal correlation array in which N correlation values are corrected is obtained. However, at this point, some of the elements in the array may have negative values. In that case, the value is set to 0 so that all the values in the signal correlation array become 0 or a positive value, and this is set as the correction correlation array. The reason why the value of the corrected correlation array is set to 0 or more in this way is to delete an unrealistic value, since a negative correlation value is basically impossible. Also,
The process of setting a negative value element to 0 is performed after all the elements in the signal correlation array are corrected, when the correction value E ′ (f _n ) is negative. ′ (F _n ) is [Formula 5]
This is because it is used for the calculation of other correction values as E (f _m ). As a result, if the correction value is negative,
The sum total of Σ on the right side of [Equation 5] decreases, and as a result, the correlation value E (f _n ) before correction increases. In the present invention, even if the correction value is negative in this way,
Since the correction values of other elements are obtained by directly using the values without changing them, the difference signal changes depending on the order of the included signals to be subtracted, and the obtained correlation value is different, as in generalized harmonic analysis. There is no. Therefore, the correction can be performed without depending on the order of the magnitude of the initial value of the correlation value in the signal correlation array.

【００４１】上記相関演算、および相関補正を設定され
た全単位区間に対して行うことにより、全単位区間にお
けるＮ個の周波数成分が得られる。By performing the above-mentioned correlation calculation and correlation correction for all the set unit intervals, N frequency components in all the unit intervals can be obtained.

【００４２】ここで、相互相関テーブルを用いた相関値
の補正の効果を図４を用いて概念的に説明する。図４に
おいて、横軸はノートナンバー（周波数）に対応してお
り、縦軸は信号強度あるいは相関強度に対応している。
ここで、単一音の音源の周波数解析を行う場合を考えて
みる。単一音の音源の原信号スペクトルは、図４（ａ）
に示すように、１つの周波数で表現される。この単一音
の周波数解析を行った場合、図４（ａ）に示すように１
つの周波数だけ抽出されれば、最も精度の高い周波数解
析が行われたことになる。ところが、この単一音に対し
て短時間フーリエ解析による周波数解析を行うと、図４
（ｂ）に示すように多数の周波数成分が抽出されること
になる。また、図４（ａ）に示した単一音に対して一般
化調和解析による周波数解析を行うと、図４（ｃ）に示
すように多数の周波数成分が抽出される。ただ、図４
（ｂ）と図４（ｃ）を比較するとわかるように、一般化
調和解析を利用した方が、抽出すべき周波数の相関強度
が他の周波数より大きな値となり、短時間フーリエ変換
を利用するよりも精度は高くなる。Here, the effect of correction of the correlation value using the cross-correlation table will be conceptually described with reference to FIG. In FIG. 4, the horizontal axis corresponds to the note number (frequency), and the vertical axis corresponds to the signal strength or the correlation strength.
Now, consider the case of performing frequency analysis of a single sound source. The original signal spectrum of a single-tone sound source is shown in FIG.
It is represented by one frequency as shown in. When frequency analysis of this single sound is performed, as shown in FIG.
If only one frequency is extracted, the most accurate frequency analysis has been performed. However, when frequency analysis by short-time Fourier analysis is performed on this single sound, the result shown in FIG.
As shown in (b), many frequency components will be extracted. Further, when frequency analysis by generalized harmonic analysis is performed on the single tone shown in FIG. 4A, a large number of frequency components are extracted as shown in FIG. 4C. However, Figure 4
As can be seen from a comparison between (b) and FIG. 4 (c), using the generalized harmonic analysis makes the correlation strength of the frequency to be extracted larger than the other frequencies, rather than using the short-time Fourier transform. Is also more accurate.

【００４３】この手法では、あらかじめ準備した相互相
関テーブルを利用して各ノートナンバーについて補正値
を求める。この補正値を図４（ｄ）に示す。図４（ｄ）
に示す補正値は、上記〔数式５〕の右辺の−Σ以降に対
応している。図４（ｄ）に示した補正値により図４
（ｂ）に示した相関強度を補正することにより、図４
（ｅ）に示すような目的とすべき、単一の周波数成分が
抽出される。In this method, a correction value is obtained for each note number using a cross-correlation table prepared in advance. This correction value is shown in FIG. Figure 4 (d)
The correction value shown in (4) corresponds to -Σ and after on the right side of the above [Formula 5]. The correction values shown in FIG.
By correcting the correlation strength shown in FIG.
A single frequency component to be the target as shown in (e) is extracted.

【００４４】以上のような処理により、各単位区間につ
いて、各周波数に対する強度値の集合である周波数群
（スペクトルデータ）が得られることになる。このよう
にして所定数の周波数群が選出されたら、この周波数群
の各周波数に対応する「音の高さを示す情報」、選出さ
れた各周波数の信号強度に対応する「音の強さを示す情
報」、当該単位区間の始点に対応する「音の発音開始時
刻を示す情報」、当該単位区間に後続する単位区間の始
点に対応する「音の発音終了時刻を示す情報」、の４つ
の情報を含む符号データ（これを音素データと呼ぶこと
にする）を作成すれば、当該単位区間内の区間信号Ｘを
所定数の符号データにより符号化することができる。符
号データとして、ＭＩＤＩデータを作成するのであれ
ば、「音の高さを示す情報」としてノートナンバーを用
い、「音の強さを示す情報」としてベロシティーを用
い、「音の発音開始時刻を示す情報」としてノートオン
時刻を用い、「音の発音終了時刻を示す情報」としてノ
ートオフ時刻を用いるようにすれば良い。By the above processing, a frequency group (spectral data), which is a set of intensity values for each frequency, is obtained for each unit section. When a predetermined number of frequency groups are selected in this way, "information indicating the pitch of the sound" corresponding to each frequency of this frequency group, "sound intensity corresponding to the signal strength of each selected frequency""Informationindicating","information indicating the sound production start time" corresponding to the start point of the unit section, and "information indicating sound production end time" corresponding to the start point of the unit section subsequent to the unit section. If code data including information (which will be referred to as phoneme data) is created, the section signal X in the unit section can be coded by a predetermined number of code data. If MIDI data is created as code data, note number is used as "information indicating pitch of tone", velocity is used as "information indicating intensity of tone", and "start time of sound generation" The note-on time may be used as the “information indicating” and the note-off time may be used as the “information indicating the sound production end time”.

【００４５】（3.1.本発明に係る周波数解析装置および
音響信号の符号化装置）以下、本発明に係る周波数解析
装置および音響信号の符号化装置について説明してい
く。図５は、本発明に係る周波数解析の概要を示すフロ
ーチャートである。まず、複数の標準周波数を設定し、
各標準周波数に対応する標準周期関数を調和関数として
準備する（ステップＳ１）。このとき設定される標準周
波数としては、周波数解析の特性に合わせて任意に設定
することができるが、音響信号の符号化に利用するため
には、図２および〔数式１〕に示したように、ＭＩＤＩ
規格のノートナンバーｎに対応させて設定することが好
ましい。(3.1. Frequency Analysis Apparatus and Acoustic Signal Coding Apparatus According to the Present Invention) The frequency analysis apparatus and acoustic signal coding apparatus according to the present invention will be described below. FIG. 5 is a flowchart showing an outline of frequency analysis according to the present invention. First, set multiple standard frequencies,
A standard periodic function corresponding to each standard frequency is prepared as a harmonic function (step S1). The standard frequency set at this time can be arbitrarily set according to the characteristics of the frequency analysis, but in order to use it for encoding the acoustic signal, as shown in FIG. 2 and [Equation 1], , MIDI
It is preferable to set in correspondence with the standard note number n.

【００４６】続いて、解析対象となる時系列信号の全区
間に渡って単位区間を設定し、設定された単位区間の時
系列信号を区間信号として抽出する（ステップＳ２）。
単位区間の設定は、図１（ａ）に示したのとは異なり、
隣接する単位区間が互いに重複するように行う。Subsequently, a unit section is set over the entire section of the time series signal to be analyzed, and the time series signal of the set unit section is extracted as a section signal (step S2).
The setting of the unit section is different from that shown in FIG.
The unit sections adjacent to each other are overlapped with each other.

【００４７】続いて、抽出した区間信号に対して、全調
和関数との相関計算を行う（ステップＳ３）。例えば、
図２に示したようなノートナンバーに対応して標準周波
数を設定した場合には、１２８個の調和関数との相関計
算が行われる。このステップＳ３における調和関数との
相関計算は、上記〔数式４〕を用いた手法で行われる。
すなわち、区間信号のうち、先頭から、相関計算を行う
調和関数の周期の整数倍で単位区間長を超えない部分
と、調和関数との相関を算出する。ただし、先頭の単位
区間と２番目以降の単位区間では、相関計算の方法が異
なる。先頭の単位区間については、上記〔数式４〕に従
って、単位区間全体に渡って相関計算を行うが、２番目
以降の単位区間では、既に計算した直前の単位区間の相
関値から重複部分の相関値を減じ、その単位区間内で直
前の単位区間と重複していない部分の相関値だけを新た
に計算して加算する処理を行っていく。Then, the correlation calculation with the total harmonic function is performed on the extracted section signal (step S3). For example,
When the standard frequency is set corresponding to the note number as shown in FIG. 2, correlation calculation with 128 harmonic functions is performed. The correlation calculation with the harmonic function in step S3 is performed by the method using the above [Formula 4].
That is, the correlation between the harmonic function and the portion of the interval signal from the beginning that is an integral multiple of the cycle of the harmonic function for which correlation calculation is performed and does not exceed the unit interval length is calculated. However, the correlation calculation method is different between the first unit section and the second and subsequent unit sections. For the first unit section, the correlation calculation is performed over the entire unit section according to the above [Formula 4], but in the second and subsequent unit sections, the correlation value of the overlapping portion is calculated from the correlation value of the immediately preceding unit section that has already been calculated. Is subtracted, and only the correlation value of a portion that does not overlap with the immediately preceding unit section in the unit section is newly calculated and added.

【００４８】図６を用いて直前の単位区間の相関値を用
いた、目的とする現単位区間の相関値の計算の考え方に
ついて説明する。ここでは、直前の単位区間を前区間、
目的とする現単位区間を、現区間もしくは後区間と呼ぶ
ことにする。図６において、図６（ａ）は、音響信号の
波形を示すものであり、図６（ｂ）は時系列の音響信号
に設定された固定長Ｌの単位区間ｄ_iおよび単位区間ｄ
_i+1の様子を示す図である。なお、図６（ｂ）におい
て、Ｄは、単位区間ｄ_iの開始時刻と単位区間ｄ_i+1の開
始時刻との差であり、全単位区間が固定長であるため、
単位区間ｄ_iの終了時刻と単位区間ｄ_i+1の終了時刻の差
もＤとなる。本明細書では、この時間的長さＤを更新区
間と呼ぶことにする。図６に示すように、固定長Ｌの単
位区間ｄ_i+ ₁の相関値を計算する場合、既に計算が終了
している固定長Ｌの単位区間ｄ_iの相関値を利用する。
具体的には、単位区間ｄ_i内の単位区間ｄ_i+1と重複して
いない先頭領域ｄｈにおける区間信号と調和関数との相
関値と、単位区間ｄ_i+1内の単位区間ｄ_iと重複していな
い後尾領域ｄｂにおける区間信号と調和関数との相関値
をそれぞれ計算し、単位区間ｄ_iの相関値から先頭領域
ｄｈの相関値を減算すると共に後尾領域ｄｂの相関値を
加算する。これにより単位区間ｄ_iと単位区間ｄ_i+1の重
複領域の相関値の計算をしなくて済むので、時系列信号
全体における相関計算の総負荷が少なくなる。The concept of calculation of the target correlation value of the current unit section using the correlation value of the immediately preceding unit section will be described with reference to FIG. Here, the immediately preceding unit section is the previous section,
The target current unit section is called a current section or a subsequent section. In FIG. 6, FIG. 6A shows the waveform of the acoustic signal, and FIG. 6B shows the unit section d _i and the unit section d of the fixed length L set in the time-series acoustic signal.
It is a figure which shows the mode of _{i + 1} . Since in FIG. 6 (b), D is the difference between the start time and the unit interval d _{i + 1} of the start time of the unit section d _i, all the unit interval is a fixed length,
The difference between the end time of the unit section d _{i and} the end time of the unit section d _{i + 1} is also D. In this specification, this temporal length D will be referred to as an update interval. As shown in FIG. 6, when the correlation value of the unit section d _{i +} ₁ of the fixed length L is calculated, the correlation value of the unit section d _i of the fixed length L for which the calculation has already been completed is used.
Specifically, the correlation value between the harmonic function and the interval signal in the head area dh does not overlap with the unit section d _{i + 1} in the unit section d _i, the unit section d _i of the unit section in d _{i + 1} The correlation values of the section signals and the harmonic functions in the non-overlapping tail region db are calculated, the correlation value of the head region dh is subtracted from the correlation value of the unit segment d _i , and the correlation value of the tail region db is added. As a result, it is not necessary to calculate the correlation value of the overlapping region of the unit section d _i and the unit section d _{i + 1} , and the total load of the correlation calculation for the entire time series signal is reduced.

【００４９】ここで、具体的に相関値を求める計算式を
用いて計算負荷の軽減について説明する。第ｉ番目の単
位区間における周波数ｆ（ｎ）に対する相関値Ｅｎ
（ｉ）は、上記〔数式２〕を基に、以下の〔数式６〕の
ように表現することができる。なお、相関値は、準備し
た調和関数の数だけ算出される。例えば、図２に示した
ように１２８個の調和関数を準備した場合には、１２８
個の相関値が算出されるが、ここでは、ある１つの標準
周波数による相関値Ｅｎ（ｉ）を代表して示す。Here, the reduction of the calculation load will be specifically described by using the calculation formula for obtaining the correlation value. Correlation value En for frequency f (n) in the i-th unit section
(I) can be expressed as the following [Equation 6] based on the above [Equation 2]. The correlation value is calculated by the number of prepared harmonic functions. For example, if 128 harmonic functions are prepared as shown in FIG.
Although the correlation value is calculated individually, here, the correlation value En (i) at a certain standard frequency is shown as a representative.

【００５０】〔数式６〕Ｅｎ（ｉ）＝｛（(２／Ｌ)Σ_k=1,Lｘ(ｋ) sin(２πｆ_n
ｋ／Ｆ)）²＋（(２／Ｌ)Σ_k=1,Lｘ(ｋ) cos(２πｆ_nｋ
／Ｆ)）²｝^1/2 [Equation 6] En (i) = {((2 / L) Σ _{k = 1, L} x (k) sin (2πf _n
k / F)) ² + ((2 / L) Σ _{k = 1, L} x (k) cos (2πf _n k
/ F)) ² } ^1/2

【００５１】上記〔数式６〕においては、説明の便宜
上、サンプル番号を図３に示した、０，１，２，３，・
・・，ｋ，・・・，ｗ−２，ｗ−１から１，２，３，・
・・，Ｌ−１，Ｌと置き換えている。すなわち、計算負
荷の削減量をわかりやすくするため、サンプル点の数を
固定長Ｌと同じ値としている。そして、Ｌは周期Ｆ／ｆ
（ｎ）の整数倍になるように与え、実際には周波数ｆ
（ｎ）に依存して変化する。In the above [Formula 6], for convenience of explanation, sample numbers 0, 1, 2, 3, ...
.., k, ..., w-2, w-1 to 1, 2, 3, ...
.., L-1 and L are replaced. That is, in order to make it easy to understand the reduction amount of the calculation load, the number of sample points is set to the same value as the fixed length L. And L is the period F / f
It is given as an integer multiple of (n), and the frequency f is actually
It changes depending on (n).

【００５２】ここで、計算量の変化をわかりやすくする
ために、上記〔数式６〕において、調和関数の正弦関数
と区間信号の各サンプル点における振幅値の乗算値の総
和である「(２／Ｌ)Σ_k=1,Lｘ(ｋ) sin(２πｆ_nｋ／
Ｆ)」をＳｎ（ｉ，１，Ｌ）、「(２／Ｌ)Σ_k=1,Lｘ(ｋ)
cos(２πｆ_nｋ／Ｆ)」をＣｎ（ｉ，１，Ｌ）と置き換
える。これにより、〔数式６〕は以下の〔数式７〕のよ
うに変形できる。Here, in order to make it easy to understand the change in the calculation amount, in the above [Equation 6], it is the sum of the product of the sine function of the harmonic function and the amplitude value at each sample point of the interval signal, "(2 / L) Σ _{k = 1, L} x (k) sin (2πf _n k /
F) ”is Sn (i, 1, L),“ (2 / L) Σ _{k = 1, L} x (k)
Replace “cos (2πf _n k / F)” with Cn (i, 1, L). Thereby, [Formula 6] can be transformed into [Formula 7] below.

【００５３】〔数式７〕Ｅｎ（ｉ）＝｛Ｓｎ（ｉ，１，Ｌ）²＋Ｃｎ（ｉ，１，
Ｌ）²｝^1/2 [Equation 7] En (i) = {Sn (i, 1, L) ² + Cn (i, 1,)
L) ² } ^1/2

【００５４】上記〔数式７〕において、Ｓｎ（ｉ，１，
Ｌ）は、調和関数の正弦関数と区間信号の相関をサンプ
ル点１からサンプル点Ｌまで計算したものであり、Ｃｎ
（ｉ，１，Ｌ）は、調和関数の余弦関数と区間信号の相
関をサンプル点１からサンプル点Ｌまで計算したもので
ある。すなわち、Ｓｎ（ｉ，１，Ｌ）、Ｃｎ（ｉ，１，
Ｌ）共にＬ点分の計算を行っている。これに対して、単
位区間ｄ_i+1の相関値Ｅｎ（ｉ＋１）は、以下の〔数式
８〕のように表現できる。In the above [Formula 7], Sn (i, 1,
L) is the correlation between the sine function of the harmonic function and the interval signal calculated from sample point 1 to sample point L, and Cn
(I, 1, L) is the correlation between the cosine function of the harmonic function and the interval signal calculated from sample point 1 to sample point L. That is, Sn (i, 1, L), Cn (i, 1, L)
L) Both are calculating for L points. On the other hand, the correlation value En (i + 1) of the unit section d _{i + 1} can be expressed as the following [Equation 8].

【００５５】〔数式８〕Ｅｎ（ｉ＋１）＝｛Ｓｎ（ｉ＋１，１，Ｌ）²＋Ｃｎ
（ｉ＋１，１，Ｌ）²｝^1/2 [Equation 8] En (i + 1) = {Sn (i + 1,1, L) ² + Cn
(I + 1,1, L) ² } ^1/2

【００５６】単位区間ｄ_i+1も区間長は単位区間ｄ_iと同
じＬであるため、サンプル点は単位区間の先頭をサンプ
ル点１、最後尾をサンプル点Ｌとしている。この式を詳
細に書き直すと、以下の〔数式９〕ようになる。Since the unit section d _{i + 1} has the same section length L as that of the unit section d _i , the sample points are the sample point 1 at the beginning of the unit section and the sample point L at the end. If this equation is rewritten in detail, the following [Equation 9] is obtained.

【００５７】〔数式９〕Ｅｎ（ｉ＋１）＝｛（(２／Ｌ)Σ_k=1,Lｘ(ｋ＋Ｄ) sin
(２πｆ_n(ｋ＋Ｄ)／Ｆ)） ²＋（(２／Ｌ)Σ_k=1,Lｘ(ｋ＋
Ｄ) cos(２πｆ_n(ｋ＋Ｄ)／Ｆ)）²｝^1/2 [Equation 9] En (i + 1) = {((2 / L) Σ_{k = 1, L}x (k + D) sin
(2πf_n(k + D) / F)) ²+ ((2 / L) Σ_{k = 1, L}x (k +
D) cos (2πf_n(k + D) / F))²}^1/2

【００５８】〔数式９〕は、〔数式６〕の「ｋ」を「ｋ
＋Ｄ」で置き換えたものとなっている。これは、実際に
は単位区間ｄ_iと単位区間ｄ_i+1では、その更新区間であ
るＤだけサンプル点が異なるので、区間信号ｘの値もそ
れに伴って変化するためである。区間信号と相関計算を
行う調和関数については、更新区間Ｄのオフセットを加
えないのが通常であるが、本実施形態では、前述の通り
Ｌを周期Ｆ／ｆ（ｎ）の整数倍に設定しているため、更
新区間Ｄのオフセットを加えても理論上はＥｎ（ｉ＋
１）の値は変化しない。本願では発明の構成上、単位区
間の更新区間に合わせて位相を変化させてやる必要があ
るため、更新区間Ｄのオフセットを加え、sin(２πｆ_n
ｋ／Ｆ)、 cos(２πｆ_nｋ／Ｆ)は、それぞれsin(２πｆ
_n(ｋ＋Ｄ)／Ｆ)、cos(２πｆ_n(ｋ＋Ｄ)／Ｆ)と置き換え
る。そうすると、〔数式９〕は、以下の〔数式１０〕の
ように書き換えることができる。In [Equation 9], "k" in [Equation 6] is replaced with "k".
It is replaced with "+ D". This is because the unit section d _i and the unit section d _{i + 1} actually have different sampling points by the update section D, so that the value of the section signal x also changes accordingly. Regarding the harmonic function for performing the correlation calculation with the interval signal, it is usual not to add the offset of the update interval D, but in this embodiment, L is set to an integral multiple of the period F / f (n) as described above. Therefore, theoretically, En (i +
The value of 1) does not change. In the present application, because of the configuration of the invention, it is necessary to change the phase in accordance with the update section of the unit section. Therefore, the offset of the update section D is added and sin (2πf _n
k / F) and cos (2πf _n k / F) are sin (2πf _n )
_n (k + D) / F) and cos (2πf _n (k + D) / F). Then, [Formula 9] can be rewritten as [Formula 10] below.

【００５９】〔数式１０〕Ｅｎ（ｉ＋１）＝［｛(２／Ｌ)Σ_k=1,Lｘ(ｋ) sin(２π
ｆ_nｋ／Ｆ)−(２／Ｄ)Σ _k=1,Dｘ(ｋ) sin(２πｆ_nｋ／
Ｆ)＋(２／Ｄ)Σ_k=L-D+1,Lｘ(ｋ＋Ｄ) sin(２πｆ _n(ｋ
＋Ｄ)／Ｆ)｝²＋｛(２／Ｌ)Σ_k=1,Lｘ(ｋ) cos(２πｆ_n
ｋ／Ｆ)−(２／Ｄ)Σ_k=1,Dｘ(ｋ) cos(２πｆ_nｋ／Ｆ)
＋(２／Ｄ)Σ_k=L-D+1,Lｘ(ｋ＋Ｄ) cos(２πｆ_n(ｋ＋
Ｄ)／Ｆ)｝²］^1/2 [Formula 10] En (i + 1) = [{(2 / L) Σ_{k = 1, L}x (k) sin (2π
f_nk / F)-(2 / D) Σ _{k = 1, D}x (k) sin (2πf_nk /
F) + (2 / D) Σ_{k = L-D + 1, L}x (k + D) sin (2πf _n(k
+ D) / F)}²+ {(2 / L) Σ_{k = 1, L}x (k) cos (2πf_n
k / F)-(2 / D) Σ_{k = 1, D}x (k) cos (2πf_nk / F)
+ (2 / D) Σ_{k = L-D + 1, L}x (k + D) cos (2πf_n(k +
D) / F)}²]^1/2

【００６０】〔数式９〕から〔数式１０〕への変換は、
単位区間ｄ_i+1の先頭を基準として割り当てたサンプル
点１からサンプル点Ｌを、単位区間ｄ_iの先頭を基準と
して置き換え、その際、単位区間ｄ_iの後尾領域ｄｂに
おける相関値を位相Ｄだけずらして行っていることを示
す。さらに、〔数式１０〕は、〔数式６〕から〔数式
７〕への変換と同様に、以下の〔数式１１〕に示すよう
に書き換えることができる。The conversion from [Equation 9] into [Equation 10] is
The sample points 1 to L assigned with the head of the unit section d _{i + 1} as a reference are replaced with the head of the unit section d _i as a reference, and the correlation value in the tail region db of the unit section d _i is phase D Show that you are just staggering. Further, [Formula 10] can be rewritten as shown in [Formula 11] below, similarly to the conversion from [Formula 6] to [Formula 7].

【００６１】〔数式１１〕Ｅｎ（ｉ＋１）＝［｛Ｓｎ（ｉ，１，Ｌ）−Ｓｎ（ｉ，
１，Ｄ）＋Ｓｎ（ｉ＋１，Ｌ−Ｄ＋１，Ｌ）｝²＋｛Ｃ
ｎ（ｉ，１，Ｌ）−Ｃｎ（ｉ，１，Ｄ）＋Ｃｎ（ｉ＋
１，Ｌ−Ｄ＋１，Ｌ）｝²］^1/2 [Equation 11] En (i + 1) = [{Sn (i, 1, L) -Sn (i,
1, D) + Sn (i + 1, L-D + 1, L)} ² + {C
n (i, 1, L) -Cn (i, 1, D) + Cn (i +
1, L-D + 1, L)} ² ] ^1/2

【００６２】上記〔数式１１〕のうち、Ｓｎ（ｉ，１，
Ｌ）、Ｃｎ（ｉ，１，Ｌ）については、〔数式７〕の算
出の際に既に求められているため、計算する必要はな
く、新たに算出が必要なのは、Ｓｎ（ｉ，１，Ｄ）、Ｓ
ｎ（ｉ＋１，Ｌ−Ｄ＋１，Ｌ）、Ｃｎ（ｉ，１，Ｄ）、
Ｃｎ（ｉ＋１，Ｌ−Ｄ＋１，Ｌ）となる。Ｓｎ（ｉ，
１，Ｌ）、Ｃｎ（ｉ，１，Ｌ）はサンプル点１からサン
プル点ＬまでのＬ個のサンプル点における区間信号と調
和関数の振幅値の乗算を行っているが、Ｓｎ（ｉ，１，
Ｄ）、Ｃｎ（ｉ，１，Ｄ）はサンプル点１からサンプル
点ＤまでのＤ個のサンプル点、Ｓｎ（ｉ＋１，Ｌ−Ｄ＋
１，Ｌ）、Ｃｎ（ｉ＋１，Ｌ−Ｄ＋１，Ｌ）も、サンプ
ル点Ｌ−Ｄ＋１からサンプル点ＬまでのＤ個のサンプル
点における区間信号と調和関数の振幅値の乗算を行うこ
とになる。In the above [Formula 11], Sn (i, 1,
L) and Cn (i, 1, L) have already been obtained when calculating [Equation 7], and therefore need not be calculated. Sn (i, 1, D) is newly calculated. ), S
n (i + 1, L-D + 1, L), Cn (i, 1, D),
It becomes Cn (i + 1, L-D + 1, L). Sn (i,
1, L) and Cn (i, 1, L) multiply the interval signal at the L sample points from sample point 1 to sample point L by the amplitude value of the harmonic function, but Sn (i, 1) ，
D) and Cn (i, 1, D) are D sample points from sample point 1 to sample point D, and Sn (i + 1, L-D +).
1, L) and Cn (i + 1, L-D + 1, L), the section signals at the D sample points from the sample point L-D + 1 to the sample point L are also multiplied by the amplitude value of the harmonic function.

【００６３】このことからわかるように単位区間ｄ_i+1
における計算量は、そのまま計算を行った場合は、Ｌ個
のサンプル点における計算をしなければならないが、直
前の単位区間ｄ_iで求めた相関値を利用した場合、２Ｄ
個のサンプル点における計算をすれば良いことになる。
すなわち、計算量は２Ｄ／Ｌとなる。これは、時間分解
能を高めるために隣接する単位区間の重複領域を大きく
すればするほど、すなわち更新区間Ｄを小さくすればす
るほど、かかっていた計算量を大幅に削減することが可
能になることを示している。As can be seen from this, the unit section d _{i + 1}
When the calculation is performed as it is, the calculation at L sample points must be performed. However, when the correlation value obtained in the immediately preceding unit section d _i is used, 2D
It suffices to perform the calculation at each sample point.
That is, the calculation amount is 2D / L. This means that the larger the overlapping area of the adjacent unit sections in order to improve the time resolution, that is, the smaller the update section D, the more the amount of calculation required can be significantly reduced. Is shown.

【００６４】単位区間ごとに算出された相関値は、各単
位区間ごとに用意された信号相関配列に格納されること
になる。同様にして、全単位区間について相関計算を行
い、各単位区間ごとの相関値が得られたら、上記2.1.の
項で説明したように、相互相関テーブルを利用して信号
相関配列の値を補正する（ステップＳ４）。The correlation value calculated for each unit section is stored in the signal correlation array prepared for each unit section. Similarly, perform correlation calculation for all unit intervals, and when the correlation value for each unit interval is obtained, correct the value of the signal correlation array using the cross-correlation table as described in section 2.1. Yes (step S4).

【００６５】ステップＳ３における相関計算、ステップ
Ｓ４における相関補正をステップＳ３において設定され
た全単位区間に対して行うことにより、全単位区間にお
けるＮ個の周波数成分が得られる。すなわち、全単位区
間における周波数のスペクトルデータが得られることに
なる。By performing the correlation calculation in step S3 and the correlation correction in step S4 for all the unit intervals set in step S3, N frequency components in all the unit intervals can be obtained. That is, the spectrum data of the frequencies in all the unit sections can be obtained.

【００６６】（3.2.音響信号の符号化）以上のようにし
て時系列信号の周波数解析が行われ、各単位区間につい
て含有信号がＮ個抽出される。時系列信号として音響信
号を採用し、音響信号の符号化を行う場合には、標準周
波数ｆ（ｎ）を図２に示したようにＭＩＤＩのノートナ
ンバー、すなわち半音単位の音高の間隔で設定し、各ノ
ートナンバーに対応するＮ（＝１２８）個の周波数成分
が得られる。そして、上述のように周波数成分の周波数
をノートナンバー、相関値をベロシティ、単位区間の始
点をノートオン時刻、後続する単位区間の始点をノート
オフ時刻とするＭＩＤＩデータへの変換を行うことによ
り、音響信号が符号化される。(3.2. Coding of acoustic signal) The frequency analysis of the time-series signal is performed as described above, and N contained signals are extracted for each unit section. When an acoustic signal is adopted as the time-series signal and the acoustic signal is encoded, the standard frequency f (n) is set at MIDI note numbers, that is, pitch intervals in semitone units, as shown in FIG. Then, N (= 128) frequency components corresponding to each note number are obtained. Then, as described above, by performing conversion into MIDI data in which the frequency of the frequency component is the note number, the correlation value is the velocity, the start point of the unit section is the note-on time, and the start point of the following unit section is the note-off time, The audio signal is encoded.

【００６７】（3.3.周波数の設定について）上記実施形
態においては、抽出すべき周波数を、ＭＩＤＩ規格のノ
ートナンバーｎに対応させた標準周波数として〔数式
１〕のように設定したが、実際には、さらに細かい間隔
で設定しないと、精度の高い検出を行うことができな
い。その理由は、周波数の設定が、上記〔数式１〕のよ
うに、ノートナンバーに比例した対数的な間隔になって
いるため、周波数が高くなるほど計算間隔が粗くなり、
周波数成分が見落とされる確率が増大するためである。
時系列信号として音響信号を解析し、符号化を行う場合
には、〔数式１〕のようにノートナンバーに対応した間
隔で設定する必要がある。そして、隣接する標準周波数
間に設定する細かい周波数は〔数式１〕のように対数的
な間隔にとることが望ましい。例えば、各ノートナンバ
ー間に１２個の周波数を、それぞれノートナンバーの１
／１３間隔となるように設定する。すなわち、各標準周
波数間の等比級数的な間隔で周波数を設定することにな
る。具体的には、標準周波数ｆ（ｎ）と標準周波数ｆ
（ｎ＋１）の間には、周波数ｆ（ｎ＋ｍ／Ｍ）が設定さ
れることになる。ここで、Ｍは０以上の整数、ｍは０〜
Ｍ−１の値をとる整数である。この場合、等比級数的な
間隔で周波数を設定すると、各周波数は、以下の〔数式
１２〕で表現される。(3.3. Regarding Frequency Setting) In the above embodiment, the frequency to be extracted is set as the standard frequency corresponding to the MIDI standard note number n as in [Equation 1]. If the intervals are not set finer, highly accurate detection cannot be performed. The reason is that the frequency setting is a logarithmic interval proportional to the note number, as in the above [Formula 1]. Therefore, the higher the frequency, the coarser the calculation interval,
This is because the probability of missing a frequency component increases.
When an acoustic signal is analyzed as a time-series signal and encoded, it is necessary to set at intervals corresponding to note numbers as in [Equation 1]. Then, it is desirable that the fine frequencies set between the adjacent standard frequencies have logarithmic intervals as in [Equation 1]. For example, 12 frequencies between each note number, 1 for each note number
/ 13 interval is set. That is, the frequencies are set at geometric intervals between the standard frequencies. Specifically, the standard frequency f (n) and the standard frequency f
The frequency f (n + m / M) is set between (n + 1). Here, M is an integer of 0 or more, m is 0
It is an integer that takes the value of M-1. In this case, if frequencies are set at geometrical intervals, each frequency is expressed by the following [Equation 12].

【００６８】〔数式１２〕ｆ（ｎ）＝４４０×２^(n-69)/12 ｆ（ｎ＋ｍ／Ｍ）＝４４０×２^{(n+m/M-69)/12} ｆ（ｎ＋１）＝４４０×２^(n+1-69)/12 ただし、ｎ＝０，１，２，・・・，１２７、ｍ＝０，１，
２，・・・，Ｍ−１[Equation 12] f (n) = 440 × 2 ^{(n-69) / 12} f (n + m / M) = 440 × 2 ^{(n + m / M-69) / 12} f (n + 1) = 440 × 2 ^{(n + 1-69) / 12} However, n = 0,1,2, ..., 127, m = 0,1,
2, ..., M-1

【００６９】例えば、上述のように各ノートナンバー間
に１２個の周波数を、それぞれノートナンバーの１／１
３間隔となるように設定した場合は、〔数式１２〕にお
いてＭ＝１３とした場合に該当する。このような設定を
行った場合、周波数は全部で１２８×１３個設定される
ことになり、相関値の算出精度は向上されることにな
る。音響信号への符号化の際には、このように細かい周
波数設定に対応する周波数が存在しないため、最大とな
る相関値を選別して最も近いノートナンバーに対応する
強度（ベロシティ）成分として与える。For example, as described above, twelve frequencies are provided between note numbers, each of which is 1/1 of the note number.
The setting of 3 intervals corresponds to the case where M = 13 in [Equation 12]. When such a setting is made, the frequency is set to 128 × 13 in total, and the calculation accuracy of the correlation value is improved. Since there is no frequency corresponding to such a fine frequency setting when encoding into an audio signal, the maximum correlation value is selected and given as an intensity (velocity) component corresponding to the closest note number.

【００７０】しかし、上述のように細かい間隔で周波数
を設定すると当然のことながら、相関計算にかかる計算
負荷は膨大なものになる。そして、計算時間を費やして
算出した個々の細かい周波数に対応する相関値は選別さ
れて、そのほとんどが使用されない。すなわち、個々の
細かい周波数に対応する相関値の大小関係は重要である
が、絶対的な精度は必要としない。そこで、本発明にお
いては、標準周波数間に実際に細かい間隔で周波数を設
定して、設定した各周波数と区間信号との相関計算を一
々行うのでなく、隣接する標準周波数の相関値を基に各
標準周波数間における細かい周波数に対応する相関値を
推定し、各標準周波数に対応する最大となる細かい周波
数の相関値を探索する処理を行う。例えば、周波数ｆ
（ｎ＋ｍ／Ｍ）における相関値Ｅ_n+m/Mは、以下の〔数
式１３〕により算出される。However, if the frequencies are set at fine intervals as described above, it goes without saying that the calculation load for the correlation calculation becomes enormous. Then, the correlation values corresponding to the individual fine frequencies calculated by spending the calculation time are selected, and most of them are not used. That is, the magnitude relation of the correlation values corresponding to each fine frequency is important, but absolute accuracy is not required. Therefore, in the present invention, the frequencies are actually set at fine intervals between the standard frequencies, and the correlation calculation between each set frequency and the section signal is not performed one by one, but each frequency is set based on the correlation value of the adjacent standard frequency. A process of estimating a correlation value corresponding to a fine frequency between standard frequencies and searching for a correlation value of a maximum fine frequency corresponding to each standard frequency is performed. For example, frequency f
(N + m / M) correlation value E _{n + m / M} in is calculated by the following [Equation 13].

【００７１】〔数式１３〕Ｅ_n+m/M ＝｛（(２／Ｌ)Σｘ(ｋ) sin(２πｆ_n+m/Mｋ／
Ｆ)）²＋（(２／Ｌ)Σｘ(ｋ) cos(２πｆ_n+m/Mｋ／
Ｆ)）²｝^1/2＝｛Ｓ_n+m/M ²＋Ｃ_n+m/M ²｝^1/2 [Equation 13] E _{n + m / M} = {((2 / L) Σx (k) sin (2πf _{n + m / M} k /
F)) ² + ((2 / L) Σx (k) cos (2πf _{n + m / M} k /
F)) ² } ^1/2 = {S _{n + m / M} ² + C _{n + m / M} ² } ^1/2

【００７２】なお、数式が繁雑になるのを避けるため、
この〔数式１３〕および下記の〔数式１４〕内ではｆ
（ｎ）をｆ_n、ｆ（ｎ＋ｍ／Ｍ）をｆ_n+m/M、ｆ（ｎ＋
１）をｆ_n+1と表現している。ここで、標準周波数ｆ
（ｎ）とｆ（ｎ＋１）との間における細かい周波数の調
和関数sin(２πｆ_n+m/Mｋ／Ｆ)を２つの隣接する標準周
波数の調和関数を基に線形近似すると、すなわちsin(２
πｆ_n+m/Mｋ／Ｆ)≒sin(２πｆ_nｋ／Ｆ)＋｛sin(２πｆ
_n+1ｋ／Ｆ)−sin(２πｆ_nｋ／Ｆ)｝ｍ／Ｍとおくと、上
記〔数式１３〕は、以下の〔数式１４〕のように変形さ
れる。In order to avoid complicated formulas,
In this [Formula 13] and the following [Formula 14], f
(N) is f _n , f (n + m / M) is f _{n + m / M} , f (n +
1) is expressed as f _{n + 1} . Here, the standard frequency f
Linearly approximating a fine frequency harmonic function sin (2πf _{n + m / M} k / F) between (n) and f (n + 1) based on two adjacent standard frequency harmonic functions, that is, sin (2
πf _{n + m / M} k / F) ≈sin (2πf _n k / F) + {sin (2πf
_{If n + 1} k / F) -sin (2πf _n k / F)} m / M is set, the above [Formula 13] is transformed into the following [Formula 14].

【００７３】〔数式１４〕Ｅ_n+m/M ＝［｛Ｓ_n＋（Ｓ_n+1−Ｓ_n）ｍ／Ｍ｝²＋｛Ｃ_n
＋（Ｃ_n+1−Ｃ_n）ｍ／Ｍ｝²］^1/2 [Equation 14] E _{n + m / M} = [{S _n + (S _{n + 1} −S _n ) m / M} ² + {C _n
+ (C _{n + 1} −C _n ) m / M} ² ] ^1/2

【００７４】このことは、標準周波数間に等比級数間隔
で周波数を設定した場合には、設定した数分の相関計算
を行わなければならないのに対し、相関計算は標準周波
数に対して行うだけで良く、求めた相関値を利用した加
減算を行えば良いことを示している。相関計算の負荷に
比べると、加減算にかかる負荷はほとんど無視できるほ
ど小さいため、例えば、上述のように標準周波数間に１
２個の周波数を設定して相関計算を行った場合に比べ
て、相関値算出のための総計算負荷は約１／１３にな
る。This means that, when frequencies are set at geometric series intervals between standard frequencies, correlation calculation for the set number must be performed, whereas correlation calculation is performed only for standard frequencies. It means that the addition and subtraction using the calculated correlation value should be performed. Compared with the load of correlation calculation, the load of addition and subtraction is almost negligible.
The total calculation load for calculating the correlation value is about 1/13 as compared with the case where the correlation calculation is performed by setting two frequencies.

【００７５】（4.装置構成）続いて、本発明に係る周波
数解析装置および音響信号符号化装置の装置構成につい
て説明する。図７は、本発明に係るＭＩＤＩ符号に変換
するための音響信号符号化装置の機能ブロック図であ
る。図７において、１は音響信号入力手段、２は周波数
解析装置、３は音素連結手段、４はＭＩＤＩ符号変換手
段である。(4. Device Configuration) Next, the device configurations of the frequency analysis device and the acoustic signal coding device according to the present invention will be described. FIG. 7 is a functional block diagram of an audio signal encoding apparatus for converting to a MIDI code according to the present invention. In FIG. 7, 1 is an acoustic signal input means, 2 is a frequency analysis device, 3 is a phoneme connection means, and 4 is a MIDI code conversion means.

【００７６】音響信号入力手段１は、時系列信号である
音響信号を入力するためのものである。図７に示す装置
を周波数解析装置として用いる場合は、音響信号に限ら
ず時系列信号を入力することができる。周波数解析装置
２は、上記ステップＳ２〜ステップＳ４の処理を実行す
るものであり、時系列信号の周波数解析を行って単位区
間ごとの周波数成分（周波数および相関値、すなわちス
ペクトルデータ）を抽出する機能を有する。The acoustic signal input means 1 is for inputting an acoustic signal which is a time series signal. When the device shown in FIG. 7 is used as a frequency analysis device, not only acoustic signals but also time series signals can be input. The frequency analysis device 2 executes the processing of steps S2 to S4, and has a function of performing frequency analysis of a time-series signal to extract frequency components (frequency and correlation values, that is, spectrum data) for each unit section. Have.

【００７７】音素連結手段３は、音響信号に対して周波
数解析を行った結果得られる単位区間の開始時刻、後続
する単位区間の開始時刻、周波数、強度値の４つの情報
からなる音素データを、隣接する音素データの類似性に
基づいて互いに連結して連結音素データとする処理を行
う。ＭＩＤＩ符号変換手段４は、符号化された符号デー
タをＭＩＤＩ形式に変換するものであり、上述のように
連結音素データの発音開始時刻にノートオン、発音終了
時刻のノートオフイベントを発生させると共に、ノート
オンイベント発生時に、周波数に対応したノートナンバ
ー、強度値に対応したベロシティを設定する機能を有す
る。The phoneme connecting means 3 obtains phoneme data consisting of four pieces of information, which are the start time of a unit section, the start time of the following unit section, the frequency, and the intensity value, which are obtained as a result of frequency analysis of the acoustic signal. Based on the similarity of adjacent phoneme data, the phoneme data is connected to each other to form connected phoneme data. The MIDI code conversion means 4 converts the encoded code data into a MIDI format, and generates a note-on event at the sounding start time and a note-off event at the sounding end time of the concatenated phoneme data as described above. It has the function of setting the note number corresponding to the frequency and the velocity corresponding to the intensity value when a note-on event occurs.

【００７８】ここで、周波数解析装置２の詳細について
説明する。本発明においては、周波数解析装置２として
３つの構成パターンがあるので、それぞれ説明する。ま
ず、第１の構成パターンは図８（ａ）に示すような構成
となる。区間信号抽出手段１１は、上記ステップＳ２の
処理を行う機能を有している。前区間先頭の相関算出手
段１２は、上記ステップＳ３において、目的とする現単
位区間の相関値を算出する際に、その直前の前単位区間
の先頭領域ｄｈと区間信号の相関を算出する機能を有し
ている。後区間後尾の相関算出手段１３は、上記ステッ
プＳ３において、目的とする現単位区間の相関値を算出
する際に、その現単位区間の後尾領域ｄｂと区間信号の
相関を算出する機能を有している。前区間相関値の記憶
手段１４は、前区間の相関値を記憶するための記憶手段
である。相関合算手段１５は、上記ステップＳ３におい
て、目的とする現単位区間の相関値を算出する際に、前
区間相関値の記憶手段１４に記憶された前区間の相関値
を抽出し、この相関値に対して、前区間先頭の相関算出
手段１２で算出した相関値を減算すると共に、後区間後
尾の相関算出手段１３で算出した相関値を加算する処理
を行う機能を有している。スペクトル算出手段１６は、
上記相互相関テーブルを利用してスペクトルデータを出
力する機能を有している。Here, the details of the frequency analysis device 2 will be described. In the present invention, the frequency analysis device 2 has three configuration patterns, which will be described respectively. First, the first configuration pattern has a configuration as shown in FIG. The section signal extraction means 11 has a function of performing the process of step S2. In step S3, the correlation calculating means 12 at the beginning of the preceding section has a function of calculating the correlation between the leading area dh of the preceding preceding unit section and the section signal when calculating the target correlation value of the current unit section. Have The correlation calculating means 13 of the tail of the rear section has a function of calculating the correlation between the tail region db of the current unit section and the section signal when calculating the correlation value of the target current unit section in step S3. ing. The previous section correlation value storage unit 14 is a storage unit for storing the previous section correlation value. In step S3, the correlation summing means 15 extracts the correlation value of the previous section stored in the storage section 14 of the previous section correlation value when calculating the correlation value of the target current unit section, and this correlation value On the other hand, it has a function of subtracting the correlation value calculated by the correlation calculating means 12 at the beginning of the preceding section and adding the correlation value calculated by the correlation calculating means 13 at the end of the succeeding section. The spectrum calculation means 16 is
It has a function of outputting spectrum data by using the cross-correlation table.

【００７９】また、第２の構成パターンは図８（ｂ）に
示すような構成となる。図８（ｂ）において、図８
（ａ）と同一の機能を有するものについては、同一の符
号を付して説明を省略し、第１の構成パターンと異なる
ものについて説明する。基本調和関数との相関算出手段
２１は、ある基本とする標準周波数をｆ（ｎ）としたと
きに、標準周波数ｆ（ｎ）の調和関数と区間信号との相
関を算出する機能を有している。差分調和関数との相関
算出手段２２は、ある基本とする標準周波数をｆ（ｎ）
としたときに、隣接する標準周波数ｆ（ｎ＋１）の調和
関数とｆ（ｎ）の調和関数との差分信号sin(２πｆ_n+1
ｋ／Ｆ)−sin(２πｆ_nｋ／Ｆ)およびcos(２πｆ_n+1ｋ／
Ｆ)−cos(２πｆ_nｋ／Ｆ)と区間信号との相関を算出す
る機能を有している。相関調整手段２３は、上記〔数式
１３〕および〔数式１４〕に従って、標準周波数ｆ
（ｎ）と標準周波数ｆ（ｎ＋１）の間に存在する周波数
に対応する相関値を、標準周波数ｆ（ｎ）の調和関数と
の相関値および標準周波数ｆ（ｎ＋１）の調和関数とｆ
（ｎ）の調和関数との差分信号との相関値を用いて近似
的に算出する機能を有している。The second configuration pattern has a configuration as shown in FIG. 8 (b). In FIG.
For those having the same function as (a), the same reference numerals are given and the description thereof is omitted, and only those different from the first configuration pattern will be described. The correlation calculating means 21 with the basic harmonic function has a function of calculating the correlation between the harmonic function of the standard frequency f (n) and the section signal, where f (n) is a certain standard frequency. There is. The correlation calculating means 22 with the difference harmonic function determines a certain standard frequency as f (n).
, The difference signal sin (2πf _{n + 1} ) between the adjacent harmonic function of the standard frequency f (n + 1) and the adjacent harmonic function of f (n).
k / F) -sin (2πf _n k / F) and cos (2πf _{n + 1} k /
It has a function of calculating the correlation between F) -cos (2πf _n k / F) and the interval signal. The correlation adjusting means 23 uses the standard frequency f according to the above [Formula 13] and [Formula 14]
The correlation value corresponding to the frequency existing between (n) and the standard frequency f (n + 1) is the correlation value with the harmonic function of the standard frequency f (n) and the harmonic function of the standard frequency f (n + 1) and f.
It has a function of approximately calculating using the correlation value between the harmonic function of (n) and the difference signal.

【００８０】また、第３の構成パターンは図９に示すよ
うな構成となる。第３の構成パターンは、第１の構成パ
ターンと第２の構成パターンを組み合わせたものになっ
ている。図９においても、図８（ａ）、図８（ｂ）と同
一の機能を有するものについては、同一の符号を付して
説明を省略し、第１、第２の構成パターンと異なるもの
について説明する。図９において、前区間先頭の相関算
出手段１２ａ、後区間後尾の相関算出手段１３ａ、前区
間相関値の記憶手段１４ａ、相関合算手段１５ａは、図
８（ｂ）に示した基本調和関数との相関算出手段２１の
処理を、図８（ａ）に示した前区間先頭の相関算出手段
１２、後区間後尾の相関算出手段１３、前区間相関値の
記憶手段１４、相関合算手段１５で実行するためのもの
であり、前区間先頭の相関算出手段１２ｂ、後区間後尾
の相関算出手段１３ｂ、前区間相関値の記憶手段１４
ｂ、相関合算手段１５ｂは、図８（ｂ）に示した差分調
和関数との相関算出手段２２の処理を、図８（ａ）に示
した前区間先頭の相関算出手段１２、後区間後尾の相関
算出手段１３、前区間相関値の記憶手段１４、相関合算
手段１５で実行するためのものである。The third configuration pattern has a configuration as shown in FIG. The third configuration pattern is a combination of the first configuration pattern and the second configuration pattern. In FIG. 9 as well, those having the same functions as those in FIGS. 8A and 8B are denoted by the same reference numerals and description thereof is omitted, and those different from the first and second configuration patterns explain. In FIG. 9, the correlation calculating means 12a at the beginning of the preceding section, the correlation calculating means 13a at the tail of the following section, the storing means 14a of the correlation value of the preceding section, and the correlation summing means 15a are the same as those of the basic harmonic function shown in FIG. The processing of the correlation calculation unit 21 is executed by the correlation calculation unit 12 at the beginning of the preceding section, the correlation calculation unit 13 at the end of the succeeding section, the storage unit 14 of the correlation value of the preceding section, and the correlation summing unit 15 shown in FIG. 8A. This is for the purpose of the correlation calculation means 12b at the beginning of the preceding section, the correlation calculation means 13b at the tail of the following section, and the storage section 14 for the correlation value of the preceding section.
b, the correlation summing means 15b performs the processing of the correlation calculation means 22 with the differential harmonic function shown in FIG. 8B by the correlation calculation means 12 at the beginning of the preceding section and the processing at the tail of the succeeding section shown in FIG. 8A. This is to be executed by the correlation calculation unit 13, the previous section correlation value storage unit 14, and the correlation summation unit 15.

【００８１】なお、図７〜図９に示した周波数解析装置
および音響信号符号化装置は、実際には、コンピュータ
等の演算処理装置に専用のソフトウェアを搭載すること
により実現される。具体的には、図５のフローチャート
に示したようなステップを上記手順で実行するためのプ
ログラムをコンピュータに搭載しておく。そして、音響
信号等の時系列信号をＰＣＭ方式等でデジタル化した
後、コンピュータで実現される音響信号符号化装置に取
り込み、ステップＳ２〜ステップＳ４の処理を行った
後、抽出したスペクトルデータを出力する。音響信号符
号化装置においては、さらに、ＭＩＤＩ形式等の符号デ
ータに変換して出力する。出力された符号データは、例
えば、ＭＩＤＩデータの場合、ＭＩＤＩシーケンサ、Ｍ
ＩＤＩ音源を用いて音響信号として再生される。The frequency analysis device and the acoustic signal coding device shown in FIGS. 7 to 9 are actually realized by installing dedicated software in an arithmetic processing device such as a computer. Specifically, a program for executing the steps shown in the flowchart of FIG. 5 in the above procedure is installed in the computer. Then, after time-series signals such as acoustic signals are digitized by the PCM method or the like, they are taken into an acoustic signal encoding device realized by a computer, the processes of steps S2 to S4 are performed, and then the extracted spectrum data is output. To do. The acoustic signal encoding device further converts the encoded data into MIDI format data and outputs the encoded data. If the output code data is MIDI data, for example, a MIDI sequencer, M
It is reproduced as an acoustic signal using an IDI sound source.

【００８２】[0082]

【発明の効果】以上、説明したように本発明によれば、
与えられた時系列信号を時系列のスペクトルデータに変
換する周波数解析装置として、解析を行う基本単位であ
る単位区間を、時系列上において隣接する単位区間が互
いに重複するように設定し、各単位区間の時系列信号を
順次抽出する区間信号抽出手段と、前記区間信号抽出手
段により直前に抽出された前区間信号と新たに抽出され
た現区間信号との間で合致しない前区間信号の先頭部に
位置する信号に対して、所定の調和関数との相関を算出
する前区間先頭の相関算出手段と、前区間信号と新たに
抽出された現区間信号との間で合致しない現区間信号の
後尾部に位置する信号に対して、所定の調和関数との相
関を算出する現区間後尾の相関算出手段と、前区間全体
の相関値に対して、前記前区間先頭の相関算出手段によ
り算出した値を減算し、前記現区間後尾の相関算出手段
により算出した値を加算することにより現区間全体の相
関値を算出する相関合算手段と、前記相関合算手段で得
られた現区間全体の相関値を保持するための前区間相関
値の記憶手段と、前記相関合算手段で得られた現区間全
体の相関値に基づいて所定の変換を行い、各単位区間に
対応するスペクトルデータを算出するスペクトル算出手
段により構成するようにしたので、単位区間の重複領域
における相関計算の重複をなくし、相関計算にかかる総
所要時間を削減することが可能となるという効果を奏す
る。As described above, according to the present invention,
As a frequency analysis device for converting a given time-series signal into time-series spectrum data, a unit section that is the basic unit for analysis is set so that adjacent unit sections on the time series overlap each other, and each unit Section signal extracting means for sequentially extracting time-series signals of a section, and a leading portion of the previous section signal that does not match between the previous section signal extracted immediately before by the section signal extracting section and the newly extracted current section signal For the signal located at, the correlation calculation means at the beginning of the previous section for calculating the correlation with the predetermined harmonic function, and the current section signal that does not match between the previous section signal and the newly extracted current section signal For the signal located at the tail, the correlation calculating means for calculating the correlation with a predetermined harmonic function, and the correlation value for the entire previous interval, the value calculated by the correlation calculating means for the beginning of the previous interval Reduced Then, the correlation summing means for calculating the correlation value of the entire current section by adding the values calculated by the correlation calculation means at the end of the current section, and the correlation value of the whole current section obtained by the correlation summing means are held. And a spectrum calculating means for performing a predetermined conversion based on the correlation value of the entire current section obtained by the correlation summing means, and calculating spectrum data corresponding to each unit section. Thus, there is an effect that it is possible to eliminate the overlap of the correlation calculation in the overlapping region of the unit section and reduce the total time required for the correlation calculation.

[Brief description of drawings]

【図１】本発明に係る周波数解析装置および音響信号符
号化装置の基本原理を示す図である。FIG. 1 is a diagram showing a basic principle of a frequency analysis device and an acoustic signal encoding device according to the present invention.

【図２】本発明で利用される周期関数の一例を示す図で
ある。FIG. 2 is a diagram showing an example of a periodic function used in the present invention.

【図３】解析対象となる信号と周期信号との相関計算の
手法を示す図である。FIG. 3 is a diagram showing a method of calculating a correlation between a signal to be analyzed and a periodic signal.

【図４】相関補正テーブルを利用した効果を他の手法と
比較した場合の概念図である。FIG. 4 is a conceptual diagram when the effect of using the correlation correction table is compared with other methods.

【図５】本発明に係る周波数解析装置および音響信号符
号化装置の処理動作を示すフローチャートである。FIG. 5 is a flowchart showing processing operations of the frequency analysis device and the acoustic signal encoding device according to the present invention.

【図６】時系列信号に対して隣接する単位区間を互いに
重複して設定した状態を示す図である。FIG. 6 is a diagram showing a state in which adjacent unit sections are set to overlap each other with respect to a time-series signal.

【図７】本発明に係る音響信号符号化装置の装置構成を
示す図である。FIG. 7 is a diagram showing a device configuration of an audio signal encoding device according to the present invention.

【図８】本発明に係る周波数解析装置の第１、第２の構
成パターンを示す図である。FIG. 8 is a diagram showing first and second configuration patterns of the frequency analysis device according to the present invention.

【図９】本発明に係る周波数解析装置の第３の構成パタ
ーンを示す図である。FIG. 9 is a diagram showing a third configuration pattern of the frequency analysis device according to the present invention.

[Explanation of symbols]

ｄ，ｄ１〜ｄ５，ｄ_i，ｄ_i+1・・・単位区間Ｄ・・・更新区間Ｌ・・・単位区間長ｎ・・・ノートナンバーＥｎ（ｉ），Ｅｎ（ｉ＋１），Ｅ_n+m/M・・・相関値Ｘ，Ｘ（ｋ）・・・区間信号d, d1 to d5, d _i , d _{i + 1} ... Unit section D ... Update section L ... Unit section length n ... Note number En (i), En (i + 1), En _{+ m / M:} Correlation value X, X (k): Section signal

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｈ０３Ｍ 7/30 Ｇ１０Ｌ 7/02 ＡＦターム(参考） 5D045 AC10 DA20 5D082 BB01 5D108 BA39 5D378 MM41 5J064 AA02 AA03 BA16 BC01 BC27 BD02 BD03 ─────────────────────────────────────────────────── ─── Continuation of front page (51) Int.Cl. ⁷ Identification code FI theme code (reference) H03M 7/30 G10L 7/02 A F term (reference) 5D045 AC10 DA20 5D082 BB01 5D108 BA39 5D378 MM41 5J064 AA02 AA03 BA16 BC01 BC27 BD02 BD03

Claims

[Claims]

1. A frequency analysis device for converting a given time-series signal into time-series spectrum data, wherein unit sections that are basic units for performing analysis overlap adjacent unit sections on the time series. Thus, the section signal extracting means for sequentially extracting the time-series signal of each unit section, and the previous section signal extracted immediately before by the section signal extracting means and the newly extracted current section signal are matched. The correlation calculation means at the beginning of the previous section for calculating the correlation with the predetermined harmonic function for the signal located at the beginning of the previous section signal, and the previous section signal and the newly extracted current section signal For the signal located at the tail of the current section signal that does not match, the correlation calculating means of the tail of the current section for calculating the correlation with a predetermined harmonic function, and the correlation value of the entire previous section, Correlation calculation means Correlation summing means for calculating the correlation value of the entire current section by adding the values calculated by the correlation calculation means at the end of the current section, and the entire current section obtained by the correlation summing means A storage unit for storing the correlation value of the previous section for holding the correlation value of, and a predetermined conversion is performed based on the correlation value of the entire current section obtained by the correlation summing unit, and spectrum data corresponding to each unit section is calculated. And a spectrum calculating means for performing the frequency analysis.

2. A frequency analysis device for converting a given time-series signal into time-series spectrum data, wherein unit sections which are basic units for performing analysis overlap adjacent unit sections on the time series. Set as such, the interval signal extraction means for sequentially extracting the time-series signal of each unit interval, and the basic harmonic function for calculating the correlation between the interval signal extracted by the interval signal extraction means and a predetermined basic harmonic function Correlation calculation means, the section signal, the correlation calculation means of the difference harmonic function to calculate the correlation with the difference function of the other harmonic function frequency adjacent to the predetermined basic harmonic function, of the basic harmonic function The correlation value calculated by the correlation calculation unit is multiplied by the correlation value obtained by the correlation calculation unit with the difference harmonic function and added, and the weight is changed at regular intervals to obtain one value. Correlation adjusting means for calculating a plurality of correlation values in which the frequency slightly changes with respect to the frequency of the fundamental harmonic function, and based on the correlation value obtained by the correlation adjusting means, a predetermined conversion is performed, and each unit interval And a spectrum calculation means for calculating spectrum data corresponding to the frequency analysis device.

3. The correlation calculating means with the basic harmonic function and the correlation calculating means with the differential harmonic function are respectively the correlation calculating means at the beginning of the preceding section, the correlation calculating means at the tail of the current section, and the correlation. The frequency analysis device according to claim 2, wherein the frequency analysis device is configured by a summing unit and a storage unit of the previous section correlation value.

4. The harmonic function, the fundamental harmonic function and the differential harmonic function are composed of a sine wave function and a cosine wave function, and each of the correlation calculating means is composed of a two-dimensional vector composed of a sine wave component and a cosine wave component. A correlation value is output, the correlation summing means and the correlation adjusting means perform vector addition and subtraction on each of the correlation values to output a correlation value composed of a two-dimensional vector, and the spectrum calculation means is composed of a two-dimensional vector. 2. A process of converting a correlation value according to the above into spectral data composed of a scalar value is performed.
Alternatively, the frequency analysis device according to claim 2.

5. The spectrum calculating means is provided with spectrum data for each harmonic function as a cross-correlation table, and the calculated spectrum data is corrected based on the cross-correlation table. The frequency analysis device according to claim 1 or 2.

6. An acoustic signal input means for inputting a given acoustic signal to obtain a time-series signal, a frequency analysis device having the configuration according to any one of claims 1 to 5, and A phoneme linking unit that creates phoneme data by connecting the spectrum data of the series in time series, and an MI that converts the obtained phoneme data into a MIDI format code.
An audio signal encoding apparatus, comprising: DI code conversion means.