JP2003084799A

JP2003084799A - Frequency analysis method and sound signal encoding method

Info

Publication number: JP2003084799A
Application number: JP2001281034A
Authority: JP
Inventors: Toshio Motegi; 敏雄茂出木
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2001-09-17
Filing date: 2001-09-17
Publication date: 2003-03-19

Abstract

PROBLEM TO BE SOLVED: To provide a frequency analysis method and a sound signal encoding method which improve the time resolution without degrading the frequency resolution neither increasing the operation load. SOLUTION: With respect to a method which sets unit sections to a time series signal and analyzes each unit section to separate a signal component corresponding to a prescribed standard frequency, the unit sections are so set that they may overlap by a prescribed length, and a section signal itself in an objective unit section dt is not analyzed but the section signal corrected by the final residual signal in the just preceding unit section dt-τ is analyzed by the generalized harmonic analysis method in the case of analysis of the objective unit section dt as the analysis object.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、放送メディア（ラジ
オ、テレビ）、通信メディア（ＣＳ映像・音声配信、イ
ンターネット音楽配信、通信カラオケ）、パッケージメ
ディア（ＣＤ、ＭＤ、カセット、ビデオ、ＬＤ、ＣＤ−
ＲＯＭ、ゲームカセット、携帯音楽プレーヤ向け固体メ
モリ媒体）などで提供する各種オーディオコンテンツの
制作、並びに、音楽演奏録音信号から楽譜出版、通信カ
ラオケ配信用ＭＩＤＩデータ、演奏ガイド機能付き電子
楽器向け自動演奏データ、携帯電話・ＰＨＳ・ポケベル
などの着信メロディデータを自動的に作成する自動採譜
技術に関する。The present invention relates to broadcast media (radio, television), communication media (CS video / audio distribution, Internet music distribution, communication karaoke), package media (CD, MD, cassette, video, LD, CD). −
Production of various audio contents provided in ROM, game cassettes, solid-state memory media for portable music players, etc., as well as music performance recording signals, musical score publishing, MIDI data for communication karaoke distribution, and automatic performance data for electronic musical instruments with performance guide function. , Automatic music transcription technology for automatically creating ringing melody data for mobile phones, PHS, pagers, etc.

【０００２】[0002]

【従来の技術】音響信号に代表される時系列信号には、
その構成要素として複数の周期信号が含まれている。こ
のため、与えられた時系列信号にどのような周期信号が
含まれているかを解析する手法は、古くから知られてい
る。例えば、フーリエ解析は、与えられた時系列信号に
含まれる周波数成分を解析するための方法として広く利
用されている。2. Description of the Related Art A time series signal represented by an acoustic signal is
A plurality of periodic signals are included as its constituent elements. Therefore, a method of analyzing what kind of periodic signal is included in a given time series signal has been known for a long time. For example, Fourier analysis is widely used as a method for analyzing frequency components included in a given time series signal.

【０００３】このような時系列信号の解析方法を利用す
れば、音響信号を符号化することも可能である。コンピ
ュータの普及により、原音となるアナログ音響信号を所
定のサンプリング周波数でサンプリングし、各サンプリ
ング時の信号強度を量子化してデジタルデータとして取
り込むことが容易にできるようになってきており、こう
して取り込んだデジタルデータに対してフーリエ解析な
どの手法を適用し、原音信号に含まれていた周波数成分
を抽出すれば、各周波数成分を示す符号によって原音信
号の符号化が可能になる。By using such a time-series signal analysis method, it is possible to encode an acoustic signal. With the spread of computers, it has become easy to sample the analog sound signal that is the original sound at a predetermined sampling frequency, quantize the signal strength at each sampling, and capture it as digital data. If a method such as Fourier analysis is applied to the data and the frequency components included in the original sound signal are extracted, the original sound signal can be encoded by the code indicating each frequency component.

【０００４】また、電子楽器による楽器音を符号化しよ
うという発想から生まれたＭＩＤＩ（Musical Instrume
nt Digital Interface）規格も、パーソナルコンピュー
タの普及とともに盛んに利用されるようになってきてい
る。このＭＩＤＩ規格による符号データ（以下、ＭＩＤ
Ｉデータという）は、基本的には、楽器のどの鍵盤キー
を、どの程度の強さで弾いたか、という楽器演奏の操作
を記述したデータであり、このＭＩＤＩデータ自身に
は、実際の音の波形は含まれていない。そのため、実際
の音を再生する場合には、楽器音の波形を記憶したＭＩ
ＤＩ音源が別途必要になるが、その符号化効率の高さが
注目を集めており、ＭＩＤＩ規格による符号化および復
号化の技術は、現在、パーソナルコンピュータを用いて
楽器演奏、楽器練習、作曲などを行うソフトウェアに広
く採り入れられている。MIDI (Musical Instrume) was born from the idea of encoding musical instrument sounds by electronic musical instruments.
The nt Digital Interface) standard has also been actively used with the spread of personal computers. Code data according to this MIDI standard (hereinafter referred to as MID
Basically, the I data) is data that describes the operation of the musical instrument playing, such as which keyboard key of the musical instrument was played and with what strength. The MIDI data itself contains the actual sound. Waveform not included. Therefore, when reproducing the actual sound, the MI that stores the waveform of the instrument sound is stored.
Although a DI sound source is required separately, its high coding efficiency has been attracting attention, and the MIDI coding and decoding technology is currently used for musical instrument performance, musical instrument practice, composition, etc. using a personal computer. It is widely adopted in software that does.

【０００５】そこで、音響信号に代表される時系列信号
に対して、所定の手法で解析を行うことにより、その構
成要素となる周期信号を抽出し、抽出した周期信号をＭ
ＩＤＩデータを用いて符号化しようとする提案がなされ
ている。例えば、特開平１０−２４７０９９号公報、特
開平１１−７３１９９号公報、特開平１１−７３２００
号公報、特開平１１−９５７５３号公報、特開２０００
−９９００９号公報、特開２０００−９９０９２号公
報、特開２０００−９９０９３号公報、特開２０００−
２６１３２２号公報、特開２００１−５４５０号公報、
特開２００１−１４８６３３号公報には、任意の時系列
信号について、構成要素となる周波数を解析し、その解
析結果からＭＩＤＩデータを作成することができる種々
の方法が提案されている。Therefore, a time-series signal typified by an acoustic signal is analyzed by a predetermined method to extract a periodic signal which is a constituent element thereof, and the extracted periodic signal is M
Proposals have been made to encode using IDI data. For example, JP-A-10-247099, JP-A-11-73199, and JP-A-11-73200.
JP, JP-A-11-95753, JP, 2000
-99009, JP 2000-99092 A, JP 2000-99093 A, JP 2000-
261322, JP 2001-5450 A,
Japanese Unexamined Patent Application Publication No. 2001-148633 proposes various methods capable of analyzing a frequency as a constituent element of an arbitrary time series signal and creating MIDI data from the analysis result.

【０００６】[0006]

【発明が解決しようとする課題】上記公報に記載された
発明により、時系列信号に対する周波数の解析精度およ
び音響信号の符号化精度は大きく向上し、信号波形を直
接計測するゼロクロス法を利用した場合よりも高い精度
が得られる。しかしながら、一般化調和解析の手法を用
いる上記発明では、解析単位である単位区間を設定し、
隣接する単位区間とオーバーラップさせるため、ゼロク
ロス法に比べ時間分解能に問題がある。一般化調和解析
を用いた手法において、時間分解能を改善させる方法と
して、解析の単位である単位区間の長さを短くする方
法、単位区間の送り長を短くする方法、窓関数を用いる
方法があるが、いずれも限界がある。According to the invention described in the above publication, the frequency analysis accuracy for time series signals and the audio signal encoding accuracy are greatly improved, and the case where the zero cross method for directly measuring the signal waveform is used. Higher accuracy can be obtained. However, in the above invention using the method of generalized harmonic analysis, a unit section that is an analysis unit is set,
Since it overlaps with the adjacent unit section, it has a problem in time resolution as compared with the zero-cross method. In the method using generalized harmonic analysis, as a method of improving the time resolution, there are a method of shortening the length of a unit section which is a unit of analysis, a method of shortening the feed length of the unit section, and a method of using a window function. However, there are limits.

【０００７】単位区間の長さを短くする方法を用いた場
合、時間分解能は改善するが、符号化精度として重要な
周波数分解能が低下するという問題がある。また、単位
区間より周期が長い周波数成分は検出できなくなり、検
出可能な周波数範囲も狭くなる。単位区間の送り長を短
くする方法を用いた場合、周波数分解能は変化しない
が、単位区間同士のオーバーラップが大きくなるので、
時間分解能の向上が小さく、演算量も大きくなる。窓関
数を用いる方法を用いた場合、窓形状を工夫することに
より若干時間分解能を改善できるが、窓形状により周波
数分解能の低下を招く場合もあり、顕著な時間分解能の
改善は期待できない。When the method of shortening the length of the unit section is used, the time resolution is improved, but there is a problem that the frequency resolution, which is important as coding accuracy, is lowered. In addition, the frequency component whose period is longer than the unit section cannot be detected, and the detectable frequency range becomes narrow. When the method of shortening the feed length of the unit section is used, the frequency resolution does not change, but the overlap between unit sections becomes large, so
The improvement in time resolution is small and the amount of calculation is large. When the method using the window function is used, the time resolution can be slightly improved by devising the window shape, but the frequency resolution may be lowered due to the window shape, and a remarkable improvement in time resolution cannot be expected.

【０００８】上記のような点に鑑み、本発明は、周波数
分解能を低下させず、さらに演算負荷の増大を招かずに
時間分解能を向上させることが可能な周波数解析方法お
よび音響信号の符号化方法を提供することを課題とす
る。In view of the above points, the present invention makes it possible to improve the frequency resolution without lowering the frequency resolution and further increasing the time resolution without increasing the calculation load, and the audio signal encoding method. The challenge is to provide.

【０００９】[0009]

【課題を解決するための手段】上記課題を解決するた
め、本発明では、時系列信号から所定の標準周波数に対
応した信号成分を分離するための周波数解析方法とし
て、前記時系列信号に所定の長さの単位区間を、各単位
区間が互いに所定の長さだけ重複するように、設定する
段階と、解析対象となる対象単位区間において、当該対
象単位区間と所定の長さだけ重複する直前単位区間との
重複区間における信号を、直前単位区間において得られ
る最終残差信号に置き換えた修正区間信号を作成する区
間信号修正段階と、前記修正区間信号に対して、一般化
調和解析を利用することにより、前記対象単位区間にお
ける周波数および対応する強度の組を算出するスペクト
ル算出段階を有し、当該対象単位区間を新たな直前単位
区間とし、一般化調和解析を利用して前記修正区間信号
から複数の含有信号を減じることにより最終的に得られ
る信号を最終残差信号として、前記区間信号修正段階お
よび前記スペクトル算出段階の処理を、設定された全単
位区間について行うことにより、全区間に渡って周波数
および強度の組を算出するようにしたことを特徴とす
る。In order to solve the above problems, the present invention provides a frequency analysis method for separating a signal component corresponding to a predetermined standard frequency from a time series signal. A step of setting unit sections of length so that each unit section overlaps with each other by a predetermined length, and a unit immediately before the target unit section to be analyzed overlaps with the target unit section by a predetermined length. A section signal correction step of creating a modified section signal by replacing a signal in an overlapping section with a section with a final residual signal obtained in the immediately preceding unit section, and using a generalized harmonic analysis for the modified section signal According to the generalized harmony, there is a spectrum calculation step for calculating a set of frequencies and corresponding intensities in the target unit section, and the target unit section is a new immediately preceding unit section. Using the analysis to subtract the plurality of contained signals from the corrected section signal as the final residual signal, the section signal correction step and the spectrum calculation step are performed for all set units. It is characterized in that a set of frequencies and intensities is calculated over the entire section by performing the processing for each section.

【００１０】本発明によれば、解析対象とする単位区間
の解析にあたって、直前の単位区間と重複する部分の信
号については、直前の単位区間の区間信号から含有信号
成分を減じた最終残差信号に置きかえることにより区間
信号を修正し、修正された区間信号に対して解析を行な
うようにしたので、重複する区間の信号成分による影響
を抑えた解析を行なうことが可能となる。According to the present invention, in the analysis of the unit section to be analyzed, regarding the signal of the portion overlapping with the immediately preceding unit section, the final residual signal obtained by subtracting the contained signal component from the section signal of the immediately preceding unit section. Since the section signal is corrected by replacing the section signal with, the analysis is performed on the corrected section signal. Therefore, it is possible to perform the analysis while suppressing the influence of the signal component of the overlapping section.

【００１１】[0011]

【発明の実施の形態】以下、本発明の実施形態について
図面を参照して詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below in detail with reference to the drawings.

【００１２】（1.基本原理）はじめに、本発明に係る周
波数解析方法および音響信号符号化方法の基本原理を述
べておく。この基本原理は、前掲の各公報に開示されて
いるので、ここではその概要のみを簡単に述べることに
する。(1. Basic Principle) First, the basic principle of the frequency analysis method and the acoustic signal coding method according to the present invention will be described. Since this basic principle is disclosed in each of the above-mentioned publications, only its outline will be briefly described here.

【００１３】図１（ａ）に示すように、時系列信号とし
てアナログ音響信号が与えられたものとする。図１の例
では、横軸に時間ｔ、縦軸に振幅（強度）をとって、こ
の音響信号を示している。ここでは、まずこのアナログ
音響信号を、デジタルの音響データとして取り込む処理
を行う。これは、従来の一般的なＰＣＭの手法を用い、
所定のサンプリング周波数でこのアナログ音響信号をサ
ンプリングし、振幅を所定の量子化ビット数を用いてデ
ジタルデータに変換する処理を行えば良い。ここでは、
説明の便宜上、ＰＣＭの手法でデジタル化した音響デー
タの波形も図１（ａ）のアナログ音響信号と同一の波形
で示すことにする。As shown in FIG. 1A, it is assumed that an analog acoustic signal is given as a time series signal. In the example of FIG. 1, the horizontal axis represents time t, and the vertical axis represents amplitude (intensity) to show this acoustic signal. Here, first, a process of taking in the analog acoustic signal as digital acoustic data is performed. This uses the conventional general PCM method,
The analog acoustic signal may be sampled at a predetermined sampling frequency, and the amplitude may be converted into digital data by using a predetermined number of quantization bits. here,
For convenience of explanation, the waveform of the acoustic data digitized by the PCM method is also shown as the same waveform as the analog acoustic signal of FIG.

【００１４】続いて、この解析対象となる音響信号の時
間軸上に、複数の単位区間を設定する。図１（ａ）に示
す例では、時間軸ｔ上に等間隔に６つの時刻ｔ１〜ｔ６
が定義され、これら各時刻を始点および終点とする５つ
の単位区間ｄ１〜ｄ５が設定されている。図１の例で
は、全て同一の区間長をもった単位区間が時間軸上で重
複せずに設定されているが、隣接する単位区間が時間軸
上で部分的に重なり合うような区間設定を行ってもかま
わない。特に、後述する本発明の具体的処理において
は、各単位区間が重なり合うように設定している。Then, a plurality of unit sections are set on the time axis of the acoustic signal to be analyzed. In the example shown in FIG. 1A, six times t1 to t6 are equally spaced on the time axis t.
Is defined, and five unit sections d1 to d5 whose start point and end point are the respective time points are set. In the example of FIG. 1, all unit sections having the same section length are set without overlapping on the time axis, but the section setting is performed so that adjacent unit sections partially overlap on the time axis. It doesn't matter. Particularly, in the specific processing of the present invention described later, the unit sections are set to overlap each other.

【００１５】こうして単位区間が設定されたら、各単位
区間ごとの音響信号（以下、区間信号と呼ぶことにす
る）について、それぞれ代表周波数を選出する。各区間
信号には、通常、様々な周波数成分が含まれているが、
例えば、その中で成分の強度割合の大きな周波数成分を
代表周波数として選出すれば良い。ここで、代表周波数
とはいわゆる基本周波数が一般的であるが、音声のフォ
ルマント周波数などの倍音周波数や、ノイズ音源のピー
ク周波数も代表周波数として扱うことがある。代表周波
数は１つだけ選出しても良いが、音響信号によっては複
数の代表周波数を選出した方が、より精度の高い符号化
が可能になる。図１（ｂ）には、個々の単位区間ごとに
それぞれ３つの代表周波数を選出し、１つの代表周波数
を１つの代表符号（図では便宜上、音符として示してあ
る）として符号化した例が示されている。ここでは、代
表符号（音符）を収容するために３つのトラックＴ１，
Ｔ２，Ｔ３が設けられているが、これは個々の単位区間
ごとに選出された３つずつの代表符号を、それぞれ異な
るトラックに収容するためである。When the unit section is set in this way, a representative frequency is selected for each acoustic signal (hereinafter referred to as section signal) for each unit section. Each section signal usually contains various frequency components,
For example, a frequency component having a large intensity ratio of the components may be selected as the representative frequency. Here, the representative frequency is generally a so-called fundamental frequency, but a harmonic frequency such as a formant frequency of voice or a peak frequency of a noise sound source may be treated as a representative frequency. Although only one representative frequency may be selected, more accurate encoding becomes possible if a plurality of representative frequencies are selected depending on the acoustic signal. FIG. 1B shows an example in which three representative frequencies are selected for each unit section and one representative frequency is encoded as one representative code (in the figure, it is shown as a note for convenience). Has been done. Here, three tracks T1 for accommodating a representative code (note) are provided.
T2 and T3 are provided so that the three representative codes selected for each unit section are accommodated in different tracks.

【００１６】例えば、単位区間ｄ１について選出された
代表符号ｎ（ｄ１，１），ｎ（ｄ１，２），ｎ（ｄ１，
３）は、それぞれトラックＴ１，Ｔ２，Ｔ３に収容され
ている。ここで、各符号ｎ（ｄ１，１），ｎ（ｄ１，
２），ｎ（ｄ１，３）は、ＭＩＤＩ符号におけるノート
ナンバーを示す符号である。ＭＩＤＩ符号におけるノー
トナンバーは、０〜１２７までの１２８通りの値をと
り、それぞれピアノの鍵盤の１つのキーを示すことにな
る。具体的には、例えば、代表周波数として４４０Ｈｚ
が選出された場合、この周波数はノートナンバーｎ＝６
９（ピアノの鍵盤中央の「ラ音（Ａ３音）」に対応）に
相当するので、代表符号としては、ｎ＝６９が選出され
ることになる。もっとも、図１（ｂ）は、上述の方法に
よって得られる代表符号を音符の形式で示した概念図で
あり、実際には、各音符にはそれぞれ強度に関するデー
タも付加されている。例えば、トラックＴ１には、ノー
トナンバーｎ（ｄ１，１），ｎ（ｄ２，１）・・・なる
音高を示すデータとともに、ｅ（ｄ１，１），ｅ（ｄ
２，１）・・・なる強度を示すデータが収容されること
になる。この強度を示すデータは、各代表周波数の成分
が、元の区間信号にどの程度の度合いで含まれていたか
によって決定される。具体的には、各代表周波数をもっ
た周期関数の区間信号に対する相関値に基づいて強度を
示すデータが決定されることになる。また、図１（ｂ）
に示す概念図では、音符の横方向の位置によって、個々
の単位区間の時間軸上での位置が示されているが、実際
には、この時間軸上での位置を正確に数値として示すデ
ータが各音符に付加されていることになる。For example, the representative codes n (d1,1), n (d1,2), n (d1, selected for the unit section d1.
3) are housed in the tracks T1, T2, T3, respectively. Here, each code n (d1,1), n (d1,
2) and n (d1,3) are codes indicating note numbers in the MIDI code. The note number in the MIDI code takes 128 values from 0 to 127, and each indicates one key on the keyboard of the piano. Specifically, for example, the representative frequency is 440 Hz
Is selected, the frequency is note number n = 6
Since it corresponds to 9 (corresponding to "Ra sound (A3 sound)" at the center of the keyboard of the piano), n = 69 is selected as the representative code. However, FIG. 1B is a conceptual diagram showing the representative code obtained by the above-described method in the form of a musical note, and in fact, each musical note is also provided with data relating to its strength. For example, the track T1 includes data indicating pitches of note numbers n (d1,1), n (d2,1), ..., E (d1,1), e (d
2, 1) ... The data indicating the intensity will be stored. The data indicating this intensity is determined by the degree to which the component of each representative frequency is included in the original section signal. Specifically, the data indicating the intensity is determined based on the correlation value of the section signal of the periodic function having each representative frequency. In addition, FIG.
In the conceptual diagram shown in, the position of each note on the time axis is shown by the horizontal position of the note. Actually, however, data indicating the position on this time axis as a numerical value is shown exactly. Will be added to each note.

【００１７】音響信号を符号化する形式としては、必ず
しもＭＩＤＩ形式を採用する必要はないが、この種の符
号化形式としてはＭＩＤＩ形式が最も普及しているた
め、実用上はＭＩＤＩ形式の符号データを用いるのが好
ましい。ＭＩＤＩ形式では、「ノートオン」データもし
くは「ノートオフ」データが、「デルタタイム」データ
を介在させながら存在する。「ノートオン」データは、
特定のノートナンバーＮとベロシティーＶを指定して特
定の音の演奏開始を指示するデータであり、「ノートオ
フ」データは、特定のノートナンバーＮとベロシティー
Ｖを指定して特定の音の演奏終了を指示するデータであ
る。また、「デルタタイム」データは、所定の時間間隔
を示すデータである。ベロシティーＶは、例えば、ピア
ノの鍵盤などを押し下げる速度（ノートオン時のベロシ
ティー）および鍵盤から指を離す速度（ノートオフ時の
ベロシティー）を示すパラメータであり、特定の音の演
奏開始操作もしくは演奏終了操作の強さを示すことにな
る。The MIDI format does not necessarily have to be adopted as the format for encoding the audio signal, but since the MIDI format is the most popular as this type of encoding format, the MIDI format code data is practically used. Is preferably used. In the MIDI format, “note on” data or “note off” data exists with “delta time” data interposed. The "Note On" data is
The "note-off" data is data for instructing the start of playing a specific sound by designating a specific note number N and velocity V. The "note-off" data is a data of a specific note designated by a specific note number N and velocity V. This is data for instructing the end of performance. The "delta time" data is data indicating a predetermined time interval. Velocity V is a parameter indicating, for example, the speed at which the piano keyboard is pushed down (velocity at note-on) and the speed at which the finger is released from the keyboard (velocity at note-off), and operation to start playing a specific sound. Alternatively, it indicates the strength of the performance ending operation.

【００１８】前述の方法では、第ｉ番目の単位区間ｄｉ
について、代表符号としてＪ個のノートナンバーｎ（ｄ
ｉ，１），ｎ（ｄｉ，２），・・・，ｎ（ｄｉ，Ｊ）が
得られ、このそれぞれについて強度ｅ（ｄｉ，１），ｅ
（ｄｉ，２），・・・，ｅ（ｄｉ，Ｊ）が得られる。そ
こで、次のような手法により、ＭＩＤＩ形式の符号デー
タを作成することができる。まず、「ノートオン」デー
タもしくは「ノートオフ」データの中で記述するノート
ナンバーＮとしては、得られたノートナンバーｎ（ｄ
ｉ，１），ｎ（ｄｉ，２），・・・，ｎ（ｄｉ，Ｊ）を
そのまま用いれば良い。一方、「ノートオン」データも
しくは「ノートオフ」データの中で記述するベロシティ
ーＶとしては、得られた強度ｅ（ｄｉ，１），ｅ（ｄ
ｉ，２），・・・，ｅ（ｄｉ，Ｊ）を所定の方法で規格
化した値を用いれば良い。また、「デルタタイム」デー
タは、各単位区間の長さに応じて設定すれば良い。In the above method, the i-th unit section di
About J note numbers n (d
i, 1), n (di, 2), ..., N (di, J) are obtained for each of these intensities e (di, 1), e
(Di, 2), ..., E (di, J) are obtained. Therefore, the code data in the MIDI format can be created by the following method. First, as the note number N described in the “note on” data or the “note off” data, the obtained note number n (d
i, 1), n (di, 2), ..., N (di, J) may be used as they are. On the other hand, as the velocity V described in the “note-on” data or the “note-off” data, the obtained intensities e (di, 1), e (d
i, 2), ..., E (di, J) may be standardized by a predetermined method. The “delta time” data may be set according to the length of each unit section.

【００１９】（2.周期関数との相関を求める具体的な方
法）上述した基本原理の基づく方法では、区間信号に対
して、１つまたは複数の代表周波数が選出され、この代
表周波数をもった周期信号によって、当該区間信号が表
現されることになる。ここで、選出される代表周波数
は、文字どおり、当該単位区間内の信号成分を代表する
周波数である。この代表周波数を選出する具体的な方法
には、後述するように、短時間フーリエ変換を利用する
方法と、一般化調和解析の手法を利用する方法とがあ
る。いずれの方法も、基本的な考え方は同じであり、あ
らかじめ周波数の異なる複数の周期関数を用意してお
き、これら複数の周期関数の中から、当該単位区間内の
区間信号に対する相関が高い周期関数を見つけ出し、こ
の相関の高い周期関数の周波数を代表周波数として選出
する、という手法を採ることになる。すなわち、代表周
波数を選出する際には、あらかじめ用意された複数の周
期関数と、単位区間内の区間信号との相関を求める演算
を行うことになる。そこで、ここでは、周期関数との相
関を求める具体的な方法を述べておく。(2. Specific Method for Obtaining Correlation with Periodic Function) In the method based on the above-mentioned basic principle, one or a plurality of representative frequencies are selected for the section signal and have the representative frequency. The section signal is represented by the periodic signal. Here, the selected representative frequency is literally a frequency representing the signal component in the unit section. As a specific method of selecting the representative frequency, there are a method using a short-time Fourier transform and a method using a generalized harmonic analysis method, as described later. In either method, the basic idea is the same, and a plurality of periodic functions with different frequencies are prepared in advance, and a periodic function having a high correlation with the section signal in the unit section is selected from among the plurality of periodic functions. Is found and the frequency of this highly correlated periodic function is selected as the representative frequency. That is, when the representative frequency is selected, the calculation for obtaining the correlation between the plurality of periodic functions prepared in advance and the section signal in the unit section is performed. Therefore, here, a specific method for obtaining the correlation with the periodic function will be described.

【００２０】複数の周期関数として、図２に示すような
三角関数が用意されているものとする。これらの三角関
数は、同一周波数をもった正弦関数と余弦関数との対か
ら構成されており、１２８通りの標準周波数ｆ（０）〜
ｆ（１２７）のそれぞれについて、正弦関数および余弦
関数の対が定義されていることになる。ここでは、同一
の周波数をもった正弦関数および余弦関数からなる一対
の関数を、当該周波数についての周期関数として定義す
ることにする。すなわち、ある特定の周波数についての
周期関数は、一対の正弦関数および余弦関数によって構
成されることになる。このように、一対の正弦関数と余
弦関数とにより周期関数を定義するのは、信号に対する
周期関数の相関値を求める際に、相関値が位相の影響を
受ける事を考慮するためである。なお、図２に示す各三
角関数内の変数Ｆおよびｋは、区間信号Ｘについてのサ
ンプリング周波数Ｆおよびサンプル番号ｋに相当する変
数である。例えば、周波数ｆ（０）についての正弦波
は、ｓｉｎ（２πｆ（０）ｋ／Ｆ）で示され、任意のサ
ンプル番号ｋを与えると、区間信号を構成する第ｋ番目
のサンプルと同一時間位置における周期関数の振幅値が
得られる。It is assumed that a trigonometric function as shown in FIG. 2 is prepared as a plurality of periodic functions. These trigonometric functions are composed of a pair of a sine function and a cosine function having the same frequency, and 128 standard frequencies f (0) to
For each of f (127), a pair of sine and cosine functions will be defined. Here, a pair of functions including a sine function and a cosine function having the same frequency will be defined as a periodic function for the frequency. That is, the periodic function for a specific frequency is composed of a pair of sine function and cosine function. Thus, the reason why the periodic function is defined by a pair of sine function and cosine function is to consider that the correlation value is influenced by the phase when the correlation value of the periodic function with respect to the signal is obtained. The variables F and k in each trigonometric function shown in FIG. 2 are variables corresponding to the sampling frequency F and the sample number k for the interval signal X. For example, a sine wave for the frequency f (0) is represented by sin (2πf (0) k / F), and given an arbitrary sample number k, the same time position as the kth sample forming the interval signal is given. The amplitude value of the periodic function at is obtained.

【００２１】ここでは、１２８通りの標準周波数ｆ
（０）〜ｆ（１２７）を図３に示すような式で定義した
例を示すことにする。すなわち、第ｎ番目（０≦ｎ≦１
２７）の標準周波数ｆ（ｎ）は、以下に示す〔数式１〕
で定義されることになる。Here, 128 standard frequencies f
An example in which (0) to f (127) are defined by equations shown in FIG. 3 will be shown. That is, the nth (0 ≦ n ≦ 1
The standard frequency f (n) of 27) is shown below [Formula 1].
Will be defined in.

【００２２】〔数式１〕ｆ（ｎ）＝４４０×２^γ(n) γ（ｎ）＝（ｎ−６９）／１２[Formula 1] f (n) = 440 × 2 ^{γ (n)} γ (n) = (n−69) / 12

【００２３】このような式によって標準周波数を定義し
ておくと、最終的にＭＩＤＩデータを用いた符号化を行
う際に便利である。なぜなら、このような定義によって
設定される１２８通りの標準周波数ｆ（０）〜ｆ（１２
７）は、等比級数をなす周波数値をとることになり、Ｍ
ＩＤＩデータで利用されるノートナンバーに対応した周
波数になるからである。したがって、図２に示す１２８
通りの標準周波数ｆ（０）〜ｆ（１２７）は、対数尺度
で示した周波数軸上に等間隔（ＭＩＤＩにおける半音単
位）に設定した周波数ということになる。Defining the standard frequency by such a formula is convenient when finally performing coding using MIDI data. This is because there are 128 standard frequencies f (0) to f (12) set by such a definition.
7) is to take frequency values forming a geometric series, and M
This is because the frequency corresponds to the note number used in the IDI data. Therefore, 128 shown in FIG.
The standard frequencies f (0) to f (127) are the frequencies set at equal intervals (semitone unit in MIDI) on the frequency axis shown by the logarithmic scale.

【００２４】（2.1.短時間フーリエ変換の手法）続い
て、任意の区間の区間信号に対する各周期関数の相関の
求め方について、具体的な説明を行う。例えば、図４に
示すように、ある単位区間ｄについて区間信号Ｘが与え
られていたとする。ここでは、区間長Ｌをもった単位区
間ｄについて、サンプリング周波数Ｆでサンプリングが
行なわれており、全部でｗ個のサンプル値が得られてい
るものとし、サンプル番号を図示のように、０，１，
２，３，・・・，ｋ，・・・，ｗ−２，ｗ−１とする
（白丸で示す第ｗ番目のサンプルは、右に隣接する次の
単位区間の先頭に含まれるサンプルとする）。この場
合、任意のサンプル番号ｋについては、Ｘ（ｋ）なる振
幅値がデジタルデータとして与えられていることにな
る。短時間フーリエ変換においては、Ｘ（ｋ）に対して
各サンプルごとに中央の重みが１に近く、両端の重みが
０に近くなるような窓関数Ｗ（ｋ）を乗ずることが通常
である。すなわち、Ｘ（ｋ）×Ｗ（ｋ）をＸ（ｋ）と扱
って以下のような相関計算を行うもので、窓関数の形状
としては余弦波形状のハミング窓が一般に用いられてい
る。ここで、ｗは以下の記述においても定数のような記
載をしているが、一般にはｎの値に応じて変化させ、区
間長Ｌを超えない範囲で最大となるＦ／ｆ（ｎ）の整数
倍の値に設定することが望ましい。(2.1. Method of Short-Time Fourier Transform) Next, a concrete description will be given of how to obtain the correlation of each periodic function with respect to a section signal of an arbitrary section. For example, as shown in FIG. 4, it is assumed that the section signal X is given for a certain unit section d. Here, it is assumed that the unit section d having the section length L is sampled at the sampling frequency F and w sample values are obtained in total, and the sample number is 0, as shown in the figure. 1,
2, 3, ..., K, ..., W-2, w-1 (the w-th sample indicated by a white circle is a sample included at the beginning of the next unit section adjacent to the right. ). In this case, for any sample number k, the amplitude value X (k) is given as digital data. In the short-time Fourier transform, it is usual to multiply X (k) by a window function W (k) such that the center weight is close to 1 and the weights at both ends are close to 0 for each sample. That is, the following correlation calculation is performed by treating X (k) × W (k) as X (k), and a cosine wave-shaped Hamming window is generally used as the shape of the window function. Here, w is also described as a constant in the following description, but it is generally changed according to the value of n, and the maximum F / f (n) within the range not exceeding the section length L is obtained. It is desirable to set the value to an integral multiple.

【００２５】このような区間信号Ｘに対して、第ｎ番目
の標準周波数ｆ（ｎ）をもった正弦関数Ｒｎとの相関値
を求める原理を示す。両者の相関値Ａ（ｎ）は、図５の
第１の演算式によって定義することができる。ここで、
Ｘ（ｋ）は、図４に示すように、区間信号Ｘにおけるサ
ンプル番号ｋの振幅値であり、ｓｉｎ（２πｆ（ｎ）ｋ
／Ｆ）は、時間軸上での同位置における正弦関数Ｒｎの
振幅値である。この第１の演算式は、単位区間ｄ内の全
サンプル番号ｋ＝０〜ｗ−１の次元について、それぞれ
区間信号Ｘの振幅値と正弦関数Ｒｎの振幅ベクトルの内
積を求める式ということができる。The principle of obtaining the correlation value with the sine function Rn having the nth standard frequency f (n) for such a section signal X will be described. The correlation value A (n) between the two can be defined by the first arithmetic expression in FIG. here,
X (k) is the amplitude value of the sample number k in the interval signal X, as shown in FIG. 4, and sin (2πf (n) k
/ F) is the amplitude value of the sine function Rn at the same position on the time axis. This first arithmetic expression can be said to be an expression for obtaining the inner product of the amplitude value of the interval signal X and the amplitude vector of the sine function Rn for each dimension of all sample numbers k = 0 to w−1 in the unit interval d. .

【００２６】同様に、図５の第２の演算式は、区間信号
Ｘと、第ｎ番目の標準周波数ｆ（ｎ）をもった余弦関数
との相関値を求める式であり、両者の相関値はＢ（ｎ）
で与えられる。なお、相関値Ａ（ｎ）を求めるための第
１の演算式も、相関値Ｂ（ｎ）を求めるための第２の演
算式も、最終的に２／ｗが乗ぜられているが、これは相
関値を規格化するためのものでり、前述のとおりｗはｎ
に依存して変化させるのが一般的であるため、この係数
もｎに依存する変数である。Similarly, the second arithmetic expression in FIG. 5 is an expression for obtaining the correlation value between the interval signal X and the cosine function having the nth standard frequency f (n). Is B (n)
Given in. Note that both the first arithmetic expression for obtaining the correlation value A (n) and the second arithmetic expression for obtaining the correlation value B (n) are finally multiplied by 2 / w. Is for normalizing the correlation value, and w is n as described above.
This coefficient is also a variable dependent on n, since it is generally changed depending on.

【００２７】区間信号Ｘと標準周波数ｆ（ｎ）をもった
標準周期関数との相関実効値は、図５の第３の演算式に
示すように、正弦関数との相関値Ａ（ｎ）と余弦関数と
の相関値Ｂ（ｎ）との二乗和平方根値Ｅ（ｎ）によって
示すことができる。この相関実効値の大きな標準周期関
数の周波数を代表周波数として選出すれば、この代表周
波数を用いて区間信号Ｘを符号化することができる。The effective value of the correlation between the interval signal X and the standard periodic function having the standard frequency f (n) is the correlation value A (n) with the sine function as shown in the third arithmetic expression of FIG. It can be represented by the square root sum of squares value E (n) with the correlation value B (n) with the cosine function. If the frequency of the standard periodic function with a large correlation effective value is selected as the representative frequency, the section signal X can be encoded using this representative frequency.

【００２８】すなわち、この相関値Ｅ（ｎ）が所定の基
準以上の大きさとなる１つまたは複数の標準周波数を代
表周波数として選出すれば良い。なお、ここで「相関値
Ｅ（ｎ）が所定の基準以上の大きさとなる」という選出
条件は、例えば、何らかの閾値を設定しておき、相関値
Ｅ（ｎ）がこの閾値を超えるような標準周波数ｆ（ｎ）
をすべて代表周波数として選出する、という絶対的な選
出条件を設定しても良いが、例えば、相関値Ｅ（ｎ）の
大きさの順にＱ番目までを選出する、というような相対
的な選出条件を設定しても良い。That is, one or a plurality of standard frequencies whose correlation value E (n) is greater than a predetermined standard may be selected as the representative frequency. The selection condition that “the correlation value E (n) is greater than or equal to a predetermined reference” is, for example, a threshold that is set in advance and the correlation value E (n) exceeds the threshold. Frequency f (n)
May be set as a representative frequency, but an absolute selection condition may be set. For example, relative selection conditions such as selecting up to the Qth in the order of the magnitude of the correlation value E (n). May be set.

【００２９】（2.2.一般化調和解析の手法）ここでは、
本発明に係る音響信号の符号化を行う際に有用な一般化
調和解析の手法について説明する。既に説明したよう
に、音響信号を符号化する場合、個々の単位区間内の区
間信号について、相関値の高いいくつかの代表周波数を
選出することになる。一般化調和解析は、より高い精度
で代表周波数の選出を可能にする手法であり、その基本
原理は次の通りである。(2.2. Generalized Harmonic Analysis Method) Here,
A method of generalized harmonic analysis useful for encoding an acoustic signal according to the present invention will be described. As already described, when the acoustic signal is encoded, some representative frequencies having a high correlation value are selected for the section signals in each unit section. Generalized harmonic analysis is a method that enables selection of representative frequencies with higher accuracy, and its basic principle is as follows.

【００３０】図６（ａ）に示すような単位区間ｄについ
て、信号Ｓ（ｊ）なるものが存在するとする。ここで、
ｊは後述するように、繰り返し処理のためのパラメータ
である（ｊ＝１〜Ｊ）。まず、この信号Ｓ（ｊ）に対し
て、図２に示すような１２８通りの周期関数すべてにつ
いての相関値を求める。そして、最大の相関値が得られ
た１つの周期関数の周波数を代表周波数として選出し、
当該代表周波数をもった周期関数を要素関数として抽出
する。続いて、図６（ｂ）に示すような含有信号Ｇ
（ｊ）を定義する。この含有信号Ｇ（ｊ）は、抽出され
た要素関数に、その振幅として、当該要素関数の信号Ｓ
（ｊ）に対する相関値を乗じることにより得られる信号
である。例えば、周期関数として図２に示すように、一
対の正弦関数と余弦関数とを用い、周波数ｆ（ｎ）が代
表周波数として選出された場合、振幅Ａ（ｎ）をもった
正弦関数Ａ（ｎ）ｓｉｎ（２πｆ（ｎ）ｋ／Ｆ）と、振
幅Ｂ（ｎ）をもった余弦関数Ｂ（ｎ）ｃｏｓ（２πｆ
（ｎ）ｋ／Ｆ）との和からなる信号が含有信号Ｇ（ｊ）
ということになる（図６（ｂ）では、図示の便宜上、一
方の関数しか示していない）。ここで、Ａ（ｎ），Ｂ
（ｎ）は、図５の式で得られる規格化された相関値であ
るから、結局、含有信号Ｇ（ｊ）は、信号Ｓ（ｊ）内に
含まれている周波数ｆ（ｎ）をもった信号成分というこ
とができる。It is assumed that the signal S (j) exists in the unit section d as shown in FIG. 6 (a). here,
As will be described later, j is a parameter for iterative processing (j = 1 to J). First, for this signal S (j), correlation values for all 128 periodic functions as shown in FIG. 2 are obtained. Then, the frequency of one periodic function for which the maximum correlation value is obtained is selected as the representative frequency,
A periodic function having the representative frequency is extracted as an element function. Then, the inclusion signal G as shown in FIG.
Define (j). The content signal G (j) is added to the extracted element function as its amplitude and the signal S of the element function.
It is a signal obtained by multiplying the correlation value for (j). For example, as shown in FIG. 2 as a periodic function, when a pair of sine function and cosine function are used and the frequency f (n) is selected as the representative frequency, the sine function A (n) having the amplitude A (n) is used. ) Sin (2πf (n) k / F) and a cosine function B (n) cos (2πf) having an amplitude B (n)
(N) k / F) is the content signal G (j)
That is, only one function is shown in FIG. 6B for convenience of illustration. Where A (n), B
Since (n) is the normalized correlation value obtained by the equation of FIG. 5, the contained signal G (j) eventually has the frequency f (n) contained in the signal S (j). It can be called a signal component.

【００３１】こうして、含有信号Ｇ（ｊ）が求まった
ら、信号Ｓ（ｊ）から含有信号Ｇ（ｊ）を減じることに
より、差分信号Ｓ（ｊ＋１）を求める。図６（ｃ）は、
このようにして求まった差分信号Ｓ（ｊ＋１）を示して
いる。この差分信号Ｓ（ｊ＋１）は、もとの信号Ｓ
（ｊ）の中から、周波数ｆ（ｎ）をもった信号成分を取
り去った残りの信号成分からなる信号ということができ
る。そこで、パラメータｊを１だけ増加させることによ
り、この差分信号Ｓ（ｊ＋１）を新たな信号Ｓ（ｊ）と
して取り扱い、同様の処理を、パラメータｊをｊ＝１〜
Ｊまで１ずつ増やしながらＪ回繰り返し実行すれば、Ｊ
個の代表周波数を選出することができる。When the content signal G (j) is obtained in this way, the difference signal S (j + 1) is obtained by subtracting the content signal G (j) from the signal S (j). FIG.6 (c) shows
The difference signal S (j + 1) thus obtained is shown. This difference signal S (j + 1) is the original signal S
It can be said that the signal is composed of the remaining signal components obtained by removing the signal component having the frequency f (n) from (j). Therefore, by increasing the parameter j by 1, the difference signal S (j + 1) is treated as a new signal S (j), and the same processing is performed for the parameter j from j = 1 to 1.
If you repeat J times while incrementing by 1 to J, J
Individual representative frequencies can be selected.

【００３２】このような相関計算の結果として出力され
るＪ個の含有信号Ｇ（１）〜Ｇ（Ｊ）は、もとの区間信
号Ｘの構成要素となる信号であり、もとの区間信号Ｘを
符号化する場合には、これらＪ個の含有信号の周波数を
示す情報および振幅（強度）を示す情報を符号データと
して用いるようにすれば良い。尚、Ｊは代表周波数の個
数であると説明してきたが、標準周波数ｆ（ｎ）の個数
と同一すなわちＪ＝１２８であってもよく、周波数スペ
クトルを求める目的においてはそのように行うのが通例
である。The J contained signals G (1) to G (J) output as a result of such a correlation calculation are signals which are constituent elements of the original section signal X, and are the original section signals. When X is coded, information indicating the frequency and amplitude (intensity) of these J contained signals may be used as code data. Although it has been described that J is the number of representative frequencies, it may be the same as the number of standard frequencies f (n), that is, J = 128, and it is customary to do so for the purpose of obtaining the frequency spectrum. Is.

【００３３】以上のような処理により、各単位区間につ
いて、各周波数に対する強度値の集合である周波数群が
得られることになる。このようにして所定数の周波数群
が選出されたら、この周波数群の各周波数に対応する
「音の高さを示す情報」、選出された各周波数の信号強
度に対応する「音の強さを示す情報」、当該単位区間の
始点に対応する「音の発音開始時刻を示す情報」、当該
単位区間に後続する単位区間の始点に対応する「音の発
音終了時刻を示す情報」、の４つの情報を含む所定数の
符号データを作成すれば、当該単位区間内の区間信号Ｘ
を所定数の符号データにより符号化することができる。
符号データとして、ＭＩＤＩデータを作成するのであれ
ば、「音の高さを示す情報」としてノートナンバーを用
い、「音の強さを示す情報」としてベロシティーを用
い、「音の発音開始時刻を示す情報」としてノートオン
時刻を用い、「音の発音終了時刻を示す情報」としてノ
ートオフ時刻を用いるようにすれば良い。By the above processing, a frequency group, which is a set of intensity values for each frequency, is obtained for each unit section. When a predetermined number of frequency groups are selected in this way, "information indicating the pitch of the sound" corresponding to each frequency of this frequency group, "sound intensity corresponding to the signal strength of each selected frequency""Informationindicating","information indicating the sound production start time" corresponding to the start point of the unit section, and "information indicating sound production end time" corresponding to the start point of the unit section subsequent to the unit section. If a predetermined number of code data including information is created, the section signal X in the unit section is generated.
Can be encoded by a predetermined number of encoded data.
If MIDI data is created as code data, note number is used as "information indicating pitch of tone", velocity is used as "information indicating intensity of tone", and "start time of sound generation" The note-on time may be used as the “information indicating” and the note-off time may be used as the “information indicating the sound production end time”.

【００３４】（3.本発明に係る周波数解析方法の理論的
根拠）以下、本発明に係る周波数解析方法および音響信
号の符号化方法について説明していく。まず、本発明に
係る周波数解析方法の理論について説明する。まず、時
系列信号のある時刻ｔにおける周波数成分を抽出するこ
とを考えてみる。現実的な計算のため、微小時間δを用
いると、時刻ｔにおける時系列信号ｇ（ｔ）は、以下の
〔数式２〕で近似できる。(3. Theoretical Basis of the Frequency Analysis Method According to the Present Invention) The frequency analysis method and the acoustic signal encoding method according to the present invention will be described below. First, the theory of the frequency analysis method according to the present invention will be described. First, consider extraction of frequency components of a time-series signal at a certain time t. If a small time δ is used for realistic calculation, the time series signal g (t) at time t can be approximated by the following [Formula 2].

【００３５】〔数式２〕ｇ（ｔ）≒Σ_δ［Σ_n｛Ａ_n(δ) sin(２πf_nt)＋Ｂ_n(δ)
cos(２πf_nt)｝］[Formula 2] g (t) ≅Σ _δ [Σ _n {A _n (δ) sin (2πf _n t) + B _n (δ)
cos (2πf _n t)}]

【００３６】上記〔数式２〕において、Σ_δはδ＝−τ
からδ＝＋τまでの総和を示し、Σ _nはｎ＝０からｎ＝
１２７までの総和を示す。なお、このｎは、上記ノート
ナンバーに対応している。さらに、上記〔数式２〕にお
けるＡ_n(δ)、Ｂ_n(δ)は以下の〔数式３〕で表現でき
る。In the above [Formula 2], Σ_δIs δ = −τ
To δ = + τ, and Σ _nIs from n = 0 to n =
The total sum up to 127 is shown. Note that this n is the above note
It corresponds to the number. Furthermore, in the above [Formula 2],
Kick A_n(δ), B_n(δ) can be expressed by the following [Equation 3].
It

【００３７】〔数式３〕Ａ_n(δ)＝（２/Ｔ_n）Σ_t｛ｇ（ｔ＋δ）sin(２πf_nt)｝Ｂ_n(δ)＝（２/Ｔ_n）Σ_t｛ｇ（ｔ＋δ）cos(２πf_nt)｝[0037] [Equation 3] _{A n (δ) = (2} / T n) Σ t {g (t + δ) sin (2πf n t)} B n (δ) = (2 / T n) Σ t {g ( t + δ) cos (2πf _n t)}

【００３８】上記〔数式３〕において、Ｔ_nは周期１/ f
_nの整数倍で単位区間長を超えない最大のものである。
Σ_tはｔ＝０からｔ＝Ｔ_n−１までの総和を示す。さて、
ここで求めたいのはｇ（ｔ）であり、これは、δ＝０の
ときの〔数式２〕の右辺により近似できる。これは、
〔数式３〕を用いてＡ_n(０)、Ｂ_n(０)を求めることによ
り得られる。周期関数に単位区間後方に重みが大きくな
るような窓関数W(t)を乗じるとｎの範囲を直前単位区間
−τだけを考慮するように近似させることができる。す
なわち、上記〔数式２〕は、以下の〔数式４〕のように
変形できる。In the above [Formula 3], T _n is the period 1 / f
It is the maximum value that is an integer multiple of _n and does not exceed the unit interval length.
Σ _t represents the total sum from t = 0 to t = T _n −1. Now,
What is desired here is g (t), which can be approximated by the right side of [Equation 2] when δ = 0. this is,
It is obtained by calculating A _n (0) and B _n (0) using [Equation 3]. When the periodic function is multiplied by the window function W (t) that increases the weight behind the unit section, the range of n can be approximated so that only the immediately preceding unit section −τ is considered. That is, the above [Formula 2] can be transformed into the following [Formula 4].

【００３９】〔数式４〕ｇ（ｔ）≒Σ_n｛Ａ_n(０) W(t) sin(２πf_nt)＋Ｂ_n(０)
W(t) cos(２πf_nt)｝＋Σ_n｛Ａ_n(-τ) W(t) sin(２πf_n
t)＋Ｂ_n(-τ) W(t) cos(２πf_nt)｝[Formula 4] g (t) ≈Σ _n {A _n (0) W (t) sin (2πf _n t) + B _n (0)
W (t) cos (2πf _n t)} ＋ Σ _n {A _n (-τ) W (t) sin (2πf _n
t) ＋ B _n (-τ) W (t) cos (2πf _n t)}

【００４０】上記〔数式４〕の右辺第１項が算出したい
パラメータであるため、右辺第１項をｇ'（ｔ）とする
と、〔数式４〕は、ｇ'(ｔ)＝ｇ(ｔ)−Σ_n｛Ａ_n(-τ) W(t) sin(２πf_nt)＋
Ｂ_n(-τ) W(t) cos(２πf_nt)｝と変形され、左辺に±ｇ(ｔ−τ)を追加することによ
り、以下の〔数式５〕のように変形される。Since the first term on the right side of the above [Equation 4] is a parameter to be calculated, if the first term on the right side is g ′ (t), then [Equation 4] becomes g ′ (t) = g (t). −Σ _n {A _n (-τ) W (t) sin (2πf _n t) +
B _n (−τ) W (t) cos (2πf _n t)}, and by adding ± g (t−τ) to the left side, it is transformed as shown in [Equation 5] below.

【００４１】〔数式５〕ｇ'(ｔ)＝ｇ(ｔ)−ｇ(ｔ−τ)＋ｇ(ｔ−τ)−Σ_n｛Ａ
_n(-τ) W(t) sin(２πf_nt)＋Ｂ_n(-τ) W(t) cos(２πf_n
t)｝[Formula 5] g ′ (t) = g (t) −g (t−τ) + g (t−τ) −Σ _n {A
_n (-τ) W (t) sin (2πf _n t) ＋ B _n (-τ) W (t) cos (2πf _n
t)}

【００４２】終了時刻がｔである単位区間を単位区間ｄ
_tと呼ぶことにすると、上記〔数式５〕の右辺における
「ｇ(ｔ)−ｇ(ｔ−τ)」の部分は、単位区間ｄ_tにおけ
る信号ｇ（ｔ）から単位区間ｄ_t-τにおける信号を除い
たものとなり、上記〔数式５〕の左辺における「ｇ(ｔ
−τ)−Σ_n｛Ａ_n(-τ) W(t) sin(２πf_nt)＋Ｂ_n(-τ)W
(t) cos(２πf_nt)｝」の部分は、単位区間ｄ_t-τにおい
て原信号から含有信号成分を除いた差分となる。これを
具体的に示すと図７に示すようになる。結局、上記〔数
式２〕〜〔数式５〕に示した変形は、単位区間ｄ_tにつ
いて周波数解析を行なおうとする場合に、単位区間ｄ_t
における原信号ｇ(ｔ)に対して解析を行なうのではな
く、図７の最下段に示すｇ'(ｔ)に対して行うべきであ
ることを示している。したがって、前の区間の信号の値
が小さいｇ'(ｔ)に対して行った方が、前の区間に影響
されることなく、精度の高い解析が可能となる。なお、
計算上、Ａ_n(０)、Ｂ_n(０)は以下の〔数式６〕により求
められることになる。A unit section whose end time is t is a unit section d.
If will be referred to as _t, the portion of the "g (t) -g (t- τ) " in the right side of the [Equation 5], in the unit interval d _t-tau from a signal g (t) in the unit interval d _t The signal is excluded, and "g (t
−τ) −Σ _n {A _n (−τ) W (t) sin (2πf _n t) + B _n (−τ) W
(t) cos (2πf _n t)} ”is a difference obtained by removing the contained signal component from the original signal in the unit section d _t− τ. This is specifically shown in FIG. 7. After all, the modifications shown in [Equation 2] to [Equation 5] have the unit interval d _{t when the} frequency analysis is performed on the unit interval d _t.
It is shown that the original signal g (t) in FIG. 7 should not be analyzed but g ′ (t) shown at the bottom of FIG. 7 should be analyzed. Therefore, it is possible to perform a highly accurate analysis without being influenced by the previous section by performing g '(t) in which the signal value in the previous section is small. In addition,
From the calculation, A _n (0) and B _n (0) are obtained by the following [Equation 6].

【００４３】〔数式６〕Ａ_n(０)＝（Ｓ_n/Ｔ_n）Σ_t｛ｇ'(ｔ) W(t) sin(２πf_nt)｝Ｂ_n(０)＝（Ｃ_n/Ｔ_n）Σ_t｛ｇ'(ｔ) W(t) cos(２πf_nt)｝[Equation 6] A _n (0) = (S _n / T _n ) Σ _t {g ′ (t) W (t) sin (2πf _n t)} B _n (0) = (C _n / T _n ) Σ _t {g '(t) W (t) cos (2πf _n t)}

【００４４】上記〔数式６〕におけるＳ_n、Ｃ_nは窓関数
に依存した補正係数で、Ｗ（ｔ）がフラットな矩形なら
ば、２．０の値をとる。具体的には、Ｓ_n、Ｃ_nは以下
の〔数式７〕で表現される。S _n and C _n in the above [Formula 6] are correction coefficients depending on the window function, and take a value of 2.0 if W (t) is a flat rectangle. Specifically, S _n and C _n are represented by the following [Formula 7].

【００４５】〔数式７〕Ｓ_n＝１／［Σ_t｛W(t) sin(２πf_nt)｝²］Ｃ_n＝１／［Σ_t｛W(t) cos(２πf_nt)｝²］[Formula 7] S _n = 1 / [Σ _t {W (t) sin (2πf _n t)} ² ] C _n = 1 / [Σ _t {W (t) cos (2πf _n t)} ² ]

【００４６】（3.1.本発明に係る周波数解析方法の具体
的処理）上記理論を踏まえて以下、本発明の周波数解析
方法について、音響信号を符号化する場合を例にとっ
て、図８に示すフローチャートを用いて説明する。ま
ず、時系列の音響信号に対して単位区間を設定して区間
信号の抽出を行う（ステップＳ１）。これは、上記（基
本原理）の項において、図１を用いて説明したように、
所定の長さの単位区間が設定されて抽出が行われる。た
だし、本発明においては、図７に示したように単位区間
同士を所定の量だけ重複させて設定する。(3.1. Specific processing of frequency analysis method according to the present invention) Based on the above theory, the frequency analysis method of the present invention will be described below with reference to the flowchart shown in FIG. It demonstrates using. First, a unit section is set for a time-series acoustic signal, and a section signal is extracted (step S1). This is as explained using FIG. 1 in the above (basic principle) section.
A unit section having a predetermined length is set and extraction is performed. However, in the present invention, as shown in FIG. 7, the unit sections are set to overlap each other by a predetermined amount.

【００４７】次に、抽出された区間信号の修正を行う
（ステップＳ２）。これは、上記（3.理論的根拠）の項
において説明したように、直前単位区間と解析対象単位
区間の重複部分を、直前単位区間において解析した最終
残差信号に置き換えることにより行われる。ただし、こ
のステップＳ２の処理は、直前の単位区間が存在しない
先頭の単位区間については行われない。先頭の単位区間
については、抽出された区間信号に対してステップＳ３
以降の処理が行われることになる。Next, the section signal thus extracted is corrected (step S2). This is performed by replacing the overlapping portion of the immediately preceding unit section and the unit section to be analyzed with the final residual signal analyzed in the immediately preceding unit section, as described in the section (3. Theoretical basis) above. However, the process of step S2 is not performed for the first unit section in which the immediately preceding unit section does not exist. For the first unit section, step S3 is performed on the extracted section signal.
Subsequent processing will be performed.

【００４８】次に、修正された区間信号とあらかじめ準
備された調和信号との相関を算出する（ステップＳ
３）。この調和信号とは、上記（周期関数との相関を求
める具体的な方法）の項で説明した周期関数と同じもの
である。周期関数の周波数としては、図２の例に示した
ように、ノートナンバーに対応した１２８個の標準周波
数を設定しても良いが、本実施形態では、各標準周波数
間に等比級数の間隔で１２個の微細周波数を設定し、全
部で１２８×１３個の調和信号を用意している。例え
ば、隣接する標準周波数ｆ（ｎ）とｆ（ｎ＋１）間で
は、ｆ（ｎ＋１／１３）〜ｆ（ｎ＋１２／１３）の１２
個の微細周波数が設定されることになる。このステップ
Ｓ３においては、区間信号と１２８×１３個の調和信号
との相関を短時間フーリエ変換の手法により求める。こ
の結果、１２８×１３個の各微細周波数に対応する相関
値が得られる。この相関値は、あらかじめ準備された微
細相関配列に格納される。Next, the correlation between the corrected section signal and the harmonic signal prepared in advance is calculated (step S).
3). This harmonic signal is the same as the periodic function described in the above section (Specific method for obtaining correlation with periodic function). As the frequency of the periodic function, 128 standard frequencies corresponding to the note numbers may be set as shown in the example of FIG. 2, but in the present embodiment, intervals of geometric series are provided between the standard frequencies. , 12 fine frequencies are set, and a total of 128 × 13 harmonic signals are prepared. For example, between the adjacent standard frequencies f (n) and f (n + 1), 12 of f (n + 1/13) to f (n + 12/13)
Individual fine frequencies will be set. In step S3, the correlation between the interval signal and 128 × 13 harmonic signals is obtained by the short-time Fourier transform method. As a result, 128 × 13 correlation values corresponding to the respective fine frequencies are obtained. This correlation value is stored in the fine correlation array prepared in advance.

【００４９】続いて、算出された相関値に基づいてピー
クとなる微細周波数（以下、ピーク周波数という）を選
出する（ステップＳ４）。具体的には、算出された１２
８×１３個の相関値を基に、隣接する両側の微細周波数
の相関値よりも、自身の相関値が大きくなっている微細
周波数を選出する。Then, a fine frequency having a peak (hereinafter referred to as a peak frequency) is selected based on the calculated correlation value (step S4). Specifically, the calculated 12
Based on the 8 × 13 correlation values, the fine frequency whose self correlation value is larger than the correlation values of the fine frequencies on both sides adjacent to each other is selected.

【００５０】次に、選出された各ピーク周波数に対応す
る調和信号と、修正区間信号との相関を一般化調和解析
の手法により求める（ステップＳ５）。具体的には、上
記（一般化調和解析の手法）の項で説明したのと同様
に、まず、選出されたピーク周波数のうち、相関値が最
大のものに対応する調和信号と区間信号Ｓ（１）（＝
Ｘ）との相関を求める。続いて、このときの調和信号
に、その振幅として算出された相関値を乗じることによ
り得られる信号を含有信号Ｇ（１）とし、この含有信号
Ｇ（１）を区間信号Ｓ（１）から減じることにより差分
信号Ｓ（２）を得る。次に、ステップＳ３において選出
されたピーク周波数のうち、区間信号Ｘとの相関値が２
番目に大きいピーク周波数の調和信号と、差分信号Ｓ
（２）との相関を求め、含有信号Ｇ（２）を区間信号か
ら減じることにより差分信号Ｓ（３）を得る。このよう
にして、一般化調和解析の手法により、選出された全て
のピーク周波数の相関値を求めていく。ここで、所定数
の周波数成分を抽出した時点で、残った差分信号が最終
残差信号となる。Next, the correlation between the harmonic signal corresponding to each selected peak frequency and the corrected section signal is obtained by the generalized harmonic analysis method (step S5). Specifically, as described in the section (Generalized Harmonic Analysis Method) above, first, among the selected peak frequencies, the harmonic signal and the section signal S (corresponding to the one having the largest correlation value) are selected. 1) (=
X) is calculated. Subsequently, a signal obtained by multiplying the harmonic signal at this time by a correlation value calculated as its amplitude is set as a contained signal G (1), and this contained signal G (1) is subtracted from the section signal S (1). Thus, the difference signal S (2) is obtained. Next, of the peak frequencies selected in step S3, the correlation value with the interval signal X is 2
The harmonic signal with the second largest peak frequency and the difference signal S
The difference signal S (3) is obtained by obtaining the correlation with (2) and subtracting the contained signal G (2) from the section signal. In this way, the correlation values of all the selected peak frequencies are obtained by the method of generalized harmonic analysis. Here, when the predetermined number of frequency components are extracted, the remaining difference signal becomes the final residual signal.

【００５１】選出されたピーク周波数は、微細周波数に
対応しているため、符号化の際には、ノートナンバーに
対応した標準周波数に戻す必要がある。そのため、算出
された相関値を、ピーク周波数に最も近い標準周波数に
対応する相関値として、主相関配列に設定する。これに
より、標準周波数とその相関値の組が複数得られること
になり、この標準周波数をノートナンバーとし、相関値
をベロシティとすることにより、ＭＩＤＩ規格の符号化
データが得られる（ステップＳ６）。以上の処理を全単
位区間により行うことにより、音響信号全体の符号化が
可能となる。Since the selected peak frequency corresponds to the fine frequency, it is necessary to restore the standard frequency corresponding to the note number at the time of encoding. Therefore, the calculated correlation value is set in the main correlation array as the correlation value corresponding to the standard frequency closest to the peak frequency. As a result, a plurality of sets of standard frequencies and their correlation values are obtained. By using this standard frequency as the note number and the correlation value as the velocity, encoded data of the MIDI standard is obtained (step S6). By performing the above processing for all unit sections, it is possible to code the entire acoustic signal.

【００５２】（3.2.抽出周波数成分決定の２段階処理）
上記ステップＳ４・Ｓ５においては、１回だけピーク周
波数を選出し、選出された各ピーク周波数について一般
化調和解析の手法により相関値を算出したが、同様の処
理を再度行うことにより解析精度および符号化精度を高
めることができる。このような場合について、以下説明
する。上記ステップＳ５のように区間信号Ｘから含有信
号を減じる処理を繰り返し行っていくことにより、異な
った区間信号Ｘ’が得られることになる。この区間信号
Ｘ’に対してステップＳ３〜ステップＳ５の処理を繰り
返す。(3.2. Two-step processing for determining extraction frequency component)
In the above steps S4 and S5, the peak frequency is selected only once, and the correlation value is calculated for each of the selected peak frequencies by the method of generalized harmonic analysis. The accuracy of conversion can be increased. Such a case will be described below. By repeating the process of subtracting the contained signal from the section signal X as in step S5, different section signals X ′ are obtained. The processing of steps S3 to S5 is repeated for this section signal X '.

【００５３】区間信号Ｘ’と相関をとる微細周波数とし
ては、既に主相関配列に相関値が格納された標準周波数
に該当するものは除かれる。例えば、標準周波数ｆ
（ｎ）に対応する相関値が主相関配列に設定されている
場合には、微細周波数ｆ（ｎ−６／１３）〜ｆ（ｎ＋６
／１３）は、区間信号Ｘ’との相関算出の対象から外さ
れる。残った微細周波数に対応する調和信号と区間信号
Ｘ’との相関を短時間フーリエ変換の手法により求めら
れ、算出された相関値は微細相関配列に格納される。As the fine frequencies correlated with the interval signal X ', those corresponding to the standard frequency whose correlation value is already stored in the main correlation array are excluded. For example, the standard frequency f
When the correlation value corresponding to (n) is set in the main correlation array, fine frequencies f (n-6 / 13) to f (n + 6)
/ 13) is excluded from the target of the correlation calculation with the section signal X ′. The correlation between the harmonic signal corresponding to the remaining fine frequency and the section signal X ′ is obtained by the short-time Fourier transform method, and the calculated correlation value is stored in the fine correlation array.

【００５４】続いて、ステップＳ４において、算出され
た相関値に基づいて再びピーク周波数を選出した後、ス
テップＳ５において、選出された各ピーク周波数と区間
信号Ｘ’との相関を求めた後、算出された相関値を主相
関配列に設定する。Subsequently, in step S4, the peak frequency is selected again based on the calculated correlation value, and in step S5, the correlation between each selected peak frequency and the section signal X'is obtained, and then calculated. The calculated correlation value is set in the main correlation array.

【００５５】ステップＳ３〜ステップＳ５の処理は、何
回でも繰り返すことが可能であるが、音響信号の符号化
処理を行う場合には、２回程度で十分である。２回行う
と、ＭＩＤＩ規格で同時発音可能な６４音程度は抽出で
きるためである。The processes of steps S3 to S5 can be repeated any number of times, but when the encoding process of the acoustic signal is performed, about twice is sufficient. This is because if performed twice, about 64 tones that can be simultaneously pronounced according to the MIDI standard can be extracted.

【００５６】（3.3.同一標準周波数内における抽出周波
数成分の決定）上記ステップＳ５における標準周波数の
決定および相関値の算出は、ピーク周波数の近傍にある
標準周波数を、算出された相関値に対応するものとして
決定した。これは、１つの標準周波数の範囲に含まれる
ピーク周波数が１つだけである場合には問題ないが、１
つの標準周波数の範囲に含まれるピーク周波数が複数あ
る場合には、そのどちらを選ぶかが問題となる。そのた
め、後者の場合には、１つの標準周波数に含まれるピー
ク周波数のうち、短時間フーリエ変換による相関値が最
大のものだけをピーク周波数として選出し、一般化調和
解析の手法により区間信号Ｘとの相関を求めるようにす
る。(3.3. Determination of Extraction Frequency Component in Same Standard Frequency) The determination of the standard frequency and the calculation of the correlation value in the step S5 correspond to the calculated correlation value for the standard frequency in the vicinity of the peak frequency. Decided as one. This is not a problem when only one peak frequency is included in one standard frequency range.
When there are a plurality of peak frequencies included in one standard frequency range, it becomes a problem which one is selected. Therefore, in the latter case, among the peak frequencies included in one standard frequency, only the one having the maximum correlation value by the short-time Fourier transform is selected as the peak frequency, and the section signal X is obtained by the generalized harmonic analysis method. Try to find the correlation of.

【００５７】（3.4.区間更新幅の決定処理）以上のよう
な周波数成分の抽出処理および抽出した周波数成分の符
号化は各単位区間ごとに行われるが、上述のように単位
区間の設定は固定ではなく、状況に応じて変更すること
が可能である。このとき、１つの単位区間の長さをどの
ように設定するかは、解析結果に大きな影響を与える重
要な問題である。すなわち、区間長を短く設定すればす
るほど、時間分解能が向上するため、音声信号（ボーカ
ル信号）などの周波数変化を解析するには好都合である
が、逆に、周波数分解能は低下するため、解析精度は低
下するという問題が生じることになる。そこで従来は、
１つの単位区間の中に、解析に最も都合が良いと思われ
るサンプル数（例えば、１０２４サンプル）が含まれる
ように、標準的な区間長をあらかじめ定めておき、この
固定の区間長に基づいて時間軸を機械的に区切ってゆく
という方法を採るのが一般的であった。(3.4. Interval Update Width Determination Processing) The frequency component extraction processing and the extracted frequency component encoding as described above are performed for each unit interval, but the unit interval settings are fixed as described above. Rather, it can be changed according to the situation. At this time, how to set the length of one unit section is an important problem that greatly affects the analysis result. That is, the shorter the section length is set, the more the time resolution is improved, which is convenient for analyzing the frequency change of a voice signal (vocal signal) and the like. The problem is that the accuracy is reduced. So conventionally,
A standard interval length is set in advance so that the number of samples (for example, 1024 samples) that seems to be most convenient for analysis is included in one unit interval, and based on this fixed interval length It was common to adopt a method of mechanically dividing the time axis.

【００５８】しかしながら、固定長の単位区間を機械的
に設定するという方法では、解析対象となる時系列信号
がどのような信号であるかに関わらず、常に一義的な区
間設定が行われることになるため、単位区間の境界部分
において不連続性が生じるおそれがある。すなわち、隣
接する単位区間についての周波数解析の結果に大きな差
が生じてしまうことがある。一般に、得られた周波数解
析の結果を、スペクトルのグラフとして観察するために
利用するような場合には、単位区間の境界において解析
結果に差が生じていても、大きな問題にはならない。と
ころが、音響信号に対する解析結果を利用して、原音響
信号に対応する符号データを作成するような場合、境界
部分における不連続性は大きな問題となる。このような
方法で符号化されたデータに基づいて、原音響信号を再
生しようとすると、単位区間の境界において、音程の急
激な変動が生じたり、音とびが生じたりすることにな
る。However, in the method of mechanically setting a fixed-length unit section, a unique section is always set regardless of what kind of signal the time-series signal to be analyzed is. Therefore, discontinuity may occur at the boundary of the unit section. That is, a large difference may occur in the frequency analysis results of the adjacent unit sections. Generally, when the obtained frequency analysis result is used for observing it as a spectrum graph, even if there is a difference in the analysis result at the boundary of the unit section, it does not cause a big problem. However, when the coded data corresponding to the original acoustic signal is created by using the analysis result of the acoustic signal, the discontinuity at the boundary becomes a big problem. When an original audio signal is reproduced based on the data encoded by such a method, a sharp pitch change or a pitch skip occurs at the boundary of the unit section.

【００５９】このような境界における不連続性を避ける
ために、時間軸上において単位区間を部分的に重複させ
て設定する方法も提案されているが、この場合も重複さ
せる量が固定であるため、本来重複させる必要のない箇
所まで重複して処理することになり、無駄な演算処理が
増加するという問題が生じることになる。本発明では、
時系列信号を解析する際に、単位区間の境界において、
音の連続性を保つと共に、単位区間の重複を最小限に抑
えるようにした単位区間の設定についても工夫を行って
いるので、以下に説明する。In order to avoid such discontinuity at the boundary, a method of partially overlapping the unit intervals on the time axis has been proposed, but in this case also, the overlapping amount is fixed. As a result, redundant processing is performed up to a portion that originally does not need to be duplicated, which causes a problem that unnecessary calculation processing is increased. In the present invention,
When analyzing a time series signal, at the boundary of the unit interval,
Since the unit continuity is set so that the continuity of the sound is maintained and the duplication of the unit spans is minimized, it will be described below.

【００６０】特に本発明においては、上記ステップＳ３
における短時間フーリエ変換により得られた相関値を利
用することにより単位区間の設定を行うようにしたこと
を特徴としている。具体的には、ある単位区間において
周波数成分の抽出を行う際に、ステップＳ３において最
初に得られる微細相関配列の値を、区間決定のために用
いる直前相関配列にコピーしておく。そして、ステップ
Ｓ２〜ステップＳ４の処理を所定の回数だけ行って、１
つの単位区間において抽出すべき周波数成分を決定した
後、次の単位区間の位置を設定する。本発明において
は、単位区間は固定長であるため、単位区間の開始位置
が定まれば、単位区間が決定する。従って本発明におい
ては、前の単位区間の開始位置から後の単位区間の開始
位置までの送り幅を定めることにより、後の単位区間が
決定されることになる。Particularly in the present invention, the above step S3
It is characterized in that the unit section is set by using the correlation value obtained by the short-time Fourier transform in. Specifically, when the frequency component is extracted in a certain unit section, the value of the fine correlation array first obtained in step S3 is copied to the immediately preceding correlation array used for section determination. Then, the processes of steps S2 to S4 are performed a predetermined number of times to
After determining the frequency component to be extracted in one unit section, the position of the next unit section is set. In the present invention, since the unit section has a fixed length, the unit section is determined when the start position of the unit section is determined. Therefore, in the present invention, the subsequent unit section is determined by determining the feed width from the start position of the previous unit section to the start position of the subsequent unit section.

【００６１】まず、既に確定している前の単位区間と、
これから位置を決定する単位区間の類似度を求める（ス
テップＳ７）。具体的には、ピーク周波数選出のため
に、ステップＳ２で行った短時間フーリエ変換の結果得
られた微細相関配列の値を利用して以下の〔数式８〕に
より算出する。First, the previously determined unit section,
From this, the degree of similarity of the unit section for determining the position is obtained (step S7). Specifically, for the peak frequency selection, the value of the fine correlation array obtained as a result of the short-time Fourier transform performed in step S2 is used to calculate by the following [Equation 8].

【００６２】〔数式８〕ＳＭ＝100−100×［Σ_l｛V_dm-1(l)−V_dm(l)｝²］^1/2 ／
｛Σ_lV_dm-1(l)²｝^1/2 [Equation 8] SM = 100−100 × [Σ _l {V _dm−1 (l) −V _dm (l)} ² ] ^1/2 /
{Σ _l V _dm-1 (l) ² } ^1/2

【００６３】上記〔数式８〕において、V_dm(l)は、決定
しようとしている単位区間ｄ_mの区間信号に対する微細
周波数ｋの相関値であり、V_dm-1(l)は、直前の単位区間
ｄ_m- ₁の区間信号に対する微細周波数ｋの相関値であ
る。ただし、ｌは周波数そのものを示すものではなく、
ノートナンバーｎに対応する標準周波数を１３分割した
値で表現され、ｌ＝−６〜１２７×１３＋６の値を
とる。類似度ＳＭの値が大きい程両区間信号は類似して
いることになる。In the above [Equation 8], V _dm (l) is the correlation value of the fine frequency k with respect to the section signal of the unit section d _m to be determined, and V _dm-1 (l) is the immediately preceding unit. It is a correlation value of the fine frequency k with respect to the section signal in the section d _m- ₁ . However, l does not indicate the frequency itself,
It is expressed by a value obtained by dividing the standard frequency corresponding to the note number n into 13, and takes values of l = -6 to 127 × 13 + 6. The larger the value of the similarity SM, the more similar the two section signals are.

【００６４】次に、この類似度ＳＭの値と閾値を比較す
る。類似度ＳＭが閾値２より小さい場合は（ステップＳ
８）、設定されている送り幅を半分にする（ステップＳ
９）。送り幅を半分にした位置に仮単位区間の設定をし
直し（ステップＳ１）、新たな仮単位区間の区間信号と
調和信号との相関を求める（ステップＳ３）。ステップ
Ｓ１〜Ｓ３およびステップＳ７〜Ｓ９の処理は、類似度
ＳＭが閾値２以上となるまで繰り返される。すなわち、
閾値２以上となるまで、後の単位区間を前の単位区間と
重複させる範囲を大きくしていくことになる。Next, the value of this similarity SM and the threshold value are compared. When the similarity SM is smaller than the threshold value 2 (step S
8), halve the set feed width (step S
9). The provisional unit section is reset at the position where the feed width is halved (step S1), and the correlation between the section signal of the new provisional unit section and the harmonic signal is obtained (step S3). The processes of steps S1 to S3 and steps S7 to S9 are repeated until the similarity SM becomes equal to or greater than the threshold value 2. That is,
Until the threshold value becomes 2 or more, the range in which the subsequent unit section overlaps with the previous unit section is increased.

【００６５】類似度ＳＭが閾値２以上となったら、類似
度ＳＭと閾値１との比較を行う。類似度が閾値１未満で
ある場合は、仮単位区間を単位区間として確定し、ピー
ク位置の選出処理へ進む（ステップＳ４）。類似度が閾
値１以上である場合は、送り幅を２倍に設定する（ステ
ップＳ１１）と共に、仮単位区間を単位区間として確定
し、ピーク位置の選出処理へ進む（ステップＳ４）。こ
こで、２倍に変更された送り幅は、次の単位区間の送り
幅として利用されることになる。単位区間の確定後は、
ステップＳ４、Ｓ５の処理は同様に行われ、抽出周波数
成分が決定される。なお、当然のことながら、上記２つ
の閾値は、閾値１＞閾値２の関係を満たすものとなって
いる。When the similarity SM exceeds the threshold 2, the similarity SM and the threshold 1 are compared. If the degree of similarity is less than the threshold value 1, the provisional unit section is determined as a unit section, and the process proceeds to peak position selection processing (step S4). When the similarity is equal to or more than the threshold value 1, the feed width is set to double (step S11), the provisional unit section is confirmed as a unit section, and the peak position selecting process is performed (step S4). Here, the doubled feed width is used as the feed width of the next unit section. After confirming the unit section,
The processes of steps S4 and S5 are performed in the same manner to determine the extraction frequency component. In addition, as a matter of course, the two threshold values satisfy the relationship of threshold value 1> threshold value 2.

【００６６】単位区間についての抽出周波数が決定した
ら、この時点で設定されている送り幅だけ移動した位置
を開始位置として（ステップＳ１２）、仮単位区間を設
定する（ステップＳ１）。このとき、通常はあらかじめ
設定されている標準送り幅が利用され、ステップＳ１１
により送り幅が２倍に設定されている場合には、標準送
り幅の２倍を送り幅とする。When the extraction frequency for the unit section is determined, the position moved by the feed width set at this time is set as the start position (step S12), and the temporary unit section is set (step S1). At this time, normally, the preset standard feed width is used, and step S11
If the feed width is set to 2 by, the feed width is set to twice the standard feed width.

【００６７】ここで、設定される単位区間を、図９に示
す具体的な例を用いて説明する。図９（ａ）は、時系列
の音響信号を示す図である。図９（ｂ）に示すように音
響信号の先頭位置から固定長の単位区間ｄ１が設定さ
れ、ステップＳ３からステップＳ５の処理が行われるこ
とにより、抽出周波数が決定される。次の単位区間ｄ２
の設定にあたっては、ステップＳ１２において、あらか
じめ設定されている標準送り幅だけ後に送られて仮設定
される。本実施形態では、標準送り幅を固定長Ｌの１／
２としている。よって、単位区間ｄ１の先頭からＬ／２
だけずれた位置が単位区間ｄ２の先頭となるように単位
区間ｄ２が仮設定される。このとき仮設定された単位区
間ｄ２の状態を図９（ｃ）に示す。Here, the unit section to be set will be described using a concrete example shown in FIG. FIG. 9A is a diagram showing a time-series acoustic signal. As shown in FIG. 9B, the fixed-length unit section d1 is set from the head position of the acoustic signal, and the processes of steps S3 to S5 are performed to determine the extraction frequency. Next unit section d2
In setting S, in step S12, the standard feed width set in advance is sent afterward and provisionally set. In the present embodiment, the standard feed width is 1 / the fixed length L.
2 Therefore, L / 2 from the beginning of the unit section d1
The unit section d2 is tentatively set so that the position shifted by just the position becomes the head of the unit section d2. FIG. 9C shows the state of the unit section d2 temporarily set at this time.

【００６８】そして、ステップＳ２において、仮設定さ
れた単位区間ｄ２の位置の音響信号を区間信号として抽
出すると共に、直前単位区間の最終残差信号により修正
し、修正区間信号との相関をステップＳ３の手法により
求める。続いてステップＳ７において上記〔数式２〕に
示した類似度を算出する。算出した類似度が閾値２未満
である場合、すなわち、両区間信号があまり似ていない
場合は、ステップＳ９に進んで送り幅が１／２にされ
る。ここでは、Ｌ／２に設定されているので、Ｌ／４に
設定されるということになる。この場合、ステップＳ１
に戻って、単位区間ｄ１の先頭からＬ／４だけずれた位
置が単位区間ｄ２の先頭となるように単位区間ｄ２が仮
設定し直される。このとき仮設定された単位区間ｄ２の
状態を図９（ｄ）に示す。単位区間ｄ２が仮設定し直さ
れると、再びステップＳ２、Ｓ３Ｓ７の処理が繰り返さ
れる。さらに、ステップＳ８において、類似度が閾値２
未満である場合は、ステップＳ９において送り幅がＬ／
８に設定され、ステップＳ１に戻って単位区間ｄ１の先
頭からＬ／８だけずれた位置が単位区間ｄ２の先頭とな
るように単位区間ｄ２が仮設定し直される。このとき仮
設定された単位区間ｄ２の状態を図９（ｅ）に示す。Then, in step S2, the acoustic signal at the position of the temporarily set unit section d2 is extracted as a section signal and corrected by the final residual signal of the immediately preceding unit section, and the correlation with the corrected section signal is correlated with step S3. It is obtained by the method of. Then, in step S7, the degree of similarity shown in [Equation 2] is calculated. If the calculated similarity is less than the threshold value 2, that is, if the two section signals are not very similar, the process proceeds to step S9 and the feed width is halved. Here, since it is set to L / 2, it means that it is set to L / 4. In this case, step S1
Returning to step 1, the unit section d2 is tentatively set again so that the position displaced by L / 4 from the head of the unit section d1 becomes the head of the unit section d2. The state of the unit section d2 temporarily set at this time is shown in FIG. When the unit section d2 is temporarily set again, the processes of steps S2 and S3S7 are repeated again. Further, in step S8, the degree of similarity is threshold value 2
If it is less than, the feed width is L /
8, the process returns to step S1 and the unit section d2 is provisionally set again so that the position displaced by L / 8 from the start of the unit section d1 becomes the start of the unit section d2. The state of the unit section d2 temporarily set at this time is shown in FIG.

【００６９】すなわち、図９（ｃ）〜図９（ｅ）に示す
ように、先行する単位区間ｄ１と後続の単位区間ｄ２に
おける区間信号の類似度が閾値２以上となるまで、単位
区間ｄ２は単位区間ｄ１との重複区間を増やしていくこ
とになる。ただし、このままだと、単位区間ｄ１と単位
区間ｄ２が果てしなく近付いてしまう可能性があるの
で、送り幅の最小距離をあらかじめ設定しておき、送り
幅が最小距離以下となった場合は、その時点で仮設定さ
れている単位区間を正式な単位区間として確定し、ステ
ップＳ４の処理に進むようにする。That is, as shown in FIGS. 9 (c) to 9 (e), the unit section d2 is maintained until the similarity between the section signals in the preceding unit section d1 and the succeeding unit section d2 becomes the threshold value 2 or more. The number of overlapping sections with the unit section d1 will be increased. However, if it is left as it is, the unit section d1 and the unit section d2 may approach each other endlessly. Therefore, if the minimum distance of the feed width is set in advance and the feed width becomes equal to or less than the minimum distance, the point Then, the unit section temporarily set is confirmed as a formal unit section, and the process proceeds to step S4.

【００７０】一方、単位区間ｄ１の区間信号と、仮設定
された単位区間ｄ２の区間信号の類似度が閾値２以上で
閾値１未満となった場合には、仮設定されている単位区
間ｄ２を確定して、ステップＳ４以降の処理を行う。ま
た、類似度が閾値１以上となった場合も、仮設定されて
いる単位区間ｄ２が確定されて、ステップＳ４以降の処
理が行われることになるが、この場合、送り幅を２倍に
設定する処理も平行して行う。そして、単位区間ｄ２に
ついての抽出周波数成分が決定された後、ステップＳ１
２において、単位区間の先頭から２倍に設定された送り
幅だけ移動した位置を単位区間ｄ３の先頭位置とする。
例えば、単位区間ｄ２が図９（ｃ）に示すような位置で
確定された場合、単位区間ｄ３は、図９（ｆ）に示す位
置に仮設定されることになる。On the other hand, if the similarity between the section signal of the unit section d1 and the section signal of the temporarily set unit section d2 is greater than or equal to the threshold value 2 and less than the threshold value 1, the temporarily set unit section d2 is set. After the confirmation, the processing from step S4 is performed. Further, even when the similarity becomes equal to or more than the threshold value 1, the provisionally set unit section d2 is determined and the processing from step S4 is performed, but in this case, the feed width is set to double. The processing is also performed in parallel. Then, after the extraction frequency component for the unit section d2 is determined, step S1
In 2, the position moved from the head of the unit section by the doubled feed width is set as the head position of the unit section d3.
For example, when the unit section d2 is determined at the position shown in FIG. 9C, the unit section d3 is provisionally set at the position shown in FIG. 9F.

【００７１】（4.周波数解析の変形例）次に、変形例に
ついて説明する。図１０に本発明に係る周波数解析方法
の変形例を示すフローチャートである。まず、単位区間
の設定を行う（ステップＳ２１）。ここでも、上記図７
に示したのと同様に各単位区間が相互に重複するように
設定する。ここでは、解析対象とする単位区間を対象単
位区間と呼び、対象単位区間の直前の単位区間、直後の
単位区間をそれぞれ直前単位区間、直後単位区間と呼ぶ
ことにする。(4. Modified Example of Frequency Analysis) Next, a modified example will be described. FIG. 10 is a flowchart showing a modified example of the frequency analysis method according to the present invention. First, the unit section is set (step S21). Again, in FIG.
The unit sections are set so that they overlap each other as shown in FIG. Here, the unit section to be analyzed is referred to as a target unit section, and the unit section immediately before the target unit section and the unit section immediately after the target unit section are respectively referred to as an immediately preceding unit section and an immediately following unit section.

【００７２】続いて、対象単位区間、直前単位区間およ
び直後単位区間それぞれについて、短時間フーリエ変換
の手法により、標準周波数に対応する相関値を算出する
（ステップＳ２２）。短時間フーリエ変換による各標準
周波数に対応する相関値の算出は、上記（2.1.短時間フ
ーリエ変換の手法）の項で説明した処理と同様であるの
で、詳細な説明は省略する。ここでは、図２に示したよ
うな１２８通りの標準周期関数を利用するので、対象単
位区間、直前単位区間、直後単位区間それぞれについて
１２８の標準周波数とその相関値の組が合計１２８×３
個得られることになる。Then, the correlation value corresponding to the standard frequency is calculated for each of the target unit section, the immediately preceding unit section, and the immediately following unit section by the short-time Fourier transform method (step S22). The calculation of the correlation value corresponding to each standard frequency by the short-time Fourier transform is the same as the process described in the above section (2.1. Method of short-time Fourier transform), and thus detailed description thereof will be omitted. Here, since 128 standard periodic functions as shown in FIG. 2 are used, a total of 128 × 3 sets of 128 standard frequencies and their correlation values are set for each of the target unit section, the immediately preceding unit section, and the immediately following unit section.
You will get one.

【００７３】次に、一般化調和解析の手法を用いて抽出
する周波数成分を決定する（ステップＳ２３）。従来の
一般化調和解析の手法では、解析対象となる対象単位区
間において、短時間フーリエ変換の結果の相関値が大き
い順に、対応する標準周波数で合成される含有信号を用
いて区間信号を更新すると共に、更新された相関値を標
準周波数の強度成分としていたが、本発明では、直前単
位区間・対象単位区間・直後単位区間のいずれかにおい
て、短時間フーリエ変換の結果の相関値が大きい順に、
対応する標準周波数で合成される含有信号を用いて区間
信号を更新し、注目している単位区間が対象単位区間で
ある場合のみ相関値を標準周波数の強度成分とする。具
体的には、直前単位区間、対象単位区間、直後単位区間
の標準周波数の中で、上記短時間フーリエ変換による解
析の結果、最大の相関値を有する標準周波数に注目す
る。つまり、１２８×３個の相関値の中から相関値最大
となる標準周波数に注目するのである。そして、その相
関値を、所定の単位区間で再計算し、注目する標準周波
数と更新された相関値で合成される含有信号を注目した
標準周波数の注対象単位区間における区間信号から減じ
て差分信号を得て区間信号を更新するのである。このと
き、注目する単位区間が直前単位区間または直後単位区
間の場合には、含有信号の位相を対象単位区間との時間
差に対応する分だけずらし、対象単位区間の範囲内の区
間信号から減算して区間信号を更新する。注目する単位
区間が対象単位区間であった場合は、更新された相関値
を注目した標準周波数の強度成分とする。一方、注目す
る単位区間が直前単位区間または直後単位区間の場合に
は、その更新された相関値は前述のように区間信号を更
新する処理に使われるだけである。Next, the frequency component to be extracted is determined using the method of generalized harmonic analysis (step S23). In the conventional generalized harmonic analysis method, in the target unit section to be analyzed, the section signal is updated by using the contained signal synthesized at the corresponding standard frequency in descending order of the correlation value of the result of the short-time Fourier transform. Along with this, the updated correlation value was used as the strength component of the standard frequency, but in the present invention, in any of the immediately preceding unit section, the target unit section, and the immediately following unit section, the correlation value as a result of the short-time Fourier transform is in descending order,
The section signal is updated using the contained signal that is synthesized at the corresponding standard frequency, and the correlation value is used as the intensity component of the standard frequency only when the unit section of interest is the target unit section. Specifically, among the standard frequencies of the immediately preceding unit section, the target unit section, and the immediately following unit section, attention is paid to the standard frequency having the maximum correlation value as a result of the analysis by the short-time Fourier transform. In other words, attention is paid to the standard frequency having the maximum correlation value from the 128 × 3 correlation values. Then, the correlation value is recalculated in a predetermined unit section, and the content signal synthesized with the correlation value updated with the standard frequency of interest is subtracted from the section signal in the target unit section of the standard frequency of interest to obtain a difference signal. Is obtained and the section signal is updated. At this time, if the unit section of interest is the immediately preceding unit section or the immediately following unit section, the phase of the contained signal is shifted by an amount corresponding to the time difference from the target unit section and subtracted from the section signal within the range of the target unit section. To update the section signal. If the unit section of interest is the target unit section, the updated correlation value is set as the intensity component of the standard frequency of interest. On the other hand, when the unit section of interest is the immediately preceding unit section or the immediately following unit section, the updated correlation value is only used for the processing of updating the section signal as described above.

【００７４】例えば、直前単位区間・対象単位区間・直
後単位区間の短時間フーリエ変換による各標準周波数に
対応する相関値が図１１（ｃ）に示すような値であった
とし、各々の相関値を順に更新した結果を同図（ｄ）に
示す。なお、図１１（ｃ）においては、標準周波数を、
対応するノートナンバーで示しており、相関値は、所定
の基準に従って数値化したものを示している。図１１
（ｄ）の相関値の中で括弧無しの数値は強度成分に採用
されたもので、括弧付きの数値は区間信号の更新にのみ
使われたことを示す。図１１（ｃ）の例では、ノートナ
ンバー９８に対応する直前単位区間の相関値が「９２」
で、全相関値中最大であるので、直前単位区間におい
て、相関値を再計算する。その結果は、図１１（ｄ）に
示すように「９２」となる（初回は区間信号が元のまま
であるため、相関値は更新前と同一になる）。そして、
ノートナンバー９８に対応する標準周波数および直前単
位区間の相関値「９２」に基づく含有信号成分を、対象
単位区間の区間信号から減じて差分信号を得て、この信
号を新たな区間信号とする。ただし、相関値「９２」
は、対象単位区間でないため強度成分にはならない。次
に、残った相関値の中から最大の相関値を探すと、同様
にノートナンバー９８に対応する直後単位区間の相関値
「８６」となり、相関を再計算した結果は「６０」にな
る。そのため、ノートナンバー９８に対応する標準周波
数および直後単位区間の相関値「６０」に基づく含有信
号成分を、対象単位区間の区間信号から減じて区間信号
を更新する。同様に、相関値「６０」は対象単位区間で
はないため強度成分にはならない。続いて、残った相関
値の中から最大の相関値を探すと、ノートナンバー９９
に対応する直後単位区間の相関値「７８」となり、相関
を再計算した結果は「７０」になる。そのため、ノート
ナンバー９９に対応する標準周波数および直後単位区間
の相関値「７０」に基づく含有信号成分を、対象単位区
間の区間信号から減じて区間信号を更新する。同様に、
相関値「７０」は対象単位区間ではないため強度成分に
はならない。更に、残った相関値の中から最大の相関値
を探すと、ノートナンバー９７に対応する対象単位区間
の相関値「７０」となり、再計算した結果は「６０」に
なる。そのため、ノートナンバー９７に対応する標準周
波数および対象単位区間の相関値「６０」に基づく含有
信号成分を、対象単位区間の区間信号から減じて区間信
号を更新する。このとき、相関値「６０」は対象単位区
間であるためノートナンバー９７に対応する標準周波数
の強度成分になる。同様の処理を繰り返し行っていき、
１２８×３個の相関値に基づく含有信号成分が区間信号
から初期の相関値が大きい順に削除されてゆくととも
に、１２８個の相関値に基づく強度成分が、図１１
（ｄ）の対象単位区間の相関値の列に示されるように決
定されることになる。注目すべき点は、図１１（ｃ）で
示される短時間フーリエ変換の結果によると、相関値が
最大となるノートナンバーは９８であるが、同図（ｄ）
で示される最終結果ではノートナンバー９７の強度値が
最大となり、直前・直後の単位区間における信号による
影響を削除した、精度の高い抽出成分を得ることができ
る。For example, assuming that the correlation value corresponding to each standard frequency by the short-time Fourier transform of the immediately preceding unit section / target unit section / immediately following unit section is a value as shown in FIG. 11C, each correlation value The result of updating in sequence is shown in FIG. In addition, in FIG. 11C, the standard frequency is
Corresponding note numbers are shown, and the correlation value is shown as a numerical value according to a predetermined standard. Figure 11
Among the correlation values in (d), the values without brackets are adopted for the intensity component, and the bracketed values are used only for updating the interval signal. In the example of FIG. 11C, the correlation value of the immediately preceding unit section corresponding to the note number 98 is “92”.
Since it is the maximum among all the correlation values, the correlation value is recalculated in the immediately preceding unit section. As a result, as shown in FIG. 11D, the value becomes “92” (in the first time, the interval signal remains unchanged, so the correlation value becomes the same as before the update). And
The standard frequency corresponding to the note number 98 and the contained signal component based on the correlation value “92” of the immediately preceding unit section are subtracted from the section signal of the target unit section to obtain a difference signal, and this signal is set as a new section signal. However, the correlation value "92"
Is not an intensity component because it is not the target unit section. Next, when the maximum correlation value is searched from the remaining correlation values, the correlation value "86" in the immediately following unit section corresponding to the note number 98 is obtained, and the result of recalculating the correlation is "60". Therefore, the contained signal component based on the standard frequency corresponding to the note number 98 and the correlation value “60” of the immediately subsequent unit section is subtracted from the section signal of the target unit section to update the section signal. Similarly, since the correlation value “60” is not the target unit section, it does not become an intensity component. Then, when the maximum correlation value is searched from the remaining correlation values, the note number 99
Correlation value "78" in the unit section immediately after that corresponds to, and the result of recalculating the correlation becomes "70". Therefore, the contained signal component based on the standard frequency corresponding to the note number 99 and the correlation value “70” of the immediately subsequent unit section is subtracted from the section signal of the target unit section to update the section signal. Similarly,
The correlation value “70” is not a target component section and thus does not become an intensity component. Furthermore, when the maximum correlation value is searched from the remaining correlation values, the correlation value of the target unit section corresponding to the note number 97 becomes "70", and the recalculated result becomes "60". Therefore, the section signal is updated by subtracting the contained signal component based on the standard frequency corresponding to the note number 97 and the correlation value “60” of the target unit section from the section signal of the target unit section. At this time, since the correlation value “60” is the target unit section, it becomes the intensity component of the standard frequency corresponding to the note number 97. Repeat the same process,
The contained signal components based on 128 × 3 correlation values are deleted from the interval signal in the order of increasing initial correlation values, and the intensity components based on 128 correlation values are
It is decided as shown in the column of the correlation value of the target unit section of (d). According to the result of the short-time Fourier transform shown in FIG. 11C, noteworthy is that the note number having the maximum correlation value is 98, but FIG.
In the final result shown by, the intensity value of the note number 97 becomes maximum, and it is possible to obtain a highly accurate extracted component in which the influence of the signal in the immediately preceding and following unit sections is deleted.

【００７５】抽出された周波数成分については、上記基
本原理等と同様にノートナンバー、ベロシティ、デルタ
タイムに変換することによりＭＩＤＩ符号データが得ら
れる（ステップＳ２４）。MIDI code data is obtained by converting the extracted frequency components into note numbers, velocities, and delta times in the same manner as the above basic principle (step S24).

【００７６】上記変形例についても、各単位区間につい
て短時間フーリエ変換による相関値を有しているため、
これらに対して図８のステップＳ７〜Ｓ１２に示す処理
を行うことにより、単位区間の設定位置の変更を行うよ
うにすることも可能である。Also in the above modification, since each unit section has the correlation value by the short-time Fourier transform,
It is also possible to change the set position of the unit section by performing the processing shown in steps S7 to S12 of FIG. 8 on these.

【００７７】以上、本発明の好適な実施形態について説
明したが、上記周波数解析方法および音響信号符号化方
法は、コンピュータ等で実行されることは当然である。
具体的には、図８および図１０のフローチャートに示し
たようなステップを上記手順で実行するためのプログラ
ムをコンピュータに搭載しておく。そして、音響信号等
の時系列信号をＰＣＭ方式等でデジタル化した後、コン
ピュータに取り込み、ステップＳ１〜ステップＳ１２、
もしくはステップＳ２１〜ステップＳ２４の処理を行っ
た後、抽出した周波数成分もしくはＭＩＤＩ形式等の符
号データをデジタルデータとしてコンピュータより出力
する。出力された符号データは、例えば、ＭＩＤＩデー
タの場合、ＭＩＤＩシーケンサ、ＭＩＤＩ音源を用いて
音声として再生される。Although the preferred embodiment of the present invention has been described above, it goes without saying that the frequency analysis method and the acoustic signal encoding method are executed by a computer or the like.
Specifically, a program for executing the steps shown in the flowcharts of FIGS. 8 and 10 in the above procedure is installed in the computer. Then, after time-series signals such as acoustic signals are digitized by the PCM method or the like, they are loaded into a computer, and steps S1 to S12 are performed.
Alternatively, after performing the processing of steps S21 to S24, the extracted frequency component or code data in the MIDI format or the like is output from the computer as digital data. For example, in the case of MIDI data, the output code data is reproduced as sound using a MIDI sequencer and MIDI sound source.

【００７８】[0078]

【発明の効果】以上、説明したように本発明によれば、
時系列信号から所定の標準周波数に対応した信号成分を
分離するための周波数解析方法として、時系列信号に所
定の長さの単位区間を、各単位区間が互いに所定の長さ
だけ重複するように、設定する段階と、解析対象となる
対象単位区間において、この対象単位区間と所定の長さ
だけ重複する直前単位区間との重複区間における信号
を、直前単位区間において得られる最終残差信号に置き
換えた修正区間信号を作成する区間信号修正段階と、修
正区間信号に対して、一般化調和解析を利用することに
より、対象単位区間における周波数および対応する強度
の組を算出するスペクトル算出段階を有し、この対象単
位区間を新たな直前単位区間とし、一般化調和解析を利
用して修正区間信号から複数の含有信号を減じることに
より最終的に得られる信号を最終残差信号として、区間
信号修正段階およびスペクトル算出段階の処理を、設定
された全単位区間について行うことにより、全区間に渡
って周波数および強度の組を算出するようにしたので、
重複する区間の信号成分による影響を抑えた解析を行な
うことが可能となるという効果を奏する。As described above, according to the present invention,
As a frequency analysis method for separating a signal component corresponding to a predetermined standard frequency from a time series signal, a unit section of a predetermined length is added to the time series signal so that each unit section overlaps each other by a predetermined length. , In the setting unit and in the target unit section to be analyzed, the signal in the overlapping section of the immediately preceding unit section that overlaps the target unit section by a predetermined length is replaced with the final residual signal obtained in the immediately preceding unit section. And a spectrum calculation step for calculating a set of frequencies and corresponding intensities in the target unit section by using the generalized harmonic analysis for the modified section signal. , This target unit section is set as a new immediately preceding unit section, and finally obtained by subtracting multiple contained signals from the corrected section signal using generalized harmonic analysis. Signal as the final residual signal, the processing of the interval signal modification step and the spectrum calculation step, by performing for all units sections set, because to calculate the set of frequencies and intensity over the entire interval,
It is possible to perform an analysis that suppresses the influence of signal components in the overlapping section.

[Brief description of drawings]

【図１】本発明に係る周波数解析方法および音響信号符
号化方法の基本原理を示す図である。FIG. 1 is a diagram showing a basic principle of a frequency analysis method and an acoustic signal coding method according to the present invention.

【図２】本発明で利用される周期関数の一例を示す図で
ある。FIG. 2 is a diagram showing an example of a periodic function used in the present invention.

【図３】図２に示す各周期関数の周波数とＭＩＤＩノー
トナンバーｎとの関係式を示す図である。FIG. 3 is a diagram showing a relational expression between a frequency of each periodic function shown in FIG. 2 and a MIDI note number n.

【図４】解析対象となる信号と周期信号との相関計算の
手法を示す図である。FIG. 4 is a diagram showing a method of calculating a correlation between a signal to be analyzed and a periodic signal.

【図５】図４に示す相関計算を行うための計算式を示す
図である。5 is a diagram showing a calculation formula for performing the correlation calculation shown in FIG.

【図６】一般化調和解析の基本的な手法を示す図であ
る。FIG. 6 is a diagram showing a basic method of generalized harmonic analysis.

【図７】単位区間における区間信号の修正の様子を示す
図である。FIG. 7 is a diagram showing how a section signal is modified in a unit section.

【図８】本発明に係る周波数解析方法および音響信号符
号化方法のフローチャートである。FIG. 8 is a flowchart of a frequency analysis method and an acoustic signal coding method according to the present invention.

【図９】単位区間の設定手順を示す図である。FIG. 9 is a diagram showing a procedure for setting a unit section.

【図１０】本発明に係る周波数解析方法および音響信号
符号化方法の変形例の概要を示すフローチャートであ
る。FIG. 10 is a flowchart showing an outline of a modified example of the frequency analysis method and the acoustic signal coding method according to the present invention.

【図１１】本発明の変形例を説明するための図である。FIG. 11 is a diagram for explaining a modified example of the present invention.

【符号の説明】ｄ・・・単位区間Ｇ・・・含有信号Ｌ・・・単位区間長Ｓ・・・差分信号Ｘ・・・初期区間信号[Explanation of symbols] d: Unit section G: content signal L: Unit section length S ・・・ Differential signal X: Initial section signal

Claims

[Claims]

1. A frequency analysis method for separating a signal component corresponding to a predetermined standard frequency from a time-series signal, wherein the time-series signal has unit sections of a predetermined length, and each unit section has a predetermined section. And the signal in the overlapping part of the target unit section that is the analysis target and the immediately preceding unit section that overlaps the target unit section by a predetermined length, in the immediately preceding unit section. A section signal correction step of creating a modified section signal replaced with the final residual signal obtained, and a frequency and corresponding intensity value in the target unit section by using a generalized harmonic analysis for the modified section signal. And a spectrum calculation step of calculating a set of a., A target unit section as a new immediately preceding unit section, and using generalized harmonic analysis, a plurality of sections are included from the corrected section signal. The signal finally obtained by subtracting the existing signal is used as the final residual signal, and the processing of the section signal correction step and the spectrum calculation step is performed for all the set unit sections, thereby providing the frequency over the entire section. And a frequency analysis method comprising extracting a set of intensity values.

2. A frequency analysis method for separating a signal component corresponding to a predetermined standard frequency from a time-series signal, wherein the time-series signal has unit sections of a predetermined length, and each unit section has a predetermined section. The short-time Fourier transform is applied to the setting step, the target unit section to be analyzed, the immediately preceding unit section that is the immediately preceding unit section, and the immediately following unit section that is the immediately following unit section so that they overlap each other by the length of A correlation calculation step of calculating a correlation value corresponding to each standard frequency by using, of the correlation values in the immediately preceding unit section, the target unit section, the immediately after unit section, the maximum correlation value is selected,
Along with recalculating that value, the content signal obtained by the correlation value recalculated with the standard frequency is subtracted from the section signal of the target unit section, and the difference signal obtained by subtracting is a new section signal, When the selected correlation value is based on the target unit section, the recalculated correlation value is determined as the intensity value corresponding to the standard frequency, and the unit section of the standard frequency corresponding to the selected correlation value is set to the unit section. A corresponding spectrum is excluded from the subsequent selection targets, and a spectrum calculation step in which these processes are repeated until a predetermined number of sets of the standard frequency and intensity value is determined, and, after the spectrum calculation step, the target unit The section is set as a new immediately preceding unit section, and the immediately following unit section is set as a new target unit section, and the correlation calculation step and the spectrum calculation step are performed for all unit sections. A frequency analysis method, characterized in that a set of a plurality of standard frequencies and intensity values is determined for all unit intervals by repeatedly performing the above.

3. The time-series signal is an acoustic signal, and pitch information of a sound corresponding to the standard frequency selected in the spectrum calculating step, sound intensity information corresponding to an intensity value, and each unit. An encoding step of generating coded data by performing a predetermined conversion on four pieces of information consisting of a sounding start time of a sound corresponding to a start point of a section and a sounding end time of a sound corresponding to a start point of a subsequent unit section. The audio signal encoding method according to claim 1 or 2, further comprising: