JPH02137896A

JPH02137896A - Generating method for sound source data

Info

Publication number: JPH02137896A
Application number: JP63292931A
Authority: JP
Inventors: Makoto Furuhashi; 古橋　真
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1988-11-19
Filing date: 1988-11-19
Publication date: 1990-05-28

Abstract

PURPOSE:To enable excellent repetitive reproduction by interpolating waveform data consisting of existent samples and generating interpolated samples, and using the interpolated sample which is closest to the existent sample as the connection sample of repetitive waveform. CONSTITUTION:The specific-cycle waveform data consisting of the existent samples (marked with 'o' in figure) with a sampling period T3 is interpolated to generate the interpolated samples ('x' in figure) and the point where the existent sample and interpolated sample are closest in value (crest value) is regarded as the connection sample. Thus, the connection point (looping start point and looping end point) of the repetitive waveform are obtained including the interpolated samples obtained by interpolating the existent samples of the waveform data, so the continuity at the connection point of the waveform is improved, so no looping noises are generated.

Description

【発明の詳細な説明】〔産業上の利用分野］本発明は、例えば電子楽器等の音源データの生成方法に
関すものであり、特に、楽音信号等をディジタル処理し
て音源データを生成する方法に関するものである。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to a method of generating sound source data such as, for example, an electronic musical instrument, and in particular, a method of generating sound source data by digitally processing a musical tone signal or the like. It is related to.

[Summary of the invention]

本発明は、実在サンプルよりなる波形データを補間して
複数の補間サンプルを生成し、実在サンプルに最も近い
補間サンプルの点で繰り返し波形の接続サンプルとする
ことにより、繰り返し点における非連続性がなくなり良
好な繰り返し再生を可能とした音源データの生成方法を
提供するものである。The present invention generates a plurality of interpolated samples by interpolating waveform data consisting of real samples, and connects the repeating waveform at the point of the interpolated sample closest to the real sample, thereby eliminating discontinuity at the repeating points. The present invention provides a method of generating sound source data that enables good repeated playback.

[Conventional technology]

−ｉに、電子楽器やＴＶゲーム器等に用いられる音源は
、ＶＣＯ，ＶＣＡ、ＶＣＦ等から成ルアナログ音源と、
ＰＳＧ（プログラマブル・サウンド・ジェネレータ）や
波形ＲＯＭ読み出しタイプ等のディジタル音源とに大別
される。このディジタル音源の一種として、近年におい
ては、生の楽器音等をサンプリングしてディジタル処理
した音源データをメモリ等に記憶させて用いるようなサ
ンプラー音源も広く知られるようになってきている（例
えば特開昭６２−２６４０９９号公報、特開昭６２−２
６７７９８号公報参照）。-i, the sound sources used in electronic musical instruments, TV game machines, etc. are analog sound sources such as VCO, VCA, VCF, etc.
It is broadly classified into digital sound sources such as PSG (programmable sound generator) and waveform ROM readout type. In recent years, as a type of digital sound source, sampler sound sources have become widely known, in which sound source data that is sampled from live musical instruments and digitally processed is stored in a memory or the like (for example, Publication No. 62-264099, JP-A No. 62-2
(See Publication No. 67798).

このサンプラー音源においては、−船釣に音源データ記
憶用のメモリに大容量を要することから、メモリ節約の
ための手法が各種提案されており、例えば、楽音波形の
周期性を利用したルーピング処理や、非線形量子化等に
よるビット圧縮処理がその代表的なものとして挙げられ
る。なお上記ルーピング処理は、サンプリングされた楽
音の元の持続時間よりも長い時間音を出し続けるための
一手法でもある。すなわち、例えば楽音信号波形を考え
るとき、一般に発音開始直後においてはピアノの打鍵ノ
イズや管楽器のプレスノイズ等のいわゆる非音程成分を
含む波形の周期性が不明瞭なフォルマント部分が生じて
おり、その後、楽音の音程（ピッチ、音高）に対応する
基本周期で同じ波形が繰り返し現れるようになる。この
繰り返し波形のｎ周期分（ｎは整数）をルーピング区間
とし、必要に応して繰り返し再生することにより、少な
いメモリ容量で長時間の持続音を得ることができるわけ
である。This sampler sound source requires a large capacity of memory for storing sound source data when fishing on a boat, so various methods have been proposed to save memory, such as looping processing that uses the periodicity of musical sound waves A typical example of this is bit compression processing using nonlinear quantization. Note that the looping process described above is also a method for continuing to output a sound for a longer time than the original duration of the sampled musical tone. In other words, when considering a musical tone signal waveform, for example, immediately after the start of sound generation, a formant portion containing so-called non-pitch components such as piano key-press noise or wind instrument press noise occurs, and the periodicity of the waveform is unclear, and then, The same waveform appears repeatedly with a fundamental period corresponding to the musical interval (pitch, pitch) of the musical tone. By setting n cycles (n is an integer) of this repetitive waveform as a looping section and repeatedly reproducing the sound as needed, a long-lasting sound can be obtained with a small memory capacity.

上述のように楽音をサンプリングしてその音程成分をル
ーピング処理するということは、ルーピング区間のルー
ピング開始点とルーピング終端点を接続して繰り返しル
ープすることであり、これらのルーピング開始点とルー
ピング終端点の値は略等しいことが要求される。なぜな
らば、その２点の値が略等しい値でない場合、つまりそ
の接続点が非連続であるとノイズ発生の原因となるため
である。As mentioned above, sampling a musical tone and looping its pitch components means connecting the looping start point and looping end point of the looping section and looping repeatedly. It is required that the values of are approximately equal. This is because if the values at the two points are not approximately equal, that is, if the connection points are discontinuous, noise will occur.

しかし、上記ルーピング開始点とルーピング終端点の値
を略等しい値に選ぶことはサンプリング周期等の関係か
ら難しく、従来用いられているルーピング処理方法では
有効な解決方法が見出されていない。However, it is difficult to select substantially equal values for the looping start point and the looping end point due to the sampling period, etc., and no effective solution has been found in conventional looping processing methods.

本発明は、上述のような実情に鑑みて提案されたもので
あり、繰り返し点の非連続性を無くし、ルーピングノイ
ズが発生しない音源データの生成方法を提供することを
目的とするものである。The present invention has been proposed in view of the above-mentioned circumstances, and aims to provide a method for generating sound source data that eliminates discontinuity in repeating points and does not generate looping noise.

（発明が解決しようとする課題）〔課題を解決するための手段〕本発明に係る音源データの生成方法は、上述の目的を達
成するために提案されたものであって、第１図に示すよ
うに、サンプリング周期Ｔ、の複数の実在サンプル（図
中○印）よりなる所定周期の波形データを補間して複数
の補間サンプルも生成（図中の×印等）し、実在サンプ
ルと補間サンプルのうち最も値（波高値）の近い点を繰
り返し波形の接続サンプルとしたことを特徴とするもの
であり、例えば繰り返し波形に対応するルーピング区間
ＬＰにおけるルーピング開始点ＬＰ、が実在サンプルで
、この開始点ＬＰ、の波高値に最も近いルーピング終端
点ＬＰＥが補間サンプルとなっている。これは、従来の
実在サンプルのみから構成されるルーピング区間ＬＰ’
のルーピング終端点Ｌ　Ｐ　Ｅ　　“の実在サンプルの
波高値よりも上記補間サンプルの波高値の方が開始点Ｌ
Ｐ３の波高値に近いことを考慮したものである。(Problem to be Solved by the Invention) [Means for Solving the Problem] A sound source data generation method according to the present invention has been proposed to achieve the above-mentioned object, and is shown in FIG. As shown in FIG. 2, the waveform data of a predetermined period consisting of multiple real samples (marked with ○ in the figure) with sampling period T is interpolated to generate multiple interpolated samples (marked with x in the figure, etc.), and the real samples and interpolated samples are It is characterized in that the point with the closest value (peak value) among them is used as a connected sample of the repeating waveform. For example, the looping start point LP in the looping section LP corresponding to the repeating waveform is an actual sample, and this start The looping end point LPE closest to the peak value of point LP is the interpolation sample. This is the conventional looping section LP' consisting of only real samples.
The peak value of the above interpolated sample is higher than the peak value of the actual sample at the looping end point L P E " of the starting point L
This is done in consideration of the fact that it is close to the peak value of P3.

このような開始点ＬＰ、と終端点ＬＰ、とで接続するた
めには、ルーピング処理時の終端点から開始点に戻ると
きにサンプリング周期が乱れないように、補間処理によ
って第２図に示すように各補間サンプル（図中の×印）
を求め、破線に示すような信号波形を形成する。In order to connect the starting point LP and the ending point LP, interpolation processing is used to prevent the sampling period from being disrupted when returning from the ending point to the starting point during looping processing, as shown in Figure 2. each interpolation sample (x mark in the figure)
, and form a signal waveform as shown by the broken line.

[For production]

本発明によれば、波形データの複数の実在サンプルを補
間して得られる補間サンプルも含めて繰り返し波形の接
続点（ルーピング開始点とルーピング終端点）としてい
るため、第２図の破線に示すように波形の接続点での波
形の連続性を高めることができる。According to the present invention, interpolated samples obtained by interpolating a plurality of real samples of waveform data are also used as connecting points (looping start point and looping end point) of repeated waveforms, so as shown in the broken line in FIG. It is possible to improve the continuity of the waveform at the connection point of the waveform.

〔Example〕

以下、本発明を適用した一実施例について図面を参照し
ながら説明する。なお、本発明は以下の実施例に限定さ
れるものでないことは言うまでもない。An embodiment to which the present invention is applied will be described below with reference to the drawings. It goes without saying that the present invention is not limited to the following examples.

先ず、上記第１図のルーピング区間ＬＰの波形図につい
て更に詳しく説明する。First, the waveform diagram of the looping section LP shown in FIG. 1 will be explained in more detail.

第１図において、一定のサンプリング周ＭＴ　ｓで所定
周期の信号波形をサンプリングすることにより実際に得
られた各サンプル（実在サンプル、図中○印）のみを用
いて得られる繰り返し波形に対応するルーピング区間Ｌ
Ｐ’は、ルーピング開始点ＬＰｆｆとルーピング終端点
ＬＰ、’にそれぞれ実在サンプルを有するものであるが
、上記開始点ＬＰ３と終端点［、Ｐｉ　　’を接続する
場合、開始点ＬＰｓと終端点しＰｉ　　’の波高値が近
い値になることは少なく、第２図の実線で示す接続波形
のように開始点ＬＰｓと終端点ＬＰ！　’が非連続にな
ることが非常に多い。そこで、上記実在サンプルより成
る波形データを補間して補間サンプルを生成し、その補
間サンプルをも含めて互いに最も近い波高値となるルー
ピング開始点ＬＰｉと終端点ＬＰ、とを求め、波形の接
続点として用いるものである。In Fig. 1, looping corresponds to a repetitive waveform obtained using only each sample (actual sample, marked with ○ in the figure) actually obtained by sampling a signal waveform of a predetermined period at a constant sampling frequency MT s. Section L
P' has actual samples at the looping start point LPff and the looping end point LP,', but when connecting the start point LP3 and the end point [, Pi', the start point LPs and the end point Pi' are connected. The peak values of ' are rarely close to each other, and as shown in the connection waveform shown by the solid line in Figure 2, the starting point LPs and the ending point LP! ' are very often discontinuous. Therefore, interpolation samples are generated by interpolating the waveform data made up of the actual samples, and the looping start point LPi and end point LP, which have the closest peak values to each other including the interpolated samples, are determined, and the connection point of the waveforms is determined. It is used as a.

ここで通常のサンプラー音源装置等において、メモリ等
に記憶された一定のピッチの音源データから打鍵された
キーのピッチの楽音波形を得るためのピッチ変換方式と
して、オーバーサンプリング等による補間方式を用いて
いる場合には、上記補間サンプルを得るためにハードウ
ェアを増加させる必要がなく、容易に本発明を適用する
ことができる。第３図はこの補間によるピッチ変換方式
を説明するための図であり、例えば４倍オーバーサンプ
リングされて得られた各補間サンプルの取り出し間隔を
変えることによりピッチ変換を行っている。この第３図
の例では、原波形Ａの各実在サンプル（Ｏ印）を４倍オ
ーバーサンプリングすることにより得られた補間サンプ
ル（×印）も含めた全てのサンプルから、例えば５個周
期でサンプルを取り出して元のサンプリング周期Ｔ、で
配列することによりピッチ変換波形Ｂを得ている。Here, in a normal sampler sound source device, etc., an interpolation method such as oversampling is used as a pitch conversion method to obtain a musical sound waveform of the pitch of a pressed key from sound source data of a constant pitch stored in a memory etc. In this case, there is no need to increase hardware to obtain the interpolated samples, and the present invention can be easily applied. FIG. 3 is a diagram for explaining a pitch conversion method using interpolation. For example, pitch conversion is performed by changing the sampling interval of each interpolated sample obtained by four times oversampling. In the example shown in Fig. 3, samples are sampled at a period of, for example, 5 samples from all samples, including interpolated samples (x marks) obtained by oversampling each real sample (O mark) of the original waveform A by 4 times. The pitch-converted waveform B is obtained by extracting and arranging it at the original sampling period T.

この場合には、周波数を５／４倍に変換したものである
。なお、第３図に示すピッチ変換はピッチを上げた例で
あるが、サンプル取り出し周期を３個以下とすることに
よりピッチを下げることもできる。また、オーバーサン
プリングの倍数を高めることにより精度を高めることが
でき、実用的には例えば２５６倍程度のオーバーサンプ
リングが考えられる。In this case, the frequency is converted to 5/4 times. Although the pitch conversion shown in FIG. 3 is an example in which the pitch is raised, the pitch can also be lowered by setting the sampling period to three or less. Furthermore, accuracy can be improved by increasing the oversampling multiple, and in practical terms, oversampling of about 256 times is considered, for example.

次に、第４図は上記第３図の補間処理によるピッチ変換
を行うための回路構成を示している。Next, FIG. 4 shows a circuit configuration for performing pitch conversion by the interpolation process shown in FIG. 3 above.

この第４図において、音源データが記憶されているＲＯ
Ｍ等のメモリ１は、アドレス発生回路２のアドレス・デ
ータに基づいて音源データを出力する。アドレス発生回
路２には、上記ピッチ変換処理するときのピッチ変換比
を決定するピッチ情報がピッチ情報発生回路３から送ら
れており、さらに付属情報レジスタ７ａ、ループスター
トアドレス・レジスタ７ｂ、ループエンドアドレス・レ
ジスタ７Ｃからの各データも供給されている。これらの
データに基づいて、メモリｌをアクセスするための上記
アドレス・データを発生するものである。また、上記付
属情報レジスタ７ａ、ループスタートアドレス・レジス
タ７ｂ、ループエンドアドレス・レジスタ７Ｃには、上
記音源データからのそれぞれの情報が供給されている。In this FIG. 4, the RO where the sound source data is stored is
The memory 1 such as M outputs sound source data based on the address data of the address generation circuit 2. Pitch information that determines the pitch conversion ratio when performing the pitch conversion process is sent to the address generation circuit 2 from the pitch information generation circuit 3, and further includes an attached information register 7a, a loop start address register 7b, and a loop end address. - Each data from register 7C is also supplied. Based on these data, the address data for accessing the memory I is generated. Further, the attached information register 7a, the loop start address register 7b, and the loop end address register 7C are supplied with respective information from the sound source data.

上記付属情報レジスタ７ａは、例えば後述の第２３図に
示すブロック単位のヘッダ情報ＦＲを取り込むものであ
り、また、ループスタートアドレス・レジスタ７ｂ及び
ループエンドアドレス・レジスタ７Ｃは、上記ルーピン
グ開始点ＬＰ、とルーピング終端点ＬＰｔの各アドレス
を取り込むものである。The attached information register 7a is for taking in block-by-block header information FR shown, for example, in FIG. and each address of the looping end point LPt.

これら付属情報レジスタ及びループスタートアドレス・
レジスタ７ｂとループエンドアドレス・レジスタ７Ｃの
出力データは上記アドレス発生回路２へ送られると共に
、係数ＲＯＭアドレス変換回路６へも供給される。当該
係数ＲＯＭアドレス変換回路６には、上記ピッチ情報発
生回路３からのピッチ情報も入力され、これらの出力デ
ータに基づいて係数ＲＯＭ５は予め記憶されている係数
を補間フィルタ４に送り、該補間フィルタ４のフィルタ
特性を決定する。ここで、上記補間フィルタ４は、例え
ばｎ個の遅延器ＤＬＬ＝ＤＬｎと乗算器Ｍ１〜Ｍｎと加
算器Ｐ１〜Ｐｎで構成され、上記ＲＯＭＩから音源デー
タが入力されている。上述のように補間フィルタ４に入
力された音源データは該補間フィルタ４でピッチ変換が
行われ、Ｄ／Ａコンバータ８でＤ／Ａ変換されて出力端
子９から音源信号として出力されるものである。These attached information registers and loop start address
The output data of the register 7b and the loop end address register 7C are sent to the address generation circuit 2 and also to the coefficient ROM address conversion circuit 6. The pitch information from the pitch information generation circuit 3 is also input to the coefficient ROM address conversion circuit 6, and based on these output data, the coefficient ROM 5 sends pre-stored coefficients to the interpolation filter 4. Determine the filter characteristics of 4. Here, the interpolation filter 4 includes, for example, n delay devices DLL=DLn, multipliers M1 to Mn, and adders P1 to Pn, and receives sound source data from the ROMI. As mentioned above, the sound source data input to the interpolation filter 4 undergoes pitch conversion by the interpolation filter 4, is D/A converted by the D/A converter 8, and is output as a sound source signal from the output terminal 9. .

第５図は上記各レジスタ７ｂ及び７ｃにそれぞれ取り込
まれるループスタートアドレス及びループエンドアドレ
スの一例を示しており、複数サンプル毎にブロックを構
成する場合を想定している。FIG. 5 shows an example of the loop start address and loop end address taken into each of the registers 7b and 7c, assuming a case where a block is formed for each of a plurality of samples.

ここで、ループスタート点（上記ルーピング開始点Ｌ　
Ｐ　ｓ　）のサンプルを必ず上記ブロックの先頭位置に
配置する場合には、ループスタートアドレスはブロック
番号のみで十分であるが、ブロック内のどのサンプルか
らでもルーピング開始されるときには、ブロック内サン
プル番号（図中破線参照）が必要となる。またループエ
ンドアドレスには、ブロック内サンプル番号のみならず
、サンプル間の補間ポイントを指示する情報が含まれる
。Here, the loop start point (the above looping start point L
If the sample of P s ) is always placed at the beginning of the block, the block number alone is sufficient as the loop start address, but when looping starts from any sample within the block, the sample number within the block ( (see broken line in the figure) is required. Furthermore, the loop end address includes not only the sample number within the block but also information indicating the interpolation point between samples.

これによって上記ピッチ変換と同様な方法により補間サ
ンプルも含めた高精度のルーピング処理が行える。例え
ば２５６倍オーバーサンプリングする場合には、上記ル
ーピング区間ＬＰをサンプリング周期Ｔ、の１／２５６
の精度で設定することができ、波形接続の精度も１／２
５６程度にまで高めることができる。しかも、オーバー
サンプリング方式のピッチ変換を採用している音源装置
等にあっては、殆どハードウェアを追加することなく本
発明を適用でき、円滑な波形の接続がされることにより
、ルーピングノイズを大幅に低減することができる。As a result, high-precision looping processing including interpolation samples can be performed using a method similar to the pitch conversion described above. For example, in the case of 256 times oversampling, the looping section LP is set to 1/256 of the sampling period T.
It can be set with an accuracy of 1/2, and the accuracy of waveform connection is also 1/2
It can be increased to about 56. Furthermore, the present invention can be applied to sound source devices that employ pitch conversion using an oversampling method, with almost no additional hardware, and looping noise can be significantly reduced by smooth waveform connections. can be reduced to

ここで、通常の楽音信号波形をサンプリングして伝送、
あるいは記録再生する際におけるルーピング処理につい
て、第６図を参照しながら説明する。Here, the normal musical tone signal waveform is sampled and transmitted,
Alternatively, looping processing during recording and reproduction will be explained with reference to FIG.

一般に発音開始直後においてはピアノの打鍵ノイズや管
楽器のプレスノイズ等の非音程成分が含まれることによ
り、波形の周期性が不明瞭な部分であるフォルマント部
分ＦＲが生じており、その後、楽音の音程（ピッチ、音
高）に対応する基本周期で同じ波形が繰り返し現れるよ
うになる。この繰り返し波形のｎ周期分（ｎは整数）を
ルーピング区間ＬＰとし、このルーピング区間ＬＰはル
ーピング開始点ＬＰ、とルーピング終端点ＬＰｔのルー
ピングポイント間で表されるものである。Generally, immediately after the start of sound production, a formant part FR, which is a part where the periodicity of the waveform is unclear, occurs due to the inclusion of non-pitch components such as piano key tapping noise and press noise of wind instruments, and after that, the pitch of the musical note The same waveform will appear repeatedly with a fundamental period corresponding to (pitch, pitch). N cycles (n is an integer) of this repetitive waveform are defined as a looping section LP, and this looping section LP is expressed between the looping points of the looping start point LP and the looping end point LPt.

そして上記フォルマント部分ＦＲとルーピング区間ＬＰ
とを記憶媒体に記録し、再生時にはフォルマント部分Ｆ
Ｒの再生に続いてルーピング区間ｔ。And the above formant part FR and looping section LP
is recorded on a storage medium, and during playback, the formant part F
Following the reproduction of R, a looping section t.

Ｐを繰り返し再生することにより、任意の長時間に亘っ
て楽音を発生させることができる。By repeatedly reproducing P, musical tones can be generated over an arbitrary long period of time.

第７図は、本発明に係る音源データの生成方法の他の具
体的な実施例を説明するために、人力楽音信号をサンプ
リングして記憶媒体に記録するまでの各機能の具体例を
示す機能ブロック図である。FIG. 7 shows a specific example of each function from sampling a human-powered musical tone signal to recording it on a storage medium, in order to explain another specific embodiment of the sound source data generation method according to the present invention. It is a block diagram.

この場合の入力端子１０に供給される入力楽音信号とし
ては、例えばマイクロフォンで直接収音した信号、ある
いはディジタル・オーディオ信号記録媒体等を再生して
得られた信号を、アナログ信号あるいはディジタル信号
の形態で用いることができる。In this case, the input musical tone signal supplied to the input terminal 10 may be a signal directly picked up by a microphone, or a signal obtained by reproducing a digital audio signal recording medium, etc., in the form of an analog signal or a digital signal. It can be used in

先ず、第７図のサンプリング処理機能ブロック１１にお
いては、上記入力楽音信号を例えば周波数３８ｋＨｚで
サンプリングし、■サンプル１６ビツトのディジタルデ
ータとして取り出している。First, in the sampling processing function block 11 of FIG. 7, the input musical tone signal is sampled at a frequency of 38 kHz, for example, and extracted as 16-bit sample digital data.

このサンプリング処理とは、上記入力楽音信号がアナロ
グ信号の場合のＡ／Ｄ変換処理に対応するものであり、
また入力信号がディジタル信号の場合にはサンプリング
レート変換及びビット数変換の処理に対応するものであ
る。This sampling process corresponds to A/D conversion process when the input musical tone signal is an analog signal,
Furthermore, when the input signal is a digital signal, it corresponds to processing of sampling rate conversion and bit number conversion.

次に、ピッチ検出機能ブロック１２において、上述のサ
ンプリング処理により得られたディジタル楽音信号につ
いての楽音の音程（ピンチ）を決定する基音の周波数（
基本周波数）ｆ、、すなわちピッチ情報が検出される。Next, in the pitch detection function block 12, the frequency (
fundamental frequency) f, that is, pitch information is detected.

このピ・ンチ検出機能ブロック１２における検出原理を
説明する。ここで、サンプリング音源となる楽音信号は
、その基音となる周波数がサンプリング周波数ｆｓに比
べてかなり低い場合が多く、周波数軸で楽音のピークを
検出するだけでは高い精度での音程の同定が難しい。し
たがって、何らかの手段を用いて、楽音の倍音成分のス
ペクトルを利用する必要がある。The detection principle in this pinch detection function block 12 will be explained. Here, the frequency of the fundamental tone of the musical tone signal serving as the sampling sound source is often considerably lower than the sampling frequency fs, and it is difficult to identify the pitch with high accuracy just by detecting the peak of the musical tone on the frequency axis. Therefore, it is necessary to use some means to utilize the spectrum of overtone components of musical tones.

先ず、音程を検出したい楽音信号の波形をｆ　（ｔ）と
すれば、この楽音波形ｆｃｔ＞を各倍音成分の振幅ａ（
ω）および位相φ（ω）で表せば、該楽音波形ｆ（ｔ）
はフーリエ展開した式、ｆ（ｔ）＝　　Σａ　（６））ＣＯ３（ωｔ　＋φ（ω
）　　）　　・・■で表せる。ここで、各倍音の位相の
ずれφ（ω）を全てゼロにすると、ｆ（ｔ）＝　　Σａ（ω）ｃｏｓωｔ　・　・　・　・
　・　・　・■の式で表せるものとなる。このように位
相の揃えの持つ全ての倍音の周期の整数倍の点およびｔ
・０の点である。これは基音の周期にほかならない。First, let f (t) be the waveform of a musical tone signal whose pitch is to be detected, then convert this musical waveform fct> to the amplitude a(
ω) and phase φ(ω), the musical sound waveform f(t)
is a Fourier-expanded formula, f(t)=Σa (6))CO3(ωt +φ(ω
) ) ... can be expressed as ■. Here, if the phase shift φ(ω) of each harmonic is set to zero, f(t) = Σa(ω)cosωt ・・・・
・・・It can be expressed by the formula of ■. In this way, the points that are integral multiples of the period of all harmonics that the phase alignment has and t
・It is a point of 0. This is nothing but the period of the fundamental tone.

この原理をふまえて、ピッチ検出の手順を第８図に示す
機能ブロック図を用いて説明する。Based on this principle, the pitch detection procedure will be explained using the functional block diagram shown in FIG.

第８図において、実部データ入力端子３１より楽音デー
タを、また虚部データ入力端子３２より“０゛°を、高
速フーリエ変換（ＦＦＴ）機能ブロック３３に供給する
。In FIG. 8, musical tone data is supplied from a real part data input terminal 31 and "0°" is supplied from an imaginary part data input terminal 32 to a fast Fourier transform (FFT) functional block 33.

ここで、上記高速フーリエ変換機能ブロック３３で行わ
れる高速フーリエ変換において、ピッチを推定する楽音
信号をｘ（ｔ）とし、また、上記楽音信号ｘ（ｔ）に含
まれる倍音成分をａａｃｏｓ（２πｆ１％Ｌ＋θ）・・・・・―■とすれ
ば、ｘ（ｔ）はＯｏこれを複素表示で書き直して、０゜Ｘ　（ｔ）　＝　（１／２）Σａ、ｅｘｐ（ｊθ、、）
ｅｘｐ（ｊωｎｔ）　・・■ｎ＝−ω ただし、ｃｏｓθ＝（ｅｘｐ（ｊθ）十ｅｘｐ（−ｊθ））／２
−　　・■を用いた。この式をフーリエ変換すると、ＯＸ　（（１））＝　Ｓ　　ｘ（ｔ）ｅｘｐ（−３（１）
　Ｌ）　ｄｔ−Ｏｏ・Σａイｅｘｐ（ｊθ、、）δ（ω−ω＋１）　　・　
・■ｎ；−のここで、δ（ω−ω、１）はデルタ関数である。Here, in the fast Fourier transform performed by the fast Fourier transform function block 33, the musical tone signal whose pitch is estimated is x(t), and the overtone component included in the musical tone signal x(t) is aacos(2πf1% L+θ)・・・・・・■, then x(t) is Oo Rewriting this in complex representation, 0゜X (t) = (1/2)Σa, exp(jθ,,)
exp(jωnt) ・・■n=-ω However, cosθ=(exp(jθ) + exp(-jθ))/2
- ・■ was used. When this formula is Fourier transformed, O x ((1)) = S x (t)exp(-3(1)
L) dt-Oo ・Σa exp(jθ,,)δ(ω-ω+1) ・
・■n;- where δ(ω-ω, 1) is a delta function.

次の機能ブロック３４で該高速フーリエ変換後のデータ
のノルム（絶対値、すなわち実部と虚部をそれぞれ２乗
したものの和の平方根）を算出する。In the next functional block 34, the norm (absolute value, that is, the square root of the sum of the squares of the real part and the imaginary part) of the data after the fast Fourier transform is calculated.

すなわち、Ｘ（ω）の絶対値Ｙ（ω）を取ると、位相成
分がキャンセルされて、Ｙ（ω）＝［Ｘ（ω）Ｘ（ω）］ｌ／！＝（１／２）ａ
、１δ（ω−ω７）　・　・　・　・■これは、上記楽
音データの高周波成分の全ての位相を合わせるために成
されるものであり、上記虚部をゼロにすることにより、
位相成分を揃えることができる。That is, when we take the absolute value Y(ω) of X(ω), the phase component is canceled, and Y(ω)=[X(ω)X(ω)]l/! =(1/2)a
, 1δ(ω-ω7) ・・・・■ This is done to match the phases of all the high frequency components of the musical tone data, and by setting the imaginary part to zero,
Phase components can be aligned.

次に、この算出されたノルムを高速フーリエ変換（この
場合は逆ＦＦＴに相当）機能ブロック３６に実部データ
として供給し、虚部データ入力端子３５には“°０”を
供給して逆ＦＦＴをかけ楽音データを復元する。すなわ
ち、上記逆フーリエ変換は、０コｙ（ｔ）＝（１／２πＮ　　Ｙ（ω）ｅｘｐ（−ｊωｔ
）　ｄｔ−Ｏｏ＝５　　ａ１％ｃｏ！（ｄｎｊ　　＋　ＨＨＨＨ−［相
］である。この逆フーリエ変換後の復元された楽音デー
タは、全ての高周波成分の位相が揃ったコサイン波の合
成で表せる波形として取り出されるものである。Next, this calculated norm is supplied as real part data to the fast Fourier transform (corresponding to inverse FFT in this case) function block 36, and "°0" is supplied to the imaginary part data input terminal 35 to perform inverse FFT. and restore the musical sound data. In other words, the above inverse Fourier transform is 0koy(t)=(1/2πN Y(ω)exp(-jωt
) dt-Oo = 5 a1%co! (dnj + HHHH - [phase]. The restored musical tone data after this inverse Fourier transform is extracted as a waveform that can be expressed by a combination of cosine waves in which all high frequency components are in phase.

その後、ピーク検出機能ブロック３７で上記復元された
音源データのピークを検出する。ここで、上記ピークは
上記楽音データの全ての高周波成分の極値（ピーク）が
一致した点であり、次の機能ブロック３８において上記
検出されたピーク値を値の大きい方から分１１１４（ソ
ート）する。上記検出されたピークの周期を計測するこ
とにより、楽音信号の音程を知ることができる。Thereafter, a peak detection function block 37 detects the peak of the restored sound source data. Here, the peak is the point where the extreme values (peaks) of all the high frequency components of the musical tone data coincide, and in the next function block 38, the detected peak values are sorted by 1114 (sorting) from the largest value. do. By measuring the period of the detected peak, the pitch of the musical tone signal can be determined.

第９図は、第８図のピーク検出機能ブロック３７におけ
る楽音データの極大値（ピーク）を検出するための構成
について説明するためのものである。FIG. 9 is for explaining the configuration for detecting the local maximum value (peak) of musical tone data in the peak detection function block 37 of FIG. 8.

この場合上記楽音データは、値の異なったピーク（極値
）が多数存在するものであり、上記楽音データの最大値
を求めてその周期を検出することで楽音の音程を知るこ
とができる。In this case, the musical tone data has many peaks (extreme values) of different values, and by finding the maximum value of the musical tone data and detecting its period, the pitch of the musical tone can be determined.

すなわち第９図において、逆フーリエ変換後の楽音デー
タ列は、入力端子４１を介しＮ＋１段のシフトレジスタ
４２に供給され、このシフトレジスタ４２の各段のレジ
スタａ−Ｎ／２・・・ａｏ・・・ａ　Ｍ／！を順次介し
て出力端子４３に送られている。このＮ＋１段のシフト
レジスタ４２は上記楽音データ列に対して幅がＮ＋１サ
ンプル分のウィンドウとして作用し、該楽音データ列の
Ｎ＋１サンプルが上記ウィンドウを介して最大値検出回
路４４に送られる。すなわち、上記楽音データは最初に
レジスタａ−Ｎ／□に入力した後レジスタａＮ／□まで
順次伝送され、各々のレジスタａ□７．・・・ａｏ・・
・ａＮ／□からのＮ＋１サンプルの上記各楽音データが
最大値検出回路４４に送られる。That is, in FIG. 9, the musical tone data string after the inverse Fourier transform is supplied to the N+1 stage shift register 42 via the input terminal 41, and each stage of the shift register 42 has registers a-N/2...ao.・・a M/! are sent to the output terminal 43 via the following sequentially. This N+1 stage shift register 42 acts as a window with a width of N+1 samples for the musical tone data string, and N+1 samples of the musical tone data string are sent to the maximum value detection circuit 44 via the window. That is, the above musical tone data is first input to registers a-N/□, and then sequentially transmitted to registers aN/□, and is then input to each register a□7. ...ao...
- The above-mentioned musical tone data of N+1 samples from aN/□ are sent to the maximum value detection circuit 44.

この最大値検出回路４４は、上記シフトレジスタ４２内
の例えば中央のレジスタａ０の値が上記Ｎ＋１サンプル
のデータの各個の内で最大となったとき、そのレジスタ
ａ。のデータをピーク値として検出して、出力端子４５
より出力するものである。なお、上記ウィンドウの幅Ｎ
＋１は任意に設定可能である。This maximum value detection circuit 44 detects, for example, the value of the central register a0 in the shift register 42 when the value of the register a0 becomes the maximum among the N+1 samples of data. data is detected as a peak value, and output terminal 45
It outputs more. In addition, the width N of the above window
+1 can be set arbitrarily.

第７図に戻って、エンベロープ検出機能ブロック１３に
おいては、上述のサンプリング処理後のディジタル楽音
信号に対して、上記ピッチ情報を用いたエンベロープ検
出処理を施すことにより、楽音信号のいわゆるエンベロ
ープ波形を得ている。Returning to FIG. 7, the envelope detection function block 13 obtains the so-called envelope waveform of the musical tone signal by performing envelope detection processing using the pitch information on the digital musical tone signal after the sampling process described above. ing.

これは、例えば第１０図Ａに示すような楽音信号波形の
ピーク点を順次結んで得られる第１０図Ｂに示すような
波形であり、発音直後からの時間経過に伴うレベル（あ
るいは音量）の変化を表している。このエンベロープ波
形は、一般にＡＤＳＲ（アタックタイム／デイティクイ
ム／サスティンレベル／リリースタイム）のような各パ
ラメータにより表されることが多い。ここで楽音信号の
一具体例として、打鍵操作に応して発音されるピアノ音
等を考えるとき、上記アタックタイムＴＡは鍵盤の鍵が
押され（キー・オン）徐々に音量が上がり目標とする音
量に達するまでの時間を表し、上記デイケイタイムＴ、
は上記アタックタイムＴＳで達した音量から次の音量（
例えば楽器の持続音の音量）に達するまでの時間を表し
、上記サスティンレベルＬ、は鍵の押圧を解除してキー
・オフするまで保たれる持続音の音量を表し、上記リリ
ースタイムＴＲは上記キー・オフしてから音が消えるま
での時間を表している。なお上記各時間ＴＡ　、Ｔｏ　
、Ｔｏは、音量変化の傾きあるいはレートを示すことも
ある。また、これらの４つのパラメータの他にさらに多
くのエンヘローブパラメータを用いるようにしてもよい
。For example, the waveform shown in Figure 10B is obtained by sequentially connecting the peak points of the musical tone signal waveform shown in Figure 10A, and the level (or volume) changes over time from immediately after the sound is produced. It represents change. This envelope waveform is generally expressed by parameters such as ADSR (attack time/date time/sustain level/release time). Here, as a specific example of a musical sound signal, when considering a piano sound etc. that is produced in response to a keystroke operation, the above attack time TA is the target for the volume to gradually increase when a key is pressed (key-on) on the keyboard. It represents the time it takes to reach the volume, and the above Decay Time T,
is the next volume from the volume reached at the above attack time TS (
For example, the sustain level L represents the volume of the sustained sound that is maintained until the key is released and the key is turned off, and the release time TR is This represents the time from when the key is turned off until the sound disappears. Note that each of the above times TA, To
, To may also indicate the slope or rate of volume change. Furthermore, more enherobe parameters may be used in addition to these four parameters.

ここで、エンベロープ検出機能ブロック１３においては
、上述したようなＡＤＳＲ（アタックタイムＴＡ／デイ
ケイタイムＴ。／サスティンレベルＬ、／リリースタイ
ムＴえ）等の各パラメータにより表されるエンベロープ
波形情報と同時に、前述したフォルマント部分をアタッ
ク波形の残った状態で取り出すために、信号波形の全体
的なデイケイレートを示す情報を得るようにしている。Here, in the envelope detection function block 13, envelope waveform information represented by each parameter such as ADSR (attack time TA/decay time T./sustain level L, release time T.) as described above is simultaneously detected. In order to extract the aforementioned formant portion with the attack waveform remaining, information indicating the overall decay rate of the signal waveform is obtained.

このデイケイレート情報は、例えば第１１図に示すよう
に、発音時（キー・オン時）から上記アタックタイムＴ
、の間は基準の値“１″をとり、その後単調減衰する波
形を表すものである。This decay rate information is, for example, as shown in FIG.
The waveform takes a reference value "1" between , and then monotonically decays.

ここで、第７図のエンベロープ検出機能ブロック１３の
構成例について、第１２図の機能ブロンク図を参照しな
がら説明する。Here, an example of the configuration of the envelope detection function block 13 shown in FIG. 7 will be explained with reference to the functional block diagram shown in FIG. 12.

当３亥エンベロープ検出の原理は、いわゆるＡＭ（振幅
変調）信号のエンベロープ検波と同様なものである。す
なわち、上記楽音信号のピッチを上記ＡＭ信号のキャリ
アの周波数として考えることによりエンベロープを検出
するものである。上記エンベロープ情報は楽音を再生す
る際に用いるものであり、当該楽音は上記エンベロープ
情報とピッチ情報に基づいて形成されるものである。The principle of envelope detection is similar to envelope detection of so-called AM (amplitude modulation) signals. That is, the envelope is detected by considering the pitch of the musical tone signal as the frequency of the carrier of the AM signal. The envelope information is used when reproducing a musical tone, and the musical tone is formed based on the envelope information and pitch information.

第１２図の入力端子５１に供給された楽音データは、絶
対値出力機能ブロック５２において、上記楽音の波高値
データの絶対値が求められる。この絶対値データをＦＩ
Ｒ（有限インパルス応答）型ディジタルフィルタの機能
ブロック５５に送る。For the musical tone data supplied to the input terminal 51 in FIG. 12, an absolute value of the peak value data of the musical tone is determined in an absolute value output function block 52. FI this absolute value data
It is sent to a functional block 55 of an R (finite impulse response) type digital filter.

ここで、上記ＦＩＲフィルタ機能機能ブランクはローパ
スフィルタとして作用するものであり、予め、入力端子
５３に供給されたピッチ情報に基づいて機能ブロック５
４において形成しておいたフィルタ係数をＦＩＲフィル
タ機能ブロック５５に供給することにより、そのローパ
スフィルタのカットオフ特性を決定するものである。Here, the FIR filter function function blank acts as a low-pass filter, and the function block 5 is activated based on the pitch information supplied to the input terminal 53 in advance.
By supplying the filter coefficients formed in step 4 to the FIR filter function block 55, the cutoff characteristic of the low-pass filter is determined.

ここで、上記フィルタ特性は、例えば第１３図に示す特
性となっており、上記楽音信号の基音（周波数ｒ０）や
その倍音の周波数に零点を有するものである。例えば、
上記第１０図Ａに示す楽音信号からは、上記ＦＩＲフィ
ルタで基音１倍音の周波数を減衰させることにより第１
０図Ｂに示すようなエンベロープ情報が検出される。な
お上記フィルタ係数の特性は、次式で示されるものであ
る。Here, the filter characteristic is, for example, the characteristic shown in FIG. 13, and has zero points at the fundamental tone (frequency r0) of the musical tone signal and the frequencies of its overtones. for example,
From the musical tone signal shown in FIG.
Envelope information as shown in Figure 0B is detected. Note that the characteristics of the filter coefficients described above are shown by the following equation.

Ｈ（ｆ）　　＝　　ｋ　・（ｓｉｎ（πｆ／ｆｏ））／
ｆ　・・・・■この■式中のｆｏは楽音信号の基本周波
数（ピッチ）を示す。H(f) = k ・(sin(πf/fo))/
f...■fo in this formula represents the fundamental frequency (pitch) of the musical tone signal.

次に、上述のサンプリング処理された楽音信号の波高値
データ（サンプリングデータ）から、前述の第６図に示
すフォルマント部分ＦＨの信号の波高値データと、ルー
ピング区間ＬＰの信号の波高値データ（ループデータ）
とを生成する処理について説明する。Next, from the peak value data (sampling data) of the sampled musical tone signal mentioned above, the peak value data of the formant part FH signal shown in FIG. 6 and the peak value data of the signal in the looping section LP (loop data)
The process of generating .

上記ループデータ生成のための最初の機能ブロク１４に
おいて、上記サンプリングされた楽音信号の波高値デー
タを、先に検出したエンベロープ波形〔第１Ｏ図Ｂ〕の
データで割算（又は逆数を乗算）してエンベロープ補正
を行うことにより、第１４図に示すような振幅一定の波
形の信号の波高値データを得ている。このエンベロープ
補正された信号（の波高値データ）をフィルタ処理する
ことにより、音程成分以外が減衰された、あるいは相対
的に音程成分が強調された信号（の波高値データ）を得
ている。ここで、音程成分とは、基本周波数ｆ０の整数
倍の周波数成分のことである。In the first function block 14 for generating the loop data, the peak value data of the sampled musical tone signal is divided (or multiplied by the reciprocal) by the data of the previously detected envelope waveform [Fig. 10B]. By performing envelope correction, peak value data of a signal with a constant amplitude waveform as shown in FIG. 14 is obtained. By filtering (the peak value data of) this envelope-corrected signal, (the peak value data of) a signal in which components other than pitch components are attenuated or pitch components are relatively emphasized is obtained. Here, the pitch component is a frequency component that is an integral multiple of the fundamental frequency f0.

具体的には、上記エンベロープ補正された信号に含まれ
るビブラート等の低周波成分を除去するためにＨＰＦ　
（バイパスフィルタ）を介し、次に、第１５図の一点鎖
線に示すような周波数特性、すなわち基本周波数ｒ０の
整数倍の周波数帯域が通過帯域の周波数特性、を有する
櫛形フィルタを介すことにより、上記ＨＰＦ出力信号に
含まれる音程成分のみを通過させてこれら以外の非音程
成分やノイズ成分を減衰させ、さらに必要に応じてＬＰ
Ｆ（ローパスフィルタ）を介すことにより、上記櫛形フ
ィルタ通過後の信号に重畳しているノイズ成分を除去す
る。Specifically, an HPF is used to remove low frequency components such as vibrato included in the envelope-corrected signal.
(bypass filter), and then through a comb filter having frequency characteristics as shown in the dashed line in FIG. Only the pitch components included in the above HPF output signal are passed through, other non-pitched components and noise components are attenuated, and if necessary, the LP
By passing through F (low pass filter), noise components superimposed on the signal after passing through the comb filter are removed.

すなわち、前記入力信号として楽器の音等の楽音信号を
考えるとき、この楽音信号は通常一定の音程（ピッチ、
音高）を有していることから、その周波数スペクトラム
には、第１５図の実線に示すように、上記楽音自体の音
程に対応する基本周波数ｆ０の近傍とその整数倍の周波
数の近傍にエネルギが集中するような分布が得られる。That is, when considering a musical sound signal such as the sound of an instrument as the input signal, this musical sound signal usually has a constant pitch (pitch,
As shown by the solid line in Fig. 15, the frequency spectrum contains energy in the vicinity of the fundamental frequency f0 corresponding to the pitch of the musical note itself and in the vicinity of frequencies that are integral multiples thereof. A distribution is obtained in which .

これに対して一般のノイズ成分は−様な周波数分布を持
っていることが知られている。従って、上記入力楽音信
号を第１５図の一点鎖線に示すような周波数特性の櫛形
フィルタを通すことにより、楽音信号の基本周波数ｆ０
の整数倍の周波数成分（いわゆる音程成分）のみがその
まま通過あるいは強調されて他の成分（非音程成分及び
ノイズの一部）が減衰され、結果としてＳＮ比を改善す
ることができる。ここで、上記第１５図中の一点鎖線に
示す櫛形フィルタの周波数特性は、次式％式％で表されるものである。この０式中のｆｏは上記入力信
号の基本周波数（音程に対応する基音の周波数）、Ｎは
櫛形フィルタの段数である。On the other hand, it is known that general noise components have a −-like frequency distribution. Therefore, by passing the input musical tone signal through a comb-shaped filter having frequency characteristics as shown in the dashed line in FIG. 15, the fundamental frequency f0 of the musical tone signal is
Only frequency components (so-called pitch components) that are integral multiples of are passed through or emphasized, while other components (non-pitch components and part of noise) are attenuated, and as a result, the S/N ratio can be improved. Here, the frequency characteristic of the comb-shaped filter shown by the dashed line in FIG. 15 is expressed by the following formula. In this equation, fo is the fundamental frequency of the input signal (the frequency of the fundamental tone corresponding to the pitch), and N is the number of stages of the comb filter.

このようにしてノイズ成分が低減された楽音信号は、前
記繰り返し波形抽出回路に送られ、この繰り返し波形抽
出回路により前述した第６図のルーピング区間ＬＰのよ
うな適当な操り返し波形区間が抽出された後、半導体メ
モリ等の記憶媒体に送られて記録される。この記憶媒体
に記録された楽音信号データは、非音程成分や一部のノ
イズ成分が減衰されたものであるため、上記繰り返し波
形区間を繰り返し再生する際のノイズ、いわゆるルーピ
ングノイズを低減することができる。The musical tone signal whose noise components have been reduced in this way is sent to the repetitive waveform extraction circuit, and this repetitive waveform extraction circuit extracts an appropriate repeated waveform section such as the looping section LP in FIG. 6 described above. After that, it is sent to a storage medium such as a semiconductor memory and recorded. Since the musical tone signal data recorded on this storage medium has non-pitch components and some noise components attenuated, it is possible to reduce the so-called looping noise, which is the noise that occurs when the above-mentioned repetitive waveform section is repeatedly reproduced. can.

なお上記ＨＰＦＳ櫛形フィルタ、ＬＰＦの周波数特性は
、先にピッチ検出機能ブロック１２にて検出されたピッ
チ情報である上記基本周波数ｆ０に基づいて設定される
ようになっている。Note that the frequency characteristics of the HPFS comb filter and LPF are set based on the fundamental frequency f0, which is pitch information detected by the pitch detection function block 12 in advance.

次に第７図のループ区間検出機能ブロック１６において
、上記フィルタ処理によって音程成分以外が減衰された
楽音信号に対して、適当な繰り返し波形区間を検出する
ことにより、ルーピング開始点ＬＰ、とルーピング終端
点ＬＰ、とのルーピングポイントを設定する。Next, the loop section detection function block 16 in FIG. 7 detects an appropriate repeating waveform section for the musical tone signal whose components other than pitch components have been attenuated by the above-described filtering process, thereby determining the looping starting point LP and the looping ending point. Set a looping point with point LP.

すなわち、ループ区間検出機能ブロック１６では、上記
楽音信号のピッチ（音程）に対応する繰り返し周期（の
整数倍）だけ相対的に離れた２点であるルーピングポイ
ントを選定するが、以下にその選定原理を説明する。That is, the loop section detection function block 16 selects two looping points that are relatively separated by (an integral multiple of) the repetition period corresponding to the pitch (interval) of the musical tone signal, and the selection principle is explained below. Explain.

楽音データをルーピング処理する場合、ルーピングの間
隔は、楽音信号の基本周期（基音の周波数の逆数）の整
数倍でなければならない。したがって、その楽音の音程
を正確に同定すれば、容易に決定することが可能となる
。When performing looping processing on musical tone data, the looping interval must be an integral multiple of the fundamental period (the reciprocal of the frequency of the fundamental tone) of the musical tone signal. Therefore, if the pitch of the musical note is accurately identified, it can be determined easily.

つまり、予めルーピング間隔を決定しておき、その間隔
分だけ離れた２点を取り出し、その２点の近傍の信号波
形の相関性あるいは類似性を評価することでルーピング
ポイントを設定する。この評価関数の一例として、上記
２点の各近傍の信号波形のサンプルについてのたたみ込
み（合成積、コンポリューシロン）を用いるものについ
て説明する。すなわち、上記コンボリューションの操作
を全ての点の組みについて順次施すことで信号波形の相
関性あるいは類（以外を評価する。ここで、上述のコン
ボリューションによる評価は、例えば上記楽音データを
シフトレジスタに順次入力してゆき、それぞれ各レジス
タで取り込まれた楽音データを、例えば後述するＤＳＰ
　（ディジタル信号処理装置）で構成された積和器にそ
れぞれ入力し、該積和器で上記コンボリューションを計
算し出力するものである。このようにして得られたコン
ボリューションが最大となる２点の組みをルーピング開
始点ＬＰ、およびルーピング終端点ＬＰＥとする。That is, a looping interval is determined in advance, two points separated by the interval are extracted, and a looping point is set by evaluating the correlation or similarity of signal waveforms in the vicinity of the two points. As an example of this evaluation function, one that uses convolution (synthetic product, convolution) of samples of signal waveforms near each of the two points will be described. In other words, the above convolution operation is sequentially applied to all sets of points to evaluate the correlation or type (other than) of the signal waveform. The musical tone data inputted sequentially and taken in by each register are sent to a DSP, which will be described later, for example.
(digital signal processing device), which calculates and outputs the above convolution. The set of two points resulting in the maximum convolution thus obtained is defined as a looping start point LP and a looping end point LPE.

すなわち、第１６回において、ルーピング開始点ＬＰ、
の候補点をａｏとし、ルーピング終端点ＬＰ、の候補点
をｂｏとして、上記ルーピング開始点ＬＰ、の候補点ａ
０の前後近傍の複数個の点、例えば２Ｎ＋１個の点の各
波高値データを、それぞれａ−Ｎ”　＋ａ−２＋ａ−１
＋ａ６＋ａｌ＋　ａｌｌ　　”　ａｌｌ　ｓルーピング
終端点ＬＰＥの候補点ｂ０の前後近傍の同し個数（２Ｎ
＋１個）の点の各波高値データを、ｂｌ・　・、ｂ−ｚ
、ｂ−１，ｂｏ＋ｂｌ　、ｂｚ、・・ｂ、とすると、こ
のときの評価関数Ｅ（ａｎ、ｂｏ）は、次式で定めるこ
とができる。この第［相］式はａｏ、ｂｅの点を中心と
したコンボリューションを求めるための式である。そし
て上記候補点ａｏ、ｂｏの組を順次変更して、全てのル
ーピングポイントの候補となる点についての上記評価関
数Ｅの値を求め、得られた全ての評価関数Ｅの内でその
値が最大となる点をルーピングポイントとする。That is, in the 16th time, the looping starting point LP,
Let the candidate point of the looping end point LP be ao, and let the candidate point of the looping end point LP be bo, then the candidate point a of the looping start point LP,
Each wave height value data of a plurality of points around 0, for example 2N+1 points, is a-N"+a-2+a-1, respectively.
+a6+al+ all ” all sThe same number of points (2N
+1 point) each wave height value data is bl・・・, b−z
, b-1, bo+bl, bz, . . . b, then the evaluation function E(an, bo) can be determined by the following equation. This phase [phase] equation is an equation for finding convolution centered on the points ao and be. Then, the set of candidate points ao and bo is sequentially changed to find the value of the evaluation function E for all looping point candidates, and the value is the largest among all the evaluation functions E obtained. The point where , is the looping point.

また、ルーピングポイントは上述のようにコンボリュー
ションから求める方法の他に、誤差の最小２乗法から求
めることも可能である。すなわち、最小２乗法によるル
ーピングポイントの候補点ａｏ。Further, in addition to the method of finding the looping point from convolution as described above, it is also possible to find the looping point from the method of least squares of errors. That is, the candidate point ao of the looping point is determined by the least squares method.

ｂｏは、ｔ　（ａｏ、ｂｏ）　　＝Σ（ａｋ−ｂよ）ｚ・・・・
［相］の弐で表すことができる。この場合には、評価関
数εの値が最小となるａＯＪ６を求めればよい。bo is t (ao, bo) = Σ (ak - b) z...
It can be expressed by the second part of [phase]. In this case, aOJ6 with the minimum value of the evaluation function ε may be found.

また、上述のループ区間検出機能ブロック１６では、必
要に応して上記ルーピング開始点ＬＰ。Further, in the loop section detection function block 16 described above, the looping start point LP is determined as necessary.

とルーピング終端点ＬＰ、とに基づいてピッチ変換比を
算出する。このピッチ変換比は、次の機能ブロック１７
における時間軸補正処理の際の時間軸補正値データとし
て用いられる。この時間軸補正処理は、実際に各種音源
データをメモリ等の記憶手段に記録する際の各種音源デ
ータの各ピッチを揃えておくために行われるものであり
、上記ピッチ変換比の代わりにピッチ検出機能ブロック
１２において検出された上記ピッチ情報を用いるように
してもよい。The pitch conversion ratio is calculated based on the looping end point LP, and the looping end point LP. This pitch conversion ratio is determined by the following function block 17.
It is used as time axis correction value data during the time axis correction process. This time axis correction process is performed in order to align the pitches of various sound source data when actually recording the various sound source data in a storage means such as memory, and uses pitch detection instead of the above pitch conversion ratio. The pitch information detected in the functional block 12 may be used.

この時間軸補正機能ブロック１７におけるピッチの正規
化動作について第１７図を参照しながら説明する。The pitch normalization operation in the time axis correction function block 17 will be explained with reference to FIG. 17.

第１７図Ａは時間軸補正処理（主として時間軸圧伸処理
）を施す前の楽音信号波形を示し、第１７図Ｂは上記圧
伸後の補正波形を示している。これらの第１７図Ａ、Ｂ
の時間軸には、後述する準瞬時ビット圧縮符号化処理の
際のブロック単位で目盛りを付している。FIG. 17A shows a tone signal waveform before time axis correction processing (mainly time axis companding process), and FIG. 17B shows a corrected waveform after the companding process. These Figures 17A and B
The time axis of is marked with a scale in units of blocks during quasi-instantaneous bit compression encoding processing, which will be described later.

時間軸補正前の波形Ａにおいては、通常の場合ルーピン
グ区間ＬＰと上記ブロックとは無関係となるが、第１７
図Ｂに示すように、上記ルーピング区間ＬＰがブロック
の長さ（ブロック周期）の整数倍（ｍ倍）となるように
時間軸圧伸処理し、さらにブロックの境界位置が上記ル
ーピング開始点ＬＰ、及びルーピング終端点ＬＰｔに一
致するように時間軸方向にシフトする。すなわち、ルー
ピング区間ＬＰの開始点ＬＰｓ及び終端点ＬＰ。In the waveform A before time axis correction, the looping section LP is normally unrelated to the above block, but the 17th
As shown in FIG. B, the time axis companding process is performed so that the looping section LP is an integral multiple (m times) of the block length (block period), and the block boundary position is the looping start point LP, and shifted in the time axis direction to match the looping end point LPt. That is, the starting point LPs and the ending point LP of the looping section LP.

が所定のブロックの境界位置となるように時間軸補正（
時間軸圧伸及びシフト）することによって、整数個（ｍ
個）のブロック単位でルーピング処理を行うことができ
、記録時の音源データのピッチの正規化が実現できる。Time axis correction (
time axis companding and shifting), an integer number (m
The looping process can be performed on a block-by-block basis, making it possible to normalize the pitch of sound source data during recording.

ここで、上記時間シフトによって楽音信号波形の先頭に
生ずるブロックの境界からのずれ分ΔＴの間には、波高
値データとして°“０″を詰めるようにすればよい。Here, the deviation ΔT from the block boundary that occurs at the beginning of the musical tone signal waveform due to the above-mentioned time shift may be filled with 0 as the peak value data.

第１８図は、上記時間軸補正後の波形の波高値データを
後述のビット圧縮符号化処理するためにブロック化する
際のブロック構造を表すものであり、１ブロツクの波高
値データの個数（サンプル数、ワード数）をｈとしてい
る。この場合、上記ピッチの正規化とは、−１的に第６
図に示す楽音信号波形の一定周期Ｔｗの波形のｎ周期分
すなわちルーピング区間ＬＰ内のワード数を、上記ブロ
ック内のワード数りの整数倍（ｍ倍）とするように時間
軸圧伸処理することであり、さらに好ましくは、ルーピ
ング区間ＬＰの開始点ＬＰｓ及び終端点ＬＰ、を時間軸
上のブロック境界位置に一致させるように時間軸処理（
シフト処理）させることである。このように各点ＬＰｓ
　、ｔ、ｐ、がブロック境界位置に一致していると、ビ
ット圧縮符号化システムでのデコードの際のブロック切
替えによって生じる誤差を減少させることができる。FIG. 18 shows the block structure when the peak value data of the waveform after time axis correction is divided into blocks for the bit compression encoding process described below. (number of words, number of words) is set to h. In this case, the above pitch normalization means -1
Time axis companding processing is performed so that the number of words in the looping section LP for n cycles of the musical tone signal waveform with a constant period Tw shown in the figure is an integral multiple (m times) of the number of words in the block. More preferably, time axis processing (
shift processing). In this way, each point LPs
, t, p coincide with the block boundary position, it is possible to reduce errors caused by block switching during decoding in the bit compression encoding system.

ここで、第１８図Ａの１ブロツク内の図中斜線で示す部
分のワードｗｔ、ｐｓとＷＬＰ、は、図中補正波形のル
ーピング開始点ＬＰｓ　とルーピング終端点ＬＰ！　　
（正確には点ＬＰＩの直前の点）のサンプルを示すワー
ドである。なお上記シフト処理を行わない場合には、ル
ーピング開始点ＬＰ。Here, the words wt, ps, and WLP in the shaded portion in one block of FIG. 18A are the looping start point LPs and the looping end point LP! of the corrected waveform in the figure.
(To be exact, the point immediately before point LPI) is a word indicating a sample. Note that if the above shift processing is not performed, the looping start point LP.

及び終端点ＬＰＥがブロック境界に必ずしも一致しない
ため、第１８図Ｂに示すように、上記ワードｗＬｐｓ　
、ＷＬＰ、の設定位置は、ブロック内の任意の位置に設
定される。ただし、上記ワードＷＬ　Ｐ、からワードＷ
ＬＰ！までの間のワード数はｌブロック内のワード数り
の整数倍（ｍ倍）となっており、ピッチは正規化される
。Since the terminal point LPE and the terminal point LPE do not necessarily coincide with the block boundary, as shown in FIG. 18B, the word wLps
, WLP, are set at any position within the block. However, from the above word WL P, to the word W
LP! The number of words up to this point is an integral multiple (m times) of the number of words in l block, and the pitch is normalized.

ここで、上述のようにルーピング区間ＬＰ内のワード数
を１ブロツクのワード数りの整数倍とするための楽音信
号波形の時間軸圧伸処理には各種方法が考えられるが、
例えばサンプリングされた波形の波高値データを補間処
理することにより実現でき、その−具体例としては、オ
ーバーサンプリング処理用のフィルタ構成等を利用する
ことができる。Here, as mentioned above, various methods can be considered for time-axis companding processing of the musical tone signal waveform in order to make the number of words in the looping section LP an integral multiple of the number of words in one block.
For example, this can be realized by interpolating the peak value data of a sampled waveform, and as a specific example, a filter configuration for oversampling processing, etc. can be used.

（以下余白）ところで、現実の楽音波形のルーピング周期がサンプリ
ング周期単位に対して端数を持ち、ルーピング開始点Ｌ
Ｐ、でのサンプリング波高値とルーピング終端点ＬＰＥ
でのサンプリング波高値とにずれが生じている場合に、
オーバサンプリング等を利用した補間処理により、ルー
ピング終端点ＬＰえの近傍位置（サンプリング周期より
も短い距離の位置）でルーピング開始点ＬＰ、のサンプ
リング波高値に一致するような波高値を求める等して、
補間サンプルも含めたサンプリング周期の非整数倍の（
端数を持つ）ルーピング周期を実現することが考えられ
る。このようなサンプリング周期の非整数倍のルーピン
グ周期も、上記時間軸補正処理により上記ブロック周期
の整数倍とすることができ、例えば２５６倍オーバサン
プリング周期の非整数倍のルーピング周期も、上記時間
軸補正処理により上記ブロック周期の整数倍とすること
ができ、例えば２５６倍オーバサンプリングを利用して
時間軸圧伸処理する場合には、ルーピング開始点ＬＰ、
と終端点ＬＰＩとの間の波高値の誤差を１／２５６に低
減して、より円滑なルーピング再生を実現できる。(Left below) By the way, the looping period of an actual musical sound waveform has a fraction of the sampling period unit, and the looping start point L
The sampling peak value at P and the looping termination point LPE
If there is a deviation from the sampling peak value at
By interpolation processing using oversampling, etc., a peak value that matches the sampling peak value of the looping start point LP is obtained at a position near the looping end point LP (a position at a distance shorter than the sampling period). ,
Non-integer multiple of the sampling period including interpolated samples (
It is conceivable to realize a looping period (with a fraction). Such a looping period that is a non-integer multiple of the sampling period can also be made an integer multiple of the block period by the above-mentioned time axis correction processing. By correction processing, the block period can be set to an integral multiple of the above block period. For example, when performing time axis companding processing using 256 times oversampling, the looping start point LP,
By reducing the error in the peak value between the LPI and the end point LPI to 1/256, smoother looping reproduction can be realized.

上述のようにしてルーピング区間ＬＰが決められ時間軸
補正（圧伸）処理が施された波形は、次の機能ブロック
２１において、第１９図に示すようにルーピング区間Ｌ
Ｐを前後に接続してループデータの生成が行われる。す
なわち第１９図は、上記時間軸補正後の楽音波形（第１
７図Ｂ）からルーピング区間ＬＰのみを切り取り、この
ルーピング区間ＬＰを複数個並べたループデータ波形を
示しており、このループデータ波形は、複数個のルーピ
ング区間ＬＰのそれぞれ一方のルーピング終端点ＬＰＥ
と他方のルーピング開始点ＬＰｓとを順次接続して並べ
たものである。このループデータ波形がループデータ生
成機能ブロック２１にて生成される。In the next functional block 21, the waveform whose looping section LP is determined and subjected to time axis correction (companding) processing as described above is converted into a looping section L as shown in FIG.
Loop data is generated by connecting P in front and behind each other. In other words, FIG. 19 shows the musical sound waveform (the first
A loop data waveform is shown in which only the looping section LP is cut out from Figure 7B) and a plurality of these looping sections LP are arranged.
and the other looping start point LPs are connected and arranged in sequence. This loop data waveform is generated by the loop data generation function block 21.

このループデータは、ルーピング区間ＬＰを多数回接続
して形成されるため、該接続形成されたループデータ波
形の各ルーピング開始点ＬＰ、に対応するワードＷＬＰ
ｌを含む開始ブロックの直前には、ルーピング終端点Ｌ
ＰＥ　　（正確には点ＬＰｔの直前の点）に対応するワ
ードＷＬＰ、を含む終了ブロックのデータがそのまま配
置されることになる。原理的には、ビット圧縮符号化の
エンコード処理をするすに、記憶しようとするルーピン
グ区間ＬＰ、の上記開始ブロックの直前位置に、少なく
とも上記終了ブロックが存在していればよい。さらに−
最北して述べるならば、上記ブロック単位のビット圧縮
エンコード時に、上記開始ブロックのパラメータ（圧縮
ブロック毎のピント圧縮符号化の情報、例えば後述する
レンジ情報やフィルタ選択情報）は、上記開始ブロック
と終了ブロックのデータに基づいて形成されるようにす
ればよい。これは、後述するフォルマント部分を持たな
いループデータのみの楽音信号を音源とする場合にも適
用可能な技術である。Since this loop data is formed by connecting looping sections LP many times, a word WLP corresponding to each looping start point LP of the connected loop data waveform is
Immediately before the start block containing l, the looping end point L
The data of the end block including the word WLP corresponding to PE (more precisely, the point immediately before point LPt) will be placed as is. In principle, in order to perform bit compression encoding processing, it is sufficient that at least the end block exists immediately before the start block of the looping section LP to be stored. Furthermore-
To put it simply, at the time of bit compression encoding on a block-by-block basis, the parameters of the starting block (information on focus compression encoding for each compressed block, such as range information and filter selection information to be described later) are the same as the starting block. It may be formed based on the data of the end block. This is a technique that can also be applied when the sound source is a musical tone signal consisting only of loop data without a formant part, which will be described later.

こうすれば、上記エンコード時に、ルーピング開始点Ｌ
　Ｐ　ｓと終端点ＬＰｔとについては、それぞれの前後
複数サンプルに亘って、それぞれ同じデータが並ぶこと
になる。従って、これらの各点ＬＰｓとＬ　Ｐ　Ｅの直
前のそれぞれのブロックについてのビット圧縮符号化の
際のパラメータは同じものとなり、デコード処理の際の
ルーピング再生時のエラー（ノイズ）を減少することが
できる。By doing this, during the above encoding, the looping start point L
Regarding Ps and the terminal point LPt, the same data is lined up over a plurality of samples before and after each of them. Therefore, the parameters for bit compression encoding for each block immediately before each point LPs and LPE are the same, and errors (noise) during looping playback during decoding processing can be reduced. can.

すなわち、ルーピング再生される楽音データは接続ノイ
ズの無い安定したものとなる。なお、本実施例において
は、上記開始ブロックの直前に配置する上記ルーピング
区間ＬＰのデータのサンプル数を約５００サンプルとし
ている。That is, the musical tone data that is looped and reproduced becomes stable without any connection noise. In this embodiment, the number of samples of data in the looping section LP placed immediately before the start block is approximately 500 samples.

次に上記フォルマント部分ＦＲの信号のデータ生成工程
においては、先ず、上記ループデータ生成の際の機能ブ
ロック１４と同様に、機能ブロック１８おいてエンベロ
ープ補正処理が施される。Next, in the process of generating data for the signal of the formant portion FR, envelope correction processing is first performed in the functional block 18, similar to the functional block 14 in generating the loop data.

ただしこの場合のエンベロープ補正は、上記サンプリン
グ処理された楽音信号に対して、前述したデイケイレー
ト情報のみのエンベロープ補正（第１１図）で割算する
ことにより、第２０図に示すような波形の信号（の波高
値データ）を得ている。However, in this case, the envelope correction is performed by dividing the sampled musical tone signal by the envelope correction of only the decay rate information (Fig. 11) described above, resulting in a waveform as shown in Fig. 20. The signal (peak value data) is obtained.

すなわちこの第２０図の出力信号においては、上記アタ
ック部分（時間ＴＡＯ間）のエンベロープが残され、そ
れ以外の部分は一定振幅となっている。That is, in the output signal of FIG. 20, the envelope of the attack portion (between time TAO) remains, and the other portions have a constant amplitude.

このエンベロープ補正された信号は、必要に応じて機能
ブロック１９でのフィルタ処理が施されることにより、
楽音信号の非音程成分が相対的に強調された信号となる
。この機能ブロック１９でのフィルタ処理には、上記機
能ブロック１５とは対称的な例えば第１５図の破線に示
すような周波数特性の櫛形フィルタが用いられる。すな
わちこの櫛形フィルタは、上記音程に対応する基本周波
数ｆ０の整数倍の周波数帯域成分を減衰させて相対的に
非音程成分を強調するような周波数特性を有しており、
この櫛形フィルタも上記ピッチ検出機能ブロック１２で
検出されたピッチ情報（基本周波数ｒｅ）に基づいて周
波数特性が設定されるものである。このような非音程成
分が相対的に強調された信号は、最終的にメモリ等の記
憶媒体に記録される音源データにおけるフォルマント部
分の信号のデータを生成するために用いられる。This envelope-corrected signal is subjected to filter processing in the functional block 19 as necessary, so that
The non-pitch component of the musical tone signal becomes a signal that is relatively emphasized. For the filter processing in the functional block 19, a comb-shaped filter having a frequency characteristic as shown by the broken line in FIG. 15, for example, which is symmetrical to the functional block 15, is used. That is, this comb-shaped filter has a frequency characteristic that attenuates frequency band components that are integral multiples of the fundamental frequency f0 corresponding to the pitch and relatively emphasizes non-pitch components,
The frequency characteristics of this comb filter are also set based on the pitch information (fundamental frequency re) detected by the pitch detection function block 12. Such a signal in which non-pitch components are relatively emphasized is used to generate signal data of a formant part in sound source data that is finally recorded in a storage medium such as a memory.

次の機能ブロック２０においては、上記機能プロック１
７゛と同様な時間軸補正が上記フォルマント部分生成用
信号に対しても行われる。これは、上記機能ブロック１
６で求められたピッチ変換比あるいは上記機能ブロック
１２で検出されたピッチ情報に基づいて時間軸の圧縮伸
長を行うことにより、各音源毎のピッチを揃える（正規
化する）ためのものである。In the next functional block 20, the above functional block 1
The same time axis correction as in step 7 is also performed on the formant part generation signal. This is the function block 1 above.
This is to equalize (normalize) the pitches of each sound source by compressing and expanding the time axis based on the pitch conversion ratio obtained in step 6 or the pitch information detected in the functional block 12.

次に、機能ブロック２２において、上記共に同じピッチ
変換比あるいはピッチ情報を用いて時間軸補正されたル
ープデータとフォルマント部分生成用データとが混合さ
れる。このときの混合は、上記機能ブロック２０からの
フォルマント部分生成用信号に対してハミング窓をかけ
、ループデータと混合しようとする部分で時間に伴って
減衰するフェイドアウト型の信号を形成し、これに対し
て上記機能ブロック２０からのループデータに対しても
同様なハミング窓をかけ、この場合にはフォルマント信
号と混合しようとする部分で時間に伴って増大するフェ
イドイン型の信号を形成し、これらの信号を混合する（
クロスフエイドする）ことにより、最終的に音源データ
となる楽音信号を得ている。ここで、メモリ等の記憶媒
体に記録するループデータとしては、上記クロスフエイ
ド部分からある程度刈れた１つのルーピング区間のデー
タを取り出氷ことにより、ルーピング再生時のノイズ（
ルーピングノイズ）を低減することができる。このよう
にして、発音時からの非音程成分を含む波形部分である
フォルマント部分ＦＲと、音程成分のみの繰り返し波形
部分であるルーピング区間ＬＰとから成る音源信号の波
高値データが得られる。Next, in a functional block 22, the loop data and formant part generation data, both of which have been time-base corrected using the same pitch conversion ratio or pitch information, are mixed. In this mixing, a Hamming window is applied to the formant part generation signal from the functional block 20 to form a fade-out signal that attenuates over time in the part to be mixed with the loop data. On the other hand, a similar Hamming window is applied to the loop data from the functional block 20, and in this case, a fade-in type signal that increases with time is formed in the portion to be mixed with the formant signal, and these Mix the signals of (
(crossfading) to obtain musical tone signals that ultimately become sound source data. Here, as the loop data to be recorded in a storage medium such as a memory, the data of one looping section that has been cut to a certain extent from the crossfade section is taken out and the noise during looping playback is eliminated.
looping noise) can be reduced. In this way, peak value data of the sound source signal is obtained, which consists of the formant portion FR, which is a waveform portion including non-pitch components from the time of sound generation, and the looping section LP, which is a repeating waveform portion containing only pitch components.

この他、上記フォルマント部分生成用信号における上記
ルーピング開始点の位置にループデータの信号の開始点
を接続するように各部分を切り繋ぐ処理等も考えられる
。In addition, a process of cutting and connecting each part such that the start point of the loop data signal is connected to the position of the looping start point in the formant part generation signal may also be considered.

ところで、現実にループ区間検出やルーピング処理、さ
らにはループデータとフォルマント部分との混合を行う
際には、人間の手操作により試行錯誤的に試聴を繰り返
しながら大まかな混合をしておき、このときのループポ
イント（ルーピング開始点ＬＰ、とルーピング終端点Ｌ
Ｐｔ）情報等に基づいてより高精度の処理を行っている
。By the way, when actually performing loop section detection, looping processing, and even mixing loop data and formant parts, a rough mix is performed by repeating trial and error trial listening manually, and at this time, loop points (looping start point LP, looping end point L
Pt) Processing with higher precision is performed based on information etc.

すなわち、上記機能ブロック１６での高精度のループ区
間検出に先立って、第２１図のフローチャートに示すよ
うな手順でループ区間検出や上記混合等を試聴を繰り返
しながら手操作で行い、その後、上述したような高精度
の処理（ステップ３２６以降）を行わせる。That is, prior to detecting a loop section with high accuracy in the function block 16, the loop section detection and the above-mentioned mixing etc. are manually performed while repeatedly listening to the sample in the procedure shown in the flowchart of FIG. 21, and then the above-mentioned Highly accurate processing (from step 326 onward) is performed.

この第２１図において、最初のステップ３２１において
は、例えば信号波形のゼロクロス点を利用したり、信号
波形の表示を目視確認しながら、比較的粗い精度で上記
ループポイントを検出し、ステップＳ２２でルーピング
処理して上記ループポイント間の波形を繰り返し再生し
、次のステップＳ２３で人間が試聴して良好か否かを判
別する。In FIG. 21, in the first step 321, the loop point is detected with relatively rough accuracy by using, for example, the zero crossing point of the signal waveform or visually checking the display of the signal waveform, and in step S22, the loop point is detected. The waveform between the loop points is processed and reproduced repeatedly, and in the next step S23, a human listens to the sample and determines whether it is good or not.

不良の場合には上記最初のステップ３２１に戻ってルー
プポイントを再度検出する。これを繰り返して良好な試
聴結果が得られれば、次のステップＳ２４に進み、上記
フォルマント部用信号とクロスフェード等により混合し
、次のステップＳ２３で人間が試聴してフォルマント部
からルーピング部への移行が良好か否かを判別する。不
良の場合にはステップＳ２４に戻って上記混合をやり直
す。If it is defective, the process returns to the first step 321 and the loop point is detected again. If a good listening result is obtained by repeating this process, the process proceeds to the next step S24, where it is mixed with the above-mentioned formant part signal by cross-fading, etc. In the next step S23, a human listens to the signal from the formant part to the looping part. Determine whether the transition is good or not. If the mixture is defective, the process returns to step S24 and the above mixing is repeated.

その後、ステップＳ２６に進んで、上記ループ区間検出
機能ブロック１６における高精度のループ区間検出を行
う。具体的には上記補間サンプルも含むループ区間検出
、例えば２５６倍オーバサンプリング時にはサンプリン
グ周期の１／２５６の精度でのループ区間検出を行い、
次のステ、ブＳ２７で上記ピッチ正規化のためのピッチ
変換比を算出する。このピッチ変換比に基づいて、次の
ステップ３２Ｂで上記機能ブロック１７．２ｏにおける
時間軸補正処理を行い、次のステップＳ２９にて上記機
能ブロック２１でのループデーク生成を行う、そして、
ステップ３３０において、上記機能ブロック２２での混
合処理を行う。これらのステップ３２６以降の処理にお
いては、ステップＳ２１からＳ２５までで得られたルー
プポイント情報等を利用するものである。なお、上記ス
テップＳ２１から３２５までを省略して、ルーピング処
理等の全自動化を図ってもよい。Thereafter, the process proceeds to step S26, where the loop section detection function block 16 performs highly accurate loop section detection. Specifically, loop section detection including the above-mentioned interpolated samples is performed, for example, in the case of 256 times oversampling, loop section detection is performed with an accuracy of 1/256 of the sampling period.
In the next step, step S27, a pitch conversion ratio for the pitch normalization is calculated. Based on this pitch conversion ratio, time axis correction processing is performed in the functional block 17.2o in the next step 32B, loop data generation is performed in the functional block 21 in the next step S29, and
In step 330, the mixing process in the functional block 22 is performed. In the processing after step 326, the loop point information etc. obtained in steps S21 to S25 are used. Note that steps S21 to 325 may be omitted to fully automate the looping process and the like.

このような混合処理により得られたフォルマント部分Ｆ
Ｒとルーピング区間ＬＰとから成る信号の波高値データ
は、次の機能ブロック２３においてビット圧縮符号化処
理が施される。The formant part F obtained by such a mixing process
The peak value data of the signal consisting of R and the looping section LP is subjected to bit compression encoding processing in the next functional block 23.

上述のビット圧縮符号化方式としては種々のものが考え
られるが、ここでは、本件出願人が先に特開昭６１−０
０８６２９号公報や特開昭６２００３５１６号公報等に
おいて提案している準瞬時圧伸型、すなわち波高値デー
タの一定ワード数（ｈサンプル）毎にブロック化しこの
ブロック単位でピント圧縮を施すような高能率符号化方
式を用いるものとし、この高能率ビット圧縮符号化方式
について、第２２図を参照しながら概略的に説明する。Various bit compression encoding methods can be considered as the above-mentioned bit compression encoding method, but here, the present applicant first proposed Japanese Patent Application Laid-Open No. 61-0
The quasi-instantaneous companding type proposed in Publication No. 08629 and Japanese Patent Application Laid-open No. 62003516, etc. is highly efficient, in which the peak value data is divided into blocks for each fixed number of words (h samples) and focus compression is performed in units of blocks. This high-efficiency bit compression encoding method will be briefly described with reference to FIG. 22.

この第２２図において、上記高能率ビット圧縮符号化シ
ステムは、記録側のエンコーダ７０と、再生側のデコー
ダ９０々により構成されており、エンコーダ７０の入力
端子７１には、上記音源信号の波高値データｘ　（ｎ）
が供給されている。In FIG. 22, the high-efficiency bit compression encoding system is composed of an encoder 70 on the recording side and a decoder 90 on the reproduction side. data x (n)
is supplied.

この入力信号（の波高値データ）ｘ（ｎ）は、予測器７
２及び加算器７３で構成されたＦＩＲ（有限インパルス
応答型）ディジタルフィルタ７４に供給され、上記予測
器７２からの予測信号（の波高値データ）７（ｎ）は上
記加算器７３に減算信号として送られている。上記加算
器７３においては、上記入力信号ｘ　（ｎ）から上記予
測信号ｘ（ｎ）が減算されることによって、予測誤差信
号あるいは広義の差分出力ｄ（ｎ）が出力される。予測
器７２は、一般に過去の２個の入力ｘ（ｎ−ｐ）、ｘ（
ｎ−ｐ＋１）、・・ｘ（ｎ−１）の１成語合により予測
値マ（ｎ）を算出するものである。なお、上記ＦＩＲフ
ィルタ７４を、以下エンコード・フィルタと称す。This input signal (peak value data) x(n) is obtained by the predictor 7
The predicted signal (peak value data) 7(n) from the predictor 72 is supplied to the FIR (finite impulse response) digital filter 74, which is composed of the adder 2 and the adder 73, and the predicted signal (peak value data) 7(n) from the predictor 72 is sent to the adder 73 as a subtraction signal. being sent. The adder 73 subtracts the prediction signal x(n) from the input signal x(n), thereby outputting a prediction error signal or a broadly defined difference output d(n). Predictor 72 generally uses two past inputs x(n-p), x(
The predicted value Ma(n) is calculated by a one-component combination of n-p+1), . . . x(n-1). Note that the FIR filter 74 is hereinafter referred to as an encode filter.

上記高能率ビット圧縮符号システムにおいては、上記音
源データの一定時間内のデータ、すなわち、一定ワード
数りの入力データ毎にブロック化して、各ブロック毎に
量適の特性の上記エンコード・フィルタ７４を選択する
ようにしている。これは、互いに異なる特性を有する複
数の（例えば４個の）エンコード・フィルタを予め設け
ておき、これらのフィルタのうち最適の特性の、すなわ
ち最も高い圧縮率を得ることのできるようなフィルタを
選択することで実現できるものである。ただし、一般の
ディジタル・フィルタの構成上は、第２２図に示す１個
のエンコード・フィルタ７４の予測器７２の係数の組を
複数Ｍｉ（例えば４組）係数メモリ等に記憶させておき
、これらの係数の組を時分割的に切り換え選択すること
で、実質的に上記複数のエンコード・フィルタのうちの
１つを選択するのと等価な動作を行わせることが多い。In the above-mentioned high-efficiency bit compression code system, data within a certain period of time of the above-mentioned sound source data, that is, input data of a certain number of words, are divided into blocks, and the above-mentioned encoding filter 74 having characteristics suitable for each block is applied. I try to choose. In this method, multiple (for example, four) encoding filters with different characteristics are provided in advance, and the filter with the optimal characteristics, that is, the one that can obtain the highest compression rate, is selected. This can be achieved by doing so. However, in the configuration of a general digital filter, a plurality of sets of coefficients Mi (for example, 4 sets) of the coefficients of the predictor 72 of one encode filter 74 shown in FIG. 22 are stored in a coefficient memory or the like. By time-divisionally switching and selecting a set of coefficients, an operation substantially equivalent to selecting one of the plurality of encode filters described above is often performed.

次に、上記予測誤差としての差分出力ｄ　（ｎ）は、加
算器８１を介し、利得Ｇのシフタ７５と量子化器７６と
よりなるビット圧縮器に送られ、例えば浮動小数点（フ
ローティング・ポイン日表示形態における指数部が上記
利得Ｇに、仮数部が量子化器７６からの出力にそれぞれ
対応するような圧縮処理あるいはレンジング処理が施さ
れる。すなわち、シフタ７５により入力データを上記利
得Ｇに応じたビット数だけシフトしてレンジを切り替え
、量子化器７６により該ビット・シフトされたデータの
一定ビット数を取り出すような再量子化を行っている。Next, the difference output d (n) as the prediction error is sent via an adder 81 to a bit compressor consisting of a shifter 75 with a gain G and a quantizer 76. Compression processing or ranging processing is performed such that the exponent part in the display form corresponds to the gain G and the mantissa part corresponds to the output from the quantizer 76. That is, the shifter 75 converts the input data according to the gain G. The quantizer 76 performs requantization to extract a fixed number of bits from the bit-shifted data.

ここで、ノイズ・シェイピング回路（ノイズ・シェイパ
）７７は、量子化器７６の出力と入力との誤差分いわゆ
る量子化誤差を加算器７８で得て、この量子化誤差を利
得Ｑ　−１のシフタ７９を介し予測器８０に送って、量
子化誤差の予測信号を加算器８１に減算信号として帰還
するようないわゆるエラー・フィードバックを行う。Here, the noise shaping circuit (noise shaper) 77 uses an adder 78 to obtain a so-called quantization error corresponding to the error between the output and the input of the quantizer 76, and applies this quantization error to a shifter with a gain of Q −1. 79 to the predictor 80, so-called error feedback is performed in which the predicted signal of the quantization error is fed back to the adder 81 as a subtraction signal.

このように量子化器７６による再量子化とノイズ・シェ
イピング回路７７によるエラー・フィードバックとが施
され、出力端子８２より出力２（ｎ）が取り出される。In this way, requantization by the quantizer 76 and error feedback by the noise shaping circuit 77 are performed, and an output 2(n) is taken out from the output terminal 82.

ところで、上記加算器８１からの出力ｄ“（ｎ）は上記
差分出力ｄ　（ｎ）より上記ノイズ・シェイパ７７から
の量子化誤差の予測信号τ（ｎ＞を減算したものであり
、上記利得Ｇのシフタ７５からの出力ｄ　”　（ｎ）は
利得Ｇと上記出力加算器８１からの出力ｄ　’　（ｎ）
を乗算したものである。また、上記量子化器１６からの
出力？（ｎ）は、量子化の過程における量子化誤差ｅ　
（ｎ）と上記シフタ７５からの出力ｄ　”　（ｎ）を加
算したものとなり、上記ノイズ・シェイパ７７の上記加
算器７８において上記量子化誤差ｅ　（ｎ）が取り出さ
れる。この量子化誤差ｅ　（ｎ＞は、上記利得Ｑ　−１
のシック７９を介し、過去の１個の入力の１凍結合をと
る予測器８０を介することにより量子化誤差の予測信号
７（ｎ）となる。By the way, the output d''(n) from the adder 81 is obtained by subtracting the predicted signal τ(n> of the quantization error from the noise shaper 77) from the difference output d(n), and the gain G The output d'' (n) from the shifter 75 is the gain G and the output d' (n) from the output adder 81.
is multiplied by Also, the output from the quantizer 16? (n) is the quantization error e in the quantization process
(n) and the output d '' (n) from the shifter 75, and the quantization error e (n) is extracted in the adder 78 of the noise shaper 77. This quantization error e ( n> is the above gain Q −1
The predicted signal 7(n) of the quantization error is obtained by passing through the chic 79 and the predictor 80 which takes a 1-free sum of one past input.

上記音源データは、以上のようなエンコード処理が施さ
れ、上記量子化器７６からの出力↑（ｎ）となって出力
端子８２を介して取り出される。The sound source data is subjected to the encoding process as described above, and is outputted from the quantizer 76 as an output ↑(n) through the output terminal 82.

次に予測・レンジ適応回路８４からは、最適フィルタ選
択情報としてのモード選択情報が出力されて、上記エン
コード・フィルタ７４の例えば予測器７２および出力端
子８７に送られ、また、上記利得Ｇおよび利得Ｇ−１あ
るいは上記ビット・シフト量を決定するためのレンジ情
報が出力されて、各シフタ７５．７９および出力端子８
６に送られている。Next, the prediction/range adaptation circuit 84 outputs mode selection information as optimal filter selection information and sends it to the predictor 72 and output terminal 87 of the encoding filter 74, and also outputs the gain G and the gain G-1 or the range information for determining the above bit shift amount is outputted to each shifter 75, 79 and output terminal 8.
It has been sent to 6.

次に、再生側のデコーダ９０の入力端子９１には、上記
エンコーダ７０の出力端子８２からの出力↑（ｎ）が４
Ｆ、送され、あるいは記録、再生されることによって得
られた信号、ａ−１（ｎ）が供給されている。この入力
信号７１　（ｎ）は利得Ｑ−１のシフタ９２を介し加算
器９３に送られている。加算器９３からの出力ｘ　’　
（ｎ）は予測器９４に送られて予測信号？　’　（ｎ）
となり、この予測信号？　’　（ｎ）は上記加算器９３
に送られて上記シフタ９２からの出カフ　９１　（ｎ）
と加算される。この加算出力がデコード出力↑°（ｎ）
　として出力端子９５より出力される。Next, the output ↑(n) from the output terminal 82 of the encoder 70 is input to the input terminal 91 of the decoder 90 on the playback side.
F, a signal obtained by being sent, recorded, or reproduced, a-1(n), is supplied. This input signal 71 (n) is sent to an adder 93 via a shifter 92 with a gain of Q-1. Output x' from adder 93
(n) is sent to the predictor 94 and is the predicted signal? '(n)
So, this predicted signal? '(n) is the adder 93
Output from the shifter 92 91 (n)
is added. This addition output is the decoded output ↑°(n)
The signal is outputted from the output terminal 95 as a signal.

また、上記エンコーダ７０の各出力端子８６および８７
より出力され、伝送あるいは記録、再生された上記レン
ジ情報およびモード選択信号は、上記デコーダ９０の各
入力端子９６および９７にそれぞれ入力されている。そ
して、入力端子９６からのレンジ情報は上記シフタ９２
に送られて利得Ｇ−１を決定し、入力端子９７からのモ
ード選択情報は上記予測器９４に送られて予測特性を決
定する。この予測器９４の予測特性は、上記エンコーダ
７０の予測器７２の特性に等しいものが選択される。Further, each output terminal 86 and 87 of the encoder 70
The range information and mode selection signal output, transmitted, recorded, and reproduced are input to input terminals 96 and 97 of the decoder 90, respectively. The range information from the input terminal 96 is sent to the shifter 92.
The mode selection information from the input terminal 97 is sent to the predictor 94 to determine the prediction characteristic. The prediction characteristic of this predictor 94 is selected to be equal to the characteristic of the predictor 72 of the encoder 70.

このような構成のデコーダ９０において、上記シック９
２からの出力↑”（ｎ）は、上記入力信号↑“（ｎ）と
利得Ｇ−１を乗算したものである。また、上記加算器９
３の出力９　’　（ｎ）は、上記シフタ９２からの出力
↑”（ｎ）と予測信号？　’　（ｎ）を加算したもので
ある。In the decoder 90 having such a configuration, the thick 9
The output ↑''(n) from 2 is the product of the input signal ↑''(n) and the gain G-1. In addition, the adder 9
The output 9'(n) of 3 is the sum of the output ↑''(n) from the shifter 92 and the prediction signal ?'(n).

次に、第２３図は、上記ビット圧縮符号化エンコーダ７
０からの上記１ブロック分の出力データの一例を示して
おり、この１ブロック分のデータは、１バイトのヘッダ
情報（パラメータ情報）ＲＦと８バイトのサンプル用デ
ータＤＡＯ〜Ｄ１で構成されている。上記ヘッダ情報Ｒ
Ｆは、４ビツトの上記レンジ情報と、２ビツトの上記フ
ィルタ選択情報と、それぞれ１ビツトの２つのフラグ情
報、例えばループの有無を示す情報Ｌｌ及び波形の終端
ブロック（エンドブロック）が否かを示す情報Ｅｌとで
構成されている。ここで１サンプルの波高値データは、
ビット圧縮されて４ビツトで表されており、上記データ
ＤＡ、〜Ｄ０中には１６サンブル分の４ビツト・データ
ＤＡＯＨ−Ｄ１３１が含まれている。Next, FIG. 23 shows the bit compression encoding encoder 7.
This shows an example of the above one block of output data from 0. This one block of data consists of 1 byte of header information (parameter information) RF and 8 bytes of sample data DAO to D1. . Above header information R
F includes the above-mentioned range information of 4 bits, the above-mentioned filter selection information of 2 bits, and two flag information of 1 bit each, for example, information Ll indicating the presence or absence of a loop, and information indicating whether there is a terminal block (end block) of the waveform. It is made up of the information El shown. Here, the peak value data for one sample is
The data is compressed and expressed in 4 bits, and the data DA to D0 includes 4-bit data DAOH-D131 for 16 samples.

次に第２４図は、第６図に示すような楽音信号波形の先
頭部分に対応する上記準瞬時（ブロック化）ビット圧縮
符号化された波高値データの各ブロックを示している。Next, FIG. 24 shows each block of the quasi-instantaneous (blocked) bit compression encoded peak value data corresponding to the leading portion of the musical tone signal waveform as shown in FIG.

この第２４図においては、上記ヘッダを省略して波高値
データのみを示しており、図示の都合上１ブロツクを８
サンプルとしているが、１ブロツク１６サンプル等のよ
うに任意に設定可能であることは勿論である。これは、
前記第１８図の場合も同様である。In this Fig. 24, the header is omitted and only the peak value data is shown, and for convenience of illustration, one block is divided into 8
Although this is a sample, it goes without saying that it can be set arbitrarily, such as 16 samples per block. this is,
The same applies to the case shown in FIG. 18 above.

ここで、上記準瞬時ビット圧縮符号システムは、上記入
力楽音信号を直接出力するモードすなわちストレートＰ
ＣＭモードと、楽音信号をフィルタを介して出力するモ
ードすなわち１次または２次差分フィルタモードのうち
、最も高い圧縮率を有する信号が得られるモードを選択
して、出力信号である楽音データを伝送するようにした
ものである。Here, the above-mentioned quasi-instantaneous bit compression code system has a mode in which the above-mentioned input musical tone signal is directly output, that is, a straight P
Select the mode in which a signal with the highest compression rate is obtained from the CM mode and the mode in which the musical tone signal is output through a filter, that is, the primary or secondary differential filter mode, and transmit the musical tone data that is the output signal. It was designed to do so.

楽音をサンプリングしてメモリ等の記憶媒体に記録する
する場合、上記楽音の楽音信号波形は発音開始点ＫＳで
波形取り込みが開始されるものであるが、この発音開始
点ＫＳからの最初のブロックにて１次または２次差分フ
ィルタモード等のように初期値が必要なフィルタモード
が選択されると、この初期値を予め用意しておく必要が
生じるため、このような初期値の必要のない形態とする
ことが望まれる。このため、上記発音開始点ＫＳに先行
する期間に、上記ストレートＰＣＭモード（入力楽音信
号を直接出力するモード）が選択されるような疑似入力
信号を付加した後、その入力信号を含めて信号処理する
ようにしている。When sampling a musical tone and recording it in a storage medium such as a memory, the musical tone signal waveform of the musical tone starts to be captured at the sound generation start point KS. When a filter mode that requires an initial value, such as the 1st or 2nd order differential filter mode, is selected, it is necessary to prepare this initial value in advance. It is desirable that Therefore, after adding a pseudo input signal that selects the straight PCM mode (a mode in which the input musical tone signal is directly output) in the period preceding the sound generation start point KS, signal processing including that input signal is performed. I try to do that.

すなわち具体的には、第２４図において、上記発音開始
点ＫＳに先行して、上記疑似入力信号としてデータを全
て“０”としたブロックを配置し、このブロックの先頭
からの全データ“０”をサンプリング波高値データとし
てビット圧縮処理して取り込むようにしている。これは
、例えば、予めｌブロックのデータが全て“０”のブロ
ック作成しておきこれをメモリ等にストアしておいて用
いるか、または、楽音をサンプリングする際に上記発音
開始点ＫＳの前にデータが全て′０゛の部分（すなわち
発音開始前の無音部分）の入力信号からサンプリングを
開始する等により得ることができる。なお、上記擬似入
力信号のブロックは最低ｌブロック以上である。Specifically, in FIG. 24, a block in which all data is "0" is placed as the pseudo input signal prior to the sound generation start point KS, and all data from the beginning of this block is "0". The data is bit-compressed and imported as sampled peak value data. This can be done, for example, by creating a block whose data is all "0" in advance and storing it in a memory etc., or by creating a block in which the data of the l block is all "0" and storing it in a memory etc. This can be obtained by starting sampling from an input signal where all data are '0' (that is, a silent part before the start of sound generation). Note that the number of blocks of the pseudo input signal is at least l blocks.

上述のようにして形成された擬似入力信号を含んだ楽音
データを、前述の第２２図に示すような高能率ビット圧
縮符号化システムにより信号圧縮処理し、メモリ等の記
憶媒体に記録させておき、この圧縮処理された信号を再
生する。The musical tone data including the pseudo input signal formed as described above is subjected to signal compression processing using a high-efficiency bit compression encoding system as shown in FIG. , reproduces this compressed signal.

したがって、上記擬似入力信号を含んだ楽音データを再
生する場合、再生開始時（擬似入力信号のブロック部分
）のフィルタにストレートＰＣＭモードが選択されるた
め、１次または２次差分フィルタの初期値をあらかじめ
設定しておく必要がなくなる。Therefore, when playing musical tone data including the above pseudo input signal, the straight PCM mode is selected for the filter at the start of playback (block portion of the pseudo input signal), so the initial value of the first or second difference filter is There is no need to configure settings in advance.

ここで、再生開始時に上記擬似入力信号（データが全て
Ｏｎであるため無音である。）による発音開始時間の遅
れについての懸念がある。しかし、例えば、サンプリン
グ周波数３２ｋＨｚで１ブロツク１６サンプルとした場
合、上記発音時間の遅れは約０．５　ｍ５ｅｃとなり聴
覚上で識別できる遅れではなく問題にならない。Here, there is a concern about a delay in the sound generation start time due to the pseudo input signal (there is no sound because all the data is on) at the start of playback. However, for example, when the sampling frequency is 32 kHz and one block has 16 samples, the delay in the sound generation time is approximately 0.5 m5ec, which is not a delay that can be discerned audibly and does not pose a problem.

とごろで、上記ビット圧縮符号化処理やその他の音源デ
ータ生成のためのディジタル信号処理については、ディ
ジタル信号処理袋ｚ（ｏｓｐ）を用いてソフトウェア的
に実現することが多く行われており、また記録された音
源データの再生にもＤＳＰを用いたソフトウェア的な構
成が採用されることが多い。第２５図はその一例として
、音源データを取り扱う音源ユニットとしてのオーディ
オ・プロセッシング・ユニット（ＡＰＵ）１０７及びそ
の周辺を含むシステムの全体構成例を示している。Nowadays, the above-mentioned bit compression encoding processing and other digital signal processing for generating sound source data are often implemented in software using a digital signal processing bag Z (OSP). A software configuration using a DSP is often employed to reproduce recorded sound source data as well. As an example, FIG. 25 shows an example of the overall configuration of a system including an audio processing unit (APU) 107 as a sound source unit that handles sound source data and its surroundings.

この第２５回において、例えば一般のパーソナルコンピ
ュータ装置や、ディジタル電子楽器、ＴＶゲーム機等に
設けられているホストコンピュータ１０４は、上記音源
ユニットとしてのＡＰＵ　１０７と接続されており、該
ホストコンピュータ１０４からは音源データ等がＡＰＵ
１０７にロードされるようになっている。このＡＰＵ　
Ｉ　０７は、マイクロプロセッサ等のＣＰＵ　（中央処
理装置）１０３と、ＤＳＰ　（ディジタル信号処理装置
）１０１と、上述したような音源データ等が記憶された
メモリ１０２とを少なくとも有して構成されるものであ
る。すなわち、このメモリ１０２には少なくとも音源デ
ータが記憶されており、上記ＤＳＰ１０１により該音源
データの読み出し制御を含む各種処理、例えばルーピン
グ処理、ビット伸長（復元）処理、ピッチ変換処理、エ
ンベロープの付加、エコー（リバーブ）処理等が施され
る。メモリ１０２は、これらの各種処理のためのバッフ
ァメモリとしても用いられる。ＣＰＵ１０３は、ＤＳＰ
ＩＯＩのこれらの各種処理の動作や内容等についての制
御を行うものである。In this 25th episode, a host computer 104 installed in, for example, a general personal computer device, a digital electronic musical instrument, a TV game machine, etc. is connected to the APU 107 as the sound source unit, and The sound source data etc. are APU
107. This APU
I07 is configured to include at least a CPU (Central Processing Unit) 103 such as a microprocessor, a DSP (Digital Signal Processing Unit) 101, and a memory 102 in which the above-mentioned sound source data etc. are stored. It is. That is, this memory 102 stores at least sound source data, and the DSP 101 performs various processing including readout control of the sound source data, such as looping processing, bit expansion (restoration) processing, pitch conversion processing, addition of an envelope, and echo processing. (Reverberation) processing etc. are applied. The memory 102 is also used as a buffer memory for these various processes. The CPU 103 is a DSP
It controls the operations and contents of these various processes of the IOI.

さらに、メモリ＋０２からの上記音源データに対してＤ
ＳＰ　１０１により上記各種処理を施して最終的に得ら
れたディジタル楽音データは、ディジタル／アナログＣ
Ｄ／Ａ）コンバータ１０５によりアナログ信号に変換さ
れてスピーカ１０６に供給されるようになっている。Furthermore, D for the above sound source data from memory +02.
The digital musical tone data finally obtained by performing the various processing described above by SP 101 is converted into digital/analog C
The signal is converted into an analog signal by a D/A converter 105 and supplied to a speaker 106.

なお、本発明は上述した実施例のみに限定されるもので
はなく、例えば、上述の実施例においてはフォルマント
部分とルーピング区間とを接続して音源データを形成し
ていたが、ルーピング区間のみから成る音源データを形
成する場合にも容易に適用可能である。また、音源デー
タのみならず、音声合成等の他の用途に本発明を適用す
ることもできる。また、本発明における音源データの生
成とは、再生側でのデータの生成と記録側でのデータの
生成のそれぞれの可能性を含んでいるものである。Note that the present invention is not limited to the embodiments described above; for example, in the embodiments described above, the formant part and the looping section were connected to form the sound source data, but the sound source data consists only of the looping section. It can also be easily applied to the case of forming sound source data. Further, the present invention can be applied not only to sound source data but also to other uses such as speech synthesis. Furthermore, the generation of sound source data in the present invention includes the possibilities of data generation on the reproduction side and data generation on the recording side.

〔Effect of the invention〕

本発明によれば、繰り返し波形を補間してその補間サン
プルを含めて接続サンプルとして用いているため、接続
点での波形の連続性が改善される。According to the present invention, since the waveform is repeatedly interpolated and the interpolated samples are used as connection samples, the continuity of the waveform at the connection point is improved.

したがって、繰り返し点の非連続性からくるルーピング
ノイズが発生しない音源データの生成方法を得ることが
できる。また、音源データのピッチ変換に補間方式を用
いている場合には、ハードウェアの増加なく本発明を適
用できる。Therefore, it is possible to obtain a method of generating sound source data that does not generate looping noise due to discontinuity of repetition points. Further, when an interpolation method is used for pitch conversion of sound source data, the present invention can be applied without increasing hardware.

[Brief explanation of the drawing]

第１図は本発明の音源データの生成方法の原理を説明す
るための波形図、第２図は波形の接続状態を示す波形図
、第３図はピッチ変換を説明するための波形図、第４図
は補間処理の具体例を説明するためのブロック回路図、
第５図はルーピング処理の接続点とブロックとの関係を
説明するための模式図、第６図は楽音信号波形図、第７
図は本発明の他の具体例を説明するための機能ブロック
図、第８図はピッチ検出動作を説明するための機能ブロ
ック図、第９図はピーク検出動作を説明するためのブロ
ック図、第１０図は楽音信号及びエンベロープの波形図
、第１１図は楽音信号のデイケイレート情報の波形図、
第１２図はエンベロープ検出動作を説明するための機能
ブロック図、第１３図はＦ！Ｒフィルタの特性図、第１
４図は楽音信号のエンベロープ補正された後の波高値デ
ータを示す波形図、第１５図は櫛形フィルタの特性図、
第１６図は最適ルーピングポイントの設定動作を説明す
るための波形図、第１７図は時間軸補正の前後の楽音信
号を示す波形図、第１８図は時間軸補正後の波高値デー
タについて準瞬時ビット圧縮用のブロックの構造を示す
模式図、第１９図はルーピング区間の波形を繰り返し接
続されて得られるループデータを示す波形図、第２０図
はデイケイレート情報に基づくエンベロープ補正後のフ
ォルマント部分生成用データを示す波形図、第２１図は
現実のルーピング処理前後の動作を説明するためのフロ
ーチャート、第２２図は準瞬時ビット圧縮符号化システ
ムの概略構成を示すブロック回路図、第２３図は準瞬時
ビット圧縮符号化されて得られたデータの１ブロツクの
具体例を示す模式図、第２４図は楽音信号の先頭部分の
プロ・ツクの内容を示す模式図、第２５図はオーディオ
・プロセッシング・ユニット及びその周辺を含むシステ
ムの構成例を示すブロック図である。ピッチ１２侯を吉も君するための収形図第３図＋Ｌ−７”スタート？Ｆレヌ第５図ピー／＋梗七創作第８図！キーオン一■橢− ティフイし一ト斗」収第１１図７オ！しマレヒ部分住族用ザータ第２０図エンベローフ→出１作第１２図ＦＩＲ’フィルタの特性第１３図第１４図＠ｌ収叡− 箱監吟フイルダ／ｌ乍弔肱第１５図硯尖のＩＬ−ピニグデび１四乍等の誰Ｂ目第２１図FIG. 1 is a waveform diagram for explaining the principle of the sound source data generation method of the present invention, FIG. 2 is a waveform diagram showing the connection state of waveforms, FIG. 3 is a waveform diagram for explaining pitch conversion, and FIG. Figure 4 is a block circuit diagram for explaining a specific example of interpolation processing,
FIG. 5 is a schematic diagram for explaining the relationship between connection points and blocks in looping processing, FIG. 6 is a musical tone signal waveform diagram, and FIG.
8 is a functional block diagram for explaining another specific example of the present invention, FIG. 8 is a functional block diagram for explaining pitch detection operation, FIG. 9 is a block diagram for explaining peak detection operation, and FIG. Figure 10 is a waveform diagram of the musical tone signal and envelope, Figure 11 is a waveform diagram of decay rate information of the musical tone signal,
FIG. 12 is a functional block diagram for explaining envelope detection operation, and FIG. 13 is F! Characteristic diagram of R filter, 1st
Figure 4 is a waveform diagram showing peak value data after envelope correction of the musical tone signal, Figure 15 is a characteristic diagram of the comb filter,
Figure 16 is a waveform diagram for explaining the operation of setting the optimal looping point, Figure 17 is a waveform diagram showing musical tone signals before and after time axis correction, and Figure 18 is a quasi-instantaneous waveform diagram for wave height data after time axis correction. A schematic diagram showing the structure of a block for bit compression. Figure 19 is a waveform diagram showing loop data obtained by repeatedly connecting waveforms in the looping section. Figure 20 is a formant part after envelope correction based on decay rate information. FIG. 21 is a flowchart for explaining operations before and after actual looping processing, FIG. 22 is a block circuit diagram showing the schematic configuration of a quasi-instantaneous bit compression encoding system, and FIG. 23 is a waveform diagram showing generation data. A schematic diagram showing a specific example of one block of data obtained by quasi-instantaneous bit compression encoding. Figure 24 is a schematic diagram showing the contents of the processing at the beginning of a musical tone signal. Figure 25 is an audio processing diagram. - It is a block diagram showing a configuration example of a system including a unit and its surroundings. Pitch 12 Comprehensive drawings to rule the 12 marquis 3rd figure + L-7" start? F Renu 5th figure P/+ Kyoshichi creation 8th figure! 11 Figure 7 Oh! Zata for Shimarehi partial tribes Figure 20 Envelope → Exit 1 Figure 12 Characteristics of the FIR filter Figure 13 Figure 14 -Pinigudebi 145 etc. Who B's Figure 21

Claims

[Claims]

A sound source data generation method in which a plurality of interpolated samples are also generated by interpolating waveform data of a predetermined period consisting of a plurality of real samples, and a point with the closest value between the real sample and the interpolated sample is repeatedly used as a connected sample of the waveform.