JP2751262B2

JP2751262B2 - Signal recording method and apparatus

Info

Publication number: JP2751262B2
Application number: JP63292937A
Authority: JP
Inventors: 雅一鈴置
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1988-11-19
Filing date: 1988-11-19
Publication date: 1998-05-18
Anticipated expiration: 2013-05-18
Also published as: JPH02137889A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、例えば楽音信号等のアナログ信号又はその
アナログ信号に対応するディジタル信号を記憶媒体に記
録するための信号記録方法及び装置に関するものであ
る。Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a signal recording method and apparatus for recording an analog signal such as a tone signal or a digital signal corresponding to the analog signal on a storage medium. is there.

[Summary of the Invention]

本発明は、楽音信号等のアナログ信号又は該信号に対
応するディジタル信号のような入力信号を櫛形フィルタ
に供給し、入力信号の基本周波数及びその整数倍の周波
数の近傍のみを通過させ、その出力信号の適当な繰り返
し波形区間を抽出して記憶媒体に記録することにより、
入力信号に含まれるノイズを低減し、記録された波形の
繰り返し再生に伴うノイズ発生を抑え得るような信号記
録方法を提案するものである。The present invention provides an input signal such as an analog signal such as a musical tone signal or a digital signal corresponding to the signal to a comb filter, passes only a fundamental frequency of the input signal and a frequency near an integral multiple thereof, and outputs the signal. By extracting an appropriate repetitive waveform section of the signal and recording it on a storage medium,
It is an object of the present invention to propose a signal recording method capable of reducing noise included in an input signal and suppressing occurrence of noise due to repeated reproduction of a recorded waveform.

[Conventional technology]

一般に、電子楽器やTVゲーム器等に用いられる音源
は、VCO、VCA、VCF等から成るアナログ音源と、PSG（プ
ログラマブル・サウンド・ジェネレータ）や波形ROM読
み出しタイプ等のディジタル音源とに大別される。この
ディジタル音源の一種として、近年においては、生の楽
器音等をサンプリングしてディジタル処理した音源デー
タをメモリ等に記憶させて用いるようなサンプラー音源
も広く知られるようになってきている（例えば、特開昭
62-264099号公報、特開昭62-267798号公報参照）。In general, sound sources used in electronic musical instruments and video game consoles are roughly classified into analog sound sources such as VCO, VCA, VCF and the like, and digital sound sources such as PSG (programmable sound generator) and waveform ROM reading type. . In recent years, as one type of the digital sound source, a sampler sound source in which a raw musical instrument sound or the like is sampled and digitally processed sound source data is stored in a memory or the like and used has been widely known (for example, JP
62-264099 and JP-A-62-267798).

このサンプラー音源においては、一般的に音源データ
記憶用のメモリに大容量を要することから、メモリ節約
のための手法が各種提案されており、例えば、楽音波形
の周期性を利用したルーピング処理や、非線形量子化等
によるビット圧縮処理がその代表的なものとして挙げら
れる。In this sampler sound source, since a memory for sound source data storage generally requires a large capacity, various methods for saving the memory have been proposed, for example, a looping process using periodicity of a musical sound waveform, A typical example thereof is a bit compression process using nonlinear quantization or the like.

上記ルーピング処理は、サンプリングされた楽音の元
の持続時間よりも長い時間音を出し続けるための一手法
でもある。すなわち、例えば楽音信号波形を考えると
き、一般に発音開始直後においてはピアノの打鍵ノイズ
や管楽器のブレスノイズ等の非音程成分が含まれること
により、波形の周期性が不明瞭なフォルマント部分が生
じており、その後、楽音の音程（ピッチ、音高）に対応
する基本周期で同じ波形が繰り返し現れるようになる。
この繰り返し波形のｎ周期分（ｎは整数）をルーピング
区間とし、必要に応じて繰り返し再生することにより、
少ないメモリ容量で長時間の持続音を得ることができる
わけである。The looping process is also a technique for continuously outputting a sound longer than the original duration of the sampled musical sound. That is, for example, when considering a musical tone signal waveform, a formant portion in which the periodicity of the waveform is unclear occurs immediately after the start of sounding because non-pitch components such as a tapping noise of a piano and a breath noise of a wind instrument are included. After that, the same waveform repeatedly appears in the basic period corresponding to the musical pitch (pitch, pitch).
By setting n cycles (n is an integer) of this repetitive waveform as a looping section and repeating the reproduction as necessary,
That is, a long-lasting sound can be obtained with a small memory capacity.

[Problems to be solved by the invention]

しかしながら、上述のようにサンプリング音源にルー
ピングの手法を用いる場合、ルーピング処理を行った波
形は完全に周期関数となり、ループの周波数に対して、
非整数次の倍音成分を全く持つことができない。そし
て、音源が元々持っていたこの非整数次倍音はルーピン
グノイズとなってルーピング波形に表れる。However, when the looping method is used for the sampled sound source as described above, the looped waveform is completely a periodic function, and the frequency of the loop is
It cannot have any non-integer harmonic components. Then, this non-integer harmonic that the sound source originally had appears as a looping waveform as looping noise.

このルーピングノイズは、スペクトル上の周波数成分
を持ち聴感上好ましくないため、なるべく除去しなけれ
ばならない。Since this looping noise has frequency components on the spectrum and is not preferable in terms of audibility, it must be removed as much as possible.

また、サンプリングされメモリに記憶された楽音デー
タは、実際の楽音をそのままディジタル処理して記憶媒
体に記録したものであるため、サンプリング時の音質で
再生時の音質も決定されてしまう。例えばサンプリング
時の音質がノイズ成分の多いものであった場合には、記
憶媒体から読み出されて再生された楽音信号もノイズ成
分をそのまま含んだものとなる。また、サンプリングす
る楽音にいわゆるビブラートがかかっている場合には、
微小なFM変調がかかっていることになるため、上記ルー
ピング処理の際に、このFM変調によって生じた側波成分
が非整数倍音成分となり、ルーピングノイズとなって再
生されてしまう。Further, since the musical sound data sampled and stored in the memory is obtained by digitally processing the actual musical sound as it is and recording it on a storage medium, the sound quality at the time of reproduction is determined by the sound quality at the time of sampling. For example, if the sound quality at the time of sampling has a lot of noise components, the tone signal read out from the storage medium and reproduced will also contain the noise components as they are. Also, if the sampled sound has so-called vibrato,
Since the minute FM modulation is applied, the side wave component generated by the FM modulation becomes a non-integer harmonic component at the time of the looping processing, and is reproduced as looping noise.

そこで、本発明は上述のような欠点を解決するために
提案されたものであって、楽音データを櫛形フィルタを
介して記録媒体に記録することにより、非整数倍音成分
を除去した楽音データを得ることができ、再生時にルー
ピングノイズを減少させルーピングを円滑に行えるよう
にする信号記録方法及び装置を提供することを目的とす
るものである。Therefore, the present invention has been proposed in order to solve the above-mentioned drawbacks, and obtains tone data from which non-integer harmonic components have been removed by recording the tone data on a recording medium via a comb filter. It is an object of the present invention to provide a signal recording method and apparatus capable of reducing looping noise at the time of reproduction and smoothing looping.

[Means for solving the problem]

本発明に係る信号記録方法は、上述の目的を達成する
ために、入力アナログ信号又はそのアナログ信号に対応
する入力ディジタル信号を上記入力アナログ信号の基本
周波数及びその高調波成分の周波数帯域のみを通過帯域
とする櫛形フィルタに供給して出力アナログ信号又はデ
ィジタル信号を得る工程と、上記櫛形フィルタからの出
力アナログ又はディジタル信号の適当な繰り返し波形区
間を抽出する工程と、この抽出された繰り返し波形区間
を記憶媒体に記録する工程とを有することを特徴として
いる。In order to achieve the above object, a signal recording method according to the present invention passes an input analog signal or an input digital signal corresponding to the analog signal only through a fundamental frequency of the input analog signal and a frequency band of a harmonic component thereof. A step of supplying an output analog signal or a digital signal to the comb filter as a band, a step of extracting an appropriate repeating waveform section of the output analog or digital signal from the comb filter, and a step of extracting the extracted repeating waveform section. Recording on a storage medium.

また、本発明に係る信号記録装置は、上述の目的を達
成するために、入力アナログ信号又はそのアナログ信号
に対応する入力ディジタル信号が供給され、上記入力ア
ナログ信号の基本周波数及びその高調波成分の周波数帯
域のみを通過帯域とする櫛形フィルタと、この櫛形フィ
ルタからの出力アナログ又はディジタル信号の適当な繰
り返し波形区間を抽出する抽出手段と、この抽出手段か
らの繰り返し波形区間を記憶媒体に記録する手段とを有
することを特徴とするものである。In order to achieve the above object, the signal recording device according to the present invention is supplied with an input analog signal or an input digital signal corresponding to the analog signal, and outputs a fundamental frequency of the input analog signal and a harmonic component thereof. Comb filter having only a frequency band as a pass band, extraction means for extracting an appropriate repetitive waveform section of an analog or digital signal output from the comb filter, and means for recording the repetition waveform section from the extraction means on a storage medium And characterized in that:

[Action]

楽音をその楽音の基音とその倍音のみを通す櫛型フィ
ルタに通すことによって、楽音信号中の音程成分以外の
成分（非音程成分及び一部のノイズ成分）を減衰させて
SN比を改善することができる。またルーピング処理する
場合、ノイズ成分等が減衰された楽音データをルーピン
グするため、ルーピングノイズを抑えることができる。By passing the musical tone through a comb filter that passes only the fundamental tone of the musical tone and its harmonics, components other than the pitch component (non-pitched components and some noise components) in the musical tone signal are attenuated.
The SN ratio can be improved. Further, in the case of performing the looping processing, since the tone data in which the noise component and the like are attenuated is looped, looping noise can be suppressed.

〔Example〕

先ず、本発明に係る信号記録方法の基本的な実施例に
ついて、第１図のフローチャートを参照しながら説明す
る。First, a basic embodiment of the signal recording method according to the present invention will be described with reference to the flowchart of FIG.

この第１図の最初のステップS1において、楽音信号等
の入力アナログ信号又はその入力信号の基本周波数f
₀（ピッチ情報）を検出し、次に、ステップS2におい
て、上記入力アナログ信号の基本周波数帯域及びその高
調波成分の周波数帯域のみを通過帯域とする櫛形フィル
タでフィルタリングして出力アナログ信号又はディジタ
ル信号を得ると共に、ステップS3で上記出力アナログ又
はディジタル信号の基本周波数帯域及びその高調波成分
の周波数帯域のみが通過帯域となるように（抽出される
ように）制御し、ステップS4で記憶媒体に記録する。In the first step S1 of FIG. 1, an input analog signal such as a tone signal or a fundamental frequency f
₀ (pitch information) is detected, and then, in step S2, the output analog signal or digital signal is filtered by a comb filter having only the fundamental frequency band of the input analog signal and the frequency band of its harmonic components as pass bands. In step S3, control is performed so that only the fundamental frequency band of the output analog or digital signal and the frequency band of its harmonic components are set as pass bands (extracted), and recorded in the storage medium in step S4. I do.

次に、より具体的な実施例の説明に先立って、第２図
に示す楽音信号波形を参照しながら、前述したルーピン
グ処理について簡単に説明する。一般に発音開始直後に
おいてはピアノの打鍵ノイズや管楽器のブレスノイズ等
の非音程成分が含まれることにより、波形の周期性が不
明瞭な部分であるフォルマント部分FRが生じており、そ
の後、楽音の音程（ピッチ、音高）に対応する基本周期
で同じ波形が繰り返し現れるようになる。この繰り返し
波形のｎ周期分（ｎは整数）をルーピング区間LPとし、
このルーピング区間LPはルーピング開始点LP_Sとルーピ
ング終端点LP_Eのルーピングポイント間で表されるもの
である。そして上記フォルマント部分FRとルーピング区
間LPとを記憶媒体に記録し、再生時にはフォルマント部
分FRの再生に続いてルーピング区間LPを繰り返し再生す
ることにより、任意の長時間に亘って楽音を発生させる
ことができる。Next, prior to the description of a more specific embodiment, the above-described looping processing will be briefly described with reference to a tone signal waveform shown in FIG. In general, immediately after the start of sound production, non-pitch components such as piano tapping noise and wind instrument breath noise are included, so that a formant part FR, in which the periodicity of the waveform is unclear, occurs. The same waveform repeatedly appears in the basic cycle corresponding to (pitch, pitch). N cycles (n is an integer) of this repetitive waveform are defined as a looping section LP,
The looping section LP is represented by the inter-looping points looping start point LP _S and the looping end point LP _E. Then, the formant part FR and the looping section LP are recorded on a storage medium, and at the time of reproduction, the tone is generated for an arbitrary long time by repeatedly reproducing the looping section LP subsequent to the reproduction of the formant part FR. it can.

以下、本発明の一実施例について図面を参照しながら
説明する。なお、本発明は以下の実施例に限定されるも
のでないことは言うまでもない。Hereinafter, an embodiment of the present invention will be described with reference to the drawings. It goes without saying that the present invention is not limited to the following examples.

第３図は、本発明実施例の音源データ圧縮符号化方法
を音源データ形成装置に適用する際に、入力楽音信号を
サンプリングして記憶媒体に記録するまでの各機能の具
体例を示す機能ブロック図である。この場合の入力端子
10に供給される入力楽音信号としては、例えばマイクロ
フォンで直接収音した信号、あるいはディジタル・オー
ディオ信号記録媒体等を再生して得られた信号を、アナ
ログ信号あるいはディジタル信号の形態で用いることが
できる。FIG. 3 is a functional block diagram showing specific examples of functions from sampling an input tone signal to recording it on a storage medium when the sound source data compression encoding method according to the embodiment of the present invention is applied to a sound source data forming apparatus. FIG. Input terminal in this case
As the input tone signal supplied to 10, for example, a signal directly collected by a microphone or a signal obtained by reproducing a digital audio signal recording medium or the like can be used in the form of an analog signal or a digital signal. .

先ず、第３図のサンプリング処理機能ブロック11にお
いては、上記入力楽音信号を例えば周波数38kHzでサン
プリングし、１サンプル16ビットのディジタルデ、タと
して取り出している。このサンプリング処理とは、上記
入力楽音信号がアナログ信号の場合のA/D変換処理に対
応するものであり、また入力信号がディジタル信号の場
合にはサンプリングレート変換及びビット数変換の処理
に対応するものである。First, in the sampling processing function block 11 shown in FIG. 3, the input tone signal is sampled at a frequency of, for example, 38 kHz, and extracted as 16-bit digital data per sample. This sampling processing corresponds to A / D conversion processing when the input tone signal is an analog signal, and corresponds to sampling rate conversion and bit number conversion processing when the input signal is a digital signal. Things.

次に、ピッチ検出機能ブロック12において、上述のサ
ンプリング処理により得られたディジタル楽音信号につ
いての楽音の音程（ピッチ）を決定する基音の周波数
（基本周波数）f₀、すなわちピッチ情報が検出される。Next, in the pitch detection function block 12, a fundamental tone frequency (fundamental frequency) f ₀ for determining a musical tone pitch (pitch) of the digital tone signal obtained by the above-described sampling processing, that is, pitch information is detected.

このピッチ検出機能ブロック12における検出原理を説
明する。ここで、サンプリング音源となる楽音信号は、
その基音となる周波数がサンプリング周波数f_Sに比べて
かなり低い場合が多く、周波数軸で楽音のピークを検出
するだけでは高い精度での音程の同定が難しい。したが
って、何らかの手段を用いて、楽音の倍音成分のスペク
トルを利用する必要がある。The principle of detection in the pitch detection function block 12 will be described. Here, the musical tone signal as the sampling sound source is
Its fundamental become frequency may considerably lower number than the sampling frequency f _S, just the identification of pitch with high accuracy is difficult to detect the peak of a musical tone at the frequency axis. Therefore, it is necessary to use some means to use the spectrum of the overtone component of the musical tone.

先ず、音程を検出したい楽音信号の波形をｆ（ｔ）と
すれば、この楽音波形ｆ（ｔ）を各倍音成分の振幅ａ
（ω）および位相φ（ω）で表せば、該楽音波形ｆ
（ｔ）はフーリエ展開した式、で表せる。ここで、各倍音の位相のずれφ（ω）を全
てゼロにすると、の式で表せるものとなる。このように位相の揃えられ
た楽音波形（ｔ）のピークは楽音波形（ｔ）の持つ
全ての倍音の周期の整数倍の点およびｔ＝０の点であ
る。これは基音の周期にほかならない。First, assuming that the waveform of a tone signal whose pitch is to be detected is f (t), the tone waveform f (t) is represented by the amplitude a of each harmonic component.
(Ω) and phase φ (ω), the tone waveform f
(T) is a Fourier-expanded equation, Can be represented by Here, if the phase shift φ (ω) of each overtone is all zero, It can be expressed by the following equation. The peaks of the musical tone waveform (t) whose phases are aligned in this manner are points at integer multiples of the period of all overtones of the musical tone waveform (t) and at t = 0. This is nothing but the period of the fundamental tone.

この原理をふまえて、ピッチ検出の手順を第４図に示
す機能ブロック図を用いて説明する。Based on this principle, the procedure of pitch detection will be described with reference to a functional block diagram shown in FIG.

第４図において、実部データ入力端子31より楽音デー
タを、また虚部データ入力端子32より“0"を、高速フー
リエ変換（FFT）機能ブロック33に供給する。In FIG. 4, tone data is supplied from a real part data input terminal 31 and “0” is supplied from an imaginary part data input terminal 32 to a fast Fourier transform (FFT) function block 33.

ここで、上記高速フーリエ変換機能ブロック33で行わ
れる高速フーリエ変換において、ピッチを推定する楽音
信号をｘ（ｔ）とし、また、上記楽音信号ｘ（ｔ）に含
まれる倍音成分を a_ncos（２πｆ_ｎｔ＋θ）・・・・・・とすれば、ｘ（ｔ）はこれを複素表示で書き直して、ただし、 cosθ＝（exp（ｊθ）＋exp（−ｊθ）/2 ・・を用いた。この式をフーリエ変換すると、ここで、δ（ω−ω_ｎ）はデルタ関数である。Here, in a fast Fourier transformation performed by the fast Fourier transform function block 33, the musical tone signal to estimate the pitch is x (t), also a harmonic component included in the sound signal x (t) a _n cos ( 2πf _n t + θ)..., X (t) becomes Rewrite this in complex notation, Here, cosθ = (exp (jθ) + exp (−jθ) / 2... Is used. Here, δ (ω−ω _n ) is a delta function.

次の機能ブロック34で該高速フーリエ変換後のデータ
のノルム（絶対値、すなわち実部と虚部をそれぞれ２乗
したものの和の平方根）を算出する。In the next function block 34, the norm (absolute value, that is, the square root of the sum of the squares of the real part and the imaginary part) of the data after the fast Fourier transform is calculated.

すなわち、Ｘ（ω）の絶対値Ｙ（ω）を取ると、位相
成分がキャンセルされて、Ｙ（ω）＝［Ｘ（ω）▲▼］^1/2＝（1/2）a_nδ
（ω−ω_ｎ）・・・・これは、上記楽音データの高周波成分の全ての位相を
合わせるために成されるものであり、上記虚部をゼロに
することにより、位相成分を揃えることができる。That is, taking the X (omega) of the absolute value Y (omega), the phase component is canceled, Y (ω) = [X (ω) ▲ ▼] 1/2 = (1/2) a n δ
(.Omega .-. _Omega..sub.n ) This is performed to match the phases of all the high-frequency components of the musical tone data. By setting the imaginary part to zero, the phase components can be aligned. it can.

次に、この算出されたノルムを高速フーリエ変換（こ
の場合は逆FFTに相当）機能ブロック36に実部データと
して供給し、虚部データ入力端子35には“0"を供給して
逆FFTをかけ楽音データを復元する。すなわち、上記逆
フーリエ変換は、である。この逆フーリエ変換後の復元された楽音デー
タは、全ての高周波成分の位相が揃ったコサイン波の合
成で表せる波形として取り出されるものである。Next, the calculated norm is supplied to the fast Fourier transform (corresponding to the inverse FFT in this case) function block 36 as the real part data, and “0” is supplied to the imaginary part data input terminal 35 to perform the inverse FFT. Restore the tone data. That is, the inverse Fourier transform is It is. The restored tone data after the inverse Fourier transform is extracted as a waveform that can be expressed by synthesizing a cosine wave in which all high-frequency components have the same phase.

その後、ピーク検出機能ブロック37で上記復元された
音源データのピークを検出する。ここで、上記ピークは
上記楽音データの全ての高周波成分の極値（ピーク）が
一致した点であり、次の機能ブロック38において上記検
出されたピーク値を値の大きい方から分類（ソート）す
る。上記検出されたピークの周期を計測することによ
り、楽音信号の音程を知ることができる。Thereafter, the peak of the restored sound source data is detected by the peak detection function block 37. Here, the peak is a point where the extreme values (peaks) of all the high frequency components of the musical tone data coincide with each other, and in the next function block 38, the detected peak values are classified (sorted) from the larger value. . By measuring the period of the detected peak, the pitch of the tone signal can be known.

第５図は、第４図のピーク検出機能ブロック37におけ
る楽音データの極大値（ピーク）を検出するための構成
について説明するためのものである。FIG. 5 is a diagram for explaining a configuration for detecting the maximum value (peak) of the musical sound data in the peak detection function block 37 of FIG.

この場合上記楽音データは、値の異なったピーク（極
値）が多数存在するものであり、上記楽音データの最大
値を求めてその周期を検出することで楽音の音程を知る
ことができる。In this case, the musical tone data has many peaks (extreme values) having different values, and the pitch of the musical tone can be known by finding the maximum value of the musical tone data and detecting its cycle.

すなわち第５図において、逆フーリエ変換後の楽音デ
ータ列は、入力端子41を介しＮ＋１段のシフトレジスタ
42に供給され、このシフトレジスタ42の各段のレジスタ
a_-N/2…a₀…a_N/2を順次介して出力端子43に送られてい
る。このＮ＋１段のシフトレジスタ42は上記楽音データ
列に対して幅がＮ＋１サンプル分のウィンドウとして作
用し、該楽音データ列のＮ＋１サンプルが上記ウィンド
ウを介して最大値検出回路44に送られる。すなわち、上
記楽音データは最初にレジスタa_-N/2に入力した後レジ
スタa_N/2まで順次伝送され、各々のレジスタa_-N/2…a₀
…a_N/2からのＮ＋１サンプルの上記各楽音データが最大
値検出回路44に送られる。That is, in FIG. 5, the tone data string after the inverse Fourier transform is transferred to an N + 1-stage shift register via an input terminal 41.
42, and the register of each stage of the shift register 42
a _{-N / 2} ... a ₀ ... a _{N / 2} are sequentially sent to the output terminal 43. The N + 1-stage shift register 42 acts as a window having a width of N + 1 samples for the tone data string, and N + 1 samples of the tone data string are sent to the maximum value detection circuit 44 via the window. That is, the music data is first sequentially transmitted to the register a _{N / 2} after the input to the register a _{-N / 2,} each register a _{-N / 2} ... a ₀
.. A The N + 1 samples of tone data from _{N / 2} are sent to the maximum value detection circuit 44.

この最大値検出回路44は、上記シフトレジスタ42内の
例えば中央のレジスタa₀の値が上記Ｎ＋１サンプルのデ
ータの各値の内で最大となったとき、そのレジスタa₀の
データをピーク値として検出して、出力端子45より出力
するものである。なお、上記ウィンドウの幅Ｎ＋１は任
意に設定可能である。The maximum value detection circuit 44 sets the data of the register a ₀ as a peak value when, for example, the value of the central register a ₀ in the shift register 42 becomes the maximum among the values of the data of the N + 1 samples. This is detected and output from the output terminal 45. The window width N + 1 can be set arbitrarily.

第３図に戻って、エンベロープ検出機能ブロック13に
おいては、上述のサンプリング処理後のディジタル楽音
信号に対して、上記ピッチ情報を用いたエンベロープ検
出処理を施すことにより、楽音信号のいわゆるエンベロ
ープ波形を得ている。これは、例えば第６図Ａに示すよ
うな楽音信号波形のピーク点を順次結んで得られる第６
図Ｂに示すような波形であり、発音直後からの時間経過
に伴うレベル（あるいは音量）の変化を表している。こ
のエンベロープ波形は、一般にADSR（アタックタイム／
ディケイタイム／サスティンレベル／リリースタイム）
のような各パラメータにより表されることが多い。ここ
で楽音信号の一具体例として、打鍵操作に応じて発音さ
れるピアノ音等を考えるとき、上記アタックタイムT_Aは
鍵盤の鍵が押され（キー・オン）徐々に音量が上がり目
標とする音量に達するまでの時間を表し、上記ディケイ
タイムT_Dは上記アタックタイムT_Sで達した音量から次の
音量（例えば楽器の持続音の音量）に達するまでの時間
を表し、上記サスティンレベルL_Sは鍵の押圧を解除して
キー・オフするまで保たれる持続音の音量を表し、上記
リリースタイムT_Rは上記キー・オフしてから音が消える
までの時間を表している。なお上記各時間T_A、T_D、T
_Rは、音量変化の傾きあるいはレートを示すこともあ
る。また、これらの４つのパラメータの他にさらに多く
のエンベロープパラメータを用いるようにしてもよい。Returning to FIG. 3, the envelope detection function block 13 performs an envelope detection process using the pitch information on the digital tone signal after the sampling process to obtain a so-called envelope waveform of the tone signal. ing. This is achieved by sequentially connecting peak points of the tone signal waveform as shown in FIG. 6A, for example.
The waveform is as shown in FIG. B, and represents a change in level (or volume) over time immediately after sound generation. This envelope waveform generally has the ADSR (attack time /
Decay time / sustain level / release time)
In many cases. As a specific example of where tone signal, when considering the piano sound like be pronounced according to keying operation, the attack time T _A is a target increases the volume gradually keys of the keyboard is pressed (key on) The decay time T _D represents the time required to reach the volume, and the decay time T _D represents the time required to reach the next volume (for example, the volume of the continuous sound of the instrument) from the volume reached by the attack time T _S , and the sustain level L _S represents the volume of the sustained sound is maintained until the key-off by releasing the pressing of the key, the release time T _R represents the time until the sound from the above-mentioned key-off disappears. Each of the above times T _A , T _D , T
_R may also indicate the slope or rate of volume change. Further, in addition to these four parameters, more envelope parameters may be used.

ここで、エンベロープ検出機能ブロック13において
は、上述したようなADSR（アタックタイムT_A／ディケイ
タイムT_D／サスティンレベルL_S／リリースタイムT_R）等
の各パラメータにより表されるエンベロープ波形情報と
同時に、前述したフォルマント部分をアタック波形の残
った状態で取り出すために、信号波形の全体的なディケ
イレートを示す情報を得るようにしている。このディケ
イレート情報は、例えば第７図に示すように、発音時
（キー・オン時）から上記アタックタイムT_Aの間は基準
の値“1"をとり、その後単調減衰する波形を表すもので
ある。Here, in the envelope detection function block 13, simultaneously with the envelope waveform information represented by each parameter such as ADSR (attack time T _A / decay time T _D / sustain level L _S / release time T _R ) as described above. In order to extract the above-mentioned formant portion with the attack waveform remaining, information indicating the entire decay rate of the signal waveform is obtained. The decay rate information, for example, as shown in FIG. 7, between time pronunciation (when a key on) of the attack time T _A has a value "1" of the reference, which represents the subsequent waveform monotonously attenuated is there.

ここで、第３図のエンベロープ検出機能ブロック13の
構成例について、第８図の機能ブロック図を参照しなが
ら説明する。Here, an example of the configuration of the envelope detection function block 13 in FIG. 3 will be described with reference to the functional block diagram in FIG.

当該エンベロープ検出の原理は、いわゆるAM（振幅変
調）信号のエンベロープ検波と同様なものである。すな
わち、上記楽音信号のピッチを上記AM信号のキャリアの
周波数として考えることによりエンベロープを検出する
ものである。上記エンベロープ情報は楽音を再生する際
に用いるものであり、当該楽音は上記エンベロープ情報
とピッチ情報に基づいて形成されるものである。The principle of the envelope detection is similar to the envelope detection of a so-called AM (amplitude modulation) signal. That is, the envelope is detected by considering the pitch of the tone signal as the frequency of the carrier of the AM signal. The envelope information is used for reproducing a musical sound, and the musical sound is formed based on the envelope information and the pitch information.

第８図の入力端子51に供給された楽音データは、絶対
値出力機能ブロック52において、上記楽音の波高値デー
タの絶対値が求められる。この絶対値データをFIR（有
限インパルス応答）型ディジタルフィルタの機能ブロッ
ク55に送る。ここで、上記FIRフィルタ機能ブロック55
はローパスフィルタとして作用するものであり、予め、
入力端子53に供給されたピッチ情報に基づいて機能ブロ
ック54において形成しておいたフィルタ係数をFIRフィ
ルタ機能ブロック55に供給することにより、そのローパ
スフィルタのカットオフ特性を決定するものである。From the musical tone data supplied to the input terminal 51 in FIG. 8, the absolute value of the peak value data of the musical tone is obtained in the absolute value output function block 52. The absolute value data is sent to a functional block 55 of a FIR (finite impulse response) type digital filter. Here, the above FIR filter function block 55
Acts as a low-pass filter.
By supplying the filter coefficient formed in the function block 54 to the FIR filter function block 55 based on the pitch information supplied to the input terminal 53, the cutoff characteristic of the low-pass filter is determined.

ここで、上記フィルタ特性は、例えば第９図に示す特
性となっており、上記楽音信号の基音（周波数f_o）やそ
の倍音の周波数に零点を有するものである。例えば、上
記第６図Ａに示す楽音信号からは、上記FIRフィルタで
基音，倍音の周波数を減衰させることにより第６図Ｂに
示すようなエンベロープ情報が検出される。なお上記フ
ィルタ係数の特性は、次式で示されるものである。Here, the filter characteristic is, for example, the characteristic shown in FIG. 9, and has a zero point in the fundamental tone (frequency f _o ) of the tone signal and its harmonic frequency. For example, from the tone signal shown in FIG. 6A, envelope information as shown in FIG. 6B is detected by attenuating the fundamental and harmonic frequencies by the FIR filter. Note that the characteristics of the filter coefficient are represented by the following equations.

Ｈ（ｆ）＝ｋ・（sin（πf/f_o））/f ・・・・この式中のf_oは楽音信号の基本周波数（ピッチ）を
示す。H (f) = k · (sin (πf / f _o )) / f In this equation, f _o indicates the fundamental frequency (pitch) of the tone signal.

次に、上述のサンプリング処理された楽音信号の波高
値データ（サンプリングデータ）から、前述の第２図に
示すフォルマント部分FRの信号の波高値データと、ルー
ピング区間LPの信号の波高値データ（ループデータ）と
を生成する処理について説明する。Next, the peak value data of the signal of the formant part FR shown in FIG. 2 and the peak value data of the signal of the looping section LP (loop ) Will be described.

上記ループデータ生成のための最初の機能ブロック14
において、上記サンプリングされた楽音信号の波高値デ
ータを、先に検出したエンベロープ波形（第６図Ｂ）の
データで割算（又は逆数を乗算）してエンベロープ補正
を行うことにより、第10図に示すような振幅一定の波形
の信号の波高値データを得ている。このエンベロープ補
正された信号（の波高値データ）をフィルタ処理するこ
とにより、音程成分以外が減衰された、あるいは相対的
に音程成分が強調された信号（の波高値データ）を得て
いる。ここで音程成分とは、基本周波数f_oの整数倍の周
波数成分のことである。具体的には、上記エンベロープ
補正された信号に含まれるビブラート等の低周波成分を
除去するためにHPF（ハイパスフィルタ）を介し、次
に、第11図の一点鎖線に示すような周波数特性、すなわ
ち基本周波数f_oの整数倍の周波数帯域が通過帯域の周波
数特性、を有する櫛形フィルタを介すことにより、上記
HPF出力信号に含まれる音程成分のみを通過させてこれ
ら以外の非音程成分やノイズ成分を減衰させ、さらに必
要に応じてLPF（ローパスフィルタ）を介すことによ
り、上記櫛形フィルタ通過後の信号に重畳しているノイ
ズ成分を除去する。First functional block 14 for generating the above loop data
In FIG. 10, the peak value data of the sampled tone signal is divided (or multiplied by the reciprocal) by the data of the previously detected envelope waveform (FIG. 6B) to perform envelope correction. Crest value data of a signal having a constant amplitude waveform as shown is obtained. By subjecting this envelope-corrected signal (peak value data) to filtering processing, a signal (peak value data) in which components other than the pitch components are attenuated or the pitch components are relatively emphasized is obtained. Here, the pitch component is that an integer multiple of the frequency component of the fundamental frequency f _o. Specifically, in order to remove low-frequency components such as vibrato contained in the envelope-corrected signal, the signal passes through an HPF (high-pass filter), and then has a frequency characteristic as shown by a one-dot chain line in FIG. frequency characteristics of the integral multiples of the frequency band is the pass band of the fundamental frequency f _o, by the intervention of the comb filter having the above
By passing only the pitch components included in the HPF output signal to attenuate other non-pitch components and noise components, and passing through an LPF (low-pass filter) as necessary, the signal after passing through the comb filter The superimposed noise component is removed.

すなわち、前記入力信号として楽器の音等の楽音信号
を考えるとき、この楽音信号は通常一定の音程（ピッ
チ、音高）を有していることから、その周波数スペクト
ラムには、第11図の実線に示すように、上記楽音自体の
音程に対応する基本周波数f_oの近傍とその整数倍の周波
数の近傍にエネルギが集中するような分布が得られる。
これに対して一般のノイズ成分は一様な周波数分布を持
っていることが知られている。従って、上記入力楽音信
号を第11図の一点鎖線に示すような周波数特性の櫛形フ
ィルタを通すことにより、楽音信号の基本周波数f_oの整
数倍の周波数成分（いわゆる音程成分）のみがそのまま
通過あるいは強調されて他の成分（非音程成分及びノイ
ズの一部）が減衰され、結果としてSN比を改善すること
ができる。ここで、上記第11図中の一点鎖線に示す櫛形
フィルタの周波数特性は、次式Ｈ（ｆ）＝［（cos（２πf/f_o＋１）/2］^N ・・・で表されるものである。この式中のf_oは上記入力信
号の基本周波数（音程に対応する基音の周波数）、Ｎは
櫛形フィルタの段階である。That is, when considering a tone signal such as the sound of a musical instrument as the input signal, since the tone signal usually has a fixed pitch (pitch, pitch), its frequency spectrum has a solid line in FIG. as shown in, distributed as near the energy in the vicinity of the frequency of an integer multiple of the fundamental frequency f _o corresponding to the pitch of the musical tone itself is concentrated is obtained.
On the other hand, it is known that general noise components have a uniform frequency distribution. Therefore, by passing the comb filter in the frequency characteristic as shown the input musical tone signal to the one-dot chain line in FIG. 11, the fundamental frequency f integral multiple of the frequency components (so-called pitch component) _o tone signals only pass intact or The other components (non-pitched components and part of noise) are emphasized and attenuated, and as a result, the S / N ratio can be improved. Here, the frequency characteristic of the comb filter shown in dashed line in the FIG. 11, the following formula H (f) = [(cos (2πf / f o +1) / 2] those represented by ^N · · · there. (frequency of the fundamental tone corresponding to the musical interval) f _o in this equation the fundamental frequency of the input signal, N represents a stage of the comb filter.

このようにしてノイズ成分が低減された楽音信号は、
前記繰り返し波形抽出回路に送られ、この繰り返し波形
抽出回路により前述した第２図のルーピング区間LPのよ
うな適当な繰り返し波形区間が抽出された後、半導体メ
モリ等の記憶媒体に送られて記録される。この記憶媒体
に記録された楽音信号データは、非音程成分や一部のノ
イズ成分が減衰されたものであるため、上記繰り返し波
形区間を繰り返し再生する際のノイズ、いわゆるルーピ
ングノイズを低減することができる。The tone signal with the noise component reduced in this way is
It is sent to the repetitive waveform extraction circuit, and after the repetition waveform extraction circuit extracts an appropriate repetition waveform section such as the looping section LP in FIG. 2 described above, it is sent to a storage medium such as a semiconductor memory and recorded. You. Since the tone signal data recorded on this storage medium has attenuated non-pitch components and some noise components, it is possible to reduce noise when repeatedly playing back the repetitive waveform section, so-called looping noise. it can.

なお上記HPF、櫛形フィルタ、LPFの周波数特性は、先
にピッチ検出機能ブロック12にて検出されたピッチ情報
である上記基本周波数f_oに基づいて設定されるようにな
っている。Note the HPF, comb filter, the frequency characteristic of the LPF is adapted to be set based on the fundamental frequency f _o is the pitch information detected by the pitch detection function block 12 first.

次に第３図のループ区間検出機能ブロック16におい
て、上記フィルタ処理によって音程成分以外が減衰され
た楽音信号に対して、適当な繰り返し波形区間を検出す
ることにより、ルーピング開始点LP_Sとルーピング終端
点LP_Eとのルーピングポイントを設定する。Then the loop interval detection block 16 of FIG. 3, with respect to the musical tone signal other than pitch component is attenuated by the filtering process, by detecting a suitable repetitive waveform sections, looping start point LP _S and looping end to set the looping point between the point LP _E.

すなわち、ループ区間検出機能ブロック16では、上記
楽音信号のピッチ（音程）に対応する繰り返し周期（の
整数倍）だけ相対的に離れた２点であるルーピングポイ
ントを選定するが、以下にその選定原理を説明する。That is, the loop section detection function block 16 selects two looping points that are relatively separated by a repetition period (an integer multiple) corresponding to the pitch (pitch) of the musical tone signal. Will be described.

楽音データをルーピング処理する場合、ルーピングの
間隔は、楽音信号の基本周期（基音の周波数の逆数）の
整数倍でなければならない。したがって、その楽音の音
程を正確に同定すれば、容易に決定することが可能とな
る。When performing looping processing on musical tone data, the looping interval must be an integral multiple of the fundamental period of the musical tone signal (the reciprocal of the fundamental tone frequency). Therefore, if the pitch of the musical tone is accurately identified, it can be easily determined.

つまり、予めルーピング間隔を決定しておき、その間
隔分だけ離れた２点を取り出し、その２点の近傍の信号
波形の相関性あるいは類似性を評価することでルーピン
グポイントを設定する。この評価関数の一例として、上
記２点の各近傍の信号波形のサンプルについてのたたみ
込み（合成積、コンボリューション）を用いるものにつ
いて説明する。すなわち、上記コンボリューションの操
作を全ての点の組みについて順次施すことで信号波形の
相関成あるいは類似性を評価する。ここで、上述のコン
ボリューションによる評価は、例えば上記楽音データを
シフトレジスタに順次入力してゆき、それぞれ各レジス
タで取り込まれた楽音データを、例えば後述するDSP
（ディジタル信号処理装置）で構成された積和器にそれ
ぞれ入力し、該積和器で上記コンボリューションを計算
し出力するものである。このようにして得られたコンボ
リューションが最大となる２点の組みをルーピング開始
点LP_Sおよびルーピング終端点LP_Eとする。In other words, a looping interval is determined in advance, two points separated by the interval are extracted, and a looping point is set by evaluating the correlation or similarity of signal waveforms near the two points. As an example of the evaluation function, a description will be given of a function using convolution (synthesis product, convolution) for a sample of a signal waveform near each of the two points. That is, the correlation operation or similarity of the signal waveforms is evaluated by sequentially performing the convolution operation on all sets of points. Here, in the evaluation by the convolution described above, for example, the tone data is sequentially input to the shift register, and the tone data captured by each register is converted into, for example, a DSP described later.
(Digital signal processing device), each of which is input to a product-sum device, and the product-sum device calculates and outputs the convolution. Thus convolution obtained is to set the looping start point LP _S and looping end point LP _E of two points becomes maximum.

すなわち、第12図において、ルーピング開始点LP_Sの
候補点をa₀とし、ルーピング終端点LP_Eの候補点をb₀と
して、上記ルーピング開始点LP_Sの候補点a₀の前後近傍
の複数個の点、例えば2N＋１個の点の各波高値データ
を、それぞれa_-N・・，a_-2,a_-1,a₀,a₁,a₂,・・ a_N、ルー
ピング終端点LP_Eの候補点b₀の前後近傍の同じ個数（2N
＋１個）の点の各波高値データを、b_-N・・，b_-2,b_-1,b
₀,b₁,b₂,・・ b_Nとすると、このときの評価関数Ｅ（a₀,
b₀）は、次式で定めることができる。この第式はa₀,b₀の点を中
心としたコンボリューションを求めるための式である。
そして上記候補点a₀,b₀の組を順次変更して、全てのル
ーピングポイントの候補となる点についての上記評価関
数Ｅの値を求め、得られた全ての評価関数Ｅの内でその
値が最大となる点をルーピングポイントとする。That is, in Figure 12, the candidate points of the looping start point LP _S and a _0, the candidate points of the looping end point LP _E as b _0, a plurality of front and rear vicinity of the candidate point a ₀ of the looping start point LP _S points, for example, the respective amplitude data of 2N + 1 single point, respectively _{_{a -N ··, a -2, a}} -1, a 0, a 1, a 2, ·· a N, looping end point LP _E same number before and after the vicinity of the candidate point b ₀ (2N
+1) points are calculated as b _-N .., b _-2 , b _-1 , b
_0, b _1, b _2, when a · · b _N, the evaluation function E (a ₀ at this time,
b ₀ ) is Can be determined. This equation is an equation for obtaining a convolution centering on the points a ₀ and b ₀ .
Then, the set of the candidate points a ₀ and b ₀ is sequentially changed to obtain the value of the evaluation function E for all the looping point candidate points. The point at which is maximized is defined as the looping point.

また、ルーピングポイントは上述のようにコンボリュ
ーションから求める方法の他に、誤差の最小２乗法から
求めることも可能である。すなわち、最小２乗法による
ルーピングポイントの候補点a₀,b₀は、の式で表すことができる。この場合には、評価関数ε
の値が最小となるa₀,b₀を求めればよい。Further, the looping point can be obtained by the least square method of the error in addition to the method of obtaining the looping point from the convolution as described above. That is, the candidate points a ₀ and b ₀ of the looping point by the least square method are Can be represented by the following equation. In this case, the evaluation function ε
A ₀ , b ₀ that minimizes the value of may be obtained.

また、上述のループ区間検出機能ブロック16では、必
要に応じて上記ルーピング開始点LP_Sとルーピング終端
点LP_Eとに基づいてピッチ変換比を算出する。このピッ
チ変換比は、次の機能ブロック17における時間軸補正処
理の際の時間軸補正値データとして用いられる。この時
間軸補正処理は、実際に各種音源データをメモリ等の記
憶手段に記録する際の各種音源データの各ピッチを揃え
ておくために行われるものであり、上記ピッチ変換比の
代わりにピッチ検出機能ブロック12において検出された
上記ピッチ情報を用いるようにしてもよい。Further, the loop interval detection block 16 described above, calculates the pitch conversion ratio based on the above looping start point LP _S and the looping end point LP _E as required. This pitch conversion ratio is used as time axis correction value data in the time axis correction processing in the next function block 17. This time axis correction process is performed to make the pitches of the various sound source data uniform when actually recording the various sound source data in a storage unit such as a memory, and the pitch detection is performed instead of the pitch conversion ratio. The pitch information detected in the function block 12 may be used.

この時間軸補正機能ブロック17におけるピッチの正規
化動作について第13図を参照しながら説明する。The pitch normalization operation in the time axis correction function block 17 will be described with reference to FIG.

第13図Ａは時間軸補正処理（主として時間軸圧伸処
理）を施す前の楽音信号波形を示し、第13図Ｂは上記圧
伸後の補正波形を示している。これらの第13図Ａ、Ｂの
時間軸には、後述する準瞬時ビット圧縮符号化処理の際
のブロック単位で目盛りを付している。FIG. 13A shows a tone signal waveform before time-base correction processing (mainly, time-base expansion / compression processing), and FIG. 13B shows a corrected waveform after the above-mentioned expansion. The time axis in FIGS. 13A and 13B is marked on a block-by-block basis in the quasi-instantaneous bit compression encoding process described later.

時間軸補正前の波形Ａにおいては、通常の場合ルーピ
ング区間LPと上記ブロックとは無関係となるが、第13図
Ｂに示すように、上記ルーピング区間LPがブロックの長
さ（ブロック周期）の整数倍（ｎ倍）となるように時間
軸圧押処理し、さらにブロックの境界位置が上記ルーピ
ング開始点LP_S及びルーピング終端点LP_Eに一致するよう
に時間軸方向にシフトする。すなわち、ルーピング区間
LPの開始点LP_S及び終端点LP_Eが所定のブロックの境界位
置となるように時間軸補正（時間軸圧伸及びシフト）す
ることによって、整数個（ｍ個）のブロック単位でルー
ピング処理を行うことができ、記録時の音源データのピ
ッチの正規化が実現できる。ここで、上記時間シフトに
よって楽音信号波形の先頭に生ずるブロックの境界から
のずれ分ΔＴの間には、波高値データとして“0"を詰め
るようにすればよい。In the waveform A before the time axis correction, the looping section LP and the block are normally unrelated in the normal case. However, as shown in FIG. 13B, the looping section LP is an integer of the block length (block cycle). times (n times) and a way to time axis圧押process further boundary position of the block is shifted in the time axis direction to coincide with the looping start point LP _S and looping end point LP _E. That is, the looping section
By starting point of the LP LP _S and end point LP _E is time base correction so that the boundary position of a predetermined block (time scale modification and shift), the looping process in blocks of an integer number (m pieces) And normalization of the pitch of the sound source data at the time of recording can be realized. Here, "0" may be filled as the peak value data during the deviation ΔT from the block boundary generated at the head of the tone signal waveform due to the time shift.

第14図は、上記時間軸補正後の波形の波高値データを
後述のビット圧伸符号化処理するためにブロック化する
際のブロック構造を表すものであり、１ブロックの波高
値データの個数（サンプル数、ワード数）をｈとしてい
る。この場合、上記ピッチの正規化とは、一般的に第２
図に示す楽音信号波形の一定周期T_Wの波形のｎ周期分す
なわちルーピング区間LP内のワード数を、上記ブロック
内ののワード数ｈの整数倍（ｍ倍）とするように時間軸
圧伸処理することであり、さらに好ましくは、ルーピン
グ区間LPの開始点LP_S及び終端点LP_Eを時間軸上のブロッ
ク境界位置に一致させるように時間軸処理（シフト処
理）させることである。このように各点LP_S、LP_Eがブロ
ック境界位置に一致していると、ビット圧縮符号化シス
テムでのデコードの際のブロック切替えによって生じる
誤差を減少させることができる。FIG. 14 shows a block structure when the peak value data of the waveform after the time axis correction is divided into blocks for bit companding encoding processing to be described later, and the number of peak value data of one block ( The number of samples, the number of words) is h. In this case, the pitch normalization generally means the second
The number of words within a fixed period T _W n cycles i.e. looping segment of the waveform of the LP tone signal waveform shown in FIG., An integer multiple of the number of words h of in the block (m times) to as time scale modification it is to process, more preferably, is to time-axis processing to match the start point LP _S and end point LP _E looping section LP to block border position on the time axis (shift). When the points LP _S and LP _E coincide with the block boundary position as described above, it is possible to reduce an error caused by block switching at the time of decoding in the bit compression encoding system.

ここで、第14図Ａの１ブロック内の図中斜線で示す部
分のワードWLP_SとWLP_Eは、図中補正波形のルーピング開
始点LP_Sとルーピング終端点LP_E（正確には点LP_Eの直前
の点）のサンプルを示すワードである。なお上記シフト
処理を行わない場合には、ルーピング開始点LP_S及び終
端点LP_Eがブロック境界に必ずしも一致しないため、第1
4図Ｂに示すように、上記ワードWLP_S、WLP_Eの設定位置
は、ブロック内の任意の位置に設定される。ただし、上
記ワードWLP_SからワードWLP_Eまでの間のワード数は１ブ
ロック内のワード数ｈの整数倍（ｍ倍）となっており、
ピッチは正規化される。Here, the word WLP _S and WLP _E of the portion indicated by oblique lines in FIG within 1 block of Figure 14 A is a looping start point of the figure correction waveform LP _S and the looping end point LP _E (exact points in LP _E Is a word indicating the sample at the point immediately before the. Note that the case of not performing the shift process, since the looping start point LP _S and end point LP _E do not necessarily coincide with the block boundary, the first
4 As shown in FIG. B, set position of the word WLP _S, WLP _E is set at an arbitrary position in the block. However, the number of words between the above word WLP _S to word WLP _E is an integral multiple of the number of words h in one block (m times),
The pitch is normalized.

ここで、上述のようにルーピング区間LP内のワード数
を１ブロックのワード数ｈの整数倍とするための楽音信
号波形の時間軸圧伸処理には各種方法が考えられるが、
例えばサンプリングされた波形の波高値データを補間処
理するとにより実現でき、その一具体例としては、オー
バーサンプリング処理用のフィルタ構成等を利用するこ
とができる。Here, as described above, various methods can be considered for the time axis companding process of the tone signal waveform for making the number of words in the looping section LP an integral multiple of the number h of words in one block.
For example, this can be realized by interpolating the peak value data of the sampled waveform, and as a specific example, a filter configuration for oversampling can be used.

ところで、現実の楽音波形のルーピング周期がサンプ
リング周期単位に対して端数を持ち、ルーピング開始点
LP_Sでのサンプリング波高値とルーピング終端点LP_Eでの
サンプリング波高値とにずれが生じている場合に、オー
バーサンプリング等を利用した補間処理により、ルーピ
ング終端点LP_Eの近傍位置（サンプリング周期よりも短
い距離の位置）でルーピング開始点LP_Sのサンプリング
波高値に一致するような波高値を求める等して、補間サ
ンプルも含めたサンプリング周期の非整数倍の（端数を
持つ）ルーピング周期を実現することが考えられる。こ
のようなサンプリング周期の非整数倍のルーピング周期
も、上記時間軸補正処理により上記ブロック周期の整数
倍とすることができ、例えば256倍オーバサンプリング
を利用して時間軸圧伸処理する場合には、ルーピング開
始点LP_Sと終端点LP_Eとの間の波高値の誤差を1/256に低
減して、より円滑なルーピング再生を実現できる。By the way, the looping cycle of the actual tone waveform has a fraction with respect to the sampling cycle unit, and the looping start point
If the deviation in the sampling peak value at the sampling peak value and looping end point LP _E in LP _S occurs, by interpolation processing using oversampling like, from a position near (sampling period of looping end point LP _E even if such finding the peak value to conform to the sampling the peak value of the short distance looping start point LP _S at position) of, with non-integer multiple of the (fractional sampling period interpolated samples were also included) realized looping cycle It is possible to do. Such a looping cycle of a non-integer multiple of the sampling cycle can also be set to an integer multiple of the block cycle by the time axis correction processing. For example, when performing the time axis companding processing using 256 times oversampling, , to reduce the error in the wave height value between the looping start point LP _S and the end point LP _E 1/256 can be realized more smoothly looping playback.

上述のようにしてルーピング区間LPが決められ時間軸
補正（圧伸）処理が施された波形は、次の機能ブロック
21において、第15図に示すようにルーピング区間LPを前
後に接続してループデータの生成が行われる。すなわち
第15図は、上記時間軸補正後の楽音波形（第13図Ｂ）か
らルーピング区間LPのみを切り取り、このルーピング区
間LPを複数個並べたループデータ波形を示しており、こ
のループデータ波形は、複数個のルーピング区間LPのそ
れぞれ一方のルーピング終端点LP_Eと他方のルーピング
開始点LP_Sとを順次接続して並べたものである。このル
ープデータ波形がループデータ生成機能ブロック21にて
生成される。The waveform for which the looping section LP has been determined as described above and the time axis correction (compression expansion) processing has been performed is performed by the following functional block.
In FIG. 21, loop data is generated by connecting the looping sections LP back and forth as shown in FIG. That is, FIG. 15 shows a loop data waveform obtained by cutting out only the looping section LP from the tone waveform after the time axis correction (FIG. 13B) and arranging a plurality of looping sections LP. it is obtained by arranging the respective one of the looping end point LP _E and the other looping start point LP _S of a plurality of looping sections LP and sequentially connected. This loop data waveform is generated by the loop data generation function block 21.

このループデータは、ルーピング区間LPを多数回接続
して形成されてるため、該接続形成されたループデータ
波形の各ルーピング開始点LP_Sに対応するワードWLP_Sを
含む開始ブロックの直前には、ルーピング終端点LP
_E（正確には点LP_Eの直前の点）に対応するワードWLP_Sを
含む終了ブロックのデータがそのまま配置されることに
なる。原理的には、ビット圧縮符号化のエンコード処理
をする際に、記憶しようとするルーピング区間LP_Oの上
記開始ブロックの直前位置に、少なくとも上記終了ブロ
ックが存在していればよい。さらに一般化して述べるな
らば、上記ブロック単位のビット圧縮エンコード時に、
上記開始ブロックのパラメータ（圧縮ブロック毎のビッ
ト圧縮符号化の情報、例えば後述するレンジ情報やフィ
ルタ選択情報）は、上記開始ブロックと終了ブロックの
データに基づいて形成されるようにすればよい。これ
は、後述するフォルマント部分を持たないループデータ
のみの楽音信号を音源とする場合にも適用可能な技術で
ある。The loop data, because it is formed by connecting multiple looping section LP, just before the start block containing the word WLP _S corresponding to each looping start point LP _S of the connection formed loop data waveform, looping End point LP
_E so that the data of the end block containing the word WLP _S corresponding to (exactly to a point immediately before the point LP _E) is to be placed as it is. In principle, when the encoding process of bit compression encoding, just before the position of the start block of the looping section LP _O to be stored as long as the at least the end block is present. To be more generalized, at the time of the above-mentioned block unit bit compression encoding,
The parameters of the start block (information of bit compression encoding for each compressed block, for example, range information and filter selection information described later) may be formed based on the data of the start block and the end block. This is a technique applicable also to a case where a tone signal of only loop data having no formant part described later is used as a sound source.

こうすれば、上記エンコード時に、ルーピング開始点
LP_Sと終端点LP_Eとについては、それぞれの前後複数のサ
ンプルに亘って、それぞれ同じデータが並ぶことにな
る。従って、これらの各点LP_SとLP_Eの直前のそれぞれの
ブロックについてのビット圧縮符号化の際のパラメータ
は同じものとなり、デコード処理の際のルーピング再生
時のエラー（ノイズ）を減少することができる。すなわ
ち、ルーピング再生される楽音データは接続ノイズの無
い安定したものとなる。なお、本実施例においては、上
記開始ブロックの直前に配置する上記ルーピング区間LP
のデータのサンプル数を約500サンプルとしている。In this way, at the time of the above encoding, the looping start point
For the LP _S and the end point LP _E, over each of the front and rear multiple samples, so that each same data lined. Therefore, that the parameters of the time of bit compression encoding for each of the blocks immediately preceding each of these points LP _S and LP _E becomes the same as, reducing the looping playback error upon decoding (noise) it can. In other words, the tone data reproduced in a loop is stable without connection noise. In this embodiment, the looping section LP arranged immediately before the start block is used.
The number of data samples is about 500 samples.

次に上記フォルマント部分FRの信号のデータ生成工程
においては、先ず、上記ループデータ生成の際の機能ブ
ロック14と同様に、機能ブロック18おいてエンベロープ
補正処理が施される。ただしこの場合のエンベロープ補
正は、上記サンプリング処理された楽音信号に対して、
前述したディケイレート情報のみのエンベロープ波形
（第７図）で割算することにより、第16図に示すような
波形の信号（の波高値データ）を得ている。すなわちこ
の第16図の出力信号においては、上記アタック部分（時
間T_Aの間）のエンベロープが残され、それ以外の部分は
一定振幅となっている。Next, in the data generation process of the signal of the formant part FR, first, similarly to the functional block 14 at the time of generating the loop data, an envelope correction process is performed in the functional block 18. However, in this case, the envelope correction is performed on the sampled tone signal.
By dividing by the envelope waveform of only the decay rate information (FIG. 7), a signal having a waveform as shown in FIG. 16 (peak value data) is obtained. That is, in the output signal of FIG. 16, the envelope of the above-mentioned attack portion (during the time T _A ) is left, and the other portions have a constant amplitude.

このエンベロープ補正された信号は、必要に応じて機
能ブロック19でのフィルタ処理が施される。この機能ブ
ロック19でのフィルタ処理には、上記機能ブロック15と
同様な例えば第11図の一点鎖線に示すような周波数特性
の櫛形フィルタが用いられる。すなわちこの櫛形フィル
タは、上記音程に対応する基本周波数f₀の整数倍の周波
数帯域成分を強調して相対的に非音程成分を減衰するよ
うな周波数特性を有しており、この櫛形フィルタも上記
ピッチ検出機能ブロック12で検出されたピッチ情報（基
本周波数f₀）に基づいて周波数特性が設定されるもので
ある。このような信号は、最終的にメモリ等の記憶媒体
に記録される音源データにおけるフォルマント部分の信
号のデータを生成するために用いられる。This envelope-corrected signal is subjected to a filtering process in a functional block 19 as necessary. For the filter processing in the functional block 19, for example, a comb filter having a frequency characteristic as shown by a dashed line in FIG. 11 similar to the functional block 15 is used. That this comb filter has a frequency characteristic as to attenuate the relatively non-pitch component emphasizes the integral multiple of the frequency band component of the fundamental frequency f ₀ corresponding to the pitch, also the comb filter described above The frequency characteristic is set based on the pitch information (basic frequency f ₀ ) detected by the pitch detection function block 12. Such a signal is used to generate data of a signal of a formant part in sound source data finally recorded on a storage medium such as a memory.

次の機能ブロック20においては、上記機能ブロック17
と同様な時間軸補正が上記フォルマント部分生成用信号
に対しても行われる。これは、上記機能ブロック16で求
められたピッチ変換比あるいは上記機能ブロック12で検
出されたピッチ情報に基づいて時間軸の圧縮伸長を行う
ことにより、各音源毎のピッチを揃える（正規化する）
ためのものである。In the next function block 20, the above function block 17
The same time axis correction is performed on the formant part generation signal. This is because the pitch of each sound source is made uniform (normalized) by performing compression and expansion on the time axis based on the pitch conversion ratio obtained in the function block 16 or the pitch information detected in the function block 12.
It is for.

次に、機能ブロック22において、上記共に同じピッチ
変換比あるいはピッチ情報を用いて時間軸補正されたル
ープデータとフォルマント部分生成用データとが混合さ
れる。このときの混合は、上記機能ブロック20からのフ
ォルマント部分生成用信号に対してハミング窓をかけ、
ループデータと混合しようとする部分で時間に伴った減
衰するフェイドアウト型の信号を形成し、これに対し上
記機能ブロック20からのループデータに対しても同様な
ハミング窓をかけ、この場合にはフォルマント信号と混
合しようとする部分で時間に伴って増大するフェイドイ
ン型の信号を形成し、これらの信号を混合する（クロス
フェイドする）ことにより、最終的に音源データとなる
楽音信号を得ている。ここで、メモリ等の記憶媒体に記
録するループデータとしては、上記クロスフェイド部分
からある程度離れた１つのルーピング区間のデータを取
り出すことにより、ルーピング再生時のノイズ（ルーピ
ングノイズ）を低減することができる。このようにし
て、発音時からの非音程成分を含む波形部分であるフォ
ルマント部分FRと、音程成分のみの繰り返し波形部分で
あるルーピング区間LPとから成る音源信号の波高値デー
タが得られる。Next, in the function block 22, the loop data and the formant part generation data that have been time-axis corrected using the same pitch conversion ratio or pitch information are mixed. At this time, a hamming window is applied to the formant part generation signal from the functional block 20,
A fade-out type signal that attenuates with time is formed at a portion to be mixed with the loop data, and a similar hamming window is applied to the loop data from the functional block 20 in this case. A part to be mixed with the signal forms a fade-in type signal which increases with time, and these signals are mixed (cross-fade) to finally obtain a tone signal which is sound source data. . Here, as loop data to be recorded on a storage medium such as a memory, noise in a looping reproduction (looping noise) can be reduced by extracting data of one looping section that is separated from the cross-fade part to some extent. . In this manner, the peak value data of the sound source signal including the formant portion FR, which is a waveform portion including a non-pitch component from the time of sound generation, and the looping section LP, which is a repetitive waveform portion including only the pitch component, is obtained.

この他、上記フォルマント部分生成用信号における上
記ルーピング開始点の位置にループデータの信号の開始
点を接続するように各部分を切り繋ぐ処理等も考えられ
る。In addition, a process of connecting each part such that the start point of the loop data signal is connected to the position of the looping start point in the formant part generation signal may be considered.

ところで、現実にループ区間検出やルーピング処理、
さらにはループデータとフォルマント部分との混合を行
う際には、人間の手操作により試行錯誤的に試聴を繰り
返しながら大まかな混合をしておき、このときのループ
ポイント（ルーピング開始点LP_Sとルーピング終端点L
P_E）情報等に基づいてより高精度の処理を行っている。By the way, actually, loop section detection and looping processing,
When further performing mixing of the loop data and formant portion, by the hand of man operations leave the rough mixed with trial and error repeated Listen, loop points (looping start point LP _S and the looping of the time End point L
_PE ) Higher precision processing is performed based on information.

すなわち、上記機能ブロック16での高精度のループ区
間検出に先立って、第17図のフローチャートに示すよう
な手順でループ区間検出や上記混合等を試聴を繰り返し
ながら手操作で行い、その後、上述したような高精度の
処理（ステップS26以降）を行わせる。That is, prior to the high-precision loop section detection in the functional block 16, the loop section detection and the mixing and the like are manually performed while repeating the audition according to the procedure shown in the flowchart of FIG. Such high-precision processing (step S26 and subsequent steps) is performed.

この第17図において、最初のステップS21において
は、例えば信号波形のゼロクロス点を利用したり、信号
波形の表示を目視確認しながら、比較的粗い精度で上記
ループポイントを検出し、ステップS22でルーピング処
理して上記ループポイント間の波形を繰り返して再生
し、次のステップS23で人間が試聴して良好か否かを判
別する。不良の場合には上記最初のステップS21で戻っ
てループポイントを再度検出する。これを繰り返して良
好な試聴結果が得られれば、次のステップS24に進み、
上記フォルマント部用信号とクロスフェード等により混
合し、次のステップS23で人間が試聴してフォルマント
部からルーピング部への移行が良好か否かを判別する。
不良の場合にはステップS24に戻って上記混合をやり直
す。その後、ステップS26に進んで、上記ループ区間検
出機能ブロック16における高精度のループ区間検出を行
う。具体的には上記補間サンプルも含むループ区間検
出、例えば256倍オーバサンプリング時にはサンプリン
グ周期の1/256の精度でのループ区間検出を行い、次の
ステップS27で上記ピッチ正規化のためのピッチ変換比
を算出する。このピッチ変換比に基づいて、次のステッ
プS28で上記機能ブロック17、20における時間軸補正処
理を行い、次のステップS29にて上記機能ブロック21で
のループデータ生成を行う。そして、ステップS30にお
いて、上記機能ブロック22での混合処理を行う。これら
のステップS26以降の処理においては、ステップS21から
S25までで得られたループポイント情報等を利用するも
のである。なお、上記ステップS21からS25までを省略し
て、ルーピング処理等の全自動化を図ってもよい。In FIG. 17, in the first step S21, the loop point is detected with relatively coarse accuracy while using, for example, the zero-cross point of the signal waveform or visually confirming the display of the signal waveform, and looping is performed in step S22. The processing is repeated to reproduce the waveform between the loop points, and in the next step S23, it is determined whether or not a human listens to the sample by listening. In the case of a failure, the process returns to the first step S21 to detect the loop point again. If a good audition result is obtained by repeating this, proceed to the next step S24,
The signal is mixed with the signal for the formant part by cross-fade or the like, and in the next step S23, a human listens to the sound to determine whether the transition from the formant part to the looping part is good.
If defective, the process returns to step S24 to repeat the mixing. Thereafter, the process proceeds to step S26, in which the loop section detection function block 16 performs high-accuracy loop section detection. Specifically, the loop section detection including the interpolation sample is performed, for example, at the time of 256 times oversampling, the loop section detection is performed with an accuracy of 1/256 of the sampling period, and the pitch conversion ratio for the pitch normalization is performed in the next step S27. Is calculated. Based on the pitch conversion ratio, the time axis correction processing in the functional blocks 17 and 20 is performed in the next step S28, and the loop data is generated in the functional block 21 in the next step S29. Then, in step S30, the mixing process in the functional block 22 is performed. In the processing after step S26, the processing from step S21
The loop point information and the like obtained up to S25 are used. Note that steps S21 to S25 may be omitted and full automation such as looping processing may be achieved.

このような混合処理により得られたフォルマント部分
FRとルーピング区間LPとから成る信号の波高値データ
は、次の機能ブロック23においてビット圧縮符号化処理
が施される。Formant part obtained by such a mixing process
The peak value data of the signal composed of the FR and the looping section LP is subjected to a bit compression encoding process in the next functional block 23.

上述のビット圧縮符号化方式としては種々のものが考
えられるが、ここでは、本件出願人が先に特開昭62-008
629号公報や特開昭62-003516号公報等において提案して
いる準瞬時圧伸型、すなわち波高値データの一定ワード
数（ｈサンプル）毎にブロック化しこのブロック単位で
ビット圧縮を施すような高能率符号化方式を用いるもの
とし、この高能率ビット圧縮符号化方式について、第18
図を参照しながら概略的に説明する。Various bit compression coding schemes are conceivable, but here, the applicant of the present application has disclosed in Japanese Patent Laid-Open No. 62-008 / 1987.
A quasi-instantaneous companding type proposed in Japanese Patent Application Laid-Open No. 629 and Japanese Patent Application Laid-Open No. 62-003516, that is, a method in which a block is formed for every fixed number of words (h samples) of peak value data and bit compression is performed in block units The high-efficiency coding method shall be used.
This will be schematically described with reference to the drawings.

この第18図において、上記高能率ビット圧縮符号化シ
ステムは、記録側のエンコーダ70と、再生側のデコーダ
90とにより構成されており、エンコーダ70の入力端子71
には、上記音源信号の波高値データｘ（ｎ）が供給され
ている。In FIG. 18, the high-efficiency bit compression encoding system comprises a recording-side encoder 70 and a reproduction-side decoder.
90 and the input terminal 71 of the encoder 70.
Is supplied with peak value data x (n) of the sound source signal.

この入力信号（の波高値データ）ｘ（ｎ）は、予測器
72及び加算器73で構成されたFIR（有限インパルス応答
型）ディジタルフィルタ74に供給され、上記予測器72か
らの予測信号（の波高値データ）（ｎ）は上記加算器
73に減算信号として送られている。上記加算器73におい
ては、上記入力信号ｘ（ｎ）から上記予測信号（ｎ）
が減算されることによって、予測誤差信号あるいは広義
の差分出力ｄ（ｎ）が出力される。予測器72は、一般に
過去のｐ個の入力ｘ（ｎ−ｐ）,x（ｎ−ｐ＋１），…,x
（ｎ−１）の１次結合により予測値（ｎ）を算出する
ものである。なお、上記FIRフィルタ74を、以下エンコ
ード・フィルタと称す。This input signal (peak value data) x (n) is calculated by a predictor
The prediction signal (the peak value data) (n) of the prediction signal from the predictor 72 is supplied to an FIR (finite impulse response type) digital filter 74 comprising an adder 72 and an adder 73.
73 is sent as a subtraction signal. In the adder 73, the prediction signal (n) is obtained from the input signal x (n).
Is subtracted to output a prediction error signal or a difference output d (n) in a broad sense. The predictor 72 generally includes p past inputs x (n-p), x (n-p + 1), ..., x
The prediction value (n) is calculated by the linear combination of (n-1). Note that the FIR filter 74 is hereinafter referred to as an encoding filter.

上記高能率ビット圧縮符号システムにおいては、上記
音源データの一定時間内のデータ、すなわち、一定ワー
ド数ｈの入力データ毎にブロック化して、各ブロック毎
に最適の特性の上記エンコード・フィルタ74を選択する
ようにしている。これは、互いに異なる特性を有する複
数の（例えば４個の）エンコード・フィルタを予め設け
ておき、これらのフィルタのうち最適の特性の、すなわ
ち最も高い圧縮率を得ることのできるようなフィルタを
選択することで実現できるものである。ただし、一般の
ディジタル・フィルタの構成上は、第18図に示す１個の
エンコード・フィルタ74の予測器72の係数の組を複数組
（例えば４組）係数メモリ等に記憶させておき、これら
の係数の組を時分割的に切り換え選択することで、実質
的に上記複数のエンコード・フィルタのうちの１つを選
択するのと等価な動作を行わせることが多い。In the high-efficiency bit compression encoding system, data of the excitation data within a certain time, that is, input data of a certain number of words h is divided into blocks, and the encoding filter 74 having the optimum characteristic is selected for each block. I am trying to do it. This is because a plurality of (for example, four) encoding filters having different characteristics are provided in advance, and a filter having an optimum characteristic, that is, a filter capable of obtaining the highest compression ratio is selected from these filters. It can be realized by doing. However, in the structure of a general digital filter, a plurality of sets (for example, four sets) of coefficients of the predictor 72 of one encoding filter 74 shown in FIG. In many cases, an operation substantially equivalent to selecting one of the plurality of encoding filters is performed by switching and selecting the set of coefficients in a time-division manner.

次に、上記予測誤差としての差分出力ｄ（ｎ）は、加
算器81を介し、利得Ｇのシフタ75と量子化器76とよりな
るビット圧縮器に送られ、例えば浮動小数点（フローテ
ィング・ポイント）表示形態における指数部が上記利得
Ｇに、仮数部が量子化器76からの出力にそれぞれ対応す
るような圧縮処理あるいはレンジング処理が施される。
すなわち、シフタ75により入力データを上記利得Ｇに応
じたビット数だけシフトしてレンジを切り替え、量子化
器76により該ビット・シフトされたデータの一定ビット
数を取り出すような再量子化を行っている。ここで、ノ
イズ・シェイピング回路（ノイズ・シェイパ）77は、量
子化器76の出力と入力との誤差分いわゆる量子化誤差を
加算器78で得て、この量子化誤差を利得G^-1のシフタ79
を介し予測器80に送って、量子化誤差の予測信号を加算
器81に減算信号として帰還するようないわゆるエラー・
フィードバックを行う。このように量子化器76による再
量子化とノイズ・シェイピング回路77によるエラー・フ
ィードバックとが施され、出力端子82より出力（ｎ）
が取り出される。Next, the difference output d (n) as the prediction error is sent to a bit compressor composed of a shifter 75 for gain G and a quantizer 76 via an adder 81, for example, a floating point. A compression process or a ranging process is performed so that the exponent part in the display form corresponds to the gain G and the mantissa part corresponds to the output from the quantizer 76.
That is, the range is switched by shifting the input data by the number of bits according to the gain G by the shifter 75, and requantization is performed by the quantizer 76 to extract a certain number of bits of the bit-shifted data. I have. Here, a noise shaping circuit (noise shaper) 77 obtains a so-called quantization error corresponding to an error between an output and an input of the quantizer 76 by an adder 78, and converts the quantization error into a shifter having a gain G- ¹ . 79
, And a prediction signal of the quantization error is fed back to the adder 81 as a subtraction signal.
Give feedback. In this way, requantization by the quantizer 76 and error feedback by the noise shaping circuit 77 are performed, and the output (n) is output from the output terminal 82.
Is taken out.

ところで、上記加算器81からの出力ｄ′（ｎ）は上記
差分出力ｄ（ｎ）より上記ノイズ・シェイパ77からの量
子化誤差の予測信号（ｎ）を減算したものであり、上
記利得Ｇのシフタ75からの出力ｄ″（ｎ）は利得Ｇと上
記出力加算器81からの出力ｄ′（ｎ）を乗算したもので
ある。また、上記量子化器76からの出力（ｎ）は、量
子化の過程における量子化誤差ｅ（ｎ）と上記シフタ75
からの出力ｄ″（ｎ）を加算したものとなり、上記ノイ
ズ・シェイパ77の上記加算器78において上記量子化誤差
ｅ（ｎ）が取り出される。この量子化誤差ｅ（ｎ）は、
上記利得G^-1のシフタ79を介し、過去のｒ個の入力の１
次を結合をとる予測器80を介することにより量子化誤差
の予測信号（ｎ）とする。The output d '(n) from the adder 81 is obtained by subtracting the prediction signal (n) of the quantization error from the noise shaper 77 from the difference output d (n). The output d ″ (n) from the shifter 75 is obtained by multiplying the gain G by the output d ′ (n) from the output adder 81. The output (n) from the quantizer 76 is the quantum Error e (n) in the process of quantization and the shifter 75
Is added to the output d ″ (n), and the quantization error e (n) is extracted by the adder 78 of the noise shaper 77. The quantization error e (n) is
Through the shifter 79 of the gain G ^-1 , one of the past r inputs
The next is through a predictor 80 which takes the combination into a prediction signal (n) of the quantization error.

上記音源データは、以上のようなエンコード処理が施
され、上記量子化器76からの出力（ｎ）となって出力
端子82を介して取り出される。The above-mentioned sound source data is subjected to the above-described encoding processing, output as the output (n) from the quantizer 76, and taken out through the output terminal 82.

次に予測・レンジ適応回路84からは、最適フィルタ選
択情報としてのモード選択情報が出力されて、上記エン
コード・フィルタ74の例えば予測器72および出力端子87
に送られ、また、上記利得Ｇおよび利得G^-1あるいは上
記ビット・シフト量を決定するためのレンジ情報が出力
されて、各シフタ75,79および出力端子86に送られてい
る。Next, the mode selection information as the optimum filter selection information is output from the prediction / range adaptation circuit 84, for example, the predictor 72 and the output terminal 87 of the encoding filter 74.
The range information for determining the gain G and the gain G- ¹ or the bit shift amount is output to the shifters 75 and 79 and the output terminal 86.

次に、再生側のデコーダ90の入力端子91には、上記エ
ンコーダ70の出力端子82からの出力（ｎ）が伝送さ
れ、あるいは記録，再生されることによって得られた信
号′（ｎ）が供給されている。この入力信号′
（ｎ）は利得G^-1のシフタ92を介し加算器93に送られて
いる。加算器93からの出力ｘ′（ｎ）は予測器94に送ら
れて予測信号′（ｎ）となり、この予測信号′
（ｎ）は上記加算器93に送られて上記シフタ92からの出
力″（ｎ）と加算される。この加算出力がデコード出
力′（ｎ）として出力端子95より出力される。Next, the output (n) from the output terminal 82 of the encoder 70 is transmitted to the input terminal 91 of the decoder 90 on the reproduction side, or the signal '(n) obtained by recording and reproduction is supplied. Have been. This input signal
(N) is sent to the adder 93 via the shifter 92 having the gain G ^-1 . The output x '(n) from the adder 93 is sent to the predictor 94 to become a predicted signal' (n),
(N) is sent to the adder 93 and added to the output "(n) from the shifter 92. The added output is output from the output terminal 95 as a decoded output '(n).

また、上記エンコーダ70の各出力端子86および87より
出力され、伝送あるいは記録，再生された上記レンジ情
報およびモード選択信号は、上記デコーダ90の各入力端
子96および97にそれぞれ入力されている。そして、入力
端子96からのレンジ情報は上記シフタ92に送られて利得
G^-1を決定し、入力端子97からのモード選択情報は上記
予測器94に送られて予測特性を決定する。この予測器94
の予測特性は、上記エンコーダ70の予測器72の特性に等
しいものが選択される。The range information and the mode selection signal output from the output terminals 86 and 87 of the encoder 70 and transmitted, recorded, or reproduced are input to the input terminals 96 and 97 of the decoder 90, respectively. The range information from the input terminal 96 is sent to the shifter 92 and gain
G- ¹ is determined, and the mode selection information from the input terminal 97 is sent to the predictor 94 to determine a prediction characteristic. This predictor 94
Are selected as those of the predictor 72 of the encoder 70.

このような構成のデコーダ90において、上記シフタ92
からの出力″（ｎ）は、上記入力信号′（ｎ）と利
得G^-1を乗算したものである。また、上記加算器93の出
力′（ｎ）は、上記シフタ92からの出力″（ｎ）と
予測信号′（ｎ）を加算したものである。In the decoder 90 having such a configuration, the shifter 92
Is the product of the input signal '(n) and the gain G- ¹ . The output' (n) of the adder 93 is the output '(n) of the shifter 92. n) and the prediction signal '(n).

次に、第19図には、上記ビット圧縮符号化エンコーダ
70からの上記１ブロック分の出力データの一例を示して
おり、この１ブロック分のデータは、１バイトのヘッダ
情報（圧縮に関するパラメータ情報、あるいは付属情
報）RFと８バイトのサンプル用データD_A0〜D_B3で構成さ
れている。上記ヘッダ情報RFは、４ビットの上記レンジ
情報と、２ビットの上記モード選択情報、あるいはフィ
ルタ選択情報と、それぞれ１ビットの２つのフラグ情
報、例えばループの有無を示す情報LI及び波形の終端ブ
ロック（エンドブロック）が否かを示す情報EIとで構成
されている。ここで１サンプルの波高値データは、ビッ
ト圧縮されて４ビットと表されており、上記データD_A0
〜D_B3中には16サンプル分の４ビット・データD_A0H〜D
_B3Lが含まれている。Next, FIG. 19 shows the above-mentioned bit compression encoding encoder.
An example of the output data of one block from 70 is shown. This one block of data is composed of 1-byte header information (parameter information on compression or additional information) RF and 8-byte sample data _DA0. ~ _B3 . The header information RF includes the 4-bit range information, the 2-bit mode selection information, or the filter selection information, two 1-bit flag information, for example, information LI indicating the presence or absence of a loop, and a waveform end block. (End block) is composed of information EI indicating whether or not (end block). Here, the peak value data of one sample is bit-compressed and represented as 4 bits, and the data D _A0
4-bit data D _A0H to D of in to D _B3 16 samples
_B3L is included.

次に第20図は、第２図に示すような楽音信号波形の先
頭部分に対応する上記準瞬時（ブロック化）ビット圧縮
符号化された波高値データの各ブロックを示している。
この第20図においては、上記ヘッダを省略して波高値デ
ータのみを示しており、図示の都合上１ブロックを８サ
ンプルとしているが、１ブロック16サンプル等のように
任意に設定可能であることは勿論である。これは、前記
第14図の場合も同様である。Next, FIG. 20 shows each block of the peak value data which has been subjected to the quasi-instantaneous (blocking) bit compression encoding corresponding to the head portion of the tone signal waveform as shown in FIG.
In FIG. 20, only the peak value data is shown omitting the above-mentioned header, and one block is set to 8 samples for the sake of illustration. However, it can be set arbitrarily such as 16 samples per block. Of course. This is the same in the case of FIG.

ここで、上記準瞬時ビット圧縮符号システムは、上記
入力楽音信号を直接出力するモードすなわちストレート
PCMモードと、楽音信号をフィルタを介して出力するモ
ードすなわち１次または２次差分フィルタモードのう
ち、最も高い圧縮率を有する信号が得られるモードを選
択して、出力信号である楽音データを伝送するようにし
たものである。Here, the quasi-instantaneous bit compression code system is a mode for directly outputting the input tone signal, that is, a straight mode.
Selects a mode in which a signal having the highest compression rate is obtained from the PCM mode and a mode in which a tone signal is output through a filter, that is, a primary or secondary difference filter mode, and transmits tone data as an output signal. It is something to do.

楽音をサンプリングしてメモリ等の記憶媒体に記録す
る場合、上記楽音の楽音信号波形は発音開始点KSで波形
取り込みが開始されるものであるが、この発音開始点KS
からの最初のブロックにて１次または２次差分フィルタ
モード等のように初期値が必要なフィルタモードを選択
されると、この初期値を予め用意しておく必要が生じる
ため、このような初期値の必要のない形態とすることが
望まれる。このため、上記発音開始点KSに先行する期間
に、上記ストレートPCMモード（入力楽音信号を直接出
力するモード）が選択されるような擬似入力信号を付加
した後、その入力信号を含めて信号処理するようにして
いる。When a musical tone is sampled and recorded in a storage medium such as a memory, the tone signal waveform of the musical tone starts to be captured at the tone generation start point KS.
If a filter mode requiring an initial value, such as a primary or secondary difference filter mode, is selected in the first block from, it is necessary to prepare this initial value in advance. It is desirable to have a form that does not require a value. For this reason, during a period preceding the tone generation start point KS, a pseudo input signal for selecting the straight PCM mode (a mode for directly outputting an input tone signal) is added, and then signal processing including the input signal is performed. I am trying to do it.

すなわち具体的には、第20図において、上記発音開始
点KSに先行して、上記疑似入力信号としてデータを全て
“0"としてブロックを配置し、このブロックの先頭から
全データ“0"をサンプリング波高値データとしてビット
圧縮処理して取り込むようにしている。これは、例え
ば、予め１ブロックのデータが全て“0"のブロック作成
しておきこれをメモリ等にストアしておいて用いるか、
または、楽音をサンプリングする際に上記発音開始点KS
の前にデータが全て“0"の部分（すなわち発音開始前の
無音部分）の入力信号からサンプリングを開始する等に
より得ることができる。なお、上記擬似入力信号のブロ
ックは最低１ブロック以上である。That is, specifically, in FIG. 20, prior to the sound generation start point KS, a block is arranged with all data “0” as the pseudo input signal, and all data “0” are sampled from the head of this block. Bit-compression processing is performed as peak value data. This is, for example, to create a block in which one block of data is all “0” and store it in a memory or the like before use,
Alternatively, when sampling a tone,
, Data can be obtained by starting sampling from an input signal in which all data is "0" (that is, a silent portion before the start of sound generation). The number of blocks of the pseudo input signal is at least one.

上述のようにして形成された擬似入力信号を含んだ楽
音データを、前述の第18図に示すような高能率ビット圧
縮符号化システムにより信号圧縮処理し、メモリ等の記
憶媒体に記録させておき、この圧縮処理された信号を再
生する。The tone data including the pseudo input signal formed as described above is subjected to signal compression processing by the high-efficiency bit compression encoding system as shown in FIG. 18 and recorded in a storage medium such as a memory. The compressed signal is reproduced.

したがって、上記擬似入力信号を含んだ楽音データを
再生する場合、再生開始時（擬似入力信号のブロック部
分）のフィルタにストレートPCMモードが選択されるた
め、１次または２次差分フィルタの初期値をあらかじめ
設定しておく必要がなくなる。Therefore, when the tone data including the pseudo input signal is reproduced, the straight PCM mode is selected as the filter at the start of reproduction (the block portion of the pseudo input signal). There is no need to set in advance.

ここで、再生開始時に上記擬似入力信号（データが全
て“0"であるため無音である。）による発音開始時の遅
れについての懸念がある。しかし、例えば、サンプリン
グ周波数32kHzで１ブロック16サンプルとした場合、上
記発音時間の遅れは約0.5msecとなり聴覚上で識別でき
る遅れではなく問題にならない。Here, there is a concern about a delay at the start of sound generation due to the pseudo input signal (since data is all "0" and there is no sound) at the start of reproduction. However, for example, when the sampling frequency is 32 kHz and 16 samples are included in one block, the delay of the sound generation time is about 0.5 msec, which is not a delay that can be discerned from the auditory sense, and is not a problem.

ところで、上記ビット圧縮符号処理やその他の音源デ
ータ生成のためのディジタル信号処理については、ディ
ジタル信号処理装置（DSP）を用いてソフトウェア的に
実現することが多く行われており、また記録された音源
データの再生にもDSPを用いたソフトウェア的な構成が
採用されることが多い。第21図はその一例として、音源
データを取り扱う音源ユニットとしてのオーディオ・プ
ロセッシング・ユニット（APU）107及びその周辺を含む
システムの全体構成例を示している。By the way, the above-mentioned bit compression coding processing and other digital signal processing for generating sound source data are often implemented by software using a digital signal processing device (DSP), and the recorded sound source is often used. A software-like configuration using a DSP is often used for data reproduction. FIG. 21 shows an example of the overall configuration of a system including an audio processing unit (APU) 107 as a sound source unit for handling sound source data and its periphery as an example.

この第21図において、例えば一般のパーソナルコンピ
ュータ装置や、ディジタル電子楽器、TVゲーム機等に設
けられているホストコンピュータ104は、上記音源ユニ
ットとしてのAPU107と接続されており、該ホストコンピ
ュータ104からは音源データ等がAPU107にロードされる
ようになっている。このAPU107は、マイクロプロセッサ
等のCPU（中央処理装置）103と、DSP（ディジタル信号
処理装置）101と、上述したような音源データ等が記憶
されたメモリ102とを少なくとも有して構成されるもの
である。すなわち、このメモリ102には少なくとも音源
データが記憶されており、上記DSP101により該音源デー
タの読み出し制御を含む各種処理、例えばルーピング処
理、ビット伸長（復元）処理、ピッチ変換処理、エンベ
ロープの付加、エコー（リバーブ）処理等が施される。
メモリ102は、これらの各種処理のためのバッファメモ
リとしても用いられる。CPU103は、DSP101のこれらの各
種処理の動作や内容等についての制御を行うものであ
る。In FIG. 21, for example, a host computer 104 provided in a general personal computer device, a digital electronic musical instrument, a video game machine, or the like is connected to an APU 107 as the sound source unit. Sound source data and the like are loaded into the APU 107. The APU 107 includes at least a CPU (central processing unit) 103 such as a microprocessor, a DSP (digital signal processing unit) 101, and a memory 102 in which sound source data and the like are stored as described above. It is. That is, at least sound source data is stored in the memory 102, and various kinds of processing including reading control of the sound source data by the DSP 101, such as looping processing, bit decompression (decompression) processing, pitch conversion processing, envelope addition, echo (Reverb) processing or the like is performed.
The memory 102 is also used as a buffer memory for these various processes. The CPU 103 controls operations and contents of these various processes of the DSP 101.

さらに、メモリ102からの上記音源データに対してDSP
101により上記各種処理を施して最終的に得られたディ
ジタル楽音データは、ディジタル／アナログ（D/A）コ
ンバータ105によりアナログ信号に変換されてスピーカ1
06に供給されるようになっている。Furthermore, the above sound source data from the memory 102 is
Digital tone data finally obtained by performing the above-described various processings at 101 is converted into an analog signal by a digital / analog (D / A) converter 105, and
06 will be supplied.

なお、本発明は上述した実施例のみに限定されるもの
ではなく、例えば、上述の実施例においてはフォルマン
ト部分とルーピング区間とを接続して音源データを形成
していたが、ルーピング区間のみから成る音源データを
形成する場合にも容易に適用可能である。また、上記デ
コーダ側構成や音源データ用外部メモリは、ROMカート
リッジやアダプタとして供給してもよい。また、楽音信
号の音源のみならず音声合成にも適用可能である。It should be noted that the present invention is not limited to only the above-described embodiment. For example, in the above-described embodiment, the sound source data is formed by connecting the formant part and the looping section, but only the looping section is used. It can be easily applied to the case where sound source data is formed. Further, the decoder side configuration and the external memory for sound source data may be supplied as a ROM cartridge or an adapter. In addition, the present invention can be applied not only to the sound source of a tone signal but also to speech synthesis.

〔The invention's effect〕

本発明によれば、ノイズ成分を含んだ音源データを櫛
型のフィルタに通すことにより、楽音の基音とその整数
倍の周波数のみを取り出すことができ、ノイズ成分をカ
ットすることができる。同様に、音源にビブラート等の
微小なFM変調がかかっていても櫛型フィルタに通すこと
によりルーピングノイズを除去することができる。従っ
て、ルーピング再生に適した繰り返し波形区間を抽出し
て記録することができ、、ルーピングノイズが少なく、
円滑なルーピング再生が行える。According to the present invention, by passing sound source data containing a noise component through a comb filter, only the fundamental tone of a musical tone and a frequency that is an integral multiple of the fundamental tone can be extracted, and the noise component can be cut. Similarly, even if a sound source is subjected to minute FM modulation such as vibrato, looping noise can be removed by passing the sound through a comb filter. Therefore, a repetitive waveform section suitable for looping reproduction can be extracted and recorded, and there is little looping noise.
Smooth looping playback can be performed.

[Brief description of the drawings]

第１図は本発明の信号記録方法の原理を示すフローチャ
ート、第２図は楽音信号波形図、第３図は本発明の信号
記録方法の具体例を説明するための機能ブロック図、第
４図はピッチ検出動作を説明するための機能ブロック
図、第５図はピーク検出動作を説明するためのブロック
図、第６図は楽音信号及びエンベロープの波形図、第７
図は楽音信号のディケイレート情報の波形図、第８図は
エンベロープ検出動作を説明するための機能ブロック
図、第９図はFIRフィルタの特性図、第10図は楽音信号
のエンベロープ補正された後の波高値データを示す波形
図、第11図は櫛形のフィルタの特性図、第12図は最適ル
ーピングポイントの設定動作を説明するための波形図、
第13図は時間軸補正の前後の楽音信号を示す波形図、第
14図は時間軸補正後の波高値データについて準瞬時ビッ
ト圧縮用のブロックの構成を示す模式図、第15図はルー
ピング区間の波形を繰り返し接続されて得られるループ
データを示す波形図、第16図はディケイレート情報に基
づくエンベロープ補正後のフォルマント部分生成用デー
タを示す波形図、第17図は現実のルーピング処理前後の
動作を説明するためのフローチャート、第18図は準瞬時
ビット圧縮符号化システムの概略構成を示すブロック回
路図、第19図は準瞬時ビット圧縮符号化されて得られた
データの１ブロックの具体例を示す模式図、第20図は楽
音信号の先頭部分のブロックの内容を示す模式図、第21
図はオーディオ・プロセッシング・ユニット（APU）及
びその周辺を含むシステムの構成例を示すブロック図で
ある。FIG. 1 is a flowchart showing the principle of the signal recording method of the present invention, FIG. 2 is a tone signal waveform diagram, FIG. 3 is a functional block diagram for explaining a specific example of the signal recording method of the present invention, and FIG. Is a functional block diagram for explaining a pitch detecting operation, FIG. 5 is a block diagram for explaining a peak detecting operation, FIG. 6 is a waveform diagram of a tone signal and an envelope, FIG.
FIG. 8 is a waveform diagram of the decay rate information of the tone signal, FIG. 8 is a functional block diagram for explaining the envelope detection operation, FIG. 9 is a characteristic diagram of the FIR filter, and FIG. 10 is a diagram of the tone signal after envelope correction. FIG. 11 is a characteristic diagram of a comb-shaped filter, FIG. 12 is a waveform diagram for explaining an operation of setting an optimal looping point,
FIG. 13 is a waveform diagram showing the tone signal before and after the time axis correction.
FIG. 14 is a schematic diagram showing a configuration of a block for quasi-instantaneous bit compression for peak value data after time axis correction, FIG. 15 is a waveform diagram showing loop data obtained by repeatedly connecting looping section waveforms, and FIG. FIG. 17 is a waveform diagram showing formant part generation data after envelope correction based on decay rate information. FIG. 17 is a flowchart for explaining operations before and after actual looping processing. FIG. 18 is a quasi-instantaneous bit compression encoding system. FIG. 19 is a schematic diagram showing a specific example of one block of data obtained by quasi-instantaneous bit compression encoding, and FIG. 20 is a block diagram of a head portion of a tone signal. Schematic diagram showing the twenty-first
FIG. 1 is a block diagram showing a configuration example of a system including an audio processing unit (APU) and its periphery.

Claims

(57) [Claims]

An input analog signal or an input digital signal corresponding to the analog signal is supplied to a comb filter having only a fundamental frequency of the input analog signal and a frequency band of a harmonic component thereof as a pass band, and an output analog signal or Obtaining a digital signal; extracting an appropriate repetitive waveform section of the analog or digital signal output from the comb filter; and recording the extracted repetitive waveform section on a storage medium. Signal recording method to be used.

2. A comb filter to which an input analog signal or an input digital signal corresponding to the input analog signal is supplied, and wherein only a fundamental frequency of the input analog signal and a frequency band of a harmonic component thereof are pass bands, A signal recording apparatus comprising: an extracting unit for extracting an appropriate repetitive waveform section of an analog or digital signal output from the apparatus; and a unit for recording the repetitive waveform section from the extracting unit on a storage medium.