JPS61252600A

JPS61252600A - Lsp type pattern matching vocoder

Info

Publication number: JPS61252600A
Application number: JP60094924A
Authority: JP
Inventors: 哲田口
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1985-05-02
Filing date: 1985-05-02
Publication date: 1986-11-10
Anticipated expiration: 2009-06-29
Also published as: JPH0650440B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は音声信号を低速度の符号列に変換するＬＳＰ型
パタンマツチングボコーダに関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to an LSP pattern matching vocoder that converts an audio signal into a low-speed code string.

（従来の技術）入力音声信号のスペクトル包絡に最近似するスペクトル
包絡を、予め音声資料を分析して得られた標準パタンと
照合して選択し、これを入力音声信号に関する有声およ
び無声ならびに無声に関する情報のほか、ピッチ周期お
よび音の強さ等の音源情報とともに分析側から合成側に
伝送して入力音声信号の波形を再生するパタンマツチン
グボコーダは近時よく知られており、またこのようなパ
タンマツチングボコーダの分析側と合成側とにおける分
析および合成パラメータとしてＬＳＰ係数を利用するＬ
ｆＭＰ型バタソパタンマツチングボコーダよく知られて
いる。(Prior art) A spectral envelope that most closely resembles the spectral envelope of an input audio signal is selected by comparing it with a standard pattern obtained by analyzing audio materials in advance, and this is selected as a spectral envelope that is most similar to the spectral envelope of an input audio signal. Pattern matching vocoders, which reproduce the waveform of an input audio signal by transmitting information as well as sound source information such as pitch period and sound intensity, from the analysis side to the synthesis side are well known these days. LSP coefficients are used as analysis and synthesis parameters on the analysis and synthesis sides of a pattern matching vocoder.
The fMP type bataso pattern matching vocoder is well known.

このＬＳＰ係数は巌形予測係数、ＰＡ凡Ｃ０Ｒ（偏自己
相関）係数等とともに声道の共振特性を表わすパラメー
タとして利用されるものでるり、声門を仮想的に完全開
放および完全閉塞した場合の一道伝達胸数の巌スペクト
ル周波数によるパラメータであることはよく知られてい
る。This LSP coefficient is used as a parameter representing the resonance characteristics of the vocal tract, along with the rock shape prediction coefficient and the PA C0R (partial autocorrelation) coefficient. It is well known that the number of transmitted breasts is a parameter depending on the spectral frequency.

このよりなＬＳＰ係数は周波数領域で表わされるパラメ
ータであシ、αパラメータ等が時間領域で表わされるパ
ラメータであるのに対してよシ直観的に扱い得る量であ
るうえ、少ない情報量でしかも合成すべき入力音声信号
の音質も高い精度のものが得られるといったさまざまな
特徴を有し、従ってこのＬＳＦ係数を声道フィルタの伝
達関数を決定する分析および合成パラメータとして利用
し入力音声信号の分析、合成を行なうＬＳＰ型ボコーダ
も上述したような特徴を有するものとして構成される。This more precise LSP coefficient is a parameter expressed in the frequency domain, and unlike the α parameter, etc., which is expressed in the time domain, it is a quantity that can be handled more intuitively, and can be synthesized with a small amount of information. The LSF coefficients are used as analysis and synthesis parameters to determine the transfer function of the vocal tract filter, and are used to analyze input speech signals. The LSP type vocoder that performs synthesis is also configured to have the above-mentioned characteristics.

このＬＳＰ型ボコーダを利用するＬＳＰ型パターンマツ
チングボコーダは、ＬＳＰ分析器で分析されたＬＳＰ係
数と、予め音声賃料をＬＳＰ分析して得られる音声の標
準的なＬＳＦ係数の分布内容に関する標準パタンとを照
合することによって両者の類似度が最大となる最近似標
準パタンを選択し、これを合成側に音源情報とともに伝
送して入力音声信号の合成を図るものであシ、スペクト
ル包絡を１０ビット前後の低情報量で分析、合成しうる
方法として近時よく知られつつあ、ｊ５、ＬＰＣボコー
ダにパタン照合、復号を行なう機能を付加することによ
って容易ｌＩＣ４１ｇ成しうるものでるる。The LSP type pattern matching vocoder that uses this LSP type vocoder uses the LSP coefficients analyzed by an LSP analyzer and the standard pattern regarding the distribution content of the standard LSF coefficients of the voice obtained by LSP analysis of the voice rent in advance. The system selects the closest standard pattern that maximizes the similarity between the two, and transmits this to the synthesis side along with the sound source information to synthesize the input audio signal.The spectral envelope is approximately 10 bits. Recently, it has become well known as a method that allows analysis and synthesis with a low amount of information.It can be easily achieved by adding pattern matching and decoding functions to the j5 and LPC vocoder.

このようなＬＳＰｇパタンマツチングボコーダにおける
ＬＳＰボコーダは、通常Ｌ　Ｐ　Ｃ（ＬｉｎｅａｒＰｒ
ｅｄｉｃｔｉｏｎ　Ｃｏｅｆｆｉｃｉｅｎｔ、　線形予
測係数）分析器によって得られたＬＰＧ係数からＬＳＰ
係数を誘導するという手段によってＬＳＦ係数を得てい
る。The LSP vocoder in such an LSPg pattern matching vocoder is usually LPC (LinearPr
LSP from the LPG coefficient obtained by the LPG coefficient (Linear prediction coefficient) analyzer
The LSF coefficients are obtained by inducing the coefficients.

さて、パタンマツチングの単位としては入力音声のスペ
クトル包絡の如く音−の物理的特徴に着目した物理単位
と、音声の言甜的竹倣に着目した言語単位とがあシ、い
ずれを利用するかはパタンマツチングボコーダの構成内
谷寺に対応して効率のいいものが選択され、またこれら
の単位をマツチングの尺度として行なうパタン照合によ
る標準パタンの選択にはパラメータの見間距離による方
法と言賭的な要素との対応による方法とがある。Now, as a unit for pattern matching, either a physical unit that focuses on the physical characteristics of the sound, such as the spectral envelope of the input voice, or a linguistic unit that focuses on the linguistic imitation of the voice, can be used. The most efficient pattern matching vocoder configuration is selected according to the Uchidera, and standard patterns are selected by pattern matching using these units as a matching measure. There is a method that deals with gambling elements.

従って、たとえはＬａＰ型パタンマツチングボコーダの
如（、ＬＰＣボコーダの機能を内戚するものＫあっては
、ＬＰＣボコーダの機能との親和性を考慮し、マツチン
グ単位には物理単位、選択方法にはパラメータ空間距離
を利用することが望ましいと言える。Therefore, for example, a LaP pattern matching vocoder (or one that has the functions of an LPC vocoder), the matching unit is a physical unit, and the selection method is It can be said that it is desirable to use the parameter spatial distance.

ＬＳＰ型パタンマツチングボコーダにおけるパタンマツ
チング尺度として利用されるパラメータ空１’１４ｍ１
ｍハ、ＬＳＰ　係数４ＬＰＣ，ＰＡＲＣＯＲ４ｆｉと同
様に空間ベクトルと見做すことができ、この空間ベクト
ル間の距離を尺度としてその大小比較によって入力音声
信号のＬＳＰ係数に最も近い標準パタンを選択するため
に利用される。このような空間ベクトルであるＬＳＰ係
数間の距離は次の（１）弐に示すスペクトル距離Ｄｆｆ
ｉｊによりて示される。Parameter space 1'14m1 used as a pattern matching measure in LSP type pattern matching vocoder
mc, LSP coefficient 4LPC, PARCOR4fi can be regarded as a space vector, and in order to select the standard pattern closest to the LSP coefficient of the input audio signal by comparing the magnitude using the distance between these space vectors as a measure. used. The distance between LSP coefficients, which are space vectors, is the spectral distance Dff shown in (1) 2 below.
It is indicated by ij.

ｌ　　π Ｄｉｊ＝　−／　　（８ｉ（−−８３（（４）”　ｄω
−・−−−−−（１）π　　　・（１）式社また次の（２）式の如く近似等式に変換しう
る。l π Dij= −/ (8i(−−83((4)” dω
−・−−−−(1)π ・(1) Equation can also be converted into an approximate equation as shown in Equation (2) below.

（１）および（２）式において、ｉ、ｊｉｊＬ８Ｆ分析
および合成における処理単位区間でるる７レーム（ブロ
ック）の番号ｓ　Ｓ　ｌ　（’４−８　Ｊ　（−は周波
数ωの関数としてのフレームｉおよびｊの対数スペクト
ル、ｐｋ（１）、　ｐａｌはフレームｉおよびｊにおけ
るＮ次のＬＳＰ係数％ＷｋはＮ次のＩ、８Ｐスペクトル
径度である。In equations (1) and (2), the number s S l ('4-8 J (- is the frame i and The logarithmic spectrum of j, pk(1), pal is the Nth-order LSP coefficient in frames i and j, %Wk is the Nth-order I, 8P spectral diameter.

Ｌ８Ｆ係数の次数は、Ｌ８Ｆ係数によって実現すべき声
道フィルタを構成するための全極型デジタルフィルタの
次数と対応し、Ｎ次の全極型デジタルフィルタにあって
は、通常ＬＳＰ周波数と呼ばれるＮ個の巌スペクトルω
１．ω鵞、ω３・・・・・・ωＮを示す。またＮ次のＬ
ＳＰスペクトル感度ＷｋはＮ次のＬ８Ｆ係数の微少変化
によって起るスペクトル変化の程度を示すものであって
、通常ＬＳＰ周波数に対応して決定されるＬ８Ｆ周波数
スペクトル感度が用いられる。The order of the L8F coefficients corresponds to the order of an all-pole digital filter for configuring the vocal tract filter to be realized by the L8F coefficients, and in an N-order all-pole digital filter, the order of N individual rock spectrum ω
1. ω goose, ω3...shows ωN. Also, the Nth L
The SP spectral sensitivity Wk indicates the degree of spectral change caused by a minute change in the N-th order L8F coefficient, and the L8F frequency spectral sensitivity determined in accordance with the LSP frequency is usually used.

さて、人力背戸信号のスペクトル色釉に最も近似した標
準パタンを、予め登録された橡準パタン鮮から選択する
には（１）式によるスペクトル距離の計算を入力音声信
号の全フレームにわたって全標準パタンとの間で実行す
ればよいことＫなるが、この演算量は極めて膨大なもの
となるため、一般的には（２）式の近似等式を利用して
いわゆる簡易スペクトル距離を計測する。これは、分析
された入力音声信号の空間特徴ベクトルであるＮ次のＬ
ＳＰ係数ｐＪＵと、標準パタンに登録されている空間特
徴ベクトルｐＨ１ｌとの内積を各次数のＬＳＦ係数ごと
に求めたうえ、ＬＳＰ係数の次数に対応するＬＳＦ周波
数ごとに予め設定する重みづけ％数をしてのＷｋを乗じ
た簡易スペクトル距離計測を行なうものである。Now, in order to select the standard pattern that is most similar to the spectral color glaze of the human-powered back door signal from among the pre-registered semi-patterns, the spectral distance calculation using equation (1) is performed using all the standard patterns over all frames of the input audio signal. However, since the amount of calculation is extremely large, the approximate equation (2) is generally used to measure the so-called simple spectral distance. This is the spatial feature vector of the analyzed input audio signal of order N
The inner product of the SP coefficient pJU and the spatial feature vector pH1l registered in the standard pattern is calculated for each order of LSF coefficient, and the weighting percentage set in advance for each LSF frequency corresponding to the order of the LSP coefficient is calculated. Simple spectral distance measurement is performed by multiplying by Wk.

（発明が解決しようとする問題点）従来のこの種のＬＳＰｆｉパタンマツチングボコーダは
、（２）式に示す重みづけ係数ＷｋにＩ８Ｐ８ｒ数に対
応するＬＳＰ周技数スペクトル感度を利用しているが、
とのＬＳＰ周波数スペクトル感度はり、８ｒ周ｒｆ！１
．数間隔によって異なるため、単純にこのようなスペク
トルｍ度を用いて計測したスペクトル距離をパタンマツ
チングの尺にとして標準パタンを選択した場合には合成
すべき音７！！Ｉヲ大きく劣化することが多いという欠
点がある。(Problems to be Solved by the Invention) This type of conventional LSPfi pattern matching vocoder uses the LSP frequency spectrum sensitivity corresponding to the I8P8r number as the weighting coefficient Wk shown in equation (2). ,
LSP frequency spectrum sensitivity with 8r frequency rf! 1
．． Since it differs depending on the number interval, if a standard pattern is selected using the spectral distance measured using such a spectrum m degree as the measure for pattern matching, the sound to be synthesized is 7! ! The disadvantage is that I often deteriorate significantly.

本発明の目的は上述した欠点を除去し、少数の標準パタ
ンを予備選択し、前記選択された標準パタンと入力音声
信号とのスペクトル包絡との差を直接比較する手質を備
えることによシ、音質の劣化を大幅に改善し得るＬＳＰ
ｆｉパタンマツチングボコーダを提供することにある。The object of the present invention is to eliminate the above-mentioned drawbacks by providing a system with the ability to preselect a small number of standard patterns and directly compare the difference between the selected standard patterns and the spectral envelope of the input audio signal. , LSP that can significantly improve sound quality deterioration
An object of the present invention is to provide a fi pattern matching vocoder.

（問題点を解決するだめの手段）本発明のボコーダは、音声資料のＬ　８　Ｐ　（Ｌｉｎ
ｅ８ｐｅｃｔｒｕｍ　Ｐａ１ｒ　）係数の分布を考慮し
て作成された標準パタンと入力音声信号をＬＳＰ分析し
で得られるＬＳＰ係数に関するパタンとを照合して入力
音声信号の合成を行なうＬＳＰ型パタンマツチングボコ
ーダにおいて、前記標準パタンの１．ＳＰ係数と前記入
力音声信号のＬ８Ｆ係数との重みづけ内積によるスペク
トル距離を計測して少数の標準パタンを予備選択し、前
記選択されたｌｉ！準パタンよシスベクトル包絡を算出
し、算出されたスペクトル包絡と入力音／！１信号を分
析して求められたスペクトル包絡との差を計測し、前記
計測された差が最小となる＠準パタンを代表パタンとし
て選゛択する手段を備えて構成される。(Means for Solving the Problems) The vocoder of the present invention is capable of processing L8P (Lin
e8pectrum Pa1r) In an LSP type pattern matching vocoder that synthesizes an input audio signal by comparing a standard pattern created in consideration of the coefficient distribution with a pattern related to LSP coefficients obtained by LSP analysis of the input audio signal, 1. of the standard pattern. A small number of standard patterns are preliminarily selected by measuring the spectral distance based on the weighted inner product of the SP coefficient and the L8F coefficient of the input audio signal, and the selected li! Calculate the cisvector envelope of the quasi-pattern, and use the calculated spectral envelope and input sound /! The present invention is configured to include means for measuring a difference from a spectrum envelope obtained by analyzing one signal, and selecting a quasi-pattern for which the measured difference is the minimum as a representative pattern.

（実施例）次に図面を参照して本発明の詳細な説明する。(Example) Next, the present invention will be described in detail with reference to the drawings.

第１図四、郵）は本発明の第一の実施例を示すブロック
図であシ第１図（５）は分析側、第１図（ロ）は合成側
の構成を示すブロック図である。Fig. 1 (4) is a block diagram showing the first embodiment of the present invention, Fig. 1 (5) is a block diagram showing the structure of the analysis side, and Fig. 1 (b) is a block diagram showing the structure of the synthesis side. .

第１図（５）に示す分析図１は、Ｌ　Ｐ　Ｆ　（Ｌｏｗ
　Ｐａ５ｓＦｉｌｔｅｒ）　ｌ　ｌ　、　Ａ／Ｄコｙパ
ータ１２．窓関数処理器１３．自己相関係数計側器１４
　、ＬＰＣ分析器１５．有声／無声／無音判別器１６．
ピッチ抽出器１７　、ＬＳＰ分析器１８．スペクトル距
離計測器１９．標準パタンメモリ２０．周波数スペクト
ル感度メモ’Ｊ２１　、標準パタン選択器２２および符
号化＠２３を備えて構成され、また第１図（Ｂ）に示す
合成１１１！１２は、復号器２４．パタン復号器２５゜
標準パタンメモ９２６．ＬｂＰ合成器２７．可変利得増
１Ｍ器２８．切替器２９．パルス発生器３０゜雑音発生
器３１．Ｄ／ムコンバータ３２およ０ＬＰＰ３３を備え
て構成される。The analytical diagram 1 shown in FIG. 1 (5) shows that L P F (Low
Pa5sFilter) l l, A/D Coy Parter 12. Window function processor 13. Autocorrelation coefficient meter 14
, LPC analyzer 15. Voiced/unvoiced/silent discriminator 16.
Pitch extractor 17, LSP analyzer 18. Spectral distance measuring instrument 19. Standard pattern memory 20. The synthesis 111!12 shown in FIG. Pattern decoder 25° standard pattern memo 926. LbP synthesizer 27. Variable gain multiplier 1M unit 28. Switcher 29. Pulse generator 30° noise generator 31. It is configured to include a D/mu converter 32 and an 0LPP 33.

第１図（５）において、入力ライン１１１を介して入力
する人力音声信号はＬＰＦＩＩによって所定の分析帯域
の周波数成分がフィルタリングされ、出力２イン１１２
を介してＡ／Ｄコンバータ１２に送出されて所定のビッ
ト数でデジタル化されたのち量子化音声信号として出力
ライン１２１を介して窓関数処理器１３に送出される。In FIG. 1 (5), the human voice signal input via the input line 111 is filtered for frequency components in a predetermined analysis band by the LPFII, and the output 2-in 112
The signal is sent to the A/D converter 12 via the A/D converter 12, where it is digitized with a predetermined number of bits, and then sent to the window function processor 13 via the output line 121 as a quantized audio signal.

窓関数処理器１３は、入力した音声信号の３０ｍ５ＥＣ
ずつにハミング関数を乗算する窓関数処理を行なうがこ
の窓関数処理は１０ｍ５ＥＣ周期で繰返されこれを基本
フレーム周期としている。The window function processor 13 processes 30m5EC of the input audio signal.
Window function processing is performed in which each frame is multiplied by a Hamming function, and this window function processing is repeated at a cycle of 10m5EC, which is defined as the basic frame cycle.

こうして窓関数処理された入力音声信号の音声波形デー
タは基本フレームごとに出力ライン１３１を介して自己
相関係数計測器１４に送出される。The audio waveform data of the input audio signal subjected to the window function processing is sent to the autocorrelation coefficient measuring device 14 via the output line 131 for each basic frame.

自己相関係数計６１ｊ器１４社、入力した音声波形デー
タを乗算回路等を利用して各遅れ時間における自己相関
係数を必貴な遅れ時間内で計測し、この自己相関９１．
数データを出力２イン１５１を介してＬＰＣ分析器１５
に、また出力ライン１５２を介して有声／無声／無音判
別器１６およびピッチ抽出器１７に送出するとともに、
遅れ時間苓にお−ける自己相関係数をとシこれを基本フ
レームあたシの短時間音声電力データとして出力ライン
１５３を介して符号化器２３に送出する。An autocorrelation coefficient meter 61j from 14 manufacturers measures the autocorrelation coefficient at each delay time within the required delay time using a multiplication circuit or the like using the input audio waveform data, and calculates the autocorrelation coefficient 91.
LPC analyzer 15 outputs numerical data via 2-in 151
In addition, it is sent to the voiced/unvoiced/silent discriminator 16 and the pitch extractor 17 via the output line 152, and
The autocorrelation coefficient at the delay time is determined and sent to the encoder 23 via the output line 153 as short-time audio power data for each basic frame.

有声／無声／無音判別器１６は入力した自己相関係数デ
ータを利用し、各基本フレームごとに含まれる音声信号
の有声あるいは無声、もしく紘無音状態を判別しこれを
有声／無声／無音判別データとして出力ライン１６１を
介して符号化器２３に送出、またピッチ抽出器１７は入
力した自己相関係数データを利用して各基本フレームご
とに含まれる音声信号のピッチデータを抽出、これを出
力２イン１７１を介して符号化器２３に送出する。The voiced/unvoiced/silent discriminator 16 uses the input autocorrelation coefficient data to determine whether the audio signal included in each basic frame is voiced, unvoiced, or completely silent, and determines whether it is voiced/unvoiced/silent. It is sent as data to the encoder 23 via the output line 161, and the pitch extractor 17 uses the input autocorrelation coefficient data to extract the pitch data of the audio signal included in each basic frame and outputs it. It is sent to the encoder 23 via the 2-in 171.

ＬＰＣ分析器１５は、後述するＬＳＰ分析器１８ととも
に可変長フレームＬＳＰ分析回路を構成するものであシ
、本実施例においてはＬＳＰ分析器１８において、有声
／°無戸／無音判別器１６から出力ライン１６２を介し
て受ける有声／無声／無音判別データにもとづきフレー
ムを、有声および無声に対応する有音区間と、それ以外
の無音区間とに分けこれら２つの区間にそれぞれ予め設
定する可変長伝送フレームを設定している。この場合、
ＬＰＧ分析器１５はよく知られたレビンソン法によって
、入力したフレームごとの自己相関係数を利用して線形
予測係数を予め定める次数、本実施例の場合は１０次ま
で算出し、これを出力ライン１５４を介してＬＳＰ分析
器１８に送出し、ＬＳＰ分析器１８紘この線形予測係数
をＮｅｗｔｏｎの反復法を利用する高次方程式法によっ
て１０次のＬ８Ｆ係数に変′換し、さらに基本フレーム
ごとの一定周期をもったこのＬＳＰ８Ｐ係数、出力ライ
ン１６２を介して入力する有声／無声／無音判別データ
による情報を利用しながら予め設定する近似関数による
最適近似法によって可変長周期化した可変フレーム長に
変換する。The LPC analyzer 15 constitutes a variable-length frame LSP analysis circuit together with an LSP analyzer 18 (to be described later). Based on the voiced/unvoiced/silent discrimination data received via the line 162, the frame is divided into voiced sections corresponding to voiced and unvoiced, and other silent sections, and a variable length transmission frame is set in advance for each of these two sections. is set. in this case,
The LPG analyzer 15 uses the autocorrelation coefficient of each input frame to calculate linear prediction coefficients up to a predetermined order (in the case of this embodiment, up to the 10th order) using the well-known Levinson method, and outputs the linear prediction coefficients to the output line. 154 to the LSP analyzer 18, and the LSP analyzer 18 converts the linear prediction coefficients into 10th-order L8F coefficients by a higher-order equation method using Newton's iterative method, and further converts the linear prediction coefficients into 10th-order L8F coefficients for each basic frame. This LSP8P coefficient with a constant period is converted into a variable frame length with a variable period by an optimal approximation method using an approximation function set in advance while using information from voiced/unvoiced/silent discrimination data input via the output line 162. do.

また、このようなＬＳＰ分析の前処理として、入力音声
データの高域強調を行なうために波形の１次差分を利用
し波形領域における高域成分の事前強調を行なうプリエ
ンファシス（Ｐｒｅ　−Ｅｍｐｈａｓｉｓ）処理、およ
び自己相関係数領域におけるＬａｇ＠数によるＬａｇウ
ィンド処理が行なわれるが、これらの前処理はＬＳＰ８
Ｐ係数最小間隔を広げ、後述する合成１１１２における
ＬＡＦ合成−２７０全極形デジタルフイルタの安定性全
増大させるためＬＳＰ量子化感脱の低域を図って行表わ
れるものである。In addition, as pre-processing for such LSP analysis, pre-emphasis processing is performed to pre-emphasize high-frequency components in the waveform region using the first-order difference of the waveform in order to emphasize the high-frequency components of the input audio data. , and Lag window processing using Lag@ numbers in the autocorrelation coefficient region, but these preprocessings are performed using LSP8
This is done by widening the minimum interval of the P coefficients and increasing the stability of the LAF synthesis-270 all-pole digital filter in synthesis 1112, which will be described later, by reducing the LSP quantization sensitivity in the low range.

さて、このように得られた１０次のＬＳＦ係数は出力ラ
イン１８１を介してスペクトル距離計側器１９に送出さ
れる。またＬＳＰ分析器１８からは可変長フレームを形
成する際に基本フレーム長を伸縮したフレーム変化率情
報、いわゆるレピートビットデータを出力ライン１８２
を介して符号化器２３に送出する。Now, the 10th-order LSF coefficient obtained in this way is sent to the spectral distance meter side unit 19 via the output line 181. In addition, from the LSP analyzer 18, frame change rate information obtained by expanding or contracting the basic frame length when forming a variable length frame, so-called repeat bit data, is output to an output line 182.
The data is sent to the encoder 23 via the encoder 23.

ＬＡＦ分析器１８から出力ライン１８１を介してスペク
トル距離計測器１９に送出されたｌＯ次ＬＳＰ係数は、
スペクトル距離計測器１９において（２）式の近似等式
によ〕、いわゆる簡易スペクトル距離を演算する〇（２）式による（資）易スペクトル演算における入力音
声慎号の特徴ベクトル、すなわちｐｋ（ｉ）Ｋ相当する
ｌＯ次ＬＳＰ係数と、標準パタンメモリ２０に登録され
た標準パタンの％倣ベクトル、すなわちｐｋ（ｊ）　Ｋ
相当する標準１０次ＬＳＰ係数との内積が（２）式の如
くまずｙＬ算され、この内積に対して周波数スペクトル
感度Ｗｋが重みづけ係数として乗算されたものが１次の
ＬＳＰ係数から１０次のＬＳＰ係数まで、入力電声信号
の可変長フレームのおのおのについて標準パターンメモ
！７２０に登録されたＬＳＰ係数の☆パターンとの間で
実行され、スペクトル距離Ｄｉｊが決定し、可変長フレ
ームのおのおのについてこのスペクトル距離Ｄｉｊが最
も小さいものがそれぞれ標準パターンとして選択される
。このような標準パターンは、標準パタンメモリ２０に
おける標準パタン登録アドレスコードを指定する標準パ
タン指定コードデータとして次次に出力ライン１９１を
介して符号化２３に送出される。The lO-order LSP coefficients sent from the LAF analyzer 18 to the spectral distance measuring device 19 via the output line 181 are:
The spectral distance measuring device 19 calculates the so-called simple spectral distance using the approximate equation of equation (2). ) K corresponding lO-order LSP coefficients and the % imitation vector of the standard pattern registered in the standard pattern memory 20, that is, pk(j) K
The inner product with the corresponding standard 10th-order LSP coefficient is first calculated as yL as shown in equation (2), and this inner product is multiplied by the frequency spectral sensitivity Wk as a weighting factor to calculate the 10th-order LSP coefficient from the 1st-order LSP coefficient. Standard pattern memo for each variable length frame of the input voice signal, up to the LSP coefficient! The spectral distance Dij is determined, and the pattern with the smallest spectral distance Dij is selected as the standard pattern for each variable length frame. Such standard patterns are sequentially sent to the encoder 23 via the output line 191 as standard pattern designation code data that designates the standard pattern registration address code in the standard pattern memory 20.

糠早パタンメモリ２０に登録され、ストアされている標
準パタンは、本実施例の場合、次のようにして予め別な
シンピ為−夕によるオフ：）イン処理で作成されるが、
これを本実施例によるボコーダを利用して予め作成して
おいても一向に差支えない。In the case of this embodiment, the standard pattern registered and stored in the Nukahaya pattern memory 20 is created in advance by a separate input process as follows.
There is no problem even if this is created in advance using the vocoder according to this embodiment.

まず、予め設定した音声資料を利用しＬＰＣ分析等の手
法によって無音区間の除去、不要な近接フレームの除去
、有声、無音、無音による介抱等の前飽理を実施する。First, pre-saturation such as removal of silent sections, removal of unnecessary adjacent frames, and voiced, silent, and silent assistance is performed using a method such as LPC analysis using preset audio data.

この場合、フレーム周期は１０ｍ８ＥＣとし、この各フ
レームごとに有声、無声、無音および有声の無声との境
界音いずれに属するかのタグコードを与える。次に無音
フレームを除去し残シのフレームを有声と無声とに分離
し、このとき境界音は有声と無声とのいずれか又は双方
に含ませるものとする。In this case, the frame period is 10 m8 EC, and a tag code is given to each frame to indicate whether it belongs to voiced, unvoiced, unvoiced, or voiced/unvoiced boundary sound. Next, silent frames are removed and the remaining frames are separated into voiced and unvoiced frames, and at this time, boundary sounds are included in either or both of the voiced and unvoiced frames.

さらに、時間的に接近しスペクトル距離の小さいフレー
ムを除去し、このようにして必要とするサンプル数の削
減を図ったうえこれらを従来から知られている標準パタ
ン選択手法によって、予め設定する各スペクトル距離ご
とに分類して標準パタンとして登録、ストアしておくも
のでおる。Furthermore, frames that are close in time and have a small spectral distance are removed, thus reducing the number of required samples. The patterns are classified by distance and registered and stored as standard patterns.

上述した標準パタン手法は、本実施例の場合１０次元Ｌ
８Ｆ係数の空間υがＮ（−のパタンから成るものとし、
このＮ個のパタンのおのおのについて（り式によってス
ペクトル距離を計測し、これが予じめ設定するスペクト
ル距離域値＃ｄＢｊ”をもつものをＮ個のパタンすべて
について求め、このパタン数Ｍｉ＝（ｉ＝１．２．・・
・・・・Ｎ）のうち最大のＭｉをもつパタンＰＬを決定
したうえ、パタンＰＬにおけるスペクトル距離が、予め
設定する値＃ｄＢ”以下のパタンを１０次元ＬＳＰ係数
の空間Ｕから除去したのちＰＬを標準パタンとして登録
し、このような操作を空間ＵＫ含まれるパタンかなくな
るまで繰返して実施して標準パタンとして登録するもの
である。In the case of this example, the standard pattern method described above is 10-dimensional L
Assume that the space υ of 8F coefficients consists of N(- patterns,
For each of these N patterns, the spectral distance is measured using the following formula, and the number of N patterns that has a preset spectral distance range value #dBj is determined, and the number of patterns Mi = (i =1.2...
After determining the pattern PL with the maximum Mi among . is registered as a standard pattern, and this operation is repeated until there are no more patterns including the space UK, and the pattern is registered as a standard pattern.

また、周波数スペクトル感度メモリ２１にそれぞれスト
アされている内容は次のようにして決定される。Further, the contents stored in each frequency spectrum sensitivity memory 21 are determined as follows.

音声資料を（１）式によりて実測して得られるＬＳＰの
に番目（Ｋ次）の要素Ｐｋのスペクトル感度は、次の（
３）弐によって求められる。The spectral sensitivity of the second (Kth) element Pk of the LSP obtained by actually measuring the audio material using equation (1) is as follows (
3) Required by Ni.

／（ΔＰｋ）”ｄＢ”／？ジアン・・・・・・・・・（
３）（２）式においてΔＰｋはＰｋの微少変化であ１．
８ｉ（→はこの場合Ｐ、、Ｐ、、・・・・・・Ｐｋ・・
・・・・ＰＬ等から求めたスペクトル包Ｗｔｓ　８ｊ（
ロ）はＰ、　、Ｐ、　、・・・・・・Ｐｋ＋ΔＰｋ・・
・・・・ＰＬから求めたスペクトル包絡を用いている。/(ΔPk)”dB”/? Jian......(
3) In equation (2), ΔPk is a minute change in Pk, and 1.
8i (→ in this case P,,P,,...Pk...
...spectral envelope Wts 8j (
b) is P, ,P, ,...Pk+ΔPk...
...The spectrum envelope obtained from PL is used.

従って（３）式によって、ΔＰｋを予め設定する値θラ
ジアンとした場合、１０次のＬＳＦ係数の各周波数に関
するＬＳＰ周波数スペクトル感度が得られる。Therefore, according to equation (3), when ΔPk is a preset value θ radian, the LSP frequency spectrum sensitivity for each frequency of the 10th-order LSF coefficient can be obtained.

パタン照合においては、こうして得られた周波数スペク
トル感度を重みづけ係数として入力音声信号のＬＡＦ分
析データと標準パタンとのスペクトル距離を（２）式に
よって演算し、スペクトル距離が最小となるものから小
いさい順に所望の数の標準パタンを可変長フレーム毎に
検索し、こ五らの＊準パタンデータ（１０次１．８Ｐ）
と標準パタン指定コードデータとを出力ライン１９１を
介して標準パタン選択器２２へ出力する。In pattern matching, the spectral distance between the LAF analysis data of the input audio signal and the standard pattern is calculated using equation (2) using the frequency spectral sensitivity obtained as a weighting factor, and the spectral distance is calculated from the one with the minimum spectral distance to the smallest one. A desired number of standard patterns are searched for each variable length frame in order of size, and these five * quasi-pattern data (10th order 1.8P) are
and standard pattern designation code data are output to the standard pattern selector 22 via the output line 191.

標準パタン選択６２２は本発明の最も重要な部分で６４
）、その詳細な動作は後述するが、概略、以下の機能を
有する。ｍｓパタン選択器２２はスペクトル距離計測器
１９によシ予備選択された所望の数の標準パタンからス
ペクトル包絡を算出し、これとスペクトル距離計測器、
Ｌ８Ｆ８Ｐ器を介してＬＰＣ分析器よシ供給される縁形
予測係数から算出されるスペクトル包絡の差を算出し、
前記差が最小となる標準パタンに対応する標準パタン指
定コードデータを符号化器２３へ出力する。Standard pattern selection 622 is the most important part of the invention 64
), its detailed operation will be described later, but generally it has the following functions. The ms pattern selector 22 calculates a spectral envelope from a desired number of standard patterns preselected by the spectral distance measuring device 19, and calculates the spectral envelope from the spectral distance measuring device 19,
Calculating the difference in spectral envelope calculated from the edge shape prediction coefficients supplied by the LPC analyzer via the L8F8P device,
Standard pattern designation code data corresponding to the standard pattern with the minimum difference is output to the encoder 23.

符号化器２３は、このようにして供給された各データを
予め設定する符号形式によって符号化しこれを伝送路２
３１を介して合成側２に伝送する。The encoder 23 encodes each data thus supplied in a preset code format and transmits it to the transmission path 2.
31 to the combining side 2.

合成１１４１１２では伝送路２３１を介して入力した各
極符号化情報の復号化を行ない、標準パタン指定。In the synthesis 114112, each pole encoded information input via the transmission line 231 is decoded and a standard pattern is specified.

コードデータは入力ライン２５１を介してパタン復号器
２５、レピートビットデータは入力ライン２７１を介し
てＬ８Ｆ８Ｐ器２７、短時間音声電力データは入力ライ
ン２８１をプｒして可変利得増幅器２８、有声／無声／
無音判別データおよびピッチデータはそれぞれ人力ライ
ン２９１および３０１を介して切替器２９およびパルス
発生器３０に供給する。Code data is input to the pattern decoder 25 via the input line 251, repeat bit data is input to the L8F8P unit 27 via the input line 271, and short-time audio power data is input to the input line 281 to the variable gain amplifier 28, voiced/unvoiced. /
Silence discrimination data and pitch data are supplied to switch 29 and pulse generator 30 via human power lines 291 and 301, respectively.

パタン復号器２５は、入力した標準パタン指定コードデ
ータによって指定される標準パタンを標準パタンメモリ
２６から出力ライン２６１を介して耽出し、これを出力
ライン２５２を介してＬＳＦ合成器２７に送出する。Ｉ
Ｉ準パタンメモリ２６は分析＄１１１における標準パタ
ンメモリ２ｏと＃１ぼ同一のものであ）、パタン復号器
２５によってＬＳＰ合成器２７に供給されるデータは分
析−のパタン照合の結果入力音声信号の内容に対応して
可変長フレームごとに選択された標準パタンによるＬ８
Ｆ８Ｆ係数すなわちＬＳＦ周波数列である。The pattern decoder 25 extracts the standard pattern specified by the input standard pattern designation code data from the standard pattern memory 26 via the output line 261 and sends it to the LSF synthesizer 27 via the output line 252. I
The I semi-pattern memory 26 is almost the same as the standard pattern memory 2o in the analysis $111), and the data supplied to the LSP synthesizer 27 by the pattern decoder 25 is the input audio signal as a result of pattern matching in the analysis. L8 according to a standard pattern selected for each variable length frame corresponding to the content of
This is an F8F coefficient, that is, an LSF frequency sequence.

ＬＳＰ合成器２７は、こうして入力したＬＳＦ係数列を
含む可変長フレームを、入力ライン２７１を介して受け
るレピートビットデータによってもとの基本７レームご
とに復元し、これを予め設定する近似関数を利用して入
力音声信号の標本化間隔、すなわち合成＠１の窓＠数処
理器１４における標本化周期でＬＳＰ係数を補間する。The LSP synthesizer 27 restores the variable length frame containing the LSF coefficient sequence inputted in this manner into the original basic 7 frame units using the repeat bit data received via the input line 271, and uses an approximation function to set this in advance. Then, the LSP coefficients are interpolated at the sampling interval of the input audio signal, that is, at the sampling period in the synthesis@1 window@number processor 14.

こうして補間処理を受けた基本フレームごとのＬ８Ｆ係
数は全極形モデルによる１０次のＬＳＰ音声合成デジタ
ルフィルタのフィルタ係数として供給される。The L8F coefficients for each basic frame subjected to interpolation processing in this manner are supplied as filter coefficients of a 10th-order LSP speech synthesis digital filter based on an all-pole model.

ＬＳＰ音声合成デジタルフィルタはこのようにして入力
するフィルタ係数と、可変利得増幅器２８から出力ライ
ン２８２を介して入力する音源励振電力とによって音声
合成デジタルフィルタとしての演算を行ない、デジタル
形式の合成音声出力を得てこれを出力ライン２７２ｔ−
介してＬ）／Ａコンバータ３２に送信する〇上述した音源励振電力は、入力音声信号からスペクトル
包絡成分を除いたいわゆる残差電力に対応するものであ
）、入力音声信号を再現する場合にスペクトル包絡成分
としてのＬ８Ｆ係数とともに必要な音源情報を付与する
ものでこれは次のようにして発生する。The LSP voice synthesis digital filter performs calculations as a voice synthesis digital filter using the input filter coefficients and the sound source excitation power input from the variable gain amplifier 28 via the output line 282, and outputs synthesized voice in digital format. and output this to the output line 272t-
The above-mentioned sound source excitation power, which is transmitted to the L)/A converter 32 via the input audio signal, corresponds to so-called residual power obtained by removing the spectral envelope component from the input audio signal. Necessary sound source information is added together with the L8F coefficient as an envelope component, and this is generated as follows.

入力ライン２８１を介して入力した各基本フレームごと
の短時間音声電力データは可変利得増幅器２８に供給さ
れる。Short-term audio power data for each basic frame input via input line 281 is supplied to variable gain amplifier 28 .

一方、パルス発生器３０は入力ライン３０１を介してピ
ッチデータを受け、このピッチデータに対応し予め設定
された周波数のパルスをピッチパルスとして発生しこれ
を出力ライン３０２を介して切替器２９に送出する。On the other hand, the pulse generator 30 receives pitch data via an input line 301, generates a pulse with a preset frequency corresponding to the pitch data as a pitch pulse, and sends this to the switch 29 via an output line 302. do.

切替器２９は、入力ライン２９１を介して受ける有声／
無声／無音判別データが有声を指定するときは上述した
ピッチパルスを選択し、また無声もしくは無音を指定す
るときには雑音発生器３１の出力する白色雑音を出力ラ
イン３１１を介して入力するように切替える動作を行な
う。切替器２９によって選択出力されるパルス発生器３
０もしくは雑音発生ｆＦ３１の出力は、出力ライン２９
２を介して可変利得増幅器２８に供給され、入力ライン
２８１を介して入力した短時間音声電力データの大きさ
に対応する重みづけを受けるように可変増幅されて音源
励振電力として出力ライン２８２に送出される。The switch 29 receives voiced/
When the unvoiced/silent discrimination data specifies voiced, the pitch pulse described above is selected, and when unvoiced or silent, the white noise output from the noise generator 31 is switched to be input via the output line 311. Do the following. Pulse generator 3 selectively output by switch 29
0 or the output of the noise generating fF31 is output on the output line 29.
2 to the variable gain amplifier 28, where it is variably amplified so as to receive weighting corresponding to the magnitude of the short-time voice power data input through the input line 281, and is sent to the output line 282 as sound source excitation power. be done.

ζうしてＬＳＰ合成器２７から出力したデジタル形式の
合成音声信号は次にＤ／Ａコンバータ３２によってアナ
ログ化され、ＬＰＦ３３によりて所要の帯域をフィルタ
リングして合成音声信号として出力ライン３３１に送出
される。ζThe digital synthesized voice signal outputted from the LSP synthesizer 27 is then converted into an analog signal by the D/A converter 32, filtered for a required band by the LPF 33, and sent to the output line 331 as a synthesized voice signal. .

このようにしてＬＳＰ周波数間隔スペクトル感度を重み
づけ係数として計測したスペクトル距離によるパタン照
合を介して行なう入力音声信号の分析１合成が容易に実
施できる。In this way, analysis 1 synthesis of input audio signals can be easily performed through pattern matching using spectral distances measured using LSP frequency interval spectral sensitivities as weighting factors.

次に標準パタン選択器２２の動作を詳細に説明する。第
２図は標準パタン選択器２２の動作を詳細に説明するた
めのブロック図である。Next, the operation of the standard pattern selector 22 will be explained in detail. FIG. 2 is a block diagram for explaining the operation of the standard pattern selector 22 in detail.

スペクトル距離計測器１９によシ予備選択された所望の
数の標準パタンデータは出力２イン１９１を介してω／
α変換器４０へ、標準パタン指定コードデータは２ベル
メモリ４１へ各々供給される。The desired number of standard pattern data preselected by the spectral distance measuring device 19 is sent to ω/
The standard pattern designation code data is supplied to the α converter 40 and the 2-bell memory 41, respectively.

尚、標準パタンデータはスペクトル距離計測器１９での
計測結果に基づき、前記距離の最小のものよシ順々に、
前記距離を昇べきに出力される。ω／α変換器４０マイ
クロプロセッサであ）％スペクトル距離が昇べきとなる
順序で入力される標準パタンデータを所定の番地に記録
する。ω／α変換器は更に記録した標準パタンデータ（
１０次ＬＳＰ）を１０次のαパラメータに変換し、前記
スペクトル距離が昇べきとなる順序で変換結果をαバラ
メ−タメモリ４２へ出力する。尚、Ｌ８Ｆ係数をαパラ
メータへ変換する方法は次の通シである。ＬＳＰ係数は
下記（３）弐におけるωｉであることが板倉氏らによシ
示されている。（音声研究会資料８７９−４６第１０式
）％式％従がって下記■〜■の手順に従りてＬＳＰよシーパラメ
ータへ変換される。In addition, the standard pattern data is based on the measurement results with the spectral distance measuring device 19, and in order from the one with the minimum distance,
The distance is output as the distance increases. Standard pattern data input to the ω/α converter 40 microprocessor in the order in which the % spectral distance increases is recorded at a predetermined address. The ω/α converter also uses the recorded standard pattern data (
The 10th order LSP) is converted into a 10th order α parameter, and the conversion results are output to the α parameter memory 42 in the order in which the spectral distance increases. Note that the method for converting the L8F coefficient into the α parameter is as follows. Itakura et al. have shown that the LSP coefficient is ωi in (3) 2 below. (Voice Study Group Material 879-46, Formula 10) % Formula % Therefore, the LSP is converted into a sea parameter according to the steps ① to ② below.

■Ｐｐ（Ｚ）＝（１−Ｚ−”Ｘｌ　２ｃｏｓ　ａ＋１Ｚ
−’＋Ｚ−”）（１−２ｃｏｓｍ４Ｚ−１＋Ｚ−”　）
　−−−−（１−２ｃｏｓ　０）１６ｚ−”＋Ｚ−”　
）＝（１−Ｚ−凰）（１＋ＰｔＺ−’＋Ｐ１Ｚ−”−１
”・・＋Ｐ１＠Ｚ−”）　　　　（６）ただしｐｔ＝ｐ
ｔ・Ｐ雪＝Ｐ・・・・Ｐ、＝　Ｐ。■Pp(Z)=(1-Z-”Xl 2cos a+1Z
-'+Z-") (1-2cosm4Z-1+Z-")
-----(1-2cos 0)16z-"+Z-"
)=(1-Z-凰)(1+PtZ-'+P1Z-''-1
"...+P1@Z-") (6) However, pt=p
t・P snow=P...P,=P.

テｈ　”）　Ｐ　ｉハＰｐｌＺ）／　（１−Ｚ−”　）
　を展開したときの係数Ｑｐ（Ｚ）＝　（１＋Ｚ−’　）　（１−２ｃｏｓ　ω
、Ｚ−”＋Ｚ−”　）　（１−２ｃｏｓ＃５ｚ−１＋ｚ
＝）、−−−−−（１−２ｃ０８　ｍ、ｚ−ｔ＋ｚ−ｓ
）＝（１＋Ｚ−”）（１＋ｑｌＺ−”＋ｑ雪Ｚ−”４”
＋ｑｔｏ＋Ｚ−”）　　　（７）ただしｑｔ＝Ｑｔ・　
Ｑｓ＝ｑ・・・・ｑｓ＝（１・であシ　ｑｉはＱｔ）／
（１＋Ｚ−１）を展開したときの係数 ”　（２＋（Ｐｔ＋ｑｔ）Ｚ−’＋（Ｐ＊＋ｑｌ）Ｚ−
”＋・・・＋＝丁（Ｐ、。＋ｑｔ。）Ｚ−”　）　−’　Ｚ−”　（（Ｐ
ｔ　−ｑｔ　）Ｚ−”＋（Ｐｇ−ｑｓ）Ｚ−”＋”・＋
（Ｐｌｏ　　　Ｑｔｏ）Ｚ−”）＝１＋４（Ｐｔ＋ｑｔ
）Ｚ−”＋　２　（（Ｐｓ＋ｑｓ）−（Ｐｔ　Ｑｔ）　
）Ｚ−Ｓ　＋　−・・　　　　　（８）ことＫｚ−ｉの係数がαパラメータαｉである。Teh ”) PihaPplZ)/ (1-Z-”)
Coefficient Qp(Z) when expanded = (1+Z-') (1-2cos ω
,Z-"+Z-") (1-2cos#5z-1+z
=), -----(1-2c08 m, z-t+z-s
)=(1+Z-”)(1+qlZ-”+qSnow Z-”4”
+qto+Z-”) (7) However, qt=Qt・
Qs=q...qs=(1・adashi qi is Qt)/
Coefficient when expanding (1+Z-1)"(2+(Pt+qt)Z-'+(P*+ql)Z-
"+...+=Ding(P,.+qt.)Z-" ) -'Z-" ((P
t −qt )Z−”+(Pg−qs)Z−”+”・+
(Plo Qto)Z-”)=1+4(Pt+qt
)Z-”+ 2 ((Ps+qs)-(Pt Qt)
) Z-S + -... (8) The coefficient of Kz-i is the α parameter αi.

再び第２図に於いてスペクトル距離計測器１９゜ＬＳＰ
分析器１８を介してＬＰＣ分析器１５よ）供給される線
形予測係数（ａｉ、ｉ＝１．２・・・１０）はスペクト
ル包絡算出器４３へ入力される。スペクトル包絡算出器
４３はマイクロプルセッサであシ公知の方法によシ離散
的スペク°トル包絡データＦｉ（Ｎ）（Ｎ＝ｏ、１．・
・・、１０１）を算出する。なお、この手法は斉藤、中
田両氏の共著１音声情報処理の基礎”オーム社、昭和５
６年１１月の第７章１スペクトル推定”ページ９６に述
べられている。算出されたＦｉ（Ｎ）は出力ライン４３
１ｔ−介してスペクトル包絡メモリ４４へ供給される。Again in Figure 2, the spectral distance measuring device 19°LSP
The linear prediction coefficients (ai, i=1.2...10) supplied to the LPC analyzer 15 via the analyzer 18 are input to the spectral envelope calculator 43. The spectral envelope calculator 43 is a microprocessor that calculates discrete spectral envelope data Fi(N) (N=o, 1..
..., 101) is calculated. This method was co-authored by Messrs. Saito and Nakata, 1. Fundamentals of Speech Information Processing, Ohmsha, 1930.
Chapter 7 1 Spectrum Estimation” page 96 of November 2006. The calculated Fi(N) is the output line 43.
1t- to the spectral envelope memory 44.

スペクトル包絡メモリ４４は前記Ｂｉ（Ｎ）を記録し必
要に応じてスペクトル包絡差算出器４５へ出力する。α
パラメータメモリはスペクトル距離計測器１９に於ける
スペクトル距−〇昇べき釦αパラメータを頃々にスペク
トル包絡算出器４３へ出力する。スペクトル包絡算出器
４３は離散的スペクトル包絡データ町（’）（Ｎ）　（
Ｎ＝ｏ　、　ｌ、・・・１０１　）　（ただしｊ＝１　
、２・・・、であシーパラメータの供給９番と一致する
）を算出し出力ライン４３２を介してスペクトル包格差
算出器４５へ出力する。スペクトル包絡差算出器４５は
］Ｆｉ（Ｎ）とＰ　／’２Ｎ）とから下記スペクトル距
離ＤＩをを算出し最小距離パタン検索器４６へ出力する。The spectral envelope memory 44 records the Bi(N) and outputs it to the spectral envelope difference calculator 45 as required. α
The parameter memory outputs the spectral distance minus 0 button α parameter in the spectral distance measuring device 19 to the spectral envelope calculator 43 from time to time. The spectral envelope calculator 43 calculates discrete spectral envelope data (') (N) (
N=o, l,...101) (however, j=1
, 2, . The spectral envelope difference calculator 45 calculates the following spectral distance DI from ]Fi(N) and P/'2N) and outputs it to the minimum distance pattern searcher 46.

最小距離パタン検索器４６はｍ１ｎ（ＤＩ）となる！（
αパラメータの供給順序）を決定し、データｊをラベル
メモリ４１へ出力する。ラベルメモリ４１はデータｊＫ
よシスベクトル距離計測器１９にょ一〕予備選択された
標準パタンのうちスペクトル距離が１番目に小いさい標
準パタンの標準パタン指定コードデータを符号化器２３
へ出力する。The minimum distance pattern searcher 46 becomes m1n(DI)! (
The order in which the α parameters are supplied is determined, and the data j is output to the label memory 41. Label memory 41 has data jK
System vector distance measuring device 19] Encoder 23 converts the standard pattern designation code data of the standard pattern with the smallest spectral distance among the preselected standard patterns.
Output to.

尚、予備選択するパタン数を所望の数として説明したが
、これは測定数でも可変数でも差しつかえない。可変数
とする場合には入力音声信号を分析して得られるＬＳＰ
パラメータの最小間隔、予備選択でのスペクトル距離勢
を利用して予備選択パタン数を決定できる。Although the number of patterns to be preselected has been described as a desired number, it may be a measured number or a variable number. If the number is variable, the LSP obtained by analyzing the input audio signal
The number of preliminary selection patterns can be determined using the minimum interval of parameters and the spectral distance trend in preliminary selection.

上述した各実施例における分析側で、ＬＳＦ分析器１Ｂ
によって得られるＬＳＰ係数は高次方程式法によって演
算しているが、これは高次方程式法とともＫよ〈知られ
た零点探索法によりて実施してもよく、またこのＬＡＦ
係数は可変長フレームととに分析抽出しているが、この
可変長フレームは所望に応じ固定長フレームとしても差
支えない。On the analysis side in each of the above embodiments, LSF analyzer 1B
The LSP coefficients obtained by
Although the coefficients are analyzed and extracted in a variable length frame, the variable length frame may be a fixed length frame if desired.

また、Ｌ８Ｆ８Ｆ係数の前処理として行なわれるプリエ
ンファシス処理およびＬａｇ関数処理は分析および合成
すべき入力音声信号の特徴、音声合成デジタルフィルタ
の内容、データビット数の配分勢を勘案し所望に応じて
実施の有無を選択しうろことは明らかである。In addition, the pre-emphasis processing and Lag function processing performed as pre-processing of the L8F8F coefficients are performed as desired, taking into account the characteristics of the input audio signal to be analyzed and synthesized, the contents of the speech synthesis digital filter, and the distribution of the number of data bits. It is clear that there is a choice to be made with or without.

さらに、上述した各実施例においては１０次のＬ８Ｆ係
数を利用して分析および合成を実施しているが、Ｌ８Ｆ
係数の次数を他の次数としても何様に実施しうろことは
明らかである。Furthermore, in each of the above-mentioned examples, analysis and synthesis are performed using the 10th order L8F coefficient, but L8F
It is obvious that the coefficients can be implemented in other orders as well.

（発明の効果）以上説明したように本発明によれば％ＬａＰ型パタンマ
ツチングボコーダにおいて、標準パタンのＬＳＰ係数と
入力音声信号のＬＡＦ係数とのスペクトル距離をＬＳＰ
係数のスペクトル感度を介して算出し、複数の標準パタ
ン候補を予備選択し、更に予備選択された標準パタン候
補から、実際のスペクトル包絡データを介して算出され
るスペクトル距離に基づいて最良の標準パタンを選択す
ることにより、ＬＳＰ周波数間隔によりＬＳＰ周波数ス
ペクトル感度が異なるために必ずしも最適な標準パタン
か選択されない欠点を解決し、且つ、予備選択によシパ
タン候補を限定するととくよシ演算量の増加を最小限に
とどめるという効果がある。(Effects of the Invention) As explained above, according to the present invention, in the %LaP pattern matching vocoder, the spectral distance between the LSP coefficient of the standard pattern and the LAF coefficient of the input audio signal is
The spectral sensitivity of the coefficient is calculated, multiple standard pattern candidates are preselected, and from the preselected standard pattern candidates, the best standard pattern is calculated based on the spectral distance calculated through the actual spectral envelope data. By selecting , it solves the problem that the optimal standard pattern is not necessarily selected because the LSP frequency spectrum sensitivity differs depending on the LSP frequency interval, and also minimizes the increase in the amount of computation by limiting the pattern candidates by preliminary selection. It has the effect of limiting

[Brief explanation of the drawing]

第１図囚、＠は本発明の第一の実施例によるＬａＰ型パ
タンマツチングボコーダの分析側（５）および合成側但
）の構成を示すブロック図、第２図は本発明に於いて特
に重要な標準パタン選択器２２を詳細に説明するための
ブロック図である。ｌ・・・・・・分析側、２・・・・・・合成側、１１・
・・・・・ＬＰＦ。１２・・・・・・ム／Ｄコンバータ、１３・・・・・・
窓関数処理器、１４・・・・・・自己相関係数計測器、
１５・・・・・・ＬＰＣ分析器、１６・・・・・・有声
／無声／焦音判別器、１７・・・・・・ピッチ抽出器、
１８・・・・・・ＬＳＰ分析器、１９・・・・・・スペ
クトル距離計測器、２０・・・・・・標準パ／ｙメモリ
、２１・・・・・・周波数間隔スペクトル感度メモリ、
２２・・・・・・標準パタン選択器、２３・・・・・・
符号化器、２４・・・・・・復号器、２５・・・・・・
パタン復号器、２６・・・・・・標準パタンメモリ、２
７・・・・・・ＬＳＰ合成器、２８・・・・・・可変利
得増暢器、２９・・・・・・切替器、３０・・・・・・
パルス発生器、３１・・・・・・雑音発生器、３２・・
・・・・Ｄ／Ａコンバータ、３３・・・・・・ＬＰＦ、
４０・・・・・・ω／α変換器、４１・・・・・・ラベ
ルメモリ、４２・・・・・・パラメータメモリ、４３・
・・・・・スペクトル包絡算出器、４４・・・・・・ス
ペクトル包絡メモリ、４５・・・・・・スペクトル包絡
差算出器、４６・・・・・・最小距離パタン検索器。代理人　弁理士　　内　原　　　晋１、−、’、：、ｙ− ゝ−一一、、、−Figure 1 is a block diagram showing the structure of the analysis side (5) and the synthesis side of the LaP pattern matching vocoder according to the first embodiment of the present invention, and Figure 2 is a block diagram showing the configuration of the analysis side (5) and synthesis side of the LaP pattern matching vocoder according to the first embodiment of the present invention. FIG. 2 is a block diagram for explaining in detail the important standard pattern selector 22. FIG. l... Analysis side, 2... Synthesis side, 11.
...LPF. 12...Mu/D converter, 13...
Window function processor, 14...Autocorrelation coefficient measuring device,
15... LPC analyzer, 16... Voiced/unvoiced/focused discriminator, 17... Pitch extractor,
18... LSP analyzer, 19... Spectral distance measuring device, 20... Standard PA/Y memory, 21... Frequency interval spectral sensitivity memory,
22...Standard pattern selector, 23...
Encoder, 24... Decoder, 25...
Pattern decoder, 26...Standard pattern memory, 2
7...LSP synthesizer, 28...Variable gain amplifier, 29...Switcher, 30...
Pulse generator, 31...Noise generator, 32...
...D/A converter, 33...LPF,
40...ω/α converter, 41... Label memory, 42... Parameter memory, 43.
... Spectrum envelope calculator, 44 ... Spectrum envelope memory, 45 ... Spectrum envelope difference calculator, 46 ... Minimum distance pattern search device. Agent Patent Attorney Susumu Uchihara1,-,',:,y- ゝ-11,,-

Claims

[Claims]

LSP (Line Spectrum Pa) of audio materials
ir) In an LSP amount pattern matching vocoder that synthesizes an input audio signal by comparing a standard pattern regarding coefficient distribution with a pattern regarding LSP coefficients obtained by LSP analysis of an input audio signal, the LSP coefficients of the standard pattern and the means for preliminarily selecting a small number of standard patterns by measuring a spectral distance by a weighted inner product with an LSP coefficient of an input audio signal; and analyzing a spectral envelope signal calculated from the preselected standard patterns and the input audio signal. and means for selecting a standard pattern using a spectral distance calculated from the spectral envelope signal obtained by the LSP pattern matching vocoder.