JP3164808B2

JP3164808B2 - Variable frame vocoder

Info

Publication number: JP3164808B2
Application number: JP01519190A
Authority: JP
Inventors: 哲田口
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1989-01-25
Filing date: 1990-01-24
Publication date: 2001-05-14
Anticipated expiration: 2016-05-14
Also published as: JPH02282300A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は可変長フレーム型ボコーダに関する。Description: TECHNICAL FIELD The present invention relates to a variable-length frame vocoder.

[Conventional technology]

入力音声信号を一定長の分析フレームごとに分析して
得たスペクトル包絡パラメータを分析フレームに対応し
てすべて送出する代りに、連続するＭ（例えば20）個の
分析フレームから成る区分ごとにＮ（例えば５）個の代
表分析フレームを選択し、各代表分析フレームのスペク
トル包絡パラメータと共にこれら代表分析フレームによ
って代表される分析フレームの数、いわゆるリピートビ
ット（repeat bit）を送出することによって、必要とす
る伝送情報量を低減する可変長フレーム型ボコーダが知
られている。Instead of sending out all the spectral envelope parameters obtained by analyzing the input speech signal for each analysis frame of a certain length in accordance with the analysis frame, N (for each 20 (for example, 20) analysis frames) For example, by selecting 5) representative analysis frames and transmitting the number of analysis frames represented by these representative analysis frames, the so-called repeat bit, together with the spectral envelope parameter of each representative analysis frame. 2. Description of the Related Art Variable-length frame vocoders that reduce the amount of transmission information are known.

区分ごとに選択される代表分析フレームは区分ごとの
全分析フレームによって示されるスペクトル包絡パラメ
ータの分布を最適近似関数で近似せしめることによって
得られ、この最適近似関数は矩形、台形もしくは直線等
の近似関数がそれぞれの適用目的によって使いわけら
れ、かつ、通常は動的計画法（Dynamic Programming;D
P）を介して関数設定がなされている。このDP手法を介
した区分的最適関数近似の一例が特開昭62−998号公報
に詳細に記述されている。The representative analysis frame selected for each section is obtained by approximating the distribution of the spectral envelope parameters indicated by all the analysis frames for each section with an optimal approximation function, and this optimal approximation function is an approximate function such as a rectangle, a trapezoid, or a straight line. Are used for different purposes, and are usually called Dynamic Programming (D
Function settings have been made via P). An example of the piecewise optimal function approximation through the DP method is described in detail in JP-A-62-998.

[Problems to be solved by the invention]

可変長フレーム型ボコーダにおける歪は、分析次数や
量子化数が有限であることに起因して代表分析フレーム
そのものの表現に関して発生する歪と各分析フレームを
代表分析フレームで代替することにより発生する代替歪
とがある。音声の性質は時変であり、特に、スペクトル
包絡の性質の時変性により、分析次数，量子化数，代表
分析フレーム数等が一定であれば、総歪におけるこれら
各歪の割合は時々刻々変化する。Distortion in the variable-length frame type vocoder is caused by the representation of the representative analysis frame itself due to the finite number of analysis orders and quantization, and the substitution caused by substituting each analysis frame with the representative analysis frame. There is distortion. The nature of speech is time-varying, especially if the order of analysis, the number of quantizations, the number of representative analysis frames, etc. are constant due to the time-varying nature of the spectral envelope, the proportion of each of these distortions in the total distortion changes every moment I do.

ところが、上述した従来の可変長フレーム型ボコーダ
は、区分内の代表分析フレーム数を固定としているの
で、ほとんどの区分では総歪の見地からの最適化が望め
ない欠点がある。However, the above-mentioned conventional variable-length frame vocoder has a drawback that optimization from the viewpoint of total distortion cannot be expected in most of the sections because the number of representative analysis frames in the section is fixed.

本発明の目的は、総歪の見地から、従来におけるより
最適化を計ることができる可変長フレーム型ボコーダを
提供することにある。An object of the present invention is to provide a variable-length frame type vocoder that can be more optimized than before in terms of total distortion.

[Means for solving the problem]

本発明の可変長フレーム型ボコーダは、区分的最適関
数近似を用いた可変長フレーム型ボコーダにおいて、区
分内の代表分析フレーム数をあらかじめ定めた範囲内で
最適化する手段を備えている。A variable-length frame vocoder according to the present invention includes means for optimizing the number of representative analysis frames in a section within a predetermined range in a variable-length frame vocoder using piecewise optimal function approximation.

〔Example〕

次に、本発明について図面を参照して説明する。 Next, the present invention will be described with reference to the drawings.

第１図は本発明の第１の実施例における分析側を示す
ブロック図、第２図は同じく合成側を示すブロック図で
ある。FIG. 1 is a block diagram showing the analysis side in the first embodiment of the present invention, and FIG. 2 is a block diagram showing the synthesis side in the same manner.

第１図に示す分析側は、A/D変換器１と、LPC分析器２
と、音源分析器３と、量子化器４と、フレーム選択器５
〜７と、代表フレーム数決定器８と、多重化回路９とを
具備して構成されている。The analysis side shown in FIG. 1 includes an A / D converter 1 and an LPC analyzer 2
, Sound source analyzer 3, quantizer 4, frame selector 5
7, a representative frame number determiner 8, and a multiplexing circuit 9.

フレーム選択器５は、量子化器51と、複号化器52と、
歪算出器53と、バッファメモリ54〜57と、区分的最適近
似器58とを備えて構成されている。フレーム選択器６及
び７も、各構成要素が行う処理の数値的条件が必ずしも
等しくない点を除いては、フレーム選択器５と同じに構
成されている。The frame selector 5 includes a quantizer 51, a decoder 52,
It comprises a distortion calculator 53, buffer memories 54 to 57, and a piecewise optimum approximator 58. The frame selectors 6 and 7 are also configured in the same manner as the frame selector 5 except that the numerical conditions of the processes performed by the respective components are not necessarily equal.

遮断周波数3,4kHzの低域フィルタで帯域制限されてい
る入力音声信号は、A/D変換器１において、標本化周波
数8kHzで標本化され、所定のビット数に量子化される。An input audio signal whose band is limited by a low-pass filter having a cutoff frequency of 3 and 4 kHz is sampled by the A / D converter 1 at a sampling frequency of 8 kHz and quantized to a predetermined number of bits.

LPC分析器２はA/D変換器１から入力した量子化音声信
号の窓処理及び分析を行う。量子化音声信号の30ms分ず
つが内蔵メモリにストアされつつ10msの周期で読出され
ハミング関数による窓処理が行われて10msの分析フレー
ムとして出力される。分析フレームごとの量子化音声信
号は線形予測分析され、本実施例では線形予測係数（Li
near Prediction Coefficient;LPC）の１つである８次
のＫパラメータが抽出される。The LPC analyzer 2 performs window processing and analysis on the quantized audio signal input from the A / D converter 1. The quantized audio signal is read out at a period of 10 ms while being stored in the built-in memory at intervals of 30 ms, subjected to window processing by the Hamming function, and output as a 10 ms analysis frame. The quantized speech signal for each analysis frame is subjected to linear prediction analysis, and in this embodiment, the linear prediction coefficient (Li
An eighth-order K parameter, which is one of the near prediction coefficients (LPC), is extracted.

フレーム選択器５は、分析フレームの連続する20個
分、すなわち200msを１区分とし、区分内の代表分析フ
レーム数を４とし、量子化歪とフレーム代替歪との総量
が最小になるように区分的最適関数近似を行う。The frame selector 5 divides 20 consecutive analysis frames, that is, 200 ms, into one section, sets the number of representative analysis frames in the section to 4, and divides the quantization distortion and the frame substitution distortion so that the total amount thereof is minimized. Approximate optimal function.

量子化器51はLPC分析器２から分析フレームごとに入
力するＫパラメータを、代表分析フレーム数が４である
場合にとり得る量子化数で、量子化し、量子化したＫパ
ラメータの20フレーム分をバッファメモリ54に書込む。
複号化器52は量子化器51から入力した量子化されたＫパ
ラメータを複号化し、複号したＫパラメータの20フレー
ム分をバッファメモリ55に書込む。歪算出器53は、LPC
分析器２から入力したＫパラメータ及び複号化器52から
入力したＫパラメータを用いて分析フレームごとの量子
化歪を算出し、算出した量子化歪の20フレーム分をバッ
ファメモリ56に書込む。LPC分析器２が出力した、量子
化前の、Ｋパラメータも20フレーム分がバッファメモリ
57に書込まれる。区分的最適近似器58は、バッファメモ
リ54〜57の記憶内容を用い、量子化歪とフレーム代替歪
との総量が最小になるようにDP手法を介した区分的最適
関数近似を行い、４個の代表分析フレームを抽出し、抽
出した各代表分析フレームの量子化されたＫパラメータ
を量子化パラメータ102として出力し、各代表分析フレ
ームが代表する分析フレーム数をリピートビット103と
して出力し、又、各代表分析フレームの量子化歪及びフ
レーム代替歪の総量を総歪101として出力する。The quantizer 51 quantizes the K parameters input from the LPC analyzer 2 for each analysis frame by the number of quantizations that can be taken when the number of representative analysis frames is 4, and buffers 20 frames of the quantized K parameters. Write to memory 54.
The decoder 52 decodes the quantized K parameter input from the quantizer 51 and writes the decoded K parameter for 20 frames into the buffer memory 55. The distortion calculator 53 is an LPC
The K-parameter input from the analyzer 2 and the K-parameter input from the decoder 52 are used to calculate the quantization distortion for each analysis frame, and the calculated quantization distortion for 20 frames is written into the buffer memory 56. 20 frames of K parameters output from the LPC analyzer 2 before quantization are also stored in the buffer memory.
Written in 57. The piecewise optimal approximator 58 uses the stored contents of the buffer memories 54 to 57 to perform a piecewise optimal function approximation via the DP method so that the total amount of the quantization distortion and the frame substitution distortion is minimized. The representative analysis frame is extracted, the K parameter quantized of each extracted representative analysis frame is output as a quantization parameter 102, the number of analysis frames represented by each representative analysis frame is output as a repeat bit 103, The total amount of quantization distortion and frame substitution distortion of each representative analysis frame is output as total distortion 101.

なお、区分的最適近似器58は先に述べた特開昭62−99
8号公報に記載されているフレーム選択器15の（量子化
をパターンマッチングによって行う場合の）処理アルゴ
リズムを一般化した処理アルゴリズムで動作すればよい
ので、同公報の記述から区分的最適近似器58の具体的実
現は容易である。Incidentally, the piecewise optimal approximator 58 is disclosed in Japanese Patent Laid-Open No. 62-99
Since the operation of the frame selector 15 described in Japanese Patent Publication No. 8 (in the case where quantization is performed by pattern matching) is generalized, the processing algorithm may be used. Is easy to realize.

フレーム選択器６は、代表分析フレーム数を５とし、
それにあわせてＫパラメータの量子化数を低減すること
を除いてはフレーム選択器５と同じ動作をする。フレー
ム選択器７も、代表分析フレーム数を６とし、それにあ
わせて量子化数を更に低減することを除いてはフレーム
分析器５と同じ動作をする。The frame selector 6 sets the number of representative analysis frames to 5,
The operation is the same as that of the frame selector 5 except that the number of quantizations of the K parameter is reduced accordingly. The frame selector 7 operates in the same manner as the frame analyzer 5 except that the number of representative analysis frames is 6, and the number of quantizations is further reduced accordingly.

代表フレーム数決定器８は、フレーム選択器５〜７か
ら入力した総歪を比較し、総歪が最小であるフレーム選
択器から入力した量子化パラメータ及びリピートビット
を選択し、量子化パラメータ202及びリピートビット203
として出力する。又、選択したフレーム選択器の代表分
析フレーム数を代表フレーム数204として出力する。The representative frame number determiner 8 compares the total distortions input from the frame selectors 5 to 7, selects the quantization parameter and the repeat bit input from the frame selector having the minimum total distortion, and selects the quantization parameters 202 and Repeat bit 203
Output as Also, the number of representative analysis frames of the selected frame selector is output as the number of representative frames 204.

音源分析器３は、A/D変換器１から入力した量子化音
声信号を10msの分析フレーム周期で矩形関数により窓処
理し、公知の手法により分析フレームごとに有声／無声
の区別及びピッチ周期のデータを抽出する。これらデー
タは量子化器４で量子化され、音源データ205となる。The sound source analyzer 3 performs window processing on the quantized audio signal input from the A / D converter 1 with a rectangular function at an analysis frame period of 10 ms, and performs a voiced / unvoiced discrimination and pitch period analysis for each analysis frame by a known method. Extract data. These data are quantized by the quantizer 4 to become sound source data 205.

多重化回路９は、入力した量子化パラメータ202,リピ
ートビット203,代表フレーム数204及び音源データ205を
代表フレーム数204により定まる所定のフォーマットで
多重化して合成側に送出する。The multiplexing circuit 9 multiplexes the input quantization parameter 202, repeat bit 203, representative frame number 204, and sound source data 205 in a predetermined format determined by the representative frame number 204, and transmits the multiplexed data to the combining side.

以上が第１図に示す分析側の動作の全体的説明であ
る。次に第１図に示す分析側の主要な構成部分につい
て、本発明を1200BPSボコーダに適用した場合の動作を
詳細に説明する。The above is the overall description of the operation on the analysis side shown in FIG. Next, the operation of the main components on the analysis side shown in FIG. 1 when the present invention is applied to a 1200 BPS vocoder will be described in detail.

第５図（Ａ），（Ｂ），（Ｃ）はＭ＝20で構成される
区分内の代表分析フレーム数Ｎとして各々4,5,6が選択
された場合に於ける多重化回路９の１区分分の出力デー
タのフレーム構成を示す。第５図（ａ）に示す通りＮ＝
４の場合、第１〜第４代表フレームのＫパラメータには
各々35bitsが割当てられる。又、各代表フレームのＫパ
ラメータk₁,k₂,…,k₈には各々8,6,5,4,3,3,3,3bitsが割
当てられる。LPC分析器２で10mSEC毎に分析された20組
のＫパラメータベクトルは各々所望のbit数に量子化され、量子化Ｋパラメータ
ベクトルとしてバッファメモリ54と複号化器52とへ送出される。
バッファメモリ54は量子化Ｋパラメータベクトルを記憶
する。複号化器52は量子化Ｋパラメータベクトルを複号
化し、これを複号化Ｋパラメータベクトルとしてバッファメモリ55と歪算出器53とへ出力する。バ
ッファメモリ55は複号化Ｋパラメータベクトルを記憶す
る。歪算出器53は下記（１）式により20ケのフレームの
量子化歪δ^(j),（ｊ＝1,…,20）を算出する。FIGS. 5A, 5B, and 5C show the multiplexing circuit 9 when 4, 5, and 6 are selected as the number N of representative analysis frames in a section having M = 20. 3 shows a frame configuration of output data for one section. As shown in FIG. 5 (a), N =
In the case of 35, 35 bits are assigned to each of the K parameters of the first to fourth representative frames. ₈ , 6, ₅ , 4, ₃ , ₃ , ₃ , and 3 bits are assigned to the K parameters k ₁ , k ₂ ,..., K ₈ of each representative frame, respectively. 20 sets of K parameter vectors analyzed every 10mSEC by LPC analyzer 2 Are each quantized to a desired number of bits, and a quantized K parameter vector Is sent to the buffer memory 54 and the decoder 52.
The buffer memory 54 stores the quantization K parameter vector. The decryptor 52 decrypts the quantized K parameter vector, and converts this to the decrypted K parameter vector. To the buffer memory 55 and the distortion calculator 53. The buffer memory 55 stores the decrypted K parameter vector. The distortion calculator 53 calculates the quantization distortion δ ^(j) , (j = 1,..., 20) for 20 frames by the following equation (1).

ここでw_i（ｉ＝1,…８）はＫパラメータのスペクトル
感度である。算出された20ケの量子化歪δ^(j),（ｊ＝1,
…,20）はバッファメモリ56へ書込まれる。 Here, w _i (i = 1,..., 8) is the spectral sensitivity of the K parameter. The calculated 20 quantization distortions δ ^(j) , (j = 1,
.., 20) are written to the buffer memory 56.

区分的最適近似器58はバッファメモリ57に書込まれて
いるＫパラメータベクトルとバッファメモリ55に書込まれている複号化Ｋパラメー
タベクトルとバッファメモリ56に書込まれている量子化歪δ^(j),
（ｊ＝1,…,20）を用いて、特開昭62−998号公報に開示
されたアルゴリズムを利用して代表フレーム数Ｎ＝４の
場合に可能な最小の総歪▲Ｄ⁽⁴⁾ _min▼を得る事のできる
代表フレームと、これら代表フレームに代表される区間
を決定する。尚、本実施例に於いては各代表フレームに
代表される最大区間長を16フレームに限定している。こ
の限定処理はリピートビットの不必要な割当てを防ぐた
めに実施されるものであり、又、この程度の限定は音質
に殆んど影響を与えない。因に、最大区間長を限定しな
い場合、取り得る最大区間長は17である。総歪▲Ｄ⁽⁴⁾
_min▼は出力ライン101を介して、各代表フレームの量子
化Ｋパラメータはバッファメモリ54より読出され、出力
ライン102を介して、各代表フレームに代表される区間
長はリピートビットとして出力ライン103を介して各々
代表フレーム数決定器８へ出力される。代表フレーム数
決定器８はとなる。▲Ｄ⁽⁾ _min▼を検索し、この検索結果に基づ
き最適代表フレーム数を決定する。尚▲Ｄ⁽⁵⁾ _min▼，▲
Ｄ⁽⁶⁾ _min▼は各々フレーム選択器6,フレーム選択器７で
算出された総歪である。＝４が最適の場合、フレーム
選択器５で量子化及び選択され決定された量子化Ｋパラ
メータ，リピートビットが多重化回路９へ出力される。
多重化回路９は＝４の場合、第５図（Ａ）に示す構成
のフレームを組立て合成側へ出力する。音源パラメータ
の構成はに無関係に固定であり第５図（Ｃ）と同一で
ある。無論Ｍ＝20は固定であり、リピートビットは任意
の３ケを伝送すれば他の１ケは伝送する必要がない。The piecewise optimal approximator 58 is a K-parameter vector written in the buffer memory 57. And the decrypted K parameter vector written in the buffer memory 55 And the quantization distortion δ ^(j) written in the buffer memory 56,
(J = 1, ..., 20 ) with the total distortion ▲ D of the smallest possible in the case of using the algorithm disclosed in JP-A-62-998 representative frame number N = ^{4 (4)} _The representative frames from which _min ▼ can be obtained and the sections represented by these representative frames are determined. In this embodiment, the maximum section length represented by each representative frame is limited to 16 frames. This limiting process is performed to prevent unnecessary assignment of repeat bits, and this limit has almost no effect on sound quality. If the maximum section length is not limited, the maximum section length that can be taken is 17. Total strain D ⁽⁴⁾
_min ▼ is output via the output line 101, the quantization K parameter of each representative frame is read from the buffer memory 54, and the section length represented by each representative frame is output via the output line 102 as a repeat bit. Each of them is output to the representative frame number determiner 8. The representative frame number determiner 8 Becomes ▲ D ⁽⁾ _min ▼ is searched, and the optimum number of representative frames is determined based on the search result. ▲ D ⁽⁵⁾ _min ▼ 、 ▲
D ⁽⁶⁾ _min ▼ is the total distortion calculated by the frame selector 6 and the frame selector 7, respectively. When = 4 is optimal, the quantization K parameter and repeat bit determined and quantized and selected by the frame selector 5 are output to the multiplexing circuit 9.
When = 4, the multiplexing circuit 9 assembles the frame having the configuration shown in FIG. 5A and outputs the frame to the combining side. The configuration of the sound source parameters is fixed irrespective of the parameters, and is the same as that in FIG. 5 (C). Of course, M = 20 is fixed, and if any three repeat bits are transmitted, there is no need to transmit the other one.

フレーム選択器６は＝５の場合の処理を司どるもの
であり最大区間長は９に設定されている。フレーム選択
器７は＝６の場合の処理を司どるものであり、最大区
間長は９に設定されている。尚、第５図（Ａ），
（Ｂ），（Ｃ）に於いて第１ビット“F"はフレーム同期
用のビット，第５図（Ａ），（Ｂ）の第2,第３ビット
“00",及び“01",並びに第５図（Ｃ）の第２ビット“1"
は代表フレーム数を指定するビットである。The frame selector 6 controls the processing in the case of = 5, and the maximum section length is set to 9. The frame selector 7 controls the processing in the case of = 6, and the maximum section length is set to 9. FIG. 5 (A),
In (B) and (C), the first bit "F" is a bit for frame synchronization, and the second and third bits "00" and "01" in FIGS. 5A and 5B, and Second bit "1" in FIG. 5 (C)
Is a bit that specifies the number of representative frames.

次に合成側の動作の全体的説明を図を利用して行う。 Next, the overall operation of the synthesizing side will be described with reference to the drawings.

第２図に示す合成側は、分離回路11と、複号化器12
と、パラメータ補間器13と、バッファメモリ14と、複号
化器15と、パルス発生器16と、雑音発生器17と、切替器
18と、LPC合成フィルタ19と、D/A変換器20とを備えて構
成されている。The combining side shown in FIG. 2 includes a separating circuit 11 and a demultiplexer 12.
, A parameter interpolator 13, a buffer memory 14, a decoder 15, a pulse generator 16, a noise generator 17, and a switch
18, an LPC synthesis filter 19, and a D / A converter 20.

分離回路11は多重化回路９から伝送されてきた多重化
信号を分離し、量子化パラメータ202を複号化器12へ、
リピートビット203をパラメータ補間器13へ、代表フレ
ー数204を複号化器12及びパラメータ補間器13へ、又、
音源データ205を複号化器15へ出力する。The separation circuit 11 separates the multiplexed signal transmitted from the multiplexing circuit 9 and sends the quantization parameter 202 to the demultiplexer 12.
Repeat bit 203 to parameter interpolator 13, representative frame number 204 to decryptor 12 and parameter interpolator 13, and
The sound source data 205 is output to the decoder 15.

複号化器12は、代表フレーム数204を参照して量子化
パラメータ202を複号し、複号したＫパラメータをαパ
ラメータに変換してパラメータ補間器13へ出力する。パ
ラメータ補間器13は、代表フレーム数204を参照し、入
力したαパラメータをリピートビット203で指定された
分析フレームの回数分繰返してバッファメモリ14に書込
む。The decryptor 12 decrypts the quantization parameter 202 with reference to the representative frame number 204, converts the decrypted K parameter into an α parameter, and outputs it to the parameter interpolator 13. The parameter interpolator 13 refers to the representative frame number 204, and repeats the input α parameter as many times as the number of analysis frames specified by the repeat bit 203, and writes the same into the buffer memory 14.

複号化器15は、音源データ205を複号し、ピッチ周期
のデータをパルス発生器16へ出力し、又、有声／無声の
区別によって切替器18を制御する。有声のときは、入力
したピッチ周期データで指定されたピッチ周期でパルス
発生器16がパルス列を発生し、このパルス列は切替器18
により選択出力される。無声のときは雑音発生器17の出
力した白色雑音が切替器18によって選択出力される。The decoder 15 decodes the sound source data 205, outputs pitch cycle data to the pulse generator 16, and controls the switch 18 by voiced / unvoiced discrimination. When voiced, the pulse generator 16 generates a pulse train at the pitch cycle specified by the input pitch cycle data.
Is selected and output. When there is no voice, the white noise output from the noise generator 17 is selectively output by the switch 18.

LPC合成フィルタ19は、分析フレームごとにバッファ
メモリ14から読出したαパラメータをフィルタ係数とし
て用い、切替器18から入力する音源により駆動されてデ
ィジタル音声信号を合成する。このディジタル音声信号
がD/A変換器20によりアナログ化されて出送音声信号と
なる。The LPC synthesis filter 19 is driven by a sound source input from the switch 18 and synthesizes a digital audio signal using the α parameter read from the buffer memory 14 for each analysis frame as a filter coefficient. This digital audio signal is converted into an analog signal by the D / A converter 20 to be an outgoing / transmitting audio signal.

以上説明したように第１図及び第２図に示す実施例は
区分内の代表フレーム数を（Ｋパラメータの量子化数と
共に）４個〜６個の範囲内で、量子化歪とフレーム代替
歪との総量が最小となるように、適応制御している。As described above, in the embodiment shown in FIGS. 1 and 2, the number of representative frames in a section is set within the range of 4 to 6 (along with the quantization number of the K parameter), and the quantization distortion and the frame replacement distortion are reduced. And the adaptive control is performed so that the total amount becomes minimum.

なお、第１図及び第２図に示す実施例はLPCとしてＫ
パラメータを用いているが、αパラメータ，線スペクト
ル対（Line Spectrum Pairs;LSP）等、その他のLPCを用
いることもできる。The embodiment shown in FIG. 1 and FIG.
Although a parameter is used, other LPCs such as an α parameter and a line spectrum pair (LSP) can also be used.

以上、第１図及び第２図に示す実施例について説明し
た。The embodiment shown in FIGS. 1 and 2 has been described above.

第３図は本発明の第２の実施例における分析側を示す
ブロック図、第４図は同じく合成側を示すブロック図で
ある。FIG. 3 is a block diagram showing the analysis side in the second embodiment of the present invention, and FIG. 4 is a block diagram showing the synthesis side in the same manner.

第３図に示す分析側は、第１図に示す分析側のLPC分
析器2,フレーム選択器５〜7,代表フレーム数決定器8,多
重化回路９をLPC分析器31,フレーム選択器32〜34,LPC次
数決定器35,多重化回路36で置換えて構成されている。
フレーム選択器32〜34は、各構成要素が行う処理の数値
的条件が必ずしも等しくない点を除いては、第１図に示
す分析側におけるフレーム選択器５と同じに構成されて
おり、その各構成要素には、フレーム選択器５の各構成
要素に付けた参照番号51〜58と対応するように、参照番
号61〜68を付けてある。第４図に示す合成側は、第２図
に示す合成側の分離回路11,複号化器12,パラメータ補間
器13,バッファメモリ14,LPC合成フィルタ19を分離回路4
1,複号化器42,パラメータ補間器43,バッファメモリ44,L
PC合成フィルタ45で置換えて構成される。The analysis side shown in FIG. 3 includes the analysis side LPC analyzer 2, the frame selectors 5 to 7, the representative frame number determiner 8, and the multiplexing circuit 9 which are the LPC analyzer 31, the frame selector 32 shown in FIG. , An LPC order determiner 35, and a multiplexing circuit 36.
The frame selectors 32 to 34 have the same configuration as the frame selector 5 on the analysis side shown in FIG. 1 except that the numerical conditions of the processing performed by the respective components are not always equal. The components are assigned reference numerals 61 to 68 so as to correspond to the reference numerals 51 to 58 assigned to the respective components of the frame selector 5. The combining side shown in FIG. 4 includes a combining circuit 11, a decoding unit 12, a parameter interpolator 13, a buffer memory 14, and an LPC combining filter 19 on the combining side shown in FIG.
1, Decoder 42, Parameter interpolator 43, Buffer memory 44, L
It is configured by replacing with a PC synthesis filter 45.

LPC分析器31の動作は、分析次数が12次である点を除
いては、LPC分析器２の動作と同じである。The operation of the LPC analyzer 31 is the same as the operation of the LPC analyzer 2 except that the analysis order is twelfth.

フレーム選択器32は、LPC分析器31から入力した12次
のＫパラメータを８次のＫパラメータとして扱い、200m
sの区分内の代表分析フレーム数を６とし、量子化歪と
フレーム代替歪との総量が最小になるように区分的最適
関数近似を行う。The frame selector 32 treats the 12th-order K parameter input from the LPC analyzer 31 as an 8th-order K parameter, and
The representative analysis frame number in the section of s is set to 6, and the piecewise optimal function approximation is performed so that the total amount of the quantization distortion and the frame substitution distortion is minimized.

量子化器61はLPC分析器31から分析フレームごとに入
力する12次のＫパラメータのうち９次以降を無視して８
次のＫパラメータとして扱い量子化を行う。歪算出器63
は量子化されていない12次のＫパラメータを基準として
量子化器61における量子化の歪を算出する。区分的最適
近似器68は量子化されていない12次のＫパラメータを
（代表分析フレームの）８次の複号化Ｋパラメータで代
替する場合に発生する歪をフレーム代替歪とする。以上
述べた点を除き、フレーム選択器32の動作はフレーム選
択器５の動作と同じである。The quantizer 61 ignores the ninth and subsequent K parameters of the 12th K parameter input for each analysis frame from the LPC
It is treated as the next K parameter and quantization is performed. Strain calculator 63
Calculates the distortion of quantization in the quantizer 61 on the basis of the unquantized 12-order K parameter. The piecewise optimal approximator 68 uses the distortion generated when substituting the unquantized 12th order K parameter with the 8th order decoding K parameter (of the representative analysis frame) as the frame substitution distortion. Except as described above, the operation of the frame selector 32 is the same as the operation of the frame selector 5.

フレーム分析器33は、入力する12次のＫパラメータを
10次のＫパラメータとして扱い、代表分析フレーム数を
５とすることを除いてはフレーム選択器32と同じ動作を
する。フレーム選択器34も、入力する12次のＫパラメー
タをそのまま12次として扱い、代表分析フレーム数を４
とすることを除いてはフレーム選択器32と同じ動作をす
る。The frame analyzer 33 calculates the 12th order K parameter to be input.
The operation is the same as that of the frame selector 32 except that the number of representative analysis frames is set to 5 by treating it as a 10th-order K parameter. The frame selector 34 also treats the input 12th-order K parameter as it is as 12th order, and reduces the number of representative analysis frames to 4
The operation is the same as that of the frame selector 32 except that

LPC次数決定器35は、代表フレーム数204のかわりに選
択したフレーム選択器の扱うＫパラメータの次数をLPC
次数206として出力することを除き代表フレーム数決定
器８と同じ動作をする。The LPC order determiner 35 converts the order of the K parameter handled by the selected frame selector in place of the representative frame number 204 into an LPC order.
The operation is the same as that of the representative frame number determiner 8 except that it is output as the degree 206.

多重化回路36,分離回路41,複号化器42,パラメータ補
間器43,バッファメモリ44の多重化回路9,分離回路11,複
号化器12,パラメータ補間器13,バッファメモリ14との相
異は、これらが扱うパラメータのフォーマットの相異に
基づく変更によるものである。LPC合成フィルタ19が８
次の全極型フィルタとして構成されるのに対し、LPC合
成フィルタ45は12次の全極型フィルタとして構成され
る。Multiplexing circuit 36, demultiplexing circuit 41, demultiplexer 42, parameter interpolator 43, phase of buffer memory 44 with multiplexing circuit 9, demultiplexing circuit 11, demultiplexer 12, parameter interpolator 13, buffer memory 14 The differences are due to changes based on differences in the format of the parameters handled by these. LPC synthesis filter 19 is 8
The LPC synthesis filter 45 is configured as a twelfth-order all-pole filter, while being configured as the next all-pole filter.

第１図及び第２図に示す実施例との以上説明した相異
からわかるように、第３図及び第４図に示す実施例はLP
Cの次数と共に区分内の代表分析フレーム数を４個〜６
個の範囲で適応制御している。As can be seen from the differences described above with respect to the embodiment shown in FIGS. 1 and 2, the embodiment shown in FIGS.
The number of representative analysis frames in the section along with the order of C is 4 to 6
Adaptive control is performed in a range of

なお、第３図及び第４図に示す実施例はLPCとしてＫ
パラメータを用いているが、その他のLPCでも、12次のL
PCを８次,10次のLPCに変換する変換器をフレーム選択器
32,33の入力端に設ければ、利用可能である。The embodiment shown in FIG. 3 and FIG.
Parameters are used, but other LPCs also use the 12th order L
A frame selector that converts a PC into an 8th or 10th order LPC
It can be used if provided at the input terminals of 32 and 33.

以上、第３図及び第４図に示す実施例について説明し
た。The embodiment shown in FIGS. 3 and 4 has been described above.

以上説明した２つの実施例は区分内の代表分析フレー
ム数と量子化数又はLPC次数のいずれか一方との最適化
を行っているが、代表分析フレーム数，量子化数及びLP
C次数の全てを考慮して最適化することも可能である。Although the two embodiments described above optimize the number of representative analysis frames in a section and either the number of quantizations or the LPC order, the number of representative analysis frames, the number of quantizations, and the LP
It is also possible to perform the optimization in consideration of all the C orders.

〔The invention's effect〕

以上説明したように本発明の可変長フレーム型ボコー
ダは、区分内の代表分析フレーム数を適応制御すること
により、総歪の見地から最適化を計ることができ、総歪
を減少できる効果がある。As described above, the variable-length frame vocoder of the present invention can optimize from the viewpoint of total distortion by adaptively controlling the number of representative analysis frames in a section, and has the effect of reducing total distortion. .

[Brief description of the drawings]

第１図は本発明の第１の実施例における分析側を示すブ
ロック図、第２図は同じく合成側を示すブロック図、第
３図は本発明の第２の実施例における分析側を示すブロ
ック図、第４図は同じく合成側を示すブロック図、第５
図はフレームの構成を示す図である。１……A/D変換器、2,31……LPC分析器、３……音源分析
器、４……量子化器、５〜7,32〜34……フレーム分析
器、８……代表フレーム数決定器、9,36……多重化回
路、11,41……分離回路、12,42……複号化器、13,43…
…パラメータ補間器、14,44……バッファメモリ、15…
…複号化器、16……パルス発生器、17……雑音発生器、
18……切替器、19,45……LPC合成フィルタ、20……D/A
変換器、35……LPC次数決定器、51,61……量子化器、5
2,62……複号化器、53,63……歪算出器、54〜57,64〜67
……バッファメモリ、58,68……区分的最適近似器。FIG. 1 is a block diagram showing an analysis side in the first embodiment of the present invention, FIG. 2 is a block diagram showing the synthesis side similarly, and FIG. 3 is a block diagram showing an analysis side in the second embodiment of the present invention. FIG. 4 is a block diagram showing the combining side, and FIG.
The figure shows the structure of the frame. 1 ... A / D converter, 2,31 ... LPC analyzer, 3 ... Sound source analyzer, 4 ... Quantizer, 5-7,32-34 ... Frame analyzer, 8 ... Representative frame Number determiner, 9,36 …… Multiplexer, 11,41 …… Separator, 12,42 …… Decoder, 13,43…
... parameter interpolator, 14,44 ... buffer memory, 15 ...
... Decoder, 16 ... Pulse generator, 17 ... Noise generator,
18… Switch, 19,45… LPC synthesis filter, 20… D / A
Transformer, 35 …… LPC order determiner, 51,61 …… Quantizer, 5
2,62 …… Decoder, 53,63 …… Strain calculator, 54-57,64-67
…… Buffer memory, 58,68 …… Piecewise optimal approximator.

Claims

(57) [Claims]

1. A variable length frame type vocoder using piecewise optimal function approximation, an analyzer for analyzing an input speech signal at predetermined intervals to extract a characteristic parameter, and receiving an output of the analyzer for quantization. A plurality of frame selectors that perform a piecewise optimal function approximation so that the total distortion that is the sum of the distortion and the frame substitution distortion is minimized, wherein the number of representative analysis frames in the division is different from each other. A variable length frame type vocoder comprising: a frame selector; and a representative frame number selector which receives outputs of the plurality of frame selectors and selects an output of the frame selector having the smallest total distortion.

2. The variable-length frame vocoder according to claim 1, wherein said plurality of frame selectors perform quantization with different numbers of quantization bits according to the number of representative analysis frames.

3. A variable-length frame-type vocoder using piecewise optimal function approximation, wherein an analyzer extracts an characteristic parameter by LPC-analyzing an input speech signal at a predetermined analysis order every predetermined period. A plurality of frame selectors that receive the output and perform a piecewise optimal function approximation so that the total distortion, which is the sum of the quantization distortion and the frame substitution distortion, is minimized. The plurality of frame selectors that perform the piecewise optimal function approximation with different representative analysis frames according to the analysis order to be considered while differentiating the analysis orders, and receiving the outputs of the plurality of frame selectors, A variable-length frame type vocoder comprising: an LPC order selector for selecting an output of a frame selector having the smallest distortion.