JPH02282300A

JPH02282300A - Variable length frame type vocoder

Info

Publication number: JPH02282300A
Application number: JP2015191A
Authority: JP
Inventors: Satoru Taguchi; 哲田口
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1989-01-25
Filing date: 1990-01-24
Publication date: 1990-11-19
Anticipated expiration: 2016-05-14
Also published as: JP3164808B2

Abstract

PURPOSE:To execute optimization with total distortions and to decrease the total distortions by subjection the representative analysis frame number in a segment to an adaptive control. CONSTITUTION:A frame selector 5 segments the analysis frame from a linear prediction coefft. LPC analyzer 2 and executes the segmental optimum function approximation so as to minimize the total quantity of quantization distortions and frame substitutive distortions. Quantization parameters 102, repeat bits 103 and the total distortions 101 are transmitted with the respective representative analysis frames. A representative frame number determining device 8 compares the total distortions from selectors 5 to 7 and transmits the parameter 202, pit 203 and representative frame number 204 of the min. total distortions. A sound source analyzer 3 executes the window processing of quantization voice signals with the rectangular functions and extracts the data on voiced/ voiceless discrimination and pitch period for each of the fames. The data are passed through a quantizer 4 and is thereby made into sound source data 205. These sets of the data are multiplexed in prescribed format by a multiplexing circuit 9.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は可変長フレーム型ボコーダに関する。[Detailed description of the invention] [Industrial application field] The present invention relates to a variable length frame type vocoder.

[Conventional technology]

入力音声信号を一定長の分析フレームごとに分析して得
たスペクトル包絡パラメータを分析フレームに対応して
すべて送出する代りに、連続するＭ（例えば２０）個の
分析フレームから成る区選択し、各代表分析フレームの
スペクトル包絡パラメータと共にこれら代表分析フレー
ムによって代表される分析フレームの数、いわゆるリピ
ートビット　（ｒｅｐｅａｔ　ｂｉｔ）を送出すること
によって、必要とする伝送情報量を低減する可変長フレ
ーム型ボコーダが知られている。Instead of transmitting all the spectral envelope parameters obtained by analyzing the input audio signal for each analysis frame of a certain length, each of the spectral envelope parameters is selected from M (for example, 20) consecutive analysis frames, and each A variable-length frame type vocoder is known that reduces the amount of transmission information required by transmitting the number of analysis frames represented by these representative analysis frames, so-called repeat bits, along with the spectral envelope parameters of the representative analysis frames. It is being

区分ごとに選択される代表分析フレームは区分ごとの全
分析フレームによって示されるスペクトル包絡パラメー
タの分布を最適近似関数で近似せしめることによって得
られ、この最適近似関数は矩形、台形もしくは直線等の
近似関数がそれぞれの適用目的によって使いわけられ、
かつ、通常は動的計画法（Ｄｙｎａｍｉｃ　Ｐｒｏｇｒ
ａｍｍｉｎｇ　：　Ｄ　Ｐ）を介して関数設定がなされ
ている。このＤＰ千手法介した区分的最適関数近似の一
例が特開昭６２９９８号公報に詳細に記述されている。The representative analysis frame selected for each section is obtained by approximating the distribution of spectral envelope parameters shown by all analysis frames for each section with an optimal approximation function, and this optimal approximation function is an approximation function such as a rectangle, trapezoid, or straight line. are used depending on the purpose of application,
In addition, it is usually a dynamic programming method.
Function settings are made via amming: DP). An example of piecewise optimal function approximation using this DP method is described in detail in Japanese Patent Application Laid-Open No. 62998.

[Problem to be solved by the invention]

可変長フレーム型ボコーダにおける歪は、分析次数や量
子化数が有限であることに起因して代表分析フレームそ
のものの表現に関して発生する歪と各分析フレームを代
表分析フレームで代替することにより発生する代替歪と
がある。音声の性質は時変であり、特に、スペクトル包
絡の性質の時変性により、分析次数、量子化数２代表分
析フレーム数等が一定であれば、総量におけるこれら各
歪の割合は時々刻々変化する。Distortion in a variable length frame type vocoder consists of distortion that occurs in the representation of the representative analysis frame itself due to the finite analysis order and quantization number, and distortion that occurs when each analysis frame is replaced with a representative analysis frame. There is distortion. The nature of speech is time-varying, and in particular, due to the time-varying nature of the spectral envelope, if the analysis order, quantization number 2, number of representative analysis frames, etc. are constant, the proportion of each of these distortions in the total amount changes from moment to moment. .

ところが、上述した従来の可変長フレーム型ボコーダは
、区分内の代表分析フＩ／−人数を固定としているので
、はとんどの区分では総量の見地からの最適化が望めな
い欠点がある。However, in the conventional variable length frame type vocoder described above, since the number of representative analysis filters in a section is fixed, there is a drawback that optimization from the viewpoint of the total amount cannot be expected in most sections.

本発明の目的は、総量の見地から、従来におけるより最
適化を計ることができる可変長フレーム型ボコーダを提
供することにある。SUMMARY OF THE INVENTION An object of the present invention is to provide a variable length frame type vocoder that can achieve better optimization than conventional vocoders from the standpoint of total amount.

[Means to solve the problem]

本発明の可変長フレーム型ボコーダは、区分的最適関数
近似を用いた可変長フレーム型ボコーグにおいて、区分
内の代表分析フレーム数をあらかじめ定めた範囲内で最
適化する手段を備えている。The variable length frame type vocoder of the present invention is equipped with a means for optimizing the number of representative analysis frames within a section within a predetermined range in variable length frame type vocoding using piecewise optimal function approximation.

〔Example〕

次に、本発明について図面を参照して説明する。 Next, the present invention will be explained with reference to the drawings.

第１図は本発明の第１の実施例における分析側を示すブ
ロック図、第２図は同じく合成側を示すブロック図であ
る。FIG. 1 is a block diagram showing the analysis side in the first embodiment of the present invention, and FIG. 2 is a block diagram showing the synthesis side.

第１図に示す分析側は、Ａ／Ｄ変換器１と、ＬＰＣ分析
器２と、音源分析器３と、量子化器４と、フレーム選択
器５〜７と、代表フレーム数決定器８と、多重化回路９
とを具備して構成されている。The analysis side shown in FIG. 1 includes an A/D converter 1, an LPC analyzer 2, a sound source analyzer 3, a quantizer 4, frame selectors 5-7, and a representative frame number determiner 8. , multiplexing circuit 9
It is configured with the following.

フレーム選択器５は、量子化器５１と、復号化器５２と
、歪算出器５３と、バッファメモリ５４〜５７と、区分
的最適近似器５８とを備えて構成されている。フレーム
選択器６及び７も、各構成要素が行う処理の数値的条件
が必ずしも等しくない点を除いては、フレーム選択器５
と同じに構成されている。The frame selector 5 includes a quantizer 51, a decoder 52, a distortion calculator 53, buffer memories 54 to 57, and a piecewise optimal approximator 58. The frame selectors 6 and 7 are also similar to each other, except that the numerical conditions of the processing performed by each component are not necessarily the same.
is configured the same as.

遮断周波数３．４ｋＨｚの低域フィルタで帯域制限され
ている入力音声信号は、Ａ／Ｄ変換器１において、標本
化周波数８　ｋ　Ｈｚで標本化され、所定のビット数に
量子化される。An input audio signal whose band is limited by a low-pass filter with a cutoff frequency of 3.4 kHz is sampled at a sampling frequency of 8 kHz in the A/D converter 1 and quantized into a predetermined number of bits.

Ｔ、ＰＣ分析器２ばＡ、／Ｄ変換器１から入力した量子
化音声信号の窓処理及び分析を行う。量子化音声信号の
３０ｍ５分ずつが内蔵メモリにストアされつつ１０ｍ５
の周期で読呂されハミング関数による窓処理が行われて
］、Ｏｍｓの分析フレームとして出力される。分析フレ
ームごとの量子化音声信号は線形予測分析され、本実施
例では線形予測係数ぐＬｉｎｅａｒ　Ｐｒｅｃｌｉｃｔ
ｉｏｎ　Ｃｏｅｆｆｉｃｉｅｎｔ；ＬＰＣ）の１つであ
る８次のにパラメータが抽出される。T, PC analyzer 2 performs window processing and analysis of the quantized audio signal input from A, /D converter 1. Each 30m5 minute of quantized audio signal is stored in the built-in memory while 10m5
window processing using a Hamming function] and output as an Oms analysis frame. The quantized audio signal for each analysis frame is subjected to linear predictive analysis, and in this example, the linear predictive coefficient is
ion coefficient (LPC)) is extracted.

フレーム選択器５は、分析フ！・・−ムの連続する２０
個分、すなわち２００ｍ５を１区分とし、区分内の代表
分析フレーム数を４と１７、量子化歪とフレーム代替歪
との総計が最小になるように区分的最適関数近似を行う
。The frame selector 5 selects an analysis frame!・・・-20 consecutive words
A piecewise optimal function approximation is performed so that the total number of quantization distortion and frame substitution distortion is minimized, with 200 m5 as one segment and the number of representative analysis frames in the segment being 4 and 17.

量子化器５１はＬＰＣ分析器２から分析フレムごとに入
力するにパラメータを、代表分析フレーム数が４である
場合にとり得る量子化数で、量子化し、量子化したにパ
ラメータの２０フレム分をバッファメモリ５４に書込む
。復号化器５２は量子化器５１から入力した月：子化さ
れたにパラメータな復号化し、復号したにパラメータの
２０フレーム分をバッファメモリ５５に書込む。歪算出
器５３は、ＬＰＧ分析器２から入力したにパラメータ及
び復号化器５２から入力したにパラメタを用いて分析フ
レームごとの量子化歪を算出し、算出した量子化歪の２
０フレ一ム分をバッフアメそり５６に書込む。ＬＰＧ分
析器２が出力した、量子化前の、Ｋパラメータも２０フ
レ一ム分がバッファメモリ５７に書込まれる。区分的最
適近似器５８は、バッファメモリ５４〜５７の記憶内容
を用い、量子化歪とフレーム代替歪との総量が最小にな
るようにＤＰ平手法介した区分的最適関数近似を行い、
４個の代表分析フレームを抽出し、抽出した各代表分析
フレームの量子化されたにパラメータを量子化パラメー
ター０２として出力し、各代表分析フ１／−ムが代表す
る分析フレーム数をリピー　トビッｌ−１０３として出
力し、又、各代表分析フレームの量子化歪及びフレーム
代替歪の総量を総量１０１として出力する。The quantizer 51 quantizes the parameters input for each analysis frame from the LPC analyzer 2 using a possible quantization number when the number of representative analysis frames is 4, and buffers the quantized parameters for 20 frames. Write to memory 54. The decoder 52 decodes the child parameters input from the quantizer 51 and writes 20 frames of the decoded parameters to the buffer memory 55. The distortion calculator 53 calculates quantization distortion for each analysis frame using the parameters input from the LPG analyzer 2 and the parameters input from the decoder 52, and calculates 2 of the calculated quantization distortion.
Write one frame of 0 to the buffer memory 56. The K parameters output by the LPG analyzer 2 and before quantization are also written into the buffer memory 57 for 20 frames. The piecewise optimal approximator 58 uses the stored contents of the buffer memories 54 to 57 to perform piecewise optimal function approximation via the DP square method so that the total amount of quantization distortion and frame replacement distortion is minimized.
Extract four representative analysis frames, output the quantized parameters of each extracted representative analysis frame as quantization parameter 02, and repeat the number of analysis frames represented by each representative analysis frame. -103, and the total amount of quantization distortion and frame replacement distortion of each representative analysis frame is output as a total amount 101.

なお、区分的最適近似器５８は先に述べた特開昭・６：
２−９９８号公報に記載されているフレーム選択器１５
の（量子化をパターンマツチングによって行う場合の）
処理アルゴリズムを一般化した処理アルゴリズムで動作
すればよいので、同公報の記述から区分的最適近似器５
８の具体的実現は容易である。Note that the piecewise optimal approximator 58 is based on the above-mentioned Japanese Patent Application Laid-Open No. 1996-6:
Frame selector 15 described in Publication No. 2-998
(when quantization is performed by pattern matching)
Since it is sufficient to operate with a processing algorithm that is a generalized processing algorithm, the piecewise optimal approximator 5 is
8 is easy to concretely realize.

フレーム選択器６は、代表分析フレーム数を５とし、そ
れにあわせてにパラメータの量子化数を低減することを
除いてはフレーム選択器５と同じ動作をする。フレーム
選択器７も、代表分析フレーム数を６とし、それにあわ
せて量子化数を更に低減することを除いてはフレーム分
析器５と同じ動作をする。The frame selector 6 operates in the same way as the frame selector 5 except that the number of representative analysis frames is 5 and the number of quantization parameters is reduced accordingly. The frame selector 7 also operates in the same way as the frame analyzer 5, except that the number of representative analysis frames is six, and the number of quantizations is further reduced accordingly.

代表フｌ／−ム数決定器８は、フレーム選択器５〜７か
ら入力した総量を比較し、総量が最小であるフレーム選
択器から入力した量子化パラメタ及びリピートビットを
選択し、量子化パラメータ２０２及びリピートビット２
０３として出力する。又、選択したフレーム選択器の代
表分析フレーム数を代表フレーム数２０４として出力す
る。The representative frame number determiner 8 compares the total amount input from the frame selectors 5 to 7, selects the quantization parameter and repeat bit input from the frame selector with the minimum total amount, and selects the quantization parameter and repeat bit input from the frame selector with the minimum total amount. 202 and repeat bit 2
Output as 03. Further, the number of representative analysis frames of the selected frame selector is output as the number of representative frames 204.

音源分析器３は、Ａ、　／　Ｄ変換器１から入力した量
子化音声信号を１０ｍ５の分析フレーム周期で矩形関数
により窓処理し、公知の手法により分析フレームごとに
有声／無声の区別及びピッチ周期のデータを抽出する。The sound source analyzer 3 performs window processing on the quantized audio signal input from the A/D converter 1 using a rectangular function at an analysis frame period of 10 m5, and distinguishes between voiced/unvoiced and pitch period for each analysis frame using a known method. Extract the data.

これらデータは量子化器４で量子化され、音源データ２
０５となる。These data are quantized by a quantizer 4, and the sound source data 2
It becomes 05.

多重化回路９は、入力した量子化パラメータ２０２、リ
ピートビット２０３１代表フレーム数２０４及び音源デ
ータ２０５を代表フレーム数２０４により定まる所定の
フォーマットで多重化して合成側に送出する。The multiplexing circuit 9 multiplexes the input quantization parameter 202, repeat bits 2031 representative frame number 204, and sound source data 205 in a predetermined format determined by the representative frame number 204, and sends the multiplexed data to the synthesis side.

以上が第１図に示す分析側の動作の全体的説明である。The above is an overall explanation of the operation on the analysis side shown in FIG.

次に第１図に示す分析側の主要な構成部分について、本
発明を１２００ＢＰＳボコーダに適用した場合の動作を
詳細に説明する。Next, the operation of the main components on the analysis side shown in FIG. 1 when the present invention is applied to a 1200 BPS vocoder will be explained in detail.

第５図（Ａ）　、　（Ｂ）　、　（Ｃ）はＭ二２０で構
成される区分内の代表分析フレーム数Ｎとして各々４゜
５．６が選択された場合に於ける多重化回路９の１区分
分の出力データのフレーム構成を示す。第５図（ａ）に
示す通’ｊＮ＝４の場合、第１〜第４代表フレームのに
パラメータには各々３５ｂｉｔｓが割当てられる。又、
各代表フレームのにパラメータ１（＋、に２＋・・・、
ｌ（８には各々８，６，５，４゜３．３，３，３ｂｉｔ
ｓが割当てられる。ＬＰＧ分析器２で１０ｍ５ＥＣ：毎
に分析された２０組のにパラメータベクトル［（ｋ甲：
　ｉ＝１．　　・・、２０）、ｊｌ、・・・、８〕は各
々所望のｂｉｔ数に量子化され、量子化にパラメータベ
クトル［（ｋ甲；１１、・・・、２０）、ｊ＝Ｌ・・・
、８〕としてバッファメモリ５４と復号化器５２とへ送
出される。バッファメモリ５４は量子化にパラメータベ
クトルを記憶する。復号化器５２は量子化にパラメータ
ベクトルを復号化し、これを復号化にパラメータベクト
ルＣ（ｋ甲；ｉ二１．・・・、’８）、ｊ＝］、。FIGS. 5(A), (B), and (C) show the results of the multiplexing circuit 9 when 4°5.6 is selected as the number N of representative analysis frames in each section consisting of M220. The frame structure of one segment of output data is shown. In the case of jN=4 shown in FIG. 5(a), 35 bits are allocated to each parameter of the first to fourth representative frames. or,
Parameter 1 (+, 2+...,
l (8 has 8, 6, 5, 4 degrees 3.3, 3, 3 bits respectively)
s is assigned. 20 sets of parameter vectors [(kA:
i=1. ..., 20), jl, ..., 8] are each quantized to a desired number of bits, and a parameter vector [(kA; 11, ..., 20), j=L...
, 8] to the buffer memory 54 and decoder 52. A buffer memory 54 stores the parameter vector for quantization. The decoder 52 decodes the quantized parameter vector, and decodes the parameter vector C(kA;i21...,'8),j=],.

２０〕としてバッファメモリ５５と歪算出器５３とへ出
力する。バッファメモリ５５は復号化にパラメータベク
トルを記憶する。歪算出器５３は下記（１）式により２
０ケのフレームの量子化歪δ１°９（ｊ−１・・　２０
）を算出する。20] to the buffer memory 55 and distortion calculator 53. Buffer memory 55 stores parameter vectors for decoding. The distortion calculator 53 calculates 2 by the following equation (1).
Quantization distortion of 0 frames δ1°9(j-1...20
) is calculated.

δ３°ゝ＝Σｗ、（ｌｃＣｉゝ−ｋ　甲）２（ｊ　＝　
１．　、　＝−、２０）　　（１）ここでｗｔ（ｉ＝４
．・・８）はにパラメータのスペクトル感度である。算
出された２０ケの量子化歪６３°ゝ、（ｊ−１，，２０
）はバッファメモリ５６へ書込まれる。δ3°ゝ=Σw, (lcCiゝ-k A)2(j =
1. , =-, 20) (1) Here wt(i=4
．． ...8) is the spectral sensitivity of the parameter. The calculated 20 quantization distortions are 63°ゝ, (j-1,,20
) is written to the buffer memory 56.

区分的最適近似器５８はバッファメモリ５７に書込まれ
ているにパラメータベクトル［（ｋ！’；ｉ−１，・・
・、８）、ｊ＝１．・・・、２ｏ〕とバッファメモリ５
５に書込まれている復号化にパラメータベクトル〔（ｋ
（１）；１−１．・・・、８Ｌ　　ｊ＝１゜２０〕とバ
ッファメモリ５６に書込まれている量子化歪δｏ＞、（
ｊ＝１．・・・、２ｏ）を用いて、特開昭６２−９９８
号公報に開示されたアルゴリズムを利用して代表フレー
ム数Ｎ＝４の場合に可能な最小の総量Ｄ　、５ｊ’、を
得る事のできる代表フレームと、これら代表フレームに
代表される区間を決定する。尚、本実施例に於いては各
代表フレームに代表される最大区間長を１６フレームに
限定している。この限定処理はリピートビットの不必要
な割当てを妨ぐために実施されるものであり、又、この
程度の限定は音質に殆んど影響を与えない。因に、最大
区間長を限定しない場合、取り得る最大区間長は１７で
ある。総歪り、：ｊ玉は出力ライン１０１を介して、各
代表フレームの量子化にパラメータはバッファメモリ５
４より読出され、出力ライン１０２を介して、各代表フ
レームに代表される区間長はリピートビットとして出力
ライン１０３を介して各々代表フレーム数決定器８へ出
力される。代表フレーム数決定器８はｍｉｎ　（Ｄ、、
！七：　、Ｃ＝４．５．６）となる。Ｄ４ｊ）を検索し
、この検索結果に基づき最適代表フレーム数を決定する
。尚り譜？’ｎｌ　Ｄｍｉ）。は各々フレーム選択器６
．フレーム選択器７で算出された総歪である。The piecewise optimal approximator 58 uses the parameter vector [(k!';i-1,...
・, 8), j=1. ..., 2o] and buffer memory 5
Parameter vector [(k
(1);1-1. ..., 8L j=1°20] and the quantization distortion δo>, (
j=1. ..., 2o), JP-A-62-998
Using the algorithm disclosed in the publication, when the number of representative frames N=4, the representative frames that can obtain the minimum possible total amount D, 5j', and the sections represented by these representative frames are determined. . Note that in this embodiment, the maximum section length represented by each representative frame is limited to 16 frames. This limiting process is performed to prevent unnecessary allocation of repeat bits, and this level of limiting has little effect on sound quality. Incidentally, if the maximum section length is not limited, the maximum possible section length is 17. The total distortion: j-ball is sent via the output line 101, and the parameters for quantization of each representative frame are sent to the buffer memory 5.
4, and the section length represented by each representative frame is read out via an output line 102 and output as a repeat bit to the representative frame number determiner 8 via an output line 103. The representative frame number determiner 8 calculates min (D, ,
! 7: , C=4.5.6). D4j) and determine the optimal number of representative frames based on the search results. Naori music? 'nl Dmi). are each frame selector 6
．． This is the total distortion calculated by the frame selector 7.

ｆｆ１＝４が最適の場合、フレーム選択器５で量子化及
び選択され決定された量子化にパラメータ、リピートビ
ットが多重化回路９へ出力される。多重化回路９はＡ＝
４の場合、第５図（Ａ）に示す構成のフレームを組立て
合成側へ出力する。音源パラメータの構成はρに無関係
に固定であり第５図（Ｃ）と同一である。熱論Ｍ−２０
は固定であり、リピートビットは任意の３ケを伝送すれ
ば他の１ケは伝送する必要がない。When ff1=4 is optimal, the frame selector 5 performs quantization and selects the determined quantization parameters and repeat bits, which are output to the multiplexing circuit 9. The multiplexing circuit 9 has A=
In the case of 4, a frame having the configuration shown in FIG. 5(A) is output to the assembly and synthesis side. The configuration of the sound source parameters is fixed regardless of ρ and is the same as that in FIG. 5(C). Netsuron M-20
is fixed, and if any three repeat bits are transmitted, there is no need to transmit the other one.

フレーム選択器６はρ＝５の場合の処理を司どるもので
あり最大区間長は９に設定されている。The frame selector 6 is responsible for processing when ρ=5, and the maximum section length is set to 9.

フレーム選択器７はｐ＝６の場合の処理を司どるもので
あり、最大区間長は９に設定されている。The frame selector 7 is responsible for processing when p=6, and the maximum section length is set to 9.

尚、第５図（Ａ）　、　（Ｂ）　、　（Ｃ）に於いて第
１ヒツト“′Ｆ′”はフレーム同期用のビット、第５図
（Ａ）。In FIGS. 5(A), 5(B), and 5(C), the first hit "'F'" is a bit for frame synchronization, FIG. 5(A).

（Ｂ）の第２．第３ピッドパ００“′、及び“′０１′
並びに第５図（Ｃ）の第２ピツ）”１””は代表フレー
ム数を指定するビットである。(B) 2nd. 3rd pit pad 00"' and "'01'
Also, the second bit "1" in FIG. 5(C) is a bit that specifies the number of representative frames.

次に合成側の動作の全体的説明を図を利用して行う。Next, an overall explanation of the operation on the synthesis side will be given using diagrams.

第２図に示す合成側は、分離回路１１と、復号化器１２
と、パラメータ補間器１３と、バッファメモリ１４と、
復号化器１５と、パルス発生器１６と、雑音発生器１７
と、切替器１８と、ＬＰＧ合成フィルタ１９と、Ｄ／Ａ
変換器２０とを備えて構成されている。The synthesis side shown in FIG. 2 includes a separation circuit 11 and a decoder 12.
, a parameter interpolator 13, a buffer memory 14,
Decoder 15, pulse generator 16, and noise generator 17
, the switch 18, the LPG synthesis filter 19, and the D/A
The converter 20 is configured to include a converter 20.

分離回路１１は多重化回路９から伝送されてきた多重化
信号を分離し、量子化パラメータ２０２ヲ復号化器１２
へ、リピートピッ）　２０３ヲパラメ一タ補間器１３へ
、代表フレーム数２０４を復号化器１２及びパラメータ
補間器１３へ、又、音源データ２０５を復号化器１５へ
出力する。The separation circuit 11 separates the multiplexed signal transmitted from the multiplexing circuit 9, and converts the quantization parameter 202 into the decoder 12.
203 to the parameter interpolator 13, the representative frame number 204 to the decoder 12 and the parameter interpolator 13, and the sound source data 205 to the decoder 15.

復号化器１２は、代表フレーム数２０４を参照して量子
化パラメータ２０２を復号し、復号したにパラメータを
αパラメータに変換してパラメータ補間器１３へ出力す
る。パラメータ補間器１３は、代表フレーム数２０４を
参照し、入力したαパラメータをリピートビット２０３
で指定された分析フレームの回数分繰返してバッファメ
モリ１４に書込む。The decoder 12 decodes the quantization parameter 202 with reference to the number of representative frames 204, converts the decoded parameter into an α parameter, and outputs the α parameter to the parameter interpolator 13. The parameter interpolator 13 refers to the representative frame number 204 and sets the input α parameter to the repeat bit 203.
It is written into the buffer memory 14 repeatedly for the number of analysis frames specified in .

復号化器１５は、音源データ２０５を復号し、ピッチ周
期のデータをパルス発生器１６へ出力し、又、有声／無
声の区別によって切替器１８を制御する。有声のときは
、入力したピッチ周期データで指定されたピッチ周期で
パルス発生器１６がパルス列を発生し、このパルス列は
切替器１８により選択出力される。無声のときは雑音発
生器１７の出力した白色雑音が切替器１８によって選択
出力される。The decoder 15 decodes the sound source data 205, outputs pitch period data to the pulse generator 16, and also controls the switch 18 based on the voiced/unvoiced distinction. When voiced, the pulse generator 16 generates a pulse train at the pitch period specified by the input pitch period data, and this pulse train is selectively output by the switch 18. When there is no voice, the white noise output from the noise generator 17 is selectively output by the switch 18.

ＬＰＧ合成フィルタ１９は、分析フレームごとにバッフ
ァメモリ１４から読出したαパラメータをフィルタ係数
として用い、切替器１８から入力する音源により駆動さ
れてディジタル音声信号を合成する。このディジタル音
声信号がＤ／Ａ変換器２０によりアナログ化されて出力
音声信号となる。The LPG synthesis filter 19 uses the α parameter read from the buffer memory 14 for each analysis frame as a filter coefficient, and is driven by the sound source input from the switch 18 to synthesize a digital audio signal. This digital audio signal is converted into an analog signal by the D/A converter 20 and becomes an output audio signal.

以上説明したように第１図及び第２図に示す実施例は区
分内の代表フレーム数を（Ｋパラメータの量子化数と共
に）４個〜６個の範囲内で、量子化歪とフレーム代替部
との総量が最小となるように、適応制御している。As explained above, in the embodiments shown in FIGS. 1 and 2, the number of representative frames in a section is within the range of 4 to 6 (together with the number of quantization of the K parameter), and the quantization distortion and frame replacement part are Adaptive control is performed so that the total amount of

なお、第１図及び第２図に示す実施例はＬＰＣとしてに
パラメータを用いているが、αパラメタ、線スペクトル
対（Ｌｉｎｅ　Ｓｐｅｃｔｒｕｍ　Ｐａ１ｒｓ；ＬＳＰ
）等、その他のＬＰＣを用いることもできる。Note that the embodiment shown in FIGS. 1 and 2 uses parameters as LPC, but α parameter, line spectrum pairs (Line Spectrum Pa1rs; LSP
), etc., can also be used.

以上、第１図及び第２図に示す実施例について説明した
。The embodiment shown in FIGS. 1 and 2 has been described above.

第３図は本発明の第２の実施例における分析側を示すブ
ロック図、第４図は同じく合成側を示すフロック図であ
る。FIG. 3 is a block diagram showing the analysis side in the second embodiment of the present invention, and FIG. 4 is a block diagram showing the synthesis side.

第３図に示す分析側は、第１図に示す分析側のＬＰＧ分
析器２．フレーム選択器５〜７１代表フレーム数決定器
８．多重化回路９をＬＰＣ分析器３１、フレーム選択器
３２〜３４　　ＬＰＣ次数決定器３５．多重化回路３６
で置換えて構成されている。フレーム選択器３２〜３４
は、各構成要素が行う処理の数値的条件が必ずしも等し
くない点を除いては、第１図に示す分析側におけるフレ
ーム選択器５と同じに構成されており、その各構成要素
には、フレーム選択器５の各構成要素に付けた参照番号
５１〜５８と対応するように、参照番号６１〜６８を付
けである。第４図に示す合成側は、第２図に示す合成側
の分離回路１１．復号化ＲＦＳ、　１２　、　、＜ラメ
ータ補間器１３．バッファメモリ１４、ＬＰＧ合成フィ
ルタ１９を分兜ＩＬ回路４１゜復号化器４２．パラメー
タ補間器４３．バッファメモ！Ｊ４４．．ＬＰＧ合成フ
ィルタ４５で置換えて構成される。The analysis side shown in FIG. 3 includes the LPG analyzer 2.2 on the analysis side shown in FIG. Frame selectors 5-71 Representative frame number determiner 8. The multiplexing circuit 9 includes an LPC analyzer 31, frame selectors 32 to 34, and an LPC order determiner 35. Multiplexing circuit 36
It is configured by replacing it with . Frame selectors 32-34
has the same structure as the frame selector 5 on the analysis side shown in FIG. 1, except that the numerical conditions of the processing performed by each component are not necessarily the same, and Reference numbers 61 to 68 are assigned to correspond to reference numbers 51 to 58 assigned to each component of the selector 5. On the synthesis side shown in FIG. 4, the separation circuit 11 on the synthesis side shown in FIG. Decoding RFS, 12, , <Rameter interpolator 13. A buffer memory 14, an LPG synthesis filter 19 are divided into an IL circuit 41 and a decoder 42. Parameter interpolator 43. Buffer memo! J44. ．． It is configured by replacing it with an LPG synthesis filter 45.

ＬＰＣ分析器３１の動作は、分析次数が１２次である点
を除いては、ＬＰＣ分析器２の動作と同じである。The operation of the LPC analyzer 31 is the same as that of the LPC analyzer 2 except that the order of analysis is the 12th order.

フレーム選択器３２は、ＬＰＣ分析器３１から入力した
１２次のにパラメータを８次のにパラメータとして扱い
、２００ｍ５の区分内の代表分析フレーム数を６とし、
量子化歪とフレーム代替子との総量が最小になるように
区分的最適関数近似を行う。The frame selector 32 treats the 12th-order parameters input from the LPC analyzer 31 as 8th-order parameters, and sets the number of representative analysis frames within the 200 m5 section to 6.
Piecewise optimal function approximation is performed so that the total amount of quantization distortion and frame substitute is minimized.

量子化器６１はＬＰＣ分析器３１から分析フレームごと
に入力する１２次のにパラメータのうち９次以降を無視
して８次のにパラメータとして扱い量子化を行う。歪算
出器６３は量子化されていない１２次のにパラメータを
基準として量子化器６１における量子化の歪を算出する
。区分的最適近似器６８は量子化されていない１２次の
にパラメータを（代表分析フレームの）８次の復号化に
パラメータで代替する場合に発生する歪をフレーム代替
子とする。以上述べた点を除き、フレーム選択器３２の
動作はフレーム選択器５の動作と同じである。The quantizer 61 ignores the 9th and subsequent 12th order parameters input from the LPC analyzer 31 for each analysis frame and treats them as 8th order parameters for quantization. The distortion calculator 63 calculates the quantization distortion in the quantizer 61 using the unquantized 12th order parameter as a reference. The piecewise optimal approximator 68 uses the distortion that occurs when replacing a non-quantized 12th-order parameter with a parameter for 8th-order decoding (of a representative analysis frame) as a frame substitute. Except for the points mentioned above, the operation of the frame selector 32 is the same as the operation of the frame selector 5.

フレーム分析器３３は、入力する１２次のにパラメータ
を１０次のにパラメータとして扱い、代表分析フレーム
数を５とすることを除いてはフレーム選択器３２と同じ
動作をする。フレーム選択器３４も、入力する１２次の
にパラメータをそのまま１２次として扱い、代表分析フ
レーム数を４とすることを除いてはフレーム選択器３２
と同じ動作をする。The frame analyzer 33 operates in the same way as the frame selector 32, except that it treats the input 12th-order parameters as 10th-order parameters and sets the number of representative analysis frames to 5. The frame selector 34 also treats the input 12-order parameters as 12-order as is, and sets the number of representative analysis frames to 4.
does the same thing.

ＬＰＧ次数決定器３５は、代表フレーム数２０４のかわ
りに選択したフレーム選択器の扱うにパラメータの次数
をＬＰＣ次数２０６として出力することを除き代表フレ
ーム数決定器８と同じ動作をする。The LPG order determiner 35 operates in the same manner as the representative frame number determiner 8 except that instead of the representative frame number 204, the order of the parameter handled by the selected frame selector is output as the LPC order 206.

多重化回路３６１分離回路４１．復号化器４２゜パラメ
ータ補間器４３．バッファメモリ４４の多重化回路９１
分離回路１１．複号化器１２．パラメータ補間器１３．
バッファメモリ１４との相異は、これらが扱うパラメー
タのフォーマットの相異に基づく変更によるものである
。ＬＰＣ合成フィルタ１９が８次の全極型フィルタとし
て構成されるのに対し、ＬＰＣ合成フィルタ４５は１２
次の全極型フィルタとして構成される。Multiplexing circuit 361 Separation circuit 41. Decoder 42° parameter interpolator 43. Multiplexing circuit 91 of buffer memory 44
Separation circuit 11. Decoder 12. Parameter interpolator 13.
The difference from the buffer memory 14 is due to changes based on differences in the formats of parameters handled by these. While the LPC synthesis filter 19 is configured as an 8th order all-pole filter, the LPC synthesis filter 45 is configured as an 8th order all-pole filter.
It is configured as the following all-pole filter.

第１図及び第２図に示す実施例との以上説明した相異か
られかるように、第３図及び第４図に示す実施例はＬＰ
Ｃの次数と共に区分内の代表分析フレーム数を４個〜６
個の範囲で適応制御している。As can be seen from the above-described differences from the embodiment shown in FIGS. 1 and 2, the embodiment shown in FIGS.
The number of representative analysis frames within the classification is 4 to 6 along with the degree of C.
It is adaptively controlled within the range of individuals.

なお、第３図及び第４図に示す実施例はＬＰＧ、！：’
してにパラメータを用いているが、その他のＬＰＧでも
、１２次のＬＰＣを８次、１０次のＬＰＣに変換する変
換器をフレーム選択器３２゜３３の入力端に設ければ、
利用可能である。Note that the embodiments shown in FIGS. 3 and 4 are LPG,! :'
However, for other LPGs, if a converter for converting 12th-order LPC to 8th-order and 10th-order LPC is installed at the input end of the frame selector 32 and 33,
Available.

以上、第３図及び第４図に示す実施例について説明した
。The embodiment shown in FIGS. 3 and 4 has been described above.

以上説明した２つの実施例は区分内の代表分析フレーム
数と量子化数又はＬＰＧ次数のいずれか一方との最適化
を行っているが、代表分析フレーム数、量子化数及びＬ
ＰＧ次数の全てを考慮して最適化することも可能である
。The two embodiments described above optimize the number of representative analysis frames within a section and either the quantization number or the LPG order.
It is also possible to perform optimization by considering all PG orders.

〔Effect of the invention〕

＝１８以上説明したように本発明の可変長フレーム型ボコーダ
は、区分内の代表分析フレーム数を適応制御することに
より、総歪の見地から最適化を計ることができ、総歪を
減少できる効果がある。=18 As explained above, the variable length frame type vocoder of the present invention can perform optimization from the viewpoint of total distortion by adaptively controlling the number of representative analysis frames within a section, and has the effect of reducing total distortion. There is.

[Brief explanation of drawings]

第１図は本発明の第１の実施例における分析側を示すブ
ロック図、第２図は同じく合成側を示すブロック図、第
３図は本発明の第２の実施例における分析側を示すブロ
ック図、第４図は同じく合成側を示すブロック図、第５
図はフレームの構成を示す図である。１・・・・Ａ／Ｄ変換器、２，３１・・・・ＬＰＣ分析
器、３・・・・・・音源分析器、４・・・・・量子化器
、５〜７．３２〜３４・・・・・・フレーム分析器、訃
・・・・・代表フレーム数決定器、９，３６・・・・・
・多重化回路、１１．４１・・・・・分離回路、１２．
４２・・・・・復号化器、１３．４３・・・・・・パラ
メータ補間器、１４，７１４・・・・・・バッファメモ
リ、１５・・・・・・復号化器、１６・・パルス発生器
、１７・・・・・・雑音発生器、１８・・・・・・切替
器、１９．４５・・・・ＬＰＣ合成フィルタ、２０・・
・・・・Ｄ／Ａ変換器、３５・・・・・ＬＰＣ次数決定
器、５１．６１・・・・・・量子化器、５２．６２・・
・・・・復号化器、５３．６３・・・・・歪算出器、５
４〜５７．６４〜６７・・・・バッファメモリ、５８．
６８・・　・区分的最適近似器。代理人　弁理士　　内　原　　　晋FIG. 1 is a block diagram showing the analysis side in the first embodiment of the present invention, FIG. 2 is a block diagram also showing the synthesis side, and FIG. 3 is a block diagram showing the analysis side in the second embodiment of the invention. Figure 4 is a block diagram also showing the synthesis side, and Figure 5 is a block diagram showing the synthesis side.
The figure shows the structure of a frame. 1... A/D converter, 2, 31... LPC analyzer, 3... Sound source analyzer, 4... Quantizer, 5~7.32~34 ...Frame analyzer, death...Representative frame number determiner, 9,36...
- Multiplexing circuit, 11.41... Separation circuit, 12.
42...Decoder, 13.43...Parameter interpolator, 14,714...Buffer memory, 15...Decoder, 16...Pulse Generator, 17...Noise generator, 18...Switcher, 19.45...LPC synthesis filter, 20...
...D/A converter, 35...LPC order determiner, 51.61...Quantizer, 52.62...
... Decoder, 53.63 ... Distortion calculator, 5
4-57. 64-67...buffer memory, 58.
68... Piecewise optimal approximator. Agent Patent Attorney Susumu Uchihara

Claims

[Claims]

A variable length frame type vocoder using piecewise optimal function approximation, characterized by comprising means for optimizing the number of representative analysis frames within a partition within a predetermined range.