JPH02500620A

JPH02500620A - coded communication system

Info

Publication number: JPH02500620A
Application number: JP50722288A
Authority: JP
Inventors: キシデイア、コスタス; ゴウビイアナキス，ニコラオス
Original assignee: ブリテツシユ・テレコミユニケイシヨンズ・パブリツク・リミテツド・カンパニー
Priority date: 1987-08-28
Filing date: 1988-08-26
Publication date: 1990-03-01
Also published as: WO1989002148A1; EP0341271A1

Abstract

In an LPC type coded communications system the excitation source is derived from previous filter outputs at the decoder; in one embodiment the speech output is used, in other embodiments, intermediate excitation outputs are used. To enable tracking the coder derives the filter parameters by using the same excitations, supplied by a local decoder, to synthesise locally the actual error produced at the decoder; the parameters are optimised iteratively by varying the delay of an FIR stage and deriving the actual error in a loop, and selecting the delay for minimum actual error. The IIR parameters may be calculated jointly with the other FIR parameters inside the loop, either for minimum prediction error or minimum actual error. The FIR stage may comprise several parallel FIRs, separately excited.

Description

【発明の詳細な説明】発明の名称コード化通信システムこの発明は、コード化された音声（スピーチ）信号を伝送するシステムに関する。この発明はまた、このような音声信号をコード化およびデコードする送受ｔ＝機に関する。[Detailed description of the invention] name of invention coded communication system The present invention relates to a system for transmitting coded audio (speech) signals. . The present invention also provides a transmitter/receiver t= Regarding machines.

コード化された音声信号を伝送する多くの従来システムでは、コード化部における入力音声サンプルからフィルターの特性を引出し、これをデコーダ一部へ送っているロブコーダ一部では、送られてきたフィルター特性を、デコードフィルターを構成するのに用いている。その後、このデコードフィルターは、適当な励起信号源により励起されて、入力音声信号に対する合成再生信号を生成する。In many conventional systems that transmit coded audio signals, the coding section extracts the characteristics of the filter from the input audio sample and sends it to part of the decoder. Some rob coders use the filter characteristics sent to them as decoding filters. It is used to configure the This decoding filter then uses the appropriate excitation It is excited by a signal source to generate a synthetic reproduction signal for the input audio signal.

線形予ｅｌ符号化（ＬＰＧ）では、音声信号の短周期なスペクトラルエンベローブをモデル化するのに、周期的に更新される有限個の係数を用いた全極型リカーシブフィルタ−が採用されている。これらの係数は、通常、「予測誤差」を最小化するように工夫された線形方程式の組を解くことによって、直接計算される。In linear pre-el coding (LPG), the short-period spectral envelope of the audio signal is An all-pole liquor model using a finite number of periodically updated coefficients to model A sibu filter is used. These coefficients typically minimize the “prediction error” It is computed directly by solving a set of linear equations devised to give

（なお上記予測誤差とは、入力された音声信号と予測された音声信号との差分の計測値をいう。）ある種のＬＰＧシステムでは、上記差分の信号から話し手のピッチに対応した長い周期性を取り除くために、コード化側に予測ステージを含ませている。この予測ステージはフィルターの一種と考えることができ、対応するフィルターパラメータもデコーダ一部に送られる。(The above prediction error is the difference between the input audio signal and the predicted audio signal. Refers to measured value. ) In some LPG systems, the speaker's pitch can be determined from the above difference signal. Including a prediction stage on the coding side to remove the long periodicity corresponding to It's set. This prediction stage can be thought of as a type of filter, corresponding to Filter parameters are also sent to part of the decoder.

前記予測誤差は、予測された音声信号にピッチの予測が含まれているか否かに関係なく、現実の誤差とは異なったものになる。というのも、この予測は励起モデルに基づくものであって、デコーダ一部における実際の励起に基づくものではないからである。The prediction error is related to whether or not the predicted speech signal includes a pitch prediction. Regardless, the error will be different from the actual error. This is because this prediction is based on the excitation model. based on the actual excitation in some part of the decoder. It is the body.

最近のＬＰＧシステムには、例えば多重励起（ＭＰ）、レギュラーパルス励起’ （ＲＰＥ）、およびコードブック励起（ＣＥ）といったタイプのＬＰＧシステムがある。デコーダ一部で用いられる励起は、コード化部において選択もしくは銹導される。このようなシステムでは、デコーダ一部が制御可能な励起信号発生器を含んでおり、その場合のコード化部はデコーダ一部へ制御信号を伝送するものでなければならない。Modern LPG systems include, for example, multiple pumping (MP), regular pulse pumping’ (RPE), and codebook excitation (CE) types of LPG systems. There is. The excitation used in part of the decoder is selected or be guided. In such systems, part of the decoder is a controllable excitation signal generator. In that case, the encoding section is the one that transmits the control signal to the decoder part. Must.

デコーダ一部は、それ故、２ステージフイルター（比較的遅延の短い全極フィルターおよび比較的遅延の長いフィルター）と励起発生器とによって構成できる。Some decoders are therefore equipped with two-stage filters (relatively short delay all-pole filters). (a filter with a relatively long delay) and an excitation generator.

励起制御信号は、それ自身がコード化部で生成される。The excitation control signal itself is generated in the coding section.

ＭＰ型ＬＰＧでは、これを「合成による解析」により達成している。すなわち、非修正の励起パルスシーケンスを合成音声で局部的に用い、実際の誤差信号を生成するために実際の入力信号から合成音声シーケンスを減算し、誤差信号に聴感上の重み付けを行い、その後、重み付けされた誤差信号を最適化閉ループ内の制御に用いて、（重み付けされた）実際の誤差を最小化する励起シーケンスを選択する。In MP type LPG, this is achieved through "analysis by synthesis." That is, The unmodified excitation pulse sequence is used locally in the synthesized speech to generate the actual error signal. Subtract the synthesized speech sequence from the actual input signal to create an audible error signal. Then, the weighted error signal is used as a constraint in an optimization closed loop. control to select the excitation sequence that minimizes the (weighted) actual error. do.

この発明は、音声のコード化法を提供する。このコード化法では、フィルターパラメータがコード化部の入力音声信号から周期的に取り出される。導出されたフィルターパラメータは、デコーダ一部のフィルターの応答を更新するために、デコーダ一部へ送られる。なお、デコーダ一部フィルターに入力される励起入力は、デコーダ一部において、デコーダ一部フィルターの音声出力から取り出される。The present invention provides a method for encoding speech. This encoding method uses filter parameters. parameters are periodically extracted from the input audio signal of the encoder. The derived f Filter parameters are used to update the response of some filters in the decoder. Sent to part of the coder. In addition, the excitation input to the decoder part filter is , in the decoder part, is extracted from the audio output of the decoder part filter. .

つまり、励起シーケンスに関連したデータの伝送をやめ、デコーダ一部においてそれ以前の出力音声から励起を単純に引き出すことで、ビットレートを下げることができる。好ましくは、フィルターパラメータのうちの少なくともいくつかは、予測誤差ではなくデコーダ一部で生成されることになる実際の誤差を減らすように取り出される。なお、合成フィルターはコード化部に設けられていてもよく、また実際の誤差は局部的に合成された音声を入力音声信号から減算することで取り出してもよい。This means that the data associated with the excitation sequence is no longer transmitted and some parts of the decoder You can reduce the bitrate by simply pulling the excitation from the previous output audio. I can do it. Preferably, at least some of the filter parameters are , to reduce the actual error that would be generated by the decoder part rather than the prediction error. The sea urchin is taken out. Note that the synthesis filter may be provided in the encoding section. , and the actual error can be determined by subtracting the locally synthesized audio from the input audio signal. You can take it out.

好ましくは、パラメータの生成は、実際の誤差を最小化してフィルターの全ゼロステージの遅延パラメータを反復して取り出すことにより達成される。また好ましくは、その他のパラメータのうち少なくともいくつかは前記ループ内で算出される。また好ましくは、残り全てのパラメータを遅延パラメータ反復ループ内で計算することにより、これらをまとめて最適化する。他の実施例では、これらの最適化法は、フィルターの複数並列フィードフォワードステージの励起として、以前にデコードされた音声でなく以前の励起を用いたシステムに適用することができる。本願発明の採用によって、大幅な信号対雑音比の改善や装置の小型化が効果的に行い得、これにより所定の信号対雑音比に対してコード化の遅れをより短くできる。Preferably, the generation of parameters minimizes the actual error and reduces the total zeros of the filter. This is achieved by iteratively retrieving the stage's delay parameters. I also like it Alternatively, at least some of the other parameters are calculated within said loop. It will be done. Also preferably, all remaining parameters are These are optimized together by calculation. In other embodiments, these The optimization method uses the excitation of multiple parallel feedforward stages of the filter as Can be applied to systems using previous excitation rather than previously decoded speech. can. By adopting the claimed invention, it is possible to significantly improve the signal-to-noise ratio and downsize the device. This reduces the coding delay for a given signal-to-noise ratio. It can be made shorter.

この発明の他の形態は特許請求の範囲おいて詳述されている。これよりこの発明は添付図面を参照し、実施例を用いて説明される。Other forms of the invention are set out in the claims. From now on, this invention will be described by way of example with reference to the accompanying drawings, in which:

図面の簡単な説明第１図はこの発明によるコーダーををする一般化した送信器の略図、第１ａ図はこの発明によるコーダーを有する送信器の一実施例を示し、第２図はこの発明によるデコーダーを有する一般化した受信器の略図、第２ａ図はこの発明によるデコーダーを有する受信器の一実施例を示し、第３図は、第１図又は第２図の単一励起合成フィルターの構成要素を示す略図、第４ａ図及び第４ｂ図は、この発明によるコーダーの合成フィルターのパラメータを最も効果的に活用する方法に対応する６つのアルゴリズムを提供し、第５図はこの発明の他の形態によるコーダーの複合の励起合成フィルターの構成要素を示す略図、第６図は複合入力合成フィルターのパラメータを鼓も効果的に活用する方法の例を示し、第７ａ図及び第７ｂ図は、この発明の実施例において発生する励起波の波形を示す図である。Brief description of the drawing FIG. 1 is a schematic diagram of a generalized transmitter with a coder according to the invention; FIG. 1a shows an embodiment of a transmitter with a coder according to the invention, FIG. 2 is a schematic diagram of a generalized receiver with a decoder according to the invention; FIG. 2a shows an embodiment of a receiver with a decoder according to the invention, FIG. 3 is a schematic diagram showing the components of the single excitation synthesis filter of FIG. 1 or 2; Figures 4a and 4b show the parameters of the synthesis filter of the coder according to the invention. We provide six algorithms that address how to use data most effectively. FIG. 5 shows the configuration of a composite excitation synthesis filter of a coder according to another embodiment of the present invention. A schematic diagram showing the elements, Figure 6 shows the parameters of the composite input synthesis filter effectively. Show examples of how to utilize Figures 7a and 7b show waveforms of excitation waves generated in an embodiment of the invention. This is a diagram.

以下の説明は、第１図の送信器を参照して、フレームにサンプルのＣＹ−：］　（］ｉ−０．１，２．．．．ｎ−１を与えるための実例を示している。各フレームから、フィルターの最適利用ステージ１は、次に示す゛合成分析°技術を用い、フィルターパラメータを引き出す。励起シーケンス源２は最を発生し・この最初の励起シーケンスは、Ｈ（ｚ）の応答を有する所定のインパルス応答フィルターＢ　（ｚ）と・無限インパルス応答（全ての極）フィルター１／Ａ（ｚ）に一致して、２つのステージの合成フィルター３を駆動する：に＝１折フレームのサイズである。The following description refers to the transmitter of FIG. (]i-0.1, 2...n-1 is shown. Each frame Based on the system, stage 1 of optimal use of filters uses the following ``synthetic analysis'' technology. , pull out the filter parameters. Excitation sequence source 2 generates the maximum The initial excitation sequence is a predetermined impulse response filter with a response of H(z). - B (z) and infinite impulse response (all poles) filter 1/A (z) Accordingly, the two-stage synthesis filter 3 is driven: to=1 This is the size of the folding frame.

誤差のより良い測定は、誤差スペクトラムにおけるフォーマット領域を強調しない方法で、誤差信号を周波数重み付けを摂関数で、重み付はフィルター５によって望ましくフィルターリングされる。この信号は反復閉ループにおいて、最適計算器６によって合成フィルター３のパラメータを最適にするために最小にされる。合成フィルター３は［ａｋコ、［ｂ　コ、ｄｌに対する値を取得する。これらの値は量子化に器７によって量子化され、コーダー８を通過する。コーダー８はデコードステーションへの伝送に関するパラメータをコード化する。また、コーダー８は受信ステーションでのデコーダーと機能的に等しいローカルデコーダー１０への伝送に関するパラメータもコード化する。A better measure of error should not emphasize the format region in the error spectrum. The error signal is frequency-weighted by a centrifugal function, and the weighting is performed by filter 5. filtered as desired. In an iterative closed loop, this signal is is minimized by calculator 6 to optimize the parameters of synthesis filter 3. . The synthesis filter 3 obtains values for [ak, [b], and dl. these The value of is quantized The signal is quantized by a unit 7 and passed through a coder 8. Coder 8 is a decoding station. code the parameters for transmission to the application. Also, the coder 8 for transmission to a local decoder 10 which is functionally equivalent to the decoder at the station. Related parameters are also coded.

ローカルデコーダー１０において、パラメータはローカルの出力フレームを生成するために、パラメータが最適化された同じ励起シーケンス［Ｘ　、］によって励起シーケンス発発生器器２から駆動される。In the local decoder 10, the parameters generate a local output frame. By the same excitation sequence [X,] the parameters were optimized to excitation sequence generator It is driven from the device 2.

この合成出力は、励起適応計算器１２によって必要に応じて受信され、処理される。この励起適応計算器１２はデコーダー合成フィルター１１の出力から新しい励起シーケンスｒ　ｘ　−ｓ　］を構築する。新しい励起シーケンスは、先にデコードされた音声（ＰＤＳ）信号の一部を形成する次の入力音声フレームの使用に対して励起シーケンス発生器２を通過させる。This combined output is received and processed as required by the excitation adaptation calculator 12. Ru. This excitation adaptation calculator 12 receives a new value from the output of the decoder synthesis filter 11. An excitation sequence rx-s] is constructed. The new excitation sequence is first Use of the next input audio frame forming part of the coded speech (PDS) signal is passed through the excitation sequence generator 2.

励起適応計算器１２およびローカルデコーダー１０は第１ａ図に示すように本発明のすべての実施例において必要ではない。この簡単な実施例において、入力音声［ｙ−］のフル −ムは受信され、最適化されたフィルターパラメータは励起シーケンス発生器２によって供給される励起シーケンスを用いることにより閉ループにおいて得られる。これが行われた場合、これらの最適化されたパラメータを用いることによて前述したようにローカルデコーダー合成フィルター１１よりはむしろ）合成フィルター３によって合成され、合成フレ発生器２に供給され、これを次の励起シーケンス［Ｘ　、〕として与えている。（入力音声の第１フレームがコード化されている場合、必要とされる初期の励起シーケンスも記憶している。）第２図において、遠方デコーダー２０内のデコーダーユニットはフィルターパラメータ［ａ　］、［ｂｋ〕、ｄｌを元に戻すためにコーダー８と反対の動作を行う。このフィルターパラメータは送信器において先に最適化された配置を生成するために、合成フィルター２１を通過する。励起シーケンス発生器２３は励起シーケンス［Ｘ　、］でフィルターを駆動する。この励起シーケンス［Ｘ　、］は送信器で用いられてい−するのと同じであり、音声の第１のフレームに対して、送信器で用いられているのと等しい初期の励起シーケンスである。The excitation adaptation calculator 12 and the local decoder 10 are connected to the main source as shown in FIG. 1a. It is not necessary in all embodiments of the present invention. In this simple example, the input sound full voice [y-] - the optimized filter parameters are received by the excitation sequence generator 2. obtained in a closed loop by using the excitation sequence supplied by Ru. If this is done, by using these optimized parameters (rather than the local decoder synthesis filter 11 as described above) It is synthesized by the router 3 and is supplied to the synthesized flare generator 2, which uses it as the next excitation sheet. It is given as kens [X,]. (The first frame of the input audio is encoded If so, it also remembers the required initial excitation sequence. ) In FIG. 2, the decoder unit in the far decoder 20 has a filter parameter. Perform the opposite operation to coder 8 to restore meters [a], [bk], and dl. cormorant. This filter parameter generates a pre-optimized placement at the transmitter. It passes through a synthesis filter 21 in order to The excitation sequence generator 23 generates an excitation sequence. The filter is driven by the sequence [X,]. This excitation sequence [X,] is The materials used in the transmitter is the same as the one used by the transmitter for the first frame of audio. is the initial excitation sequence equal to .

この出力もローカルデコーダー１０と同様な方法で、励起シーケンス発生器２３において新しい励起シーケンス［Ｘ　、］″−１を作るために、励起適応計算器２２に供給される。This output is also sent to the excitation sequence generator 23 in the same way as the local decoder 10. The new excitation sequence [X, ]″−1 is fed to the excitation adaptation calculator 22 in order to create .

第２ａ図に示すように、単一人力の実施例では、励起適応計算器２２は必要ない。また、励起シーケンス発生器２３は、上述したように最後にデコードしたフレーム（および初期動イルター及び同じ初期励起を用いて全く同じ合成を行なうので、この結果作られる更新された励起シーケンスもまた同じであり、励起シーケンスデータを伝送する必要はない。As shown in FIG. 2a, in a single-person implementation, the excitation adaptation calculator 22 is not required. . In addition, the excitation sequence generator 23 also outputs the last decoded frame as described above. If we perform exactly the same synthesis using , the resulting updated excitation sequence is also the same, and the excitation sequence There is no need to transmit any performance data.

誤って受信したフィルターパラメータは“不正”の励起シーケンスを生じ、この励起シーケンスは、次のパラメータ群を送信機側で最適化するためのシーケンスではないので、伝−力ルデコーダーを一致させることが望ましい。これを実現するには制御ビットを周期的に前記両デコーダーに送り、各励起シーケンス発生器２３に、あらかじめプログラムされた初期励起シーケンスをリスタートさせるよう指示すればよい。Incorrectly received filter parameters will result in an “incorrect” excitation sequence and this The excitation sequence is a sequence for optimizing the following parameter groups on the transmitter side. Therefore, it is desirable to match the transmission decoders. make this happen To do this, control bits are periodically sent to both decoders and each excitation sequence generator 23, restart the pre-programmed initial excitation sequence. Just give instructions.

上述した機能の殆どあるいは全部は、単一デジタルプロセッサで実現可能であり、余分の物理的回路を必要としない。Most or all of the functions described above can be achieved with a single digital processor. , does not require extra physical circuitry.

第１図及び第１ａ図の送信機のフィルター最適化ステージ１について詳細に述べる。ＬＰＣコーディング技術を用いた多くの従来システムでは、最適化プロセスは、予／ｌ１ｌｌ　（有意の）誤差の値を最小にすることにより行なわれる。この有意の誤差は音声信号の現在値と過去のサンプル値にもとすく予測値とを比較することにより得られる。We now describe in detail the filter optimization stage 1 of the transmitter of Figures 1 and 1a. Ru. In many conventional systems using LPC coding techniques, the optimization process is performed by minimizing the value of the pre/l1ll (significant) error. child The significant error is the comparison between the current value of the audio signal and the predicted value based on past sample values. It can be obtained by

但し、ｅ、はｉｔｈサンプルの予測誤差である。However, e is the prediction error of the ith sample.

この技術はこの発明を具現化する場合にも使用しようと思えばできるが、合成フィルター３の合成音声出力間の実加圧みすけフィルター５を通すことにより、得られる重みず１つのフィルター特性を最適化することが望ましい。Although this technique can be used to embody this invention, it is By passing the actual pressurized misuke filter 5 between the synthesized voice outputs of the filter 3, the obtained It is desirable to optimize the filter characteristics for each weight.

重みすけフィルター５はＭＰＬＰＧあるいはＲＰＥ−ＬＰＧと同様に定義されるので、その動作説明を省略する。The weighted filter 5 is defined similarly to MPLPG or RPE-LPG. Therefore, the explanation of its operation will be omitted.

“実際の誤差”とは認知的に重みすけされた実際の２１差を含むものとする。"Actual error" shall include the actual cognitively weighted difference.

に複雑な計算を伴う非線形の問題であり、この発明の好適実施例では、フィルター特性は、直通信号である最小予測誤差エネルギーｅ”ｉに対するパラメータを（平均最小二乗法を用いて）初めに計算することにより、問題を部分的に線形化することにより選択される。これらの値を用いて、残りのバより得られる。is a nonlinear problem that involves complex calculations, and in the preferred embodiment of this invention, the filter - Characteristics are the parameters for the minimum prediction error energy e”i, which is a direct signal. Partially linearize the problem by first computing (using mean least squares) Selected by Using these values, the remaining values are obtained.

他のパラメータの値はループ内あるいはループ外で下記の如くに計算することができる。The values of other parameters can be calculated inside or outside the loop as follows: can.

第３図において、フィルター最適化ステージ１の合成フィルター３は直列接続された２ステージで構成される。すなｎ−ｄｉわち、伝送特性Ｂ　（ｚ）ｚ　を有する無限インパルス応答フィルターから成る第１フィルター回路３１と、伝送特性１　／　Ａ　（ｚ）を有する無限インパルス応答フィルターから成る第２フィルター回路３２で構成される。従って、全体のフィルターインパルス応答Ｈ（Ｚ）は式Ａにおいて必要となるである。第１のフィルター回路３１は、ｑ１＋１係数回路［ｂ、］　（ｉ−０，１１，、、ｑ　）及びｑ　１＋ｄ　を遅延回路で構成されており、（ｄ　１＋ｎ−１）の長さの遅延を有し、その後にｑ１＋１ステージの非リカーシブフィルタ−が結合されている。実際には、少数の係数回路（ｑ　１−０．１ｏ「２）が用いられる。In Figure 3, the synthesis filter 3 of filter optimization stage 1 is connected in series. It consists of two stages. Suna n-di That is, it consists of an infinite impulse response filter with transmission characteristic B(z)z The first filter circuit 31 and an infinite impulse having a transmission characteristic of 1/A(z) The second filter circuit 32 includes a second filter circuit 32 consisting of a frequency response filter. Therefore, the whole The filter impulse response H(Z) of is required in equation A. first The filter circuit 31 is a q1+1 coefficient circuit [b,] (i-0,11,,,q ) and q1+d are constructed with delay circuits, and the length of (d1+n-1) is delay, followed by a q1+1 stage non-recursive filter. There is. In reality, a small number of coefficient circuits (q1-0.1o "2") are used.

使用される遅延回路（それゆえ、遅延長）の数は実際の動作で所定の最大数Ｎ− １（この値はフレームサイズｎより小さくでも大きくても良い）まで変わる。The number of delay circuits (and therefore delay lengths) used is limited to a predetermined maximum number N- in practical operation. 1 (this value may be smaller or larger than the frame size n).

それゆえ、この第１のフィルター回路３１は励起信号ＥＸ　、３を受取、［Ｘ　、］　シーケンスの最も最近のｑ　ｌ＋ｄ　１＋ｎサンプル値を含んでいる。Therefore, this first filter circuit 31 receives the excitation signal EX,3 and [X ,] contains the most recent ql+d1+n sample values of the sequence.

第２フィルター回路はフィルター係数　［ａｋコ　（ｋ−１，２１，、、ｐ）により定義される全極応答を有するリカーシブフィルタ−から成る。The second filter circuit has filter coefficients [ak (k-1, 21,,,p) consists of a recursive filter with an all-pole response defined by

以下の、記述では、ベクトル標示を用いる。ベクトルａは［ａｋ］係数群を示し、ベクトルｂは［ｂ、］係数群を示す。The following description uses vector markings. Vector a indicates the [ak] coefficient group , vector b indicates a group of coefficients [b,].

入力音用の各受信したフレームに対して、フィルター最適化計算器６は最初に、平均予測誤差エネルギーｅＴｅの［ａ、〕係数を計算する。For each received frame for the input sound, the filter optimization calculator 6 first: Calculate the [a,] coefficient of the average prediction error energy eTe.

好適実施例では、（第４図及び第５図の方法１参照）０とＮ−１の間の各ｄ１の値に対してループ内のｂベクトルと結合してａベクトルを計算し、上述したように最小のｅ　に対警最小にするａベクトルとｂベクトルは次式で得られる。In the preferred embodiment (see method 1 of FIGS. 4 and 5), each d1 between 0 and N-1 Compute the a vector by combining it with the b vector in the loop for the value, as described above. against the police to the minimum e The a vector and b vector to be minimized are obtained by the following equations.

但し、Ｘは　−（ｎ＋ｄ　十ｑ　）から　−（ｄｌ＋１）までの励起サンプルＸｋのｎｘ　（ｑｌ　＋１）マトリクスであり、Ｙはｙ　からｙ　までのｙｋサンプルの（ｎｘｐ）マトリ−ｐ　ｎ−２クスである。However, X is the excitation sample X from -(n+d 1q) to -(dl+1) nx (ql + 1) matrix of k, and Y is yk sun from y to y Pull's (nxp) matrix-p n-2 It's Kusu.

実際の誤差エネルギーｉＴ″ｉ！は次ぎの３つの方法のいずれかを用いて最小化される。The actual error energy iT″i! is minimized using one of the following three methods: be done.

実際の誤差エネルギー否Ｔ古はニーｙ−’９を用いてもめられる。それゆえ、）Ｉ　（Ｚ）は°合成解析°ループ内で決定される。このループ内ではｄｌはＮの値の範囲で変化する。The actual error energy is determined using y-'9. therefore,) I(Z) is determined within the synthesis analysis loop. In this loop, dl is N Varies within a range of values.

−に対して最適化されるのに対し、ｉと５は最小予１１ＦＩ誤差エネルギー１１１に対して選択される。これについては、スキーム１として、第４図で述べる。−, while i and 5 are optimized for the minimum predetermined 11 FI error energy 11 1. This will be described in FIG. 4 as Scheme 1.

第２実施例においては、Ｅを評価するためのアプローチは上述のスキーム１の変形である。この場合、ｉと６は予測誤差；Ｔ古を最小にするようなｄ　の値に対する方程式（Ｄ）により定義される。一度、ｄｌが最適化されると、ベクトルｂは（Ｆ）式を用いて実際の誤差を最小にするように（最適化ループの外で）再評価される。In a second embodiment, the approach for evaluating E is a variation of Scheme 1 above. It is the shape. In this case, i and 6 are prediction errors; for the value of d that minimizes T is defined by equation (D). Once dl is optimized, vector b is re-evaluated (outside the optimization loop) using equation (F) to minimize the actual error. valued.

ここで、ｍはその出力がゼロの時の、１／Ａ（ｚ）フィルターの出力を示している。そのメモリは従前の合成された音声フレームの最も最近のサンプルから構成される。Ｑは（ｎ　ｘ　ｎ）の渦状のマトリクスを意味し、ｑｋは合成検索ステップによる従前の分析から得られたｉとｄｌを用いた１／Ａ（ｚ）フィルターのインパルス応答のに番目の値である。これは、第４図に第２スキームとして示される。Here, m indicates the output of the 1/A(z) filter when its output is zero. Ru. Its memory consists of the most recent sample of the previous synthesized audio frame. be done. Q means a (n x n) spiral matrix, and qk is a synthetic search step. of the 1/A(z) filter using i and dl obtained from the previous analysis by is the second value of the impulse response. This is shown as the second scheme in Figure 4. It will be done.

第３の実施例において、ｄｌの各与えられた値に対し、方程式（Ｄ）を用いて、予ΔＩＩＪ誤差を最小にするように、ａ（及びｂ）が計算される。石の値は実際の誤差を最小にするためのループ内で、（Ｆ）を用いて、（再）計算される。In a third example, for each given value of dl, using equation (D): a (and b) are calculated to minimize the pre-ΔIIJ error. Stone value is actual is (re)calculated using (F) in a loop to minimize the error of .

従ッテ、各ｄ　［（０（ｄ、（Ｎ−１）に対して定義された合成ループによる分析内で、ｉが式（Ｄ）を用いて、最初古Ｔ古は（Ｇ）式から直接水める事もできる。Therefore, each d [(0(d,(N-1)) is divided by a composition loop defined for In the analysis, if i uses equation (D), the first ancient T ancient can also be calculated directly from equation (G). Ru.

実際の誤差エネルギーを最小にするｄ１値、対応するａ。The d1 value that minimizes the actual error energy, corresponding to a.

５値は最適となるようにそれから選択される。これは、第４図にスキーム３として示される。Five values are then selected to be optimal. This is shown in Figure 4 as Scheme 3. is shown.

上記のスキーム２と３は精度上の理由で、予測誤差エネルギーｉよりも実際の誤差エネルギーｉを最小にする事により５が（再）定義されるように選択される。For accuracy reasons, the above schemes 2 and 3 are based on the actual error rather than the prediction error energy i. 5 is chosen to be (re)defined by minimizing the difference energy i.

他の実施例において、第４．５図に方法ＩＩとして示されるように、ｉは、６がゼロに等しいとの仮定のちとに、６とは独立に、（Ｈ）式に従ってループの外で、予測エネルギーを最小にするように、はじめに計算される。In another embodiment, i is 6, as shown in Figure 4.5 as Method II. After the assumption that it is equal to zero, outside the loop according to equation (H), independently of 6, , is first calculated to minimize the predicted energy.

Ｅ　＝　（ＹＴＹ）″ｌ　ＹＴＹ　’　（Ｈ）次に、ｉが与えられ、６とｄ１パラメータが上述のスキーム１．２．３により、最適化できる。特に、最小予測エネルギーをｅＴｅを生成するｂの値は（Ｊ）式により与えられる。E = (YTY)″l YTY’ (H) Next, i is given, and 6 and d1 parameter parameter can be optimized according to scheme 1.2.3 above. In particular, the minimum predicted The value of b for generating energy eTe is given by equation (J).

６　＝　（ｘＴｘ）−１ｘ”　（ｙ　−ｙＭ）　（Ｊ）スキーム１．２．３への対応するアプローチは、それぞれスキーム４，５．６として、それぞれ￥Ｓ４図に示され、この発明の第４．５．６の実施例を構成する。6 = (xTx) - 1x" (y - yM) (J) to Scheme 1.2.3 The corresponding approaches are shown as Schemes 4 and 5.6, respectively, in Figure S4. , which constitutes the 4.5.6 embodiment of the present invention.

生成されたパラメータはそれから量子化器７により量子化される。この量子化器はパラメータ間に利用できるビットを割り付ける。The generated parameters are then quantized by a quantizer 7. This quantizer allocates available bits between parameters.

合成フィルターに対する励起は単一の励起シーケンス［Ｘ、］を用いて説明したが、数個の励起シーケンスを用いることが望ましい。従って、第１図において、励起シーケンス源２は複数の異なるシーケンス［Ｘ　、１．　［ｕ　、］　。The excitation for the synthesis filter was explained using a single excitation sequence [X,] However, it is desirable to use several excitation sequences. Therefore, in Figure 1, The excitation sequence source 2 generates a plurality of different sequences [X, 1. [u,].

［Ｖ　、］を合成フィルター３に供給するように適合させられ−する。[V, ] is adapted to supply the synthesis filter 3. Ru.

力される過去にデコードされた音声ｙ、でよく、他のシーケンスは、合成フィルター３の中間出力から励起適合計算器１２によって得られる。（あるいは、そのようなシーケンスの全てが以下に述べるようにして得てもよい。）第５図において、フィルター３は（当然、第１．２．２８図のフィルター１１．２１も）直列に配置された２つのステージから構成される。第１のフィルター素子５１はｊ個のフィルター５１ａ、５１ｂ、、、５１ｊから構成され、各フィルターは励起シーケンス［Ｘ、］。The previously decoded audio y, input to is obtained by the excitation adaptation calculator 12 from the intermediate output of the controller 3. (or that All such sequences may be obtained as described below. ) Figure 5 Smell Therefore, filter 3 (of course, filter 11.21 in Fig. 1.2.28) is connected in series. It consists of two stages located at The number of first filter elements 51 is j. It consists of filters 51a, 51b, . . . , 51j, each filter having an excitation series. - Kens [X,].

［ｕ　］、ＥＶ−１］等を受け、出力を生成する。図示され−するように、フィルター５１ａ、５１ｂ等は、それぞれ応答（１）　Ｚ−ｎ−ｄｌ　（２）　Ｚ−ｎ−ｄ２Ｂ　（ｚ）　、Ｂ　（ｚ）（３）Ｚ−ｎ−ｄ３、を有する。そして、それらのＢ　（２）組み合わされた出力は第２フイルター素子３２に供給される。[u　], EV-1], etc., and generates an output. Illustrated As shown in FIG. (2) Z-n-d2B (z), B (z) (3) It has Z-n-d3. And those B (2) The combined output is provided to a second filter element 32.

第２フイルター素子３２は、前述のように、応答１／Ａ　（ｚ）を有する反復フィルターである。The second filter element 32 is a repeating filter with a response 1/A(z), as described above. It is a filter.

図示されるように、各フィルター５１ａ、５１ｂ等はフィ単一の励起フィルターを用いる場合において、パラメータ間に対し、実際の誤差エネルギーを最小にする事により最適化される。この最適化は第１から第６の実施例において、説明されたそれらに基づいた方法を用いて、部分的に線形化することにより（すなわち、最小子Ａｌ１１エネルギーをめる事によりいくつかのパラメータを計算する事により）、計算上の複雑さを減少して行われる。As shown, each filter 51a, 51b, etc. is a single excitation filter. is used, the actual error energy is minimized between the parameters. Optimized by This optimization is explained in the first to sixth embodiments. By partially linearizing (i.e. , some parameters can be calculated by taking the minimum Al11 energy. ), with reduced computational complexity.

一般的には、フィルターパラメータは、フィルターが成長その精度を改善するように、順番に最適化される。In general, the filter parameters will allow the filter to grow and improve its accuracy. are optimized in order.

これは伝送スキームに役に立ち、もしビットの割合の減少か要求されるのであれば、より早い係数のみ（例えば（１）、ｄｌ）か伝送される必要がある。This is useful in transmission schemes, if a reduction in the proportion of bits is required. For example, only the earlier coefficients (eg (1), dl) need to be transmitted.

Ｓ　ｂ第６図に示すように、一実施例におけるフィルター最適化ステージ６は単一の励起ケースの場合の第１の実施例（方法ｌ）に基づく計算を採用するために用いられる。プロセスの第１ステージにおいて、ａ、ｂ　及びｄｌは零に見せかけた他のパラメータによって第４図のスキーム１．２、若しくは３に従う実際の誤差エネルギーを最小にすることによって見付けられる。S b As shown in FIG. 6, filter optimization stage 6 in one embodiment used to adopt the calculation based on the first embodiment (method l) for the It will be done. In the first stage of the process, a, b, and dl were made to look like zero, and The actual error error according to scheme 1.2 or 3 in Fig. 4 can be calculated depending on the parameters of It is found by minimizing the energy.

好ましい実施例においては、種々の方法で、それら自身でａ及びｂ　を再限定するために使用されることができ、以下のように説明される。In preferred embodiments, a and b may themselves be redefined in various ways. It can be used for

第１のアプローチはスキーム１（若しくはスキーム２）に基づく。もしｂ　が、ｉおよびｂ　と独立して計算されるときは最小の予測エネルギーｅ”ｅ解法は以下の式で与えられる。　。The first approach is based on Scheme 1 (or Scheme 2). If b is When computed independently of i and b, the minimum predicted energy e”e solution is: It is given by the formula below. .

ｂ　（２）−（ｕＴｕ）−’ｕＴ（ｙ−Ｙａ−Ｘｂ　（１））まで、励起サンプルｕｋのｎｘ　（ｑｌ＋１）マトリクスを示し、第１ステージの等式Ｆに相当する。b (2)-(uTu)-'uT(y-Ya-Xb(1)), excitation sample nx(ql+1) matrix of Luke, which corresponds to the equation F of the first stage. Ru.

−（２）は交互にｉと結合されたり、ｉ及びｂ　と結合されて計算され、この場合、第１ステージにおける等式（Ｄ）に相当する簡単なマトリクス表示が用いられる。−(2) is computed by being alternately combined with i or with i and b, and in this case In this case, a simple matrix representation corresponding to equation (D) in the first stage is used. It will be done.

ルギーに対して計算することができる。しかしながら、これるフィルターが選択される。can be calculated for Rugi. However, Koruru filter is selected be done.

フィルターパラメータのいくつかは、前記プロセスによって既に決定された特別な値ｄ□−ｄ１′及びｄ２−ｄ、、’に対して（ループを外れて）再評価される。特に、合成プロセスによる第２の解析が最小予測エネルギーに対するフィルタ、＜ラメータを評価するとき、フィルターパラメータのいくつか又はすべては、最小予測誤差エネルギーと最小の実際の誤差エネルギーのいずれかに対して、以下のように再限定さ計算される。Some of the filter parameters are special is reevaluated (out of the loop) for the values d□−d1′ and d2−d,,’ . In particular, the second analysis by the synthesis process is the filter for the minimum predicted energy. , < When evaluating the parameter, some or all of the filter parameters are For either the minimum predicted error energy or the minimum actual error energy, The re-limited calculation is as below.

交互に、スキーム３のアルゴリズムに基づくアプローチが採用される。例えば、（ｉ）　ｂ　はａ及びｂ　と独立して、ｄ２のＮ異なる値に対して実行される合成ループによっての解析範囲−Ｔ二は最小のエネルギーｅｅであるｄ　２−　ｄ　２　’の値に対して選択される。Alternately, an approach based on the algorithm of Scheme 3 is adopted. for example, (i) b is a combination performed independently of a and b for N different values of d2. Analysis range by forming loop-T2 is selected for the value of d2-d2' which is the minimum energy ee.

（ｉｉ）　ｂ　は最小予測誤差エネルギーｅＴｅに対続され、６（２）は、最小予測誤差に対して再評価される。(ii) b is concatenated with the minimum prediction error energy eTe, and 6(2) is the minimum Reevaluated for prediction errors.

（ｉｉｉ）　ｂ（２）は最小予測誤差に対してｉ及び−（１）と結合して計算される。新しいｉベクトルは存続され、一方、ｂ　及びｂ　は等式（Ｄ３）からの最小予測誤差に対して再評価される。(iii) b(2) is calculated by combining i and −(1) for the minimum prediction error. It will be done. The new i vector is preserved, while b and b are from equation (D3) Reevaluated for minimum prediction error.

最適化値ｄ１及びｄ２、即ちｄユ′及びｄ２′が与えられると、フィルターパラメータのいくつかは等式（Ｄ３）を用いて（合成ループによる解析によらない）単一のステ・ノブにおいて、再最適化が行なわれ、その結果、ｂ　及び−（２）を再限定する。Given the optimized values d1 and d2, i.e. du' and d2', the filter parameters Some of the meters are calculated using equation (D3) (not analyzed by the synthesis loop) On a single Ste-knob, re-optimization is performed, resulting in b and -(2) relimit.

−（３）及びｄ３を計算するため、及び先のフィルター特す性を再計算するための第３のステージに対する上述した計算の延長は、もし等式がより高い項を含むように適当に修正さ合計項数があるステージから次のステージに展開する最適化プロセスのように順次増加する。- (3) and to calculate d3 and specify the previous filter An extension of the calculations described above to a third stage for recalculating the equations from one stage to the next with a total number of terms modified appropriately to include higher terms. It increases sequentially like an optimization process that unfolds over multiple pages.

この発明の他の実施例においては、第４ｂ図のスキームの４．５又は６のアプローチは以下のように行なわれ、係数［ａｋコは、通常のＬＰＧ解法を用いて最小予測誤差エネルギーｉＴｉに対して最初に最適化される。In other embodiments of the invention, approaches 4.5 or 6 of the scheme of FIG. The search is performed as follows, where the coefficient [ak is the minimum value using the usual LPG solution method. It is first optimized for the prediction error energy iTi.

フィルタ係数の計算において用いられている方法は、ステージ１で用いられ得る６つのスキームのうちの１つ、或いはそれ以上のスキームに基づく広範なアルゴリズムから選択される。第２のステージの式を検討すると、ステージ１の式と形式的に類似していることが判る。フィルタ最適化計算器６はステージ２と、それに続くステージのためにステージ１と同じ方法を、必要に応じて拡張して用いる事ができる。繰り返しプログラミング技術を簡単にでき、ひいてはシステムの簡略化につながるからである。The method used in calculating the filter coefficients can be used in stage 1 A wide range of algorithms based on one or more of six schemes Selected from rhythm. Considering the second stage equation, the stage 1 equation and form It can be seen that the formulas are similar. The filter optimization calculator 6 includes stage 2 and its For subsequent stages, use the same method as for stage 1, extending as necessary. I can do things. Easily repeat programming techniques and, in turn, simplify systems. This is because it leads to simplification.

フィルタパラメータのいくつかを選択することは、最小予測誤差エネルギｅに対してそれらを計算することによって実行されるが、他の（例えば逆フイルタ動作［ｙｉ］によって）計算され、算出され得ることは理解されよう。Choosing some of the filter parameters affects the minimum prediction error energy e. but other (e.g. inverse filter operations) [yi]) and can be calculated.

上述のＨ（Ｚ）フィルタを適応することは、係数の組みの値に限定されるのみならず、フィルタの°構造°、すなわちに再定義されるということを実現することが重要である。Adapting the H(Z) filter described above is only limited to the values of the set of coefficients. However, the structure of the filter is redefined, i.e. is important.

励起適応計算器］２の動作を次に述べる。前述の適用から、合成フィルタ２１を駆動するために受信ステーションで用いられる励起は、そのフィルタが最適化されるものでなければならないということは明らかである。受信フィルタ２１が、送信器中のローカルデコーダフィルタ１１と最適化フィルタ３と同一である場合、励起は、送信されたパラメータが最適化されるためのそれと同じでなければならない。The operation of Excitation Adaptive Calculator] 2 will be described next. From the above application, the synthesis filter 21 is The excitation used at the receiving station to drive the It is clear that it must be possible. The reception filter 21 is When the local decoder filter 11 and optimization filter 3 in the transmitter are the same , the excitation must be the same as that for the transmitted parameters to be optimized. No.

予めプログラム化されている初期シーケンスでもって送信器２と受信器２３における励起シーケンス発生器を提供することにより、上記のように、これは構成される。シーケンスが有する正確な特性はクリティカルなものではなく、例えば零平均ガウシアン（Ｇａｕｓｓｉａｎ）ランダムシーケンスのようなものである。transmitter 2 and receiver 23 with a pre-programmed initial sequence. This can be configured as described above by providing an excitation sequence generator that It will be done. The exact properties the sequence has are not critical, e.g. It is like an average Gaussian random sequence.

この単一シーケンスは、第５図に示したような構成の複合入力合成フィルタ３の各入力を駆動するのに用いることができる。上述したような３つの応答、を待った最初の３個のフィルタ５１ａｓ　５１ｂｓ　５１ｃについて考える０ここで、ｎは分析フレームのフレームサイズ、パラメータｄ１、ｄ２、ｄ３の値は、シーケンス［Ｘ、］中の特定のセグメントを示し、これらはＨ（Ｚ）合成フィルタ３のＦＩＲ要索５１ａ、５１ｂ、５１ｃ中に各々に対する励起信号として供給される。This single sequence is applied to a composite input synthesis filter 3 configured as shown in FIG. It can be used to drive each input. Three responses as mentioned above, Considering the first three filters 51as, 51bs, and 51c that waited for Here, n is the frame size of the analysis frame, and the values of parameters d1, d2, and d3 are , denote certain segments in the sequence [X,], which are H(Z) synthetic fi Provided as an excitation signal for each of the FIR signals 51a, 51b, and 51c of the router 3. be provided.

この発明のＰＤＳの実施例において、付加信号演算回路］２は単に励起ｆＪ号格納部と音声出力部との間のリンクである。ＰＤＳ信号を励起シーケンスとして用いることは、与えられた伝送速度における信号対雑音比（ＳＮＲ）を上げる点、およびコード化の遅れを少なくする点のいずれか一方、もしくは双方の点からみて極めて有効である。In the embodiment of the PDS of this invention, the additional signal calculation circuit]2 is simply the excitation fJ number. This is a link between the storage section and the audio output section. Use PDS signal as excitation sequence This increases the signal-to-noise ratio (SNR) at a given transmission rate, and/or reduce coding delays. It is extremely effective.

以上述べたように、Ｘ　（Ｚ）の代わりに他の入力を用いることももちろん可能である。特に、Ｘ　（Ｚ）は最初のフィルタ要素３１または５１の出力から（即ち、全体のフィルタＨ（Ｚ）の中間出力として）取り出すことができる。この場合、このフィルタ要素は、その出力と入力との間にフィードバック回路があるので、無限応答フィルタとして考えることができる。ここで、励起信号はこの中間出力からのみ取り出される。また、Ａ　（Ｚ）フィルタへの入力は、例えば、ローズ（Ｒｏｓｅ）およびバーンウェル（Ｂａｒｎｗｅｌ　Ｉ）による、°自己励起ボコーダ：　４８００ボーを用いた長距離通話に対する他のアプローチ”、（ＩＣＡＳＳＰのＩＥＥＥ　ＰＲＯＣ，における１９８６年４月号の４５３〜４５６頁）に示された“自己励起ボコーダ′の中で生成されたデータと同等のものである。As mentioned above, it is of course possible to use other inputs instead of X (Z). It is. In particular, X (Z) is derived from the output of the first filter element 31 or 51 (immediately In other words, it can be taken out (as an intermediate output of the entire filter H(Z)). this place In this case, this filter element has a feedback circuit between its output and input. It can be thought of as an infinite response filter. Here, the excitation signal is Extracted only from output. Also, the input to the A(Z) filter is, for example, °Self-excitation by Rose and Barnwell I ``Vocoder: Another approach to long distance calls using 4800 baud'', ( 453-45 of ICASSP's IEEE PROC, April 1986 issue. This is equivalent to the data generated in the “self-exciting vocoder” shown in page 6). be.

上述したようなパラメータ最適化法は、並行フィルタ構造を有する場合も同様に適用可能である。The parameter optimization method described above can be applied similarly to cases with parallel filter structures. Applicable.

第７ａ図に示したグラフは、全極フィルタ３２に対する入力として用いられる代表的な信号波形を示す。この信号はＰＤＳ信号であって、全零フィルタ３１によってろ波されたものであり、音声波形に極めて類似したものである。二のＰＤＳ励起の音声近似性は、第７ｂ図に示したＰＥＳ励起波形とはまったく異なるものである。これらのＰＥＳ励起波形は、Ｍ　Ｐ　Ｌ　Ｐ　Ｃのような従来のシステムの中で用いられる励起波形に極めて似たものである。The graph shown in FIG. A typical signal waveform is shown. This signal is a PDS signal and is passed through the all-zero filter 31. It is filtered and has a very similar waveform to the voice waveform. Second PDS The audio approximation of the excitation is completely different from the PES excitation waveform shown in Figure 7b. It is. These PES excitation waveforms are compatible with conventional systems such as MPL This is very similar to the excitation waveform used in the system.

これらの励起は、人間のボーカルコードによって形成された声門パルス列に極めて似ているパルス列である。These excitations are unique to the glottal pulse train formed by the human vocal chord. This is a similar pulse train.

このように、この発明によるＰＤＳ励起は、従来のＰＥＳ励起を用いる場合よりもずっと良好な結果が得られる。従って、ＰＤＳ信号は、少ないろ波処理を行うだけであるにもかかわらず、“音声近似”信号であり、従来の音声源／フィルタモデルにおけるランダム波形から各音声フレームを再合成しなければならない方法に対して、音声フレーム間の変化を見るために、単にフィルタにおいて先行音声フレームを変形するだけでよい。In this way, PDS excitation according to the present invention is more effective than when using conventional PES excitation. gives much better results. Therefore, the PDS signal undergoes less filtering. However, it is an “audio approximation” signal and cannot be used with conventional audio sources/filters. Those who have to resynthesize each audio frame from random waveforms in the model For the method, in order to see the changes between audio frames, we simply use the preceding sound in the filter. All you have to do is transform the voice frame.

各実施例において、フレームサイズの増加に伴ってＳＮＲが減少することが分った。特に、単一の励起シーケンスによるＰＤＳを用いた実施例では、特に、フレームサイズの小さいところで高い５ＮＲ４’！性を示し、このことは伝送される音声の明瞭度を高めることを意味している。さらに、単一励起型のＰＤＳを用いる実施例はより小さいフレームサイズで構成することができ、この結果、コード化による遅れを減少させることができる効果か得られる。In each example, it was found that the SNR decreased as the frame size increased. Ta. In particular, in embodiments using PDS with a single excitation sequence, the frequency High 5NR4' in a small room size! This is transmitted This means improving the clarity of speech. Furthermore, using single excitation type PDS implementations can be configured with smaller frame sizes, resulting in code This has the effect of reducing delays caused by conversion.

フィルタパラメータを個々に評価する代わりに、いくつかのパラメータを合わせて評価することによって好都合なことがある。Instead of evaluating filter parameters individually, combine several parameters It may be advantageous to evaluate the

フィルタの係数の個数を増加したり、または、ｄ、の最大値を増加したりすることによって、コード化、デコード化の際のＳＮＲを高めることができることが分った。ＳＮＲの改善を行うために、フレーム当たりの伝送ビットを増加させる必要があるとしても、ビット数を新しいフィルタの係数に対して割当てることよりは、遅延パラメータのために特別なビットを割当てることの方が有利であることは明らかである。It is possible to increase the number of coefficients of the filter or increase the maximum value of d. It was found that the SNR during encoding and decoding can be increased by It was. In order to improve SNR, it is necessary to increase the transmitted bits per frame. Even if necessary, it is better to allocate the number of bits to the coefficients of the new filter. that it is advantageous to allocate a special bit for the delay parameter is clear.

国際調査報告international search report

Claims

[Claims]

(1) The filter parameters are periodically derived from the input audio signal in the coder. sent to the decoder to update the decoder filter response. and the excitation input to the decoder filter is the audio of the decoder filter. A method of voice communication, characterized in that the output is derived in the decoder. .

(2) Deriving the filter parameters includes combining the input audio signal and the data to be performed, so as to reduce the actual error between the coder's output and the coder's output. The method of claim 1, characterized in that:

(3) The decoder includes a composite filter comprising a first and a second filter in series. the first filter has a variable delay parameter and the second filter has a variable delay parameter; router is an infinite impulse response filter, and the filter parameters the input audio signal and the output of the decoder by changing the delay parameter; Derive the corresponding evaluation of said actual error between the forces and then this estimated and selecting said delay parameter values that reduce the actual error. The coder is periodically derived by a process. The method described in claim 2.

(4) said coder includes such a synthesis filter, and said coder includes an evaluation of said actual error; generating a synthesized speech output for each value of said delay parameter; in the coder by comparing the output with the input audio signal. 4. A method according to claim 3, characterized in that:

(5) For each of the delay parameter values, other parameters of the first filter are determined. The values for the meter and for the parameters of the second filter are together calculated and used to derive said estimated actual error. 5. The method according to claim 3 or 4.

(6) The values for the parameters of the second filter are determined outside the iterative process. The values for the other parameters of the first filter are initially evaluated and the values for the other parameters of the first filter are Deriving the estimated actual error calculated for each of the extended parameter values 5. The method according to claim 3, wherein the method is used for:

(7) The filter parameters are periodically derived from the input audio signal in the coder. sent to the decoder to update the decoder filter response. The decoder includes first and second filters in series, and the first and second filters are arranged in series. The filter has a variable delay parameter, and the second filter has an infinite impulse response. the excitation input to the decoder filter is the excitation input to the decoder filter. derived in the decoder from one or more intermediate outputs of the filter; The filter parameters vary the delay parameters and the filter parameters vary between the first and second filters. calculate other parameter values of the input audio signal and the output of the decoder; and then use this value to derive an estimate of the actual error between selecting the delay parameter value to reduce the estimated actual error of is derived in said coder by an iterative process including A method of voice communication.

(8) For each of the delay parameter values, other parameters of the first filter are determined. 5. The meter value is calculated such that the prediction error is low. The method described in any one of (7) to (7) above.

(9) For each of the delay parameter values, other parameters of the first filter are determined. 5. The meter value is calculated in such a way that the actual error is low. 7. The method according to any one of 7.

(10) After selecting the delay parameter, other parameters of the first filter The value of is recalculated so that the actual error is lowered. 7. The method described in any one of 7.

(11) The actual error depends on the spectral region that is perceived to be less important. Claims 1 to 10 are weighted to reduce the The method described in Shifting.

(12) When excited by a signal derived from a previous synthesized speech output, they The difference between the input audio signal and the synthesized audio output generated by the filter with the characteristics of filter characteristics from said input audio signal so as to reduce the estimation of the actual error between A voice code, characterized in that it comprises means configured to periodically derive the gender. -der.

(13) comprising a synthesis filter adapted to generate the synthesized speech output; The actual error evaluation is based on the input audio signal and the synthesized audio output of the synthesis filter. 13. The coder according to claim 12, wherein the error is an error formed between the force and the force.

(14) configured to periodically derive filter characteristics from the input audio signal means, the filter characteristic comprising first and second filters in series. an output produced by a synthesis filter comprising a synthesis filter and said input the first filter is derived to reduce the actual error between the audio signal and the audio signal; comprises a plurality of parallel feedforward filters, and the plurality of parallel feedforward filters At least one of the feedforward filters includes the parallel feedforward filter. Receive one of several different excitation sequences derived from the combination of router outputs and the second filter is an infinite impulse response filter. , said infinite impulse response filters change their characteristics when excited A voice coder comprising:

(15) Changing the delay parameter of a certain first parallel feedforward filter , derive such an estimate of said actual error for such a value, and calculate said actual error Then select that delay parameter value to decrease the parallel feedforward repeating the above steps for the delay parameters of each of the code filters; configured to derive the filter characteristics by a method comprising: and 15. A coder according to claim 14, characterized in that it comprises means.

(16) further comprising a local decoder adapted to generate synthesized speech output; The coder according to any one of claims 12 to 14.

(17) comprising a filter, and upon receiving the coded audio signal, the filter The filter is configured to update the characteristics of the filter, and the filter is configured to update the characteristics of the filter. an audio output connected to a A decoder for coded speech, characterized in that it is excited when used.

(18) comprising a filter, and upon receiving the coded audio signal, the filter The filter is configured to update the characteristics of the filter, and the filter includes a first and a second , the first filter includes a plurality of parallel feed forward filters. filter, at least one of the plurality of parallel feedforward filters One is derived from the combination of the outputs of the parallel feedforward filters. said second fibre, connected to receive one of a plurality of different excitation sequences; A router is an infinite impulse response filter for coded speech. decoder for.

(19) At least one of the parallel feedforward filters includes the second connected to receive the excitation sequence derived from the output of the filter of 19. The decoder according to claim 18, wherein:

(20) A receiver comprising the decoder according to any one of claims 17 to 19.

(21) A receiver substantially equivalent to that described with reference to Figures 2 and 2a. .

(22) A transmitter comprising the coder according to any one of claims 12 to 16.

(23) A transmitter substantially equivalent to that described with reference to FIG. 1 and FIG. 1a. .

(24) An audio communication device comprising the transmitter and receiver according to any one of claims 1 to 23. trust system.

(25) A voice communication method almost equivalent to that described with reference to the drawings.