JPH0736495A

JPH0736495A - Variable rate voice coding device

Info

Publication number: JPH0736495A
Application number: JP5181177A
Authority: JP
Inventors: Shigeru Hosoi; 茂細井; Norio Nomura; 規雄野村
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1993-07-22
Filing date: 1993-07-22
Publication date: 1995-02-07
Anticipated expiration: 2017-03-18
Also published as: JP3265726B2

Abstract

PURPOSE:To execute efficient quantization by adaptively changing the number of quantization bits by the result of the analysis of an input frame. CONSTITUTION:The signal to noise ratio (S/N: SNR) of the synthesized voice signal and input voice signal from an adaptive code book 1 and the synthesized voice signal and input voice signal from a probability code book 2 is calculated by an SNR calculator 16. If the calculated SNR is larger than the preset threshold value, the input voice signal is expressed only by the synthesized voice signal from either of the adaptive code book 1 and the probability code book 2 and the signal (gain code vector number) corresponding to the information from the other is not transmitted, by which the number of the quantization bits is varied and the quantization bits based in the characteristic of the input voice signal are varied.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、固体素子を記憶媒体に
用いた録音留守番電話機などに利用し、入力フレームの
分析結果により量子化ビット数を適応的に変化させて効
率の良い量子化を行う可変レート音声符号化装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention is used in a recording answering machine or the like in which a solid-state element is used as a storage medium, and the number of quantization bits is adaptively changed according to the analysis result of an input frame for efficient quantization. The present invention relates to a variable rate audio encoding device for performing.

【０００２】[0002]

【従来の技術】図２は従来のＣＥＬＰ（Code Excited L
inear Predictive coding ）方式による固定レート音声
符号化装置の構成を示すブロック図である。図２におい
て、この可変レート音声符号化装置は、過去の音源を蓄
積する適応コードブック１と、白色雑音を蓄積する確率
的コードブック２と、入力音声の声道フィルタの特性を
分析する線形予測分析器３とを有している。2. Description of the Related Art FIG. 2 shows a conventional CELP (Code Excited L).
It is a block diagram which shows the structure of the fixed rate speech coding apparatus by the inear Predictive coding system. In FIG. 2, the variable-rate speech coding apparatus includes an adaptive codebook 1 for accumulating past sound sources, a stochastic codebook 2 for accumulating white noise, and a linear prediction for analyzing characteristics of a vocal tract filter of input speech. And an analyzer 3.

【０００３】さらに、この可変レート音声符号化装置
は、コードブックから合成音声信号を得る合成フィルタ
４，５と、音源を可変増幅する増幅器６，７と、適応コ
ードブック１及び確率的コードブック２に蓄えられてい
る音源から当該短時間区間（以下フレームという）に最
適な音源（以下、最適コードベクトルと記載する）を得
るための最適コードベクトル選択器８と、合成音声信号
から音源信号を得る予測フィルタ９とを有している。ま
た、この可変レート音声符号化装置は、復号側に伝送す
る情報である線形予測係数（信号）ＳＡ、適応コードブ
ック１中の最適コードベクトルに対する音源ゲイン（信
号）ＳＢ、確率的コードブック２中の最適コードベクト
ルに対する音源ゲイン（信号）ＳＣ、適応コードブック
１中の最適コードベクトルの番号（信号）ＳＤ、確率的
コードブック２中の最適コードベクトルの番号（信号）
ＳＥをそれぞれ量子化する量子化器１０と、増幅器６，
７からの信号を加算する加算器１１と、この加算器１１
からの信号と入力音声信号を減算する減算器１２と、適
応コードブック１を選択する選択スイッチ１３と、確率
的コードブック２を選択する選択スイッチ１４とを有し
ている。Further, this variable rate speech coding apparatus includes synthesis filters 4 and 5 for obtaining synthesized speech signals from a codebook, amplifiers 6 and 7 for variably amplifying a sound source, an adaptive codebook 1 and a stochastic codebook 2. An optimal code vector selector 8 for obtaining an optimal sound source (hereinafter, referred to as an optimum code vector) for the short time section (hereinafter, referred to as a frame) from the sound source stored in, and a sound source signal from a synthesized speech signal. And a prediction filter 9. In addition, this variable rate speech coding apparatus includes a linear prediction coefficient (signal) SA that is information to be transmitted to the decoding side, an excitation gain (signal) SB for the optimum code vector in the adaptive codebook 1, and a stochastic codebook 2. Source code (signal) SC for the optimal code vector of, the optimal code vector number (signal) SD in the adaptive codebook 1 and the optimal code vector number (signal) in the stochastic codebook 2
Quantizer 10 for quantizing each SE, amplifier 6,
Adder 11 for adding signals from 7 and this adder 11
It has a subtracter 12 for subtracting the input signal and the input voice signal, a selection switch 13 for selecting the adaptive codebook 1, and a selection switch 14 for selecting the probabilistic codebook 2.

【０００４】次に、この従来例の構成の動作について説
明する。当該フレームに対応する入力音声信号信号から
線形予測分析器３で声道フィルタの特性が得られる。こ
こで得られたフィルタ係数が、合成フィルタ４，５及び
予測フィルタ９のフィルタ係数となる。次に適応コード
ブック１と確率的コードブック２に格納されている全て
の音源信号（以下、コードベクトルと記載する）が、合
成フィルタ４，５により合成され、それぞれのコードベ
クトルに対応する合成音声信号が得られる。なお、適応
コードブック１により当該フレームの信号に対して、過
去の音源と相関の強い成分が合成される。また、確率的
コードブック２により比較的相関の弱い独立した成分が
合成される。この合成された音声が増幅器６，７で増幅
される。さらに、適応コードブック１からの合成音声信
号と、確率的コードブック２からの合成音声信号が加算
されて最終的な合成音声信号が得られる。このそれぞれ
のコードベクトルに対応する合成音声信号の中で、入力
音声信号との誤差が最小となるコードベクトルが最適コ
ードベクトル選択器８により選択される。また、合成さ
れた音声から予測フィルタ９により音源信号を得る。Next, the operation of this conventional configuration will be described. The characteristics of the vocal tract filter are obtained by the linear prediction analyzer 3 from the input speech signal signal corresponding to the frame. The filter coefficients obtained here become the filter coefficients of the synthesis filters 4, 5 and the prediction filter 9. Next, all the sound source signals (hereinafter, referred to as code vectors) stored in the adaptive codebook 1 and the stochastic codebook 2 are synthesized by the synthesis filters 4 and 5, and synthesized speech corresponding to each code vector. The signal is obtained. It should be noted that the adaptive codebook 1 synthesizes a component having a strong correlation with the past sound source with respect to the signal of the frame. Further, the stochastic codebook 2 synthesizes independent components having a relatively weak correlation. The synthesized voice is amplified by the amplifiers 6 and 7. Further, the synthetic speech signal from the adaptive codebook 1 and the synthetic speech signal from the stochastic codebook 2 are added to obtain a final synthetic speech signal. The optimum code vector selector 8 selects the code vector having the smallest error from the input voice signal among the synthesized voice signals corresponding to the respective code vectors. Further, the predictive filter 9 obtains a sound source signal from the synthesized speech.

【０００５】次フレームのコードベクトル探索のため
に、適応コードブック１に蓄積される。そして、線形予
測係数ＳＡ、適応コードブック１中の最適コードベクト
ルに対する音源ゲインＳＢ、確率的コードブック２中の
最適コードベクトルに対する音源ゲインＳＣ、適応コー
ドブック１中の最適コードベクトルの番号ＳＤ、確率的
コードブック２中の最適コードベクトルの番号ＳＥが量
子化器１０で量子化される。ここでフレーム長さをＬ
（ｓｅｃ）、伝送する線形予測係数ＳＡから番号ＳＥま
での量子化ビット数をそれぞれＮ（ＳＡ）ビット〜Ｎ
（ＳＥ）ビットとすると、伝送ビットレートＴｆ（ビッ
ト／ｓｅｃ）は次式（１）で表される。It is stored in the adaptive codebook 1 for searching the code vector of the next frame. Then, the linear prediction coefficient SA, the excitation gain SB for the optimal code vector in the adaptive codebook 1, the excitation gain SC for the optimal code vector in the probabilistic codebook 2, the optimal code vector number SD in the adaptive codebook 1, the probability The optimum code vector number SE in the dynamic codebook 2 is quantized by the quantizer 10. Where the frame length is L
(Sec), the number of quantization bits from the linear prediction coefficient SA to the number SE to be transmitted is N (SA) bits to N, respectively.
Assuming (SE) bits, the transmission bit rate Tf (bit / sec) is expressed by the following equation (1).

【０００６】Ｔｆ＝Ｎ（ＳＡ）＋Ｎ（ＳＢ）＋Ｎ（ＳＣ）＋Ｎ（ＳＤ）＋Ｎ（ＳＥ）／Ｌ …（１）Tf = N (SA) + N (SB) + N (SC) + N (SD) + N (SE) / L (1)

【０００７】[0007]

【発明が解決しようとする課題】ところで上記の従来の
固定レート音声符号化装置では、入力された音声の特徴
によらず、常に一定のビットレートで量子化しているた
め、メモリの記憶容量に制約がある固体素子を記憶媒体
に用いた録音留守番電話機などでは効率の良い量子化が
出来なかった。例えば、適応コードブック１中のコード
ベクトルのみで入力音声に近い合成音声号が得られた場
合には、確率的コードブック２が不要となってしまう。By the way, in the above-mentioned conventional fixed rate speech coding apparatus, the quantization is always performed at a constant bit rate regardless of the characteristics of the inputted speech, so that the storage capacity of the memory is limited. It was not possible to quantize efficiently with a recording answering machine using a solid element as a storage medium. For example, if a synthetic speech code close to the input speech is obtained only with the code vector in the adaptive codebook 1, the stochastic codebook 2 becomes unnecessary.

【０００８】本発明は、このような従来の問題を解決す
るものであり、入力フレームの分析結果により量子化ビ
ット数を適応的に変化させ、効率の良い量子化が出来る
優れた可変レート音声符号化装置の提供を目的とする。The present invention solves such a conventional problem and is an excellent variable rate speech code capable of performing efficient quantization by adaptively changing the number of quantization bits according to the analysis result of an input frame. The purpose is to provide a chemical conversion device.

【０００９】[0009]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、入力された短時間区間の適応コードベク
トルを格納する適応コードブックと確率コードベクトル
を格納する確率的コードブックを備えたＣＥＬＰ音声符
号化を行う際の可変レート音声符号化装置であって、適
応コードベクトル及び確率コードベクトルから合成音声
信号を生成する合成音声信号生成手段と、この合成音声
信号生成手段からの二つの合成音声信号と入力音声信号
とのそれぞれの信号対雑音比を計算する信号対雑音比計
算手段と、計算された信号対雑音比に基づいて当該区間
の符号化ビットレートを可変する可変手段とを備える構
成としている。To achieve the above object, the present invention comprises an adaptive codebook for storing an input adaptive code vector in a short time interval and a stochastic codebook for storing a stochastic code vector. A variable-rate speech coding apparatus for performing CELP speech coding, comprising: a synthesized speech signal generating means for generating a synthesized speech signal from an adaptive code vector and a probability code vector; and two synthesized speech signal generating means. A signal-to-noise ratio calculating means for calculating a signal-to-noise ratio of each of the synthesized speech signal and the input speech signal, and a varying means for varying the coding bit rate of the section based on the calculated signal-to-noise ratio. It is configured to be equipped.

【００１０】この構成にあって、信号対雑音比計算手段
は、計算された信号対雑音比と比較するしきい値を予め
設定するしきい値設定手段と、設定されたしきい値と信
号対雑音比を比較する比較手段と、しきい値より信号対
雑音比が大きい場合に、一方のコードブックからの合成
音声信号のみで入力音声信号を表せると判断して他方の
コードブックからの情報に対応する信号を伝送しないよ
うに量子化ビット数を可変する可変手段とを備える構成
としている。In this structure, the signal-to-noise ratio calculating means has a threshold setting means for presetting a threshold value to be compared with the calculated signal-to-noise ratio, and the set threshold value and the signal pair. When the signal-to-noise ratio is larger than the threshold and the comparison means for comparing the noise ratio, it is determined that the input voice signal can be represented by only the synthesized voice signal from one codebook, and the information from the other codebook is used. A variable unit for changing the number of quantization bits is provided so as not to transmit a corresponding signal.

【００１１】[0011]

【作用】このような構成により、本発明の可変レート音
声符号化装置は、適応コードブックからの合成音声信号
と入力音声信号及び確率的コードブックからの合成音声
信号と、入力音声信号との信号対雑音比を計算し、計算
された信号対雑音比が、予め設定されたしきい値より大
きい場合には、一方のコードブックからの合成音声信号
のみで入力音声信号を表することが出来ると判断してい
る。この判断によって他方のコードブックからの情報に
対応する信号（ゲイン・コードベクトル番号）を伝送し
ないようにして量子化ビット数を可変している。したが
って、入力音声信号の特徴に基づいた量子化ビットに可
変されて、効率の良い量子化が行われる。With such a configuration, the variable rate speech coding apparatus of the present invention is a signal of the synthesized speech signal from the adaptive codebook, the input speech signal and the synthesized speech signal from the stochastic codebook, and the input speech signal. When the signal-to-noise ratio is calculated and the calculated signal-to-noise ratio is larger than a preset threshold value, it is possible to represent the input speech signal only by the synthesized speech signal from one of the codebooks. Deciding. By this determination, the number of quantization bits is changed so that the signal (gain / code vector number) corresponding to the information from the other codebook is not transmitted. Therefore, the quantization bit is changed to a quantization bit based on the characteristics of the input audio signal, and efficient quantization is performed.

【００１２】[0012]

【実施例】以下、本発明の可変レート音声符号化装置の
実施例を図面を参照して詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT An embodiment of a variable rate speech coder according to the present invention will be described below in detail with reference to the drawings.

【００１３】図１は本発明の可変レート音声符号化装置
の実施例における構成を示すブロック図である。なお、
以下の文中及び図にあって、従前の図２に示す同一の構
成要素には同一の符号を付した。図１において、この可
変レート音声符号化装置は、過去の音源を蓄積する適応
コードブック１と、白色雑音を蓄積する確率的コードブ
ック２と、入力音声の声道フィルタの特性を分析する線
形予測分析器３と、コードブックから合成音声信号を得
る合成フィルタ４，５と、音源を可変増幅する増幅器
６，７と、適応コードブック１、確率的コードブック２
に蓄えられている音源から当該短時間区間（フレーム）
に最適な音源（コードベクトル）を得るための最適コー
ドベクトル選択器８とを有している。FIG. 1 is a block diagram showing the configuration of an embodiment of a variable rate speech coding apparatus of the present invention. In addition,
In the following text and drawings, the same components shown in the previous FIG. 2 are designated by the same reference numerals. In FIG. 1, this variable rate speech coding apparatus includes an adaptive codebook 1 for accumulating past sound sources, a stochastic codebook 2 for accumulating white noise, and a linear prediction for analyzing characteristics of a vocal tract filter of input speech. An analyzer 3, synthesis filters 4 and 5 for obtaining a synthetic speech signal from a codebook, amplifiers 6 and 7 for variably amplifying a sound source, an adaptive codebook 1, and a stochastic codebook 2
From the sound source stored in
And an optimum code vector selector 8 for obtaining an optimum sound source (code vector).

【００１４】さらに、この可変レート音声符号化装置に
は、合成音声信号から音源信号を得る予測フィルタ９
と、復号側に伝送する情報である線形予測係数ＳＡ、適
応コードブック１中の最適コードベクトルに対する音源
ゲインＳＢ、確率的コードブック２の中の最適コードベ
クトルに対する音源ゲインＳＣ、適応コードブック１中
の最適コードベクトルの番号ＳＤ、確率的コードブック
２中の最適コードベクトルの番号ＳＥ、信号対雑音比
（Ｓ／Ｎ比：ＳＮＲ）計算器１６からの情報ＳＦを量子
化する量子化器１０とを有している。Further, in this variable rate speech coder, a prediction filter 9 for obtaining a sound source signal from a synthesized speech signal.
And a linear prediction coefficient SA which is information to be transmitted to the decoding side, an excitation gain SB for the optimal code vector in the adaptive codebook 1, an excitation gain SC for the optimal code vector in the stochastic codebook 2, and an adaptive codebook 1 The optimum code vector number SD, the optimum code vector number SE in the stochastic codebook 2, and the quantizer 10 for quantizing the information SF from the signal-to-noise ratio (S / N ratio: SNR) calculator 16. have.

【００１５】また、この可変レート音声符号化装置は、
増幅器６，７からの信号を加算する加算器１１と、この
加算器１１からの信号と入力音声信号を減算する減算器
１２と、適応コードブック１を選択する選択スイッチ１
３と、確率的コードブック２を選択スイッチ１４と、適
応コードブック１からの合成音声信号と入力音声信号及
び確率的コードブック２からの合成音声信号と入力音声
信号とのＳＮＲを計算するＳＮＲ計算器１６とを有して
いる。Further, this variable rate speech coding apparatus is
An adder 11 that adds the signals from the amplifiers 6 and 7, a subtracter 12 that subtracts the signal from the adder 11 and the input voice signal, and a selection switch 1 that selects the adaptive codebook 1.
3, a stochastic codebook 2 selection switch 14, an SNR calculation for calculating the SNRs of the synthetic speech signal and the input speech signal from the adaptive codebook 1 and the synthetic speech signal and the input speech signal from the stochastic codebook 2. And a container 16.

【００１６】次に、この実施例の構成における動作につ
いて説明する。フレームに対応した入力音声信号信号
は、線形予測分析器３により、声道フィルタの特性が得
られる。ここで得られたフィルタ係数が、合成フィルタ
４，５及び予測フィルタ９のフィルタ係数となる。次に
適応コードブック１と確率的コードブック２に格納され
ている全てのコードベクトルが、それぞ選択スイッチ１
３，１４を通じて合成フィルタ４，５により合成され、
それぞれのコードベクトルに対応する合成音声信号が得
られる。なお、適応コードブック１により当該フレーム
の信号に対して、過去の音源と相関の強い成分が合成さ
れ、確率的コードブック２からは、比較的相関の弱い独
立した成分が合成される。この合成された音声は、増幅
器６，７によりそれぞれ増幅される。さらに、適応コー
ドブック１からの合成音声信号と、確率的コードブック
２からの合成音声信号が加算器１１で加算され最終的な
合成音声信号が得られる。このそれぞれのコードベクト
ルに対応する合成音声信号の中で、減算器１２を通じた
入力音声信号との誤差が最小となるコードベクトルが最
適コードブック選択器８により選択される。Next, the operation of the configuration of this embodiment will be described. The characteristics of the vocal tract filter of the input speech signal signal corresponding to the frame are obtained by the linear prediction analyzer 3. The filter coefficients obtained here become the filter coefficients of the synthesis filters 4, 5 and the prediction filter 9. Next, all the code vectors stored in the adaptive codebook 1 and the stochastic codebook 2 are selected by the selection switch 1 respectively.
Are synthesized by the synthesis filters 4, 5 through 3, 14,
A synthetic speech signal corresponding to each code vector is obtained. It should be noted that the adaptive codebook 1 synthesizes a component having a strong correlation with a past sound source, and the stochastic codebook 2 synthesizes an independent component having a relatively weak correlation. The synthesized voice is amplified by amplifiers 6 and 7, respectively. Further, the synthetic speech signal from the adaptive codebook 1 and the synthetic speech signal from the stochastic codebook 2 are added by the adder 11 to obtain a final synthetic speech signal. Among the synthesized speech signals corresponding to the respective code vectors, the code vector selector 8 selects the code vector having the minimum error from the input speech signal through the subtractor 12.

【００１７】ここで、当該フレームの適応コードブック
１からの合成音声信号と確率的コードブック２からの合
成音声信号及び入力音声信号がＳＮＲ計算器１６に入力
される。このＳＮＲ計算器１６では、適応コードブック
１からの合成音声信号と入力音声信号とのＳＮＲ（ＳＮ
Ｒａ）及び確率的コードブック２からの合成音声信号と
入力音声信号とのＳＮＲ（ＳＮＲｂ）を計算する。ＳＮ
Ｒ計算器１６には、予めＳＮＲａとＳＮＲｂに対するし
きい値ＳＮＲａｔｈとＳＮＲｂｔｈを設定して、以下の
制御を行う。（１）ＳＮＲａ＞ＳＮＲａｔｈの場合には、適応コー
ドブック１のみで入力音声信号が近似できるものであ
り、確率的コードブック２に関する情報は伝送しない。
量子化器に対して「適応コードブック１のみのモード」
であることを伝える信号ＳＦを出力する。（２）ＳＮＲａ≦ＳＮＲａｔｈ、かつ、ＳＮＲｂ＞ＳＮ
Ｒｂｔｈの場合は、確率的コードブック２のみで入力音
声信号が近似できるものであり、適応コードブック１に
関する情報は伝送しない。量子化器１０に対し、「確率
的コードブック２のみのモード」であることを伝える信
号ＳＦを出力する。（３）これら以外の場合は適応コードブック１及び確率
的コードブック２の双方が必要であり、量子化器１０に
対して「適応コードブック１及び確率的コードブック２
の双方を用いるモード」であることを伝える信号ＳＦを
出力する。（４）ＳＮＲｂ＞ＳＮＲｂｔｈの場合は、確率的コード
ブック２のみのモードとする。（５）ＳＮＲｂ≦ＳＮＲｂｔｈ、かつ、ＳＮＲａ＞ＳＮ
Ｒａｔｈの場合は、適応コードブック１のみのモードと
する。（６）これら以外の場合は、適応コードブック１及び確
率的コードブック２の双方を用いるモードとする。この
ように各種の組み合わせがある。Here, the synthesized speech signal from the adaptive codebook 1 of the frame, the synthesized speech signal from the stochastic codebook 2 and the input speech signal are input to the SNR calculator 16. The SNR calculator 16 calculates the SNR (SN of the synthesized speech signal from the adaptive codebook 1 and the input speech signal).
Ra) and the SNR (SNRb) of the synthetic speech signal from stochastic codebook 2 and the input speech signal are calculated. SN
In the R calculator 16, thresholds SNRath and SNRbth for SNRa and SNRb are set in advance, and the following control is performed. (1) In the case of SNRa> SNRath, the input speech signal can be approximated only by the adaptive codebook 1, and the information regarding the probabilistic codebook 2 is not transmitted.
For quantizer "Mode of adaptive codebook 1 only"
Then, a signal SF indicating that is output. (2) SNRa ≦ SNRath and SNRb> SN
In the case of Rbth, the input speech signal can be approximated only by the stochastic codebook 2, and the information regarding the adaptive codebook 1 is not transmitted. It outputs to the quantizer 10 a signal SF indicating that it is in the "only stochastic codebook 2 mode". (3) In other cases, both the adaptive codebook 1 and the probabilistic codebook 2 are required, and the quantizer 10 is instructed by "adaptive codebook 1 and stochastic codebook 2".
A signal SF indicating that it is a mode in which both of the above are used is output. (4) If SNRb> SNRbth, the probabilistic codebook 2 only mode is used. (5) SNRb ≦ SNRbth and SNRa> SN
In the case of Rath, only the adaptive codebook 1 is used. (6) In cases other than these, the mode is set to use both the adaptive codebook 1 and the stochastic codebook 2. Thus, there are various combinations.

【００１８】また、ＳＮＲ計算器１６によって選択した
音源の合成音声信号から予測フィルタ９によって音源信
号が得られ、次フレームのコードベクトル探索のため
に、適応コードブック１に蓄積される。そして、線型予
測係数ＳＡ、適応コードブック１中の最適コードベクト
ルに対する音源ゲインＳＢ、確率的コードブック２中の
最適コードベクトルに対する音源ゲインＳＣ、適応コー
ドブック１中の最適コードベクトルの番号ＳＤ、確率的
コードブック２中の最適コードベクトルの番号ＳＥ、Ｓ
ＮＲ計算器１６からのモードに関する信号ＳＦが量子化
器１０で量子化される。ここで、フレームの長さをＬ
（ｓｅｃ）、伝送する線型予測係数ＳＡ〜信号ＳＦまで
の量子化ビット数をそれぞれＮ（ＳＡ）ビット〜Ｎ（Ｓ
Ｆ）ビットとすると、伝送ビットレート（ビット／ｓｅ
ｃ）は、上記の３モードを構成する場合には、Ｎ（Ｓ
Ｆ）＝２である。そして、適応コードブック１のみのモ
ードの伝送ビットレート（ビット／ｓｅｃ）Ｔｖ１は次
式（２）で表される。Further, the predictive filter 9 obtains a sound source signal from the synthesized sound signal of the sound source selected by the SNR calculator 16 and stores the sound source signal in the adaptive codebook 1 for searching the code vector of the next frame. Then, the linear prediction coefficient SA, the excitation gain SB for the optimal code vector in the adaptive codebook 1, the excitation gain SC for the optimal code vector in the stochastic codebook 2, the optimal code vector number SD in the adaptive codebook 1, the probability Code code numbers SE and S in the dynamic codebook 2
The signal SF related to the mode from the NR calculator 16 is quantized by the quantizer 10. Where the frame length is L
(Sec), and the number of quantization bits from the linear prediction coefficient SA to the signal SF to be transmitted is N (SA) bits to N (S), respectively.
F) bits, the transmission bit rate (bit / se
c) is N (S
F) = 2. Then, the transmission bit rate (bit / sec) Tv1 in the mode of only the adaptive codebook 1 is expressed by the following equation (2).

【００１９】Ｔｖ１＝Ｎ（ＳＡ）＋Ｎ（ＳＢ）＋Ｎ（ＳＤ）＋Ｎ（ＳＦ）／Ｌ…（２）さらに、確率的コードブック２のみのモードの伝送ビッ
トレート（ビット／ｓｅｃ）Ｔｖ２は次式（３）で表さ
れる。Tv1 = N (SA) + N (SB) + N (SD) + N (SF) / L (2) Further, the transmission bit rate (bit / sec) Tv2 in the mode of only the probabilistic codebook 2 is expressed by the following equation. It is represented by (3).

【００２０】Ｔｖ２＝Ｎ（ＳＡ）＋Ｎ（ＳＣ）＋Ｎ（ＳＥ）＋Ｎ（ＳＦ）／Ｌ…（３）また、適応コードブック１及び確率的コードブック２の
双方を用いるモードの伝送ビットレート（ビット／ｓｅ
ｃ）Ｔｖ３は次式（４）で表される。Tv2 = N (SA) + N (SC) + N (SE) + N (SF) / L (3) Further, the transmission bit rate (bits) in the mode using both the adaptive codebook 1 and the stochastic codebook 2 / Se
c) Tv3 is represented by the following equation (4).

【００２１】Ｔｖ３＝Ｎ（ＳＡ）＋Ｎ（ＳＢ）＋Ｎ（ＳＣ）＋Ｎ（ＳＤ）＋Ｎ（ＳＥ）＋Ｎ（Ｆ）／Ｌ …（４）ここで、適応コードブック１、確率的コードブック２の
モードが選ばれる確率をそれぞれＰ１、Ｐ２とすると、
平均ビットレートＴｖａｖは、次式（５）で表される。Tv3 = N (SA) + N (SB) + N (SC) + N (SD) + N (SE) + N (F) / L (4) Here, adaptive codebook 1 and stochastic codebook 2 modes Let P1 and P2 be the probabilities of being selected, respectively.
The average bit rate Tvav is expressed by the following equation (5).

【００２２】Ｔｖａｖ＝Ｐ１・Ｔｖ１＋Ｐ２・Ｔｖ２＋（１−Ｐ１−Ｐ２）・Ｔｖ３ …（５）したがって、Ｔｖａｖ＜Ｔｆが成立すれば、平均ビット
レートが減少することになり、効率の良い量子化が可能
になる。Tvav = P1 * Tv1 + P2 * Tv2 + (1-P1-P2) * Tv3 (5) Therefore, if Tvav <Tf is satisfied, the average bit rate is reduced and efficient quantization is possible. become.

【００２３】[0023]

【発明の効果】以上の説明から明らかなように、本発明
の可変レート音声符号化装置は、適応コードブックから
の合成音声信号と入力音声信号及び確率的コードブック
からの合成音声信号と入力音声信号との信号対雑音比を
計算し、この計算された信号対雑音比が、予め設定され
たしきい値より大きい場合には、一方のコードブックか
らの合成音声信号のみで入力音声信号を表することが出
来ると判断している。この判断によって他方のコードブ
ックからの情報に対応する信号を伝送しないようにして
量子化ビット数を可変しているため、入力音声信号の特
徴に基づいた量子化ビットに可変されて、効率の良い量
子化が出来るという効果を有する。As is apparent from the above description, the variable-rate speech coding apparatus of the present invention has a synthetic speech signal and an input speech signal from an adaptive codebook and a synthetic speech signal and an input speech from a stochastic codebook. The signal-to-noise ratio with the signal is calculated, and if the calculated signal-to-noise ratio is larger than a preset threshold value, the input speech signal is represented by only the synthesized speech signal from one of the codebooks. I think I can do it. Based on this judgment, the number of quantization bits is changed so that the signal corresponding to the information from the other codebook is not transmitted, so that the number of quantization bits is changed according to the characteristics of the input audio signal, and the efficiency is improved. It has the effect of being able to quantize.

[Brief description of drawings]

【図１】本発明の可変レート音声符号化装置の実施例に
おける構成を示すブロック図FIG. 1 is a block diagram showing the configuration of an embodiment of a variable rate speech coding apparatus of the present invention.

【図２】従来のＣＥＬＰ方式による固定レート音声符号
化装置の構成を示すブロック図FIG. 2 is a block diagram showing the configuration of a conventional fixed-rate speech coding apparatus using the CELP method.

[Explanation of symbols]

１適応コードブック２確率的コードブック３線形予測分析器４，５合成フィルタ８最適コードベクトル選択器９予測フィルタ９１０量子化器１１加算器１２減算器１３，１４選択スイッチ１６ＳＮＲ計算器 1 Adaptive Codebook 2 Stochastic Codebook 3 Linear Prediction Analyzer 4,5 Synthesis Filter 8 Optimum Code Vector Selector 9 Prediction Filter 9 10 Quantizer 11 Adder 12 Subtractor 13, 14 Selection Switch 16 SNR Calculator

Claims

[Claims]

1. A variable rate speech coder for performing CELP speech coding, comprising an adaptive codebook for storing input adaptive codevectors for a short time period and a stochastic codebook for storing stochastic codevectors. Then, a synthetic speech signal generating means for generating a synthetic speech signal from the adaptive code vector and the probability code vector, and a signal-to-noise ratio of each of the two synthetic speech signals from the synthetic speech signal generating means and the input speech signal. A variable-rate speech coding apparatus comprising: a signal-to-noise ratio calculating means for calculating the above-mentioned; and a varying means for varying the coding bit rate of the section based on the calculated signal-to-noise ratio.

2. The signal-to-noise ratio calculating means compares the calculated signal-to-noise ratio with a threshold setting means for presetting a threshold value to be compared with the set threshold value. When the signal-to-noise ratio is larger than the threshold value, it is determined that the input voice signal can be represented by only the synthesized voice signal from one codebook, and the signal corresponding to the information from the other codebook is determined. The variable rate speech coding apparatus according to claim 1, further comprising: a variable unit that varies the number of quantization bits so as not to transmit.