JP3137550B2

JP3137550B2 - Audio encoding / decoding device

Info

Publication number: JP3137550B2
Application number: JP07030625A
Authority: JP
Inventors: 泉賢今
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 1995-02-20
Filing date: 1995-02-20
Publication date: 2001-02-26
Anticipated expiration: 2016-02-26
Also published as: JPH08221098A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、音声からピッチ周波
数、各高調波の有声無声判定、スペクトル振幅情報の３
つのパラメータにより符号化・復号化を行なう音声符号
化・復号化装置であって、そのスペクトル振幅情報をＬ
ＰＣ（線形予測）係数によって表現し、そのパラメータ
を効率よく符号化・復号化する音声符号化・復号化装置
に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method of determining pitch frequency, voiced / unvoiced judgment of each harmonic, and spectrum amplitude information from speech.
A speech encoding / decoding device that performs encoding / decoding using two parameters, wherein the spectrum amplitude information is represented by L
The present invention relates to an audio encoding / decoding device that is expressed by PC (linear prediction) coefficients and encodes / decodes its parameters efficiently.

【０００２】[0002]

【従来の技術】近年、ディジタル信号処理技術の発達に
より、ディジタル通信のサービスが多様化し、通信にお
ける伝送容量の制限から、低ビットレート化の要求が高
まっている。高能率音声符号化技術は、その要求を満た
すために欠かすことのできない技術である。ピッチ周波
数、各高調波の有声無声判定、スペクトル振幅情報の３
つのパラメータを符号化するＭＢＥ（Multi Band Excit
ed）符号化法は、低ビットレートにおいても良好な音質
が得られる優れた符号化方法として知られている(IEEE
Trans ASSP VOL 36. NO.8. 1988)。また、スペクトル振
幅情報を表すパラメータとしてＬＰＣ係数を用いるＭＢ
Ｅ符号化法も知られている(Electronics Letters Vol.2
7. No.14. 1991) 。2. Description of the Related Art In recent years, with the development of digital signal processing technology, digital communication services have been diversified, and there has been an increasing demand for lower bit rates due to limitations on transmission capacity in communication. High-efficiency speech coding technology is an indispensable technology to meet the demand. Pitch frequency, voiced / unvoiced judgment of each harmonic, spectrum amplitude information
MBE (Multi Band Excit) that encodes two parameters
ed) The coding method is known as an excellent coding method capable of obtaining good sound quality even at a low bit rate (IEEE
Trans ASSP VOL 36. NO.8. 1988). Also, an MB using an LPC coefficient as a parameter representing spectrum amplitude information
An E encoding method is also known (Electronics Letters Vol.2
7. No.14. 1991).

【０００３】以下にスペクトル振幅情報を表すパラメー
タとしてＬＰＣ係数を用いるＭＢＥ符号化法を用いた符
号化・復号化装置について説明する。まず、図８を参照
にして従来の符号化装置につて説明する。図８におい
て、１は入力音声信号を入力とし、ピッチ周波数を出力
とするピッチ周波数推定部である。２は入力音声信号お
よびピッチ周波数を入力とし、各高調波の有声無声判定
を出力とする有声無声判定部である。３は入力音声信号
を入力とし、ＬＰＣ係数を出力とするＬＰＣ分析部であ
る。４はＬＰＣ係数を入力とし、量子化ＬＰＣ係数を出
力とする量子化器である。５は入力音声信号とＬＰＣ係
数を入力とし、ＬＰＣ予測残差を出力とするＬＰＣ逆フ
ィルタである。６はＬＰＣ予測残差を入力とし、フレー
ムパワーを出力とするフレームパワー算出部である。７
はフレームパワーを入力とし、量子化フレームパワーを
出力とする量子化器である。８はピッチ周波数、有声無
声判定、量子化ＬＰＣ係数、量子化フレームパワーを入
力とし、符号を出力するマルチプレクサである。[0003] An encoding / decoding apparatus using an MBE encoding method using LPC coefficients as parameters representing spectral amplitude information will be described below. First, a conventional encoding device will be described with reference to FIG. In FIG. 8, reference numeral 1 denotes a pitch frequency estimating unit that receives an input voice signal and outputs a pitch frequency. Reference numeral 2 denotes a voiced / unvoiced determination unit which receives an input voice signal and a pitch frequency and outputs voiced / unvoiced determination of each harmonic. Reference numeral 3 denotes an LPC analysis unit that receives an input audio signal and outputs an LPC coefficient. Reference numeral 4 denotes a quantizer that receives LPC coefficients as input and outputs quantized LPC coefficients as output. Reference numeral 5 denotes an LPC inverse filter that receives an input speech signal and LPC coefficients as inputs and outputs an LPC prediction residual. Reference numeral 6 denotes a frame power calculation unit that receives the LPC prediction residual and inputs the frame power. 7
Is a quantizer that receives frame power as input and outputs quantized frame power. A multiplexer 8 receives a pitch frequency, a voiced / unvoiced determination, a quantized LPC coefficient, and a quantized frame power as inputs and outputs a code.

【０００４】次に、図８においてその動作について説明
する。ピッチ周波数推定部１では、入力音声信号からそ
のピッチ周波数を算出する。ピッチ周波数を算出する手
段としては、従来から入力音声信号の相関関数やスペク
トル振幅を利用する方法が知られている。次に、有声無
声判定部２では、入力音声信号のスペクトルを算出し、
ピッチ周波数に基づいて高調波周波数を求め、それらを
もとに各高調波の有声無声判定を行う。各高調波の有声
無声判定方法としては、各高調波を有声と仮定したとき
のスペクトルと入力音声信号のスペクトルの差異をもと
に判定を行う方法が、従来から知られている。ＬＰＣ分
析部３では、入力音声信号のＬＰＣ分析を行い、ＬＰＣ
係数を算出する。量子化器４では、ＬＰＣ係数を従来か
ら知られているベクトル量子化などの効率の良い量子化
法で量子化する。ＬＰＣ逆フィルタ５では、ＬＰＣ係数
を用いて入力音声信号をＬＰＣ逆フィルタに通す。この
結果、入力音声信号のＬＰＣ予測残差が得られる。パワ
ー算出部６では、ＬＰＣ予測残差のパワーのフレーム内
平均値を算出し、それをフレームパワーとして出力す
る。量子化器７では、フレームパワーを、従来から知ら
れているような効率の良い量子化法によって量子化す
る。マルチプレクサ８は、復号化装置に送る情報として
得られた、ピッチ周波数、各高調波の有声無声判定、量
子化ＬＰＣ係数、量子化フレームパワーを効率よく符号
化する。結果として、符号化されたピッチ周波数、有声
無声判定、ＬＰＣ係数、フレームパワーが、この符号化
装置の出力として得られる。Next, the operation will be described with reference to FIG. The pitch frequency estimating unit 1 calculates a pitch frequency from an input voice signal. As a means for calculating the pitch frequency, a method using a correlation function or a spectrum amplitude of an input voice signal has been conventionally known. Next, the voiced / unvoiced determination unit 2 calculates the spectrum of the input voice signal,
Harmonic frequencies are determined based on the pitch frequency, and voiced / unvoiced determination of each harmonic is performed based on the harmonic frequencies. As a voiced / unvoiced determination method of each harmonic, a method of performing determination based on a difference between a spectrum when each harmonic is assumed to be voiced and a spectrum of an input voice signal is conventionally known. The LPC analysis unit 3 performs an LPC analysis of the input audio signal,
Calculate the coefficient. In the quantizer 4, the LPC coefficients are quantized by an efficient quantization method such as conventionally known vector quantization. The LPC inverse filter 5 passes the input audio signal through the LPC inverse filter using the LPC coefficients. As a result, an LPC prediction residual of the input audio signal is obtained. The power calculator 6 calculates an average value of the power of the LPC prediction residual within the frame, and outputs the average value as the frame power. In the quantizer 7, the frame power is quantized by an efficient quantization method as conventionally known. The multiplexer 8 efficiently encodes the pitch frequency, the voiced / unvoiced determination of each harmonic, the quantized LPC coefficient, and the quantized frame power obtained as information to be sent to the decoding device. As a result, the encoded pitch frequency, voiced / unvoiced decision, LPC coefficient, and frame power are obtained as outputs of the encoding device.

【０００５】次に、図９を参照して従来の復号化装置に
ついて説明する。図９において、１１は符号を入力と
し、ピッチ周波数、有声無声判定、量子化フレームパワ
ー、量子化ＬＰＣ係数を出力とするデマルチプレクサで
ある。１２はピッチ周波数、有声無声判定を入力とし、
ＭＢＥ合成モードを出力するモード決定部である。１３
は量子化ＬＰＣ係数を入力とし、逆量子化されたＬＰＣ
係数を出力とする逆量子化器である。１４は逆量子化さ
れたＬＰＣ係数を入力とし、スペクトル強調されたＬＰ
Ｃ係数を出力とするスペクトル強調部である。１５はス
ペクトル強調されたＬＰＣ係数を入力とし、各高調波の
振幅乗数を出力とする高調波振幅乗数算出部である。１
６は量子化フレームパワーを入力とし、逆量子化された
フレームパワーを出力とする逆量子化器である。１７は
ピッチ周波数と、有声無声判定と、フレームパワーを入
力とし、音源信号を出力とする音源生成部である。１８
はＭＢＥ合成モードと、音源信号と、各高調波振幅乗数
を入力とし、復号化音声を出力とするＭＢＥ合成部であ
る。Next, a conventional decoding apparatus will be described with reference to FIG. In FIG. 9, reference numeral 11 denotes a demultiplexer which receives a code and outputs pitch frequency, voiced / unvoiced determination, quantized frame power, and quantized LPC coefficient. 12, a pitch frequency and a voiced / unvoiced judgment are input,
A mode determination unit that outputs the MBE combining mode. 13
Is the input of the quantized LPC coefficient and the inversely quantized LPC
This is an inverse quantizer that outputs coefficients. Reference numeral 14 denotes an input of the inversely quantized LPC coefficient, and a spectrum-emphasized LP.
This is a spectrum emphasis unit that outputs the C coefficient. Reference numeral 15 denotes a harmonic amplitude multiplier calculation unit that receives the spectrum-emphasized LPC coefficient and outputs the amplitude multiplier of each harmonic. 1
Reference numeral 6 denotes an inverse quantizer which receives the quantized frame power as input and outputs the inversely quantized frame power as output. Reference numeral 17 denotes a sound source generation unit which receives a pitch frequency, voiced / unvoiced judgment, and frame power as input and outputs a sound source signal. 18
Is an MBE synthesizing unit which receives an MBE synthesis mode, a sound source signal, and each harmonic amplitude multiplier as inputs, and outputs decoded speech.

【０００６】次に、上記図９においてその動作を説明す
る。まず、デマルチプレクサ１１では、符号化装置側か
ら送られた符号から、ピッチ周波数、有声無声判定、量
子化されたフレームパワー、量子化されたＬＰＣ係数を
それぞれ取り出す。モード決定部１２では、ピッチ周波
数と有声無声判定をもとに、従来から用いられているモ
ード決定法でＭＢＥ合成モードを決定する。逆量子化器
１３は、量子化されたＬＰＣ係数を逆量子化する。スペ
クトル強調部１４は、ＬＰＣ係数に対し、出力音声の主
観的な音質を高めるために、従来から知られているよう
なフォルマント強調や高域強調などの処理を加える。高
調波振幅乗数算出部１５では、ＬＰＣ係数をＦＦＴによ
り周波数領域に変換する。求めた各フーリエ係数の絶対
値を求め、逆数をとることにより、合成音声信号のスペ
クトル包絡を得ることができる。合成音声信号の各高調
波の振幅乗数は、ピッチ周波数ｗ0 の整数倍の周波数ｎ
ｗ0 を求め、スペクトル包絡からその周波数成分を抽出
することによって得られる。逆量子化器１６は、量子化
されたフレームパワーを逆量子化する。音源生成部１７
では、ピッチ周波数と、有声無声判定と、フレームパワ
ーにより、音源を生成する。音源の生成は、まずピッチ
周波数から各高調波の周波数を求め、各高調波の周波数
を算出する。また、有声無声判定をもとに各高調波周波
数ごとに基本音源信号を生成する。これは、従来からそ
の高調波が有声であれば正弦波、無声であれば白色雑音
とする方法が知られている。次にフレームパワーから、
各高調波の音源振幅を推定し、それを基本音源信号に乗
ずることにより音源信号を生成する。ＭＢＥ合成部１８
では、このようにして生成された音源信号と各高調波の
振幅とＭＢＥ合成モードをもとに、従来から知られてい
るＭＢＥ合成を行い、復号化音声信号を出力する。Next, the operation will be described with reference to FIG. First, the demultiplexer 11 extracts a pitch frequency, voiced / unvoiced determination, quantized frame power, and quantized LPC coefficients from the code sent from the encoding device side. The mode determination unit 12 determines the MBE combining mode by a conventionally used mode determination method based on the pitch frequency and the voiced / unvoiced determination. The inverse quantizer 13 inversely quantizes the quantized LPC coefficients. The spectrum emphasizing unit 14 performs conventionally known processing such as formant emphasis and high-frequency emphasis on the LPC coefficients in order to improve the subjective sound quality of the output sound. The harmonic amplitude multiplier calculator 15 converts the LPC coefficient into a frequency domain by FFT. The spectrum envelope of the synthesized speech signal can be obtained by obtaining the absolute value of each of the obtained Fourier coefficients and taking the reciprocal. The amplitude multiplier of each harmonic of the synthesized voice signal is a frequency n which is an integral multiple of the pitch frequency w0.
It is obtained by finding w0 and extracting its frequency components from the spectral envelope. The inverse quantizer 16 inversely quantizes the quantized frame power. Sound source generator 17
Then, a sound source is generated based on the pitch frequency, voiced / unvoiced judgment, and frame power. To generate a sound source, first, the frequency of each harmonic is obtained from the pitch frequency, and the frequency of each harmonic is calculated. In addition, a basic sound source signal is generated for each harmonic frequency based on the voiced / unvoiced determination. Conventionally, a method is known in which a sine wave is used if the harmonic is voiced, and a white noise is used if the harmonic is unvoiced. Next, from frame power,
A sound source signal is generated by estimating a sound source amplitude of each harmonic and multiplying the estimated sound source amplitude by a basic sound source signal. MBE synthesis unit 18
Then, based on the sound source signal generated in this way, the amplitude of each harmonic, and the MBE synthesis mode, conventionally known MBE synthesis is performed, and a decoded audio signal is output.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、上記の
従来の音声符号化・復号化装置では、ＬＰＣ係数のみか
ら求めたスペクトル包絡から各高調波のパワーを算出し
ているため、ＬＰＣ係数では表現できないような、より
細かいスペクトル包絡が得られないため、各高調波のパ
ワーが正確に符号化されず、復号化音声の品質が低下す
るという問題を有していた。また、フレーム内で入力音
声信号のパワーが大きく変化しても、それを表現する手
段がないために、追従できず、音の立ち上がり部分など
で復号化音声の品質が低下するという問題を有してい
た。However, in the above-mentioned conventional speech coding / decoding apparatus, since the power of each harmonic is calculated from the spectral envelope obtained only from the LPC coefficient, it cannot be represented by the LPC coefficient. Since such a finer spectral envelope cannot be obtained, there is a problem that the power of each harmonic is not accurately encoded, and the quality of the decoded speech is reduced. In addition, even if the power of the input audio signal greatly changes within a frame, there is no means to represent the power, and the input audio signal cannot follow the input signal. I was

【０００８】本発明は、上記従来の問題を解決するもの
で、高い復号化音声品質を実現する優れた音声符号化・
復号化装置を提供することを目的とする。[0008] The present invention solves the above-mentioned conventional problems, and provides an excellent speech encoding and decoding method for realizing high decoded speech quality.
It is an object to provide a decoding device.

【０００９】[0009]

【課題を解決するための手段】本発明は、上記目的を達
成するために、符号化装置に、ピッチ周波数推定部と、
各高調波の有声無声判定をする有声無声判定部と、ＬＰ
Ｃ（線形予測）分析部と、ＬＰＣ係数を量子化する量子
化器と、ＬＰＣ逆フィルタと、ＬＰＣ予測残差から少な
くとも残差フレームパワー以外の残差情報を取り出す残
差情報算出部と、ピッチ周波数と有声無声判定と量子化
されたＬＰＣ係数と残差情報とを符号化するマルチプレ
クサとを備え、復号化装置に、符号からピッチ周波数と
有声無声判定と量子化されたＬＰＣ係数と残差情報とを
復号化するデマルチプレクサと、ＭＢＥ合成モードを決
定するモード決定部と、量子化されたＬＰＣ係数を逆量
子化してＬＰＣ係数を取り出す逆量子化器と、ＬＰＣ係
数から各高調波への振幅乗数を求める高調波振幅乗数算
出部と、ピッチ周波数と有声無声周波数と残差情報とか
ら音源信号を生成する音源生成部と、ＭＢＥ合成部とを
備え、符号化装置における前記残差情報算出部が、フレ
ームパワー算出部と、フレームパワーを量子化する量子
化器と、ＬＰＣ予測残差を正規化する予測残差正規化部
と、予測残差のパワースペクトルを算出するパワースペ
クトル算出部と、残差に含まれる高調波のパワーを算出
する残差高調波パワー算出部と、残差高調波パワーから
スペクトル補正ベクトルを算出するスペクトル補正ベク
トル算出部と、スペクトル補正ベクトルを量子化する量
子化器とを備え、復号化装置における音源生成部が、残
差情報算出部において求められたフレームパワーとスペ
クトル補正ベクトルを用いて音源を生成し、また、符号
化装置および復号化装置が、それぞれピッチ周波数と有
声無声判定をもとにＬＰＣ係数およびスペクトル補正ベ
クトルに対するビット割り当てをフレーム毎に算出する
量子化レベル数算出部を備えるとともに、それぞれＬＰ
Ｃ係数を量子化・逆量子化する量子化器・逆量子化器お
よびスペクトル補正ベクトルを量子化・逆量子化する量
子化器・逆量子化器が、それぞれ量子化レベル数算出部
から与えられたフレーム毎に異なる量子化レベルで量子
化・逆量子化を行い、またマルチプレクサおよびデマル
チプレクサが、それぞれ量子化レベル数算出部で求めら
れたビット割り当てに基づき量子化ＬＰＣ係数および量
子化スペクトル補正ベクトルを符号化・復号化するよう
にしたものである。According to the present invention, in order to achieve the above object, a coding apparatus includes a pitch frequency estimating unit,
A voiced / unvoiced determination unit for voiced / unvoiced determination of each harmonic, and LP
C (linear prediction) analysis unit and a quantum for quantizing LPC coefficients
, LPC inverse filter, and LPC prediction residual
At least the residual information other than the residual frame power
Difference information calculator, pitch frequency, voiced / unvoiced judgment and quantization
To encode the obtained LPC coefficients and residual information
And the decoding device, from the code to the pitch frequency and
The voiced / unvoiced decision, the quantized LPC coefficient and the residual information
Determine the demultiplexer to decode and the MBE combining mode
Mode determining unit to determine the LPC coefficient quantized
An inverse quantizer for extracting LPC coefficients by converting
Harmonic amplitude multiplier to find the amplitude multiplier for each harmonic from the number
Output part, pitch frequency, voiced unvoiced frequency, residual information, etc.
A sound source generation unit for generating a sound source signal from the
Wherein the residual information calculation unit in the encoding device is
Frame power calculation unit and a quantum that quantizes the frame power
And a prediction residual normalization unit for normalizing the LPC prediction residual
And the power spectrum for calculating the power spectrum of the prediction residual
Calculate the power of harmonics contained in the residual by the vector calculator
From the residual harmonic power calculator and the residual harmonic power
Spectral correction vector for calculating the spectral correction vector
Torque calculator and the amount to quantize the spectral correction vector
And a sound generator in the decoding apparatus.
The frame power and spec
A sound source is generated using the vector
The decoding frequency and the decoding frequency are respectively
The LPC coefficient and the spectrum correction
Calculate bit allocation for each frame
A quantization level number calculation unit is provided, and LP
Quantizer / dequantizer for quantizing / dequantizing C coefficient
Amount to quantize and dequantize spectral correction vector and spectrum correction vector
The quantizer and inverse quantizer are each a quantization level calculation unit.
At different quantization levels for each frame given by
Multiplexing and inverse quantization, and multiplexer and demultiplexer
Each of the multiplexors is calculated by the quantization level calculation unit.
LPC coefficient and quantity based on the assigned bit allocation
Encode and decode child spectral correction vectors
It was made .

【００１０】[0010]

【作用】上記構成により、符号化装置において、ＬＰＣ
予測残差から各高調波のパワーを抽出し、復号化装置に
おいて、その各高調波のパワーをもとに音源信号を生成
することで、各高調波のパワーを正確に表現できる。According to the above arrangement, the encoding apparatus uses the LPC
By extracting the power of each harmonic from the prediction residual and generating a sound source signal based on the power of each harmonic in the decoding device, the power of each harmonic can be accurately represented.

【００１１】また、符号化装置において、ＬＰＣ予測残
差からフレーム内でのパワーの変化を抽出し、復号化装
置において、そのフレーム内のパワーの変化をもとに音
源信号を生成することで、フレーム内のパワーの変化に
追従することができる。[0011] Further, the encoding device extracts a power change in the frame from the LPC prediction residual, and the decoding device generates an excitation signal based on the power change in the frame. It can follow changes in power within the frame.

【００１２】[0012]

【Example】

（実施例１）以下、本発明の第１実施例について、図面
を参照しながら説明する。まず、符号化装置について図
１を用いて説明する。図１において、１０１は入力音声
信号を入力とし、ピッチ周波数を出力とするピッチ周波
数推定部である。１０２は入力音声信号およびピッチ周
波数を入力とし、各高調波の有声無声判定を出力とする
有声無声判定部である。１０３は入力音声信号を入力と
し、ＬＰＣ係数を出力とするＬＰＣ分析部である。１０
４はＬＰＣ係数を入力とし、量子化されたＬＰＣ係数を
出力とする量子化器である。１０５は入力音声信号とＬ
ＰＣ係数を入力とし、ＬＰＣ予測残差を出力とするＬＰ
Ｃ逆フィルタである。１０６はＬＰＣ予測残差を入力と
し、フレームパワーを出力とするフレームパワー算出部
である。１０７はフレームパワーを入力とし、量子化フ
レームパワーを出力とする量子化器である。１０８はＬ
ＰＣ予測残差と、フレームパワーを入力とし、正規化予
測残差を出力とする予測残差正規化部である。１０９は
正規化予測残差を入力とし、予測残差のパワースペクト
ルを出力とする残差パワースペクトル算出部である。１
１０は予測残差のパワースペクトルと、ピッチ周波数を
入力とし、残差の各高調波のパワーを出力とする残差高
調波パワー算出部である。１１１は残差の各高調波のパ
ワーを入力とし、スペクトル補正ベクトルを出力とする
スペクトル補正ベクトル算出部である。１１２はスペク
トル補正ベクトルを入力とし、量子化スペクトル補正ベ
クトルを出力とする量子化器である。１１３はフレーム
パワー算出部１０６から量子化器１１２までの要素によ
って構成される残差情報算出部である。１１４は算出さ
れたピッチ周波数、有声無声判定、量子化ＬＰＣ係数、
量子化フレームパワー、量子化スペクトル補正ベクトル
を入力とし、符号を出力するマルチプレクサである。Embodiment 1 Hereinafter, a first embodiment of the present invention will be described with reference to the drawings. First, an encoding device will be described with reference to FIG. In FIG. 1, reference numeral 101 denotes a pitch frequency estimating unit that receives an input speech signal and outputs a pitch frequency. Reference numeral 102 denotes a voiced / unvoiced determination unit which receives an input voice signal and a pitch frequency and outputs voiced / unvoiced determination of each harmonic. An LPC analysis unit 103 receives an input audio signal and outputs LPC coefficients. 10
Reference numeral 4 denotes a quantizer that receives LPC coefficients as input and outputs quantized LPC coefficients as output. 105 is the input audio signal and L
LP with PC coefficient as input and LPC prediction residual as output
C is an inverse filter. Reference numeral 106 denotes a frame power calculation unit that receives the LPC prediction residual and outputs the frame power. A quantizer 107 receives the frame power as input and outputs the quantized frame power as output. 108 is L
A prediction residual normalization unit that receives a PC prediction residual and a frame power as inputs and outputs a normalized prediction residual as an output. Reference numeral 109 denotes a residual power spectrum calculator that receives the normalized prediction residual as an input and outputs the power spectrum of the prediction residual as an output. 1
Reference numeral 10 denotes a residual harmonic power calculator that receives the power spectrum of the prediction residual and the pitch frequency as inputs and outputs the power of each harmonic of the residual. Reference numeral 111 denotes a spectrum correction vector calculation unit which receives the power of each harmonic of the residual and outputs a spectrum correction vector. A quantizer 112 receives a spectrum correction vector as an input and outputs a quantized spectrum correction vector as an output. Reference numeral 113 denotes a residual information calculation unit including elements from the frame power calculation unit 106 to the quantizer 112. 114 is a calculated pitch frequency, voiced / unvoiced determination, quantized LPC coefficient,
This is a multiplexer that receives a quantized frame power and a quantized spectrum correction vector and outputs a code.

【００１３】次に、上記図１においてその動作を説明す
る。ピッチ周波数推定部１０１では、入力音声信号から
そのピッチ周波数を算出する。ピッチ周波数を算出する
手段としては、従来から入力音声信号の相関関数やスペ
クトル振幅を利用する方法が知られている。次に、有声
無声判定部１０２では、入力音声信号のスペクトルを算
出し、ピッチ周波数に基づいて高調波周波数を求め、そ
れらをもとに各高調波の有声無声判定を行う。各高調波
の有声無声判定方法としては、各高調波を有声と仮定し
たときのスペクトルと入力音声信号のスペクトルの差異
をもとに判定を行う方法が、従来から知られている。Ｌ
ＰＣ分析部１０３では、入力音声信号のＬＰＣ分析を行
い、ＬＰＣ係数を算出する。量子化器１０４では、ＬＰ
Ｃ係数を量子化する。ＬＰＣ係数の量子化法としては、
ＬＰＣ係数をＬＳＰ係数に変換し、ベクトル量子化など
の効率の良い量子化法でＬＳＰ係数を量子化する方法が
従来から知られている。ＬＰＣ逆フィルタ１０５では、
ＬＰＣ係数を用いて入力音声信号をＬＰＣ逆フィルタに
通す。この結果、入力音声信号のＬＰＣ予測残差が得ら
れる。Next, the operation will be described with reference to FIG. The pitch frequency estimating unit 101 calculates the pitch frequency from the input voice signal. As a means for calculating the pitch frequency, a method using a correlation function or a spectrum amplitude of an input voice signal has been conventionally known. Next, the voiced / unvoiced determination unit 102 calculates the spectrum of the input voice signal, obtains the harmonic frequencies based on the pitch frequency, and performs voiced / unvoiced determination of each harmonic based on these. As a voiced / unvoiced determination method of each harmonic, a method of performing determination based on a difference between a spectrum when each harmonic is assumed to be voiced and a spectrum of an input voice signal is conventionally known. L
The PC analysis unit 103 performs an LPC analysis on the input audio signal and calculates an LPC coefficient. In the quantizer 104, LP
Quantize the C coefficient. As a method of quantizing LPC coefficients,
2. Description of the Related Art A method of converting an LPC coefficient into an LSP coefficient and quantizing the LSP coefficient by an efficient quantization method such as vector quantization is conventionally known. In the LPC inverse filter 105,
The input audio signal is passed through an LPC inverse filter using the LPC coefficients. As a result, an LPC prediction residual of the input audio signal is obtained.

【００１４】パワー算出部１０６では、ＬＰＣ予測残差
のパワーのフレーム内平均値を算出し、それをフレーム
パワーとして出力する。量子化器１０７では、フレーム
パワーを、従来から知られているような効率の良い量子
化法によって量子化する。予測残差正規化部１０８で
は、フレームパワーを用いて、ＬＰＣ予測残差の正規化
を行う。残差パワースペクトル算出部１０９では、正規
化されたＬＰＣ予測残差を高速フーリエ変換する。この
際、フレーム両端での信号波形の急激な変化によるスペ
クトルへの影響を軽減するために、ハミング窓などの窓
関数をＬＰＣ予測残差に乗じるのが望ましい。求められ
た各フーリエ係数を２乗することによって、予測残差の
パワースペクトルを得る。残差高調波パワースペクトル
算出部１１０では、ＬＰＣ予測残差に含まれているＬ個
の高調波のパワーを算出する。ピッチ周波数ｗ0 の整数
倍のＬ個の周波数ｎｗ0 （１≦ｎ≦Ｌ）を求め、予測残
差のパワースペクトルからその周波数成分を抽出するこ
とによって高調波のパワーを算出することができる。ま
た、音声信号の全帯域ｗc を高調波数Ｌ個の帯域に分割
し、それぞれの帯域において予測残差のパワースペクト
ルの平均値を算出し、それを各高調波のパワーとすれ
ば、音声の高調波の周波数と、求めたピッチ周波数の整
数倍の周波数との誤差が生じても、より正確に各高調波
のパワーを推定できる。スペクトル補正ベクトル算出部
１１１では、求められた予測残差の各高調波のパワーか
ら、スペクトル補正ベクトルを算出する。高調波数Ｌ
は、ピッチ周波数ｗ0 によって変化するため、これを効
率よく量子化するためには、例えば、Ｌ個の高調波の中
から、聴覚的に重要な帯域の高調波を高調波数Ｌに関わ
らず一定の個数Ｎだけ選び、それをスペクトル補正ベク
トルとし、量子化器１１２で、求められたＮ個のスペク
トル補正ベクトルをＮ次元のベクトル量子化を用いて量
子化すればよい。マルチプレクサ１１４は、復号化装置
に送る情報として得られた、ピッチ周波数、各高調波の
有声無声判定、量子化ＬＰＣ係数、量子化フレームパワ
ー、量子化スペクトル補正情報を効率よく符号化する。
結果として、符号化されたピッチ周波数、有声無声判
定、ＬＰＣ係数、フレームパワー、スペクトル補正情報
が、この符号化装置の出力として得られる。The power calculation section 106 calculates an average value of the power of the LPC prediction residual within the frame and outputs the average value as the frame power. In the quantizer 107, the frame power is quantized by an efficient quantization method as conventionally known. The prediction residual normalization unit 108 normalizes the LPC prediction residual using the frame power. The residual power spectrum calculator 109 performs fast Fourier transform on the normalized LPC prediction residual. At this time, it is desirable to multiply the LPC prediction residual by a window function such as a Hamming window in order to reduce the influence on the spectrum due to a sudden change in the signal waveform at both ends of the frame. The power spectrum of the prediction residual is obtained by squaring each of the obtained Fourier coefficients. The residual harmonic power spectrum calculator 110 calculates the power of the L harmonics included in the LPC prediction residual. The power of harmonics can be calculated by finding L frequencies nw0 (1 ≦ n ≦ L) that are integer multiples of the pitch frequency w0 and extracting the frequency components from the power spectrum of the prediction residual. In addition, the entire band wc of the audio signal is divided into L number of harmonics, and the average value of the power spectrum of the prediction residual is calculated in each band. Even if an error occurs between the frequency of the wave and a frequency that is an integral multiple of the obtained pitch frequency, the power of each harmonic can be more accurately estimated. The spectrum correction vector calculation unit 111 calculates a spectrum correction vector from the power of each harmonic of the obtained prediction residual. Harmonic number L
Changes depending on the pitch frequency w0. In order to quantize this efficiently, for example, out of L harmonics, a harmonic in an acoustically important band is fixed regardless of the number L of harmonics. The number N may be selected and used as a spectrum correction vector, and the quantized N spectral correction vectors may be quantized by the quantizer 112 using N-dimensional vector quantization. The multiplexer 114 efficiently encodes the pitch frequency, the voiced / unvoiced determination of each harmonic, the quantized LPC coefficient, the quantized frame power, and the quantized spectrum correction information obtained as information to be sent to the decoding device.
As a result, the encoded pitch frequency, voiced / unvoiced decision, LPC coefficient, frame power, and spectrum correction information are obtained as the output of the encoding device.

【００１５】次に、復号化装置について図２を用いて説
明する。図２において、２０１は符号を入力とし、ピッ
チ周波数、有声無声判定、量子化フレームパワー、量子
化スペクトル補正ベクトル、量子化ＬＰＣ係数を出力と
するデマルチプレクサである。２０２はピッチ周波数、
有声無声判定を入力とし、ＭＢＥ合成モードを出力する
モード決定部である。２０３は量子化ＬＰＣ係数を入力
とし、逆量子化されたＬＰＣ係数を出力とする逆量子化
器である。２０４は逆量子化されたＬＰＣ係数を入力と
し、スペクトル強調されたＬＰＣ係数を出力とするスペ
クトル強調部である。２０５はスペクトル強調されたＬ
ＰＣ係数を入力とし、各高調波の振幅乗数を出力とする
高調波振幅乗数算出部である。２０６は量子化フレーム
パワーを入力とし、逆量子化されたフレームパワーを出
力とする逆量子化器である。２０７は量子化スペクトル
補正ベクトルを入力とし、逆量子化されたスペクトル補
正ベクトルを出力とする逆量子化器である。２０８はピ
ッチ周波数と、有声無声判定と、フレームパワーと、ス
ペクトル補正ベクトルを入力とし、音源信号を出力とす
る音源生成部である。２０９はＭＢＥ合成モードと、音
源信号と、各高調波振幅乗数を入力とし、復号化音声を
出力とするＭＢＥ合成部である。Next, the decoding apparatus will be described with reference to FIG. In FIG. 2, reference numeral 201 denotes a demultiplexer that receives a code and outputs a pitch frequency, a voiced / unvoiced determination, a quantized frame power, a quantized spectrum correction vector, and a quantized LPC coefficient. 202 is a pitch frequency,
A mode determination unit that receives a voiced / unvoiced determination and outputs an MBE combining mode. An inverse quantizer 203 receives the quantized LPC coefficients as input and outputs the inversely quantized LPC coefficients as output. Reference numeral 204 denotes a spectrum emphasizing unit that receives the inversely quantized LPC coefficients and outputs the spectrum-emphasized LPC coefficients. 205 is a spectrum emphasized L
It is a harmonic amplitude multiplier calculation unit that receives a PC coefficient as input and outputs an amplitude multiplier of each harmonic. An inverse quantizer 206 receives the quantized frame power as input and outputs the inversely quantized frame power as output. An inverse quantizer 207 receives a quantized spectrum correction vector as an input, and outputs an inversely quantized spectrum correction vector as an output. Reference numeral 208 denotes a sound source generation unit that receives a pitch frequency, a voiced / unvoiced determination, a frame power, and a spectrum correction vector, and outputs a sound source signal. Reference numeral 209 denotes an MBE synthesizing unit that receives an MBE synthesis mode, a sound source signal, and each harmonic amplitude multiplier as inputs and outputs a decoded speech.

【００１６】次に、上記図２においてその動作を説明す
る。まず、デマルチプレクサ２０１では、符号化装置側
から送られた符号から、ピッチ周波数、有声無声判定、
量子化されたフレームパワー、量子化されたスペクトル
補正ベクトル、量子化されたＬＰＣ係数をそれぞれ取り
出す。モード決定部２０２では、ピッチ周波数と有声無
声判定をもとに、従来から用いられているモード決定法
でＭＢＥ合成モードを決定する。逆量子化器２０３は、
量子化されたＬＰＣ係数を逆量子化する。スペクトル強
調部２０４は、ＬＰＣ係数に対し、出力音声の主観的な
音質を高めるために、従来から知られているようなフォ
ルマント強調や高域強調などの処理を加える。なお、こ
のようなスペクトル強調を行う必要がないときは、この
スペクトル強調部を設ける必要はない。高調波振幅乗算
算出部２０５では、ＬＰＣ係数をＦＦＴにより周波数領
域に変換する。このとき、フレーム両端での信号波形の
急激な変化によるスペクトルへの影響を軽減するため
に、ハミング窓のような窓関数を乗じるのが望ましい。
求めた各フーリエ係数の絶対値を求め、逆数をとること
により、合成音声信号のスペクトル包絡を得ることがで
きる。合成音声信号の各高調波の振幅乗数は、ピッチ周
波数ｗ0 の整数倍の周波数ｎｗ0 を求め、スペクトル包
絡からその周波数成分を抽出することによって得られ
る。また、音声信号の全帯域ｗｃを高調波数Ｌ個の帯域
に分割し、それぞれの帯域においてスペクトル包絡の平
均値を算出し、それを各高調波の振幅乗数とすれば、音
声の高調波の周波数と求めたピッチ周波数の整数倍の周
波数との誤差が生じても、より正確に高調波の振幅乗数
が求められる。逆量子化器２０６は、量子化されたフレ
ームパワーを逆量子化する。逆量子化器２０７は、量子
化されたスペクトル補正ベクトルを逆量子化する。Next, the operation will be described with reference to FIG. First, in the demultiplexer 201, the pitch frequency, voiced / unvoiced judgment,
The quantized frame power, the quantized spectrum correction vector, and the quantized LPC coefficient are respectively extracted. The mode determination unit 202 determines the MBE combining mode by a conventionally used mode determination method based on the pitch frequency and the voiced / unvoiced determination. The inverse quantizer 203
Dequantize the quantized LPC coefficient. The spectrum emphasizing unit 204 performs conventionally known processing such as formant emphasis and high-frequency emphasis on the LPC coefficients in order to improve the subjective sound quality of the output sound. When there is no need to perform such spectrum enhancement, there is no need to provide this spectrum enhancement unit. The harmonic amplitude multiplication calculation unit 205 converts the LPC coefficient into a frequency domain by FFT. At this time, it is desirable to multiply by a window function such as a Hamming window in order to reduce the influence on the spectrum due to a sudden change in the signal waveform at both ends of the frame.
The spectrum envelope of the synthesized speech signal can be obtained by obtaining the absolute value of each of the obtained Fourier coefficients and taking the reciprocal. The amplitude multiplier of each harmonic of the synthesized speech signal is obtained by obtaining a frequency nw0 that is an integral multiple of the pitch frequency w0 and extracting the frequency component from the spectrum envelope. Further, if the entire band wc of the audio signal is divided into L number of harmonic bands, and the average value of the spectrum envelope is calculated in each band, and the average value is used as the amplitude multiplier of each harmonic, the frequency of the harmonic of the audio is calculated. Even if an error occurs between the calculated pitch frequency and an integral multiple of the pitch frequency, the amplitude multiplier of the harmonic can be obtained more accurately. The inverse quantizer 206 inversely quantizes the quantized frame power. The inverse quantizer 207 inversely quantizes the quantized spectrum correction vector.

【００１７】音源生成部２０８では、ピッチ周波数と、
有声無声判定と、フレームパワーとスペクトル補正ベク
トルにより、音源を生成する。音源の生成は、まずピッ
チ周波数から各高調波の周波数を求め、各高調波の周波
数を算出する。また、有声無声判定をもとに各高調波周
波数ごとに基本音源信号を生成する。これは、従来から
その高調波が有声であれば正弦波、無声であれば白色雑
音とする方法が知られている。次にフレームパワーおよ
びスペクトル補正ベクトルから、各高調波の音源振幅を
推定し、それを基本音源信号に乗ずることにより音源信
号を生成する。各高調波の音源振幅を算出する手段とし
ては、まずスペクトル補正ベクトルから、符号化装置側
で残差に含まれていた各高調波のスペクトルパワーを算
出し、フレームパワーと乗じる。理論的には、この結果
とスペクトル包絡により音声信号のスペクトルが再現さ
れるが、ＭＢＥ合成を行う場合、前述したように各高調
波ごとに有声と無声の場合でそのモデル化の方法が異な
るため、有声無声判定それぞれのモデルを仮定し、スペ
クトル算出時の窓関数の影響を取り除くことが望まし
い。影響を取り除く方法としては、フレームパワーとス
ペクトル補正ベクトルから求められた音源の振幅に対
し、それぞれのスペクトルモデルを仮定し、窓関数のス
ペクトルをデコンボリューションする方法が考えられ
る。ＭＢＥ合成部２０９では、このようにして生成され
た音源信号と各高調波の振幅とＭＢＥ合成モードをもと
に、従来から知られているＭＢＥ合成を行い、復号化音
声信号を出力する。In the sound source generator 208, the pitch frequency and
A sound source is generated based on the voiced / unvoiced determination, the frame power, and the spectrum correction vector. To generate a sound source, first, the frequency of each harmonic is obtained from the pitch frequency, and the frequency of each harmonic is calculated. In addition, a basic sound source signal is generated for each harmonic frequency based on the voiced / unvoiced determination. Conventionally, a method is known in which a sine wave is used if the harmonic is voiced, and a white noise is used if the harmonic is unvoiced. Next, the sound source amplitude of each harmonic is estimated from the frame power and the spectrum correction vector, and the sound source signal is generated by multiplying the estimated sound source amplitude by the basic sound source signal. As means for calculating the excitation amplitude of each harmonic, first, the coding apparatus calculates the spectrum power of each harmonic included in the residual from the spectrum correction vector and multiplies it by the frame power. Theoretically, the spectrum of the audio signal is reproduced by this result and the spectral envelope. However, when performing MBE synthesis, the modeling method differs between voiced and unvoiced for each harmonic as described above. , And voiced / unvoiced judgment models, it is desirable to remove the effect of the window function when calculating the spectrum. As a method of removing the influence, a method of deconvolving the spectrum of the window function by assuming respective spectrum models for the amplitude of the sound source obtained from the frame power and the spectrum correction vector can be considered. The MBE combining unit 209 performs conventionally known MBE combining based on the thus generated sound source signal, the amplitude of each harmonic, and the MBE combining mode, and outputs a decoded audio signal.

【００１８】以上のように、本実施例によれば、従来の
符号化装置に、予測残差正規化部１０８と、残差パワー
スペクトル算出部１０９と、残差高調波パワー算出部１
１０と、スペクトル補正ベクトル算出部１１１と、スペ
クトル補正ベクトルを量子化する量子化器１１２とを設
けることにより、ＬＰＣ分析によって取り除くことがで
きなかった各高調波の細かいスペクトルを表すパラメー
タを算出し符号化して、復号化装置に情報として伝達で
きる。したがって、復号化装置に、このスペクトル補正
ベクトルを考慮した音源生成部２０８を設けることによ
り、ＬＰＣ予測残差に含まれる高調波のスペクトルを復
号化時に補正することができるため、合成時の各高調波
のスペクトルをより正確に表現することができ、音声品
質を改善することができる。As described above, according to the present embodiment, the conventional encoding apparatus includes the prediction residual normalizing unit 108, the residual power spectrum calculating unit 109, and the residual harmonic power calculating unit 1
10, a spectrum correction vector calculation unit 111, and a quantizer 112 for quantizing the spectrum correction vector to calculate and code a parameter representing a fine spectrum of each harmonic that could not be removed by the LPC analysis. And can be transmitted as information to the decoding device. Therefore, by providing the excitation device 208 in the decoding device in consideration of the spectrum correction vector, the spectrum of the harmonic included in the LPC prediction residual can be corrected at the time of decoding. The spectrum of the wave can be represented more accurately, and the sound quality can be improved.

【００１９】（実施例２）次に、本発明の第２実施例に
ついて、図３を参照しながら説明する。図３は本実施例
における符号化装置の構成を示し、図１に示した第１実
施例の符号化装置と共通の要素には、同一の番号を付し
て重複した説明は省略する。また、本実施例において、
復号化装置は、図２に示した第１実施例の復号化装置と
共通の構成である。(Embodiment 2) Next, a second embodiment of the present invention will be described with reference to FIG. FIG. 3 shows the configuration of the encoding apparatus according to the present embodiment. Elements common to those of the encoding apparatus according to the first embodiment shown in FIG. 1 are denoted by the same reference numerals, and redundant description is omitted. In this embodiment,
The decoding device has the same configuration as the decoding device of the first embodiment shown in FIG.

【００２０】図３において、１１５は量子化ＬＰＣ係数
を入力とし、逆量子化されたＬＰＣ係数を出力とする逆
量子化器である。１１６は量子化フレームパワーを入力
とし、逆量子化したフレームパワーを出力とする逆量子
化器である。本実施例における残差情報算出部１１３Ａ
は、図１に示した残差情報算出部１１３に逆量子化器１
１６を付加した構成である。逆量子化器１１５は、量子
化器１０４によって量子化されたＬＰＣ係数を逆量子化
している。この結果、この逆量子化器の出力として得ら
れるＬＰＣ係数は、図２の復号化装置における逆量子化
器２０７の出力と全く等しい値である。この量子化によ
って生じた誤差は、逆フィルタ１０５の出力である予測
残差に反映される。また、同様に逆量子化１１６は、量
子化器１０７によって量子化されたフレームパワーを逆
量子化する。この逆量子化されたフレームパワーは、図
２の復号化装置における逆量子化器２０６の出力と等し
い値である。この量子化によって生じた誤差は、予測残
差正規化部１０８の出力である正規化予測算差に反映さ
れる。したがって、これらの量子化誤差は、残差パワー
スペクトル算出部１０９、残差高調波パワー算出部１１
０、スペクトル補正ベクトル算出部１１１によって計算
され、出力されるスペクトル補正ベクトルに反映され
る。つまり、符号化ビット数を削減し、量子化器１０４
または量子化器１０７、あるいはその両方の量子化誤差
が大きくなっても、量子化器１１２に適当なビット割り
当てを行えば、その誤差を吸収することができる。In FIG. 3, reference numeral 115 denotes an inverse quantizer which receives quantized LPC coefficients as input and outputs inversely quantized LPC coefficients as output. An inverse quantizer 116 receives the quantized frame power as input and outputs the inversely quantized frame power as output. Residual information calculation unit 113A in the present embodiment
Indicates that the residual information calculation unit 113 shown in FIG.
16 is added. The inverse quantizer 115 inversely quantizes the LPC coefficients quantized by the quantizer 104. As a result, the LPC coefficient obtained as the output of the inverse quantizer has a value exactly equal to the output of the inverse quantizer 207 in the decoding device of FIG. The error caused by this quantization is reflected on the prediction residual output from the inverse filter 105. Similarly, the inverse quantization 116 inversely quantizes the frame power quantized by the quantizer 107. The inversely quantized frame power has a value equal to the output of the inverse quantizer 206 in the decoding device of FIG. The error generated by the quantization is reflected in the normalized prediction difference output from the prediction residual normalization unit 108. Therefore, these quantization errors are calculated by the residual power spectrum calculator 109 and the residual harmonic power calculator 11.
0, which is calculated by the spectrum correction vector calculation unit 111 and reflected in the output spectrum correction vector. That is, the number of coded bits is reduced and the quantizer 104
Alternatively, even if the quantization error of the quantizer 107 or both becomes large, the error can be absorbed by appropriately assigning bits to the quantizer 112.

【００２１】以上のように、本実施例によれば、符号化
装置に、量子化ＬＰＣ係数を逆量子化する逆量子化器１
１５と、量子化フレームパワーを逆量子化する逆量子化
器１１６とを設け、逆量子されたＬＰＣ係数によってＬ
ＰＣ逆フィルタ１０５を動作させ、逆量子化されたフレ
ームパワーで予測残差正規化部１０８を動作させること
により、それらの量子化誤差をスペクトル補正ベクトル
によって補正することができ、ＬＰＣ係数またはフレー
ムパワーもしくはその両方に与えるビット数を減らして
も、スペクトル補正ベクトルに適当なビットを割り当て
れば、その誤差を吸収することができる。また、このこ
とから、各パラメータに対し、柔軟なビット割り当てを
行うことができる。As described above, according to the present embodiment, the encoding device includes the inverse quantizer 1 for inversely quantizing the quantized LPC coefficient.
15 and an inverse quantizer 116 for inversely quantizing the quantized frame power.
By operating the PC inverse filter 105 and operating the prediction residual normalization unit 108 with the inversely quantized frame power, those quantization errors can be corrected by the spectrum correction vector, and the LPC coefficient or the frame power Alternatively, even if the number of bits given to both is reduced, the error can be absorbed by assigning appropriate bits to the spectrum correction vector. Also, from this, flexible bit allocation can be performed for each parameter.

【００２２】（実施例３）次に、本発明の第３実施例に
ついて、図面を参照しながら説明する。まず、符号化装
置について図４を用いて説明する。図１に示した第１実
施例の符号化装置と共通の要素には、同一の番号を付し
て重複した説明は省略する。図４において、１１７はピ
ッチ周波数と有声無声判定を入力とし、ＬＰＣ係数量子
化レベル数とスペクトル補正ベクトル量子化レベル数と
量子化ビット割り当てを出力する量子化レベル数算出部
である。１１８はＬＰＣ係数とＬＰＣ係数量子化レベル
数を入力とし、量子化ＬＰＣ係数を出力とするレベル可
変量子化器である。１１９はスペクトル補正ベクトルと
スペクトル補正ベクトル量子化レベル数を入力とし、量
子化スペクトル補正ベクトルを出力とするレベル可変量
子化器である。１２０はピッチ周波数と、有声無声判定
と、量子化ＬＰＣ係数と、量子化フレームパワーと、量
子化スペクトル補正ベクトルを入力とし、符号を出力と
するマルチプレクサである。本実施例における残差情報
算出部１１３Ｂは、図１に示した量子化器１１２の代わ
りにレベル可変量子化器１１９を設けたものである。(Embodiment 3) Next, a third embodiment of the present invention will be described with reference to the drawings. First, an encoding device will be described with reference to FIG. The same elements as those in the encoding device of the first embodiment shown in FIG. 1 are denoted by the same reference numerals, and duplicate description will be omitted. In FIG. 4, reference numeral 117 denotes a quantization level calculation unit which receives the pitch frequency and voiced / unvoiced judgment as inputs, and outputs the number of LPC coefficient quantization levels, the number of spectrum correction vector quantization levels, and the allocation of quantization bits. Reference numeral 118 denotes a variable level quantizer that receives an LPC coefficient and the number of LPC coefficient quantization levels and outputs a quantized LPC coefficient. Reference numeral 119 denotes a variable level quantizer which receives the spectrum correction vector and the number of quantization levels of the spectrum correction vector and outputs the quantization spectrum correction vector. Reference numeral 120 denotes a multiplexer that receives a pitch frequency, a voiced / unvoiced determination, a quantized LPC coefficient, a quantized frame power, and a quantized spectrum correction vector, and outputs a code. The residual information calculation unit 113B in this embodiment has a variable level quantizer 119 in place of the quantizer 112 shown in FIG.

【００２３】次に、図４においてその動作を説明する。
量子化レベル数算出部１１７では、ピッチ周波数と有声
無声判定をもとに、フレーム毎にＬＰＣ係数とスペクト
ル補正ベクトルに割り当てるビット数を求め、それに対
応する量子化レベル数を算出する。例えば、ピッチ周波
数が高いとき、高調波数Ｌの値は小さくなるから、スペ
クトル補正を行なうべき高調波の数が少ないため、ＬＰ
Ｃ係数の符号化に、より多くのビット数を割り当てられ
る。また反対に、無声部分などホルマントが存在せず、
スペクトル包絡がなだらかに変化するような場合は、ス
ペクトル補正ベクトルに、より多くのビット数を割り当
てたほうが良い場合がある。このように、それぞれのパ
ラメータに割り当てる量子化ビット数の合計、すなわち
量子化レベル数の合計を一定とし、フレーム毎にＬＰＣ
係数とスペクトル補正ベクトルの情報量によってビット
数の割り当てを変えることにより、より効果的な符号化
を行うことができる。レベル可変量子化器１１８は、割
り当てられたビット数に基づくレベル数でＬＰＣ係数の
量子化を行う。したがって、例えばコードブックサイズ
の異なる複数のベクトル量子化器を用意すれば、効率よ
く量子化することができる。同様に、レベル可変量子化
器１１９は、割り当てられたビット数に基づくレベル数
で、スペクトル補正ベクトルを量子化する。マルチプレ
クサ１２０では、ピッチ周波数、有声無声判定、量子化
フレームパワーとともに、それぞれのビット割り当てに
基づいて、量子化ＬＰＣ係数および量子化スペクトル補
正ベクトルを符号化する。Next, the operation will be described with reference to FIG.
The quantization level calculation unit 117 obtains the number of bits to be allocated to the LPC coefficient and the spectrum correction vector for each frame based on the pitch frequency and the voiced / unvoiced determination, and calculates the corresponding quantization level. For example, when the pitch frequency is high, the value of the number of harmonics L becomes small.
More bits can be allocated to the encoding of the C coefficient. On the contrary, there is no formant such as a silent part,
If the spectrum envelope changes gradually, it may be better to assign a larger number of bits to the spectrum correction vector. As described above, the total number of quantization bits assigned to each parameter, that is, the total number of quantization levels is fixed, and the LPC
By changing the allocation of the number of bits according to the information amount of the coefficient and the spectrum correction vector, more effective encoding can be performed. The variable level quantizer 118 quantizes LPC coefficients with the number of levels based on the number of allocated bits. Therefore, for example, if a plurality of vector quantizers having different codebook sizes are prepared, quantization can be performed efficiently. Similarly, the variable level quantizer 119 quantizes the spectrum correction vector with the number of levels based on the number of allocated bits. The multiplexer 120 encodes the quantized LPC coefficient and the quantized spectrum correction vector based on each bit allocation together with the pitch frequency, voiced / unvoiced decision, and quantized frame power.

【００２４】次に、復号化装置について図５を参照して
説明する。図２に示した第１実施例の復号化装置と共通
の要素には、同一の番号を付して重複した説明は省略す
る。図５において、２１０はピッチ周波数、有声無声判
定を入力とし、ＬＰＣ係数量子化レベル数とスペクトル
補正ベクトル量子化レベル数とビット割り当てを出力す
る量子化レベル数算出部である。２１１は符号およびビ
ット割り当てを入力とし、ピッチ周波数、有声無声判
定、量子化ＬＰＣ係数、量子化フレームパワー、量子化
スペクトル補正ベクトルを出力とするデマルチプレクサ
である。２１２は量子化ＬＰＣ係数と、ＬＰＣ係数量子
化レベル数を入力とし、逆量子化されたＬＰＣ係数を出
力するレベル可変逆量子化器である。２１３は量子化ス
ペクトル補正ベクトルとスペクトル補正ベクトル量子化
レベル数を入力とし、逆量子化されたスペクトル補正ベ
クトルを出力とするレベル可変逆量子化器である。Next, the decoding apparatus will be described with reference to FIG. The same elements as those of the decoding apparatus of the first embodiment shown in FIG. 2 are denoted by the same reference numerals, and the duplicate description will be omitted. In FIG. 5, reference numeral 210 denotes a quantization level number calculation unit which receives the pitch frequency and voiced / unvoiced determination as inputs, and outputs the number of LPC coefficient quantization levels, the number of spectrum correction vector quantization levels, and bit allocation. A demultiplexer 211 receives a code and a bit allocation, and outputs a pitch frequency, a voiced / unvoiced determination, a quantized LPC coefficient, a quantized frame power, and a quantized spectrum correction vector. Reference numeral 212 denotes a level variable inverse quantizer which receives a quantized LPC coefficient and the number of LPC coefficient quantization levels and outputs an inversely quantized LPC coefficient. A variable level inverse quantizer 213 receives the quantized spectrum correction vector and the number of quantization levels of the spectrum correction vector, and outputs the inversely quantized spectrum correction vector.

【００２５】次に、図５においてその動作を説明する。
デマルチプレクサ２１１は、符号からまずピッチ周波数
と、有声無声判定と、量子化フレームパワーを出力す
る。量子化レベル数算出部２１０では、図４の符号化装
置における量子化レベル数算出部１１７と全く等しい手
段により、フレーム毎にＬＰＣ係数量子化レベル数とス
ペクトル補正ベクトル量子化レベル数と割り当てビット
数を算出する。デマルチプレクサ２１１は、得られた割
り当てビット数をもとに量子化ＬＰＣ係数と、量子化ス
ペクトル補正ベクトルを出力する。レベル可変逆量子化
器２１２では、ＬＰＣ係数量子化レベル数に基づいてＬ
ＰＣ係数を逆量子化する。このレベル可変逆量子化器２
１２は、符号化装置におけるレベル可変量子化器１１８
に対応する逆量子化器である。同様にレベル可変逆量子
化器２１３は、符号化装置におけるレベル可変量子化器
１１９に対応する逆量子化器であり、スペクトル補正ベ
クトル量子化レベル数に基づいて、スペクトル補正ベク
トルを逆量子化する。Next, the operation will be described with reference to FIG.
The demultiplexer 211 first outputs a pitch frequency, a voiced / unvoiced determination, and a quantized frame power from the code. In the quantization level number calculation unit 210, the number of LPC coefficient quantization levels, the number of spectrum correction vector quantization levels, and the number of allocated bits are set for each frame by means exactly equal to the quantization level number calculation unit 117 in the encoding apparatus of FIG. Is calculated. The demultiplexer 211 outputs a quantized LPC coefficient and a quantized spectrum correction vector based on the obtained number of allocated bits. In the level variable inverse quantizer 212, the LPC coefficient is quantized based on the number of quantization levels.
Dequantize the PC coefficients. This level variable inverse quantizer 2
12 is a variable level quantizer 118 in the encoding device.
Is an inverse quantizer corresponding to. Similarly, the variable level inverse quantizer 213 is an inverse quantizer corresponding to the variable level quantizer 119 in the encoding device, and inversely quantizes the spectrum correction vector based on the number of quantization levels of the spectrum correction vector. .

【００２６】以上のように、本実施例によれば、符号化
装置に、量子化レベル数算出部１１７と、それに対応す
る、ＬＰＣ係数を量子化するレベル可変量子化器１１８
と、スペクトル補正ベクトルを量子化するレベル可変量
子化器１１９と、マルチプレクサ１２０とを備え、また
復号化装置に、符号化装置の量子化レベル数算出部１１
７と同じ量子化レベル数算出部２１０と、それに対応す
る、デマルチプレクサ２１１と、量子化ＬＰＣ係数を逆
量子化するレベル可変逆量子化器２１２と、量子化スペ
クトル補正ベクトルを逆量子化するレベル可変逆量子化
器２１３を備えることにより、フレーム毎にＬＰＣ係数
とスペクトル補正ベクトルの情報量によってビット数の
割り当てを変え、より効果的な符号化を行うことがで
き、復号化音声の品質を高めることができる。As described above, according to the present embodiment, the encoding apparatus includes a quantization level number calculator 117 and a corresponding variable level quantizer 118 for quantizing LPC coefficients.
, A variable level quantizer 119 for quantizing the spectrum correction vector, and a multiplexer 120, and the decoding apparatus is provided with a quantization level number calculation unit 11 of the encoding apparatus.
7; a demultiplexer 211 corresponding thereto; a variable level dequantizer 212 for dequantizing the quantized LPC coefficient; and a level for dequantizing the quantized spectrum correction vector. By providing the variable inverse quantizer 213, the allocation of the number of bits can be changed depending on the information amount of the LPC coefficient and the spectrum correction vector for each frame, more effective encoding can be performed, and the quality of decoded speech can be improved. be able to.

【００２７】（実施例４）次に、本発明の第４実施例に
ついて、図面を参照しながら説明する。まず、符号化装
置について図６を用いて説明する。図１に示した第１実
施例の符号化装置と共通の要素には、同一の番号を付し
て重複した説明は省略する。図６において、１２１はＬ
ＰＣ予測残差を入力とし、フレーム内のパワーベクトル
を出力とするパワーベクトル算出部である。１２２はパ
ワーベクトルを入力とし、量子化パワーベクトルを出力
とする量子化器である。１２３はピッチ周波数と、有声
無声判定と、量子化ＬＰＣ係数と、量子化パワーベクト
ルを入力とし、符号を出力とするマルチプレクサであ
る。本実施例における残差情報算出部１１３Ｃは、パワ
ースペクトル算出部１２１と量子化器１２２とから構成
される。(Embodiment 4) Next, a fourth embodiment of the present invention will be described with reference to the drawings. First, an encoding device will be described with reference to FIG. The same elements as those in the encoding device of the first embodiment shown in FIG. 1 are denoted by the same reference numerals, and duplicate description will be omitted. In FIG. 6, 121 is L
A power vector calculation unit that receives a PC prediction residual as an input and outputs a power vector in a frame as an output. Reference numeral 122 denotes a quantizer that receives a power vector as an input and outputs a quantized power vector as an output. A multiplexer 123 receives a pitch frequency, a voiced / unvoiced determination, a quantized LPC coefficient, and a quantized power vector, and outputs a code. The residual information calculation unit 113C according to the present embodiment includes a power spectrum calculation unit 121 and a quantizer 122.

【００２８】次に、図６においてその動作を説明する。
パワーベクトル算出部１２１は、まずＬＰＣ予測残差に
おいてそのフレームを複数のサブフレームに等分割し、
各サブフレームでの平均パワーを算出し、それらを合わ
せてフレームのパワーベクトルとして出力する。量子化
器１２２では、そのパワーベクトルをベクトル量子化等
の効率のよい量子化法で量子化する。マルチプレクサ１
２３では、復号化装置に送る情報として得られた、ピッ
チ周波数、各高調波の有声無声判定、量子化ＬＰＣ係
数、量子化パワーベクトルを効率よく符号化する。Next, the operation will be described with reference to FIG.
The power vector calculation unit 121 first divides the frame into a plurality of subframes in the LPC prediction residual,
The average power in each subframe is calculated, and the sum is output as a power vector of the frame. The quantizer 122 quantizes the power vector by an efficient quantization method such as vector quantization. Multiplexer 1
In 23, the pitch frequency, the voiced / unvoiced determination of each harmonic, the quantized LPC coefficient, and the quantized power vector obtained as information to be transmitted to the decoding device are efficiently encoded.

【００２９】次に、復号化装置について図７を用いて説
明する。図２に示した第１実施例の復号化装置と共通の
要素には、同一の番号を付して重複した説明は省略す
る。図７において、２１４は符号を入力とし、ピッチ周
波数と、有声無声判定と、量子化ＬＰＣ係数と量子化パ
ワーベクトルを出力するデマルチプレクサである。２１
５は量子化パワーベクトルを入力とし、逆量子化された
パワーベクトルを出力する量子化器である。２１６はＭ
ＢＥモードと逆量子化されたパワーベクトルを入力と
し、音源を出力とする音源生成部である。Next, the decoding apparatus will be described with reference to FIG. The same elements as those of the decoding apparatus of the first embodiment shown in FIG. 2 are denoted by the same reference numerals, and the duplicate description will be omitted. In FIG. 7, reference numeral 214 denotes a demultiplexer which receives a code and outputs a pitch frequency, a voiced / unvoiced determination, a quantized LPC coefficient, and a quantized power vector. 21
Reference numeral 5 denotes a quantizer which receives a quantized power vector as an input and outputs a dequantized power vector. 216 is M
A sound source generation unit that receives a BE mode and a power vector that has been inversely quantized, and outputs a sound source.

【００３０】次に、図７においてその動作を説明する。
デマルチプレクサ２１４は、符号化装置側から送られた
符号から、ピッチ周波数、有声無声判定、量子化された
ＬＰＣ係数、量子化されたパワーベクトルをそれぞれ取
り出す。逆量子化器２１５は、量子化されたパワーベク
トルを逆量子化する。音源生成部２１６では、パワーベ
クトルとＭＢＥ合成モードにより、音源を生成する。音
源の生成は、まずＭＢＥ合成モードから各高調波の周波
数を求め、各高調波の周波数を算出する。また、有声無
声判定を抽出するとともに、それをもとに各高調波ごと
に基本音源信号を生成する。これは、従来からその高調
波が有声であれば正弦波、無声であれば白色雑音とする
方法が知られている。次に、それぞれの基本波信号を、
符号化装置のパワーベクトル算出部１２１と等しい数の
サブフレームに分割し、パワーベクトルから得られる各
サブフレームのパワーを各高調波の対応するサブフレー
ムに乗じることにより音源信号を生成する。このとき、
サブフレーム間で音源信号が滑らかに連続するように、
パワーを補間するのが望ましい。また、ＭＢＥ合成部２
０９で合成を行う場合、各高調波ごとに有声と無声の場
合でそのモデル化の方法が異なるため、有声無声判定そ
れぞれのモデルを仮定し、スペクトル算出時の窓関数の
影響を取り除くことが望ましい。影響を取り除く方法と
しては、フレームパワーとスペクトル補正ベクトルから
求められた音源の振幅に対し、それぞれのスペクトルモ
デルを仮定し、窓関数のスペクトルをデコンボリューシ
ョンする方法が考えられる。Next, the operation will be described with reference to FIG.
The demultiplexer 214 extracts a pitch frequency, a voiced / unvoiced determination, a quantized LPC coefficient, and a quantized power vector from the code transmitted from the encoder. The inverse quantizer 215 inversely quantizes the quantized power vector. The sound source generation unit 216 generates a sound source based on the power vector and the MBE combining mode. To generate a sound source, first, the frequency of each harmonic is obtained from the MBE synthesis mode, and the frequency of each harmonic is calculated. In addition, a voiced / unvoiced judgment is extracted, and a basic sound source signal is generated for each harmonic based on the extracted voiced / unvoiced judgment. Conventionally, a method is known in which a sine wave is used if the harmonic is voiced, and a white noise is used if the harmonic is unvoiced. Next, each fundamental signal is
An excitation signal is generated by dividing into the same number of subframes as the power vector calculation unit 121 of the encoding apparatus and multiplying the power of each subframe obtained from the power vector by the corresponding subframe of each harmonic. At this time,
To ensure that the sound source signal continues smoothly between subframes,
It is desirable to interpolate the power. In addition, MBE synthesis unit 2
When the synthesis is performed at step 09, since the modeling method differs between voiced and unvoiced for each harmonic, it is desirable to assume each model for voiced and unvoiced determination and remove the effect of the window function when calculating the spectrum. . As a method of removing the influence, a method of deconvolving the spectrum of the window function by assuming respective spectrum models for the amplitude of the sound source obtained from the frame power and the spectrum correction vector can be considered.

【００３１】以上のように、本実施例によれば、符号化
装置にパワーベクトル算出部１２１と、それを量子化す
る量子化器１２２と、マルチプレクサ１２３を備え、復
号化装置にデマルチプレクサ２１４と、量子化パワーベ
クトルを復号化する逆量子化器２１５と、音源生成部２
１６を備えることにより、音声の立ち上がり時のように
フレーム内でパワーが大きく変化するような場合でも、
フレーム内の複数のサブフレームのそれぞれの平均パワ
ーをパワーベクトルとして復号化装置に送り、パワーの
変化を音源生成時に再現できるので、復号化音声の品質
を高めることができる。As described above, according to the present embodiment, the encoding apparatus includes the power vector calculating section 121, the quantizer 122 for quantizing the power vector calculating section 121, and the multiplexer 123, and the decoding apparatus includes the demultiplexer 214. , An inverse quantizer 215 for decoding a quantized power vector, and a sound source generation unit 2
By providing 16, even when the power greatly changes within a frame, such as at the time of voice rising,
The average power of each of the plurality of subframes in the frame is sent to the decoding device as a power vector, and the change in power can be reproduced at the time of generating the sound source, so that the quality of the decoded speech can be improved.

【００３２】[0032]

【発明の効果】以上のように、本発明によれば、符号化
装置に、残差パワースペクトル算出部と、残差高調波パ
ワー算出部と、スペクトル補正ベクトル算出部と、スペ
クトル補正ベクトルを量子化する量子化器とを設けるこ
とにより、ＬＰＣ分析によって取り除くことができなか
った各高調波の細かいスペクトルを表すパラメータを算
出し符号化して、復号化装置に情報として伝達できる。
したがって、復号化装置に、このスペクトル補正ベクト
ルを考慮した音源生成部を設けることにより、ＬＰＣ予
測残差に含まれる高調波のスペクトルを復号化時に補正
することができるため、合成時の各高調波のスペクトル
をより正確に表現することができ、従来と同じビットレ
ートで高い復号化音声品質が得られる優れた音声符号化
・復号化装置を実現できる。As described above, according to the present invention, a coding apparatus is provided with a residual power spectrum calculating section, a residual harmonic power calculating section, a spectrum correction vector calculating section, and a method for quantizing a spectrum correction vector. By providing a quantizer for transforming, it is possible to calculate and encode a parameter representing a fine spectrum of each harmonic that could not be removed by the LPC analysis, and transmit the information to a decoding device as information.
Therefore, by providing the sound generator in consideration of the spectrum correction vector in the decoding device, the spectrum of the harmonic contained in the LPC prediction residual can be corrected at the time of decoding. Can be represented more accurately, and an excellent speech encoding / decoding apparatus capable of obtaining high decoded speech quality at the same bit rate as in the past can be realized.

【００３３】また、本発明によれば、符号化装置に、量
子化ＬＰＣ係数を逆量子化する逆量子化器と、量子化フ
レームパワーを逆量子化する逆量子化器とを設け、逆量
子されたＬＰＣ係数によってＬＰＣ逆フィルタをかけ、
逆量子化されたフレームパワーでＬＰＣ予測残差を正規
化することにより、それらの量子化誤差をスペクトル補
正ベクトルによって補正することができ、ＬＰＣ係数ま
たはフレームパワーもしくはその両方に与えるビット数
を減らしても、スペクトル補正ベクトルに適当なビット
を割り当てれば、その誤差を吸収できる。また、このこ
とから、各パラメータに対し、柔軟なビット割り当てを
行うことができ、従来と同じビットレートで高い復号化
音声品質が得られる優れた音声符号化・復号化装置を実
現できる。Further, according to the present invention, the coding apparatus is provided with an inverse quantizer for inversely quantizing the quantized LPC coefficient and an inverse quantizer for inversely quantizing the quantized frame power. LPC inverse filter is applied by the obtained LPC coefficient,
By normalizing the LPC prediction residuals with the dequantized frame power, those quantization errors can be corrected by the spectral correction vector, and the number of bits given to the LPC coefficient and / or the frame power can be reduced. Also, the error can be absorbed by assigning appropriate bits to the spectrum correction vector. Also, from this, it is possible to flexibly assign bits to each parameter, and to realize an excellent speech encoding / decoding apparatus capable of obtaining high decoded speech quality at the same bit rate as in the related art.

【００３４】また、本発明によれば、符号化装置に、量
子化レベル数算出部と、それに対応する、ＬＰＣ係数を
量子化するレベル可変量子化器と、スペクトル補正ベク
トルを量子化するレベル可変量子化器と、マルチプレク
サとを設け、また復号化装置に、符号化装置の量子化レ
ベル数算出部と同じ量子化レベル数算出部と、それに対
応する、デマルチプレクサと、量子化ＬＰＣ係数を逆量
子化するレベル可変逆量子化器と、量子化スペクトル補
正ベクトルを逆量子化するレベル可変逆量子化器とを設
けることにより、フレーム毎にＬＰＣ係数とスペクトル
補正ベクトルの情報量によってビット数の割り当てを変
更し、より効果的な符号化を行うことができ、従来と同
じビットレートで高い復号化音声品質が得られる優れた
音声符号化・復号化装置を実現できる。Further, according to the present invention, the coding apparatus includes a quantization level number calculator, a corresponding level variable quantizer for quantizing LPC coefficients, and a level variable quantizer for quantizing a spectrum correction vector. A quantizer and a multiplexer are provided, and the decoding device is provided with the same quantization level number calculation unit as the quantization level number calculation unit of the encoding device, and the corresponding demultiplexer and quantized LPC coefficient are inverted. By providing a level-variable inverse quantizer for quantization and a level-variable inverse quantizer for inverse-quantizing the quantized spectrum correction vector, the number of bits can be allocated for each frame according to the information amount of the LPC coefficient and the spectrum correction vector. The superior speech encoding / decoding that can perform more effective encoding and achieve high decoded speech quality at the same bit rate as before The apparatus can be realized.

【００３５】また、本発明によれば、符号化装置に、パ
ワーベクトル算出部と、それを量子化する量子化器と、
マルチプレクサとを設け、復号化装置にデマルチプレク
サと、量子化パワーベクトルを復号化する逆量子化器
と、音源生成部を設けることにより、音声の立ち上がり
時のようにフレーム内でパワーが大きく変化するような
場合でも、フレーム内の複数のサブフレームのそれぞれ
の平均パワーをパワーベクトルとして復号化装置に送
り、パワーの変化を音源生成時に再現できるので、従来
と同じビットレートで高い復号化音声品質が得られる優
れた音声符号化復号化装置を実現できる。According to the present invention, a coding apparatus includes a power vector calculating section, a quantizer for quantizing the power vector calculating section,
By providing a multiplexer and providing a demultiplexer, an inverse quantizer for decoding a quantized power vector, and a sound source generation unit in a decoding device, the power greatly changes in a frame as at the time of rising of a voice. Even in such a case, the average power of each of a plurality of subframes in the frame is sent to the decoding device as a power vector, and the change in power can be reproduced at the time of generating the sound source, so that high decoded voice quality can be obtained at the same bit rate as before. An excellent speech encoding / decoding device obtained can be realized.

[Brief description of the drawings]

【図１】本発明の第１実施例における符号化装置のブロ
ック図FIG. 1 is a block diagram of an encoding device according to a first embodiment of the present invention.

【図２】本発明の第１実施例および第２実施例における
復号化装置のブロック図FIG. 2 is a block diagram of a decoding device according to a first embodiment and a second embodiment of the present invention;

【図３】本発明の第２実施例における符号化装置のブロ
ック図FIG. 3 is a block diagram of an encoding device according to a second embodiment of the present invention.

【図４】本発明の第３実施例における符号化装置のブロ
ック図FIG. 4 is a block diagram of an encoding device according to a third embodiment of the present invention.

【図５】本発明の第３実施例における復号化装置のブロ
ック図FIG. 5 is a block diagram of a decoding device according to a third embodiment of the present invention.

【図６】本発明の第４実施例における符号化装置のブロ
ック図FIG. 6 is a block diagram of an encoding device according to a fourth embodiment of the present invention.

【図７】本発明の第４実施例における復号化装置のブロ
ック図FIG. 7 is a block diagram of a decoding device according to a fourth embodiment of the present invention.

【図８】従来の符号化装置のブロック図FIG. 8 is a block diagram of a conventional encoding device.

【図９】従来の復号化装置のブロック図FIG. 9 is a block diagram of a conventional decoding device.

[Explanation of symbols]

１０１ピッチ周波数推定部１０２有声無声判定部１０３ＬＰＣ分析部１０４量子化器１０５ＬＰＣ逆フィルタ１０６フレームパワー算出部１０７量子化器１０８予測残差正規化部１０９残差パワースペクトル算出部１１０残差高調波パワー算出部１１１スペクトル補正ベクトル算出部１１２量子化器１１３、１１３Ａ、１１３Ｂ、１１３Ｃ残差情報算出
部１１４マルチプレクサ１１５逆量子化器１１６逆量子化器１１７量子化レベル算出部１１８レベル可変量子化器１１９レベル可変量子化器１２０マルチプレクサ１２１パワーベクトル算出部１２２量子化器１２３マルチプレクサ２０１デマルチプレクサ２０２モード決定部２０３逆量子化器２０４スペクトル強調部２０５高調波振幅乗数算出部２０６逆量子化器２０７逆量子化器２０８音源生成部２０９ＭＢＥ合成部２１０量子化レベル数算出部２１１デマルチプレクサ２１２レベル可変逆量子化器２１３レベル可変逆量子化器２１４デマルチプレクサ２１５逆量子化器２１６音源生成部Reference Signs List 101 Pitch frequency estimation unit 102 Voiced / unvoiced judgment unit 103 LPC analysis unit 104 Quantizer 105 LPC inverse filter 106 Frame power calculation unit 107 Quantizer 108 Prediction residual normalization unit 109 Residual power spectrum calculation unit 110 Residual harmonic Power calculator 111 Spectrum correction vector calculator 112 Quantizer 113, 113A, 113B, 113C Residual information calculator 114 Multiplexer 115 Dequantizer 116 Dequantizer 117 Quantization level calculator 118 Level variable quantizer 119 Level variable quantizer 120 multiplexer 121 power vector calculator 122 quantizer 123 multiplexer 201 demultiplexer 202 mode determiner 203 inverse quantizer 204 spectrum enhancer 205 harmonic amplitude multiplier calculator 2 6 Dequantizer 207 Dequantizer 208 Sound source generator 209 MBE synthesizer 210 Quantization level number calculator 211 Demultiplexer 212 Level variable dequantizer 213 Level variable dequantizer 214 Demultiplexer 215 Dequantizer 216 Sound source generator

Claims

(57) [Claims]

1. An encoding apparatus comprising: a pitch frequency estimating unit;
A voiced / unvoiced determination unit for voiced / unvoiced determination of each harmonic, and LP
A C (linear prediction) analyzer, a quantizer for quantizing LPC coefficients, an LPC inverse filter, a residual information calculator for extracting residual information other than at least residual frame power from the LPC predicted residual, A multiplexer for encoding the frequency, voiced / unvoiced determination, quantized LPC coefficient and residual information, wherein the decoding device is configured to determine the pitch frequency, voiced / unvoiced determination, quantized LPC coefficient and residual information from the code. A demultiplexer that decodes the LPC coefficient, a mode determination unit that determines the MBE combining mode, an inverse quantizer that inversely quantizes the quantized LPC coefficient to extract the LPC coefficient, and an amplitude from the LPC coefficient to each harmonic. It includes a harmonic amplitude multiplier calculating unit for obtaining a multiplier, and a sound source generating unit for generating a sound source signal from the pitch frequency and the voiced unvoiced frequency and residual information, and a MBE synthesis unit, encoder Definitive the residual information calculation unit, a frame-Pas
Power calculator and quantizer for quantizing frame power
A prediction residual normalization unit that normalizes the LPC prediction residual;
Power spectrum to calculate power spectrum of prediction residual
And the power of harmonics included in the residual
From the residual harmonic power calculator and the residual harmonic power,
Spectral correction vector to calculate vector correction vector
Calculator and quantization for quantizing the spectrum correction vector
A sound generator in the decoding apparatus includes a residual information
Power and spectrum calculated by the report calculation unit
A sound source is generated using the correction vector, and the encoding device and the decoding device
LPC coefficient and spectrum complement based on voiced and unvoiced decision
Calculate bit assignment for positive vector for each frame
And the number of quantization levels
Quantizer and inverse quantization for quantizing and inverse quantizing LPC coefficients
And inverse quantization of the detector and spectral correction vector
Quantizer and inverse quantizer calculate the quantization level
With different quantization levels for each frame given by the source
Performs quantization and inverse quantization, and also provides multiplexers and
The multiplexer calculates the number of quantization levels by the quantization level calculation unit.
Quantized LPC coefficients and
And quantize spectral correction vector
A speech encoding / decoding device characterized by the above-mentioned.