JPH07287599A

JPH07287599A - Voice-coder

Info

Publication number: JPH07287599A
Application number: JP6077098A
Authority: JP
Inventors: Hiroaki Takeda; 博昭竹田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1994-04-15
Filing date: 1994-04-15
Publication date: 1995-10-31

Abstract

PURPOSE:To make encoded voice quality excellent when a prediction error signal found by a prediction filter is quantized by linearly predicting and analyzing an input signal. CONSTITUTION:The voice-coder is provided with an impulse response calculator 13 which finds the impulse response of the prediction filter 12 generated from the output of a linear predictive analyzer 11, an impulse response frequency area converter 14 develops the found impulse response in a frequency range, and a formant calculator 15 which finds a formant calculates the position of the formant. Further, the prediction error signal outputted by the prediction filter 12 is developed by a prediction frequency range converter 16 into a frequency range prediction residue signal. The formant part of the prediction error of the output of the prediction filter 12 is quantized by a formant quantizer 17 and other parts are quantized by a quantizer 18.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音声符号化装置に関
し、特に予測残差量子化器に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coder and, more particularly, to a prediction residual quantizer.

【０００２】[0002]

【従来の技術】音声信号を圧縮し符号化する方法とし
て、線形予測分析を用いた音声符号化装置がある。2. Description of the Related Art As a method for compressing and coding a speech signal, there is a speech coding apparatus using linear prediction analysis.

【０００３】図３は従来のこの種の音声符号化装置の構
成を示すものである。図３において、１は線形予測分析
器、２は予測フィルタ、３は予測残差量子化器である。
また、ａは入力音声信号、ｂはフィルタ係数、ｃは予測
残差信号である。FIG. 3 shows the configuration of a conventional speech coding apparatus of this type. In FIG. 3, 1 is a linear prediction analyzer, 2 is a prediction filter, and 3 is a prediction residual quantizer.
Further, a is an input voice signal, b is a filter coefficient, and c is a prediction residual signal.

【０００４】以下、上記従来例の動作について説明す
る。線形予測分析器１では、入力音声信号ａの声道情報
をフィルタとみなし、そのフィルタ係数を算出してい
る。予測フィルタ２では、線形予測分析器１で求めたフ
ィルタ係数ｂに従い、音声信号を予測残差信号ｃに変換
する。この予測残差信号ｃを予測残差量子化器３で量子
化し、量子化予測残差信号を求める。The operation of the above conventional example will be described below. The linear prediction analyzer 1 regards the vocal tract information of the input voice signal a as a filter and calculates the filter coefficient thereof. The prediction filter 2 converts the audio signal into a prediction residual signal c according to the filter coefficient b obtained by the linear prediction analyzer 1. The prediction residual signal c is quantized by the prediction residual quantizer 3 to obtain a quantized prediction residual signal.

【０００５】この方式の装置では、復号化部に送信する
信号は、線形予測分析器１で求めたフィルタ係数ｂと、
予測残差量子化器３で量子化した量子化予測残差信号で
ある。そして復号化部では、量子化予測残差信号から予
測残差信号を求め、それを合成フィルタに通して復号化
音声信号を得る。In the apparatus of this system, the signal to be transmitted to the decoding unit is the filter coefficient b obtained by the linear prediction analyzer 1,
It is a quantized prediction residual signal quantized by the prediction residual quantizer 3. Then, in the decoding unit, a prediction residual signal is obtained from the quantized prediction residual signal and is passed through a synthesis filter to obtain a decoded speech signal.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、人間の
音声信号にはホルマントと呼ばれる特徴があり、周波数
的にパワーの強い部分と弱い部分との片寄りがある。こ
れは、音声信号の特徴の一つであり、上記従来の音声符
号化装置では、すべての周波数帯域に対して平等に量子
化しており、この特徴を考慮した量子化を行なっていな
いため、ホルマント部分を忠実に量子化できず、良好な
音質を得られないという問題を有していた。However, a human voice signal has a characteristic called a formant, and there is a bias between a portion having strong frequency and a portion having weak frequency power. This is one of the characteristics of a speech signal, and in the above-described conventional speech coding apparatus, quantization is performed equally for all frequency bands, and since quantization that does not consider this characteristic is not performed, the formant There is a problem that the portion cannot be quantized faithfully and good sound quality cannot be obtained.

【０００７】本発明は、上記従来の問題を解決するもの
であり、ホルマント情報を考慮した予測残差信号の量子
化を行なうことにより優れた音質を再生することのでき
る音声符号化装置を提供することを目的とする。The present invention solves the above-mentioned conventional problems, and provides a speech coding apparatus capable of reproducing excellent sound quality by quantizing a prediction residual signal in consideration of formant information. The purpose is to

【０００８】[0008]

【課題を解決するための手段】上記目的を達成するため
に、本発明の音声符号化装置は、入力音声を線形予測分
析し、予測フィルタで求めた予測残差を量子化する音声
符号化装置において、音声信号のホルマント部分を特定
して他の部分とは独立して量子化する手段を備えたもの
である。In order to achieve the above object, a speech coder according to the present invention is a speech coder for performing linear prediction analysis on input speech and quantizing a prediction residual obtained by a prediction filter. , A means for specifying a formant part of a voice signal and quantizing the formant part independently of other parts is provided.

【０００９】[0009]

【作用】本発明は、上記した構成により、人間の音声信
号の特徴の一つであるホルマントを考慮し、音声信号の
波形的特徴を効果的に量子化することにより、良好な復
号化音声を実現できる。According to the present invention, with the above configuration, a good decoded speech can be obtained by effectively quantizing the waveform characteristic of the speech signal in consideration of the formant which is one of the characteristics of the human speech signal. realizable.

【００１０】[0010]

【実施例】以下、本発明の一実施例について、図面を参
照しながら説明する。図１は本発明の一実施例における
音声符号化装置の構成を示すものである。図１におい
て、１１は入力音声信号のフィルタ係数を算出する線形
予測分析器、１２は求められたフィルタ係数を予測残差
信号に変換する予測フィルタである。１３は予測フィル
タ１２のインパルス応答を算出するインパルス応答算出
器である。１４は求められたインパルス応答を周波数領
域インパルス応答に展開するインパルス応答周波数領域
変換器である。１５は周波数領域インパルス応答からホ
ルマント部分を算出するホルマント算出器である。１６
は予測フィルタ１２で求めた予測残差信号を周波数領域
予測残差信号に展開する予測残差周波数領域変換器であ
る。１７はホルマント算出器１５からの情報に基づいて
ホルマント部分に相当する周波数領域予測残差信号を量
子化するホルマント量子化器である。１８はホルマント
部分以外の周波数領域予測残差信号を量子化する量子化
器である。ａは入力音声信号、ｂはフィルタ係数、ｃは
予測残差信号、ｄは予測フィルタ１２のインパルス応答
である。ｅはインパルス応答ｄを周波数領域に展開した
周波数領域インパルス応答であり、ｆはホルマント部分
確定信号であり、ｇは予測残差信号ｃを周波数領域に展
開した周波数領域予測残差信号である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 shows the configuration of a speech coder according to an embodiment of the present invention. In FIG. 1, 11 is a linear prediction analyzer that calculates the filter coefficient of the input speech signal, and 12 is a prediction filter that converts the obtained filter coefficient into a prediction residual signal. Reference numeral 13 is an impulse response calculator that calculates the impulse response of the prediction filter 12. Reference numeral 14 is an impulse response frequency domain converter that expands the obtained impulse response into a frequency domain impulse response. A formant calculator 15 calculates a formant part from the frequency domain impulse response. 16
Is a prediction residual frequency domain converter that expands the prediction residual signal obtained by the prediction filter 12 into a frequency domain prediction residual signal. A formant quantizer 17 quantizes the frequency domain prediction residual signal corresponding to the formant part based on the information from the formant calculator 15. A quantizer 18 quantizes the frequency domain prediction residual signal other than the formant part. a is an input speech signal, b is a filter coefficient, c is a prediction residual signal, and d is an impulse response of the prediction filter 12. e is a frequency domain impulse response in which the impulse response d is expanded in the frequency domain, f is a formant partial deterministic signal, and g is a frequency domain prediction residual signal in which the prediction residual signal c is expanded in the frequency domain.

【００１１】以上のように構成された音声符号化装置に
ついて、以下その動作を説明する。まず、入力音声信号
ａから予測フィルタ１２のフィルタ係数ｂを線形予測分
析器１１で算出する。フィルタ係数ｂを用いて予測フィ
ルタ１２で入力音声信号ａを予測残差信号ｃに変換す
る。またフィルタ係数ｂから予測フィルタ１２のインパ
ルス応答ｄをインパルス応答算出器１３で算出する。上
記で求めた予測残差信号ｃおよびインパルス応答ｄは時
間軸上の信号である。The operation of the speech coding apparatus configured as above will be described below. First, the linear prediction analyzer 11 calculates the filter coefficient b of the prediction filter 12 from the input audio signal a. The prediction filter 12 uses the filter coefficient b to convert the input speech signal a into a prediction residual signal c. Further, the impulse response calculator 13 calculates the impulse response d of the prediction filter 12 from the filter coefficient b. The prediction residual signal c and the impulse response d obtained above are signals on the time axis.

【００１２】次に、予測残差周波数領域変換器１６で予
測残差信号ｃを周波数領域予測残差信号ｇへと変換す
る。同様に、インパルス応答周波数領域変換器１４で周
波数軸上の信号である周波数インパルス応答ｅへと展開
する。周波数領域インパルス応答ｅは、入力音声信号ａ
の周波数別のパワーを示す。このパワー値により入力音
声信号ａの特徴の一つであるホルマントが図２のように
推測できる。また周波数領域インパルス応答ｅから、ホ
ルマント算出器１５によって周波数領域予測残差信号ｇ
のうちどれがホルマント部分に対応しているか示すホル
マント部分確定信号ｆを算出する。Next, the prediction residual frequency domain converter 16 transforms the prediction residual signal c into a frequency domain prediction residual signal g. Similarly, the impulse response frequency domain converter 14 develops a frequency impulse response e which is a signal on the frequency axis. The frequency domain impulse response e is the input speech signal a
The power of each frequency is shown. From this power value, the formant, which is one of the characteristics of the input audio signal a, can be estimated as shown in FIG. Also, from the frequency domain impulse response e, the formant calculator 15 determines the frequency domain prediction residual signal g
A formant part confirmation signal f indicating which of them corresponds to the formant part is calculated.

【００１３】次に、ホルマント部分に対応する周波数領
域予測残差信号ｇをホルマント部分確定信号ｆにより確
定し、ホルマント量子化器１７によって量子化を行な
う。ホルマント部分に対応していない周波数領域予測残
差信号ｇは量子化器１８によって量子化を行なう。Next, the frequency domain prediction residual signal g corresponding to the formant part is determined by the formant part determination signal f and quantized by the formant quantizer 17. The frequency domain prediction residual signal g that does not correspond to the formant part is quantized by the quantizer 18.

【００１４】復号化器に送る信号は、線形予測分析器１
１で求めたフィルタ係数ｂ、ホルマント量子化器１７の
出力であるホルマント部分量子化残差信号および量子化
器１８の出力である量子化残差信号である。そして、復
号化部では、ホルマント部分量子化残差信号および量子
化残差信号から予測残差信号を復号し、それを合成フィ
ルタに通して復号化音声信号を得る。The signal sent to the decoder is the linear prediction analyzer 1.
1 is the filter coefficient b obtained in step 1, the formant partially quantized residual signal which is the output of the formant quantizer 17, and the quantized residual signal which is the output of the quantizer 18. Then, the decoding unit decodes the prediction residual signal from the formant partially quantized residual signal and the quantized residual signal, and passes it through a synthesis filter to obtain a decoded speech signal.

【００１５】このように、上記実施例によれば、音声信
号のホルマント部分を他の部分と独立して量子化するた
め、音声信号の波形的特徴を復号化音声でも保つことが
でき、良好な復号化音声を再生することが可能である。As described above, according to the above embodiment, the formant portion of the voice signal is quantized independently of the other portions, so that the waveform characteristics of the voice signal can be maintained even in the decoded voice, which is excellent. It is possible to play the decoded audio.

【００１６】[0016]

【発明の効果】以上のように、本発明は、音声信号のホ
ルマント部分を特定して他の部分とは独立して量子化す
る手段を備えているので、音声信号の波形的特徴を復号
化音声でも保つことができ、良好な復号化音声を再生す
ることができるという効果を有する。As described above, the present invention has means for specifying the formant portion of a voice signal and quantizing the formant portion independently of the other portions. Therefore, the waveform characteristic of the voice signal is decoded. This has the effect that even voice can be maintained, and good decoded voice can be reproduced.

[Brief description of drawings]

【図１】本発明の一実施例における音声符号化装置の概
略構成を示すブロック図FIG. 1 is a block diagram showing a schematic configuration of a speech coding apparatus according to an embodiment of the present invention.

【図２】フィルタ係数のインパルス応答から算出された
信号スペクトルを示す特性図FIG. 2 is a characteristic diagram showing a signal spectrum calculated from an impulse response of a filter coefficient.

【図３】従来例における音声符号化装置の概略構成を示
すブロック図FIG. 3 is a block diagram showing a schematic configuration of a speech encoding apparatus in a conventional example.

[Explanation of symbols]

１１線形予測分析器１２予測フィルタ１３インパルス応答算出器１４インパルス応答周波数領域変換器１５ホルマント算出器１６予測残差周波数領域変換器１７ホルマント量子化器１８量子化器 11 Linear Prediction Analyzer 12 Prediction Filter 13 Impulse Response Calculator 14 Impulse Response Frequency Domain Transformer 15 Formant Calculator 16 Prediction Residual Frequency Domain Transformer 17 Formant Quantizer 18 Quantizer

Claims

[Claims]

1. A speech coding apparatus for linearly predicting an input speech and quantizing a prediction residual obtained by a prediction filter, wherein a formant portion of a speech signal is specified and quantized independently of other portions. A speech encoding apparatus having means for performing.

2. A linear prediction analyzer for calculating a filter coefficient of an input speech signal, a prediction filter for converting the obtained filter coefficient into a prediction residual signal, and an impulse response calculation for calculating an impulse response of the filter coefficient. A vessel,
An impulse response frequency domain converter that expands the impulse response into a frequency domain impulse response, a formant calculator that calculates a formant part from the frequency domain impulse response, and a prediction residual signal obtained by a prediction filter from a frequency domain prediction residual. A predictive residual frequency domain converter expanded into a signal, a formant quantizer that quantizes a portion corresponding to the formant part of the frequency domain residual based on information from the formant calculator, and a frequency other than the formant part. A speech coding apparatus comprising: a quantizer for quantizing a region prediction residual signal.