JPH07287599A - Voice-coder - Google Patents

Voice-coder

Info

Publication number
JPH07287599A
JPH07287599A JP6077098A JP7709894A JPH07287599A JP H07287599 A JPH07287599 A JP H07287599A JP 6077098 A JP6077098 A JP 6077098A JP 7709894 A JP7709894 A JP 7709894A JP H07287599 A JPH07287599 A JP H07287599A
Authority
JP
Japan
Prior art keywords
formant
prediction
impulse response
signal
frequency domain
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP6077098A
Other languages
Japanese (ja)
Inventor
Hiroaki Takeda
博昭 竹田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP6077098A priority Critical patent/JPH07287599A/en
Publication of JPH07287599A publication Critical patent/JPH07287599A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To make encoded voice quality excellent when a prediction error signal found by a prediction filter is quantized by linearly predicting and analyzing an input signal. CONSTITUTION:The voice-coder is provided with an impulse response calculator 13 which finds the impulse response of the prediction filter 12 generated from the output of a linear predictive analyzer 11, an impulse response frequency area converter 14 develops the found impulse response in a frequency range, and a formant calculator 15 which finds a formant calculates the position of the formant. Further, the prediction error signal outputted by the prediction filter 12 is developed by a prediction frequency range converter 16 into a frequency range prediction residue signal. The formant part of the prediction error of the output of the prediction filter 12 is quantized by a formant quantizer 17 and other parts are quantized by a quantizer 18.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、音声符号化装置に関
し、特に予測残差量子化器に関するものである。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coder and, more particularly, to a prediction residual quantizer.

【0002】[0002]

【従来の技術】音声信号を圧縮し符号化する方法とし
て、線形予測分析を用いた音声符号化装置がある。
2. Description of the Related Art As a method for compressing and coding a speech signal, there is a speech coding apparatus using linear prediction analysis.

【0003】図3は従来のこの種の音声符号化装置の構
成を示すものである。図3において、1は線形予測分析
器、2は予測フィルタ、3は予測残差量子化器である。
また、aは入力音声信号、bはフィルタ係数、cは予測
残差信号である。
FIG. 3 shows the configuration of a conventional speech coding apparatus of this type. In FIG. 3, 1 is a linear prediction analyzer, 2 is a prediction filter, and 3 is a prediction residual quantizer.
Further, a is an input voice signal, b is a filter coefficient, and c is a prediction residual signal.

【0004】以下、上記従来例の動作について説明す
る。線形予測分析器1では、入力音声信号aの声道情報
をフィルタとみなし、そのフィルタ係数を算出してい
る。予測フィルタ2では、線形予測分析器1で求めたフ
ィルタ係数bに従い、音声信号を予測残差信号cに変換
する。この予測残差信号cを予測残差量子化器3で量子
化し、量子化予測残差信号を求める。
The operation of the above conventional example will be described below. The linear prediction analyzer 1 regards the vocal tract information of the input voice signal a as a filter and calculates the filter coefficient thereof. The prediction filter 2 converts the audio signal into a prediction residual signal c according to the filter coefficient b obtained by the linear prediction analyzer 1. The prediction residual signal c is quantized by the prediction residual quantizer 3 to obtain a quantized prediction residual signal.

【0005】この方式の装置では、復号化部に送信する
信号は、線形予測分析器1で求めたフィルタ係数bと、
予測残差量子化器3で量子化した量子化予測残差信号で
ある。そして復号化部では、量子化予測残差信号から予
測残差信号を求め、それを合成フィルタに通して復号化
音声信号を得る。
In the apparatus of this system, the signal to be transmitted to the decoding unit is the filter coefficient b obtained by the linear prediction analyzer 1,
It is a quantized prediction residual signal quantized by the prediction residual quantizer 3. Then, in the decoding unit, a prediction residual signal is obtained from the quantized prediction residual signal and is passed through a synthesis filter to obtain a decoded speech signal.

【0006】[0006]

【発明が解決しようとする課題】しかしながら、人間の
音声信号にはホルマントと呼ばれる特徴があり、周波数
的にパワーの強い部分と弱い部分との片寄りがある。こ
れは、音声信号の特徴の一つであり、上記従来の音声符
号化装置では、すべての周波数帯域に対して平等に量子
化しており、この特徴を考慮した量子化を行なっていな
いため、ホルマント部分を忠実に量子化できず、良好な
音質を得られないという問題を有していた。
However, a human voice signal has a characteristic called a formant, and there is a bias between a portion having strong frequency and a portion having weak frequency power. This is one of the characteristics of a speech signal, and in the above-described conventional speech coding apparatus, quantization is performed equally for all frequency bands, and since quantization that does not consider this characteristic is not performed, the formant There is a problem that the portion cannot be quantized faithfully and good sound quality cannot be obtained.

【0007】本発明は、上記従来の問題を解決するもの
であり、ホルマント情報を考慮した予測残差信号の量子
化を行なうことにより優れた音質を再生することのでき
る音声符号化装置を提供することを目的とする。
The present invention solves the above-mentioned conventional problems, and provides a speech coding apparatus capable of reproducing excellent sound quality by quantizing a prediction residual signal in consideration of formant information. The purpose is to

【0008】[0008]

【課題を解決するための手段】上記目的を達成するため
に、本発明の音声符号化装置は、入力音声を線形予測分
析し、予測フィルタで求めた予測残差を量子化する音声
符号化装置において、音声信号のホルマント部分を特定
して他の部分とは独立して量子化する手段を備えたもの
である。
In order to achieve the above object, a speech coder according to the present invention is a speech coder for performing linear prediction analysis on input speech and quantizing a prediction residual obtained by a prediction filter. , A means for specifying a formant part of a voice signal and quantizing the formant part independently of other parts is provided.

【0009】[0009]

【作用】本発明は、上記した構成により、人間の音声信
号の特徴の一つであるホルマントを考慮し、音声信号の
波形的特徴を効果的に量子化することにより、良好な復
号化音声を実現できる。
According to the present invention, with the above configuration, a good decoded speech can be obtained by effectively quantizing the waveform characteristic of the speech signal in consideration of the formant which is one of the characteristics of the human speech signal. realizable.

【0010】[0010]

【実施例】以下、本発明の一実施例について、図面を参
照しながら説明する。図1は本発明の一実施例における
音声符号化装置の構成を示すものである。図1におい
て、11は入力音声信号のフィルタ係数を算出する線形
予測分析器、12は求められたフィルタ係数を予測残差
信号に変換する予測フィルタである。13は予測フィル
タ12のインパルス応答を算出するインパルス応答算出
器である。14は求められたインパルス応答を周波数領
域インパルス応答に展開するインパルス応答周波数領域
変換器である。15は周波数領域インパルス応答からホ
ルマント部分を算出するホルマント算出器である。16
は予測フィルタ12で求めた予測残差信号を周波数領域
予測残差信号に展開する予測残差周波数領域変換器であ
る。17はホルマント算出器15からの情報に基づいて
ホルマント部分に相当する周波数領域予測残差信号を量
子化するホルマント量子化器である。18はホルマント
部分以外の周波数領域予測残差信号を量子化する量子化
器である。aは入力音声信号、bはフィルタ係数、cは
予測残差信号、dは予測フィルタ12のインパルス応答
である。eはインパルス応答dを周波数領域に展開した
周波数領域インパルス応答であり、fはホルマント部分
確定信号であり、gは予測残差信号cを周波数領域に展
開した周波数領域予測残差信号である。
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 shows the configuration of a speech coder according to an embodiment of the present invention. In FIG. 1, 11 is a linear prediction analyzer that calculates the filter coefficient of the input speech signal, and 12 is a prediction filter that converts the obtained filter coefficient into a prediction residual signal. Reference numeral 13 is an impulse response calculator that calculates the impulse response of the prediction filter 12. Reference numeral 14 is an impulse response frequency domain converter that expands the obtained impulse response into a frequency domain impulse response. A formant calculator 15 calculates a formant part from the frequency domain impulse response. 16
Is a prediction residual frequency domain converter that expands the prediction residual signal obtained by the prediction filter 12 into a frequency domain prediction residual signal. A formant quantizer 17 quantizes the frequency domain prediction residual signal corresponding to the formant part based on the information from the formant calculator 15. A quantizer 18 quantizes the frequency domain prediction residual signal other than the formant part. a is an input speech signal, b is a filter coefficient, c is a prediction residual signal, and d is an impulse response of the prediction filter 12. e is a frequency domain impulse response in which the impulse response d is expanded in the frequency domain, f is a formant partial deterministic signal, and g is a frequency domain prediction residual signal in which the prediction residual signal c is expanded in the frequency domain.

【0011】以上のように構成された音声符号化装置に
ついて、以下その動作を説明する。まず、入力音声信号
aから予測フィルタ12のフィルタ係数bを線形予測分
析器11で算出する。フィルタ係数bを用いて予測フィ
ルタ12で入力音声信号aを予測残差信号cに変換す
る。またフィルタ係数bから予測フィルタ12のインパ
ルス応答dをインパルス応答算出器13で算出する。上
記で求めた予測残差信号cおよびインパルス応答dは時
間軸上の信号である。
The operation of the speech coding apparatus configured as above will be described below. First, the linear prediction analyzer 11 calculates the filter coefficient b of the prediction filter 12 from the input audio signal a. The prediction filter 12 uses the filter coefficient b to convert the input speech signal a into a prediction residual signal c. Further, the impulse response calculator 13 calculates the impulse response d of the prediction filter 12 from the filter coefficient b. The prediction residual signal c and the impulse response d obtained above are signals on the time axis.

【0012】次に、予測残差周波数領域変換器16で予
測残差信号cを周波数領域予測残差信号gへと変換す
る。同様に、インパルス応答周波数領域変換器14で周
波数軸上の信号である周波数インパルス応答eへと展開
する。周波数領域インパルス応答eは、入力音声信号a
の周波数別のパワーを示す。このパワー値により入力音
声信号aの特徴の一つであるホルマントが図2のように
推測できる。また周波数領域インパルス応答eから、ホ
ルマント算出器15によって周波数領域予測残差信号g
のうちどれがホルマント部分に対応しているか示すホル
マント部分確定信号fを算出する。
Next, the prediction residual frequency domain converter 16 transforms the prediction residual signal c into a frequency domain prediction residual signal g. Similarly, the impulse response frequency domain converter 14 develops a frequency impulse response e which is a signal on the frequency axis. The frequency domain impulse response e is the input speech signal a
The power of each frequency is shown. From this power value, the formant, which is one of the characteristics of the input audio signal a, can be estimated as shown in FIG. Also, from the frequency domain impulse response e, the formant calculator 15 determines the frequency domain prediction residual signal g
A formant part confirmation signal f indicating which of them corresponds to the formant part is calculated.

【0013】次に、ホルマント部分に対応する周波数領
域予測残差信号gをホルマント部分確定信号fにより確
定し、ホルマント量子化器17によって量子化を行な
う。ホルマント部分に対応していない周波数領域予測残
差信号gは量子化器18によって量子化を行なう。
Next, the frequency domain prediction residual signal g corresponding to the formant part is determined by the formant part determination signal f and quantized by the formant quantizer 17. The frequency domain prediction residual signal g that does not correspond to the formant part is quantized by the quantizer 18.

【0014】復号化器に送る信号は、線形予測分析器1
1で求めたフィルタ係数b、ホルマント量子化器17の
出力であるホルマント部分量子化残差信号および量子化
器18の出力である量子化残差信号である。そして、復
号化部では、ホルマント部分量子化残差信号および量子
化残差信号から予測残差信号を復号し、それを合成フィ
ルタに通して復号化音声信号を得る。
The signal sent to the decoder is the linear prediction analyzer 1.
1 is the filter coefficient b obtained in step 1, the formant partially quantized residual signal which is the output of the formant quantizer 17, and the quantized residual signal which is the output of the quantizer 18. Then, the decoding unit decodes the prediction residual signal from the formant partially quantized residual signal and the quantized residual signal, and passes it through a synthesis filter to obtain a decoded speech signal.

【0015】このように、上記実施例によれば、音声信
号のホルマント部分を他の部分と独立して量子化するた
め、音声信号の波形的特徴を復号化音声でも保つことが
でき、良好な復号化音声を再生することが可能である。
As described above, according to the above embodiment, the formant portion of the voice signal is quantized independently of the other portions, so that the waveform characteristics of the voice signal can be maintained even in the decoded voice, which is excellent. It is possible to play the decoded audio.

【0016】[0016]

【発明の効果】以上のように、本発明は、音声信号のホ
ルマント部分を特定して他の部分とは独立して量子化す
る手段を備えているので、音声信号の波形的特徴を復号
化音声でも保つことができ、良好な復号化音声を再生す
ることができるという効果を有する。
As described above, the present invention has means for specifying the formant portion of a voice signal and quantizing the formant portion independently of the other portions. Therefore, the waveform characteristic of the voice signal is decoded. This has the effect that even voice can be maintained, and good decoded voice can be reproduced.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例における音声符号化装置の概
略構成を示すブロック図
FIG. 1 is a block diagram showing a schematic configuration of a speech coding apparatus according to an embodiment of the present invention.

【図2】フィルタ係数のインパルス応答から算出された
信号スペクトルを示す特性図
FIG. 2 is a characteristic diagram showing a signal spectrum calculated from an impulse response of a filter coefficient.

【図3】従来例における音声符号化装置の概略構成を示
すブロック図
FIG. 3 is a block diagram showing a schematic configuration of a speech encoding apparatus in a conventional example.

【符号の説明】[Explanation of symbols]

11 線形予測分析器 12 予測フィルタ 13 インパルス応答算出器 14 インパルス応答周波数領域変換器 15 ホルマント算出器 16 予測残差周波数領域変換器 17 ホルマント量子化器 18 量子化器 11 Linear Prediction Analyzer 12 Prediction Filter 13 Impulse Response Calculator 14 Impulse Response Frequency Domain Transformer 15 Formant Calculator 16 Prediction Residual Frequency Domain Transformer 17 Formant Quantizer 18 Quantizer

Claims (2)

【特許請求の範囲】[Claims] 【請求項1】 入力音声を線形予測分析し、予測フィル
タで求めた予測残差を量子化する音声符号化装置におい
て、音声信号のホルマント部分を特定して他の部分とは
独立して量子化する手段を備えた音声符号化装置。
1. A speech coding apparatus for linearly predicting an input speech and quantizing a prediction residual obtained by a prediction filter, wherein a formant portion of a speech signal is specified and quantized independently of other portions. A speech encoding apparatus having means for performing.
【請求項2】 入力音声信号のフィルタ係数を算出する
線形予測分析器と、前記求められたフィルタ係数を予測
残差信号に変換する予測フィルタと、前記フィルタ係数
のインパルス応答を算出するインパルス応答算出器と、
前記インパルス応答を周波数領域インパルス応答に展開
するインパルス応答周波数領域変換器と、前記周波数領
域インパルス応答からホルマント部分を算出するホルマ
ント算出器と、予測フィルタで求めた予測残差信号を周
波数領域予測残差信号に展開する予測残差周波数領域変
換器と、ホルマント算出器からの情報に基づいて前記周
波数領域残差のホルマント部分に相当する箇所を量子化
するホルマント量子化器と、前記ホルマント部分以外の
周波数領域予測残差信号を量子化する量子化器とを備え
た音声符号化装置。
2. A linear prediction analyzer for calculating a filter coefficient of an input speech signal, a prediction filter for converting the obtained filter coefficient into a prediction residual signal, and an impulse response calculation for calculating an impulse response of the filter coefficient. A vessel,
An impulse response frequency domain converter that expands the impulse response into a frequency domain impulse response, a formant calculator that calculates a formant part from the frequency domain impulse response, and a prediction residual signal obtained by a prediction filter from a frequency domain prediction residual. A predictive residual frequency domain converter expanded into a signal, a formant quantizer that quantizes a portion corresponding to the formant part of the frequency domain residual based on information from the formant calculator, and a frequency other than the formant part. A speech coding apparatus comprising: a quantizer for quantizing a region prediction residual signal.
JP6077098A 1994-04-15 1994-04-15 Voice-coder Pending JPH07287599A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP6077098A JPH07287599A (en) 1994-04-15 1994-04-15 Voice-coder

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP6077098A JPH07287599A (en) 1994-04-15 1994-04-15 Voice-coder

Publications (1)

Publication Number Publication Date
JPH07287599A true JPH07287599A (en) 1995-10-31

Family

ID=13624321

Family Applications (1)

Application Number Title Priority Date Filing Date
JP6077098A Pending JPH07287599A (en) 1994-04-15 1994-04-15 Voice-coder

Country Status (1)

Country Link
JP (1) JPH07287599A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008083363A (en) * 2006-09-27 2008-04-10 Casio Comput Co Ltd Voice encoding device, voice decoding device, voice encoding method, voice decoding method and program
US7421304B2 (en) 2002-01-21 2008-09-02 Kenwood Corporation Audio signal processing device, signal recovering device, audio signal processing method and signal recovering method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7421304B2 (en) 2002-01-21 2008-09-02 Kenwood Corporation Audio signal processing device, signal recovering device, audio signal processing method and signal recovering method
JP2008083363A (en) * 2006-09-27 2008-04-10 Casio Comput Co Ltd Voice encoding device, voice decoding device, voice encoding method, voice decoding method and program

Similar Documents

Publication Publication Date Title
JP3592473B2 (en) Perceptual noise shaping in the time domain by LPC prediction in the frequency domain
KR100427753B1 (en) Method and apparatus for reproducing voice signal, method and apparatus for voice decoding, method and apparatus for voice synthesis and portable wireless terminal apparatus
US8010348B2 (en) Adaptive encoding and decoding with forward linear prediction
US7194407B2 (en) Audio coding method and apparatus
JP5154934B2 (en) Joint audio coding to minimize perceptual distortion
Edler et al. Audio coding using a psychoacoustic pre-and post-filter
KR20040005838A (en) Method and system for estimating artificial high band signal in speech codec
GB2359468A (en) Converting an audio signal between data compression formats
US5933802A (en) Speech reproducing system with efficient speech-rate converter
JPH09127995A (en) Signal decoding method and signal decoder
JPH10240299A (en) Voice encoding and decoding device
JP3092653B2 (en) Broadband speech encoding apparatus, speech decoding apparatus, and speech encoding / decoding apparatus
JPH07287599A (en) Voice-coder
KR101038446B1 (en) Audio coding
JP3348759B2 (en) Transform coding method and transform decoding method
JP2797348B2 (en) Audio encoding / decoding device
US9620139B2 (en) Adaptive linear predictive coding/decoding
JP2736157B2 (en) Encoding device
JP2006047561A (en) Audio signal encoding device and audio signal decoding device
JP3496618B2 (en) Apparatus and method for speech encoding / decoding including speechless encoding operating at multiple rates
Holters et al. Delay-free lossy audio coding using shelving pre-and post-filters
JPH1049200A (en) Method and device for voice information compression and accumulation
KR100349329B1 (en) Method of processing of MPEG-2 AAC algorithm
JP2906596B2 (en) Audio coding device
JP4195598B2 (en) Encoding method, decoding method, encoding device, decoding device, encoding program, decoding program