JP2621376B2

JP2621376B2 - Multi-pulse encoder

Info

Publication number: JP2621376B2
Application number: JP63164346A
Authority: JP
Inventors: 哲田口
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1988-06-30
Filing date: 1988-06-30
Publication date: 1997-06-18
Anticipated expiration: 2012-06-18
Also published as: JPH0212300A

Abstract

PURPOSE:To obtain the multi-pulse encoding device which is tolerant to a transmission line error by providing a linear predicting means which inputs the output signal of a 2nd fixed filtering means. CONSTITUTION:A linear prediction LPC coefficient extracting means 3 equipped with the 2nd fixed filtering means 31 and a linear prediction LPC means 33 extracts the linear prediction coefficient of a pre-emphasized voice signal. Further, spectrum structure conversion including partial pre-emphasis is carried out for the voice signal by a 1st fixed filtering means 81 and a time variation filtering means 83 included in a spectrum structure converting means 8 and then a multi-pulse analysis is taken to obtain multi-pulses. The pre-emphasis is therefore taken without spoiling the effect of equivalent S/N improvement by noise shaping. Consequently, the multi-pule encoding device is made tolerant to transmission line errors.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明はマルチパルス符号化装置に関する。Description: TECHNICAL FIELD The present invention relates to a multi-pulse encoder.

[Conventional technology]

音声信号を効率よく符号化する方法の一つにマルチパ
ルス符号化がある。Multi-pulse encoding is one of the methods for encoding an audio signal efficiently.

マルチパルス符号化は、分析側において、入力音声信
号を線形予測（linear predictive coding:LPC）分析し
て得たαパラメータ,Kパラメータ等のLPC係数によって
スペクトル包絡情報を表現し、インパルス系列（マルチ
パルス）によって音源情報を表現する。スペクトル包絡
情報は音声を発生する声道系のスペクトル分布情報を表
すものである。また、音源情報はスペクトル包絡の微細
構造を表すものであり、入力音声信号からスペクトル分
布情報を除いた残差信号の情報である。上述したマルチ
パルスの各パルスは、残差信号の波形の特徴に対応した
発生時間位置および振幅になるようにそれぞれ決定され
る。合成側では、αパラメータを係数とする全極型のデ
ィジタルフィルタであるLPC合成フィルタをマルチパル
スで励振することにより出力音声信号を合成できる。音
源情報に上述の波形情報を含むので、マルチパルス符号
化は効率がよい。In multi-pulse coding, on the analysis side, spectral envelope information is represented by LPC coefficients such as α parameters and K parameters obtained by linear predictive coding (LPC) analysis of an input speech signal, and an impulse sequence (multi-pulse ) Represents sound source information. The spectrum envelope information represents spectrum distribution information of a vocal tract system that generates a voice. The sound source information represents a fine structure of a spectrum envelope, and is information on a residual signal obtained by removing spectrum distribution information from an input audio signal. Each pulse of the above-described multi-pulse is determined so as to have an occurrence time position and an amplitude corresponding to the characteristic of the waveform of the residual signal. On the synthesis side, an output voice signal can be synthesized by exciting an LPC synthesis filter, which is an all-pole digital filter using the α parameter as a coefficient, with multiple pulses. Since the excitation information includes the above-described waveform information, multi-pulse encoding is efficient.

マルチパルス符号化においてはノイズシェイピイグ
（noise shaping）が行われる。このノイズシェイピン
グは以下述べるようにして行われる。In multi-pulse coding, noise shaping is performed. This noise shaping is performed as described below.

分析側において入力音声信号を、そのスペクトル包絡
の変化を抑圧するように、スペクトル構造変換し、スペ
クトル構造変換した音声信号を分析してマルチパルスを
得る。分析側においてこのマルチパルスを用いて（スペ
クトル構造変換後の音声信号に対応する）音声信号を合
成する。合成されたこの音声信号が含む量子化雑音のス
ペクトル包絡は平坦である。この音声信号を合成側のス
ペクトル構造変換の逆変換によってスペクトル構造変換
し、入力音声信号に対応する出力音声信号を得る。この
出力音声信号が含む量子化雑音のスペクトル包絡は、入
力音声信号のスペクトル包絡とスペクトル構造変換後の
音声信号のスペクトル包絡との差と同じ形状をしてい
る。いいかえれば、上述のノイズシェイピングにより出
力音声信号中の量子化雑音のスペクトル包絡が出力音声
信号のスペクトル包絡にある程度類似するようにシェィ
ピングされる。音声符号化における雑音は量子化雑音が
支配的であり、聴覚には信号成分に近接した周波数の雑
音成分はS/Nがある値以上になると信号成分でマスクさ
れて感じないという特性があるので、ノイズシェイピン
グを行うことにより等価的にS/Nを改善できる。このS/N
改善度はノイズシェイピングの程度、いいかえれば、分
析側でのスペクトル包絡の変化の抑圧の程度と共に単調
に増大するわけではなく、ノイズシェイピングの程度に
は最適値があり、この最適値は聴感テストによって選定
される。なお、後に詳述するように、スペクトル構造変
換した音声信号に対応する音声信号を中間的に得ること
なく、入力音声信号に対応する出力音声信号をマルチパ
ルスから直接合成することができる。On the analysis side, the input audio signal is subjected to spectral structure conversion so as to suppress a change in its spectral envelope, and the spectral structure converted audio signal is analyzed to obtain a multipulse. On the analysis side, an audio signal (corresponding to the audio signal after the spectral structure conversion) is synthesized using the multi-pulse. The spectral envelope of the quantization noise included in the synthesized speech signal is flat. This audio signal is subjected to spectrum structure conversion by inverse conversion of the spectrum structure conversion on the synthesis side to obtain an output audio signal corresponding to the input audio signal. The spectral envelope of the quantization noise included in the output audio signal has the same shape as the difference between the spectral envelope of the input audio signal and the spectral envelope of the audio signal after the spectral structure conversion. In other words, the noise envelope is shaped so that the spectral envelope of the quantization noise in the output audio signal is somewhat similar to the spectral envelope of the output audio signal. The noise in speech coding is dominated by quantization noise.Hearing has the property that noise components at frequencies close to the signal components are masked by the signal components when the S / N exceeds a certain value and are not felt. S / N can be equivalently improved by performing noise shaping. This S / N
The degree of improvement does not increase monotonically with the degree of noise shaping, in other words, the degree of suppression of changes in the spectral envelope on the analysis side.There is an optimum value for the degree of noise shaping, and this optimum value is Selected. As will be described later in detail, an output audio signal corresponding to an input audio signal can be directly synthesized from a multi-pulse without obtaining an intermediate audio signal corresponding to an audio signal subjected to spectral structure conversion.

[Problems to be solved by the invention]

上述したマルチパルス符号化を行う従来のマルチパル
ス符号化装置は伝送路誤りに弱い欠点がある。その原因
はLPC係数の係数感度が高いことにある。The conventional multi-pulse encoding apparatus that performs the above-described multi-pulse encoding has a disadvantage that it is vulnerable to transmission path errors. The cause is that the coefficient sensitivity of the LPC coefficient is high.

この係数感度を低下させる一般的、かつ、合理的な方
法としてプリエンファシスが知られている。音声スペク
トルは平均的に−6dB/オクターブの１次傾斜があるの
で、符号化前にこの１次傾斜を補償するようにプリエン
ファシスすれば、分析側から合成側へ伝送すべき情報が
少くてよい、あるいは、ビット伝送速度が同じなら伝送
する情報により多くの冗長度をもたせることができるか
ら、係数感度を低下させることができる。ところが、マ
ルチパルス符号化装置は先に述べたようにノイズシェイ
ピングを行っており、入力音声信号をプリエンファシス
して（平均的な）１次傾斜を無くしてから従来のマルチ
パルス符号化装置に入力すると、合成された出力音声信
号とこの音声信号が含む量子化雑音との（平均的な）１
次傾斜が一致する。いいかえれば、ノイズシェイピング
がかかり過ぎ、ノイズシェイピングによる等価的なS/N
改善効果が損われてしまう。そのため、従来のマルチパ
ルス符号化装置にプリエンファシスをそのまま適用する
ことはできない。Pre-emphasis is known as a general and rational method for reducing the coefficient sensitivity. Since the speech spectrum has a primary slope of -6 dB / octave on average, if pre-emphasis is performed to compensate for this primary slope before encoding, less information needs to be transmitted from the analysis side to the synthesis side. Alternatively, if the bit transmission rates are the same, the information to be transmitted can be given more redundancy, so that the coefficient sensitivity can be reduced. However, the multi-pulse coding apparatus performs noise shaping as described above, and pre-emphasizes the input speech signal to eliminate the (average) first-order gradient and then inputs the signal to the conventional multi-pulse coding apparatus. Then, the (average) 1 of the synthesized output audio signal and the quantization noise included in the audio signal is obtained.
The next slopes match. In other words, too much noise shaping, equivalent S / N due to noise shaping
The improvement effect is impaired. Therefore, pre-emphasis cannot be directly applied to a conventional multi-pulse encoder.

本発明の目的は、ノイズシェイピングの効果を損うこ
となくプリエンファシスができ、したがって、伝送路誤
りに強いマルチパルス符号化装置を提供することにあ
る。An object of the present invention is to provide a multi-pulse encoding device that can perform pre-emphasis without impairing the effect of noise shaping and is therefore resistant to transmission path errors.

〔課題を解決するための手段〕本発明のマルチパルス符号化装置は、入力音声信号を
線形予測係数抽出手段により分析フレームごとに分析し
て得た線形予測係数によって前記入力音声信号のスペク
トル包絡情報を表現し前記入力音声信号を前記分析フレ
ームごとに前記線形予測係数およびあらかじめ定めた減
衰係数を用いてマルチパルス分析手段により分析し前記
入力音声信号の音源情報の特徴に対応する発生時間位置
および振幅を有する複数のインパルスからなるインパル
ス系列を得てこのインパルス系列によって前記音源情報
を表現するマルチパルス符号化装置において、前記入力
音声信号の平均的なスペクトルの１次傾斜を緩和する第
１の固定フィルタリング手段と前記線形予測係数および
前記減衰係数を用いて前記入力音声信号の前記分析フレ
ームごとのスペクトル包絡を緩和する時変フィルタリン
グ手段とを備え前記入力音声信号のスペクトル構造を変
換して変換結果を前記マルチパルス分析手段へ出力する
スペクトル構造変換手段を含み、前記線形予測係数抽出
手段に前記入力音声信号の平均的なスペクトルの１次傾
斜を補償する第２の固定フィルタリング手段およびこの
第２の固定フィルタリング手段の出力信号を入力する線
形予測手段を備えている。[Means for Solving the Problems] A multi-pulse encoding apparatus according to the present invention is characterized in that an input speech signal is analyzed by a linear prediction coefficient extraction unit for each analysis frame, and a linear prediction coefficient obtained from the spectral envelope information of the input speech signal is obtained. And the input voice signal is analyzed by the multi-pulse analysis means using the linear prediction coefficient and a predetermined attenuation coefficient for each analysis frame, and the occurrence time position and amplitude corresponding to the feature of the sound source information of the input voice signal In a multi-pulse encoding apparatus that obtains an impulse sequence composed of a plurality of impulses having the following and expresses the excitation information by the impulse sequence, a first fixed mode that reduces a first-order gradient of an average spectrum of the input speech signal Using the filtering means and the linear prediction coefficient and the attenuation coefficient before the input audio signal. A time-varying filtering means for relaxing a spectrum envelope for each analysis frame, and a spectrum structure converting means for converting a spectrum structure of the input voice signal and outputting a conversion result to the multi-pulse analyzing means; The extraction means includes second fixed filtering means for compensating for the first-order gradient of the average spectrum of the input voice signal, and linear prediction means for inputting an output signal of the second fixed filtering means.

前記第１の固定フィルタリング手段は、単位遅延手段
と、この単位遅延手段の出力信号をそれぞれの一方の入
力とする第１および第２の掛算手段と、前記第１の掛算
手段の掛算結果を入力信号から減算して減算結果を前記
単位遅延手段へ出力する減算手段と、前記減算結果およ
び前記第２の掛算手段の掛算結果を加算して出力信号と
する加算手段とを有して構成されていてもよい。The first fixed filtering means inputs a unit delay means, first and second multiplication means having the output signal of the unit delay means as one input, and a multiplication result of the first multiplication means. A subtraction means for subtracting the subtraction result from the signal and outputting the subtraction result to the unit delay means; and an addition means for adding the subtraction result and the multiplication result of the second multiplication means to produce an output signal. You may.

又、前記時変フィルタリング手段は、縦続接続したｎ
個（ｎは２以上の整数）の単位遅延手段と、これら単位
遅延手段のそれぞれの出力信号をそれぞれの一方の入力
とするｎ個の第１の掛算手段と、これら第１の掛算手段
の掛算結果を加算する第１の加算手段と、この第１の加
算手段による加算の結果を入力信号から減算して減算結
果を先頭の前記単位遅延手段へ出力する減算手段と、前
記単位遅延手段のそれぞれの出力信号をそれぞれの一方
の入力とするｎ個の第２の掛算手段と、これら第２の掛
算手段の掛算結果ならびに前記減算結果を加算して出力
信号とする第２の加算手段とを有して構成されていても
よい。Further, the time-varying filtering means comprises a cascade-connected n
(N is an integer of 2 or more) unit delay means, n first multiplying means having respective output signals of the unit delay means as one input, and multiplication of these first multiplying means First adding means for adding a result, subtracting means for subtracting the result of the addition by the first adding means from an input signal and outputting a subtraction result to the first unit delay means, And n second multiplying means each of which has one output as the input signal, and second adding means which adds the multiplication result of the second multiplication means and the subtraction result to produce an output signal. It may be configured as:

更に又、前記第２の固定フィルタリング手段は、単位
遅延手段と、この単位遅延手段の出力信号を一方の入力
とする掛算手段と、この掛算手段の掛算結果および前記
単位遅延手段の入力信号を加算して出力信号とする加算
手段とを有して構成されていてもよい。Further, the second fixed filtering means includes a unit delay means, a multiplication means having an output signal of the unit delay means as one input, and a multiplication result of the multiplication means and an input signal of the unit delay means. And an adding means for converting the output signal into an output signal.

〔Example〕

次に、本発明について図面を参照して説明する。 Next, the present invention will be described with reference to the drawings.

第１図は本発明の一実施例の分析側を示すブロック
図、第２図は同実施例の合成側を示すブロック図であ
る。FIG. 1 is a block diagram showing an analysis side of one embodiment of the present invention, and FIG. 2 is a block diagram showing a synthesis side of the embodiment.

第１図に示す分析側は、音声信号Siを入力する遮断周
波数3.4kHzの低域フィルタ１と、低域フィルタ１の出力
信号をサンプリング周波数8kHzでサンプリングし所要の
ビット数に量子化し量子化サンプルとして出力するA/D
変換器２と、入力音声信号の平均的なスペクトルの１次
傾斜を補償するための第２の固定フィルタリング手段を
実現する固定フィルタ31ならびにこの固定フィルタ31に
順次縦続接続した波形切出回路32,LPC分析器33を有しA/
D変換器２の出力信号からＫパラメータk₁〜k_pを抽出し
て出力するLPC係数抽出部３と、LPC係数抽出部３が出力
したＫパラメータk₁〜k_pを量子化するＫ量子化器４と、
Ｋ量子化器４の出力信号を復号してＫパラメータを出力
するＫ復号化器５と、Ｋ復号化器５が出力したＫパラメ
ータをαパラメータα_１〜α_ｐに変換して出力するK/α
変換器６と、αパラメータα_１〜α_ｐならびに減衰係数
γからパラメータγα_１〜γ^ｐα_ｐを生成して出力する
減衰係数印加回路７と、入力音声信号の平均的なスペク
トルの１次傾斜を緩和するための第１の固定フィルタリ
ング手段を実現する固定フィルタ81及びこの固定フィル
タ81に縦続接続され入力音声信号の分析フレーム毎のス
ペクトル包絡を緩和するための時変フィルタリング手段
を実現する時変フィルタ82を有し時変フィルタ82の係数
にαパラメータα_１〜α_ｐならびにパラメータγα_１〜
γ^ｐα_ｐを用いA/D変換器２の出力信号を入力するスペ
クトル構造変換部８と、スペクトル構造変換部８の出力
信号を入力する波形切出回路９と、パラメータγα_１〜
γ^ｐα_ｐを用いて波形切出回路９の出力信号をマルチパ
ルス分析しマルチパルスmpを出力するマルチパルス分析
回路10と、マルチパルスmpの各パルスのうち最大振幅の
パルスの振幅，この振幅で規格化したその他のパルスの
規格化振幅および各パルスの間隔を表すデータを出力す
るマルチパルス量子化器11と、Ｋ量子化器４の出力信号
およびマルチパルス量子化器11の出力データを多重化す
る多重化回路12とを備えて構成されている。The analysis side shown in FIG. 1 includes a low-pass filter 1 having a cut-off frequency of 3.4 kHz for inputting an audio signal Si, and an output signal of the low-pass filter 1 sampled at a sampling frequency of 8 kHz, quantized to a required number of bits, and quantized. A / D output as
A converter 2, a fixed filter 31 for realizing a second fixed filtering means for compensating the first-order gradient of the average spectrum of the input audio signal, and a waveform extracting circuit 32 cascade-connected to the fixed filter 31; A / A with LPC analyzer 33
LPC coefficient extraction unit 3 for extracting and outputting K parameters k _{1 to} k _p from the output signal of D converter 2, and K quantization for quantizing K parameters k _{1 to} k _p output by LPC coefficient extraction unit 3 Vessel 4,
A K decoder 5 for decoding the output signal of the K quantizer 4 and outputting a K parameter; and a K / for converting the K parameter output from the K decoder 5 into α parameters α _{1 to} α _p and outputting the same. α
A converter 6, alpha parameter alpha ₁ to? _P and the damping coefficient application circuit 7 for generating and outputting a parameter γα ₁ ~γ ^_p α ^p from attenuation coefficient gamma, 1 order slope of the average spectrum of the input speech signal Filter 81 that realizes a first fixed filtering means for reducing noise, and a time-varying filter that is cascade-connected to the fixed filter 81 and realizes a time-varying filtering means for reducing the spectral envelope of each analysis frame of the input audio signal. alpha parameter to the coefficients of the time-varying filter 82 has a filter 82 alpha ₁ to? _p and parameters γα ₁ ~
A spectrum structure conversion unit 8 for inputting an output signal of the A / D converter 2 using γ ^p α _p , a waveform cutout circuit 9 for inputting an output signal of the spectrum structure conversion unit 8, and parameters γ α ₁ to
A multi-pulse analysis circuit 10 that performs multi-pulse analysis on an output signal of the waveform extraction circuit 9 and outputs a multi-pulse mp using γ ^p α _p , a maximum amplitude pulse of each pulse of the multi-pulse mp, A multi-pulse quantizer 11 for outputting data representing the normalized amplitude of the other pulses and the intervals between the pulses, and an output signal of the K quantizer 4 and output data of the multi-pulse quantizer 11 And a multiplexing circuit 12.

第２図に示す合成側は、分析側の多重化回路12から出
力され伝送された信号を多重化回路12による多重化の前
の状態に分離する分離回路13と、分離回路13の一方の分
離出力を復号してＫパラメータを出力するＫ復号化器14
と、Ｋパラメータをαパラメータα_１〜α_ｐに変換して
出力するK/α変換器15と、分離回路13の他方の分離出力
からマルチパルスmpを生成するマルチパルス復号化器16
と、αパラメータα_１〜α_ｐを係数としてマルチパルス
mpにより励振されるLPC合成フィルタ17と、LPC合成フィ
ルタの出力端に順次接続された固定フィルタ18,D/A変換
器19および低減フィルタ20とを備えて構成されている。2 includes a separating circuit 13 for separating the signal output from the multiplexing circuit 12 on the analyzing side and transmitted to a state before the multiplexing by the multiplexing circuit 12, and one of the separating circuits 13 K decoder 14 for decoding the output and outputting the K parameter
And a K / α converter 15 for converting the K parameter into α parameters α _{1 to} α _p and outputting the same, and a multi-pulse decoder 16 for generating a multi-pulse mp from the other separated output of the separating circuit 13
And a multi-pulse using α parameters α _{1 to} α _p as coefficients
It comprises an LPC synthesis filter 17 excited by mp, a fixed filter 18, a D / A converter 19, and a reduction filter 20 sequentially connected to the output terminal of the LPC synthesis filter.

第３図は固定フィルタ31のブロック図である。 FIG. 3 is a block diagram of the fixed filter 31.

固定フィルタ31は、入力信号を遅延させる単位遅延回
路311と、単位遅延回路311の出力信号に音声の平均的な
１次自己相関係数（＝0.9）の符号を反転した−を
掛ける掛算器312と、入力信号に掛算器312の出力信号を
加算して出力信号とする加算器313とを有して構成され
ており、A/D変換器２が出力した量子化サンプル系列の
表す音声信号のスペクトルの平均的な１次傾斜を補償し
た、いいかえれば、プリエンファシスした音声信号の量
子化サンプル系列を出力する。波形切出回路32は、固定
フィルタ31が出力した30ms分（240点）の量子化サンプ
ルにハミング窓関数を乗算する窓切出を分析フレーム周
期である20msごとに行う（分析フレーム周波数は50Hzで
ある）。LPC分析器33は、波形切出回路32から入力した
信号を分析フレームごとにLPC分析してＰ次の部分自己
相関係数であるＫパラメータk₁〜k_pを抽出する。したが
って、得られたＫパラメータk₁〜k_pはプリエンファシス
された音声信号のスペクトル包絡情報である。なお、固
定フィルタ31と波形切出回路32との前後関係を第１図に
おけると逆にしてもよい。また、固定フィルタ31はフィ
ルタ係数が１点のトランスバーサルフィルタとして構成
されており、波形切出回路32は240点のハミング係数を
フィルタ係数とするトランスバーサルフィルタで実現で
きるので、１×240＝240点のフィルタ係数をもつトラン
スバーサルフィルタによって固定フィルタ31および波形
切出回路32を置換えることもできる。The fixed filter 31 includes a unit delay circuit 311 for delaying an input signal, and a multiplier 312 for multiplying the output signal of the unit delay circuit 311 by −inverting the sign of an average first-order autocorrelation coefficient (= 0.9) of voice. And an adder 313 that adds the output signal of the multiplier 312 to the input signal and generates an output signal. The adder 313 is configured to output the audio signal represented by the quantized sample sequence output from the A / D converter 2. It outputs a quantized sample sequence of a pre-emphasized audio signal in which the average primary gradient of the spectrum has been compensated, in other words, a pre-emphasized audio signal. The waveform extraction circuit 32 performs window extraction for multiplying the quantized sample for 30 ms (240 points) output from the fixed filter 31 by the Hamming window function every 20 ms which is the analysis frame period (at an analysis frame frequency of 50 Hz). is there). The LPC analyzer 33 performs LPC analysis on the signal input from the waveform extraction circuit 32 for each analysis frame, and extracts K parameters k _{1 to} k _p , which are P-order partial autocorrelation coefficients. Therefore, the obtained K parameters k _{1 to} k _p are the spectral envelope information of the pre-emphasized audio signal. The order of the fixed filter 31 and the waveform extracting circuit 32 may be reversed from that in FIG. Further, the fixed filter 31 is configured as a transversal filter having one filter coefficient, and the waveform extraction circuit 32 can be realized by a transversal filter using a 240-point hamming coefficient as a filter coefficient. The fixed filter 31 and the waveform extracting circuit 32 can be replaced by a transversal filter having filter coefficients at points.

第４図は固定フィルタ81のブロック図である。 FIG. 4 is a block diagram of the fixed filter 81.

固定フィルタ81は、単位遅延回路811と、単位遅延回
路811の出力信号に−γを掛ける掛算器812と、入力信
号から掛算器812の出力信号を減算して減算結果を単位
遅延回路811へ出力する減算器813と、単位遅延回路811
の出力信号に−を掛ける掛算器814と、減算器813およ
び掛算器814の出力信号を加算して出力信号とする加算
器815とを有して構成されている。γは減衰係数であ
り、後に説明する時変フィルタ82が用いる減衰係数と同
じ値にする。単位遅延回路811,掛算器812,加算器813か
らなるループの意味のある最大応答次数をＮ次とする
と、Ｎ個の単位遅延回路816,N個の掛算器817,加算器818
および819からなる、第５図に示す、Ｎ＋１タップのト
ランスバーサルフィルタによって固定フィルタ81を置換
えることもできる。このトランスバーサルフィルタの先
頭タップのタップ係数は１であり、したがって、先頭タ
ップには掛算器はなく、２番目以降の各タップのタップ
係数は順次（γ−１），^２（γ−１）γ，^３（γ
−１）γ²,………^Ｎ（γ−１）γ^N-1に設定する。The fixed filter 81 includes a unit delay circuit 811, a multiplier 812 that multiplies the output signal of the unit delay circuit 811 by −γ, and subtracts the output signal of the multiplier 812 from the input signal and outputs a subtraction result to the unit delay circuit 811. Subtractor 813 and unit delay circuit 811
Of the multiplier 814, and an adder 815 that adds the output signals of the subtractor 813 and the multiplier 814 to produce an output signal. γ is an attenuation coefficient, and has the same value as an attenuation coefficient used by a time-varying filter 82 described later. Assuming that the meaningful maximum response order of the loop including the unit delay circuit 811, the multiplier 812, and the adder 813 is Nth order, N unit delay circuits 816, N multipliers 817, and an adder 818.
The fixed filter 81 can be replaced by an N + 1 tap transversal filter shown in FIG. The tap coefficient of the first tap of this transversal filter is 1, so that there is no multiplier at the first tap, and the tap coefficients of the second and subsequent taps are (γ-1), ² (γ-1) γ , ³ (γ
-1) γ ² ,... ^N (γ-1) γ ^N−1 is set.

第６図は時変フィルタ82のブロック図である。 FIG. 6 is a block diagram of the time-varying filter 82.

時変フィルタ82は、縦続接続したＰ個の単位遅延回路
821と、各単位遅延回路821の出力信号を一方の入力とす
るＰ個の掛算器822と、各掛算器822の出力信号を加算す
る加算器823と、加算器823の加算結果を入力信号から減
算して減算結果を先頭の単位遅延回路821へ出力する減
算器824と、各単位遅延回路821の出力信号を一方の入力
とするＰ個の掛算器825と、減算器824の減算結果ならび
に各掛算器825の掛算結果を加算して出力信号とする加
算器826とを有して構成されている。各掛算器822の他方
の入力として、減衰係数印加回路７からのパラメータγ
α_１〜γ^ｐα_ｐを先頭の掛算器822から順次各掛算器822
に入力する。また、各掛算器825の他方の入力として、K
/α変換器６からのαパラメータα_１〜α_ｐを先頭の掛
算器825から順次各掛算器825に入力する。なお、K/α変
換器６がＫ復号化器５の出力したＫパラメータを用いLP
C係数抽出部３からのＫパラメータk₁〜k_pを用いないの
は、ｋ量子化器４による量子化歪を含むＫパラメータを
分析側で用いることによりこの量子化歪が合成側に与え
る影響を相殺するためである。The time-varying filter 82 is composed of P unit delay circuits connected in cascade.
821, P multipliers 822 having the output signal of each unit delay circuit 821 as one input, an adder 823 for adding the output signal of each multiplier 822, and the addition result of the adder 823 from the input signal. A subtractor 824 for subtracting and outputting the subtraction result to the first unit delay circuit 821, a P number of multipliers 825 having the output signal of each unit delay circuit 821 as one input, a subtraction result of the subtractor 824 and each And an adder 826 for adding the multiplication results of the multiplier 825 to produce an output signal. As the other input of each multiplier 822, the parameter γ from the attenuation coefficient applying circuit 7 is input.
α _{1 to} γ ^p α _p are sequentially assigned to each multiplier 822 from the first multiplier 822.
To enter. Also, as the other input of each multiplier 825, K
The α parameters α _{1 to} α _p from the / α converter 6 are sequentially input to each multiplier 825 from the first multiplier 825. Note that the K / α converter 6 uses the K parameter output from the K
The reason why the K parameters k _{1 to} k _p from the C coefficient extraction unit 3 are not used is that the K side including the quantization distortion by the k quantizer 4 is used on the analysis side, and the influence of this quantization distortion on the synthesis side is obtained. In order to cancel out.

A/D変換器２が出力した量子化サンプル系列は、固定
フィルタ81および時変フィルタ82によってフィルタリン
グされスペクトル構造変換される。減衰係数γはこのス
ペクトル構造変換によるスペクトル包絡の変化の抑圧の
程度を決める定数であり、もしγ＝１とすれば時変フィ
ルタ82の出力信号は入力信号と同時になり、γ＝０とす
れば出力信号は残差信号の量子化サンプル系列になって
しまう。０＜γ＜１の範囲でγの最適値を選ぶ。本実施
例ではγ＝0.8とする。The quantized sample sequence output from the A / D converter 2 is filtered by a fixed filter 81 and a time-varying filter 82 and is subjected to spectral structure conversion. The attenuation coefficient γ is a constant that determines the degree of suppression of the change in the spectral envelope due to the spectral structure conversion. If γ = 1, the output signal of the time-varying filter 82 becomes the same as the input signal, and if γ = 0, The output signal becomes a quantized sample sequence of the residual signal. An optimal value of γ is selected in the range of 0 <γ <1. In this embodiment, γ = 0.8.

固定フィルタ81は、A/D変換器２が出力した量子化サ
ンプル系列を固定フィルタ31とは異った伝達関数でフィ
ルタリングする。第４図と第６図の回路構成の比較から
わかるように、時変フィルタ82が平均的に音声スペクト
ルの１次傾斜をある程度補償するのと同程度に固定フィ
ルタ81は１次傾斜を補償し、完全には平坦化しないの
で、固定フィルタ81および時変フィルタ82によるセペク
トル構造変換によってノイズシェイピングがかかり過ぎ
ることはなく、また、固定フィルタ81自身である程度の
（部分的な）プリエンファシス効果をもっている。な
お、固定フィルタ81と時変フィルタ82との前後関係を第
１図におけると逆にしてもよい。The fixed filter 81 filters the quantized sample sequence output from the A / D converter 2 with a transfer function different from that of the fixed filter 31. As can be seen from a comparison of the circuit configurations of FIGS. 4 and 6, the fixed filter 81 compensates for the primary slope to the same extent as the time-varying filter 82 compensates for the primary slope of the voice spectrum to some extent on average. Is not completely flattened, so that noise shaping is not excessively applied by the spectral structure conversion by the fixed filter 81 and the time-varying filter 82, and the fixed filter 81 itself has a certain (partial) pre-emphasis effect. . Note that the order of the fixed filter 81 and the time-varying filter 82 may be reversed from that in FIG.

スペクトル構造変換部８によって部分的なプリエンフ
ァシスを含むスペクトル構造変換を受けた量子化サンプ
ル系列は、波形切出回路９により分析フレームごとに切
出され、マルチパルス分析回路10に入力される。波形切
出回路９による窓切出は、窓関数として25ms幅の矩形波
形を用いることを除いては、波形切出回路32による窓切
出と同じである。マルチパルス分析回路10は、波形切出
回路９からの信号が表す音声信号の残差信号の波形を極
力正確に表現するように、適当な位置，振幅，極性のイ
ンパルスの系列をマルチパルスmpとして発生する。この
発生は、波形切出回路９から供給される信号が表す波形
のスペクトル包絡を表現するフィルタのインパルス応答
を利用する公知の方法により行われる。このスペクトル
包絡は、パラメータγα_１〜γ^ｐα_ｐで規定される全極
フィルタとマクロな意味での（不完全な）スペクトル包
絡フィルタである１次の全極フィルタとの組合せで表現
可能である。この両フィルタの組合せのインパルス応答
の代りにパラメータγα_１〜γ^ｐα_ｐで規定される全極
フィルタのみのインパルス応答を利用しても実用上は十
分である。マルチパルス分析回路10は、減衰係数印加回
路７から供給されるパラメータγα_１〜γ^ｐα_ｐを用い
てマルチパルスmpを発生している。マルチパルス量子化
器11で得たマルチパルスmpの情報は、Ｋ量子化器４によ
って量子化されたＫパラメータと多重化回路12で多重化
され、合成側へ伝送される。The quantized sample sequence that has undergone spectral structure conversion including partial pre-emphasis by the spectral structure conversion unit 8 is cut out for each analysis frame by the waveform cutout circuit 9 and is input to the multi-pulse analysis circuit 10. The window extraction by the waveform extraction circuit 9 is the same as the window extraction by the waveform extraction circuit 32 except that a rectangular waveform having a width of 25 ms is used as a window function. The multi-pulse analysis circuit 10 uses a sequence of impulses of an appropriate position, amplitude, and polarity as a multi-pulse mp so as to represent the waveform of the residual signal of the audio signal represented by the signal from the waveform extraction circuit 9 as accurately as possible. Occur. This generation is performed by a known method using an impulse response of a filter expressing a spectral envelope of a waveform represented by a signal supplied from the waveform extracting circuit 9. This spectral envelope can be expressed by a combination of an all-pole filter defined by parameters γα _{1 to} γ ^p α _p and a first-order all-pole filter which is a (incomplete) spectral envelope filter in a macro sense. . Both parameters instead of the filter combinations of impulse responses γα ₁ ~γ ^p be utilized impulse response of the all-pole filter only defined by alpha _p practically sufficient. Multi-pulse analysis circuit 10 generates a multi pulse mp using parameters γα ₁ ~γ ^_p α ^p supplied from the damping coefficient application circuit 7. The information of the multipulse mp obtained by the multipulse quantizer 11 is multiplexed by the multiplexing circuit 12 with the K parameter quantized by the K quantizer 4 and transmitted to the combining side.

合成側では、伝送されてきた信号から得たマルチパル
スmpとα_１〜α_ｐとを用いて、まず、固定フィルタ31の
出力信号に対応する量子化サンプル系列を得る。マルチ
パルス復号化器16が出力したマルチパルスmpをパラメー
タγα_１〜γ^ｐα_ｐで規定される全極フィルタでフィル
タリングしてスペクトル構造変換部８の出力信号に対応
する量子化サンプル系列を得る操作は、この量子化サン
プル系列から固定フィルタ31の出力信号に対応する量子
化サンプル系列を得る操作の一部と相殺し、（残りの操
作である）マルチパルスmpをLPC合成フィルタ17でフィ
ルタリングすることのみによって固定フィルタ31の出力
信号に対応する量子化サンプル系列が得られる。得られ
た量子化サンプル系列は固定フィルタ18に入力される。On the combining side, first, a quantized sample sequence corresponding to the output signal of the fixed filter 31 is obtained using the multipulse mp and α _{1 to} α _p obtained from the transmitted signal. An operation of filtering the multi-pulse mp output from the multi-pulse decoder 16 with an all-pole filter defined by parameters γα _{1 to} γ ^p α _p to obtain a quantized sample sequence corresponding to the output signal of the spectrum structure conversion unit 8 Offsets a part of the operation of obtaining a quantized sample sequence corresponding to the output signal of the fixed filter 31 from the quantized sample sequence, and filters the multipulse mp (the remaining operation) by the LPC synthesis filter 17. Only with this, a quantized sample sequence corresponding to the output signal of the fixed filter 31 is obtained. The obtained quantized sample sequence is input to the fixed filter 18.

第７図は固定フィルタ18のブロック図である。 FIG. 7 is a block diagram of the fixed filter 18.

固定フィルタ18は、出力信号を遅延させる単位遅延回
路181と、単位遅延回路181の出力信号に−を掛ける掛
算器182と、入力信号から掛算器182の出力信号を減算し
減算結果を出力信号とする減算器183とを有して構成さ
れており、固定フィルタ31によるプリエンファシスに対
応するデエンファシス特性で、入力する量子化サンプル
系列をデエンファシスし、分析側でD/A変換器２が出力
した量子化サンプル系列に対応する量子化サンプル系列
を出力する。この量子化サンプル系列はD/A変換器19で
アナログ化され、低域フィルタ20で不要高域成分が除去
されて音声信号S₀となる。音声信号S₀が含む量子化雑音
は、スペクトル構造変換部８によるスペクトル構造変換
によって生じたスペクトル包絡の変化分と同じ形状のス
ペクトル包絡をもつ、いいかえれば所要のノイズシェイ
ピングがなされている。The fixed filter 18 includes a unit delay circuit 181 that delays the output signal, a multiplier 182 that multiplies the output signal of the unit delay circuit 181 by −, and a subtraction result of the output signal of the multiplier 182 from the input signal and an output signal. And a de-emphasis characteristic corresponding to the pre-emphasis by the fixed filter 31 to de-emphasize the input quantized sample sequence, and output the D / A converter 2 on the analysis side. A quantized sample sequence corresponding to the quantized sample sequence thus output is output. The quantized samples sequence is an analog of the D / A converter 19, unnecessary high-frequency components in the low pass filter 20 is removed the audio signal S ₀ with. The quantization noise included in the audio signal S ₀ has a spectrum envelope having the same shape as the change in the spectrum envelope generated by the spectrum structure conversion by the spectrum structure conversion unit 8, in other words, required noise shaping is performed.

〔The invention's effect〕

以上説明したように本発明は、第２の固定フィルタリ
ング手段および線形予測手段を備えた線形予測係数抽出
手段によってプリエンファシスした音声信号の線形予測
係数を抽出し、また、スペクトル構造変換手段が含む第
１の固定フィルタリング手段および時変フィルタリング
手段によって音声信号に部分的なプリエンファシスを含
むスペクトル構造変換を施してからマルチパルス分析し
てマルチパルスを得ることにより、ノイズシェイピング
による等価的なS/N改善の効果を損なうことなくプリエ
ンファシスができるので、マルチパルス符号化装置を伝
送路誤りに対して強くできる効果がある。As described above, the present invention extracts a linear prediction coefficient of a speech signal pre-emphasized by a linear prediction coefficient extraction means provided with a second fixed filtering means and a linear prediction means, (1) Speech signal is subjected to spectral structure conversion including partial pre-emphasis by a fixed filtering means and a time-varying filtering means, and then multi-pulse analysis is performed to obtain multi-pulses, thereby enabling equivalent S / N improvement by noise shaping. Since pre-emphasis can be performed without impairing the effect of the above, there is an effect that the multi-pulse encoder can be made robust against transmission path errors.

[Brief description of the drawings]

第１図は本発明の一実施例の分析側を示すブロック図、
第２図は同実施例の合成側を示すブロック図、第３図は
第１図に示す分析側における固定フィルタ31のブロック
図、第４図は同じく固定フィルタ81のブロック図、第５
図は固定フィルタ81と置換え得るトランスバーサルフィ
ルタのブロック図、第６図は第１図に示す分析側におけ
る時変フィルタ82のブロック図、第７図は第２図に示す
合成側における固定フィルタ18のブロック図である。１……低域フィルタ、２……A/D変換器、３……LPC係数
抽出部、４……ｋ量子化器、５……ｋ復号化器、６……
k/α変換器、７……減衰係数印加回路、８……スペクト
ル構造変換部、９……波形切出回路、10……マルチパル
ス分析回路、11……マルチパルス量子化器、12……多重
化回路、13……分離回路、14……ｋ復号化器、15……k/
α変換器、16……マルチパルス復号化器、17……LPC合
成フィルタ、18……固定フィルタ、19……D/A変換器、2
0……低域フィルタ、31……固定フィルタ、32……波形
切出回路、33……LPC分析器、81……固定フィルタ、82
……時変フィルタ。FIG. 1 is a block diagram showing an analysis side of one embodiment of the present invention,
FIG. 2 is a block diagram showing the synthesis side of the embodiment, FIG. 3 is a block diagram of the fixed filter 31 on the analysis side shown in FIG. 1, FIG.
FIG. 6 is a block diagram of a transversal filter that can replace the fixed filter 81, FIG. 6 is a block diagram of a time-varying filter 82 on the analysis side shown in FIG. 1, and FIG. 7 is a fixed filter 18 on the synthesis side shown in FIG. It is a block diagram of. 1 low-pass filter, 2 A / D converter, 3 LPC coefficient extraction unit, 4 k quantizer, 5 k decoder, 6
k / α converter, 7: Attenuation coefficient application circuit, 8: Spectral structure converter, 9: Waveform extraction circuit, 10: Multipulse analysis circuit, 11: Multipulse quantizer, 12 ... Multiplexing circuit, 13 ... separating circuit, 14 ... k decoder, 15 ... k /
α converter, 16 multi-pulse decoder, 17 LPC synthesis filter, 18 fixed filter, 19 D / A converter, 2
0: Low-pass filter, 31: Fixed filter, 32: Waveform extraction circuit, 33: LPC analyzer, 81: Fixed filter, 82
...... Time-varying filter.

Claims

(57) [Claims]

An input speech signal which is analyzed by a linear prediction coefficient extracting means for each analysis frame to express spectral envelope information of the input speech signal, and the input speech signal is expressed for each analysis frame; Using a linear prediction coefficient and a predetermined attenuation coefficient, the multi-pulse analysis means performs analysis to obtain an impulse sequence including a plurality of impulses having an occurrence time position and an amplitude corresponding to the characteristics of the sound source information of the input audio signal. In a multi-pulse encoding device that expresses the sound source information by an impulse sequence, a first fixed filtering unit that reduces a first-order gradient of an average spectrum of the input speech signal and the linear prediction coefficient and the attenuation coefficient are used. A time-varying filter for alleviating the spectral envelope of the input audio signal for each analysis frame. Filtering means for converting the spectral structure of the input voice signal and outputting a conversion result to the multi-pulse analyzing means, wherein the linear prediction coefficient extracting means includes an average spectrum of the input voice signal. A second fixed filtering means for compensating for the first-order slope and a linear prediction means for inputting an output signal of the second fixed filtering means.

2. The first fixed filtering means includes a unit delay means, first and second multiplication means each having one input of an output signal of the unit delay means, and a first multiplication means. Subtraction means for subtracting the multiplication result from the input signal and outputting the subtraction result to the unit delay means; and addition means for adding the subtraction result and the multiplication result of the second multiplication means to obtain an output signal. The multi-pulse encoding device according to claim 1, wherein the multi-pulse encoding device is configured as follows.

3. The time-varying filtering means includes cascaded n (n is an integer of 2 or more) unit delay means, and n output signals of each of the unit delay means having one input thereof. A first multiplying means, a first adding means for adding the multiplication results of the first multiplying means, and a result of the addition by the first adding means being subtracted from the input signal, and the result of the subtraction being the first one. Subtracting means for outputting to the unit delay means, n second multiplying means having each output signal of the unit delay means as one input, a multiplication result of the second multiplication means and the subtraction result 2. The multi-pulse encoding apparatus according to claim 1, further comprising: a second adding means for adding the second signal to an output signal.

4. The second fixed filtering means includes a unit delay means, a multiplication means having an output signal of the unit delay means as one input, a multiplication result of the multiplication means, and an input signal of the unit delay means. 2. The multi-pulse encoding apparatus according to claim 1, further comprising an adding unit that adds the signals to output signals.