JP2832942B2

JP2832942B2 - Multi-pulse encoder

Info

Publication number: JP2832942B2
Application number: JP63137749A
Authority: JP
Inventors: 哲田口; 繁治池田; 昌浩宮崎
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1988-06-03
Filing date: 1988-06-03
Publication date: 1998-12-09
Anticipated expiration: 2013-12-09
Also published as: JPH01306899A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明はマルチパルス型符号化装置に関し、特にパル
ス検索が容易にでき装置が簡単なピッチ予測型のマルチ
パルス型符号化装置に関する。Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a multi-pulse coding apparatus, and more particularly, to a pitch prediction type multi-pulse coding apparatus which can easily search for pulses and has a simple apparatus.

[Conventional technology]

音声波形は、いわゆる近接予測性とピッチ予測性との
２つの線形予測性を有することが知られている。初期の
マルチパルス符号化装置は、これらのうち近接予測性の
みを利用して、音声の符号化を実施する構成となってい
た。It is known that a speech waveform has two linear predictabilities, that is, a so-called proximity predictability and a pitch predictability. Early multi-pulse encoding devices were configured to encode speech using only the proximity predictability among them.

従来の近接予測性のみを利用した非ピッチ予測型マル
チパルス型符号化装置では、マルチパルスは近接予測フ
ィルタのインパルス応答と、被分析音声波形との相互相
関係数を介して決定されている。また、近年マルチパル
ス符号化装置の符号化効率を上昇させるために、この装
置にピッチ予測を利用する事が試みられている。In a conventional non-pitch prediction type multi-pulse coding apparatus using only proximity predictability, a multi-pulse is determined via a cross-correlation coefficient between an impulse response of a proximity prediction filter and a speech waveform to be analyzed. In recent years, in order to increase the coding efficiency of a multi-pulse coding device, attempts have been made to use pitch prediction in this device.

従来、この種のピッチ予測を行うマルチパルス型符号
化装置においては、線形予測分析（以下LPC分析と記
す）による近接予測フィルタとピッチ予測フィルタとの
縦続接続を利用して音声を合成している。従ってこの２
つの縦続接続されたフィルタ総合のインパルス応答を算
出し、従来の装置における近接予測フィルタのインパル
ス応答に代えて相互相関係数の算出を行い、マルチパル
スの検索を行っていた。Conventionally, in a multi-pulse coding apparatus that performs this kind of pitch prediction, speech is synthesized using a cascade connection of a proximity prediction filter and a pitch prediction filter by linear prediction analysis (hereinafter, referred to as LPC analysis). . Therefore, this 2
The impulse response of two filters connected in cascade is calculated, the cross-correlation coefficient is calculated in place of the impulse response of the proximity prediction filter in the conventional device, and multi-pulse search is performed.

また、従来のピッチ予測型マルチパルス符号化の方法
では、インパルス応答の実質的持続時間が長く、例えば
20msのフレーム長に対して数倍から十倍に達することも
あるので、マルチパルス符号化と整合しない点があっ
た。Further, in the conventional pitch prediction type multi-pulse encoding method, the substantial duration of the impulse response is long, for example,
Since it can reach several times to ten times the frame length of 20 ms, there is a point that it does not match with the multi-pulse coding.

[Problems to be solved by the invention]

上述した従来のマルチパルス型符号化装置は、マルチ
パルスを検索するにあたってLPC分析による近接フィル
タとピッチ予測フィルタとが縦続して使用されているの
で、演算回数が著しく増大するという欠点がある。The conventional multi-pulse coding apparatus described above has a drawback that the number of operations is significantly increased because a proximity filter and a pitch prediction filter based on LPC analysis are used in cascade in searching for a multi-pulse.

[Means for solving the problem]

本発明のマルチパルス型符号化装置は、入力された音
声信号を符号化した符号化信号を伝送し復号化して出力
するマルチパルス型符号化装置において、前記音声信号
からピッチ情報を抽出しピッチ周期信号とピッチ利得係
数信号とを出力するピッチ抽出手段と、前記音声信号を
フレーム時間長ごとにピッチ周期で区切り前記フレーム
時間長のいずれか一方の端からピッチ周期で区切られた
時間域のうち一つの時間域の音声信号と隣接したその手
前の時間域の音声信号にピッチ利得係数を乗じた音声信
号とを加える操作を順次前記フレーム時間長の他端に達
するまで行って符号化するべき音声信号を出力する線形
加算手段と、加算前の音声信号を用いて加算後の音声信
号より求めたマルチパルスの振幅を補正する振幅補正手
段とを備えている。A multi-pulse coding apparatus according to the present invention is a multi-pulse coding apparatus that transmits, decodes, and outputs an encoded signal obtained by encoding an input audio signal. Pitch extracting means for outputting a signal and a pitch gain coefficient signal; and one of a time domain separated by a pitch cycle from one end of the frame time length by dividing the audio signal by a pitch cycle for each frame time length. An audio signal to be encoded by sequentially performing operations of adding one audio signal in one time domain and an audio signal obtained by multiplying an adjacent audio signal in the previous time domain by a pitch gain coefficient until reaching the other end of the frame time length. And amplitude correction means for correcting the amplitude of the multi-pulse obtained from the audio signal after addition using the audio signal before addition.

〔Example〕

次に、本発明について図面を参照して説明する。本実
施例は特にスペクトル強調を前提としたマルチパルス符
号化装置を対象としているが、これは本発明の本質に係
わらない。Next, the present invention will be described with reference to the drawings. Although the present embodiment is particularly directed to a multi-pulse encoding device on the premise of spectrum enhancement, this is not related to the essence of the present invention.

第１図は本発明の一実施例の構成を示すブロック図、
第２図は本発明を構成するスペクトル強調フィルタの一
例を示すブロック図、第３図（ａ），（ｂ）はそれぞれ
本発明のピッチ加算の作動を示すスペクトル強調された
音声信号の波形図で、（ａ）は入力音声波形図、（ｂ）
は（ａ）に示した入力音声をピッチ加算の前処理をした
入力音声波形図である。FIG. 1 is a block diagram showing the configuration of one embodiment of the present invention,
FIG. 2 is a block diagram showing an example of a spectrum emphasis filter constituting the present invention, and FIGS. 3 (a) and 3 (b) are waveform diagrams of a spectrum-emphasized audio signal showing the operation of pitch addition of the present invention. , (A) is an input voice waveform diagram, (b)
FIG. 6 is an input voice waveform diagram obtained by preprocessing pitch addition of the input voice shown in FIG.

まず、本発明の概要について第２図，第３図（ａ），
（ｂ）を用いて説明する。First, the outline of the present invention will be described with reference to FIGS.
This will be described with reference to FIG.

本発明は、音声波形のピッチ周期に基づき音声波形の
線形加算を行うピッチ加算部を設けることにより、伝送
帯域幅が縮小できかつ線形予測を行う演算回数を減少さ
せてマルチパルス信号を得ると共に、そのマルチパルス
信号を量子化することによって、従来より構成を簡単化
するものである。The present invention provides a multi-pulse signal by providing a pitch addition unit that performs linear addition of an audio waveform based on the pitch period of the audio waveform, thereby reducing the number of calculations that can reduce the transmission bandwidth and performing linear prediction, and By quantizing the multi-pulse signal, the configuration is simplified as compared with the related art.

そこで入力音声信号の概周期がほぼ一定であることに
着目し、本発明はピッチ周期を利用することとしたもの
である。Therefore, focusing on the fact that the approximate period of the input audio signal is substantially constant, the present invention utilizes the pitch period.

しかし、ピッチ予測と近接予測との２箇の線形予測を
行うので、演算回数を減少させてマルチパルス信号を得
るつぎのような方法が採用されている。However, since two linear predictions, pitch prediction and proximity prediction, are performed, the following method of obtaining a multipulse signal by reducing the number of calculations is employed.

従って、ここでは演算回数を減少させてマルチパルス
信号を得るため、音声信号はフレーム周期ごとにピッチ
周期で区切られ、フレームの端部からピッチ周期で区切
られた時間域の音声信号は、順次隣接したフレーム端部
側の時間域の音声信号にピッチ利得係数ｇを乗じた音声
信号をも加えられて線形加算音声信号が出力される。Therefore, in this case, in order to obtain a multi-pulse signal by reducing the number of operations, the audio signal is divided by the pitch period for each frame period, and the audio signals in the time domain separated by the pitch period from the end of the frame are sequentially adjacent. A speech signal obtained by multiplying the speech signal in the time domain on the frame end side by the pitch gain coefficient g is also added, and a linearly added speech signal is output.

かようにした入力音声信号からマルチパルス信号を生
成している。すなわち、音声信号が第３図（ａ）に示す
ように、フレーム周期Ｔごとにピッチ周期信号から得
られるピッチ周期Ｐで区切られた時間域の音声信号e₁〜
e_nが供給される。A multi-pulse signal is generated from the input audio signal. In other words, as the audio signal is shown in FIG. 3 (a), the audio signal e ₁ of time-domain separated by a pitch period P derived from the pitch period signal in each frame period T ~
e _n are supplied.

さらにここで、第３図（ｂ）に示すようにフレームの
端部の１番目の時間域の線形加算音声信号E₁は１番目の
時間域の音声信号e₁と同一となる。フレームの端部から
２番目の線形加算音声信号E₂は、同一時間域の音声信号
e₂とフレーム端部から１番目の時間域の音声信号e₁とピ
ッチ利得制御係数ｇを乗じたge₁との和と同一となる。
従って、これが順次行われ第３図（ｂ）に示す線形加算
音声信号E₁〜E_nが得られる。Here, as shown in FIG. 3 (b), the linearly added audio signal E1 in the _first time domain at the end of the frame is the same as the audio signal e1 in the _first time domain. The second linearly added audio signal E ₂ from the end of the frame is an audio signal in the same time domain
the same as the sum of the ge ₁ multiplied by the audio signal e ₁ and the pitch gain control factor g of the first time domain from e ₂ and the frame end.
Therefore, this linear sum audio signals E ₁ to E _n is obtained as shown in FIG. 3 are performed in sequence (b).

次に、スペクトルの強調を行うスペクトル強調フィル
タについて説明する。Next, a spectrum enhancement filter that enhances the spectrum will be described.

スペクトル強調フィルタでは、標本化された入力音声
信号に対してLPC分析を通してLPC係数αli（ｉ＝1,2…
…Ｐ）が求められる。次にあらかじめ設定された減衰係
数γ（０〜１の間で経験的に定められる）と、LPC係数
α_iとから、伝達関数がで表わされるフィルタで、入力音声信号のスペクトルの
うち大きな振幅のスペクトルの強調が行われ、スペクト
ル強調されて線スペクトルに近い音声信号が得られる。In the spectrum emphasis filter, LPC coefficients αli (i = 1, 2,.
... P) is required. Next, the transfer function is obtained from the preset attenuation coefficient γ (determined empirically between 0 and 1) and the LPC coefficient α _i. The filter represented by ## EQU1 ## emphasizes a spectrum having a large amplitude in the spectrum of the input speech signal, and obtains a speech signal close to a line spectrum by spectrum emphasis.

具体的には第２図に示すように、入力した音声信号が
加算器A₁を通してＰ個縦続接続された伝達関数Z^-1のフ
ィルタ素子C₁〜C_pに供給され、それぞれの素子の出力が
乗算器M₁〜M_pに印加され、ここでそれぞれLPC係数α_iと
あらかじめ設定された減衰係数γⁱとが乗算され、この
出力が加算器A₂で加算され、さらに加算器A₁で入力され
た音声信号に加算される。かような接続において、加算
器A₁の出力すなわち、伝達関数Z^-1のフィルタ素子C₁の
入力が、スペクトル強調された音声信号の出力となる。Specifically, as shown in FIG. 2, an input audio signal is supplied to filter elements C _{1 to} C _p of a transfer function Z ⁻¹ connected in cascade through an adder A ₁ , and the output of each element is output. Is applied to multipliers M _{1 to} M _p , where the LPC coefficient α _i is multiplied by a preset attenuation coefficient γ ^{i, respectively,} and the output is added by adder A ₂ , and further added by adder A ₁ It is added to the input audio signal. In such Such connections, the output of the adder A ₁ i.e., the input of the filter elements C ₁ of the transfer function Z ^-1, the output of the spectrum highlighted audio signal.

次に、本発明の一実施例についてその構成と作動を第
１図を参照して説明する。Next, the configuration and operation of an embodiment of the present invention will be described with reference to FIG.

第１図において、本実施例は、LPC分析手段１と、ス
ペクトル強調手段２と、波形符号化手段３と、波形復号
化手段４と、LPC合成部５と、振幅補正手段６を備えて
構成される。以下各構成素子についてその内部構成の一
例を示し、それぞれの詳細を説明する。1, this embodiment includes an LPC analysis unit 1, a spectrum emphasis unit 2, a waveform encoding unit 3, a waveform decoding unit 4, an LPC synthesis unit 5, and an amplitude correction unit 6. Is done. An example of the internal configuration of each component will be described below, and details of each component will be described.

標本化された入力音声信号100は、LPC分析手段１の波
形切出部11と、スペクトル強調手段２の波形切出部17
と、波形符号化手段３のピッチ抽出部31とにそれぞれ入
力される。The sampled input audio signal 100 is supplied to a waveform extracting section 11 of the LPC analyzing section 1 and a waveform extracting section 17 of the spectrum emphasizing section 2.
, And the pitch extraction unit 31 of the waveform encoding unit 3.

LPC分析手段１は、波形切出部11とLPC分析部12とを備
え、波形切出部11では入力音声信号100は例えば20msご
とに30ms分の信号が切出され、その切出しが滑らかにな
るようにハミング窓を通されたフレーム信号がLPC分析
部12に出力される。LPC分析部12では、入力されたフレ
ーム信号が分析され、その結果として最大次数Ｐ次まで
のLPC係数信号101が、スペクトル強調手段２のＫ量子化
符号化部13に入力される。The LPC analysis means 1 includes a waveform cutout unit 11 and an LPC analysis unit 12. In the waveform cutout unit 11, a signal of 30 ms is cut out from the input audio signal 100 every 20 ms, for example, and the cut out becomes smooth. The frame signal that has passed through the hamming window is output to the LPC analysis unit 12. In the LPC analysis unit 12, the input frame signal is analyzed, and as a result, the LPC coefficient signal 101 up to the maximum order P is input to the K quantization encoding unit 13 of the spectrum emphasis unit 2.

スペクトル強調手段２は、Ｋ量子化復号化部13と、Ｋ
・α変換部14と、減衰係数印加部15と、スペクトル強調
フィルタ16と、波形切出部17とを備えている。The spectrum enhancement means 2 includes a K quantization / decoding section 13 and a K
-It has an α conversion unit 14, an attenuation coefficient application unit 15, a spectrum emphasis filter 16, and a waveform cutout unit 17.

LPC係数信号101が入力されたＫ量子化復号化部13で
は、量子化されたLPC係数信号104が波形符号化手段３の
多重化合成部35に出力されると共に、量子化され復号化
されたLPC係数信号がＫ・α変換部14に出力される。Ｋ
・α変換部14では、復号化されたLPC係数信号が入力さ
れ、LPC係数α_iを表わすLPC係数信号が、減衰係数印加
部15に出力される。In the K quantization decoding unit 13 to which the LPC coefficient signal 101 has been input, the quantized LPC coefficient signal 104 is output to the multiplexing / synthesizing unit 35 of the waveform encoding unit 3 and is quantized and decoded. The LPC coefficient signal is output to the K / α conversion unit 14. K
In the α conversion unit 14, the decoded LPC coefficient signal is input, and the LPC coefficient signal representing the LPC coefficient α _i is output to the attenuation coefficient application unit 15.

LPC係数α_iを表わすLPC係数信号が入力された減衰係
数印加部15では、あらかじめ経験的に選ばれた減衰係数
γⁱとLPC係数α_iとの積が算出され、α_i・γⁱを表わす
信号すなわちα・γ信号103が、スペクトル強調フィル
タ16へ出力されると同時に、波形符号化手段３のインパ
ルス応答部20とスペクトル強調フィルタ21へも出力され
る。In the attenuation coefficient applying unit 15 to which the LPC coefficient signal representing the LPC coefficient α _i is input, the product of the attenuation coefficient γ ⁱ and the LPC coefficient α _i empirically selected in advance is calculated and represents α _i · γ ⁱ The signal, that is, the α · γ signal 103 is output to the spectrum emphasizing filter 16 and simultaneously to the impulse response section 20 and the spectrum emphasizing filter 21 of the waveform encoding means 3.

一方、入力音声信号100が入力された波形切出部17で
は、フレームごとの切出しが、波形切出部11のフレーム
信号と同期して例えば20msごとに40msの切出しが行われ
る。切出された入力音声信号は、スペクトル強調フィル
タ16で、各フレームごとの入力音声信号のうち大きな振
幅のスペクトルの強調が行われ、スペクトル強調された
音声信号102は波形符号化手段３のピッチ加算部27と振
幅補正手段６の最大値検索部61とへ出力される。On the other hand, in the waveform extracting unit 17 to which the input audio signal 100 is input, the extracting for each frame is performed in synchronization with the frame signal of the waveform extracting unit 11, for example, every 40 ms, for example, every 40 ms. The cut-out input audio signal is subjected to spectrum emphasis by a spectrum emphasis filter 16 to emphasize the spectrum of a large amplitude in the input audio signal for each frame. It is output to the unit 27 and the maximum value search unit 61 of the amplitude correction means 6.

次に、波形符号化手段３について説明する。波形符号
化手段３は、インパルス応答部20と、スペクトル強調フ
ィルタ21と、一時メモリ24と、最大値検索補正部25と、
パルス量子化部26と、ピッチ加算部27と、ピッチ抽出部
31と、ピッチ利得量子化復号化部32と、多重化合成部35
とを備えている。Next, the waveform encoding means 3 will be described. The waveform encoding unit 3 includes an impulse response unit 20, a spectrum emphasis filter 21, a temporary memory 24, a maximum value search correction unit 25,
Pulse quantizer 26, pitch adder 27, pitch extractor
31, a pitch gain quantization decoding unit 32, and a multiplexing / synthesizing unit 35
And

LPC係数α_iと減衰係数γⁱとの積で表わされるα・γ
信号103が、スペクトル強調フィルタ21とインパルス応
答部20とに出力される。インパルス応答部20では、α・
γ信号103で決定されるLPC合成フィルタにインパルス信
号が入力され、その応答信号がスペクトル強調フィルタ
21へ入力される。スペクトル強調フィルタ21は、入力さ
れたα・γ信号103により決定される伝送特性を有する
フィルタが形成されており、別にインパルス応答部20か
ら入力されたLPC合成フィルタのインパルス応答信号
が、このフィルタによりスペクトル強調されて、その結
果出力されるインパルス応答信号110が最大値検索補正
部25に出力される。Α ・ γ expressed by the product of LPC coefficient α _i and damping coefficient γ ⁱ
Signal 103 is output to spectrum emphasis filter 21 and impulse response section 20. In the impulse response section 20, α ·
The impulse signal is input to the LPC synthesis filter determined by the γ signal 103, and the response signal is
Entered into 21. The spectrum emphasis filter 21 is formed with a filter having a transmission characteristic determined by the input α / γ signal 103, and the impulse response signal of the LPC synthesis filter separately input from the impulse response unit 20 is used by this filter. The spectrum is emphasized, and the resulting impulse response signal 110 is output to the maximum value search correction unit 25.

一方、標本化された入力音声信号100が入力されたピ
ッチ抽出部31では、例えば自己相関処理などにより、入
力音声信号のピッチが抽出され、これを示すピッチ周期
信号112がピッチ加算部27と多重化合成部35へ出力され
ると共に、ピッチ利得係数（ピッチ予測係数）を表わす
ピッチ利得信号がピッチ利得量子化復号化部32へ出力さ
れる。ピッチ利得量子化復号化部32では、入力されたピ
ッチ利得信号が量子化されたピッチ利得信号113となっ
て多重化合成部35へ出力されると共に、ピッチ利得信号
が量子化され復号化されたピッチ利得信号となってピッ
チ加算部27へ出力される。On the other hand, in the pitch extraction unit 31 to which the sampled input audio signal 100 has been input, the pitch of the input audio signal is extracted by, for example, autocorrelation processing, and the pitch period signal 112 indicating this is multiplexed with the pitch addition unit 27. The pitch gain signal representing the pitch gain coefficient (pitch prediction coefficient) is output to the pitch gain quantization / decoding section 32 while being output to the conversion / synthesis section 35. In the pitch gain quantization / decoding section 32, the input pitch gain signal is output as a quantized pitch gain signal 113 to the multiplexing / synthesizing section 35, and the pitch gain signal is quantized and decoded. The signal is output to the pitch adding section 27 as a pitch gain signal.

スペクトル強調された音声信号102はピッチ加算部27
に入力され、ここで別に入力されたピッチ周期信号112
で表わされるそのピッチ周期ごとにスペクトル強調され
た音声信号が、フレームの端部から区切られる。その区
切られた時間域の音声信号は順次隣接したフレーム端部
側の区切られた時間域の音声信号とピッチ利得信号中の
ピッチ利得係数ｇとを乗じた音声信号が加えられる。さ
らに、この音声信号20ms分が切出された状態の線形加算
音声信号109が、一時メモリ24に出力される。The spectrum-emphasized audio signal 102 is supplied to a pitch adding unit 27.
And the pitch period signal 112 separately input here.
The audio signal spectrally enhanced for each pitch period represented by the following expression is separated from the end of the frame. An audio signal obtained by multiplying the separated time-domain audio signal by the pitch gain coefficient g in the pitch gain signal is sequentially added to the separated time-domain audio signal. Further, the linear addition audio signal 109 in a state where the audio signal of 20 ms is cut out is output to the temporary memory 24.

次に、一時メモリに記憶されている線形加算音声信号
は最大値検索補正部25に出力され、ここでフレーム中の
最大値が検索され、その信号のフレーム上の位置と振幅
とがマルチパルス信号の一部としてパルス量子化部26へ
出力される。また、その位置と振幅とに合わせて変形さ
れた自己相関信号（この自己相関信号はインパルス応答
信号110の自己相関処理によって最大値検索補正部25の
中で生成される）が得られる。さらに最大値検索補正部
25では上述の変形された自己相関信号を一時メモリ24に
記憶された線形加算信号から差引いた信号が、一時メモ
リ24に戻される。ここで一時メモリ24に戻された信号の
最大値がふたたび検索され、その信号のフレーム上の位
置と振幅とが２番目のマルチパルスとして出力される。Next, the linearly added audio signal stored in the temporary memory is output to the maximum value search correction unit 25, where the maximum value in the frame is searched, and the position and amplitude of the signal on the frame are determined by the multi-pulse signal. Is output to the pulse quantization unit 26 as a part of Further, an autocorrelation signal deformed in accordance with the position and the amplitude (this autocorrelation signal is generated in the maximum value search correction unit 25 by the autocorrelation processing of the impulse response signal 110) is obtained. Furthermore, maximum value search correction unit
At 25, a signal obtained by subtracting the modified autocorrelation signal from the linear addition signal stored in the temporary memory 24 is returned to the temporary memory 24. Here, the maximum value of the signal returned to the temporary memory 24 is searched again, and the position and amplitude of the signal on the frame are output as the second multipulse.

以上の繰返しが各フレームの信号について、マルチパ
ルスの数または振幅があらかじめ定められた値になるま
で続けられ、各フレームごとのマルチパルス信号が順次
パルス量子化部26へ出力される。The above-described repetition is continued until the number or amplitude of the multi-pulse of each frame signal reaches a predetermined value, and the multi-pulse signal for each frame is sequentially output to the pulse quantization unit 26.

パルス量子化部26へ出力されたマルチパルス信号の振
幅は、ピッチ加算部27で加工された波形、例えば第３図
（ｂ）に示す波形に基づいて決定されており、当然、ピ
ッチ加算部27で加工される前の波形、例えば第３図
（ａ）に示す波形に基づいて決定されたマルチパルスの
振幅とは異なっている。The amplitude of the multi-pulse signal output to the pulse quantization unit 26 is determined based on the waveform processed by the pitch addition unit 27, for example, the waveform shown in FIG. The amplitude is different from the amplitude of the multi-pulse determined based on the waveform before processing, for example, the waveform shown in FIG.

第３図に於いて区間長Ｐ単位で区分されたＴの最左
端の部分は後述するピッチ予測フィルタ41の影響を受け
ない。In FIG. 3, the leftmost portion of T divided by the section length P is not affected by the pitch prediction filter 41 described later.

従って、この最左端に存在するマルチパルスの振幅
は、ピッチ加算部27で加工される前の波形に基づいて決
定されたマルチパルスの振幅と殆んど一致すべきであ
る。しかしながら、前述のように最左端の波形はＰ単位
で区分された全ての区間の線形加算波形であり、これら
の振幅は一致しない。Therefore, the amplitude of the leftmost multipulse should almost coincide with the amplitude of the multipulse determined based on the waveform before being processed by the pitch adding unit 27. However, as described above, the leftmost waveform is a linearly-added waveform in all sections divided in P units, and their amplitudes do not match.

本実施例では振幅補正手段６を設けることによりこの
問題を解決している。振幅補正手段６は最大値検索部61
と振幅補正部62とから構成される。In this embodiment, this problem is solved by providing the amplitude correction means 6. The amplitude correction means 6 includes a maximum value search unit 61
And an amplitude correction unit 62.

スペクトル強調フィルタ16の出力波形は最大値検索部
61へ供給される。最大値検索部61は第３図（ａ）の最左
端のＰで示される区間内の最大値を検索し、この最大値
を振幅補正部62へ出力する。振幅補正部62はこの最大値
とパルス量子化部26より供給されるマルチパルスのう
ち、最左端のＰで示される区間内の存在する最大パルス
の振幅との比を算出する。更に振幅補正部62は、この比
に基づいて、全てのマルチパルスを補正しパルス量子化
部へ出力する。The output waveform of the spectrum emphasis filter 16 is the maximum value search section
Supplied to 61. The maximum value search unit 61 searches for the maximum value in the section indicated by P at the left end in FIG. 3A, and outputs this maximum value to the amplitude correction unit 62. The amplitude correction unit 62 calculates a ratio between the maximum value and the amplitude of the maximum pulse present in the section indicated by P at the left end of the multi-pulses supplied from the pulse quantization unit 26. Further, the amplitude corrector 62 corrects all multi-pulses based on the ratio and outputs the result to the pulse quantizer.

パルス量子化部26では、入力されたマルチパルス信号
は量子化されて量子化されたマルチパルス信号111とな
り、LPC係数信号104ピッチ周期信号112・ピッチ利得信
号113と共に多重化合成部35に出力される。In the pulse quantization unit 26, the input multi-pulse signal is quantized into a quantized multi-pulse signal 111, which is output to the multiplexing / synthesizing unit 35 together with the LPC coefficient signal 104, the pitch period signal 112, and the pitch gain signal 113. You.

LPC係数信号104とマルチパルス信号111とピッチ周期
信号112とピッチ利得信号113とが多重化合成部35に入力
され、これの信号を多重化した多重化信号105が多重化
合成部35から出力される。多重化信号105は、伝送線を
通じて波形復号化手段４の多重化分離部36に入力され
る。The LPC coefficient signal 104, the multi-pulse signal 111, the pitch period signal 112, and the pitch gain signal 113 are input to the multiplexing / combining unit 35, and a multiplexed signal 105 obtained by multiplexing these signals is output from the multiplexing / combining unit 35. You. The multiplexed signal 105 is input to the demultiplexing unit 36 of the waveform decoding unit 4 via a transmission line.

次に波形復号化手段４について説明する。波形復号化
手段４は、多重化分離部36と、パルス復号化部40と、ピ
ッチ予測フィルタ41と、Ｋ復号化部42と、Ｋ・α変換部
43と、ピッチ利得復号化部44とを備えている。波形符号
化手段３の多重化合成部35から多重化信号105が多重化
分離部36に入力されると、多重化分離部36ではLPC係数
信号104に対応するLPC係数信号114と、マルチパルス信
号111に対応するマルチパルス信号121と、ピッチ周期信
号112に対応するピッチ周期信号122と、ピッチ利得信号
113に対応するピッチ利得信号123とがそれぞれ出力され
る。Next, the waveform decoding means 4 will be described. The waveform decoding unit 4 includes a demultiplexing unit 36, a pulse decoding unit 40, a pitch prediction filter 41, a K decoding unit 42, a K · α conversion unit
43, and a pitch gain decoding unit 44. When the multiplexed signal 105 is input from the multiplexing / combining unit 35 of the waveform encoding unit 3 to the multiplexing / demultiplexing unit 36, the multiplexing / demultiplexing unit 36 outputs an LPC coefficient signal 114 corresponding to the LPC coefficient signal 104 and a multi-pulse signal. A multi-pulse signal 121 corresponding to 111, a pitch cycle signal 122 corresponding to the pitch cycle signal 112, and a pitch gain signal
A pitch gain signal 123 corresponding to 113 is output.

多重化分離部36から出力されるLPC係数信号114は、Ｋ
復号化部42で復号化され、さらにＫ・α変換部43で変換
されたLPC係数信号107となり、LPC合成部５に入力され
る。同じく多重化分離部36から出力されるマルチパルス
の位置と振幅とを示すマルチパルス信号121は、パルス
復号化部40で復号化されてピッチ予測フィルタ41に入力
される。また多重化分離部36から出力されるピッチ周期
信号122は直接、ピッチ予測フィルタ41に入力され、こ
こで復号化されたマルチパルス信号106がLPC合成部５に
入力される。マルチパルス信号106は、LPC係数信号107
に従って制御されつつ復号化されて、出力音声信号108
がLPC合成部５から出力される。The LPC coefficient signal 114 output from the demultiplexing unit 36 is represented by K
The LPC coefficient signal 107 decoded by the decoding unit 42 and further converted by the K / α conversion unit 43 is input to the LPC synthesis unit 5. Similarly, a multipulse signal 121 indicating the position and amplitude of the multipulse output from the demultiplexing unit 36 is decoded by the pulse decoding unit 40 and input to the pitch prediction filter 41. The pitch period signal 122 output from the demultiplexing unit 36 is directly input to the pitch prediction filter 41, and the decoded multipulse signal 106 is input to the LPC synthesis unit 5. The multi-pulse signal 106 is an LPC coefficient signal 107
Are decoded while being controlled in accordance with
Is output from the LPC synthesis unit 5.

〔The invention's effect〕

以上説明したように本発明のマルチパルス符号化装置
は、音声波形のピッチ周期に基づき音声波形の線形加算
を行うピッチ加算部を設けることにより、伝送帯域幅が
縮小できかつ演算回数も減少しパルス検索が容易で構造
も簡単になるという効果がある。As described above, the multi-pulse encoding apparatus according to the present invention includes a pitch addition unit that performs linear addition of a speech waveform based on the pitch cycle of the speech waveform, thereby reducing the transmission bandwidth and the number of operations, thereby reducing the number of pulses. The effect is that the search is easy and the structure is simple.

[Brief description of the drawings]

第１図は本発明の一実施例の構成を示すブロック図、第
２図は本発明を構成するスペクトル強調フィルタの一例
を示すブロック図、第３図（ａ），（ｂ）はそれぞれ本
発明のピッチ加算の作動を示すスペクトル強調された音
声信号の波形図で、（ａ）は入力音声波形図、（ｂ）は
（ａ）に示した入力音声をピッチ加算の前処理をした入
力音声波形図である。１…LPC分析手段、２…スペクトル強調手段、３…波形
符号化手段、４…波形復号化手段、５…LPC合成部、６
…振幅補正手段、11…波形切出部、12…LPC分析部、13
…Ｋ量子化復合化部、14…Ｋ・α変換部、15…減衰係数
印加部、16…スペクトル強調フィルタ、17…波形切出
部、20…インパルス応答部、21…スペクトル強調フィル
タ、24…一時メモリ、25…最大値検索補正部、26…パル
ス量子化部、27…ピッチ加算部、31…ピッチ抽出部、32
…ピッチ利得量子化復号化部、35…多重化合成部、36…
多重化分離部、40…パルス復合化部、41…ピッチ予測フ
ィルタ、42…Ｋ復合化部、43…Ｋ・α変換部、44…ピッ
チ利得復合化部、61…最大値検索部、62…振幅補正部。FIG. 1 is a block diagram showing a configuration of an embodiment of the present invention, FIG. 2 is a block diagram showing an example of a spectrum emphasizing filter constituting the present invention, and FIGS. 3 (a) and 3 (b) show the present invention, respectively. FIGS. 4A and 4B are waveform diagrams of a spectrum-emphasized voice signal showing an operation of pitch addition of FIG. 3A, wherein FIG. 4A is an input voice waveform diagram, and FIG. FIG. DESCRIPTION OF SYMBOLS 1 ... LPC analysis means, 2 ... Spectrum emphasis means, 3 ... Waveform encoding means, 4 ... Waveform decoding means, 5 ... LPC synthesis part, 6
... Amplitude correction means, 11 ... Waveform extraction unit, 12 ... LPC analysis unit, 13
... K quantization decoding section, 14 ... K-α conversion section, 15 ... Attenuation coefficient applying section, 16 ... Spectral emphasis filter, 17 ... Waveform cutout section, 20 ... Impulse response section, 21 ... Spectral emphasis filter, 24 ... Temporary memory, 25: maximum value search / correction unit, 26: pulse quantization unit, 27: pitch addition unit, 31: pitch extraction unit, 32
... Pitch gain quantization decoder, 35 ... Multiplexer, 36 ...
Demultiplexing unit, 40: pulse decoding unit, 41: pitch prediction filter, 42: K decoding unit, 43: K / α conversion unit, 44: pitch gain decoding unit, 61: maximum value search unit, 62 ... Amplitude correction unit.

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁶，ＤＢ名) G10L 9/14 H03M 1/12 H04B 14/04──────────────────────────────────────────────────続き Continued on the front page (58) Field surveyed (Int.Cl. ⁶ , DB name) G10L 9/14 H03M 1/12 H04B 14/04

Claims

(57) [Claims]

1. A multi-pulse encoder for transmitting, decoding and outputting an encoded signal obtained by encoding an input audio signal, wherein pitch information is extracted from the audio signal, and a pitch period signal and a pitch gain coefficient signal are extracted. And a pitch extracting means for outputting a voice signal in one time range of a time range separated by a pitch cycle from one end of the frame time length by separating the voice signal by a pitch cycle for each frame time length. Linear adding means for sequentially outputting an audio signal to be encoded by sequentially performing operations of adding an audio signal obtained by multiplying an adjacent audio signal in the preceding time range by a pitch gain coefficient to the other end of the frame time length. And an amplitude correction means for correcting the amplitude of the multi-pulse obtained from the audio signal after addition using the audio signal before addition. Luz type encoder.