JP3475958B2

JP3475958B2 - Speech encoding / decoding apparatus including speechless encoding, decoding method, and recording medium recording program

Info

Publication number: JP3475958B2
Application number: JP2003062333A
Authority: JP
Inventors: 芹沢　　昌宏; 伊藤　　博紀
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1999-05-31
Filing date: 2003-03-07
Publication date: 2003-12-10
Anticipated expiration: 2019-10-20
Also published as: JP2003233398A

Abstract

PROBLEM TO BE SOLVED: To provide a device which reduces sound quality deterioration when a voiceless section is encoded at a very low bit rate. SOLUTION: The device is equipped with: a switching means for passing a signal code sequence to a voice part decoding means or voiceless part decoding means according to whether an input signal is in a voiced or voiceless section on the basis of a VAD decision code outputted from a bit sequence decomposing means; a parameter decoding means which outputs a filter coefficient and RMS that the voiceless part decoding means derives from the signal code sequence; a smoothing means which smoothes and outputs the RMS and filter coefficient outputted from the parameter decoding means; a pulse generating means which generates a pulse train signal consisting of pulses having positions and amplitudes generated with random numbers; a pitch generating means which generates a pitch signal; a mixing means which generates an excitation signal of a synthesizing filter by performing linear sum processing between a random number signal, the pulse train signal passed from the pulse generating means and the pitch signal; and a synthesizing means which decodes and outputs the excitation signal by inputting the excitation signal to a filter composed of a smoothed filter coefficient. COPYRIGHT: (C)2003,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、音声信号等のデジ
タル情報を符号化・復号する装置に関し、特に無音声部
の符号化・復号技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a device for coding / decoding digital information such as a voice signal, and more particularly to a coding / decoding technique for a non-voice part.

【０００２】[0002]

【従来の技術】この種の従来の音声符号化・復号装置
は、音声がない区間（「無音声区間」という）を、音声
区間の符号化に比べて非常に低いビットレートで符号化
することにより、伝送する平均ビットレートを低減する
ものであり、例えば、文献１（IEEE Communications
Magazine, 第64−73頁、Sep、1997）等の記載が参照さ
れる。2. Description of the Related Art A conventional voice encoding / decoding device of this type encodes a section without voice (referred to as "non-voice section") at a bit rate very lower than that of encoding a voice section. The average bit rate for transmission is reduced by, for example, Document 1 (IEEE Communications
Magazine, pp. 64-73, Sep, 1997), etc. are referred to.

【０００３】この従来の符号化装置では、入力信号を予
め定めたフレーム（10 msec）毎に音声区間であるか無
音声区間であるかを判別し、音声区間である場合には、
通常の音声符号化方式（ITU−T勧告G.729）により入力
信号を符号化・復号し、一方、無音声区間の場合、符号
化装置では入力信号の特徴パラメータを間欠的に符号化
し、復号装置に伝送する。復号装置では、全てのフレー
ムではなく、間欠的に受信した特徴パラメータの繰り返
しあるいは平滑化を行うことで全フレームの特徴パラメ
ータを計算し、これらを用いて信号を復号する。In this conventional coding apparatus, it is determined whether the input signal is a voice section or a non-voice section for each predetermined frame (10 msec), and when it is a voice section,
The input signal is coded / decoded by a normal speech coding method (ITU-T Recommendation G.729), while in the case of no speech section, the coding device intermittently codes and decodes the characteristic parameter of the input signal. Transmit to device. The decoding apparatus calculates characteristic parameters of all frames by repeating or smoothing the characteristic parameters received intermittently instead of all frames, and decodes the signals using these.

【０００４】音声区間か無音声区間かを判別する方法と
して、上記文献１記載されているように、フレーム毎に
入力信号から計算する二乗平均平方根（root mean sq
uares;「RMS」という）、低周波数領域に対応するRMS、
零交差数、及びスペクトル包絡特性を表すフィルタ係数
を用いる方法がある。これらの変量と各々の無音声区間
における平均値との差分に基づき、閾値処理により判別
を行なう。As a method for discriminating between a voice section and a non-voice section, as described in the above-mentioned reference 1, a root mean sq calculated from an input signal for each frame.
uares; "RMS"), RMS for low frequency range,
There is a method of using the number of zero crossings and a filter coefficient representing the spectral envelope characteristic. Based on the difference between these variables and the average value in each unvoiced section, threshold value determination is performed.

【０００５】音声区間を符号化する方法としては、例え
ば、文献２（ITU−T勧告G.729、COM15−152 July 199
5）に記載されているCELP (Code Excited Linear Pr
ediction Coding；符号励振線形予測符号化)方式があ
る。CELP方式については、文献３（Code−Excited Line
ar Prediction : High Quality Speech at Very LowBit
Rates (IEEE Proc. ICASSP−85、 pp.937− 940、198
5) )の記載も参照される。As a method of encoding a voice section, for example, reference 2 (ITU-T Recommendation G.729, COM15-152 July 199) is used.
5) CELP (Code Excited Linear Pr)
ediction Coding; code excitation linear prediction coding). For the CELP method, refer to Reference 3 (Code-Excited Line
ar Prediction: High Quality Speech at Very LowBit
Rates (IEEE Proc. ICASSP−85, pp.937−940, 198
5)) is also referred to.

【０００６】従来の装置の符号化処理では、入力信号を
予め定めたフレーム毎に線形予測分析して、音声信号の
スペクトル包絡特性を表す線形予測(フィルタ)係数を算
出し、そのスペクトル包絡特性に対応するＬＰ合成フィ
ルタを駆動して励振信号を算出し、それぞれ符号化す
る。In the encoding processing of the conventional apparatus, the input signal is subjected to linear prediction analysis for each predetermined frame to calculate a linear prediction (filter) coefficient representing the spectral envelope characteristic of the voice signal, and the spectral envelope characteristic is calculated. The corresponding LP synthesis filter is driven to calculate the excitation signal, and each excitation signal is encoded.

【０００７】励振信号の符号化は、フレームを更にサブ
フレームに分割してサブフレーム毎に行う。ここで、励
振信号は、入力信号のピッチ周期を表す周期成分と残り
の残差成分とそれらのゲインにより構成される。入力信
号のピッチ周期を表す周期成分は、「適応コードブッ
ク」と呼ばれる過去の励振信号を保持するコードブック
に格納された適応コードベクトルとして表され、前記残
差成分は、複数のパルスからなるマルチパルス信号とし
て表される。The excitation signal is coded for each subframe by further dividing the frame into subframes. Here, the excitation signal is composed of a periodic component representing the pitch period of the input signal, the remaining residual component, and their gains. A periodic component representing the pitch period of the input signal is represented as an adaptive code vector stored in a codebook holding a past excitation signal called an “adaptive codebook”, and the residual component is a multi-pulse composed of a plurality of pulses. It is represented as a pulse signal.

【０００８】また、復号処理では、復号したピッチ周期
成分と残差信号から得た励振信号を、復号したフィルタ
係数で構成する合成フィルタに入力して音声信号を復号
する。In the decoding process, the excitation signal obtained from the decoded pitch period component and the residual signal is input to the synthesis filter composed of the decoded filter coefficient to decode the voice signal.

【０００９】無音声区間を符号化する方法として、上記
文献１に記載されているように、まず、符号化装置で、
入力信号の特徴パラメータとしてRMSとスペクトル特性
を表すフィルタ係数を符号化する。As a method for encoding a non-voice section, as described in the above-mentioned document 1, first, in an encoding device,
The RMS and the filter coefficient representing the spectral characteristic are encoded as the characteristic parameters of the input signal.

【００１０】次に、復号装置では、乱数信号と乱数的に
生成したパルス性信号とピッチ信号の線形和をRMSで調
整し、これをフィルタ係数を用いて構成した合成フィル
タに入力することにより、無音声信号を復号する。Next, in the decoding device, the linear sum of the random number signal, the pulse signal generated in a random number, and the pitch signal is adjusted by RMS, and this is input to the synthesizing filter configured by using the filter coefficient. Decode the unvoiced signal.

【００１１】特徴パラメータは、無音声区間で信号の性
質が変化したフレームでのみ伝送し、それ以外のフレー
ムでは何も伝送しない。但し、特徴パラメータを伝送す
るか否かの情報は別途伝送する。The characteristic parameter is transmitted only in the frame in which the signal property changes in the non-voice section, and nothing is transmitted in the other frames. However, information on whether to transmit the characteristic parameter is transmitted separately.

【００１２】この特徴パラメータを何も伝送しないフレ
ームでは、過去の伝送された特徴パラメータを繰り返し
使用する。但し、波形上での不連続が生じないように、
RMSは、平滑化処理を施している。In a frame in which no characteristic parameter is transmitted, past transmitted characteristic parameters are repeatedly used. However, to prevent discontinuity on the waveform,
The RMS has been smoothed.

【００１３】図８は、従来の符号化装置の構成を示すブ
ロック図である。図８を参照すると、この符号化装置
は、音声部符号化回路12と、無音声部符号化回路14と、
信号判定回路16と、切り替え回路18と、ビット生成回路
20とを備えている。FIG. 8 is a block diagram showing the structure of a conventional coding apparatus. Referring to FIG. 8, this encoding device includes a voice part encoding circuit 12, a voiceless part encoding circuit 14,
Signal determination circuit 16, switching circuit 18, bit generation circuit
It has 20 and.

【００１４】入力端子10は、入力信号を一定フレーム単
位、例えば10msec単位で入力する。信号判定回路16は、
入力端子10からの入力信号を用いてフレームが音声区間
か無音声区間かの判定を行ない、判定結果（VAD判定符
号）を切り替え回路18とビット列生成回路20に渡す。The input terminal 10 inputs an input signal in a unit of a fixed frame, for example, a unit of 10 msec. The signal determination circuit 16 is
The input signal from the input terminal 10 is used to determine whether the frame is a voice section or a non-voice section, and the determination result (VAD determination code) is passed to the switching circuit 18 and the bit string generation circuit 20.

【００１５】音声部符号化回路12は、入力端子10からの
入力信号をフレーム毎に符号化し、信号符号列を切り替
え回路18に渡す。The voice part coding circuit 12 codes the input signal from the input terminal 10 for each frame and passes the signal code string to the switching circuit 18.

【００１６】無音声部符号化回路14は、入力端子10から
の入力信号をフレーム毎に符号化し、信号符号列を切り
替え回路18に渡す。また、無音声区間において信号符号
列を伝送するか否かの判定情報（DTX判定符号）をビッ
ト生成回路20に渡す。The voiceless section coding circuit 14 codes the input signal from the input terminal 10 for each frame and passes the signal code string to the switching circuit 18. Also, the judgment information (DTX judgment code) as to whether or not to transmit the signal code string in the non-voice section is passed to the bit generation circuit 20.

【００１７】切り替え回路18は、信号判定回路16から渡
されるVAD判定符号に基づき、入力信号が音声区間とさ
れた場合には、音声部符号化回路12から渡された信号符
号列を、VAD判定符号で入力信号が無音声区間とされた
場合には、無音声符号化回路14から渡された信号符号列
をビット列生成回路20に渡す。Based on the VAD decision code passed from the signal decision circuit 16, the switching circuit 18 judges the VAD decision code from the signal code string passed from the voice part coding circuit 12 when the input signal is in the voice section. When the code indicates that the input signal is in the non-voice section, the signal code sequence passed from the non-voice encoding circuit 14 is passed to the bit sequence generation circuit 20.

【００１８】ビット列生成回路20は、信号判定回路16か
ら渡されるVAD判定符号と、無音声部符号化回路10から
渡されるDTX判定符号と、切り替え回路18から渡される
信号符号列とを多重して、ビット列を生成し、出力端子
22から出力する。The bit string generation circuit 20 multiplexes the VAD judgment code passed from the signal judgment circuit 16, the DTX judgment code passed from the non-voice part coding circuit 10, and the signal code string passed from the switching circuit 18. , Generate bit string and output terminal
Output from 22.

【００１９】図９は、従来の復号装置を説明するブロッ
ク図である。FIG. 9 is a block diagram for explaining a conventional decoding device.

【００２０】図９を参照すると、この復号装置は、ビッ
ト列分解回路26と、切り替え回路28と、音声部復号回路
30と、無音声部復号回路34とを備えて構成される。ビッ
ト列分解回路26は、入力端子24から入力したビット列を
VAD判定符号とDTX判定符号及び信号符号列に分解し、VA
D判定符号と信号符号列を切り替え回路28に渡し、DTX判
定符号を無音声部復号回路34に渡す。Referring to FIG. 9, this decoding device is provided with a bit string decomposition circuit 26, a switching circuit 28, and a voice part decoding circuit.
30 and a voiceless section decoding circuit 34. The bit string decomposition circuit 26 analyzes the bit string input from the input terminal 24.
Decompose into VAD decision code, DTX decision code and signal code string, and
The D decision code and the signal code string are passed to the switching circuit 28, and the DTX decision code is passed to the voiceless section decoding circuit 34.

【００２１】切り替え回路28は、ビット列分解回路26か
ら渡されたVAD判定符号に基づき、入力信号が音声区間
とされた場合にはビット列分解回路26から渡された信号
符号列を音声部復号回路30に渡し、VAD判定符号で入力
信号が無音声区間とされた場合には無音声部復号回路34
に渡す。The switching circuit 28, on the basis of the VAD decision code passed from the bit string decomposition circuit 26, when the input signal is in the voice section, the signal code string passed from the bit string decomposition circuit 26 to the audio part decoding circuit 30. When the input signal is in the non-voice section by the VAD judgment code, the non-voice section decoding circuit 34
Pass to.

【００２２】音声部復号回路30は、切り替え回路28から
渡された信号符号列を用いて信号を復号し、出力端子32
から出力する。The audio part decoding circuit 30 decodes the signal using the signal code string passed from the switching circuit 28, and outputs it to the output terminal 32.
Output from.

【００２３】無音声部復号回路34は、ビット列分解回路
26から渡されたDTX判定符号と切り替え回路28から渡さ
れた信号符号列を用いて、無音声部の信号を復号し、出
力端子32から出力する。The non-voice part decoding circuit 34 is a bit string decomposition circuit.
The DTX judgment code passed from 26 and the signal code string passed from the switching circuit 28 are used to decode the signal of the non-voice part and output from the output terminal 32.

【００２４】図１０は、従来の復号装置における無音声
復号回路34の構成を示すブロック図である。図１０を参
照すると、無音声復号回路34は、パラメータ復号回路54
と、乱数回路56と、パルス回路53と、混合回路61と、平
滑化回路66と、合成回路68とを備えている。FIG. 10 is a block diagram showing the configuration of the voiceless decoding circuit 34 in the conventional decoding device. Referring to FIG. 10, the voiceless decoding circuit 34 includes a parameter decoding circuit 54.
A random number circuit 56, a pulse circuit 53, a mixing circuit 61, a smoothing circuit 66, and a synthesizing circuit 68.

【００２５】パラメータ復号回路54は、入力端子52で入
力した信号符号列から求めたフィルタ係数とRMSをそれ
ぞれ合成回路68と平滑化回路66に渡す。The parameter decoding circuit 54 passes the filter coefficient and RMS obtained from the signal code string input at the input terminal 52 to the synthesizing circuit 68 and the smoothing circuit 66, respectively.

【００２６】平滑化回路66は、パラメータ復号回路54か
ら渡されたRMSを平滑化して得た平滑化RMSを、混合回路
61に渡す。但し、入力端子50から入力されたDTX判定符
号で信号符号列が伝送されないことが示された場合に
は、前フレームのRMSを用いて平滑化を行なう。The smoothing circuit 66 smoothes the RMS passed from the parameter decoding circuit 54 and obtains the smoothed RMS obtained by the smoothing RMS.
Pass to 61. However, if the DTX decision code input from the input terminal 50 indicates that the signal code string is not transmitted, smoothing is performed using the RMS of the previous frame.

【００２７】各無音声区間中の先頭から数えてnフレー
ム目で使用する平滑化RMS P(n)は、nフレーム目に入力
されたRMS p(n)を用いて次式（１）で計算する。但
し、何も伝送されてこないフレームではp(n)の代わりに
直前に伝送されたRMSを用いて次式（１）を計算する。The smoothed RMS P (n) used in the nth frame counting from the beginning in each unvoiced section is calculated by the following equation (1) using the RMS p (n) input in the nth frame. To do. However, in a frame in which nothing is transmitted, the following equation (1) is calculated using the RMS transmitted immediately before instead of p (n).

【００２８】 P(n)=(1−α)・p(n−1)＋α・p(n) …（1）[0028] P (n) = (1−α) ・ p (n−1) ＋ α ・ p (n)… (1)

【００２９】ここで、αは平滑化の程度を決定する平滑
化係数であり、上記文献１では、固定値0.125を用いて
いる。また、Ｐ(−1)=0である。Here, α is a smoothing coefficient that determines the degree of smoothing, and in the above-mentioned document 1, a fixed value of 0.125 is used. Also, P (−1) = 0.

【００３０】乱数回路56は、乱数を生成し、混合回路61
に渡す。パルス回路53は、乱数で各々生成した位置と振
幅を持つパルスから成るパルス列信号を生成し、混合回
路61に渡す。The random number circuit 56 generates a random number, and the mixing circuit 61
Pass to. The pulse circuit 53 generates a pulse train signal composed of pulses each having a position and an amplitude generated by random numbers, and passes the pulse train signal to the mixing circuit 61.

【００３１】ピッチ回路58は、前述の適応コードベクト
ルからなるピッチ信号を生成し、混合回路61に渡す。適
応コードベクトルを規定するピッチ周期は伝送されない
ことから、代わりに乱数信号を用いる。The pitch circuit 58 generates a pitch signal composed of the above-mentioned adaptive code vector and passes it to the mixing circuit 61. Since the pitch period defining the adaptive code vector is not transmitted, a random number signal is used instead.

【００３２】混合回路61では、乱数回路56から渡された
乱数信号r(i)と、パルス回路53から渡されたパルス列信
号p(i)と、ピッチ回路58から渡されたピッチ信号q(i)と
の線形和処理により、合成フィルタの励振信号x(i)を計
算し、合成回路68に渡す。In the mixing circuit 61, the random number signal r (i) passed from the random number circuit 56, the pulse train signal p (i) passed from the pulse circuit 53, and the pitch signal q (i passed from the pitch circuit 58. ) And the excitation signal x (i) of the synthesizing filter is calculated and passed to the synthesizing circuit 68.

【００３３】線形和の結合係数を計算する方法として、
例えば、上記文献１に記載された方法が用いられる。As a method of calculating the coupling coefficient of the linear sum,
For example, the method described in Document 1 is used.

【００３４】まず、ピッチ信号の結合係数Gqを制限され
た範囲内の値から乱数で選択する。First, the pitch signal coupling coefficient Gq is randomly selected from a value within a limited range.

【００３５】次に、計算したピッチ信号の結合係数Gqを
用いて、ピッチ信号とパルス列信号の線形和から計算し
たRMSが前記平滑化RMSと同一になるように、パルス列信
号の結合係数Gpを計算する。Next, using the calculated pitch signal coupling coefficient Gq, the pulse train signal coupling coefficient Gp is calculated so that the RMS calculated from the linear sum of the pitch signal and the pulse train signal becomes the same as the smoothed RMS. To do.

【００３６】以上で計算した結合係数を用いてピッチ信
号とパルス列信号との線形和 e(i)を次式（２）で計算
する。A linear sum e (i) of the pitch signal and the pulse train signal is calculated by the following equation (2) using the coupling coefficient calculated above.

【００３７】e(i) = Gq ・q(i) + Gp・p(i) …（2）E (i) = Gq.q (i) + Gp.p (i) (2)

【００３８】更に、この線形和 e(i) と乱数信号との
新たな線形和が前記平滑化RMSと同一になるように、線
形和 e(i) の結合係数Grを計算する。ここで、乱数信
号の結合係数は固定値γ=0.6を用いている。Further, the coupling coefficient Gr of the linear sum e (i) is calculated so that the new linear sum of the linear sum e (i) and the random number signal becomes the same as the smoothed RMS. Here, a fixed value γ = 0.6 is used as the coupling coefficient of the random number signal.

【００３９】従って、合成フィルタの励振信号x(i)は次
式（３）で計算される。Therefore, the excitation signal x (i) of the synthesis filter is calculated by the following equation (3).

【００４０】 x(i) = Gr ・[Gq ・q(i) + Gp・p(i)] + γ・r(i) …（3）[0040] x (i) = Gr ・ [Gq ・ q (i) + Gp ・ p (i)] + γ ・ r (i)… (3)

【００４１】合成回路68は、混合回路61から渡される励
振信号を、パラメータ復号回路54から渡されるフィルタ
係数で構成するフィルタに入力することにより、信号を
復号し、出力端子70から出力する。The synthesizing circuit 68 decodes the signal by inputting the excitation signal passed from the mixing circuit 61 to the filter constituted by the filter coefficient passed from the parameter decoding circuit 54, and outputs it from the output terminal 70.

【００４２】[0042]

【発明が解決しようとする課題】しかしながら、上記し
た従来の装置は下記記載の問題点を有している。However, the above-mentioned conventional apparatus has the following problems.

【００４３】第１の問題点は、復号装置において、無音
声区間を復号する際に使用するフィルタ係数が不連続に
変化する場合があり、その結果、復号信号の品質が劣化
する、ということである。The first problem is that, in the decoding apparatus, the filter coefficient used when decoding the non-voice section may change discontinuously, resulting in deterioration of the quality of the decoded signal. is there.

【００４４】その理由は、間欠的に伝送されるフィルタ
係数をそのまま用いている、ためである。The reason is that the filter coefficient transmitted intermittently is used as it is.

【００４５】第２の問題点は、無音声区間における最初
の区間（例えば数百msec）において直前の有音声区間に
よる影響を受ける場合があり、その結果、復号信号でそ
の振幅が実際より高くなったり、エコーを含むことによ
る復号信号の音質劣化が生じる、ということである。The second problem may be affected by the immediately preceding voiced section in the first section (for example, several hundred msec) in the unvoiced section, and as a result, the amplitude of the decoded signal becomes higher than it actually is. Or, the sound quality of the decoded signal deteriorates due to the inclusion of echo.

【００４６】その理由は、無音声区間では、無音声区間
における再生信号が不連続にならないように、RMSの平
滑化処理を、常に行なっている、ためである。The reason is that in the non-voice section, the RMS smoothing process is always performed so that the reproduced signal in the non-voice section does not become discontinuous.

【００４７】第３の問題点は、無音声区間の復号信号が
入力信号の背景雑音とは聴覚的に著しく異なる場合があ
り、その結果、有音声部に含まれる背景雑音と聴覚的な
不連続が生じる、ということである。The third problem is that the decoded signal in the non-voice section may be significantly different from the background noise of the input signal, and as a result, the background noise included in the voiced section and the auditory discontinuity. Is to occur.

【００４８】その理由は、無音声区間において再生フィ
ルタの励振信号を生成する時に、乱数成分に対するパル
ス成分とピッチ成分の比を一定値としている、ためであ
る。The reason is that the ratio of the pulse component to the pitch component to the random number component is set to a constant value when the excitation signal of the reproduction filter is generated in the non-voice section.

【００４９】したがって本発明は、上記問題点に鑑みて
なされたものであって、その主たる目的は、無音声区間
を高性能に符号化することで、無音声部符号化の導入に
より伝送ビットレートの平均値を下げても、高符号化品
質を実現する、装置を提供することにある。Therefore, the present invention has been made in view of the above-mentioned problems, and its main purpose is to encode a non-voice section with high performance, and to introduce a non-voice section coding so as to achieve a transmission bit rate. An object of the present invention is to provide a device that realizes high coding quality even if the average value of is reduced.

【００５０】また本発明の他の目的は、無音声区間復号
時のフィルタ係数の不連続に帰因する復号音質劣化を低
減する復号装置を提供することにある。Another object of the present invention is to provide a decoding apparatus that reduces the deterioration of the decoded sound quality due to the discontinuity of the filter coefficients during the non-voice section decoding.

【００５１】[0051]

【課題を解決するための手段】前記目的を達成する第１
の発明は、各フレームにおいて復号信号が音声区間であ
るか無音声区間であるかの判別情報に従い前記復号信号
の特徴パラメータから信号を復号する方法を切り替える
音声復号装置において、前記特徴パラメータの中で前記
復号信号のスペクトル包絡特性を表す特徴パラメータを
時間方向に平滑化した値を用いて復号する手段を備えて
いる。[Means for Solving the Problems] First to achieve the above object
The present invention, in a voice decoding device for switching the method of decoding a signal from the characteristic parameter of the decoded signal according to the discrimination information whether the decoded signal is a voice section or a non-voice section in each frame, in the characteristic parameter There is provided means for decoding using a value obtained by smoothing the characteristic parameter representing the spectral envelope characteristic of the decoded signal in the time direction.

【００５２】第２の発明は、各フレームにおいて復号信
号が音声区間であるか無音声区間であるかの判別情報に
従い前記復号信号の特徴パラメータから信号を復号する
方法を切り替える音声復号装置において、音声区間から
無音声区間に切り替わってからの時間経過に応じて、前
記特徴パラメータの少なくとも一つについて時間方向に
平滑化する程度を変更した値を用いて復号する手段を備
える。A second aspect of the present invention is a voice decoding apparatus for switching a method of decoding a signal from a characteristic parameter of the decoded signal according to the discrimination information as to whether the decoded signal is a voice section or a non-voice section in each frame. There is provided means for decoding using a value in which a degree of smoothing in at least one of the characteristic parameters is changed in accordance with a lapse of time after the section is switched to the non-voice section.

【００５３】第３の発明は、各フレームにおいて復号信
号が音声区間であるか無音声区間であるかの判別に従い
前記復号信号の特徴パラメータから信号を復号する方法
を切り替える音声復号装置において、音声区間から無音
声区間に切り替わった直後の区間では伝送された特徴パ
ラメータの少なくとも一つを直接使用し、それ以降は、
前記特徴パラメータの内少なくとも一つについて時間方
向に平滑化した値を信号復号で用いて復号する手段を備
える。A third aspect of the present invention is a voice decoding apparatus for switching a method of decoding a signal from a characteristic parameter of the decoded signal according to whether the decoded signal is a voice section or a non-voice section in each frame. In the section immediately after switching from the non-voice section, at least one of the transmitted characteristic parameters is directly used, and thereafter,
There is provided means for decoding by using a value smoothed in the time direction for at least one of the characteristic parameters in signal decoding.

【００５４】第４の発明は、各フレームにおいて復号信
号が音声区間であるか無音声区間であるかの判別に従い
前記復号信号の特徴パラメータから信号を復号する方法
を切り替える音声復号装置において、前記特徴パラメー
タの内少なくとも一つに応じて、前記特徴パラメータの
少なくとも一つについて時間方向に平滑化する程度を変
更した値を用いて復号する手段を備える。A fourth aspect of the present invention is the speech decoding apparatus, wherein the method of decoding a signal from the characteristic parameter of the decoded signal is switched according to whether the decoded signal in each frame is a speech section or a non-speech section. There is provided means for decoding using a value in which the degree of smoothing in the time direction is changed for at least one of the characteristic parameters according to at least one of the parameters.

【００５５】第５の発明は、各フレームにおいて復号信
号が音声区間であるか無音声区間であるかの判別に従い
前記復号信号の特徴パラメータから信号を復号する方法
を切り替える音声復号装置において、前記特徴パラメー
タの内少なくとも一つ及び音声区間から無音声区間に切
り替わってからの時間経過に応じて、前記特徴パラメー
タの少なくとも一つについて時間方向に平滑化する程度
を変更した値を用いて復号する手段を備える。A fifth aspect of the present invention is the speech decoding apparatus, wherein the method for decoding a signal from the characteristic parameter of the decoded signal is switched according to whether the decoded signal in each frame is a speech section or a non-speech section. Decoding means using at least one of the parameters and a value obtained by changing the degree of smoothing in the time direction for at least one of the characteristic parameters according to the time elapsed after switching from the voice section to the non-voice section. Prepare

【００５６】第６の発明は、各フレームにおいて復号信
号が音声区間であるか無音声区間であるかの判別に従い
前記復号信号の特徴パラメータから信号を復号する方法
を切り替える音声復号装置において、前記特徴パラメー
タが予め定めた条件を満たす区間では伝送された特徴パ
ラメータの内少なくとも一つを直接使用し、それ以降
は、前記特徴パラメータの内少なくとも一つについて時
間方向に平滑化した値を信号復号で用いて復号する手段
を備えたことを特徴とする音声復号装置。According to a sixth aspect of the present invention, in the speech decoding apparatus, the method for decoding the signal from the characteristic parameter of the decoded signal is switched according to the determination as to whether the decoded signal is in the voice section or the non-voice section in each frame. At least one of the transmitted characteristic parameters is directly used in the section where the parameter satisfies a predetermined condition, and thereafter, a value smoothed in the time direction for at least one of the characteristic parameters is used for signal decoding. A speech decoding apparatus comprising means for decoding the speech.

【００５７】第７の発明は、各フレームにおいて復号信
号が音声区間であるか無音声区間であるかの判別に従い
前記復号信号の特徴パラメータから信号を復号する方法
を切り替える音声復号装置において、前記特徴パラメー
タの内少なくとも一つ及び音声区間から無音声区間に切
り替わってからの時間経過に応じて、前記特徴パラメー
タの内少なくとも一つについて時間方向に平滑化する程
度を変更した値を用いて復号する手段を備える。A seventh aspect of the present invention is the speech decoding apparatus, wherein the method for decoding a signal from the characteristic parameter of the decoded signal is switched according to whether the decoded signal in each frame is a speech section or a non-speech section. Decoding means using at least one of the parameters and a value in which the degree of smoothing in the time direction is changed for at least one of the characteristic parameters according to the passage of time after switching from the voice section to the non-voice section. Equipped with.

【００５８】第８の発明は、各フレームにおいて復号信
号が音声区間であるか無音声区間であるかの判別に従い
前記復号信号の特徴パラメータから信号を復号する方法
を切り替える音声復号装置において、音声区間から無音
声区間に切り替わった直後且つ前記特徴パラメータが予
め定めた条件を満たす区間では伝送された特徴パラメー
タの少なくとも一つを直接使用し、それ以降は、前記特
徴パラメータの内少なくとも一つについて時間方向に平
滑化した値を信号復号で用いて復号する手段を備える。An eighth aspect of the present invention is a speech decoding apparatus for switching a method of decoding a signal from a characteristic parameter of the decoded signal in accordance with a determination as to whether the decoded signal is a speech section or a non-speech section in each frame. Immediately after switching from the non-voice section to the non-voice section and in the section where the characteristic parameter satisfies a predetermined condition, at least one of the transmitted characteristic parameters is directly used, and thereafter, at least one of the characteristic parameters is temporally And means for decoding by using the smoothed value in the signal decoding.

【００５９】第９の発明は、各フレームにおいて復号信
号が音声区間であるか無音声区間であるかの判別情報に
従い前記復号信号に対応する特徴パラメータから信号を
復号する方法を切り替え、少なくとも一部の区間におい
て、無音声区間の信号を複数種類の信号から成る励振信
号を合成フィルタに入力することにより生成する音声復
号装置において、受信した特徴パラメータの少なくとも
一つに基づき、前記無音声区間における前記複数種類の
信号を加算する際の係数を決定する手段を備える。A ninth aspect of the present invention switches the method of decoding a signal from the characteristic parameter corresponding to the decoded signal according to the discrimination information indicating whether the decoded signal is in the voice section or in the non-voice section in each frame, and at least partly In a speech decoding device for generating a signal in a non-voice section by inputting an excitation signal composed of a plurality of types of signals to a synthesis filter in the section, in the non-voice section based on at least one of the received characteristic parameters. A means for determining a coefficient when adding a plurality of kinds of signals is provided.

【００６０】第１０の発明は、各フレームにおいて復号
信号が音声区間であるか無音声区間であるかの判別情報
に従い前記復号信号に対応する特徴パラメータから信号
を復号する方法を切り替え、無音声区間の信号を複数種
類の信号から成る励振信号を合成フィルタに入力するこ
とにより生成する音声復号装置において、少なくとも一
部の区間において、受信した特徴パラメータの時間方向
に平滑化した平滑化パラメータの少なくとも一つに基づ
き、前記無音声区間における前記複数種類の信号を加算
する際の係数を決定する。A tenth aspect of the present invention switches the method of decoding a signal from the characteristic parameter corresponding to the decoded signal according to the discrimination information whether the decoded signal is a voice section or a non-voice section in each frame, In a speech decoding apparatus for generating the signal of 1) by inputting an excitation signal composed of a plurality of kinds of signals into a synthesis filter, at least one of smoothing parameters smoothed in the time direction of the received characteristic parameter in at least a part of the section. Based on the above, a coefficient for adding the plurality of types of signals in the non-voice section is determined.

【００６１】第１１の発明は、前記第１乃至第１０の発
明において、前記特徴パラメータが、前記復号信号に対
応するスペクトル包絡を表す量とパワーを表す量の少な
くとも一つを含む。In an eleventh aspect based on the first to tenth aspects, the characteristic parameter includes at least one of an amount representing a spectrum envelope corresponding to the decoded signal and an amount representing power.

【００６２】第１２の発明は、各フレームにおいて入力
信号が音声区間であるか無音声区間であるかの判別を行
い前記入力信号の特徴パラメータを符号化する符号化装
置と、第１乃至第１１のいずれかの音声復号装置とを備
える。A twelfth aspect of the present invention is an encoding apparatus for discriminating whether an input signal is in a voice section or a non-voice section in each frame and encoding characteristic parameters of the input signal, and first to eleventh aspects. And a voice decoding device according to any one of 1.

【００６３】[0063]

【発明の実施の形態】本発明の実施の形態について説明
する。本発明の音声復号装置は、第１の実施の形態にお
いて、各フレームにおいて復号信号が音声区間であるか
無音声区間であるかの判別情報に従い前記復号信号の特
徴パラメータから信号を復号する方法を切り替える手段
（図９の28）と、前記特徴パラメータの中で、前記復号
信号のスペクトル包絡特性を表す特徴パラメータを時間
方向に平滑化する手段（図１の64）と、平滑化した特徴
パラメータを用いて復号処理を行なう手段（図１の56、
53、58、61及び68）とを備えている。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described. A speech decoding apparatus according to the first embodiment of the present invention provides a method of decoding a signal from a characteristic parameter of the decoded signal according to the discrimination information in each frame whether the decoded signal is a voice section or a non-voice section. Switching means (28 in FIG. 9), means for smoothing the characteristic parameter representing the spectral envelope characteristic of the decoded signal in the characteristic parameter in the time direction (64 in FIG. 1), and the smoothed characteristic parameter Means for performing a decoding process by using (56 in FIG. 1,
53, 58, 61 and 68).

【００６４】本発明の音声復号装置は、第２の実施の形
態において、各フレームにおいて復号信号が音声区間で
あるか無音声区間であるかの判別に従い前記復号信号の
特徴パラメータから信号を復号する方法を切り替える手
段（図２の28）と、前記特徴パラメータの内少なくとも
一つ及び音声区間から無音声区間に切り替わってからの
時間経過に応じて、前記特徴パラメータの少なくとも一
つに関して時間方向に平滑化する手段（図２の36、図３
の49と51）と、この平滑化した特徴パラメータを用いて
復号処理を行なう手段（図３の56、53、58、61及び68）
とを備えている。The speech decoding apparatus of the present invention, in the second embodiment, decodes a signal from the characteristic parameter of the decoded signal according to whether the decoded signal in each frame is a speech section or a non-speech section. Means for switching the method (28 in FIG. 2), and smoothing in time direction with respect to at least one of the characteristic parameters according to at least one of the characteristic parameters and time elapsed after switching from the voice section to the non-voice section. Means for converting (36 in FIG. 2 and FIG. 3)
49 and 51) and means for performing a decoding process using the smoothed characteristic parameters (56, 53, 58, 61 and 68 in FIG. 3).
It has and.

【００６５】本発明の音声復号装置は、第３の実施の形
態において、各フレームにおいて復号信号が音声区間で
あるか無音声区間であるかの判別に従い前記復号信号の
特徴パラメータから信号を復号する方法を切り替える手
段（図２の28）と、音声区間から無音声区間に切り替わ
った直後で前記特徴パラメータが予め定めた条件を満た
す区間では伝送された特徴パラメータの少なくとも一つ
を直接使用し、それ以降は前記特徴パラメータの内少な
くとも一つに関して時間方向に平滑化した値を生成する
手段（図２の３６、図３の４９と５１）、前記平滑化し
た値を用いて復号処理を行なう手段（図３の56、53、5
8、61及び68）とを備えている。The speech decoding apparatus of the present invention, in the third embodiment, decodes a signal from a characteristic parameter of the decoded signal in accordance with the determination of whether the decoded signal is a speech section or a non-speech section in each frame. Means for switching the method (28 in FIG. 2) and directly using at least one of the transmitted characteristic parameters in the section in which the characteristic parameters satisfy a predetermined condition immediately after switching from the voice section to the non-voice section, After that, means for generating a value smoothed in the time direction with respect to at least one of the characteristic parameters (36 in FIG. 2, 49 and 51 in FIG. 3), and means for performing a decoding process using the smoothed value ( 56, 53, 5 in FIG.
8, 61 and 68).

【００６６】本発明の音声復号装置は、第４の実施の形
態において、各フレームにおいて復号信号が音声区間で
あるか無音声区間であるかの判別に従い前記復号信号に
対応する特徴パラメータから信号を復号する方法を切り
替える手段（図４の28）と、無音声区間の信号を複数種
類の信号から成る励振信号を合成フィルタに入力するこ
とにより生成する手段（図５の56、53、58、60、68）
と、受信した特徴パラメータの少なくとも一つに基づき
前記無音声区間における前記複数種類の信号を加算する
際の係数を決定する手段（図４の38）とを備えている。According to the fourth embodiment of the speech decoding apparatus of the present invention, a signal is extracted from a characteristic parameter corresponding to the decoded signal according to whether the decoded signal in each frame is a speech section or a non-speech section. A means for switching the decoding method (28 in FIG. 4) and a means for generating a signal in the non-voice section by inputting an excitation signal composed of a plurality of kinds of signals to a synthesis filter (56, 53, 58, 60 in FIG. 5). , 68)
And a means (38 in FIG. 4) for determining a coefficient for adding the plurality of types of signals in the non-voice section based on at least one of the received characteristic parameters.

【００６７】本発明の音声復号装置は、第５の実施の形
態において、各フレームにおいて復号信号が音声区間で
あるか無音声区間であるかの判別に従い前記復号信号に
対応する特徴パラメータから信号を復号する方法を切り
替える手段（図６の28）と、無音声区間の信号を複数種
類の信号から成る励振信号を合成フィルタに入力するこ
とにより生成する手段（図７の56、53、58、60、68）
と、受信した特徴パラメータの時間方向に平滑化した平
滑化パラメータを計算する手段（図７の64と50）と計算
した平滑化パラメータの少なくとも一つに基づき前記無
音声区間における前記複数種類の信号を加算する際の係
数を決定する手段（図６の38）とを備えている。According to the fifth embodiment of the speech decoding apparatus of the present invention, a signal is extracted from a characteristic parameter corresponding to the decoded signal according to whether the decoded signal in each frame is a speech section or a non-speech section. A means for switching the decoding method (28 in FIG. 6) and a means for generating a signal in the non-voice section by inputting an excitation signal composed of a plurality of kinds of signals to a synthesis filter (56, 53, 58, 60 in FIG. 7). , 68)
And means for calculating a smoothing parameter smoothed in the time direction of the received characteristic parameter (64 and 50 in FIG. 7) and the plurality of types of signals in the non-voice section based on at least one of the calculated smoothing parameters. And a means (38 in FIG. 6) for deciding the coefficient when adding.

【００６８】本発明の音声復号装置は、第６の実施の形
態において、前記特徴パラメータが前記復号信号に対応
するスペクトル包絡を表す量とパワーを表す量の少なく
とも一つを含む。In the sixth embodiment of the speech decoding apparatus of the present invention, the characteristic parameter includes at least one of a quantity representing a spectrum envelope corresponding to the decoded signal and a quantity representing power.

【００６９】本発明の符号化・復号装置は、その好まし
い実施の形態において、各フレームにおいて入力信号が
音声区間であるか無音声区間であるかの判別を行い前記
入力信号の特徴パラメータを符号化する手段（図８参
照）と、前記した第１乃至第６の実施の形態の音声復号
装置を有する。In a preferred embodiment of the encoding / decoding apparatus of the present invention, it is determined in each frame whether the input signal is a voice section or a non-voice section, and the characteristic parameters of the input signal are encoded. Means (see FIG. 8) and the speech decoding apparatus according to the first to sixth embodiments.

【００７０】本発明の実施の形態について動作・原理に
ついて以下に説明する。The operation and principle of the embodiment of the present invention will be described below.

【００７１】本発明においては、音声復号装置におい
て、無音声区間を復号する際に、間欠的に伝送されるフ
ィルタ係数を、RMSと同様に平滑化処理した後に、合成
フィルタで使用する。これにより、間欠的に伝送してい
ることにより生じるフィルタ係数が不連続に変化するこ
とを防ぐことができ、その結果、復号音質を改善でき
る。In the present invention, when decoding a non-voice section in the speech decoding apparatus, the filter coefficient intermittently transmitted is smoothed in the same manner as RMS and then used in the synthesis filter. By this means, it is possible to prevent the filter coefficients that occur due to intermittent transmission from changing discontinuously, and as a result, it is possible to improve decoded sound quality.

【００７２】音声復号装置において、無音声区間で平滑
化されたフィルタ係数やRMSを用いる場合、平滑化処理
により過去のフレームで伝送されたフィルタ係数やRMS
の影響を受けることになる。When the filter coefficient or RMS smoothed in the non-voice section is used in the speech decoding apparatus, the filter coefficient or RMS transmitted in the past frame by the smoothing process.
Will be affected.

【００７３】無音声区間の先頭区間の信号には、直前の
有音声区間の特性が含まれているため、この区間で平滑
化処理を行なうことにより、その区間の特性を含んだ特
徴パラメータを用いて復号することになる。その結果、
復号信号の波形振幅が実際より大きくなったり、復号信
号がエコーを含む等の復号音声の劣化が生じることがあ
る。Since the signal in the head section of the non-voice section includes the characteristic of the immediately preceding voice section, the smoothing process is performed in this section to use the characteristic parameter including the characteristic of the section. Will be decrypted. as a result,
The waveform amplitude of the decoded signal may become larger than it actually is, or the decoded signal may include echoes, etc., resulting in deterioration of the decoded speech.

【００７４】これを防ぐために、音声区間から無音声区
間に入ってからの一定時間や一定フレーム数や、復号さ
れた特徴パラメータが予め定めた条件を満たす場合、例
えば、振幅を表すRMSが予め定めた値より未だ大きい場
合は平滑化を行なわないように、平滑化係数を設定す
る。これにより、先頭区間において平滑化により生ず
る、直前の有音声区間からの影響を削減することができ
る。In order to prevent this, when the fixed time after the voice section enters the non-voice section, the fixed number of frames, and the decoded characteristic parameters satisfy the predetermined conditions, for example, the RMS representing the amplitude is predetermined. The smoothing coefficient is set so that smoothing is not performed when the value is still larger than the above value. As a result, it is possible to reduce the influence from the immediately preceding voiced section, which is caused by the smoothing in the head section.

【００７５】入力信号に重畳した背景雑音の種類によっ
ては、音声部復号回路で復号される信号に含まれる背景
雑音と、無音声復号回路で復号される信号に聴覚的な差
が生じる場合がある。これは、無音声復号回路で、合成
フィルタの励振信号の加算割合を、そのRMSが伝送され
たRMSの平滑化値と同じになるという条件のみで計算し
ているためである。Depending on the type of background noise superimposed on the input signal, an audible difference may occur between the background noise included in the signal decoded by the speech decoding circuit and the signal decoded by the non-voice decoding circuit. . This is because the speechless decoding circuit calculates the addition ratio of the excitation signal of the synthesis filter only under the condition that the RMS becomes the same as the smoothed value of the transmitted RMS.

【００７６】本発明においては、この加算割合を、入力
信号の性質を考慮して決定することにより、前記聴覚的
な差による復号音質の劣化を削減することができる。考
慮の仕方としては、例えば、平均RMSが小さい時は主に
乱数的な雑音を使用し、平均RMSが大きい時、あるいは
フィルタ係数から計算したスペクトルが平坦でない場合
は、主にパルス性信号あるいはピッチ信号を使用する。In the present invention, by determining the addition ratio in consideration of the characteristics of the input signal, it is possible to reduce the deterioration of the decoded sound quality due to the auditory difference. As a method of consideration, for example, when the average RMS is small, mainly random noise is used, and when the average RMS is large, or when the spectrum calculated from the filter coefficient is not flat, it is mainly a pulse signal or pitch. Use signals.

【００７７】[0077]

【実施例】上記した本発明の実施の形態についてさらに
詳細に説明すべく、本発明の実施例について図面を参照
して以下に説明する。以下に説明する本発明の実施例に
おける符号化装置は、その基本構成が図８に示したもの
と同一のものが用いられる。また本発明の一実施例にお
ける復号装置の基本構成は、図９に示したものと同一と
される。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS In order to describe the embodiment of the present invention described above in more detail, an embodiment of the present invention will be described below with reference to the drawings. The encoding apparatus in the embodiment of the present invention described below has the same basic configuration as that shown in FIG. The basic configuration of the decoding device according to the embodiment of the present invention is the same as that shown in FIG.

【００７８】図１は、本発明の第１の実施例の復号装置
における無音声部復号回路の構成を示すブロック図であ
る。図１を参照すると、本発明の第１の実施例における
無音声部復号回路が、図１０に示した無音声部復号回路
34と相違する点は、平滑化回路64をさらに備えているこ
とである。以下では、主に従来の装置との相違点につい
て説明し、同一部分の説明は適宜省略する。FIG. 1 is a block diagram showing the structure of a voiceless decoding circuit in a decoding apparatus according to the first embodiment of the present invention. Referring to FIG. 1, the voiceless portion decoding circuit according to the first embodiment of the present invention is the voiceless portion decoding circuit shown in FIG.
The difference from 34 is that a smoothing circuit 64 is further provided. In the following, differences from the conventional device will be mainly described, and description of the same parts will be appropriately omitted.

【００７９】パラメータ復号回路54は、入力端子52から
入力した信号符号列から求めたフィルタ係数とRMSをそ
れぞれ平滑化回路64と平滑化回路66に渡す。The parameter decoding circuit 54 passes the filter coefficient and RMS obtained from the signal code string input from the input terminal 52 to the smoothing circuit 64 and the smoothing circuit 66, respectively.

【００８０】平滑化回路64は、パラメータ復号回路54か
ら渡されたフィルタ係数を平滑化し、合成回路68に渡
す。但し、入力端子50から入力されたDTX判定符号で信
号符号列が伝送されないことが示された場合は、前フレ
ームのフィルタ係数を用いて平滑化を行なう。The smoothing circuit 64 smoothes the filter coefficient passed from the parameter decoding circuit 54 and passes it to the synthesizing circuit 68. However, when the DTX determination code input from the input terminal 50 indicates that the signal code string is not transmitted, smoothing is performed using the filter coefficient of the previous frame.

【００８１】各無音声区間中の先頭から数えてnフレー
ム目で使用する平滑化フィルタ係数F(n,i),(i=1,...,M)
は、nフレーム目に入力されたフィルタ係数 f(n,i),(i
=1,...,M)を用いて次式（４）で計算する。但し、何も
伝送されてこないフレームでは、f(n,i)の代わりに直前
に伝送されたフィルタ係数を用いて次式（４）を計算す
る。Smoothing filter coefficients F (n, i), (i = 1, ..., M) used in the nth frame counting from the beginning in each unvoiced section
Is the filter coefficient f (n, i), (i
= 1, ..., M) is calculated by the following equation (4). However, in a frame in which nothing is transmitted, the following equation (4) is calculated using the filter coefficient transmitted immediately before instead of f (n, i).

【００８２】 F(n,i) = (1−β)F(n−1,i) + βf(n,i) …（4）[0082] F (n, i) = (1−β) F (n−1, i) + βf (n, i)… (4)

【００８３】ここで、βは平滑化の程度を決定する平滑
化係数である。また、F(−1,i)=0,(i=1,...,M)である。Here, β is a smoothing coefficient that determines the degree of smoothing. Also, F (−1, i) = 0, (i = 1, ..., M).

【００８４】Mはフィルタの次数である。合成回路68
は、混合回路61から渡される励振信号を、平滑回路64か
ら渡されるフィルタ係数で構成するフィルタに入力する
ことにより、信号を復号し、出力端子70から出力する。M is the order of the filter. Synthesis circuit 68
Inputs the excitation signal passed from the mixing circuit 61 into a filter constituted by the filter coefficients passed from the smoothing circuit 64, thereby decoding the signal and outputting it from the output terminal 70.

【００８５】図２は、本発明の第２の実施例における復
号装置の構成を示す図である。本発明の第２の実施例
が、図９に示した従来の復号装置と相違する点は、無音
声部復号回路35の構成が相違することと、平滑化制御回
路36を備えていることである。以下では、主に従来の装
置との相違点について説明し、同一部分の説明は適宜省
略する。FIG. 2 is a diagram showing the configuration of a decoding device according to the second embodiment of the present invention. The second embodiment of the present invention is different from the conventional decoding device shown in FIG. 9 in that the structure of the non-voice part decoding circuit 35 is different and that the smoothing control circuit 36 is provided. is there. In the following, differences from the conventional device will be mainly described, and description of the same parts will be appropriately omitted.

【００８６】ビット列分解回路26は、入力端子24から入
力したビット列をVAD判定符号、DTX判定符号及び信号符
号列に分解し、VAD判定符号を平滑化制御回路36と切り
替え回路28に渡し、信号符号列を切り替え回路28に渡
し、DTX判定符号を無音声部復号回路35に渡す。The bit string decomposition circuit 26 decomposes the bit string input from the input terminal 24 into a VAD judgment code, a DTX judgment code and a signal code string, passes the VAD judgment code to the smoothing control circuit 36 and the switching circuit 28, and outputs the signal code. The column is passed to the switching circuit 28, and the DTX determination code is passed to the voiceless section decoding circuit 35.

【００８７】切り替え回路28は、ビット列分解回路26か
ら渡されたVAD判定符号で入力信号が音声区間とされた
場合はビット列分解回路26から渡された信号符号列を音
声部復号回路30に渡し、VAD判定符号で入力信号が無音
声区間とされた場合は無音声部復号回路35に渡す。The switching circuit 28 passes the signal code string passed from the bit string decomposing circuit 26 to the audio part decoding circuit 30 when the VAD decision code passed from the bit string decomposing circuit 26 indicates that the input signal is in the voice section. If the VAD decision code indicates that the input signal is in the non-voice section, it is passed to the non-voice section decoding circuit.

【００８８】平滑化制御回路36は、ビット列分解回路26
から渡されるVAD判定符号の変化に応じた平滑化係数α
(n)とβ(n)を無音声部復号回路35に渡す。ここで nは
各無音声区間中の先頭から数えたフレーム番号である。The smoothing control circuit 36 includes a bit string decomposition circuit 26.
Smoothing coefficient α according to the change of VAD judgment code passed from
(n) and β (n) are passed to the voiceless decoding circuit 35. Here, n is the frame number counted from the beginning in each non-voice section.

【００８９】例えば、VAD判定符号が無音声区間をある
ことを示す場合、最初の特定フレーム数又は特定時間長
で、平滑化係数α(n)とβ(n)を1とすることにより、無
音声区間における先頭部分に残っている直前の有音声部
による影響を除去することができる。また、同様に伝送
されたフィルタ係数やRMS等が特定の条件を満たす間、
平滑化係数α(n)とβ(n)を1とすることにより、無音声
区間における先頭部分に残っている直前の有音声部によ
る影響を除去することができる。条件の例としては、RM
Sが直前の有音声区間の影響を受けていることを検出す
るための方法として、「RMSが予め定めた閾値以上であ
る」又は「RMSとその無音区間における先頭サブフレー
ムのRMSとが予め定めた閾値以下である」がある。ま
た、フィルタ係数が音声区間の平均スペクトルに類似し
ていることを検出するために、「フィルタ係数が予め定
めた標準フィルタ係数との距離（例えば二乗距離）が予
め定めた閾値以下である」等がある。For example, when the VAD decision code indicates that there is a non-voice section, by setting the smoothing coefficients α (n) and β (n) to 1 at the initial specific frame number or specific time length, It is possible to remove the influence of the immediately preceding voiced portion remaining in the head portion of the voice section. Similarly, while the transmitted filter coefficient, RMS, etc. satisfy certain conditions,
By setting the smoothing coefficients α (n) and β (n) to 1, it is possible to remove the influence of the immediately preceding voiced portion remaining in the head portion in the non-voiced section. As an example of the condition, RM
As a method for detecting that S is affected by the immediately preceding voiced section, "RMS is greater than or equal to a predetermined threshold" or "RMS and RMS of the first subframe in the silent section are predetermined. Is less than or equal to the threshold. " Further, in order to detect that the filter coefficient is similar to the average spectrum of the voice section, "the distance between the filter coefficient and the standard filter coefficient that is set in advance (for example, the squared distance) is equal to or less than a predetermined threshold value", etc. There is.

【００９０】更に、直前の音声区間の長さが一定フレー
ム数あるいは一定時間長よりも短い場合は、その音声区
間の直前の無音声区間と入力信号の性質が類似している
と考えて、フィルタ係数とRMSの平滑化値を計算する時
の初期値Ｐ(-1)、F(−1,i),(i=1,...,M)として、直前の
無音声区間の最終フレームでの平滑化値を用いることが
できる。Further, when the length of the immediately preceding voice section is shorter than the fixed number of frames or the fixed time length, it is considered that the characteristics of the input signal are similar to those of the non-voice section immediately before the voice section, and the filter is used. As the initial values P (-1), F (-1, i), (i = 1, ..., M) when calculating the smoothed value of the coefficient and RMS, in the last frame of the immediately preceding non-voice section. The smoothed value of can be used.

【００９１】無音声部復号回路35は、平滑化制御回路36
から渡された平滑化係数α(n)とβ(n)、ビット列分解回
路26から渡されたDTX判定符号、及び切り替え回路28か
ら渡された信号符号列を用いて無音声区間の信号を復号
し、出力端子32から出力する。The non-voice part decoding circuit 35 includes a smoothing control circuit 36.
Decodes the signal in the non-voice section using the smoothing coefficients α (n) and β (n) passed from the DTX decision code passed from the bit string decomposition circuit 26, and the signal code string passed from the switching circuit 28. Output from the output terminal 32.

【００９２】図３は、本発明の第２の実施例における無
音声部復号回路35の構成を示す図である。本発明の第２
の実施例が、前記第１の実施例における無音声部復号回
路と相違する点は、平滑化回路49と平滑化回路51の構成
である。FIG. 3 is a diagram showing the configuration of the non-voice part decoding circuit 35 in the second embodiment of the present invention. Second of the present invention
The embodiment is different from the non-voice part decoding circuit in the first embodiment in the configuration of the smoothing circuit 49 and the smoothing circuit 51.

【００９３】パラメータ復号回路54は、入力端子52で入
力した信号符号列から求めたフィルタ係数とRMSをそれ
ぞれ平滑化回路49と平滑化回路51に渡す。The parameter decoding circuit 54 passes the filter coefficient and RMS obtained from the signal code string input at the input terminal 52 to the smoothing circuit 49 and the smoothing circuit 51, respectively.

【００９４】平滑化回路49は、パラメータ復号回路54か
ら渡されたフィルタ係数を、入力端子65から入力した平
滑化係数β(n)を用いて平滑化し、合成回路68に渡す。
但し、入力端子50から入力されたDTX判定符号で信号符
号列が伝送されないことが示された場合は、前フレーム
のフィルタ係数を繰り返し使用する。The smoothing circuit 49 smoothes the filter coefficient passed from the parameter decoding circuit 54 using the smoothing coefficient β (n) input from the input terminal 65, and passes it to the synthesizing circuit 68.
However, when the DTX determination code input from the input terminal 50 indicates that the signal code string is not transmitted, the filter coefficient of the previous frame is repeatedly used.

【００９５】各無音声区間中の先頭から数えてnフレー
ム目で使用する平滑化フィルタ係数F(n,i),(i=1,...,M)
は、nフレーム目に入力されたフィルタ係数 f(n,i),(i
=1,...,M)を用いて、上式（4）と同様の次式（５）で計
算する。Smoothing filter coefficients F (n, i), (i = 1, ..., M) used in the nth frame counting from the beginning in each non-voice section
Is the filter coefficient f (n, i), (i
= 1, ..., M) is used to calculate by the following equation (5) similar to the above equation (4).

【００９６】 F(n,i) = (1−β(n))・F(n−1,i) + β(n)・f(n,i) …（5）ここで、β(n)は、各無音声区間中の先頭からの経過フ
レーム数に応じて変化する値であり、経過フレーム数が
少ない時には過去のフレームからの影響を忘却するよう
に1付近の値を取る。例えば、β(1)=β(2)=1.0、β(3)=
β(4)=… =β(L)= 0.7 とすることできる。Lは各無音
声区間のフレーム数である。F (n, i) = (1−β (n)) · F (n−1, i) + β (n) · f (n, i) (5) where β (n) Is a value that changes according to the number of frames that have elapsed from the beginning in each unvoiced section, and takes a value near 1 so that the effect from past frames is forgotten when the number of elapsed frames is small. For example, β (1) = β (2) = 1.0, β (3) =
β (4) =… = β (L) = 0.7. L is the number of frames in each non-voice section.

【００９７】平滑化回路51は、パラメータ復号回路54か
ら渡されたRMSを平滑化し、混合回路61に渡す。但し、
入力端子50から入力されたDTX判定符号で信号符号列が
伝送されないことが示された場合は、直前に伝送された
RMSを用いて平滑化を行なう。各無音声区間中の先頭か
ら数えてnフレーム目で使用する平滑化RMS P(n)は、n
フレーム目に入力されたRMS p(n) を用いて、上式（1）
と同様の次式（６）で計算する。The smoothing circuit 51 smoothes the RMS passed from the parameter decoding circuit 54 and passes it to the mixing circuit 61. However,
If the DTX judgment code input from the input terminal 50 indicates that the signal code string is not transmitted, it was transmitted immediately before.
Smooth using RMS. The smoothed RMS P (n) used in the nth frame counting from the beginning in each unvoiced section is n
Using RMS p (n) input in the frame, the above equation (1)
It is calculated by the following equation (6) similar to.

【００９８】 P(n) = (1−α(n))・P(n−1) + α(n)・p(n) …（6）ここで、α(ｎ)は、β(n)と同様に、各無音声区間中の
先頭からの経過フレーム数に応じて変化する値であり、
経過フレーム数が少ない時には過去のフレームからの影
響を忘却するように1付近の値を取る。例えば、α(1)=
α(2)=1.0、α(3)=α(4)=… =α(L)= 0.7 とすることで
きる。Lは各無音声区間のフレーム数である。P (n) = (1−α (n)) · P (n−1) + α (n) · p (n) (6) where α (n) is β (n) Similarly, is a value that changes according to the number of frames that have elapsed from the beginning in each non-voice section,
When the number of elapsed frames is small, a value near 1 is set so as to forget the influence of past frames. For example, α (1) =
α (2) = 1.0, α (3) = α (4) = ... = α (L) = 0.7. L is the number of frames in each non-voice section.

【００９９】なお、平滑化回路49と平滑化回路51の処理
のいずれか一方の処理のみを行なうこともできる。その
場合は、パラメータ復号回路54から受け渡されるフィル
タ係数あるいはRMSを、直接合成回路68又は混合回路61
に渡すことになる。It is also possible to perform only one of the smoothing circuit 49 and the smoothing circuit 51. In that case, the filter coefficient or RMS passed from the parameter decoding circuit 54 is directly combined with the synthesis circuit 68 or the mixing circuit 61.
Will be passed to.

【０１００】混合回路61では、平滑化回路51から渡され
る平滑化RMSを用いて、乱数回路56から渡された乱数信
号r(i)とパルス回路53から渡されたパルス列信号p(i)と
ピッチ回路58から渡されたピッチ信号q(i)との線形和処
理を行なうことにより、合成フィルタの励振信号x(i)を
計算し、合成回路68に渡す。The mixing circuit 61 uses the smoothed RMS passed from the smoothing circuit 51 to generate the random number signal r (i) passed from the random number circuit 56 and the pulse train signal p (i) passed from the pulse circuit 53. The excitation signal x (i) of the synthesis filter is calculated by performing the linear sum processing with the pitch signal q (i) passed from the pitch circuit 58, and passed to the synthesis circuit 68.

【０１０１】合成回路68は、混合回路61から渡される励
振信号を、平滑化回路49から渡されるフィルタ係数で構
成するフィルタに入力することにより、信号を復号し、
出力端子70から出力する。The synthesizing circuit 68 decodes the signal by inputting the excitation signal passed from the mixing circuit 61 to the filter constituted by the filter coefficient passed from the smoothing circuit 49,
Output from the output terminal 70.

【０１０２】図４は、本発明の第３の実施例における復
号装置の構成を示す図である。本発明の第３の実施例の
復号装置が、従来の復号装置と相違する点は、無音声部
検定回路38と無音声部復号回路37である。FIG. 4 is a diagram showing the structure of a decoding device according to the third embodiment of the present invention. The decoding device of the third embodiment of the present invention is different from the conventional decoding device in the non-voice part verification circuit 38 and the non-voice part decoding circuit 37.

【０１０３】ビット列分解回路26は、入力端子24から入
力したビット列をVAD判定符号とDTX判定符号及び信号符
号列に分解し、VAD判定符号と信号符号列を切り替え回
路28に渡し、DTX判定符号を無音声部復号回路37に渡
す。The bit string decomposition circuit 26 decomposes the bit string input from the input terminal 24 into a VAD judgment code, a DTX judgment code and a signal code string, passes the VAD judgment code and the signal code string to the switching circuit 28, and outputs the DTX judgment code. It is passed to the non-voice part decoding circuit 37.

【０１０４】切り替え回路28は、ビット列分解回路26か
ら渡された信号符号列を、ビット列分解回路26から渡さ
れたVAD判定符号で入力信号が音声区間とされた場合に
は音声部復号回路30に渡し、VAD判定符号で入力信号が
無音声区間とされた場合には無音声部復号回路37に渡
す。The switching circuit 28 sends the signal code string passed from the bit string decomposing circuit 26 to the audio part decoding circuit 30 when the VAD decision code passed from the bit string decomposing circuit 26 determines that the input signal is a voice section. When the VAD decision code indicates that the input signal is in the non-voice section, it is passed to the non-voice section decoding circuit 37.

【０１０５】無音声部検定回路38は、無音声部復号回路
37から渡されたフィルタ係数とRMSを用いて、図５にお
ける混合回路62で用いる線形和の結合係数を調整する設
定パラメータを決定し、無音声部復号回路37に渡す。こ
の調整パラメータの計算に関しては、混合回路62での処
理と合わせて後述する。The voiceless section verification circuit 38 is a voiceless section decoding circuit.
Using the filter coefficient and RMS passed from 37, the setting parameter for adjusting the combination coefficient of the linear sum used in the mixing circuit 62 in FIG. 5 is determined and passed to the non-voice section decoding circuit 37. The calculation of this adjustment parameter will be described later together with the processing in the mixing circuit 62.

【０１０６】無音声部復号回路37は、ビット列分解回路
26から渡されたDTX判定符号、及び切り替え回路28から
渡された信号符号列を用いて無音声区間の信号を復号
し、出力端子32から出力する。The non-voice part decoding circuit 37 is a bit string decomposition circuit.
The signal in the non-voice section is decoded using the DTX determination code passed from 26 and the signal code string passed from the switching circuit 28, and output from the output terminal 32.

【０１０７】図５は、本発明の第３の実施例における無
音声部復号回路37の構成を示す図である。本発明の第３
の実施例における無音声部復号回路37が、前記第１の実
施例における無音声部復号回路35と相違する点は、混合
回路62及びパラメータ復号回路54の出力先である。以下
では、主に従来の装置との相違点について説明し、同一
部分の説明は適宜省略する。FIG. 5 is a diagram showing the configuration of the non-voice part decoding circuit 37 in the third embodiment of the present invention. Third of the present invention
The non-voice part decoding circuit 37 in this embodiment differs from the non-voice part decoding circuit 35 in the first embodiment in the output destinations of the mixing circuit 62 and the parameter decoding circuit 54. In the following, differences from the conventional device will be mainly described, and description of the same parts will be appropriately omitted.

【０１０８】パラメータ復号回路54は、入力端子52で入
力した信号符号列からフィルタ係数とRMSを求め、フィ
ルタ係数を平滑化回路64と出力端子23に渡し、RMSを平
滑化回路66と出力端子25に渡す。The parameter decoding circuit 54 obtains the filter coefficient and RMS from the signal code string input at the input terminal 52, passes the filter coefficient to the smoothing circuit 64 and the output terminal 23, and transfers the RMS to the smoothing circuit 66 and the output terminal 25. Pass to.

【０１０９】平滑化回路66は、パラメータ復号回路54か
ら渡されたRMSを平滑化し、混合回路62に渡す。但し、
入力端子50から入力されたDTX判定符号で信号符号列が
伝送されないことが示された場合は、直前に伝送された
RMSを用いて平滑化を行なう。また、この場合、平滑化
の係数α(n)やβ(n)を零とすることで平滑化したRMSを
更新しないように制御することもできる。The smoothing circuit 66 smoothes the RMS passed from the parameter decoding circuit 54 and passes it to the mixing circuit 62. However,
If the DTX judgment code input from the input terminal 50 indicates that the signal code string is not transmitted, it was transmitted immediately before.
Smooth using RMS. In this case, the smoothing coefficient α (n) or β (n) can be set to zero so that the smoothed RMS is not updated.

【０１１０】乱数回路56は、乱数を生成し、混合回路62
に渡す。The random number circuit 56 generates a random number, and the mixing circuit 62
Pass to.

【０１１１】パルス回路53は、乱数で生成した位置と振
幅を持つパルスから成るパルス列信号を生成し、混合回
路62に渡す。ピッチ回路58は、前述の適応コードベクト
ルからなるピッチ信号を生成し、混合回路62に渡す。The pulse circuit 53 generates a pulse train signal composed of pulses having a position and amplitude generated by random numbers, and passes the pulse train signal to the mixing circuit 62. The pitch circuit 58 generates a pitch signal composed of the above-mentioned adaptive code vector and passes it to the mixing circuit 62.

【０１１２】混合回路62は、入力端子60から入力した設
定パラメータと平滑化回路66から渡された平滑化RMSを
用いて、前述の線形和の結合係数を計算する。The mixing circuit 62 uses the setting parameter input from the input terminal 60 and the smoothing RMS passed from the smoothing circuit 66 to calculate the above-described linear sum coupling coefficient.

【０１１３】また、この結合係数を用いて、乱数回路56
から渡された乱数信号とパルス回路53から渡されたパル
ス列信号とピッチ回路53から渡されたピッチ信号との線
形和信号を計算し、合成回路68に渡す。Further, by using this coupling coefficient, the random number circuit 56
The linear sum signal of the random number signal passed from the pulse circuit 53, the pulse train signal passed from the pulse circuit 53, and the pitch signal passed from the pitch circuit 53 is calculated and passed to the synthesis circuit 68.

【０１１４】合成回路68は、混合回路62から渡される励
振信号を、平滑化回路64から渡されるフィルタ係数で構
成するフィルタに入力することにより、信号を復号し、
出力端子70から出力する。The synthesizing circuit 68 decodes the signal by inputting the excitation signal passed from the mixing circuit 62 to the filter constituted by the filter coefficient passed from the smoothing circuit 64,
Output from the output terminal 70.

【０１１５】無音声部検定回路38と混合回路62について
説明する。The voiceless section verification circuit 38 and the mixing circuit 62 will be described.

【０１１６】無音声部検定回路38において無音声部にお
ける背景雑音の性質を決定し、この性質に従って、混合
回路62におけるピッチ信号、パルス列信号及び乱数信号
の結合係数の計算方法を変更する。変更する設定パラメ
ータとしては、結合係数を決定する順番や、結合係数γ
がある。The nature of background noise in the non-voice part is determined in the non-voice part verification circuit 38, and the calculation method of the coupling coefficient of the pitch signal, the pulse train signal and the random number signal in the mixing circuit 62 is changed according to this property. The setting parameters to be changed include the order of determining the coupling coefficient and the coupling coefficient γ.
There is.

【０１１７】無音性部検定回路38が、無音声部における
背景雑音の性質を検定するための情報としては、例え
ば、RMSとフィルタ係数がある。The information used by the silence section test circuit 38 to test the nature of the background noise in the unvoiced section is, for example, RMS and a filter coefficient.

【０１１８】この情報から前記設定パラメータを操作す
る方法として、例えば、前記RMSが予め定めた閾値より
も小さく、背景雑音がないと見なした場合や、フィルタ
係数から計算した入力信号のスペクトル傾きが平坦な白
色雑音と見なした場合は、乱数信号の寄与を大きくする
方法がある。これは、結合係数の計算順番はそのままで
γを小さくすることと等価である。As a method of operating the setting parameter from this information, for example, when the RMS is smaller than a predetermined threshold value and it is considered that there is no background noise, or when the spectral slope of the input signal calculated from the filter coefficient is If it is regarded as flat white noise, there is a method of increasing the contribution of the random number signal. This is equivalent to reducing γ while keeping the calculation order of the coupling coefficient.

【０１１９】なお、この無音声信号の設定パラメータを
信号符号列に含めて伝送することもできる。The setting parameter of the non-voice signal can be included in the signal code string for transmission.

【０１２０】図６は、本発明の第４の実施例における復
号装置の構成を示す図である。本発明の第４の実施例に
おける復号装置が、前記第２の実施例における復号装置
と相違する点は、無音声部検定回路38と無音声部復号回
路39である。FIG. 6 is a block diagram showing the arrangement of the decoding apparatus according to the fourth embodiment of the present invention. The decoding device according to the fourth embodiment of the present invention is different from the decoding device according to the second embodiment in a voiceless section verification circuit 38 and a voiceless section decoding circuit 39.

【０１２１】ビット列分解回路26は、入力端子24から入
力したビット列をVAD判定符号とDTX判定符号及び信号符
号列に分解し、VAD判定符号を平滑化制御回路36と切り
替え回路28に渡し、信号符号列を切り替え回路28に渡
し、DTX判定符号を無音声部復号回路39に渡す。The bit string decomposition circuit 26 decomposes the bit string input from the input terminal 24 into a VAD judgment code, a DTX judgment code and a signal code string, passes the VAD judgment code to the smoothing control circuit 36 and the switching circuit 28, and outputs the signal code. The column is passed to the switching circuit 28, and the DTX determination code is passed to the voiceless section decoding circuit 39.

【０１２２】切り替え回路28は、ビット列分解回路26か
ら渡されたVAD判定符号で入力信号が音声区間とされた
場合にはビット列分解回路26から渡された信号符号列を
音声部復号回路30に渡し、VAD判定符号で入力信号が無
音声区間とされた場合には無音声部復号回路39に渡す。
無音声部検定回路38と無音声部復号回路39に信号符号列
を渡す。The switching circuit 28 passes the signal code string passed from the bit string decomposition circuit 26 to the audio part decoding circuit 30 when the VAD decision code passed from the bit string decomposition circuit 26 indicates that the input signal is in the voice section. , If the input signal is in the voiceless section by the VAD determination code, it is passed to the voiceless section decoding circuit 39.
The signal code string is passed to the unvoiced part verification circuit 38 and the unvoiced part decoding circuit 39.

【０１２３】平滑化制御回路36は、ビット列分解回路26
から渡されるVAD判定符号の変化に応じた前記平滑化係
数α(n)とβ(n)を無音声部復号回路39に渡す。The smoothing control circuit 36 includes a bit string decomposition circuit 26.
The smoothing coefficients α (n) and β (n) corresponding to the change in the VAD determination code passed from are passed to the non-voice part decoding circuit 39.

【０１２４】無音声部検定回路38は、無音声部復号回路
39から渡された平滑化RMSを用いて、図７における混合
回路62で使用する線形和の結合係数を調整する設定パラ
メータを決定し、無音声部復号回路39に渡す。The voiceless section verification circuit 38 is a voiceless section decoding circuit.
Using the smoothed RMS passed from 39, the setting parameter for adjusting the combination coefficient of the linear sum used in the mixing circuit 62 in FIG. 7 is determined and passed to the non-voice part decoding circuit 39.

【０１２５】無音声部検定回路39での設定パラメータの
決定処理はRMSを平滑化RMSに置き換えることで、前述し
た無音声部検定回路38と同様の処理を適用できる。For the setting parameter determination processing in the voiceless section test circuit 39, the same processing as that of the voiceless section test circuit 38 described above can be applied by replacing RMS with smoothed RMS.

【０１２６】無音声部復号回路39は、ビット列分解回路
26から渡されたDTX判定符号、及び切り替え回路28から
渡された信号符号列、平滑化制御回路36から渡された平
滑化係数α(n)とβ(n)、及び無音声部検定回路38から渡
された設定パラメータを用いて無音声区間の信号を復号
し、出力端子32から出力する。The voiceless decoding circuit 39 is a bit string decomposition circuit.
26, the DTX determination code passed from 26, the signal code string passed from the switching circuit 28, the smoothing coefficients α (n) and β (n) passed from the smoothing control circuit 36, and the non-voice part test circuit 38. The signal in the non-voice section is decoded by using the setting parameter passed from and output from the output terminal 32.

【０１２７】また、図７の平滑化回路50で計算された平
滑化RMSと、平滑化回路64で計算された平滑化フィルタ
係数を無音声部検定回路38に渡す。Further, the smoothing RMS calculated by the smoothing circuit 50 of FIG. 7 and the smoothing filter coefficient calculated by the smoothing circuit 64 are passed to the non-voice part test circuit 38.

【０１２８】図７は、本発明の第４の実施例における無
音声部復号回路39の構成を示す図である。本発明の本発
明の第４の実施例における無音声部復号回路39が、前記
第2の実施例における無音声部復号回路と相違する点
は、平均化回路50と平滑化回路64からの出力が出力端子
69及び出力端子63から出力される構成とされていること
である。FIG. 7 is a diagram showing the structure of the non-voice part decoding circuit 39 in the fourth embodiment of the present invention. The voiceless portion decoding circuit 39 in the fourth embodiment of the present invention is different from the voiceless portion decoding circuit in the second embodiment in that the outputs from the averaging circuit 50 and the smoothing circuit 64. Is the output terminal
69 and the output terminal 63.

【０１２９】上記各実施例では、合成フィルタの励振信
号を計算する時にピッチ信号とパルス列信号と乱数信号
全てを用いているが、いずれかを省く構成としてもよ
い。In each of the above embodiments, the pitch signal, the pulse train signal and the random number signal are all used when calculating the excitation signal of the synthesizing filter, but any of them may be omitted.

【０１３０】本発明は、従来の技術の欄にて説明した符
号化装置とともに、掲題無線端末や無線基地局に搭載し
て、音声信号圧縮技術を用いた無線音声通信システムを
容易に構築することができる。また、既に説明した復号
方法を実行するためのプログラムをフロッピィディスク
等の記録媒体に格納しておき、スピーカー等が接続され
たパーソナルコンピュータにこのプログラムをロードす
ることにより、音声端末を構築することも容易にでき
る。The present invention, together with the encoding device described in the section of the prior art, can be installed in a subject wireless terminal or a wireless base station to easily construct a wireless audio communication system using audio signal compression technology. You can It is also possible to build a voice terminal by storing a program for executing the decoding method already described in a recording medium such as a floppy disk and loading the program on a personal computer to which a speaker or the like is connected. You can easily.

【０１３１】[0131]

【発明の効果】以上説明したように、本発明によれば下
記記載の効果を奏する。As described above, the present invention has the following effects.

【０１３２】本発明の第１の効果は、復号装置におい
て、無音声区間を復号する際に使用するフィルタ係数が
不連続に変化することによる、復号音質の劣化を低減す
る、ということである。The first effect of the present invention is to reduce the deterioration of the decoded sound quality due to the discontinuous change of the filter coefficient used when decoding the non-voice section in the decoding device.

【０１３３】その理由は、本発明においては、間欠的に
伝送されるフィルタ係数を平滑化処理した後に用いてい
るためである。The reason is that, in the present invention, the filter coefficient transmitted intermittently is used after being smoothed.

【０１３４】本発明の第２の効果は、復号装置におい
て、無音声区間の先頭部分で直前の有音声区間による影
響を受けることによる復号音質の劣化を低減する、とい
うことである。The second effect of the present invention is that the decoding apparatus reduces the deterioration of the decoded sound quality due to the influence of the immediately preceding voiced section at the beginning of the non-voiced section.

【０１３５】その理由は、本発明においては、無音声区
間の先頭部分では、特徴パラメータの平滑化処理を行な
わないように平滑化係数を設定している、ためである。The reason is that, in the present invention, the smoothing coefficient is set so that the smoothing process of the characteristic parameter is not performed in the head portion of the non-voice section.

【０１３６】本発明の第３の効果は、復号装置におい
て、音声区間と無音声区間の切り替わりにより生じる聴
覚的な不連続を低減する、ということである。A third effect of the present invention is that the decoding apparatus reduces the auditory discontinuity caused by switching between the voice section and the non-voice section.

【０１３７】その理由は、本発明においては、無音声区
間において再生フィルタの励振信号を生成する時に、乱
数成分に対するパルス成分とピッチ成分の比を入力信号
の性質に応じて変更するためである。The reason is that, in the present invention, when the excitation signal of the reproduction filter is generated in the non-voice section, the ratio of the pulse component to the pitch component to the random number component is changed according to the property of the input signal.

[Brief description of drawings]

【図１】本発明の第1の実施例における無音声部復号回
路の構成を示す図である。FIG. 1 is a diagram showing a configuration of a voiceless portion decoding circuit according to a first embodiment of the present invention.

【図２】本発明の第２の実施例における復号装置の構成
を示す図である。FIG. 2 is a diagram showing a configuration of a decoding device according to a second exemplary embodiment of the present invention.

【図３】本発明の第２の実施例における無音声部復号回
路の構成を示す図である。FIG. 3 is a diagram showing a configuration of a voiceless portion decoding circuit according to a second embodiment of the present invention.

【図４】本発明の第３の実施例における復号装置の構成
を示す図である。FIG. 4 is a diagram showing a configuration of a decoding device in a third exemplary embodiment of the present invention.

【図５】本発明の第３の実施例における無音声部復号回
路の構成を示す図である。FIG. 5 is a diagram showing a configuration of a voiceless portion decoding circuit according to a third embodiment of the present invention.

【図６】本発明の第４の実施例における復号装置の構成
を示す図である。FIG. 6 is a diagram showing a configuration of a decoding device according to a fourth exemplary embodiment of the present invention.

【図７】本発明の第４の実施例における無音声部復号回
路の構成を示す図である。FIG. 7 is a diagram showing a configuration of a voiceless section decoding circuit according to a fourth example of the present invention.

【図８】従来及び本発明の実施例に係る符号化装置の構
成を示す図である。[Fig. 8] Fig. 8 is a diagram illustrating a configuration of an encoding device according to a conventional example and an example of the present invention.

【図９】従来の復号装置の構成を示す図である。FIG. 9 is a diagram showing a configuration of a conventional decoding device.

【図１０】従来の復号装置における無音声部復号回路の
構成を示す図である。FIG. 10 is a diagram showing a configuration of a non-voice part decoding circuit in a conventional decoding device.

[Explanation of symbols]

１０、２４、５０、５２、６５、６７、６０、６３入
力端子１２音声部符号化回路１４無音声部符号化回路１８、２８切り替え回路２０ビット列生成回路２２、２３、２５、３２、７０出力端子２６ビット列分割回路３０音声部復号回路３４、３５、３７、３９無音声部復号回路３６平滑化制御回路３８無音声部検定回路４９、５１、６４、６６平滑化回路５３パルス回路５４パラメータ復号回路５６乱数回路５８ピッチ回路６１、６２混合回路６８合成回路10, 24, 50, 52, 65, 67, 60, 63 Input terminal 12 Voice part coding circuit 14 Non-voice part coding circuit 18, 28 Switching circuit 20 Bit string generation circuit 22, 23, 25, 32, 70 Output terminal 26 bit string division circuit 30 voice part decoding circuit 34, 35, 37, 39 voiceless part decoding circuit 36 smoothing control circuit 38 voiceless part verification circuit 49, 51, 64, 66 smoothing circuit 53 pulse circuit 54 parameter decoding circuit 56 Random number circuit 58 Pitch circuit 61, 62 Mixing circuit 68 Synthesizing circuit

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平10−83200（ＪＰ，Ａ) 特開平10−39898（ＪＰ，Ａ) 特開平８−305398（ＪＰ，Ａ) 特開昭62−253200（ＪＰ，Ａ) 特開昭60−262200（ＪＰ，Ａ) 特開2000−267700（ＪＰ，Ａ) 特開2001−249698（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 11/02 G10L 13/00 G10L 19/00 - 19/14 H04B 14/04 H03M 7/30 ＪＩＣＳＴファイル（ＪＯＩＳ)─────────────────────────────────────────────────── ─── Continuation of the front page (56) References JP 10-83200 (JP, A) JP 10-39898 (JP, A) JP 8-305398 (JP, A) JP 62- 253200 (JP, A) JP 60-262200 (JP, A) JP 2000-267700 (JP, A) JP 2001-249698 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 11/02 G10L 13/00 G10L 19/00-19/14 H04B 14/04 H03M 7/30 JISC file (JOIS)

Claims

(57) [Claims]

1. A voice decoding device for decoding a voice signal from a received characteristic parameter according to whether the voice signal is a voice section or a non-voice section. In at least a part of the section, a smoothed value obtained by weighted addition of a smoothed value calculated in the past from a characteristic parameter representing the spectral envelope characteristic of the decoded signal among the characteristic parameters and a characteristic parameter received in the current frame is used. A speech decoding apparatus characterized by using a means for decoding.

2. A voice decoding apparatus for decoding a voice signal from a received characteristic parameter according to whether the voice signal is a voice section or a voiceless section, wherein the decoding of the voice signal in the voiceless section is performed by the voiceless section. In at least a part of the section, weighting a and b of the smoothed value Xpre calculated in the past from the characteristic parameter representing the spectral envelope characteristic of the decoded signal among the characteristic parameters and the characteristic parameter Yn received in the current frame are used. A speech decoding apparatus characterized by using a means for decoding using a smoothed value Xn = aXpre + bYn obtained by the addition.

3. A speech decoding apparatus for decoding a signal from a received characteristic parameter according to whether the decoded signal is a speech section or a non-speech section, and a coefficient for smoothing at least one of the characteristic parameters. Is changed according to the time elapsed after switching from the voice section to the non-voice section, and at least one of the characteristic parameters is smoothed by using the changed coefficient value to obtain the voice signal of the non-voice section. A voice decoding device comprising a non-voice section decoder for decoding.

4. The non-voice section decoder uses at least one of the transmitted characteristic parameters as it is immediately after switching from the voice section to the non-voice section, and thereafter, at least one of the characteristic parameters. The speech decoding apparatus according to claim 3, wherein decoding is performed by using one of the smoothed characteristic parameters.

5. A speech decoding apparatus for decoding a signal from a received characteristic parameter according to whether the decoded signal is a speech section or a non-speech section, and a coefficient for smoothing at least one of the characteristic parameters. According to the characteristic parameter, and using the changed coefficient value, at least one of the characteristic parameters is smoothed to decode the speech signal in the non-speech section. A voice decoding device characterized by the above.

6. The non-voice section decoder uses at least one of the transmitted characteristic parameters as they are while the characteristic parameter satisfies a predetermined condition, and thereafter, at least one of the characteristic parameters is used. The speech decoding apparatus according to claim 5, wherein decoding is performed using one of the smoothed characteristic parameters.

7. A speech decoding apparatus for decoding a signal from a received characteristic parameter according to whether the decoded signal is a speech section or a non-speech section, and a coefficient for smoothing at least one of the characteristic parameters. According to the information indicating whether the characteristic parameter has been transmitted, using the changed coefficient value,
A speech decoding apparatus comprising: a speechless section decoder that smoothes at least one of the characteristic parameters to decode the speech signal in the speechless section.

8. The non-voice section decoder sets a coefficient for smoothing at least one of the characteristic parameters according to a time lapse after switching from a voice section to a non-voice section and the characteristic parameter. The speech decoding apparatus according to claim 3, wherein the speech decoding device is a speechless section decoder that changes and smoothes at least one of the characteristic parameters using the changed coefficient value, and decodes the signal in the speechless section.

9. The non-voice section decoder uses at least one of the transmitted characteristic parameters as it is, and in the following non-voice section, the time lapse after switching from the voice section to the non-voice section and the characteristics. 7. The speech decoding apparatus according to claim 4, wherein the speech decoding apparatus is a non-voice section decoder that decodes using a value obtained by smoothing at least one of the characteristic parameters according to at least one of the parameters.

10. The non-voice section decoder transmits the transmitted characteristic parameter immediately after the decoder switches from the voice section to the non-voice section and while the characteristic parameter satisfies a predetermined condition. The speech decoding apparatus according to claim 3, wherein at least one of the characteristic parameters is used as it is, and thereafter, a speech signal in a non-speech section is decoded using a value obtained by smoothing at least one of the characteristic parameters.

11. The non-voice section decoder includes a coefficient for smoothing at least one of the characteristic parameters,
Characterized in that the characteristic parameter is changed according to information indicating whether or not it is transmitted, and using the changed coefficient value, at least one of the characteristic parameters is decoded using a smoothed characteristic parameter. Claims 1 to 6, 8
10. The audio decoding device according to any one of 10 to 10.

12. The speech decoding apparatus according to claim 11, wherein the non-voice section decoder receives information indicating whether or not the characteristic parameter is transmitted on the transmitting side.

13. When the length of the voice section immediately before the unvoiced section is smaller than a predetermined value, the characteristic parameter transmitted last in the voiceless section immediately before this voice section is smoothed. The speech decoding apparatus according to claim 1, wherein the speech decoding apparatus is used as an initial value.

14. The feature parameter includes at least one of a quantity representing a spectral envelope and a quantity representing power corresponding to the decoded signal.
3. The audio decoding device according to any one of 3 above.

15. An encoding device for determining, in each frame, whether an input signal is a voice section or a non-voice section, and encoding and outputting a characteristic parameter of the input signal.
A voice encoding / decoding device comprising the voice decoding device according to claim 1.

16. A base station apparatus comprising the voice encoding / decoding apparatus or the decoding apparatus according to any one of claims 1 to 15.

17. A communication terminal device comprising the voice encoding / decoding device or the decoding device according to any one of claims 1 to 15.

18. A voice decoding method for decoding a voice signal by changing a decoding operation of a received characteristic parameter according to whether the voice signal is a voice section or a non-voice section, in at least a part of the non-voice section. A smoothing step of calculating a smoothed value obtained by weighted addition of a smoothed value calculated in the past from a characteristic parameter representing the spectral envelope characteristic of the decoded signal among the characteristic parameters, and a characteristic parameter received in the current frame; And a voiceless section voice signal decoding step of decoding the voiceless section signal using the smoothed value calculated in the smoothing step.

19. A voice decoding method for decoding a voice signal by changing a decoding operation of a received characteristic parameter according to whether the voice signal is a voice section or a non-voice section, in at least a part of the non-voice section. , A weighting a of the smoothed value Xpre calculated in the past from the feature parameter representing the spectral envelope characteristic of the decoded signal and the feature parameter Yn received in the current frame among the feature parameters.
Smoothed value Xn = aXpre + b obtained by addition using B and b
A voice decoding method comprising: a smoothing step of calculating Yn; and a voiceless section voice signal decoding step of decoding the signal of the voiceless section using the smoothed value calculated in the smoothing step. .

20. A voice decoding method for decoding a voice signal by changing a decoding operation of a received characteristic parameter according to whether the voice signal is a voice section or a non-voice section, wherein a voice section is switched to a non-voice section. Smoothing step for smoothing at least one of the characteristic parameters according to the passage of time, and a speech-free speech signal for decoding the speech-free signal using the smoothed characteristic parameter A speech decoding method comprising: a decoding step.

21. The smoothing step comprises:
21. The method according to claim 20, comprising the step (b).
The voice decoding method described in. (A) At least one of the transmitted characteristic parameters is used as it is in a certain section immediately after switching from the voice section to the non-voice section, and (b) after that, at least one of the characteristic parameters is used.
Smooth one.

22. A voice decoding method for decoding a voice signal by changing a decoding operation of a received characteristic parameter according to whether the voice signal is in a voice section or a non-voice section, wherein: It is characterized by including a smoothing step of smoothing at least one of the characteristic parameters, and a non-voice section voice signal decoding step of decoding the signal of the non-voice section using the smoothed feature parameter. Speech decoding method.

23. The smoothing step comprises:
23. The method according to claim 22, comprising the step (b).
The voice decoding method described in. (A) At least one of the transmitted characteristic parameters is used as it is while the characteristic parameter satisfies a predetermined condition, and (b) At least one of the characteristic parameters is thereafter used.
Smooth one.

24. A voice decoding method for decoding a voice signal by changing a decoding operation of a received characteristic parameter according to whether the voice signal is a voice section or a non-voice section, wherein the feature parameter is transmitted or not. A smoothing step of smoothing at least one of the characteristic parameters according to information indicating whether or not; and a speech-free speech signal decoding for decoding the speech-free signal using the smoothed characteristic parameter. A voice decoding method comprising the steps of:

25. The smoothing step smoothes at least one of the characteristic parameters according to a time lapse after switching from a voice section to a non-voice section and the characteristic parameter. Item 22. The speech decoding method according to Item 20.

26. After said smoothing step uses at least one of the transmitted feature parameters as is,
24. At least one of the characteristic parameters is smoothed according to a time lapse after switching from a voice section to a non-voice section and at least one of the characteristic parameters. Speech decoding method.

27. The smoothing step comprises:
21. The method according to claim 20, comprising the step (b).
The voice decoding method described in. (A) Immediately after switching from the voice section to the non-voice section and while the characteristic parameter satisfies a predetermined condition, at least one of the transmitted characteristic parameters is directly used, and (b) after that, At least one of the characteristic parameters is smoothed in the time direction.

28. The smoothing step changes a coefficient for smoothing at least one of the characteristic parameters according to information indicating whether or not the characteristic parameters have been transmitted. Claims 18 to 23, 25
27. The speech decoding method according to any one of 27 to 27 .

29. The speech decoding method according to claim 28, wherein the characteristic parameter is the Japanese Features, further comprising receiving information indicating whether or not transmitted.

30. The feature parameter according to claim 18, wherein the feature parameter includes at least one of an amount representing a spectral envelope and an amount representing power corresponding to the decoded signal. The voice decoding method described.

31. An encoding step of determining whether an input signal is in a voice section or a non-voice section in each frame and encoding a characteristic parameter of the input signal and outputting the encoded characteristic parameter. A speech encoding / decoding method in combination with the step of carrying out the speech decoding method described in any one of the above.

32. A recording medium recording a program for executing a voice decoding method for decoding a voice signal by changing a decoding operation of a received characteristic parameter according to whether the voice signal is a voice section or a non-voice section. , Obtained by weighted addition of a smoothed value calculated in the past from the characteristic parameter representing the spectral envelope characteristic of the decoded signal and the characteristic parameter received in the current frame among the characteristic parameters in at least a part of the non-voice section. A record characterized by storing a smoothing step of calculating a smoothed value and a voiceless section voice signal decoding step of decoding the signal of the voiceless section using the smoothed value calculated in the smoothing step. Medium.

33. A recording medium recording a program for executing a voice decoding method for decoding a voice signal by changing a decoding operation of a received characteristic parameter according to whether the voice signal is a voice section or a non-voice section. In at least a part of the non-voice section, a weighting a of the smoothed value Xpre calculated in the past from the characteristic parameter representing the spectral envelope characteristic of the decoded signal and the characteristic parameter Yn received in the current frame among the characteristic parameters a.
Smoothed value Xn = aXpre + b obtained by addition using B and b
A recording medium storing a smoothing step of calculating Yn, and a voiceless section voice signal decoding step of decoding the signal of the voiceless section using the smoothed value calculated in the smoothing step. .

34. A program for executing a voice decoding method for decoding a voice signal by changing a decoding operation of a plurality of types of received characteristic parameters according to whether the voice signal is a voice section or a non-voice section. In the recorded recording medium, a smoothing step of smoothing at least one of the characteristic parameters according to the time elapsed after switching from the voice section to the non-voice section, and using the smoothed characteristic parameter And a non-voice section voice signal decoding step for decoding the signal in the non-voice section.

35. The smoothing step comprises:
35. The method according to claim 34, comprising the step (b).
The recording medium described in. (A) Immediately after switching from the voice section to the non-voice section, at least one of the transmitted characteristic parameters is used as it is. (B) After that, at least one of the characteristic parameters is smoothed.

36. A program for executing a voice decoding method for decoding a voice signal by changing the decoding operation of a plurality of types of received characteristic parameters according to whether the voice signal is a voice section or a non-voice section. In the recorded recording medium, a smoothing step of smoothing at least one of the characteristic parameters according to the characteristic parameter, and a step of decoding the signal in the non-voice section using the smoothed characteristic parameter. A recording medium storing a voice section voice signal decoding step.

37. The smoothing step comprises:
37. The method according to claim 36, comprising the step (b).
The recording medium described in. (A) At least one of the transmitted characteristic parameters is used as it is while the characteristic parameter satisfies a predetermined condition, and (b) After that, at least one of the characteristic parameters is smoothed.

38. A program for executing a voice decoding method for decoding a voice signal by changing a decoding operation of a plurality of types of received characteristic parameters according to whether the voice signal is a voice section or a non-voice section. In the recorded recording medium, a smoothing step of smoothing at least one of the characteristic parameters according to information indicating whether the characteristic parameters have been transmitted, and the smoothing step using the smoothed characteristic parameters. A non-voice section voice signal decoding step for decoding a signal in a non-voice section is stored.

39. The smoothing step smoothes at least one of the characteristic parameters according to a time lapse after switching from a voice section to a non-voice section and the characteristic parameter. Item 34. The recording medium according to Item 34.

40. After the smoothing step uses at least one of the transmitted feature parameters as is,
Claim, characterized in that said smoothing at least one of said feature parameters according to at least one of the time and the feature parameters from the switches to no-speech interval from the speech interval 35 The recording medium according to 37 .

41. The smoothing step comprises:
35. The method according to claim 34, comprising the step (b).
The recording medium described in. (A) Immediately after switching from a voice section to a non-voice section and in a section in which the characteristic parameter satisfies a predetermined condition, at least one of the transmitted characteristic parameters is directly used, and (b) after that, the characteristic is At least one of the parameters is smoothed in the time direction.

42. The smoothing step changes a coefficient for smoothing at least one of the characteristic parameters according to information indicating whether or not the characteristic parameters have been transmitted. Claims 32 to 37, 39
The recording medium according to any one of items 1 to 41 .

43. A recording medium of claim 42, wherein the characteristic parameter is the Japanese Features, further comprising receiving information indicating whether or not transmitted.