JPS63124636A

JPS63124636A - Pseudo signal insertion system in voice semiconductor system

Info

Publication number: JPS63124636A
Application number: JP26976086A
Authority: JP
Inventors: Shigeru Iizuka; 飯塚　茂; Yotaro Hachitsuka; 八塚　陽太郎; Toshihiro Yamazaki; 智弘山崎
Original assignee: Kokusai Denshin Denwa KK
Current assignee: KDDI Corp
Priority date: 1986-11-14
Filing date: 1986-11-14
Publication date: 1988-05-28
Also published as: JPH0515331B2

Abstract

PURPOSE:To improve the voice quality of listener and keep the naturaility of talking by using a synthesis filter driven by a coded parameter of a frame before missing and noise so as to synthesize a pseudo voice and interpolate the missing part. CONSTITUTION:When the silence state is detected from a signal at a terminal 3, a changeover switch 5 selects a D/A converter 6 to the position of a power adjuster 14. An LPC analyzer 10 analyzes an envelope of a spectrum of background noise at the voice presence before the voice signal is silent and obtains a prediction coefficient for a predicting device 12 for a short time in the short period synthesis filter 11 to set the result for a short time prediction device 12. Then the noise from the noise generator 4 is inputted to the filter 11 and a noise having a spectrum similar to the spectrum of the background noise is outputted and given to the power adjuster 14. As a result, the pseudo noise having a spectrum and power similar to the background noise is outputted from the power adjuster 14.

Description

【発明の詳細な説明】（技術分野）本発明は、音声信号が存在する時間区間だけ該音声信号
を伝送路に送出するようにした音声伝送システムに係り
、特には、信号が送出されない時間区間の聴感上の通話
品質の改善を図るために有効な音声伝送システムにおけ
る疑似信号挿入方式％式％）音声回線など会話型の通信にあっては、一方の加入者が
話しているときは他方の加入者は聞き役になるため、一
方向の回線の時間的な使用効率は一般的に著しく劣る。DETAILED DESCRIPTION OF THE INVENTION (Technical Field) The present invention relates to an audio transmission system that transmits an audio signal to a transmission path only in a time period in which the audio signal exists, and particularly relates to a time period in which the signal is not transmitted. A pseudo signal insertion method in a voice transmission system that is effective for improving the perceptual quality of calls (%) In conversation-type communications such as voice lines, when one subscriber is talking, the other subscriber's Since the subscriber acts as a listener, the time efficiency of one-way lines is generally significantly lower.

そこで、従来から回線使用効率を向上させるための技術
が種々提案されている。Therefore, various techniques have been proposed to improve line usage efficiency.

この回線使用効率を改善する代表的な方法として、アナ
ログ回線に用いられている時間割当多重化方式（以下、
ｒＴＡｓＩ方式」と称す）があり、また、ディジタル回
線に用いられているディジタル音声挿入方式（以下、ｒ
ＤｓＩ方式」と称す）あるいは、音声信号が存在する時
（以下、「有音状態」と称す）のみキャリアが割り当て
られる１チャネル１搬送波方式（以下、ｒ　５ＣＰＣＪ
と称す）または、音声をパケット単位で伝送するパケッ
ト音声伝送方式等がある。これらＴＡＳ　Ｉ方式あるい
はＤＳ１方式等は加入者側に接続されている入力回線数
に比べて少ない伝送路（出力回線）を効率的に使用する
ことにより、回線を多重化するものである。A typical method for improving line usage efficiency is the time allocation multiplexing method (hereinafter referred to as
There is also a digital audio insertion method (hereinafter referred to as rTAsI method) used in digital lines.
DsI method) or one channel, one carrier method (r5CPCJ method) in which a carrier is allocated only when a voice signal is present (hereinafter referred to as "speech state").
There is also a packet audio transmission method that transmits audio in packet units. These TAS I systems, DS1 systems, etc. multiplex lines by efficiently using transmission lines (output lines) that are smaller in number than the number of input lines connected to the subscriber side.

ところで、ＤＳＩ　Ｊ？３ＳＣＰＣ方式においては送信
側からの音声がとぎれた無音状態の場合、すなわち有音
状態から無音状態に変化するとある一定時間のハングオ
ーバ時間（通常約１５抛Ｓ）の後伝送路と対応した入カ
ドランクとの接続が解除され、受信側には何ら信号が伝
送されなくなり、通話の不自然さが生じてしまう。この
問題を解決するため、従来では無音状態になると受信側
で白色雑音発生器からの白色雑音を対応したトランクに
挿入し、あたかも伝送路が引続き接続されているかのご
とくし、受信者に対して通話の自然性を保つよう試みて
いる。By the way, DSI J? In the 3SCPC system, when the audio from the transmitting side is interrupted and there is no sound, that is, when the voice state changes from a sound state to a silent state, after a certain hangover time (usually about 15 seconds), the input card link corresponding to the transmission path is connected. The connection is terminated, and no signal is transmitted to the receiving side, resulting in an unnatural conversation. To solve this problem, conventionally, when there is silence, the receiving side inserts white noise from a white noise generator into the corresponding trunk, making it appear as if the transmission line is still connected, and transmitting it to the receiver. We try to keep the call natural.

第１図は従来のディジタル音声伝送システムにおける雑
音挿入方式例のブロック図である。端子３に送られてき
た有音／無音状態信号にもとづき、有音ならば切換スイ
ッチ５を制御し、Ｄ／Ａ変換器６を復号化器２に接続す
ると共に、送信側の符号化器から送られてきた符号化信
号（例えばＰＣＭやＡＤＰＣＭ更にはＡＰＣ（適応予測
符号化）などによって符号化されたディジタル信号）は
受信側入力端子１に入力される。この受信された符号化
信号は復号化器２により復号化された後、Ｄ／Ａ変換器
６を介し、アナログ信号に変換後、アナログフィルタ７
を通して元のアナログ音声信号に復元され出力端子８か
ら出力される。FIG. 1 is a block diagram of an example of a noise insertion method in a conventional digital voice transmission system. Based on the sound/silence state signal sent to the terminal 3, if there is a sound, the selector switch 5 is controlled, the D/A converter 6 is connected to the decoder 2, and the signal is transmitted from the encoder on the transmitting side. The transmitted encoded signal (for example, a digital signal encoded by PCM, ADPCM, APC (adaptive predictive coding), etc.) is input to the receiving side input terminal 1. This received encoded signal is decoded by a decoder 2, then converted to an analog signal via a D/A converter 6, and then converted to an analog signal by an analog filter 7.
The signal is restored to the original analog audio signal and output from the output terminal 8.

一方、有音／無音状態信号から無音を示した場合、切換
スイッチ５を介して雑音発生器４とＤ／Ａ変換器６とを
接続し、雑音発生器４からの白色雑音がＤ／Ａ変換器６
及びアナログフィルタ７を介して端子８から受信者に送
出することにより通話の自然性を保つようにしている。On the other hand, when the sound/silence state signal indicates silence, the noise generator 4 and the D/A converter 6 are connected via the changeover switch 5, and the white noise from the noise generator 4 is converted into a D/A converter. Vessel 6
The naturalness of the call is maintained by transmitting it to the receiver from the terminal 8 via the analog filter 7.

しかし、無音状態に変化する直前の有音状態であるハン
グオーバ時間内で伝送されてくる信号はルーム雑音等の
背景雑音である。このため、これら従来方式では受信側
で挿入される白色雑音は背景雑音のレベルやそのスペク
トルとは異なるため、白色雑音挿入しても受信者に違和
感を与えており、通話の不自然性はさほど改善されてい
ない。However, the signal transmitted during the hangover time, which is the active state immediately before changing to the silent state, is background noise such as room noise. For this reason, in these conventional methods, the white noise inserted on the receiving side is different from the background noise level and its spectrum, so even if white noise is inserted, it gives a strange feeling to the receiver, and the unnaturalness of the call is not so great. Not improved.

また、ＤＳＩ方式やパケット音声伝送方式等の音声伝送
システムでは、音声動作率の著しい上昇による入カドラ
ンクの締出しや伝送時におけるパケットの欠落等により
、受信側で音声信号の一部が受信されない状態が発生す
る。このような音声信号の一部欠落に対して従来は、音
声信号が欠落している間を無音とするかまたは白色雑音
を挿入するか、あるいは欠落した１フレーム前の音声パ
ケットを単に操り返すなどの方法を用いていた。しかし
、いずれの方法も聴感上劣化が大きく、通話音声品質の
改善があまり図れていなかった。特に、１フレーム前の
音声パケットを繰り返して用いる場合には、ｌフレーム
前の音声情報を蓄積しておくためのメモリが必要であっ
た。In addition, in audio transmission systems such as the DSI method and the packet audio transmission method, a part of the audio signal is not received on the receiving side due to a significant increase in the audio operation rate, which locks out the input quadratic rank, or because packets are dropped during transmission. occurs. Conventionally, when a part of the audio signal is missing, methods such as making silence or inserting white noise during the missing audio signal, or simply manipulating the missing audio packet from the previous frame, etc. method was used. However, both methods have resulted in significant auditory deterioration and have not been able to significantly improve call voice quality. In particular, when an audio packet from one frame before is used repeatedly, a memory is required to store audio information from one frame before.

以上のように、従来は無音状態や音声パケットの欠落時
に挿入する疑似信号が聴感上の劣化を招き、通話の自然
さを保つことができないという問題点があった。As described above, in the past, there was a problem in that the pseudo signals inserted when there was silence or when a voice packet was missing caused audible deterioration, making it impossible to maintain the naturalness of the conversation.

（発明の目的及び特徴）本発明は、上述した従来技術の問題点を解決するために
なされたもので、音声信号の欠落時あるいは無音状態時
に通話の自然さを保つことができ、かつ疑似信号を簡単
に作成することができる音声伝送システムの疑似信号挿
入方式を提供することを目的とする。(Objects and Features of the Invention) The present invention has been made to solve the problems of the prior art described above. The purpose of this invention is to provide a pseudo signal insertion method for a voice transmission system that can easily create a pseudo signal.

本発明の特徴は音声信号の欠落時あるいは無音状態時の
直前の有音状態における音声信号とハングオーバ時間内
の信号のうち両方もしくは少なくともハングオーバ時間
の一部に含まれる電力及び送信側の入力信号のスペクト
ラム情報を用い、受信側の雑音発生器から出力された雑
音のレベル及びスペクトルを調整して疑似信号を受信側
で作成して挿入するようにしたことにある。A feature of the present invention is that when an audio signal is missing or when there is no sound, the power included in both the audio signal in the active state immediately before the silent state and the signal within the hangover time, or at least a part of the hangover time, and the input signal on the transmitting side. The present invention uses spectrum information to adjust the level and spectrum of the noise output from the noise generator on the receiving side to create and insert a pseudo signal on the receiving side.

（発明の構成）以下に図面を用いて本発明の詳細な説明する。(Structure of the invention) The present invention will be described in detail below using the drawings.

なお、以下の説明では従来例（第１図）と同一構成につ
いては同一番号を付して説明の重複を省く。In the following description, the same components as those of the conventional example (FIG. 1) are given the same reference numerals to avoid redundant description.

（実施例１）第２図は本発明の一実施例であり、音声伝送システムの
音声状態における疑似雑音信号挿入方式の概略図である
。本実施例では復号化されたディジタル音声をもとに受
信側の情報を処理する例を示す。９は有音状態から無音
状態に変化した際、有音部でのハングオーバ時間の全体
あるいは一部の背景雑音電力を計算する電力計算回路、
１０はその区間内の背景雑音のスペクトルを分析するＬ
ＰＧ（Ｌｉｎｅａｒ　Ｐｒｅｄｉｃｔｉｖｅ　Ｃｏｄｉ
ｎｇ）分析器、１１は該ＬＰＣ分析器１０によって求め
られた短時間予測係数に基づいて雑音のスペクトルのエ
ンベロープを調整する短時間合成フィルタで短時間予測
器１２と加算器１３から成る。合成フィルタ１１の出力
は電力調整器１４に入力され、電力計算回路９からの背
景雑音電力情報に基づいて雑音電力を背景雑音の電力に
等しく調整した後、Ｄ／Ａ変換器６とアナログフィルタ
７を介してアナログ信号に変換する。(Embodiment 1) FIG. 2 is an embodiment of the present invention, and is a schematic diagram of a pseudo noise signal insertion method in the voice state of a voice transmission system. This embodiment shows an example in which information on the receiving side is processed based on decoded digital audio. 9 is a power calculation circuit that calculates the background noise power for all or part of the hangover time in a sound part when changing from a sound state to a silent state;
10 is L that analyzes the spectrum of background noise within that interval.
PG (Linear Predictive Codi)
ng) The analyzer 11 is a short-time synthesis filter that adjusts the envelope of the noise spectrum based on the short-term prediction coefficients obtained by the LPC analyzer 10, and is composed of a short-time predictor 12 and an adder 13. The output of the synthesis filter 11 is input to the power regulator 14, which adjusts the noise power to be equal to the background noise power based on the background noise power information from the power calculation circuit 9. Convert to analog signal via.

次に動作について説明する。入力端子１に入力されたデ
ィジタル信号は復号化器２で復号化されディジタル音声
信号となる。端子３からの有音／無音状態信号が有音で
あることを示すと復号化器２は切換スイッチ５を介して
Ｄ／Ａ変換器６に接続され、アナログフィルタを介して
端子８からアナログ音声が復元される。一方、端子３の
信号から無音状態であることが検知されると、切換スイ
ッチ５はＤ／Ａ変換器６を電力調整器１４に接続する。Next, the operation will be explained. A digital signal input to an input terminal 1 is decoded by a decoder 2 to become a digital audio signal. When the sound/silence state signal from the terminal 3 indicates that there is sound, the decoder 2 is connected to the D/A converter 6 via the changeover switch 5, and outputs analog audio from the terminal 8 via the analog filter. is restored. On the other hand, if a silent state is detected from the signal at the terminal 3, the changeover switch 5 connects the D/A converter 6 to the power regulator 14.

ＬＰＣ分析器１０では復号化された音声信号が無音にな
る前の有音時すなわちハングオーバ時間の背景雑音のス
ペクトラムのエンベロープを分析し、短時間合成フィル
タ１１内の短時間予測器１２のための予測係数を求め、
短時間予測１２に設定する。この時これら計算により得
られた係数に重みをかけた係数を設定してもよい。第３
図は４次の短時間合成フィルタ１１の一構成例を示した
ものである。第３図の短時間予測器１２内のＤ　Ｉ””
　Ｄ　＜は１サンプル遅延メモリを意味し、α１〜α４
は予測係数である。１５は加算器でα１〜α４の係数で
荷重された各メモリＤ、−Ｄ、の出力の合成値で予測値
を与える。短時間合成フィルタ１１に雑音発生器４から
の雑音を入力し、背景雑音のスペクトルに似たスペクト
ルを持つ雑音を出力させ、電力調整器１４に入力する。The LPC analyzer 10 analyzes the envelope of the spectrum of the background noise at the time of speech before the decoded speech signal becomes silent, that is, during the hangover time, and makes a prediction for the short-term predictor 12 in the short-term synthesis filter 11. Find the coefficient,
Set short-term prediction to 12. At this time, coefficients obtained by weighting the coefficients obtained by these calculations may be set. Third
The figure shows an example of the configuration of the fourth-order short-time synthesis filter 11. D I"" in the short-term predictor 12 of FIG.
D < means 1 sample delay memory, α1 to α4
is the prediction coefficient. 15 is an adder which provides a predicted value as a composite value of the outputs of the memories D, -D loaded with coefficients α1 to α4. The noise from the noise generator 4 is input to the short-time synthesis filter 11, which outputs noise having a spectrum similar to the spectrum of background noise, and inputs it to the power regulator 14.

電力調整器１４では短時間合成フィルタ１１から入力さ
れた信号の電力が背景雑音の電力と同一となるように背
景雑音の電力を計算する電力計算回路９からの情報を基
に電力を調整する。The power regulator 14 adjusts the power based on information from the power calculation circuit 9 that calculates the power of background noise so that the power of the signal input from the short-time synthesis filter 11 is the same as the power of the background noise.

この結果この電力調整器１４から背景雑音に似たスペク
トルと電力を持つ疑似雑音が出力され、切換スイッチ５
を介してＤ／Ａ変換器６に送出されアナログフィルタ７
を介してアナログ信号に変換され、端子８から無通話時
の挿入雑音として受話者に伝送される。As a result, pseudo noise having a spectrum and power similar to background noise is output from the power regulator 14, and the changeover switch 5
is sent to the D/A converter 6 via the analog filter 7.
The signal is converted into an analog signal via the terminal 8 and transmitted to the receiver as noise inserted during no call.

上述の方法に示すように送信されてきた背景雑音と同様
なスペクトル及び電力を持つ疑似雑音を受信側にて発生
させ、挿入することにより、無音状態時の違和感を著し
く改善することができる。By generating and inserting pseudo-noise having the same spectrum and power as the transmitted background noise on the receiving side as shown in the method described above, it is possible to significantly improve the sense of discomfort during a silent state.

尚、上述の実施例１では、復号化された音声信号をもと
に背景雑音の信号電力とスペクトル情報を抽出したが、
次に適応予測符号化（以下、ｒＡＰＣ符号化」と称す）
方式を用いた場合について説明する。ＡＰＣ符号化方式
における受信側での雑音挿入方式としては前述した実施
例１のごとく電力計算回路９やＬＰＣ分析器１０は必要
とせずＡＰＣ復号器内の構成を併用することにより詩に
簡単な構成で実現できることを示す。In addition, in the above-mentioned Example 1, the signal power and spectrum information of the background noise were extracted based on the decoded audio signal.
Next, adaptive predictive coding (hereinafter referred to as rAPC coding)
The case where this method is used will be explained. As for the noise insertion method on the receiving side in the APC encoding method, the power calculation circuit 9 and the LPC analyzer 10 are not required as in the first embodiment described above, and the structure inside the APC decoder is used in combination, resulting in a very simple structure. We will show what can be achieved with

また次の実施例ではパケット欠落として１フレームの符
号化信号が全て失われた場合における疑似音声補間方式
についての構成機能も併せ持たせることができるもので
ある。Further, in the next embodiment, a configuration function for a pseudo-speech interpolation method in the case where one frame of encoded signal is completely lost due to packet loss can also be provided.

（実施例２）第４図は本発明による第２の実施例であり、ＡＰＣ符号
化方式を用いた場合における疑似雑音信号挿入方式の概
略図である。(Embodiment 2) FIG. 4 is a second embodiment according to the present invention, and is a schematic diagram of a pseudo noise signal insertion method when using the APC encoding method.

ＡＰＣ符号化方式においては送信側の符号器によリ、入
力信号の長時間相関、例えば音声信号の有声音などのピ
ッチ周期毎に似た波形が繰返されることによるピッチ周
期に対応した相関を長時間予測器を用いて取り除き、更
に短時間スペクトルエンベロープを表す短時間相関を短
時間予測器を用いて取り除き、その結果の残差信号と呼
ぶ信号を量子化・符号化し伝送している。In the APC encoding method, the encoder on the transmitting side processes the long-term correlation of the input signal, for example, the correlation corresponding to the pitch period due to the repetition of a similar waveform for each pitch period of voiced sounds in the audio signal. The short-time correlation representing the short-time spectral envelope is removed using a time predictor, and the short-time correlation representing the short-time spectral envelope is removed using a short-time predictor.The resulting signal, called a residual signal, is quantized and encoded before being transmitted.

符号化された信号は端子１に受信され、符号化パラメー
タを符号化された残差信号は多重分離回路１６により各
々分離される。受信側のＡＰＣ復号化器では残差信号電
力復号器１７からの残差信号電力の情報を基に逆量子化
器１８のステップサイズを決め、受信された残差信号の
逆量子化を逆量子化器１８で行う。端子３からの有音／
無音状態信号から現フレームは有音であることが判った
場合、切換スイッチ５を介し加算器２２と逆量子化器１
８を接続し、更にスイッチ２３が閉じられる。この結果
、逆量子化された残差信号は長時間予測器２１と加算器
２２とからなる長時間合成フィルタ２６に入力され、そ
の後、更に短時間予測器２４と加算器２５とからなる短
時間合成フィルタ２７に入力される。これらの構成はＡ
ＰＣ復号化器として動作するもので短時間合成フィルタ
２７の出力として復元されたディジタル音声が得られる
。Ｄ／Ａ変換器６とアナログフィルタ７を介して端子８
からアナログ音声が送出される。長時間予測係数復号器
１９は伝送されてきた長時間予測係数を復号するもので
、復号された係数を長時間予測器２１に設定する。The encoded signal is received at the terminal 1, and the residual signals encoded with the encoding parameters are demultiplexed by the demultiplexer circuit 16. The APC decoder on the receiving side determines the step size of the inverse quantizer 18 based on the information on the residual signal power from the residual signal power decoder 17, and converts the inverse quantization of the received residual signal into an inverse quantizer. This is done using a converter 18. Sound from terminal 3/
When it is determined from the silence state signal that the current frame is a sound frame, the adder 22 and the inverse quantizer 1 are connected to each other via the changeover switch 5.
8 is connected, and the switch 23 is further closed. As a result, the dequantized residual signal is input to a long-term synthesis filter 26 consisting of a long-term predictor 21 and an adder 22, and then a short-term synthesis filter 26 consisting of a short-term predictor 24 and an adder 25. It is input to the synthesis filter 27. These configurations are A
The reconstructed digital voice is obtained as the output of the short-time synthesis filter 27, which operates as a PC decoder. Terminal 8 via D/A converter 6 and analog filter 7
Analog audio is sent out. The long-term prediction coefficient decoder 19 decodes the transmitted long-term prediction coefficients, and sets the decoded coefficients in the long-term predictor 21.

また、短時間予測係数復号器２０は同様に伝送されてき
た短時間予測係数を復号し、各係数を短時間予測器２４
にそれぞれ設定する。ここで残差信号電力、長時間予測
係数及び短時間予測係数の各パラメータはフレームある
いはサブフレーム毎に伝送されている。Further, the short-time prediction coefficient decoder 20 similarly decodes the transmitted short-time prediction coefficients, and transfers each coefficient to the short-time prediction coefficient 24.
Set each. Here, each parameter of residual signal power, long-term prediction coefficient, and short-term prediction coefficient is transmitted for each frame or subframe.

次に端子３において現フレームが無音状態であることが
判明した場合、端子１には通常符号化信号は送られてこ
ない。従って、切換スイッチ５を介して雑音発生器４と
加算器２２を接続するよう制′４１１すると共に、スイ
ッチ２３を開放にする。短時間予測器２４には前フレー
ムの有音状態での予測係数をそのまま設定しておき、動
作させる。更に、残差信号電力復号器１７からの前フレ
ームの有音状態での背景雑音の残差信号の電力に関、す
る情報をそのまま用いて同一の電力を持った雑音を雑音
発生器４から送出する。この雑音は残差信号とみなされ
、長時間合成フィルタ２６に加えられる。しかし、この
例では、背景雑音は音声のようにピッチ構造はもってい
ないとみなし、長時間予測器２１の出力は加えず、その
まま短時間合成フィルタ２７に入力し背景雑音として合
成する。この出力は送信側の背景雑音に近い電力とスペ
クトルを持った疑偵雑音信号となる。有音状態から無音
状態に変化した際の有音状態のフレームでは背景雑音が
符号化されて伝送されてきており、残差信号電力復号器
１７からはその残差信号の電力、短時間予測係数復号器
２０からはその短時間スペクトルエンベロープが得られ
る。更に、残差信号は一般に相関が取り除かれており雑
音に近い性質を持っていること、及び背景雑音の短時間
スペクトル、残差信号電力ともフレーム毎に太き（変化
しないことなどから、上記の構成により背景雑音を受信
側で合成できる。If it is then determined at terminal 3 that the current frame is silent, no encoded signal is normally sent to terminal 1. Therefore, the noise generator 4 and the adder 22 are controlled to be connected via the changeover switch 5 411, and the switch 23 is opened. The short-time predictor 24 is operated with the prediction coefficients in the voice state of the previous frame set as they are. Further, noise having the same power is sent from the noise generator 4 using the information from the residual signal power decoder 17 regarding the power of the residual signal of the background noise in the active state of the previous frame. do. This noise is considered a residual signal and is added to the long-term synthesis filter 26. However, in this example, it is assumed that background noise does not have a pitch structure like speech, and the output of the long-term predictor 21 is not added, but is directly input to the short-term synthesis filter 27 and synthesized as background noise. This output becomes a suspicious noise signal with power and spectrum close to the background noise on the transmitting side. Background noise is encoded and transmitted in the frame of the voice state when the voice state changes from the voice state to the silent state, and the residual signal power decoder 17 outputs the power of the residual signal and the short-term prediction coefficient. The short-term spectral envelope is obtained from the decoder 20. Furthermore, the residual signal is generally decorrelated and has properties similar to noise, and the short-term spectrum of background noise and the residual signal power are thick (do not change) from frame to frame, so the above-mentioned Depending on the configuration, background noise can be synthesized on the receiving side.

尚、長時間予測器２１はこの例では動作しないようスイ
ッチ２３を設けているが、これは背景雑音にはピンチの
ような長時間相関はないものと考えたためである。一般
にはスイッチ２３を閉じ、前フレームの長時間予測係数
を設定した長時間予測器２１を動作させても、背景雑音
の長時間予測係数が０に近いため、大きな劣化はない。In this example, a switch 23 is provided so that the long-term predictor 21 does not operate, because it is assumed that the background noise does not have a long-term correlation such as a pinch. Generally, even if the switch 23 is closed and the long-term predictor 21, which has been set with the long-term prediction coefficient of the previous frame, is operated, there is no major deterioration because the long-term prediction coefficient of background noise is close to 0.

ここではこれら残差信号電力、予測係数は無音状態が連
続するフレームでは一定としている。しかし、無音状態
が長く連続する場合には、雑音電力を次第に小さくして
もよい。また、背景雑音を合成する際、合成フィルタの
予測係数は復号された値に荷重したものを使用してもよ
い。Here, the residual signal power and the prediction coefficient are constant in frames in which a silent state continues. However, if the silent state continues for a long time, the noise power may be gradually reduced. Furthermore, when synthesizing background noise, the prediction coefficients of the synthesis filter may be weighted with decoded values.

このようにＡＰＣ符号化方式を用いた場合、背景雑音を
合成するにあたり、ＡＰＣ復号器内の合成フィルタ及び
残差信号の電力情報を用いることにより、実施例１に説
明したように各機能を用意することなく背景雑音を合成
でき、これを無通話時に受話者に伝送すれば、通話の不
自然性を解消することができる。When using the APC encoding method in this way, each function is prepared as explained in Example 1 by using the synthesis filter in the APC decoder and the power information of the residual signal when synthesizing background noise. If the background noise can be synthesized without any noise and is transmitted to the receiver when there is no call, the unnaturalness of the call can be eliminated.

このように本発明では無音状態となる直前のハングオー
バ時間内における電力情報及びスペクトル情報を用いて
疑似雑音信号（背景雑音）を簡単に作成し、この作成し
た疑似雑音信号を無音状態時に挿入することにより通話
の自然さを保つことができる。In this way, in the present invention, a pseudo-noise signal (background noise) is easily created using power information and spectrum information during the hangover time immediately before the silent state, and this created pseudo-noise signal is inserted during the silent state. This allows the naturalness of the call to be maintained.

次に、パケット音声伝送システムの如き音声がパケット
単位で伝送されているとき、送信側あるいは伝送途中で
パケットが欠落した場合の音声補間方式にってい説明す
る。Next, when audio is transmitted in packet units, such as in a packet audio transmission system, an audio interpolation method will be described when a packet is lost on the transmitting side or during transmission.

（実施例３）第５図は本発明による他の実施例であり、パケット音声
伝送システムを用いた場合の疑似音声信号の補間方式を
示す概略図である。なお、前述した本発明の実施例２（
第４図）との相違点は、有音／無音状態信号のかわりに
音声パケット欠落情報信号をパケット音声伝送システム
からもらって用いることと、音声のピンチ情報が設定さ
れている長時間予測器２１を常に用いることである。従
って、以下の説明では実施例２と異なる動作について述
べる。(Embodiment 3) FIG. 5 is another embodiment according to the present invention, and is a schematic diagram showing an interpolation method of a pseudo voice signal when a packet voice transmission system is used. Note that Example 2 of the present invention described above (
4) is that a voice packet loss information signal is obtained from the packet voice transmission system and used instead of the voice/silence state signal, and the long-term predictor 21 in which voice pinch information is set is used. Always use it. Therefore, in the following description, operations different from those in the second embodiment will be described.

端子２８から音声パケット欠落情報信号がこない場合、
実施例２の有音状態と同様に切替スイッチ５は逆量子化
器１８側に接続されて、入力端子１から受信したパケッ
ト音声が復号される。If the audio packet loss information signal does not come from the terminal 28,
As in the sound state of the second embodiment, the changeover switch 5 is connected to the inverse quantizer 18 side, and the packet audio received from the input terminal 1 is decoded.

一方、端子２８から音声パケット欠落情報信号を受信す
ると、切替スイッチ５は雑音発生器４側に切替えられて
、雑音発生器４と加算器２２とが接続される。同時に、
長時間予測器２１の出力が加算器２２に接続されていな
い場合は、スイッチ２３により接続される。この状態で
、雑音発生器４にはパケット欠落の直前のフレームある
いはサブフレームにおける残差信号電力復号器１７から
の残差信号電力が設定され、同様にパケット欠落直前フ
レームの短時間予測係数及び長時間予測係数が短時間予
測器２４及び長時間予測器２１にそれぞれ設定される。On the other hand, when the voice packet loss information signal is received from the terminal 28, the selector switch 5 is switched to the noise generator 4 side, and the noise generator 4 and the adder 22 are connected. at the same time,
If the output of the long-term predictor 21 is not connected to the adder 22, it is connected by a switch 23. In this state, the noise generator 4 is set with the residual signal power from the residual signal power decoder 17 in the frame or subframe immediately before the packet loss, and similarly the short-term prediction coefficient and long-term prediction coefficient of the frame immediately before the packet loss. Time prediction coefficients are set in the short-time predictor 24 and the long-term predictor 21, respectively.

従って、雑音発生器４から出力される雑音電力はパケッ
ト欠落の直前フレームの電力に等しく調整されたのち、
長時間合成フィルタ２６のピッチ情報及び短時間合成フ
ィルタ２７のスペクトルエンベロープが調整された疑似
音声信号が作成されて受話者へ送出される。よって、本
発明ではパケットが欠落しても雑音を駆動音源として疑
似音声信号を作成して補間するため、通話品質を大幅に
改善することができる。また、本発明は従来の如き残差
信号を蓄積するためのメモリを必要とせず、雑音を駆動
音源として疑似音声信号を作成するため装置の簡単化も
実現できる。Therefore, the noise power output from the noise generator 4 is adjusted to be equal to the power of the frame immediately before the packet loss, and then
A pseudo speech signal in which the pitch information of the long-term synthesis filter 26 and the spectral envelope of the short-term synthesis filter 27 are adjusted is created and sent to the receiver. Therefore, in the present invention, even if a packet is lost, a pseudo voice signal is created and interpolated using noise as a driving sound source, so that the call quality can be significantly improved. Furthermore, the present invention does not require a memory for storing residual signals as in the prior art, and the apparatus can be simplified because a pseudo voice signal is created using noise as a driving sound source.

（発明の効果）以上のように、本発明は音声信号の欠落時には欠落前フ
レームの符号化パラメータと雑音で駆動された合成フィ
ルタを用いて疑似音声を合成し、欠落部分を補間するこ
とにより、受話者の音声品質を著しく改善し通話の自然
性を保つことができる。更に、無通話時の無音状態では
有音状態の最後のフレームにおける符号化パラメータと
雑音で駆動された合成フィルタを用いて背景雑音を合成
し、これを無音時に挿入することによって通話の自然性
を保つことができ、通話品質を改善するうえでその効果
は極めて大である。(Effects of the Invention) As described above, when a voice signal is missing, the present invention synthesizes pseudo speech using a synthesis filter driven by the coding parameters of the frame before the loss and noise, and interpolates the missing part. It is possible to significantly improve the voice quality of the receiver and maintain the naturalness of the call. Furthermore, in the silent state when there is no talk, background noise is synthesized using a synthesis filter driven by the encoding parameters and noise in the last frame of the active state, and this is inserted into the silent state to improve the naturalness of the call. This is extremely effective in improving call quality.

[Brief explanation of the drawing]

第１図は従来の無音状態時における疑似雑音挿入方式の
概略構造を示すブロック図、第２図は本発明により音声
伝送システムの疑似雑音挿入方式を実現する場合の実施
例を示すブロック図、第３図は本発明に用いる短時間合
成フィルタの構成例を示す接続図、第４図は本発明によ
りＡＰＣ符号化の疑似雑音挿入方式を実現する場合の実
施例を示すブロック図、第５図は本発明によりパケット
音声伝送システムの疑似音声信号方式を実現する場合の
実施例を示すブロック図である。FIG. 1 is a block diagram showing a schematic structure of a conventional pseudo-noise insertion method in a silent state; FIG. 2 is a block diagram showing an embodiment of the pseudo-noise insertion method for a voice transmission system according to the present invention; FIG. 3 is a connection diagram showing an example of the configuration of a short-time synthesis filter used in the present invention, FIG. 4 is a block diagram showing an embodiment of the pseudo-noise insertion method for APC encoding according to the present invention, and FIG. FIG. 2 is a block diagram showing an embodiment in which a pseudo voice signaling system for a packet voice transmission system is implemented according to the present invention.

Claims

[Claims]

(1) The receiving side discretely receives the signal within the sum of the length of time during which the audio signal to be transmitted from the transmitting side to the audio line exists and the predetermined hangover time length as transmission information, and the transmission In a pseudo signal insertion method in an audio transmission system that inserts a pseudo signal created on the receiving side and outputs it to the audio line during a time period in which no information is received, the power and spectrum of the noise signal generated on the receiving side are is configured to create the pseudo signal by adjusting the signal power information and spectrum information included in all of the transmission information received immediately before or at least a part of the hangover time. A method for inserting pseudo signals in audio transmission systems.