JPH064087A

JPH064087A - Speech encoding device

Info

Publication number: JPH064087A
Application number: JP4157113A
Authority: JP
Inventors: Naomi Nishiyama; 直美西山; Mitsuru Tsuboi; 満坪井; Shoji Fujino; 尚司藤野; Koji Okazaki; 晃二岡崎; Naoji Matsuo; 直司松尾; Toshiaki Nobumoto; 俊明信本
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1992-06-17
Filing date: 1992-06-17
Publication date: 1994-01-14

Abstract

PURPOSE:To reduce a feeling of incongruity to a reproduced sound at the time of switching between a voiced sound section and a voiceless sound section of the speech encoding device. CONSTITUTION:The speech encoding device, which has a voiceless sound detection part 1 detecting the voiceless section of an input speech signal and outputting voiceless sound information and an encoding part 2 encoding and outputting the voiceless sound information and inputting speech signal on the transmission side and a decoding part 3 decoding the received voiceless sound information and inputting speech signal and outputting a reproduced speech signal and the voiceless sound information and a noise insertion part 4 inserting a noise of specific level into the voiceless sound section of the reproduced sound signal according to the voiceless sound information outputted by the decoding part and outputting the resulting signal on the reception side, is provided with a time delay part 5 which is connected between the voiceless sound detection part 1 and encoding part 2 on the transmission side and outputs the voiceless sound information with specific time delay behind the point of time when the voiceless sound detection part decides the voiceless sound section; and the noise insertion part 4 on the reception side inserts the noise of specific level into the voiceless sound section of the reproduced speech signal and the certain section right before the voiceless sound section according to the voiceless sound information outputted by the decoding part and outputs the resulting signal.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は音声符号化装置の改良に
関するものである。この際、有音区間と無音区間の切り
替わり時に生じる再生音の違和感を軽減する音声符号化
装置が要望されている。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an improvement of a speech coder. At this time, there is a demand for a voice encoding device that reduces a feeling of strangeness in reproduced sound that occurs when a sound section and a silent section are switched.

【０００２】[0002]

【従来の技術】図６は従来例の音声符号化装置の構成を
示すブロック図である。複数の音声データあるいはその
他のデータを多重化して伝送する音声符号化装置では、
伝送路の伝送効率を高めるために、通話中の音声データ
の大半を占める無音区間を逐次検出して、有音区間にあ
るその他の音声データや伝送要求の出ているその他のデ
ータに伝送路を解放するデータ圧縮伝送方式が採用され
る。2. Description of the Related Art FIG. 6 is a block diagram showing the structure of a conventional speech coder. In a voice encoding device that transmits a plurality of voice data or other data by multiplexing,
In order to improve the transmission efficiency of the transmission path, the silent section that occupies most of the voice data during a call is sequentially detected, and the transmission path is set to other voice data in the voice section or other data for which transmission is requested. A released data compression transmission method is adopted.

【０００３】図６は無音検出方式を採用した際の音声符
号化装置の構成例を示す。同図(a)に示す符号器側にお
いて、無音検出部１では入力された音声データの音声レ
ベルが所定の閾値未満である状態が一定時間以上継続す
ると無音区間であると判定し、その判定結果（以下、
「無音化情報」と称する）を、音声データとともに符号
化部２に加える。符号化部２では、無音区間に対しては
無音化情報を符号化対象とし、有音区間については所定
の圧縮符号化方式により音声データを圧縮符号化して、
伝送路に送出する。FIG. 6 shows an example of the configuration of a speech coder when the silence detection method is adopted. On the encoder side shown in FIG. 3A, the silence detection unit 1 determines that the voice level of the input voice data is less than a predetermined threshold for a certain period of time or more and determines that it is a silence section, and the determination result (Less than,
“Silence information”) is added to the encoding unit 2 together with the voice data. In the encoding unit 2, the silence information is targeted for encoding for the silent section, and the voice data is compressed and encoded by a predetermined compression encoding method for the voiced section,
Send to the transmission path.

【０００４】一方、伝送路を介して対向する音声符号化
装置では、同図(b) に示す復号化部３で、圧縮符号化し
たデータから音声データを再生するとともに無音化情報
を抽出して、雑音挿入部４’に加える。雑音挿入部４’
で、再生された音声データの無音区間に所定の雑音信号
を挿入して出力する。このようにして再生された音声が
全く無音となることを防止していた。On the other hand, in the voice encoding device facing each other through the transmission path, the decoding unit 3 shown in FIG. 1B reproduces the voice data from the compression encoded data and extracts the silence information. , Noise adding section 4 ′. Noise insertion unit 4 '
Then, a predetermined noise signal is inserted and output in the silent section of the reproduced voice data. The sound reproduced in this way is prevented from becoming completely silent.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら上述した
装置の構成においては、音声レベルにより有音／無音の
識別をしており、閾値よりも低いレベルになった後直ち
に無音化処理を行うために、復号器側で音声が途切れた
ような違和感を生じさせ、音質を劣化させるという問題
点があった。However, in the configuration of the above-mentioned device, the presence / absence of voice is discriminated by the voice level, and in order to perform the silence processing immediately after the level becomes lower than the threshold value, There is a problem in that the decoder causes an uncomfortable feeling that the sound is interrupted and the sound quality is deteriorated.

【０００６】又、雑音挿入に際して、所定レベルの雑音
信号を入力側の音声レベルに関わらず挿入しているため
に、再生側では有音処理部の周囲雑音と挿入雑音の違い
により、有音区間と無音区間の間に音声の違和感を生じ
させ、音質を劣化させるという問題点があった。In addition, since a noise signal of a predetermined level is inserted regardless of the voice level of the input side at the time of noise insertion, the voice side section on the playback side may differ due to the difference between the ambient noise of the voice processing unit and the insertion noise. There is a problem in that the sound quality is deteriorated by causing a feeling of strangeness in the sound between the silent section.

【０００７】したがって本発明の目的は、有音区間と無
音区間の切り替わり時に生じる再生音の違和感を軽減す
る音声符号化装置を提供することにある。Therefore, an object of the present invention is to provide a speech coding apparatus for reducing the discomfort of reproduced sound which occurs when a sound section and a silent section are switched.

【０００８】[0008]

【課題を解決するための手段】上記問題点は図１〜図４
に示す装置の構成によって解決される。送信側には、入
力音声信号に対して無音区間を検出して無音化情報を出
力する無音検出部１と、無音化情報と入力音声信号を符
号化して出力する符号化部２とを有し、受信側には、受
信した無音化情報と入力音声信号を複号して再生音声信
号及び無音化情報を出力する復号化部３と、復号化部の
出力の無音化情報に基づいて再生音声信号の無音区間に
所定レベルの雑音を挿入して出力する雑音挿入部４’と
を有する音声符号化装置において、図１の場合（請求項１）、５は送信側の無音検出部１と
符号化部２の間に設けられ、無音検出部で無音と判定し
た時点より所定時間遅延させて無音化情報を出力する時
間遅延部である。[Problems to be Solved by the Invention] The above-mentioned problems are shown in FIGS.
This is solved by the configuration of the device shown in. The transmission side has a silence detector 1 that detects a silence section in an input voice signal and outputs silence information, and an encoder 2 that encodes and outputs the silence information and the input voice signal. On the receiving side, the decoding unit 3 which decodes the received silence information and the input voice signal to output the reproduced voice signal and the silence information, and the reproduced voice based on the silence information output from the decoding unit. In a speech coding apparatus having a noise insertion unit 4 ′ that inserts and outputs a predetermined level of noise in a silent section of a signal, in the case of FIG. 1 (Claim 1), 5 is a silence detection unit 1 on the transmission side and a code. The time delay unit is provided between the conversion units 2 and outputs the silence information after delaying a predetermined time from the time when the silence detection unit determines that there is no sound.

【０００９】そして、受信側の雑音挿入部４で、復号化
部の出力の無音化情報に基づいて再生音声信号の無音区
間及び無音区間の直前の一定区間に所定レベルの雑音を
挿入して出力するように構成する。Then, the noise inserting section 4 on the receiving side inserts noise of a predetermined level into the silent section of the reproduced voice signal and a certain section immediately before the silent section based on the silence information output from the decoding section, and outputs the noise. To configure.

【００１０】図２の場合（請求項２）、５は送信側の無
音検出部１と符号化部２間に設けられ、無音検出部で無
音と判定した時点より所定時間遅延させて無音化情報を
出力する時間遅延部である。In the case of FIG. 2 (Claim 2), 5 is provided between the silence detector 1 and the encoder 2 on the transmitting side, and the silence information is delayed by a predetermined time from the time when the silence detector determines that there is no silence. Is a time delay unit that outputs

【００１１】又６は受信側に設けられ、無音区間の直前
の一定区間で復号化部の出力の再生音声信号と所定レベ
ルの雑音とをミキシングするミキサである。図３の場合（請求項３）、５は送信側の無音検出部１と
符号化部２の間に設けられ、無音検出部で無音と判定し
た時点より所定時間遅延させて無音化情報を出力する時
間遅延部である。A mixer 6 is provided on the receiving side and mixes the reproduced voice signal output from the decoding section with a predetermined level of noise in a certain section immediately before the silent section. In the case of FIG. 3 (Claim 3), 5 is provided between the silence detector 1 and the encoder 2 on the transmission side, and outputs the silence information after a predetermined time delay from the time when the silence detector determines that there is no sound. It is a time delay unit.

【００１２】７は受信側の復号化部３と雑音挿入部４の
間に設けられ、復号化部の出力の再生音声信号のレベル
を算出し、再生音声信号の算出レベルから無音区間に挿
入するための雑音レベルを算出して出力するレベル算出
部である。Reference numeral 7 is provided between the decoding unit 3 and the noise insertion unit 4 on the receiving side, calculates the level of the reproduced voice signal output from the decoding unit, and inserts it from the calculated level of the reproduced voice signal into the silent section. Is a level calculation unit that calculates and outputs a noise level for

【００１３】そして、雑音挿入部４で、復号化部の出力
の再生音声信号の無音区間にレベル算出部で算出された
レベルの雑音を挿入して出力するように構成する。図４の場合（請求項４）、５は送信側の無音検出部１と
符号化部２の間に設けられ、無音検出部で無音と判定し
た時点より所定時間遅延させて無音化情報を出力する時
間遅延部である。Then, the noise insertion unit 4 is configured to insert the noise of the level calculated by the level calculation unit into the silent section of the reproduced voice signal output from the decoding unit and output the noise. In the case of FIG. 4 (Claim 4), 5 is provided between the silence detector 1 and the encoder 2 on the transmission side, and outputs the silence information after a predetermined time delay from the time when the silence detector determines that there is no sound. It is a time delay unit.

【００１４】７は、受信側の復号化部３と雑音挿入部４
の間に接続され、復号化部の出力の再生音声信号のレベ
ルを算出し、再生音声信号の算出レベルから無音区間に
挿入するための雑音レベルを算出して出力するレベル算
出部である。Reference numeral 7 is a decoding unit 3 and a noise insertion unit 4 on the receiving side.
The level calculation unit is connected between the two, calculates the level of the reproduced voice signal output from the decoding unit, and calculates and outputs the noise level for inserting into the silent section from the calculated level of the reproduced voice signal.

【００１５】６は、無音区間の直前の一定区間で、復号
化部の出力の再生音声信号とレベル算出部で算出された
レベルの雑音とをミキシングするミキサである。上記
６、７を受信部に設ける。Numeral 6 is a mixer for mixing the reproduced voice signal output from the decoding section and the noise of the level calculated by the level calculation section in a fixed section immediately before the silent section. The above 6 and 7 are provided in the receiver.

【００１６】[0016]

【作用】図１において、送信側の無音検出部１により無
音と判定した時点に対して、実際に無音化処理を適用す
る時刻を時間遅延部５により所定時間だけ遅延させる。
したがって、音声信号レベルが所定の閾値よりも低くな
った直後に無音化処理することを避けることができ、か
つ受信側の雑音挿入部４で、再生音声信号の無音区間及
び無音区間の直前の一定区間に所定レベルの雑音を挿入
して出力するため、有音区間と無音区間の切り替わり時
に生じる再生音の違和感を軽減することができる。In FIG. 1, the time delay unit 5 delays the time when the silence processing is actually applied from the time when the silence detecting unit 1 on the transmitting side determines that there is no sound by a predetermined time.
Therefore, it is possible to avoid performing the silence processing immediately after the audio signal level becomes lower than the predetermined threshold value, and the noise inserting unit 4 on the receiving side can suppress the silence interval of the reproduced audio signal and a constant interval immediately before the silent interval. Since a predetermined level of noise is inserted and output in a section, it is possible to reduce a feeling of strangeness in reproduced sound that occurs when a sound section and a silent section are switched.

【００１７】図２において、復号器側で無音区間と判断
される直前の再生音声信号を含む有音区間でだけ、無音
区間に挿入する所定レベルの雑音と再生音声信号とをミ
キサ６でミキシングする。この結果、有音区間と無音区
間をオーバーラップさせることにより徐々に移行させる
ため、有音区間と無音区間の切り替わり時に生じる再生
音の違和感を軽減することができる。In FIG. 2, the mixer 6 mixes a predetermined level of noise to be inserted into the silent section with the reproduced voice signal only in the voiced section including the reproduced voice signal immediately before being judged to be a silent section by the decoder side. . As a result, since the voiced section and the silent section are gradually shifted by overlapping each other, it is possible to reduce the discomfort of the reproduced sound generated when the voiced section and the silent section are switched.

【００１８】図３において、復号器側のレベル算出部８
で、無音区間と判断される直前の有音区間の再生音声信
号のレベルから、無音区間に挿入する雑音レベルを算出
する。無音区間と判断される直前の有音区間は時間遅延
部５により無音化を遅らせているため有音となった部分
で、本来無音処理を適用されるべき区間である。In FIG. 3, the level calculator 8 on the decoder side
Then, the noise level to be inserted into the silent section is calculated from the level of the reproduced voice signal in the sound section immediately before being determined as the silent section. The voiced section immediately before being determined to be a voiceless section is a section that becomes voiced because the time delay unit 5 delays the silence, and is a section to which the silence processing should be originally applied.

【００１９】したがって、この区間の再生音は周囲雑音
と判定される。この区間の平均レベルを算出し、それに
相当する雑音を無音区間に挿入する。この結果、有音部
分の周囲雑音と無音部分の挿入雑音が近似されるため、
有音区間と無音区間の切り替わり時に生じる再生音の違
和感を軽減することができる。Therefore, the reproduced sound in this section is determined to be ambient noise. The average level of this section is calculated, and noise corresponding to that is inserted into the silent section. As a result, the ambient noise in the voiced part and the insertion noise in the silent part are approximated,
It is possible to reduce the discomfort of the reproduced sound that occurs when the sound section and the silent section are switched.

【００２０】図４に示す第４の発明は、前述した図３
（第３の発明）の作用で述べた挿入雑音レベルを直前の
有音区間から算出する方法に、図２（第２の発明）の作
用で述べた有音区間と無音区間のミキシング再生方法を
付加するものである。The fourth invention shown in FIG. 4 is the same as that of FIG.
In the method of calculating the insertion noise level from the immediately preceding voiced section described in the operation of (Third invention), the mixing reproduction method of the voiced section and the silent section described in the operation of FIG. 2 (Second invention) is used. It is something to add.

【００２１】この結果、有音区間と無音区間の切り替わ
り時に生じる再生音の違和感を軽減することができる。As a result, it is possible to reduce the discomfort of the reproduced sound which occurs when the sound section and the silent section are switched.

【００２２】[0022]

【実施例】図１は第１の発明の原理図、兼実施例の装置
の構成を示すブロック図である。図２は第２の発明の原
理図、兼実施例の装置の構成を示すブロック図である。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a principle diagram of the first invention and a block diagram showing the construction of an apparatus according to an embodiment. FIG. 2 is a principle diagram of the second invention and is a block diagram showing the configuration of the apparatus of the embodiment.

【００２３】図３は第３の発明の原理図、兼実施例の装
置の構成を示すブロック図である。図４は第４の発明の
原理図、兼実施例の装置の構成を示すブロック図であ
る。図５は第４の発明の実施例のミキシング方法を説明
するための図である。FIG. 3 is a block diagram showing the principle of the third invention and the structure of the apparatus of the embodiment. FIG. 4 is a principle diagram of the fourth invention and is a block diagram showing the configuration of the apparatus of the embodiment. FIG. 5 is a diagram for explaining the mixing method of the embodiment of the fourth invention.

【００２４】全図を通じて同一符号は同一対象物を示
す。まず第１の発明の実施例について図１を用いて説明
する。同図(a)に示す符号器側の無音検出部１では入力
音声データの音声レベルが所定の閾値未満である状態が
一定時間以上継続した時無音区間であると判定し、その
判定結果である無音化情報を時間遅延部５に加える。時
間遅延部５では、この無音化情報を受信すると所定時間
（例えばτとする）だけ遅延させた後、音声データとと
もに符号化部２に加える。符号化部２では、所定時間
（τ）だけ遅延された無音区間に対して無音化情報を符
号化対象とし、上記所定時間（τ）だけ遅延された区間
を含めて有音区間については所定の圧縮符号化方式によ
り音声データを圧縮符号化して、伝送路に送出する。The same reference numerals denote the same objects throughout the drawings. First, an embodiment of the first invention will be described with reference to FIG. In the silence detector 1 on the encoder side shown in FIG. 6A, when the state in which the voice level of the input voice data is less than a predetermined threshold value continues for a certain time or more, it is determined to be a silent section, and the result is the determination result. The silence information is added to the time delay unit 5. When the time delay unit 5 receives the silence information, the time delay unit 5 delays it by a predetermined time (for example, τ) and then adds it to the encoding unit 2 together with the voice data. The encoding unit 2 encodes the silence information to the silent section delayed by a predetermined time (τ), and determines a predetermined voiced section including a section delayed by the predetermined time (τ). The audio data is compression-encoded by the compression encoding method and sent to the transmission path.

【００２５】一方、伝送路を介して対向する音声符号化
装置では、同図(b) に示す復号化部３で、圧縮符号化し
たデータから音声データを再生するとともに無音化情報
を抽出して、雑音挿入部４に加える。雑音挿入部４で、
再生された音声データの無音区間に所定レベルの雑音信
号を挿入して出力する。On the other hand, in the voice encoding device facing each other through the transmission line, the decoding unit 3 shown in FIG. 1B reproduces the voice data from the compression encoded data and extracts the silence information. , To the noise insertion unit 4. In the noise insertion unit 4,
A noise signal of a predetermined level is inserted and output in the silent section of the reproduced voice data.

【００２６】この結果、音声レベルが所定の閾値より低
くなった直後に無音化処理することを避けることがで
き、かつ雑音挿入部４で、再生音声信号の無音区間及び
無音区間の直前の一定区間に所定レベルの雑音を挿入し
て出力するため、有音区間と無音区間の切り替わり時に
生じる音声の違和感を軽減することができる。As a result, it is possible to avoid the silence processing immediately after the voice level becomes lower than the predetermined threshold value, and the noise insertion unit 4 makes the noise insertion section 4 a silent section and a fixed section immediately before the silent section. Since a predetermined level of noise is inserted and output in, it is possible to reduce the uncomfortable feeling of the voice that occurs at the time of switching between the voiced section and the silent section.

【００２７】次に第２の発明の実施例について、図２を
用いて説明する。図２において、(a) に示す符号器側の
構成は前述した第１の発明の場合と同様であるため、そ
の説明を省略する。Next, an embodiment of the second invention will be described with reference to FIG. In FIG. 2, the configuration on the encoder side shown in (a) is the same as in the case of the first invention described above, and therefore its explanation is omitted.

【００２８】同図(b) に示す復号化部３で、前述したと
同様に圧縮符号化したデータから音声データを再生する
とともに無音化情報を抽出する。この再生音声データと
無音化情報を雑音挿入部４及び新たに設けたミキサ６に
加える。そして、ミキサ６で、無音区間と判定される直
前の有音区間（前述したτの区間とは必ずしも一致しな
くてもよい）の再生音声データに無音区間に挿入する挿
入雑音をミキシングして、この無音区間の直前の有音区
間の時だけ出力する。In the decoding unit 3 shown in FIG. 2B, the voice data is reproduced from the data compression-coded in the same manner as described above, and the silence information is extracted. The reproduced voice data and the silence information are added to the noise insertion unit 4 and the newly provided mixer 6. Then, the mixer 6 mixes the insertion noise to be inserted into the silent section into the reproduced voice data of the sound section immediately before being determined as the silent section (which may not necessarily match the section of τ described above), It is output only in the voiced section immediately before this silent section.

【００２９】又、無音区間及び無音区間の直前以外の有
音区間の時には雑音挿入部４で、再生された音声データ
の無音区間に所定の雑音信号を挿入して出力する。この
結果、有音区間と無音区間をオーバーラップさせること
により徐々に移行させるため、有音区間と無音区間の切
り替わり時に生じる再生音の違和感を軽減することがで
きる。Further, when there is a silent section or a sound section other than immediately before the silent section, the noise inserting section 4 inserts a predetermined noise signal into the silent section of the reproduced voice data and outputs it. As a result, since the voiced section and the silent section are gradually shifted by overlapping each other, it is possible to reduce the discomfort of the reproduced sound generated when the voiced section and the silent section are switched.

【００３０】次に第３の発明の実施例について、図３を
用いて説明する。図３において、(a) に示す符号器側の
構成は前述した第１の発明の場合と同様であるため、そ
の説明を省略する。Next, an embodiment of the third invention will be described with reference to FIG. In FIG. 3, the configuration on the encoder side shown in (a) is the same as in the case of the above-described first invention, and therefore its explanation is omitted.

【００３１】同図(b) に示す復号化部３で、前述したと
同様に圧縮符号化したデータから音声データを再生する
とともに無音化情報を抽出する。そして、この再生音声
データをレベル算出部７に入力して、無音区間と判定さ
れる直前の有音区間（前述したτの区間とは、必ずしも
一致しなくてもよい）の再生音声データのレベルから無
音区間に挿入する挿入雑音レベルを計算する。この挿入
雑音レベルの計算結果を再生音声データとともに雑音挿
入部４に加える。そして、雑音挿入部４で、上記計算に
より得られた雑音レベルに基づいて挿入雑音を発生させ
てこれを無音区間に挿入して出力する。In the decoding section 3 shown in FIG. 3B, the voice data is reproduced from the data compression-coded as described above, and the silence information is extracted. Then, the reproduced voice data is input to the level calculation unit 7, and the level of the reproduced voice data of the voiced section (which does not necessarily coincide with the section of τ described above) immediately before being determined as the silent section. Then, the insertion noise level to be inserted in the silent section is calculated. The calculation result of this insertion noise level is added to the noise insertion unit 4 together with the reproduced voice data. Then, the noise insertion unit 4 generates insertion noise based on the noise level obtained by the above calculation, inserts the insertion noise in the silent section, and outputs it.

【００３２】復号器側で無音区間と判定される直前の有
音区間は時間遅延部５により無音化を遅らせているため
有音区間となった部分で（正確には一致しない時もあ
る）、本来無音処理を適用されるべき区間である。した
がって、この区間の再生音は周囲雑音と判断される。こ
の区間の平均レベルを算出し、それに相当する雑音を無
音区間に挿入する。この結果、有音部分の周囲雑音と無
音部分の挿入雑音が近似されるため、有音区間と無音区
間の切り替わり時に生じる再生音の違和感を軽減するこ
とができる。The voiced section immediately before being determined to be a voiceless section on the decoder side is the voiced section because the time delay unit 5 delays the silence (in some cases, it does not exactly match). This is the section where the silence processing should be applied. Therefore, the reproduced sound in this section is determined to be ambient noise. The average level of this section is calculated, and noise corresponding to that is inserted into the silent section. As a result, since the ambient noise in the voiced portion and the insertion noise in the voiceless portion are approximated, it is possible to reduce the discomfort of the reproduced sound that occurs when the voiced section and the silence section are switched.

【００３３】次に第４の発明の実施例について、図４を
用いて説明する。図４において、(a) に示す符号器側の
構成は前述した第１の発明の場合と同様であるため、そ
の説明を省略する。Next, an embodiment of the fourth invention will be described with reference to FIG. In FIG. 4, the configuration on the encoder side shown in (a) is the same as in the case of the above-described first invention, and therefore its explanation is omitted.

【００３４】同図(b) に示す復号化部３で、前述したと
同様に圧縮符号化したデータから音声データを再生する
とともに無音化情報を抽出する。そして、この再生音声
データを復号化部３内に有するＲＡＭ（図示しない）に
一時記憶し、次の無音化情報の区間まで処理を待つ。次
の再生音声区間が無音と判断された時一時記憶した一区
間前の再生音声データをレベル算出部７に入力して、一
区間前の有音区間の平均レベルを算出し、次の無音区間
の挿入雑音レベルとする。この挿入雑音レベルの計算結
果を雑音挿入部４及びミキサ６に加える。In the decoding section 3 shown in FIG. 3B, the voice data is reproduced from the compression-coded data as described above, and the silence information is extracted. Then, the reproduced voice data is temporarily stored in the RAM (not shown) included in the decoding unit 3, and the processing is waited until the next section of the silence information. When it is determined that the next reproduced voice section is silent, the temporarily stored reproduced voice data of the previous section is input to the level calculation unit 7, the average level of the voiced section of the previous section is calculated, and the next silent section is calculated. The insertion noise level of The calculation result of this insertion noise level is added to the noise insertion unit 4 and the mixer 6.

【００３５】無音区間直前の一定区間ではミキサ６で、
この雑音レベルに基づいて挿入雑音を発生させて、復号
化部３から加えた無音区間直前の有音区間の再生音声デ
ータとミキシングして出力音声データを得る。次に、無
音区間では雑音挿入部４で、レベル算出部７で算出した
雑音レベルに基づいて挿入雑音を発生させて、復号化部
３から加えた再生音声データの無音区間に挿入して出力
する。又、次の再生音声区間が有音の時は復号化部３か
ら雑音挿入部４に加えた再生音声データはそのまま出力
音声データとなる。In the fixed section immediately before the silent section, the mixer 6
Insertion noise is generated based on this noise level, and mixed with the reproduced voice data of the voiced section immediately before the silent section added from the decoding unit 3 to obtain output voice data. Next, in the silent section, the noise insertion unit 4 generates insertion noise based on the noise level calculated by the level calculation unit 7, inserts it into the silent section of the reproduced voice data added from the decoding unit 3, and outputs it. . When the next reproduced voice section is voiced, the reproduced voice data added from the decoding unit 3 to the noise inserting unit 4 becomes the output voice data as it is.

【００３６】尚、上述したミキシングの方法は、有音区
間の再生音と挿入雑音のそれぞれに図５に示すような窓
をかけて（重み付けを行って）加えることにより行う。
この結果、無音区間直前の有音区間では再生音声信号と
挿入雑音をそれぞれ重み付けを行ってミキシングし、無
音区間では挿入雑音をそのまま挿入しているため有音区
間と無音区間が滑らかにつながり、有音区間と無音区間
の切り替わり時に生じる再生音の違和感を軽減すること
ができる。The above-mentioned mixing method is carried out by adding (weighting) a window as shown in FIG. 5 to each of the reproduced sound and the insertion noise in the voiced section.
As a result, in the voiced section immediately before the silence section, the reproduced voice signal and the insertion noise are weighted and mixed, respectively, and in the silence section, the insertion noise is inserted as it is, so that the voiced section and the silence section are smoothly connected. It is possible to reduce the discomfort of the reproduced sound that occurs when switching between the sound section and the silent section.

【００３７】[0037]

【発明の効果】以上説明したように本発明によれば、有
音区間と無音区間の切り替わり時に生じる再生音の違和
感が軽減されるという効果を奏し、音声符号化装置にお
いてその音質の向上に寄与するところが大きい。As described above, according to the present invention, it is possible to reduce an uncomfortable feeling of reproduced sound which occurs when a sound section and a silent section are switched, and to contribute to the improvement of the sound quality in a speech encoding apparatus. There is a lot to do.

[Brief description of drawings]

【図１】は第１の発明の原理図、兼実施例の装置の構成
を示すブロック図、FIG. 1 is a principle diagram of the first invention, and a block diagram showing a configuration of an apparatus according to an embodiment;

【図２】は第２の発明の原理図、兼実施例の装置の構成
を示すブロック図、FIG. 2 is a principle diagram of the second invention, and is a block diagram showing the configuration of the device of the embodiment.

【図３】は第３の発明の原理図、兼実施例の装置の構成
を示すブロック図、FIG. 3 is a principle diagram of a third invention, and a block diagram showing a configuration of a device according to an embodiment;

【図４】は第４の発明の原理図、兼実施例の装置の構成
を示すブロック図、FIG. 4 is a principle diagram of a fourth invention, and is a block diagram showing a configuration of a device according to an embodiment;

【図５】は第４の発明の実施例のミキシング方法を説明
するための図、FIG. 5 is a diagram for explaining a mixing method according to an embodiment of the fourth invention,

【図６】は従来例の音声符号化装置の構成を示すブロッ
ク図である。FIG. 6 is a block diagram showing a configuration of a conventional speech encoding apparatus.

[Explanation of symbols]

１は無音検出部、２は符号化部、３は復号化部、４、
４’は雑音挿入部、５は時間遅延部、６はミキサ、７は
レベル算出部を示す。1 is a silence detector, 2 is an encoder, 3 is a decoder, 4,
Reference numeral 4'denotes a noise insertion unit, 5 a time delay unit, 6 a mixer, and 7 a level calculation unit.

───────────────────────────────────────────────────── フロントページの続き (72)発明者岡崎晃二神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (72)発明者松尾直司神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (72)発明者信本俊明福岡県福岡市博多区博多駅前３丁目22番８号富士通九州ディジタル・テクノロジ株式会社内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Koji Okazaki, 1015 Kamiodanaka, Nakahara-ku, Kawasaki-shi, Kanagawa, Fujitsu Limited (72) Inventor Naoji Matsuo, 1015, Kamikodanaka, Nakahara-ku, Kawasaki, Kanagawa (within Fujitsu Limited) 72) Inventor Toshiaki Nobumoto 3-22-8 Hakataekimae, Hakata-ku, Fukuoka City, Fukuoka Prefecture Fujitsu Kyushu Digital Technology Co., Ltd.

Claims

[Claims]

1. A silence detector (1) for detecting a silence section in an input voice signal and outputting silence information to a transmitting side.
And a coding unit (2) for coding and outputting the silence information and the input voice signal, and the receiving side decodes the received silence information and the input voice signal and reproduces the voice. A decoding unit (3) that outputs a signal and silence information, and noise insertion that inserts and outputs a predetermined level of noise in the silence section of the reproduced audio signal based on the silence information output from the decoding unit. A speech coding apparatus having a section (4 ′), connected between the silence detecting section (1) and the coding section (2) on the transmitting side for a predetermined time from the time when the silence detecting section determines that there is no sound. A time delay unit (5) for delaying and outputting the silence information is provided, and the noise insertion unit (4) on the receiving side, based on the silence information output from the decoding unit, silences the reproduced audio signal. A speech coding apparatus characterized in that a predetermined level of noise is inserted and output in a section and a fixed section immediately before the silent section. Place

2. A silence detecting unit (1) for detecting a silence section in an input voice signal and outputting silence information on the transmitting side.
And a coding unit (2) for coding and outputting the silence information and the input voice signal, and the receiving side decodes the received silence information and the input voice signal and reproduces the reproduced voice. A decoding unit (3) that outputs a signal and silence information, and a noise insertion that outputs by inserting a predetermined level of noise into a silence section of the reproduced voice signal based on the silence information output from the decoding unit A speech coding apparatus having a section (4 ′), connected between the silence detecting section (1) and the coding section (2) on the transmitting side for a predetermined time from the time when the silence detecting section determines that there is no sound. A time delay unit (5) for delaying and outputting the silence information is provided, and the reproduced voice signal output from the decoding unit and noise of a predetermined level are provided to the reception side in a certain section immediately before the silence section. A mixer (6) for mixing is provided, and is output from the mixer in a certain section immediately before the silent section,
Further, the speech coding apparatus is characterized in that the noise insertion section (4) outputs the voiced section except the silent section and a certain section immediately before the silent section.

3. A silence detector (1) for detecting a silence section in an input voice signal and outputting silence information on the transmitting side.
And a coding unit (2) for coding and outputting the silence information and the input voice signal, and the receiving side decodes the received silence information and the input voice signal and reproduces the reproduced voice. A decoding unit (3) that outputs a signal and silence information, and a noise insertion that outputs by inserting a predetermined level of noise into a silence section of the reproduced voice signal based on the silence information output from the decoding unit A speech coding apparatus having a section (4 ′), connected between the silence detecting section (1) and the coding section (2) on the transmitting side for a predetermined time from the time when the silence detecting section determines that there is no sound. A time delay unit (5) for delaying and outputting the silence information is provided, and is connected between the decoding unit (3) and the noise insertion unit (4) on the receiving side, and the reproduced voice output from the decoding unit is provided. A level calculator that calculates the signal level, calculates the noise level to be inserted into the silent section from the calculated level of the reproduced audio signal, and outputs the noise level. Part
(7) is provided, and the noise insertion section (4) inserts and outputs the noise of the level calculated by the level calculation section in the silent section of the reproduced voice signal output from the decoding section. A speech coding apparatus characterized by the above.

4. A silence detecting unit (1) for detecting a silence section in an input voice signal and outputting silence information on the transmitting side.
And a coding unit (2) for coding and outputting the silence information and the input voice signal, and the receiving side decodes the received silence information and the input voice signal and reproduces the reproduced voice. A decoding unit (3) that outputs a signal and silence information, and a noise insertion that outputs by inserting a predetermined level of noise into a silence section of the reproduced voice signal based on the silence information output from the decoding unit A speech coding apparatus having a section (4 ′), connected between the silence detecting section (1) and the coding section (2) on the transmitting side for a predetermined time from the time when the silence detecting section determines that there is no sound. A time delay unit (5) for delaying and outputting the silence information is provided, and is connected between the decoding unit (3) and the noise insertion unit (4) on the receiving side, and the reproduced voice output from the decoding unit is provided. A level calculator that calculates the signal level, calculates the noise level to be inserted into the silent section from the calculated level of the reproduced audio signal, and outputs the noise level. Part
(7) and a mixer (6) for mixing the reproduced voice signal output from the decoding unit and the noise of the level calculated by the level calculation unit in a certain section immediately before the silent section, Output from the mixer in a certain section immediately before the silent section,
A speech coding apparatus characterized in that, in the silent section, the noise of the level calculated by the level calculating section is inserted into the silent section of the reproduced speech signal by the noise inserting section 4 and output.