JP2002333900A

JP2002333900A - Sound-encoding/decoding method and sound-transmitting/ receiving device

Info

Publication number: JP2002333900A
Application number: JP2001139642A
Authority: JP
Inventors: Akiko Susa; 明子須佐; Toshiyuki Matsuda; 俊幸松田; Toku Tsukada; 徳塚田; Mitsuhiro Noda; 充宏野田
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2001-05-10
Filing date: 2001-05-10
Publication date: 2002-11-22

Abstract

PROBLEM TO BE SOLVED: To provide a sound-encoding/decoding method and a sound-transmitting/receiving device, with which the feeling of incompatibility can be reduced concerning background noises, without increasing the overall amount of transmitting data. SOLUTION: The background noises inputted in a soundless block, which is generated immediately after the start of speaking, are sent as encoded data by a transmitting side device, a noise-synthesizing coefficient is calculated from the background noises provided by decoding the encoded data by a receiving side device, and background noises for prescribed time inputted immediately after the start of each of soundless blocks generated during speaking are sent as encoded data by the transmitting side device. In the receiving side device, the noise-synthesizing coefficient is updated, on the basis of the background noises provided by decoding the encoded data, and during a soundless term following the output term of the decoded background noises, pseudo-noises, generated on the basis of the updated noise synthesizing coefficient, are outputted.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声符号化復号化
方法および音声送受信装置に関し、特に、無音区間にお
ける送信側からのデータ送信を制限し、受信側で発生し
た擬似的な背景雑音を出力させるようにした音声符号化
復号化方法および音声送受信装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech encoding / decoding method and a speech transmitting / receiving apparatus, and more particularly to restricting data transmission from a transmitting side in a silent section and outputting pseudo background noise generated on a receiving side. The present invention relates to a voice encoding / decoding method and a voice transmitting / receiving apparatus configured to be performed.

【０００２】[0002]

【従来の技術】通信システムにおいては、通信コストの
削減と通信リソースの有効利用のために、データ圧縮や
不要データの送信抑制などによって、伝送データ量の削
減が図られている。音声データを符号化して伝送する音
声ディジタル伝送システムでは、伝送データ量を削減す
るために無音圧縮技術が採用されている。無音圧縮は、
送信側において入力信号の状態を分析して有音状態か無
音状態かを判定し、有音状態であれば入力信号（音声信
号）の符号化データを通信回線に送信し、無音状態であ
れば符号化データの送信を抑制し、無音期間中は受信側
において生成した擬似雑音を出力するようにしたもので
ある。2. Description of the Related Art In a communication system, in order to reduce communication costs and effectively use communication resources, the amount of transmission data is reduced by compressing data and suppressing transmission of unnecessary data. 2. Description of the Related Art In an audio digital transmission system that encodes and transmits audio data, a silence compression technique is employed to reduce the amount of transmitted data. Silence compression is
The transmitting side analyzes the state of the input signal to determine whether it is a sound state or a silence state. If the sound state is present, the coded data of the input signal (voice signal) is transmitted to the communication line. The transmission of encoded data is suppressed, and pseudo noise generated on the receiving side is output during the silent period.

【０００３】簡単化のために無音区間での擬似雑音の合
成を省略したり単純に白色雑音を出力した場合は、無音
区間が長くなると受話者に回線切断の不安感を与える
が、上記無音圧縮技術によれば、無音区間での受話者の
不安を解消し、音声通信中の約６割を占めると言われる
無音区間における伝送データを削減することが可能とな
る。尚、無音区間における雑音の合成技術については、
例えば、特開平０５−１２２１６５号公報または特開平
１０−０９７２９２号公報に記載されている。また、Ｉ
ＴＵ−Ｔにおいて標準化されたＧ．７２９Ａｎｎｅｘ
Ｂは、ＣＳ−ＡＣＥＬＰ方式における無音圧縮と雑音
合成技術について述べている。If the synthesis of pseudo noise in a silent section is omitted or the white noise is simply output for simplicity, if the silent section becomes longer, the listener may feel uneasy about disconnection of the line. According to the technology, it is possible to eliminate anxiety of a listener in a silent section and to reduce transmission data in a silent section which is said to occupy about 60% during voice communication. In addition, regarding the noise synthesis technology in the silent section,
For example, it is described in JP-A-05-122165 or JP-A-10-097292. Also, I
G. TU standardized in TU-T 729 Annex
B describes silent compression and noise synthesis techniques in the CS-ACELP system.

【０００４】[0004]

【発明が解決しようとする課題】然るに、従来の無音圧
縮技術では、無音と判定された区間では、送信側で符号
化データの送信を停止するか、音声符号化データよりも
データ量の少ない雑音情報を間歇的に送信し、受信側で
は有音区間の終了と同時に予測された擬似雑音を出力し
ている。従って、受話者には、有音区間では送話者の音
声に重畳された実際の背景雑音が聴こえ、無音区間では
受信側で合成した疑似雑音が聴こえることになる。しか
しながら、極めて短期間の予測で擬似雑音を精度良く合
成することは困難である。このため、従来の無音圧縮技
術によれば、無音区間に出力される擬似雑音と有音区間
で聴こえる実際の背景雑音との間には微妙な差異があ
り、受話者が違和感を覚え、全体的な音声品質を低下さ
せている。また、入力信号の状態から有音、無音を判定
した場合、無音区間が必ずしも雑音のみが入力された区
間であるという保証はない。このため、受信側におい
て、有音区間では受信符号化データの復号化信号を出力
し、無音区間で直ちに疑似雑音出力に切替える制御方式
を採用した場合、有音区間から無音区間に切り替わった
時点で音声の末尾に違和感があり、信号品質が低下する
という問題がある。However, in the conventional silent compression technique, in the section determined to be silent, transmission of the encoded data is stopped on the transmitting side, or a noise having a smaller data amount than the encoded audio data is transmitted. The information is transmitted intermittently, and the receiving side outputs the predicted pseudo noise at the same time as the end of the sound period. Therefore, the listener hears the actual background noise superimposed on the voice of the sender in the sound period, and hears the pseudo noise synthesized on the receiving side in the silent period. However, it is difficult to synthesize pseudo-noise accurately with very short-term prediction. For this reason, according to the conventional silence compression technology, there is a subtle difference between the pseudo noise output in the silence section and the actual background noise heard in the speech section, and the listener feels discomfort, Poor voice quality. Further, when it is determined from the state of the input signal that there is sound or no sound, there is no guarantee that a silent section is not necessarily a section in which only noise is input. For this reason, on the receiving side, if a control method is used in which a decoded signal of the received coded data is output in a sound interval and a pseudo noise output is immediately switched in a silent interval, when a switch is made from a sound interval to a silent interval, There is a problem that there is a sense of incongruity at the end of the voice and the signal quality is degraded.

【０００５】一方、無音と判断された区間でも一定の周
期で雑音情報を送出し、受信側で随時に雑音の合成に必
要な係数情報の更新をしながら擬似雑音を合成する方法
もあるが、この間歇的な雑音情報送信方式では、擬似雑
音の品質が向上する代わりに、全体的な伝送データ量が
増大し、伝送路上でのデータ遅延の一因となるという問
題がある。On the other hand, there is a method in which noise information is transmitted at a constant cycle even in a section determined to be silent, and pseudo noise is synthesized while the coefficient information necessary for noise synthesis is updated on the receiving side as needed. This intermittent noise information transmission method has a problem that, instead of improving the quality of the pseudo noise, the overall transmission data amount increases, which contributes to data delay on the transmission path.

【０００６】本発明の目的は、全体的な伝送データ量を
増大することなく、背景雑音についての違和感を低減で
きる音声符号化復号化方法および音声送受信装置を提供
することにある。本発明の目的は、全体的な伝送データ
量を増大することなく、有音区間から無音区間への切替
え時点での音声品質の劣化を回避できる音声符号化復号
化方法および音声送受信装置を提供することにある。An object of the present invention is to provide a speech coding / decoding method and a speech transmission / reception apparatus which can reduce the discomfort of background noise without increasing the overall transmission data amount. An object of the present invention is to provide a voice encoding / decoding method and a voice transmitting / receiving apparatus capable of avoiding deterioration of voice quality at the time of switching from a voiced section to a voiceless section without increasing the overall transmission data amount. It is in.

【０００７】[0007]

【課題を解決するための手段】上記課題を解決するため
に、本発明の音声符号化復号化方法では、送信側装置か
ら入力音声の符号化データを所定の周期で送出し、受信
側装置で上記符号化データを復号化して出力する音声伝
送システムにおいて、送信側装置で通話開始直後に発生
する無音区間で入力された背景雑音を符号化データとし
て送出し、受信側装置で上記符号化データを復号化して
得られた背景雑音から雑音合成係数を算出しておき、送
信側装置で通話中に発生する各無音区間の開始直後に入
力された所定時間分の背景雑音を符号化データとして送
出し、受信側装置で上記符号化データを復号化して得ら
れた背景雑音に基づいて上記雑音合成係数を更新し、上
記復号化された背景雑音の出力期間に続く無音期間中
に、上記更新された雑音合成係数に基づいて合成された
擬似雑音を出力することを特徴とする。In order to solve the above-mentioned problems, according to the speech encoding / decoding method of the present invention, encoded data of input speech is transmitted from a transmitting apparatus at a predetermined period, and the receiving apparatus transmits the encoded data. In a voice transmission system that decodes and outputs the encoded data, the transmitting device transmits background noise input in a silent section immediately after the start of a call as encoded data, and the receiving device converts the encoded data. A noise synthesis coefficient is calculated from the background noise obtained by decoding, and the transmission-side device transmits background noise for a predetermined time inputted immediately after the start of each silent period occurring during a call as encoded data. Updating the noise synthesis coefficient based on the background noise obtained by decoding the encoded data at the receiving device, and during the silent period following the output period of the decoded background noise, And outputs a pseudo noise which is synthesized based on the sound synthesis coefficient.

【０００８】上記本発明の構成によれば、有音区間から
無音区間に切り替わった時、無音区間の開始直後に入力
された所定時間分の背景雑音を符号化データとして送出
し、受信側装置で復号化された背景雑音を出力するよう
にしているため、無音区間と判定された入力信号に送話
者の微弱な音声信号が含まれていた場合でも、この微弱
な音声信号を受話者に伝えることができ、有音区間から
無音区間への切替え時の違和感をなくすことが可能とな
る。また、受信側装置では、通話直後の最初の無音区間
で受信した符号化データを復号化して得られた背景雑音
から雑音合成係数を算出しておき、その後に発生する各
無音区間で受信した符号化データを復号化して得られた
背景雑音に基づいて上記雑音合成係数を更新するように
しているため、送信側における背景雑音の時間的変化に
追随した形で擬似雑音を合成することができ、無音区間
で受話者に自然な背景雑音を与えることが可能となる。[0008] According to the configuration of the present invention, when switching from a sound period to a silence period, background noise for a predetermined time inputted immediately after the start of the silence period is transmitted as coded data, and the reception-side device transmits the coded data. Since the decoded background noise is output, even if the weak signal of the sender is included in the input signal determined to be a silent section, the weak signal is transmitted to the receiver. This makes it possible to eliminate a sense of incongruity when switching from a sound section to a silent section. In addition, the receiving apparatus calculates a noise synthesis coefficient from background noise obtained by decoding the encoded data received in the first silent section immediately after the call, and calculates the code received in each silent section generated thereafter. Since the noise synthesis coefficient is updated based on the background noise obtained by decoding the encoded data, it is possible to synthesize the pseudo noise in a form following the temporal change of the background noise on the transmission side, Natural background noise can be given to the listener in the silent section.

【０００９】更に詳述すると、本発明では、送信側装置
が、入力音声および背景雑音の各符号化データに付随し
て、有音区間と無音区間の識別情報を送信し、受信側装
置が、上記識別情報に従って、受信符号化データが入力
音声用のものか背景雑音用のものかを識別し、背景雑音
用の符号化データの受信時に前記雑音合成係数の算出ま
たは更新のための処理を実行することを特徴とする。本
発明の好ましい実施例では、受信側装置が、通話中の各
無音区間における背景雑音用符号化データの受信個数に
よって最終の背景雑音用符号化データを判別し、該符号
化データを復号化して得られた背景雑音については擬似
雑音に漸次に近似させるように補正処理して出力するこ
とを特徴とする。More specifically, according to the present invention, the transmitting apparatus transmits identification information of a sound section and a silent section along with each encoded data of the input speech and the background noise, and the receiving apparatus performs According to the identification information, it identifies whether the received coded data is for input speech or for background noise, and executes processing for calculating or updating the noise synthesis coefficient when receiving coded data for background noise. It is characterized by doing. In a preferred embodiment of the present invention, the receiving device determines the final coded data for background noise based on the number of coded data for background noise received in each silent section during a call, and decodes the coded data. The obtained background noise is corrected and output so as to be gradually approximated to the pseudo noise, and is output.

【００１０】本発明による音声送受信装置は、入力音声
を符号化データして周期的に通信回線に送出する送信部
と、通信回線から受信した符号化データを復号化して音
声として出力する受信部とからなり、上記送信部が、入
力音声信号から有音区間か無音区間かを判定するための
手段と、有音区間で入力された音声信号と無音区間で入
力された背景雑音を符号化データに変換するための手段
と、各無音区間の開始直後に生成された所定時間分の符
号化データと有音区間で生成された符号化データを有
音、無音の判定情報と共に通信回線に送出するための手
段とからなり、上記受信部が、通信回線から受信された
符号化データと有音、無音の判定情報とを分離するため
の手段と、上記有音、無音の判定情報から第１、第２の
制御信号を発生するための手段と、上記第１の制御信号
が示す特定期間中に受信符号化データから抽出された特
徴情報に基づいて雑音合成係数を算出または更新するた
めの手段と、上記雑音合成係数に基づいて所定周期で擬
似雑音を合成し、該擬似雑音を上記第２の制御信号に応
じて間歇的に出力するための雑音合成手段とからなり、
符号化データの受信中は上記復号化手段で生成された音
声信号または背景雑音を出力し、符号化データがない時
は上記雑音合成手段で生成された擬似雑音を出力するよ
うにしたことを特徴とする。[0010] A voice transmitting / receiving apparatus according to the present invention includes a transmitting unit for transmitting encoded voice to a communication line periodically and a receiving unit for decoding encoded data received from the communication line and outputting the decoded voice. The transmitting unit determines from the input audio signal a voiced section or a silent section, and converts the voice signal input in the voiced section and the background noise input in the voiceless section into encoded data. Means for converting the encoded data for a predetermined period of time generated immediately after the start of each silent section and the encoded data generated in the sound section along with the voice / silence determination information to the communication line. Means for separating the coded data received from the communication line from the voiced / silent determination information, and a first, a second voice / silence determination information from the voiced / silent determination information. Generate 2 control signals Means for calculating or updating a noise combining coefficient based on feature information extracted from the received encoded data during a specific period indicated by the first control signal; and Noise synthesis means for synthesizing pseudo noise at a predetermined period and outputting the pseudo noise intermittently according to the second control signal;
During reception of the encoded data, the speech signal or background noise generated by the decoding means is output, and when there is no encoded data, pseudo noise generated by the noise synthesis means is output. And

【００１１】[0011]

【発明の実施の形態】以下、本発明の実施例について図
面を参照して説明する。図１は、本発明による音声符号
化復号化システムの第1の実施例を示す。図において、
１０１Ａは音声送受信装置Ａの送信部、１０２Ｂは音声
送受信装置Ｂの受信部、１００は装置Ａ、Ｂを接続する
通信回線（電話網）を示す。音声送受信装置Ａは、１０
２Ｂと同様の受信部を備え、音声送受信装置Ｂも１０１
Ａと同様の送信部を備えるが、ここでは、簡単化のため
に送信部１０１Ａと受信部１０２Ｂとの間の符号化デー
タの送受信に着目して、本発明による音声符号化復号化
技術を説明する。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 shows a first embodiment of a speech encoding / decoding system according to the present invention. In the figure,
101A denotes a transmitting unit of the voice transmitting / receiving device A, 102B denotes a receiving unit of the voice transmitting / receiving device B, and 100 denotes a communication line (telephone network) connecting the devices A and B. The voice transmitting / receiving device A is 10
2B, and a voice transmitting / receiving apparatus B
A is provided with a transmission unit similar to A, but here, for the sake of simplicity, a speech encoding / decoding technique according to the present invention will be described focusing on transmission / reception of encoded data between the transmission unit 101A and the reception unit 102B. I do.

【００１２】入力端子ＩＮからサンプリング周期で入力
された音声信号Ｓ１は、音声符号化部２と有音／無音判
定部５に順次に入力される。音声符号化部２は、符号化
処理周期となる単位時間（数１０サンプリング期間）内
に蓄積された音声信号をこの通信システムで採用してい
る所定の符号化方式で符号化し、符号化データ（パケッ
ト）Ｓ２に変換する。有音／無音判定部３は、入力音声
Ｓ１または音声符号化部２で生成された符号化データＳ
２の一部から音声パワーやスペクトルを抽出し、予め設
定された閾値と比較することによって、入力信号Ｓ１が
有音状態か無音状態かを判定し、判定結果を有音／無音
判定信号Ｓ３として出力する。上記有音／無音判定信号
Ｓ３は、例えば、有音状態をビット「１」、無音状態を
ビット「０」で表すパルス信号として出力され、期間判
定部５とデータ送信部６に入力される。期間判定部５
は、受話者が受話器を取り上げた時点で外部、例えば、
交換機から入力される通話開始信号Ｓ０をトリガとし
て、符号化データＳ２の送出期間判定動作を開始する。An audio signal S1 input at a sampling cycle from an input terminal IN is input to an audio encoding unit 2 and a sound / non-speech determining unit 5 sequentially. The audio encoding unit 2 encodes the audio signal accumulated within a unit time (several tens of sampling periods), which is an encoding processing cycle, by a predetermined encoding method adopted in the communication system, and encodes encoded data ( Packet) S2. The sound / non-speech determining unit 3 receives the input voice S1 or the encoded data S generated by the voice coding unit 2.
2 to determine whether the input signal S1 is a sound state or a soundless state by extracting a sound power or spectrum from a part of the sound signal 2 and comparing it with a preset threshold value. Output. For example, the sound / non-speech determination signal S3 is output as a pulse signal representing a sound state by bit “1” and a silence state by bit “0”, and is input to the period determination unit 5 and the data transmission unit 6. Period determination unit 5
Is external when the receiver picks up the handset, for example,
The transmission period determination operation of the encoded data S2 is started with the call start signal S0 input from the exchange as a trigger.

【００１３】図２の（Ａ）は、入力端子ＩＮへの入力音
声、（Ｂ）は、有音／無音判定信号Ｓ３、（Ｃ）は、送
信部１０１Ａから通信回線１００への符号化データの送
出状態を示す。本実施例では、例えば、（Ｃ）に示す符
号化データの送出状態から判るように、有音区間（期間
Ｔ１、Ｔ３）以外に、通話開始（ｔ０）直後の無音区間
（期間Ｔ０）と、その後に発生する各無音区間の開始直
後に設定された所定期間ΔＴを符号化データＳ２の送出
期間としている。FIG. 2A shows an input voice to the input terminal IN, FIG. 2B shows a speech / non-speech determination signal S3, and FIG. Indicates the sending state. In the present embodiment, for example, as can be seen from the transmission state of the encoded data shown in (C), in addition to the voiced sections (periods T1 and T3), a silent section (period T0) immediately after the start of a call (t0) A predetermined period ΔT set immediately after the start of each silent section that occurs thereafter is defined as a transmission period of the encoded data S2.

【００１４】期間判定部５は、通話開始信号Ｓ０を受信
すると、初期動作モードとして、データ送出制御信号Ｓ
５をオン状態にし、有音／無音判定信号Ｓ３が「０」か
ら「１」に変化するのを待つ。有音／無音判定信号Ｓ３
が一旦「１」状態になった後は、通常動作モードに移行
する。通常動作モードでは、期間判定部５は、判定信号
Ｓ３が「１」状態の期間と判定信号Ｓ３が「１」状態か
ら「０」状態に変化した後の所定期間ΔＴとを検出し、
これらの期間中はデータ送出制御信号Ｓ５をオン状態に
する。それ以外の期間はデータ送出制御信号Ｓ５をオフ
状態に保つ。When receiving the call start signal S0, the period determination unit 5 sets the data transmission control signal S as an initial operation mode.
5 is turned on, and waits for the presence / absence determination signal S3 to change from "0" to "1". Sound / silence determination signal S3
After the state once becomes "1", the mode shifts to the normal operation mode. In the normal operation mode, the period determination unit 5 detects a period in which the determination signal S3 is in the “1” state and a predetermined period ΔT after the determination signal S3 changes from the “1” state to the “0” state,
During these periods, the data transmission control signal S5 is turned on. During other periods, the data transmission control signal S5 is kept off.

【００１５】データ送出制御信号Ｓ５は、送信ゲート４
に供給されており、音声符号化部２から出力された符号
化データＳ２は、データ送出制御信号Ｓ５がオン状態に
ある時、送信ゲートを通過し、データ送信部６から通信
回線７に送出される。初期動作モードにおいて無音区間
（雑音分析期間）Ｔ０に送出される符号化データは、送
話者側の背景雑音を符号化したものであり、受信側で無
音区間に発生させる擬似雑音の合成係数基礎データとな
る。また、通常動作モードにおいて無音区間（雑音更新
期間）ΔＴに送出される符号化データは、受信側におい
て擬似雑音合成係数を更新するために利用される。有音
区間から無音区間に切り替わった時、送信側で所定時間
分の背景雑音を符号化して送出し、受信側で擬似雑音合
成係数を修正することにより、受話者に対して、送話者
側の背景雑音の経時的な変化を反映した現実的な擬似雑
音を発生させることが可能となる。The data transmission control signal S5 is transmitted to the transmission gate 4
The encoded data S2 output from the audio encoding unit 2 passes through the transmission gate when the data transmission control signal S5 is in the ON state, and is transmitted from the data transmission unit 6 to the communication line 7. You. The coded data transmitted in the silent section (noise analysis period) T0 in the initial operation mode is obtained by encoding background noise on the transmitter side, and is based on the synthesis coefficient of pseudo noise generated in the silent section on the receiving side. Data. In the normal operation mode, the coded data transmitted in the silent section (noise update period) ΔT is used for updating the pseudo noise synthesis coefficient on the receiving side. When switching from a voiced section to a silent section, the transmitting side encodes and transmits a predetermined amount of background noise, and the receiving side modifies the pseudo-noise synthesis coefficient so that the receiving side can talk to the transmitting side. It is possible to generate realistic pseudo noise reflecting the temporal change of the background noise.

【００１６】背景雑音の送信期間ΔＴは、例えば、有音
／無音判定信号Ｓ３として符号化処理周期で入力される
ビット「０」のパルスをカウントし、これを所定の閾値
Ｎと比較することによって設定できる。期間判定部５
は、通常動作モードにおいて、判定信号Ｓ３がビット
「１」状態となった時点でデータ送出制御信号Ｓ５がオ
ン状態にし、判定信号Ｓ３が「０」に変った後も、カウ
ント値が閾値Ｎに達する迄はデータ送出制御信号Ｓ５を
オン状態に保持し、カウント値が閾値Ｎを超えた時、ま
たはカウント値が閾値Ｎに達する前に判定信号Ｓ３が
「１」状態に戻った時点で、データ送出制御信号Ｓ５が
オフ状態に切替え、カウンタをリセットする論理回路に
よって構成できる。The transmission period ΔT of the background noise is obtained, for example, by counting the pulse of the bit “0” inputted in the encoding processing cycle as the voiced / silence determination signal S3, and comparing this with a predetermined threshold N. Can be set. Period determination unit 5
In the normal operation mode, the data transmission control signal S5 is turned on when the determination signal S3 is in the bit “1” state, and the count value remains at the threshold value N even after the determination signal S3 changes to “0”. Until the count value reaches the threshold value N, the data transmission control signal S5 is kept on. When the count value exceeds the threshold value N, or when the determination signal S3 returns to the "1" state before the count value reaches the threshold value N, the data transmission control signal S5 is turned on. The transmission control signal S5 can be configured by a logic circuit that switches to the off state and resets the counter.

【００１７】初期動作モードにおける期間Ｔ０は、上記
閾値Ｎをカウント値が超えることのできない最大値に設
定することによって判定できる。本発明において、初期
動作モードでの雑音符号化は、受信側で雑音分析に必要
とする所定期間に限定してもよい。すなわち、初期動作
モードで有限の閾値Ｌ（但し、Ｌ＞Ｎ）を設定してお
き、カウント値が閾値Ｎを超えるまではデータ送出制御
信号Ｓ５をオン状態に保持し、カウント値が閾値Ｎを超
えた時点、または判定信号Ｓ３が「１」状態に変化した
時点で、データ送出制御信号Ｓ５がオフ状態に切替え、
カウンタをリセットするようにしてもよい。この場合、
例えば、図３に示すように、通話開始（ｔ０）直後の所
定期間ΔＴ０が雑音分析期間となる。データ送信部６
は、各符号化データＳ２に有音／無音判定信号Ｓ３が示
す有音／無音判定ビットを付加した形で、音声パケット
を通信回線７に送出する。The period T0 in the initial operation mode can be determined by setting the threshold value N to a maximum value whose count value cannot exceed. In the present invention, the noise coding in the initial operation mode may be limited to a predetermined period required for noise analysis on the receiving side. That is, a finite threshold value L (where L> N) is set in the initial operation mode, the data transmission control signal S5 is held in the ON state until the count value exceeds the threshold value N, and the count value is set to the threshold value N. When the data transmission control signal S5 is exceeded or when the determination signal S3 changes to the "1" state, the data transmission control signal S5 is switched to the off state,
The counter may be reset. in this case,
For example, as shown in FIG. 3, a predetermined period ΔT0 immediately after the start of a call (t0) is a noise analysis period. Data transmission unit 6
Transmits a voice packet to the communication line 7 in a form in which a voice / non-speech determination bit indicated by the voice / non-speech determination signal S3 is added to each encoded data S2.

【００１８】受話者側では、通信回線７に送出された符
号化データ列（音声パケット列）をデータ受信部１０で
受信し、符号化データＳ１０と有音／無音判定ビットＳ
１１に分離する。符号化データＳ１０は、音声復号化部
１２に入力され、復号化された音声信号Ｓ１２が音声／
雑音出力部１９に供給される。音声復号化部１２におい
て符号化データＳ１０の復号化の際に生成された情報の
一部は、特徴情報抽出部１３に入力され、擬似雑音の合
成に必要な特徴情報が抽出されて特徴情報蓄積部１４に
蓄積される。On the receiver side, the encoded data sequence (speech packet sequence) transmitted to the communication line 7 is received by the data receiving unit 10, and the encoded data S10 and the sound / non-speech determination bit S are received.
Separated into 11. The encoded data S10 is input to the audio decoding unit 12, and the decoded audio signal S12 is
The signal is supplied to the noise output unit 19. Part of the information generated at the time of decoding the encoded data S10 in the audio decoding unit 12 is input to the characteristic information extracting unit 13, where the characteristic information necessary for the synthesis of the pseudo noise is extracted and stored. Stored in the unit 14.

【００１９】一方、データ受信部１０で分離された有音
／無音判定ビットＳ１１は、期間判定部１５に供給され
る。期間判定部１５は、例えば、受話者が受話器を取り
上げた時点で発生する通話開始信号（オフフック信号）
Ｓ０‘をトリガとして期間判定動作を開始する。期間判
定部１５は、雑音符号化データの受信中（図２における
期間Ｔ０、ΔＴ、）は、合成係数演算を指示するための
第１の制御信号Ｓ１５を発生し、雑音符号化データおよ
び音声符号化データの受信期間中（図２の期間Ｔ０、Ｔ
１、ΔＴ、Ｔ２、ΔＴ、…）は、合成された擬似雑音の
出力を抑制するための第２の制御信号Ｓ１５０を発生す
る。On the other hand, the sound / non-speech determination bit S 11 separated by the data receiving unit 10 is supplied to the period determining unit 15. The period determination unit 15 is, for example, a call start signal (off-hook signal) generated when the receiver picks up the receiver.
The period determination operation is started with S0 'as a trigger. During the reception of the noise coded data (period T0, ΔT, in FIG. 2), the period determination unit 15 generates the first control signal S15 for instructing the synthesis coefficient operation, and generates the noise coded data and the voice code. During the reception period of the coded data (periods T0, T
1, ΔT, T2, ΔT,...) Generates a second control signal S150 for suppressing the output of the synthesized pseudo noise.

【００２０】期間判定部１５は、初期動作モードにおい
て、有音／無音判定ビットＳ１１から最初の無音区間
（雑音分析期間）Ｔ０を判定し、雑音分析期間中は、第
１の制御信号Ｓ１５により合成計数算出／更新部１６に
合成係数の算出を指示する。合成計数算出／更新部１６
は、初期状態で発生した上記制御信号Ｓ１５の受信期間
中に、特徴情報蓄積部２２から復号化された音声信号
（この場合は背景雑音）を取り込み、雑音の合成係数の
算出する。合成係数は、例えば、特徴情報蓄積部２２に
蓄積された雑音分析期間Ｔ０の背景雑音の平均値を算出
することによって求められる。雑音分析期間が、図３に
示したように所定期間ΔＴ０となっている場合は、有音
／無音判定ビットＳ１１が示すビット「０」の個数をカ
ウントし、カウント値が閾値Ｌに達した時点で上記制御
信号Ｓ１５の出力を停止する。In the initial operation mode, the period determining section 15 determines the first silent section (noise analysis period) T0 from the sound / non-speech determination bit S11, and synthesizes the first control signal S15 during the noise analysis period. The count calculation / update unit 16 is instructed to calculate the combination coefficient. Composite count calculation / update section 16
Captures the decoded audio signal (in this case, background noise) from the characteristic information storage unit 22 during the reception period of the control signal S15 generated in the initial state, and calculates a noise synthesis coefficient. The synthesis coefficient is obtained, for example, by calculating the average value of the background noise in the noise analysis period T0 stored in the feature information storage unit 22. When the noise analysis period is the predetermined period ΔT0 as shown in FIG. 3, the number of bits “0” indicated by the sound / non-sound determination bit S11 is counted, and the time when the count value reaches the threshold L is reached. Then, the output of the control signal S15 is stopped.

【００２１】期間判定部１５は、通話開始直後において
第１制御信号Ｓ１５の出力を停止した後は、通常動作モ
ードに移行する。通常動作モードでは、有音／無音判定
ビットＳ１１の状態変化から有音区間と無音区間を識別
し、各無音区間の開始直後（先頭部）にある雑音更新期
間ΔＴに制御信号Ｓ１５を発生し、合成計数算出／更新
部１６に合成係数の更新を指示する。雑音更新期間ΔＴ
は、有音／無音判定ビットＳ１１がビット「１」からビ
ット「０」に変った時点でビット「０」のカウント動作
を開始し、カウント値と閾値Ｎとを比較することによっ
て判定できる。After the output of the first control signal S15 is stopped immediately after the start of the call, the period determining unit 15 shifts to the normal operation mode. In the normal operation mode, a speech section and a speech section are identified from a change in the state of the speech / non-speech determination bit S11, and a control signal S15 is generated in a noise update period ΔT immediately after the start of each speech section (head). It instructs the composite count calculation / update section 16 to update the composite coefficient. Noise update period ΔT
Can be determined by starting the counting operation of the bit “0” when the voiced / silent determination bit S11 changes from the bit “1” to the bit “0”, and comparing the count value with the threshold value N.

【００２２】合成係数算出／更新部１６は、一旦、擬似
雑音の合成係数を算出した後は、通常動作モードに移行
し、その後に発生する制御信号Ｓ１５の受信期間中に特
徴情報蓄積部２２から背景雑音を取り込み、既存の合成
係数を更新する。合成係数の更新は、例えば、現在保持
している雑音と新たな雑音とを所定の重み付けで合成、
例えば、平均化する処理を意味する。合成係数の極端な
変化を回避するためは、既存の係数値を重要視し、新た
な雑音よりも大きい重みを与えて平均化すればよい。After once calculating the synthetic coefficient of the pseudo noise, the synthetic coefficient calculating / updating section 16 shifts to the normal operation mode, and from the characteristic information accumulating section 22 during the reception period of the control signal S15 generated thereafter. Capture background noise and update existing synthesis coefficients. The update of the synthesis coefficient is performed, for example, by combining the currently held noise and the new noise with a predetermined weight,
For example, it means an averaging process. In order to avoid an extreme change in the synthesis coefficient, the existing coefficient value may be regarded as important, and a weight greater than the new noise may be given for averaging.

【００２３】期間判定部１４は、上述した有音／無音判
定ビットＳ１１から判定される背景雑音の符号化データ
受信期間（図２における期間Ｔ０、ΔＴ）および音声の
符号化データ受信期間（図２における期間Ｔ１、Ｔ２、
…）中は、擬似雑音出力の抑制信号（第２の制御信号）
Ｓ１５０をオン状態とする。雑音合成部１８は、タイマ
から所定のサンプリング周期で出力されるクロックに同
期して、合成係数算出／更新部１６の雑音合成係数を雑
音合成フィルタに設定し、白色雑音生成部（励起源）１
７で生成された白色雑音から疑似雑音Ｓ１８を合成して
音声／雑音出力部１９に供給する。データ受信部１０で
符号化データを受信中は、抑制信号Ｓ１５０がオン状態
となり、雑音合成部１８からの疑似雑音Ｓ１８の出力が
抑制され、合成された擬似雑音データがクリアされる。
従って、音声／雑音出力部１９からは、符号化データ受
信期間中は、復号化された音声または背景雑音が端子Ｏ
ＵＴに出力され、符号化データの受信がない無音期間中
は、雑音合成部１８で合成された擬似雑音が上記端子Ｏ
ＵＴに出力されることになる。The period determination unit 14 receives the encoded data reception period of the background noise (periods T0 and ΔT in FIG. 2) and the encoded data reception period of the voice (FIG. 2) determined from the voiced / silent determination bit S11. In the periods T1, T2,
...), A pseudo noise output suppression signal (second control signal)
S150 is turned on. The noise synthesizing unit 18 sets the noise synthesizing coefficient of the synthesizing coefficient calculating / updating unit 16 in the noise synthesizing filter in synchronization with the clock output from the timer at a predetermined sampling period, and sets the white noise generating unit (excitation source) 1
The pseudo noise S18 is synthesized from the white noise generated in step 7 and supplied to the audio / noise output unit 19. While the data reception unit 10 is receiving the encoded data, the suppression signal S150 is turned on, the output of the pseudo noise S18 from the noise synthesis unit 18 is suppressed, and the synthesized pseudo noise data is cleared.
Therefore, from the audio / noise output unit 19, the decoded audio or background noise is supplied to the terminal O during the encoded data reception period.
During the silent period in which the coded data is output to the UT and no encoded data is received, the pseudo noise synthesized by the noise synthesis unit 18 is applied to the terminal O.
It will be output to the UT.

【００２４】図４は、本発明による音声符号化復号化シ
ステムの第２の実施例を示す。本実施例は、図１に示し
た受信部１０２Ｂにおいて、音声／雑音出力部１９の代
りに加重移動平均算出部２２を採用し、音声復号化部１
２で復号化された音声信号および雑音を符号化音声蓄積
部２１に一時的に蓄積した後、上記加重移動平均算出部
２２に供給するようにしたことに特徴がある。FIG. 4 shows a second embodiment of the speech encoding / decoding system according to the present invention. This embodiment employs a weighted moving average calculator 22 instead of the audio / noise output unit 19 in the receiving unit 102B shown in FIG.
2 is characterized in that the speech signal and the noise decoded in step 2 are temporarily stored in the coded speech storage section 21 and then supplied to the weighted moving average calculation section 22.

【００２５】背景雑音の符号化データ受信中（図２にお
ける期間Ｔ０、ΔＴ）および音声の符号化データ受信中
（図２における期間Ｔ１、Ｔ２、…）は、音声復号化部
１２から復号化音声または復号化雑音Ｓ１２が出力さ
れ、復号化音声蓄積部２１に蓄積される。復号化音声蓄
積部２１に蓄積された音声信号および背景雑音は、音声
／雑音加重移動平均算出部２２を介して端子２０に出力
される。また、無音区間で雑音合成部１８から出力され
た擬似雑音は、上記音声／雑音加重移動平均算出部２２
を介して端子２０に出力される。During reception of the encoded data of the background noise (periods T0, ΔT in FIG. 2) and reception of the encoded data of the audio (periods T1, T2,... In FIG. Alternatively, the decoding noise S12 is output and stored in the decoded voice storage unit 21. The audio signal and the background noise accumulated in the decoded audio accumulation unit 21 are output to the terminal 20 via the audio / noise weighted moving average calculation unit 22. In addition, the pseudo noise output from the noise synthesis unit 18 in the silent section is calculated by the speech / noise weighted moving average calculation unit 22.
Is output to the terminal 20 via the.

【００２６】本実施例では、音声復号化部１２から出力
された復号化雑音と、雑音合成部１８から出力された合
成雑音Ｓ１８との不連続性を解消するために、各雑音更
新区間の最後の復号化データが復号化音声蓄積部２１に
蓄積されたタイミングで、雑音合成部１８から擬似雑音
Ｓ１８の出力を開始し、音声／雑音加重移動平均算出部
２２において、復号化音声蓄積部２１から取り込まれた
復号化雑音データと、雑音合成部１８から出力された擬
似雑音Ｓ１８とを加重移動平均処理し、その結果を背景
雑音として出力する。この場合、期間判定部１５は、各
雑音更新区間ΔＴにおいて、ビット「０」の有音／無音
判定ビットＳ１１のカウント値を閾値「Ｎ−１」と比較
し、カウント値が閾値「Ｎ−１」に達した時点で抑制信
号Ｓ１５０をオフ状態に戻すことによって、音声／雑音
加重移動平均算出部２２への復号化雑音と擬似雑音Ｓ１
８の同時入力を可能とする。In this embodiment, in order to eliminate the discontinuity between the decoding noise output from the speech decoding unit 12 and the synthesized noise S18 output from the noise synthesizing unit 18, the end of each noise update section is eliminated. At the timing when the decoded data is stored in the decoded voice storage unit 21, the noise synthesis unit 18 starts outputting the pseudo noise S 18, and the voice / noise weighted moving average calculation unit 22 outputs the pseudo noise S 18 from the decoded voice storage unit 21. Weighted moving average processing is performed on the fetched decoded noise data and the pseudo noise S18 output from the noise synthesis unit 18, and the result is output as background noise. In this case, the period determination unit 15 compares the count value of the sound / non-sound determination bit S11 of the bit “0” with the threshold “N−1” in each noise update section ΔT, and determines that the count value is equal to the threshold “N−1”. Is returned to the off state at the point of time when the decoding noise and the pseudo noise S1 to the speech / noise weighted moving average calculation unit 22 are returned.
8 simultaneous inputs are possible.

【００２７】図５は、加重移動平均処理の概念図を示
す。（Ａ）は、復号化音声蓄積部２１から出力される１
パケット分の復号化雑音Ｓ２１の波形、（Ｂ）は、上記
復号化雑音Ｓ２１に与える重み係数、（Ｃ）は、雑音合
成部１８から出力される擬似雑音（合成雑音）Ｓ１８の
波形、（Ｄ）は、上記擬似雑音Ｓ１８に与える重み係
数、（Ｅ）は、音声／雑音加重移動平均算出部２２から
出力される補正された擬似雑音の波形を示している。FIG. 5 is a conceptual diagram of the weighted moving average processing. (A) shows the output 1 from the decoded audio storage unit 21;
The waveform of the decoding noise S21 for the packet, (B) is a weighting factor given to the decoding noise S21, (C) is the waveform of the pseudo noise (synthesized noise) S18 output from the noise synthesis unit 18, (D) ) Indicates the weighting factor given to the pseudo noise S18, and (E) indicates the waveform of the corrected pseudo noise output from the voice / noise weighted moving average calculation unit 22.

【００２８】図（Ｂ）、（Ｄ）で示すように、各雑音更
新区間ΔＴにおける最後の符号化データの出力期間中
に、復号化雑音Ｓ２１に与える重み係数を漸減させ、逆
に、擬似雑音Ｓ１８に与える重み係数を漸増させなが
ら、復号化雑音と擬似雑音とを加重平均することによっ
て、音声／雑音加重移動平均算出部２２から出力される
背景雑音を復号化雑音Ｓ２１から擬似雑音Ｓ１８に緩や
かに移行させることができる。従って、本実施例によれ
ば、受信側において有音区間と無音区間で違和感のない
背景雑音を出力することが可能となる。As shown in FIGS. 3B and 3D, during the output period of the last coded data in each noise update interval ΔT, the weighting factor given to the decoding noise S21 is gradually reduced. By performing a weighted average of the decoding noise and the pseudo noise while gradually increasing the weight coefficient given to S18, the background noise output from the voice / noise weighted moving average calculation unit 22 is gradually changed from the decoding noise S21 to the pseudo noise S18. Can be transferred to. Therefore, according to the present embodiment, it is possible to output a background noise that does not cause a sense of incongruity between a sound section and a silent section on the receiving side.

【００２９】ＩＴＵ−ＴにおいてＣＳ−ＡＣＥＬＰの無
音圧縮技術の標準化方式として定められたＧ．７２９
ＡｎｎｅｘＢと、本発明の無音圧縮技術とを簡易シミ
ュレーションで比較した結果、本発明によれば、無音区
間での送信データ総量を１／４以下、データ送信回数を
1／２０以下に削減でき、間欠伝送方式によるデータ遅
延の問題も解消できることが判った。[0029] The ITU-T defines G.100, which is a standardized system for silence compression technology of CS-ACELP. 729
As a result of comparing the Annex B and the silence compression technique of the present invention by a simple simulation, according to the present invention, the total amount of transmission data in a silence section is 1/4 or less, and the number of times of data transmission is reduced.
It has been found that the problem can be reduced to 1/20 or less and the problem of data delay due to the intermittent transmission method can be solved.

【００３０】[0030]

【発明の効果】以上の説明から明らかなように、本発明
によれば、データ伝送量の削減しても受話者に違和感を
与えない音声符号化復号化システムを実現できる。ま
た、各無音期間の先頭部において背景雑音の符号化デー
タを送信し、受信側で雑音の合成係数を更新させること
によって、刻々と変化する背景雑音に柔軟に対応した擬
似雑音の合成が可能となる。As is apparent from the above description, according to the present invention, it is possible to realize a speech encoding / decoding system which does not cause a listener to feel uncomfortable even when the data transmission amount is reduced. In addition, by transmitting encoded data of background noise at the beginning of each silent period and updating the noise synthesis coefficient on the receiving side, it is possible to synthesize pseudo noise that can flexibly cope with ever-changing background noise. Become.

[Brief description of the drawings]

【図１】本発明による音声符号化復号化システムの第1
の実施例を示すブロック図。FIG. 1 shows a first embodiment of a speech encoding / decoding system according to the present invention.
FIG.

【図２】本発明における有音／無音区間と符号化データ
の伝送期間との関係を示す図。FIG. 2 is a diagram showing a relationship between a voiced / silent section and a transmission period of encoded data according to the present invention.

【図３】本発明における符号化データ伝送の他の例を示
す図。FIG. 3 is a diagram showing another example of encoded data transmission in the present invention.

【図４】本発明による音声符号化復号化システムの第２
の実施例を示すブロック図。FIG. 4 shows a second example of the speech encoding / decoding system according to the present invention.
FIG.

【図５】図４の音声／雑音加重移動平均算出部２２で実
行される加重移動平均処理を説明するための図。FIG. 5 is a view for explaining a weighted moving average process executed by the voice / noise weighted moving average calculation unit 22 of FIG. 4;

[Explanation of symbols]

Ｓ０：通話開始信号、Ｓ１：入力音声信号、２：音声符
号化部、Ｓ２：符号化データ、３：有音／無音判定部、
Ｓ３：有音／無音判定ビット、４：送信ゲート、５：期
間判定部、６：データ送信部、７：通信回線、１０：デ
ータ受信部、Ｓ１０：符号化データ、Ｓ１１：有音／無
音判定ビット、１２：音声復号化部、Ｓ１２：復号化信
号、１３：特徴情報抽出部、１４：特徴情報累積部、１
５：期間判定部、１６：合成係数算出／更新部、１７：
白色雑音生成部、１８：雑音合成部、Ｓ１８：疑似合成
雑音、１９：音声／雑音出力部、２０：音声出力端子、
２１：復号化音声蓄積部、２２：音声・雑音加重移動平
均算出部、１０１Ａ：送信部、１０１Ｂ：受信部。S0: call start signal, S1: input voice signal, 2: voice coding unit, S2: coded data, 3: voice / non-voice determination unit,
S3: Voice / silence determination bit, 4: Transmission gate, 5: Period determination unit, 6: Data transmission unit, 7: Communication line, 10: Data reception unit, S10: Coded data, S11: Voice / silence determination Bit, 12: audio decoding unit, S12: decoded signal, 13: feature information extracting unit, 14: feature information accumulating unit, 1
5: period determination unit, 16: composite coefficient calculation / update unit, 17:
White noise generation unit, 18: noise synthesis unit, S18: pseudo-synthesis noise, 19: audio / noise output unit, 20: audio output terminal,
21: decoded voice storage unit, 22: voice / noise weighted moving average calculation unit, 101A: transmission unit, 101B: reception unit.

───────────────────────────────────────────────────── フロントページの続き (72)発明者塚田徳神奈川県横浜市戸塚区戸塚町216番地株式会社日立製作所通信事業部内 (72)発明者野田充宏神奈川県横浜市戸塚区戸塚町216番地株式会社日立製作所通信事業部内Ｆターム(参考） 5D045 CC05 DA11 ──────────────────────────────────────────────────続き Continuing on the front page (72) Inventor Toku Tsukada 216 Totsuka-cho, Totsuka-ku, Yokohama-shi, Kanagawa Prefecture Inside the Hitachi, Ltd.Communications Division (72) Inventor Mitsuhiro Noda 216 Totsuka-cho, Totsuka-ku, Yokohama-shi, Kanagawa F-term in Hitachi, Ltd., Communications Division, 5D045 CC05 DA11

Claims

[Claims]

An audio transmission system in which encoded data of input speech is transmitted from a transmitting device at a predetermined period, and the receiving device decodes and outputs the encoded data. The background noise input in the generated silent section is transmitted as coded data, and the receiving apparatus calculates a noise synthesis coefficient from the background noise obtained by decoding the coded data. The background noise for a predetermined time inputted immediately after the start of each silent section occurring during the transmission is transmitted as coded data, and the reception side device decodes the coded data on the basis of the background noise obtained by decoding the coded data. Update the synthesis coefficient,
A speech encoding / decoding method, comprising outputting pseudo noise synthesized based on the updated noise synthesis coefficient during a silent period following the output period of the decoded background noise.

2. A transmitting apparatus transmits identification information of a voiced section and a silent section along with each encoded data of input speech and background noise, and a receiving apparatus transmits a received code according to the identification information. Identifying whether the digitized data is for input speech or background noise,
The speech encoding / decoding method according to claim 1, wherein a process for calculating or updating the noise combining coefficient is performed when the encoded data for background noise is received.

3. A receiving-side apparatus determines final coded data for background noise based on the number of coded data for background noise received in each silent section during a call, and decodes the coded data. 3. The speech encoding / decoding method according to claim 2, wherein the background noise is corrected and output so as to gradually approximate the pseudo noise.

4. A transmitting device transmits background noise for a predetermined time inputted in a silent section generated immediately after the start of the call as encoded data, and a receiving device decodes the encoded data. A noise synthesis coefficient is calculated from the obtained background noise, and pseudo noise synthesized based on the noise synthesis coefficient is output in a silent section following an output period of the decoded background noise. A speech encoding / decoding method according to any one of claims 1 to 3.

5. A voice transmitting / receiving apparatus comprising: a transmitting section for transmitting input voice as encoded data to a communication line periodically; and a receiving section for decoding encoded data received from the communication line and outputting the decoded data as voice. Means for determining whether the input speech signal is a speech section or a non-speech section, and converting the speech signal input in the speech section and the background noise input in the non-speech section into encoded data. Means for transmitting encoded data for a predetermined period of time generated immediately after the start of each silent section and encoded data generated in a sound section to a communication line together with voice / silence determination information. Means for separating the encoded data received from the communication line from the voiced / silent determination information, and first and second voice / silence determination information from the voiced / silent determination information. For generating control signals Means for calculating or updating a noise combining coefficient based on characteristic information extracted from the received encoded data during a specific period indicated by the first control signal; and a predetermined period based on the noise combining coefficient. And a noise synthesizing means for intermittently outputting the pseudo noise in accordance with the second control signal. The speech generated by the decoding means during reception of the encoded data. A speech transmission / reception apparatus which outputs a signal or background noise and outputs pseudo noise generated by the noise synthesis means when there is no encoded data.

6. A voice transmitting / receiving apparatus comprising: a transmitting section for encoding data of an input voice and periodically transmitting the encoded data to a communication line; and a receiving section for decoding the encoded data received from the communication line and outputting it as voice. Encoding means for converting the input signal including speech and background noise into encoded data at a predetermined cycle,
Means for determining whether the input signal is a sound section or a sound section, and suppressing transmission of the encoded data except for a predetermined period and a sound section set immediately after the start of each sound section. And a means for sending the encoded data output from the gate means to the communication line together with the determination information of the voiced section or the silent section. Decoding means for decoding the received encoded data to generate an audio signal or background noise; feature extraction means for the background noise connected to the decoding means; and a sound source received from the communication line. A means for generating a control signal for calculating a noise synthesis coefficient and a pseudo noise output suppression signal based on the determination information of the section or the silent section; so Means for calculating or updating a noise synthesis coefficient from the extracted feature information; and synthesizing pseudo noise at a predetermined cycle based on the noise synthesis coefficient, and outputting the pseudo noise intermittently according to the output suppression signal. And a means for outputting, to an output terminal, an audio signal or background noise generated by the decoding means and pseudo noise supplied from the noise synthesis means. Voice transmitting and receiving device.