JP3481087B2

JP3481087B2 - Voice communication fluctuation absorption method

Info

Publication number: JP3481087B2
Application number: JP19602397A
Authority: JP
Inventors: 伸司薄葉
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1997-07-22
Filing date: 1997-07-22
Publication date: 2003-12-22
Anticipated expiration: 2017-07-22
Also published as: JPH1141287A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、非同期網であるロ
ーカル・エリア・ネットワーク（以下、「ＬＡＮ」とい
う）、インタネットプロトコル（Internet Protocol 、
以下「ＩＰ」という）ネットワークであるインタネット
等のパケット通信網において、特にＩＰネットワークを
用いて音声通信を行う場合に、ネットワークのトラフィ
ック等により発生する遅延（これを「ゆらぎ」という）
を吸収するための音声通信ゆらぎ吸収方法に関するもの
である。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a local area network (hereinafter referred to as "LAN") which is an asynchronous network, an Internet protocol,
In the packet communication network such as the Internet which is a network (hereinafter referred to as “IP”), a delay caused by network traffic or the like (this is referred to as “fluctuation”) particularly when voice communication is performed using the IP network.
The present invention relates to a voice communication fluctuation absorbing method for absorbing noise.

【０００２】[0002]

【従来の技術】データ通信としてインタネットに代表さ
れるＩＰネットワークが広く普及している。最近、ＩＰ
ネットワーク通信における経済性に着目し、音声通信を
このＩＰネットワーク上で行ういわゆる「インタネット
電話」が商品化されつつある。しかし、データ通信網と
して発展したＩＰネットワークにおいて、通常の電話機
並みの音声を確保することは困難である。このＩＰネッ
トワークと同様な非同期網でも、非同期転送モード（As
ynchronous Transfer Mode、以下「ＡＴＭ」という）通
信においては、音声・データの統合通信を実現すること
が当初から考慮されているのに比較し、音声通信を意識
されずに発展したＩＰネットワークにおいて、一定の音
声品質を確保する上で課題がある。例えば、データ伝送
のためのフレーム単位として、ＩＰネットワークの基本
となるＬＡＮにおけるフレームは、音声通信を意識した
ＡＴＭが固定長の５３Byteセルであるのに対し、音声通
信に適するパケット交換方式における媒体アクセス制御
（Media Access Control、以下「ＭＡＣ」という）フレ
ーム上のデータは例えば、４６〜１５００Byteの可変長
である。2. Description of the Related Art IP networks represented by the Internet are widely used for data communication. Recently, IP
Focusing on economics in network communication, so-called "Internet telephones" that perform voice communication on this IP network are being commercialized. However, in an IP network that has developed as a data communication network, it is difficult to secure voice as much as a normal telephone. Even in the same asynchronous network as this IP network, the asynchronous transfer mode (As
Synchronous Transfer Mode (hereinafter referred to as “ATM”) communication has been considered from the beginning to realize integrated communication of voice and data, but compared to the case where it has been fixed in an IP network developed without being aware of voice communication. There is a problem in securing the voice quality. For example, as a frame unit for data transmission, a frame in a LAN, which is the basis of an IP network, has a fixed-length 53-byte cell in which ATM, which is conscious of voice communication, has a fixed length, whereas medium access in a packet switching system suitable for voice communication is performed. Data on a control (Media Access Control, hereinafter referred to as “MAC”) frame has a variable length of 46 to 1500 bytes, for example.

【０００３】ＩＰネットワークと音声通信との関連を整
理すると、ＩＰネットワーク側で確保されている通信帯
域が狭い場合は、遅延が大きくなる。逆に、ＩＰネット
ワーク側で確保されている通信帯域が広い場合は、遅延
が少ない。また、長い情報のファイル転送等にＩＰネッ
トワーク側が瞬間的バーストトラフィックがかかる場合
は、瞬間的な遅延が発生する。一方、音声通信において
は、音声の欠落がないほど、通話の明瞭性が確保でき、
また遅延が少ないほど、自然な会話ができる。このよう
なＩＰネットワークと音声通信との関連を考慮し、従
来、例えば図２に示すような音声・データ統合通信シス
テムが提案されている。この音声・データ統合通信シス
テムでは、ＩＰネットワーク１に設けたＬＡＮ間接続装
置であるルータ２に、伝送路３を介して音声集線装置
（以下、「音声ハブ」という）４が接続され、さらにこ
の音声ハブ４に伝送路５を介して、一般アナログ電話機
やＧ３型ファクシミリ装置（以下、「Ｇ３ＦＡＸ」とい
う）等の１台または複数台の端末装置６が接続されてい
る。音声ハブ４は、ＩＰネットワーク１と端末装置６と
の相互接続（ゲートウェイ）機能を有し、装置全体を制
御する中央処理装置（以下、「ＣＰＵ」という）、音声
データ蓄積用の送信バッファ４ａ、音声データ蓄積用の
受信バッファ４ｂ、及びアナログ音声信号をディジタル
信号に変換して送り出すと共にディジタル信号をアナロ
グ音声信号に再生するコーデック等で構成されている。
送信バッファ４ａ及び受信バッファ４ｂは、データを書
込んだ順に読出していくＦＩＦＯ（First In First Ou
t）方式のレジスタ等で構成されている。When the relationship between the IP network and the voice communication is sorted out, the delay becomes large when the communication band secured on the IP network side is narrow. On the contrary, when the communication band secured on the IP network side is wide, the delay is small. Further, when the IP network side receives a momentary burst traffic for file transfer of long information, a momentary delay occurs. On the other hand, in voice communication, the clarity of the call can be secured so that there is no loss of voice.
Also, the less the delay, the more natural conversation can be done. Considering the relationship between the IP network and the voice communication, a voice / data integrated communication system as shown in FIG. 2 has been conventionally proposed. In this integrated voice / data communication system, a voice concentrator (hereinafter referred to as “voice hub”) 4 is connected to a router 2 which is an inter-LAN connecting device provided in the IP network 1 via a transmission line 3, and further, One or a plurality of terminal devices 6 such as general analog telephones and G3 type facsimile devices (hereinafter referred to as “G3 FAX”) are connected to the voice hub 4 via a transmission line 5. The audio hub 4 has a function of interconnecting (gateway) the IP network 1 and the terminal device 6, and has a central processing unit (hereinafter referred to as “CPU”) that controls the entire device, a transmission buffer 4a for accumulating audio data, It is composed of a reception buffer 4b for accumulating voice data, a codec for converting an analog voice signal into a digital signal and transmitting the digital signal, and for reproducing the digital signal into an analog voice signal.
The transmission buffer 4a and the reception buffer 4b are FIFOs (First In First Ou
t) system registers, etc.

【０００４】一般アナログ電話機等の端末装置６からＩ
Ｐネットワーク１内の端末装置へ音声信号を送る場合、
該端末装置６から出力されたアナログ音声信号が伝送路
５を介して音声ハブ４へ送られる。音声ハブ４では、端
末装置６から送られてきたアナログ音声信号を例えばul
awＰＣＭ（Palse Code Modulation ；パルス符号変調）
で符号化して２進数表現の音声データを生成し、コーデ
ックによってディジタル信号に変換した後、送信バッフ
ァ４ａに蓄積していく。音声ハブ４内のＣＰＵでは、送
信バッファ４ａに蓄積した音声データを読出し、この音
声データに、送信手順や相手先アドレス等の制御情報で
あるヘッダを付加して音声パケットを生成し、この音声
パケットを伝送可能なＭＡＣフレームの形にして伝送路
３を介してルータ２へ出力する。ルータ２に送られてき
たＭＡＣフレームは、この音声パケット内のヘッダで指
定されるＩＰネットワーク１内の相手先の端末装置へ送
られる。From a terminal device 6 such as a general analog telephone, I
When sending a voice signal to a terminal device in the P network 1,
The analog audio signal output from the terminal device 6 is sent to the audio hub 4 via the transmission path 5. In the audio hub 4, the analog audio signal sent from the terminal device 6
awPCM (Palse Code Modulation)
Is encoded to generate audio data expressed in a binary number, converted into a digital signal by a codec, and then stored in the transmission buffer 4a. The CPU in the voice hub 4 reads the voice data stored in the transmission buffer 4a, adds a header that is control information such as a transmission procedure and a destination address to the voice data to generate a voice packet. Is transmitted to the router 2 via the transmission line 3 in the form of a transmittable MAC frame. The MAC frame sent to the router 2 is sent to the destination terminal device in the IP network 1 designated by the header in the voice packet.

【０００５】図３は、図２のシステムにおける従来の
音声通信ゆらぎ吸収方法を説明するための波形図であ
る。この図３を参照しつつ、ＩＰネットワーク１から端
末装置６へ音声信号を伝送する際に発生するゆらぎの吸
収方法について説明する。ＩＰネットワーク１内のある
端末装置から音声系ネットワーク内の端末装置６へ音声
信号を送る場合、該ＩＰネットワーク１内のある端末装
置からＭＡＣフレーム化された音声パケットが出力さ
れ、この音声パケットがルータ２を介して送信パケット
ＰＴの形で伝送路３へ送られる。伝送路３へ送られた送
信パケットＰＴは、音声ハブ４において到着パケットＰ
Ｒの形で受信される。音声ハブ４内において、ＣＰＵは
到着パケットＰＲから音声データを取出し、受信バッフ
ァ４ｂに順次書込んでいく。受信バッファ４ｂに一定の
音声データ量（即ち、バッファサイズ）ＢＳまで音声デ
ータが蓄積されると、該受信バッファ４ｂから、書込ま
れた順に音声データが読出され、コーデックによってア
ナログ音声信号に再生され、伝送路５を介して端末装置
６へ送られる。図３に示すように、到着パケットＰＲに
ゆらぎが含まれている場合、これが受信バッファ４ｂの
書込み及び読出し動作によって吸収される。受信バッフ
ァ４ｂの書込みから読出しの時間差が、コーデックによ
って再生されたアナログ音声信号の再生遅延時間とな
る。FIG. 3 is a waveform diagram for explaining a conventional voice communication fluctuation absorbing method in the system of FIG. With reference to FIG. 3, a method of absorbing fluctuations that occur when a voice signal is transmitted from the IP network 1 to the terminal device 6 will be described. When a voice signal is sent from a terminal device in the IP network 1 to a terminal device 6 in the voice network, a MAC framed voice packet is output from the terminal device in the IP network 1 and the voice packet is a router. 2 is sent to the transmission line 3 in the form of a transmission packet PT. The transmission packet PT sent to the transmission line 3 arrives at the voice hub 4 and arrives at the packet P.
It is received in the form of R. In the audio hub 4, the CPU takes out audio data from the arrival packet PR and sequentially writes it in the reception buffer 4b. Fixed in receive buffer 4b
When the voice data is accumulated up to the voice data amount (that is, the buffer size) BS, the voice data is read from the reception buffer 4b in the written order, reproduced by the codec into an analog voice signal, and transmitted via the transmission line 5. Sent to the terminal device 6. As shown in FIG. 3, when the arrival packet PR includes fluctuations, the fluctuations are absorbed by the write and read operations of the reception buffer 4b. The time difference from the writing to the reading of the reception buffer 4b becomes the reproduction delay time of the analog audio signal reproduced by the codec.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、従来の
音声通信ゆらぎ吸収方法では、次のような課題があっ
た。音声ハブ４内の送信バッファ４ａ及び受信バッファ
４ｂは、短時間のバッファサイズＢＳを持つ単純なＦＩ
ＦＯ方式のバッファで構成されており、この受信バッフ
ァ４ｂによってゆらぎ吸収が行われる。到着パケットＰ
Ｒにおいてゆらぎによる遅延が発生すると、過去の遅延
時間を超える遅延が発生した瞬間のみ無音状態になり、
ゆらぎ吸収バッファサイズＢＳは、その最大遅延時間に
追従する。つまり、ＩＰネットワーク１で発生する最大
遅延時間に応じてバッファサイズＢＳが大きくなる。そ
のため、音声の明瞭性は確保しやすい。しかし、ゆらぎ
吸収時間を超えるゆらぎが発生した場合には吸収できな
い。また、ゆらぎを吸収するための受信バッファ４ｂが
単純なＦＩＦＯ方式のバッファで構成されているので、
これに蓄積される音声パケットの欠落があっても通話品
質には影響しない程度の、頻度の低いバースト遅延が発
生した場合であっても、遅延が残り、呼が切断されるま
で回復できないため、リアルタイムな会話が阻害され
る。本発明は、前記従来技術が持っていた課題を解決
し、会話の連続性及び明瞭性を保証でき、リアルタイム
で自然な会話が行える音声通信ゆらぎ吸収方法を提供す
ることを目的とする。However, the conventional voice communication fluctuation absorbing method has the following problems. The transmission buffer 4a and the reception buffer 4b in the voice hub 4 are simple FIs having a short buffer size BS.
It is composed of a FO type buffer, and fluctuations are absorbed by the reception buffer 4b. Arrival packet P
When a delay due to fluctuations occurs in R, the silence state occurs only at the moment when the delay exceeding the past delay time occurs.
The fluctuation absorption buffer size BS follows its maximum delay time. That is, the buffer size BS increases according to the maximum delay time that occurs in the IP network 1. Therefore, it is easy to ensure the clarity of voice. However, if a fluctuation that exceeds the fluctuation absorption time occurs, it cannot be absorbed. Further, since the receiving buffer 4b for absorbing the fluctuation is composed of a simple FIFO type buffer,
Even if there are infrequent burst delays that do not affect the call quality even if there is a loss of voice packets accumulated in it, the delay remains and cannot be recovered until the call is disconnected. Real-time conversation is disturbed. SUMMARY OF THE INVENTION It is an object of the present invention to provide a method for absorbing voice communication fluctuations, which can solve the problems of the above-mentioned conventional techniques, guarantee continuity and clarity of conversation, and perform natural conversation in real time.

【０００７】[0007]

【課題を解決するための手段】前記課題を解決するため
に、本発明のうちの請求項１に係る発明では、ＩＰネッ
トワークから送られてくるフレーム化された音声パケッ
トから有音及び無音の音声データを取出して、書込みか
ら読出しまでに一定の再生遅延時間を有するＦＩＦＯ方
式の音声データ蓄積用バッファに順次書込み、この書込
まれた順序に従い、前記再生遅延時間遅れて前記音声デ
ータを逐次読出して、デコード手段により音声信号に再
生して端末装置へ与える音声通信において、前記バッフ
ァは、前記ＩＰネットワークで発生する最大遅延時間を
吸収し得る大きさの蓄積容量を有し、かつ該バッファに
蓄積された音声データ量であるバッファサイズが一定量
を超えるとバッファサイズ検出信号を出力する機能を有
している。そして、前記バッファサイズが、前記音声パ
ケットにおいて発生するゆらぎによる遅延時間を吸収し
て大きくなって、前記バッファから前記バッファサイズ
検出信号が出力された場合において、前記バッファサイ
ズが一定量を超える時間が一定時間継続すること、及び
前記バッファに書込まれている音声データが一定量無音
パタンになることを無音パタン検出部で検出した場合
に、前記バッファサイズ検出信号と前記無音パタン検出
部の検出結果とに基づいて生成したリセット信号によ
り、前記バッファをリセットして前記バッファサイズを
クリアすると共に前記無音パタン検出部をリセットする
ようにしている。このような構成を採用したことによ
り、次のような作用が行われる。ＩＰネットワークにお
いては、例えば、通信に関与するルータの段数や、バー
ストデータトラフィックの発生頻度等が不透明である。
このようなＩＰネットワークにおいて、最大ゆらぎ時間
が数十ｍｓから数秒まであり得る。そこで、本発明で
は、バッファの蓄積容量が大きく設定されているので、
ＩＰネットワークにおいて最大のゆらぎ時間が発生した
場合にも、該バッファの動作によって（即ち、ゆらぎに
よる遅延時間に応じてバッファサイズが変化して）、そ
のゆらぎが吸収される。また、発生頻度の比較的低い一
瞬のバースト遅延が発生した場合には、リセット信号に
より、バッファがリセットされてバッファサイズがクリ
アされ、このバッファの再生遅延時間が減少する。 In order to solve the above-mentioned problems, in the invention according to claim 1 of the present invention, voiced and silent voices are generated from a framed voice packet transmitted from an IP network. Data is extracted and written
FIFO method with constant playback delay time from reading to reading
In the voice communication in which the voice data is sequentially written into the buffer for storing voice data, the voice data is sequentially read with the reproduction delay time delayed in accordance with the written order, and the voice data is reproduced by the decoding means and given to the terminal device. The buffer has a storage capacity large enough to absorb the maximum delay time generated in the IP network , and
A fixed amount of buffer size, which is the amount of accumulated audio data
The function to output the buffer size detection signal is exceeded.
is doing. Then, if the buffer size is
Absorbs the delay time caused by fluctuations
Grows from the buffer to the buffer size
When the detection signal is output, the buffer size
For more than a certain amount for a certain period of time, and
There is a certain amount of silence in the audio data written in the buffer.
When the silent pattern detection unit detects that the pattern
, The buffer size detection signal and the silent pattern detection
The reset signal generated based on the
Reset the buffer to change the buffer size
Clear and reset the silent pattern detector
I am trying. By adopting such a configuration, the following operation is performed. In an IP network, for example, the number of router stages involved in communication, the occurrence frequency of burst data traffic, and the like are unclear.
In such an IP network, the maximum fluctuation time can be from several tens ms to several seconds. Therefore, in the present invention, since the storage capacity of the buffer is set large,
Even if the maximum fluctuation time occurs in the IP network, the operation of the buffer (that is, fluctuation
(The buffer size changes depending on the delay time due to) , and the fluctuation is absorbed. In addition, if a momentary burst delay with a relatively low frequency of occurrence occurs, the reset signal
Resets the buffer and clears the buffer size.
The playback delay time of this buffer is reduced.

【０００８】請求項２に係る発明では、請求項１と同
様の音声通信において、請求項１と同様のＦＩＦＯ方式
の音声データ蓄積用バッファにおけるバッファサイズ
が、音声パケットにおいて発生するゆらぎによる遅延時
間を吸収して大きくなって、前記バッファからバッファ
サイズ検出信号が出力された場合において、前記バッフ
ァサイズが一定量を超える時間が一定時間継続するこ
と、及び前記バッファに書込まれている音声データが一
定時間以上無音パタンになることを無音パタン検出部で
検出した場合に、前記バッファサイズ検出信号と前記無
音パタン検出部の検出結果とに基づいて生成したリセッ
ト信号により、前記バッファをリセットして前記バッフ
ァサイズをクリアすると共に前記無音パタン検出部をリ
セットするようにしている。このような構成を採用した
ことにより、定められた遅延時間を超える遅延が高頻度
で発生するような場合、そのような遅延が一定時間継続
すると、リセット信号により、バッファがリセットされ
てバッファサイズがクリアされ、このバッファの再生遅
延時間が減少する。 According to the invention of claim 2, in the voice communication similar to that of claim 1, the same FIFO system as that of claim 1 is used.
Size of the audio data storage buffer
Is delayed due to fluctuations that occur in voice packets
It absorbs the space and grows,
When the size detection signal is output, the buffer
If the size exceeds a certain amount, it may continue for a certain amount of time.
, And the audio data written in the buffer is
The silent pattern detection unit indicates that the silent pattern will be exceeded for a certain period of time.
If detected, the buffer size detection signal and
A reset generated based on the detection result of the sound pattern detection unit
Reset the buffer to reset the buffer
The size of the silence pattern and reset the silent pattern detector.
I'm trying to set it. By adopting such a configuration, if a delay exceeding the predetermined delay time occurs frequently, if such a delay continues for a certain period of time, the reset signal resets the buffer.
Buffer size is cleared by the
Delay time is reduced.

【０００９】[0009]

【発明の実施の形態】第１の実施形態図１は、本発明の第１の実施形態の音声通信ゆらぎ吸収
方法に用いられる音声・データ統合通信用音声ハブを示
す概略の構成図である。この音声・データ統合通信用音
声ハブ４０は、図２の音声・データ統合通信システムの
音声ハブ４に代えて設けられるもので、伝送路３及びル
ータ２を介してＩＰネットワーク１に接続されると共
に、伝送路５を介して一般アナログ電話機、Ｇ３ＦＡＸ
等の１台または複数台の端末装置６に接続され、これら
のＩＰネットワーク１と端末装置６とのゲートウェイ機
能等を持っている。音声ハブ４０は、装置全体をプログ
ラム制御するＣＰＵ４１を有し、このＣＰＵ４１にデー
タ伝送用のバス４２が接続されている。バス４２には、
ＬＡＮ制御部４３が接続され、該ＬＡＮ制御部４３が伝
送路３を介してＩＰネットワーク１のルータ２に接続さ
れている。ＬＡＮ制御部４３は、ルータ２から送られて
くるＭＡＣフレーム化された音声パケットを格納し、該
音声パケットから音声データを取出してＣＰＵ４１に対
して割込みをかける機能を有し、さらに、送信すべき音
声データにヘッダを付加して音声パケットを生成し、該
音声パケットをＭＡＣフレーム化して伝送路３へ出力す
る機能を有している。ここで、音声データは、品質重視
の観点から、例えば非圧縮のｕｌａｗＰＣＭで符号化さ
れたデータである。バス４２には、送信用の音声データ
を蓄積するＦＩＦＯ方式の送信バッファ４４と、受信用
の音声データを蓄積するＦＩＦＯ方式の受信バッファ４
５が接続されている。受信バッファ４５は、ＩＰネット
ワーク１において最大ゆらぎ時間が数十ｍｓから数秒ま
であり得ることを考慮し、これらを吸収し得る極めて大
きな蓄積容量を持っている。これらの送信バッファ４４
及び受信バッファ４５において、該送信バッファ４４に
蓄積された音声データがＣＰＵ４１で読出されてＬＡＮ
制御部４３へ与えられ、該ＬＡＮ制御部４３に格納され
た音声パケット中の音声データが該ＣＰＵ４１によって
非同期に読出されて受信バッファ４５に書込まれるよう
になっている。また、受信バッファ４５は、蓄積された
音声データ量（即ち、バッファサイズ）ＢＳが一定量Ｔ
Ｈを超えると、例えば“１”のバッファサイズ検出信号
Ｓ４５を出力し、リセット端子Ｒに例えば“１”のリセ
ット信号Ｓ４９が入力されると、バッファサイズＢＳが
クリアされる機能を有している。BEST MODE FOR CARRYING OUT THE INVENTION First Embodiment FIG. 1 is a schematic configuration diagram showing a voice hub for integrated voice / data communication used in a voice communication fluctuation absorbing method according to a first embodiment of the present invention. The voice / data integrated communication voice hub 40 is provided in place of the voice hub 4 of the voice / data integrated communication system of FIG. 2, and is connected to the IP network 1 via the transmission line 3 and the router 2. , General analog telephone, G3FAX via transmission line 5
Is connected to one or a plurality of terminal devices 6 and the like, and has a gateway function between the IP network 1 and the terminal devices 6 and the like. The audio hub 40 has a CPU 41 for program-controlling the entire apparatus, and a bus 42 for data transmission is connected to the CPU 41. On the bus 42,
The LAN control unit 43 is connected, and the LAN control unit 43 is connected to the router 2 of the IP network 1 via the transmission line 3. The LAN control unit 43 has a function of storing a MAC framed voice packet sent from the router 2, extracting voice data from the voice packet and interrupting the CPU 41, and further transmitting the voice data. It has a function of adding a header to voice data to generate a voice packet, converting the voice packet into a MAC frame, and outputting the MAC frame to the transmission path 3. Here, the audio data is, for example, data encoded by uncompressed ulawPCM from the viewpoint of quality emphasis. The bus 42 includes a FIFO transmission buffer 44 for storing transmission voice data and a FIFO reception buffer 4 for storing reception voice data.
5 is connected. The reception buffer 45 has an extremely large storage capacity capable of absorbing the maximum fluctuation time of several tens ms to several seconds in the IP network 1 in consideration of the fluctuations. These send buffers 44
Also, in the reception buffer 45, the voice data accumulated in the transmission buffer 44 is read by the CPU 41 to
The voice data in the voice packet given to the control unit 43 and stored in the LAN control unit 43 is asynchronously read by the CPU 41 and written in the reception buffer 45. In addition, the reception buffer 45 is stored
The audio data amount (that is, the buffer size) BS is a fixed amount T
When it exceeds H, the buffer size detection signal S45 of "1" is output, and when the reset signal S49 of "1" is input to the reset terminal R, the buffer size BS is cleared. .

【００１０】送信バッファ４４及び受信バッファ４５に
は、コーデック４６を介してインタフェース部４７が接
続され、このインタフェース部４７が伝送路５を介して
端末装置６に接続されている。コーデック４６は、端末
装置６からインタフェース部４７を介して送られてくる
アナログ音声信号を例えばulawＰＣＭでディジタル信号
の音声データに変換して送信バッファ４４に与えるコー
ド手段と、受信バッファ４５から読出されたディジタル
信号の音声データを例えばulawＰＣＭでアナログ音声信
号に変換してインタフェース部４７に与えるデコード手
段とを有している。インタフェース部４７は、コーデッ
ク４６と端末装置６との間のインタフェースを行う機能
を有している。受信バッファ４５の入力端子側には、無
音パタン検出部４８の入力端子が接続され、この無音パ
タン検出部４８の出力端子と受信バッファ４５の検出信
号Ｓ４５用出力端子とが、２入力ＡＮＤゲート４９の入
力端子に接続されている。ＡＮＤゲート４９の出力端子
は、受信バッファ４５のリセット端子Ｒ、及び無音パタ
ン検出部４８のリセット端子Ｒに接続されている。無音
パタン検出部４８及びＡＮＤゲート４９によって、リセ
ット手段が構成されている。An interface unit 47 is connected to the transmission buffer 44 and the reception buffer 45 via a codec 46, and the interface unit 47 is connected to the terminal device 6 via the transmission line 5. The codec 46 is read from the receiving buffer 45 and a code means for converting the analog audio signal sent from the terminal device 6 through the interface unit 47 into audio data of a digital signal by, for example, ulawPCM and giving it to the transmitting buffer 44. The audio data of the digital signal is converted into an analog audio signal by, for example, ulawPCM, and is supplied to the interface unit 47. The interface unit 47 has a function of interfacing between the codec 46 and the terminal device 6. An input terminal of the silence pattern detection unit 48 is connected to the input terminal side of the reception buffer 45, and the output terminal of the silence pattern detection unit 48 and the detection signal S45 output terminal of the reception buffer 45 are connected to a 2-input AND gate 49. Is connected to the input terminal of. The output terminal of the AND gate 49 is connected to the reset terminal R of the reception buffer 45 and the reset terminal R of the silence pattern detection unit 48. The silence pattern detecting section 48 and the AND gate 49 constitute a resetting means.

【００１１】無音パタン検出部４８は、受信バッファ４
５への書込みデータをモニタし、該受信バッファ４５に
書込まれている音声データが一定量無音パタンになる
と、出力端子から例えば“Ｈ”の無音パタン検出信号Ｓ
４８を出力する回路である。例えば、受信バッファ４５
に蓄積される音声データがulawＰＣＭで符号化されてい
る場合、無音パタン検出部４８では、下位７ビットデー
タが全て“１”（７Ｆ（Ｈ）またはＦＦ（Ｈ））になる
か否かを監視し、全て“１”になったときに“１”の無
音パタン検出信号Ｓ４８を出力する機能を有し、受信バ
ッファ４５と同様のＦＩＦＯ部、無音パタン記憶部、及
び比較部等で構成されている。ＡＮＤゲート４９は、入
力されるバッファサイズ検出信号Ｓ４５及び無音パタン
検出信号Ｓ４８が共に“１”になったときに、“１”の
リセット信号Ｓ４９を出力し、受信バッファ４５及び無
音パタン検出部４８をクリアする機能を有している。The silent pattern detector 48 is provided in the reception buffer 4
When the voice data written in the reception buffer 45 has a certain amount of silence pattern, the write data to be written into the reception buffer 45 has a certain amount of silence pattern.
It is a circuit for outputting 48. For example, the reception buffer 45
When the voice data accumulated in the above is encoded by ulawPCM, the silence pattern detection unit 48 monitors whether or not all the lower 7-bit data are “1” (7F (H) or FF (H)). However, it has a function of outputting the silence pattern detection signal S48 of "1" when all become "1", and is configured by a FIFO unit, a silence pattern storage unit, a comparison unit and the like similar to the reception buffer 45. There is. The AND gate 49 outputs the reset signal S49 of "1" when both the input buffer size detection signal S45 and the silence pattern detection signal S48 become "1", and the reception buffer 45 and the silence pattern detection unit 48. Has the function of clearing.

【００１２】図４は、図１の音声ハブ４０を用いた音声
通信ゆらぎ吸収方法を説明するための波形図である。こ
の図４の波形図を参照しつつ、第１の実施形態の音声通
信ゆらぎ吸収方法を説明する。一般アナログ電話機等の
端末装置６からＩＰネットワーク１へ音声信号を伝送す
る場合、該端末装置６から出力されたアナログ音声信号
は、伝送路５を介して音声ハブ４０内のインタフェース
部４７へ送られる。インタフェース部４７へ送られたア
ナログ音声信号は、コーデック４６によって例えばulaw
ＰＣＭでディジタル信号の音声データに変換され、送信
バッファ４４に書込まれていく。ＣＰＵ４１では、送信
バッファ４４に蓄積された音声データを非同期に読出
し、バス４２を介してＬＡＮ制御部４３へ送る。ＬＡＮ
制御部４３では、送られてきた音声データにヘッダを付
加して音声パケットを生成し、この音声パケットをＭＡ
Ｃフレームの形で出力する。出力されたＭＡＣフレーム
は、伝送路３を介してルータ２へ送られ、該音声パケッ
トのヘッダに基づきＩＰネットワーク１内の相手先の端
末装置等へ送られる。FIG. 4 is a waveform diagram for explaining a voice communication fluctuation absorbing method using the voice hub 40 of FIG. The voice communication fluctuation absorbing method according to the first embodiment will be described with reference to the waveform chart of FIG. When a voice signal is transmitted from the terminal device 6 such as a general analog telephone to the IP network 1, the analog voice signal output from the terminal device 6 is sent to the interface unit 47 in the voice hub 40 via the transmission path 5. . The analog audio signal sent to the interface unit 47 is, for example, ulaw by the codec 46.
It is converted into voice data of a digital signal by PCM and written in the transmission buffer 44. The CPU 41 asynchronously reads the audio data stored in the transmission buffer 44 and sends it to the LAN control unit 43 via the bus 42. LAN
The control unit 43 adds a header to the transmitted voice data to generate a voice packet, and then outputs this voice packet to the MA.
Output in the form of C frame. The output MAC frame is sent to the router 2 via the transmission path 3, and is sent to the other party's terminal device or the like in the IP network 1 based on the header of the voice packet.

【００１３】次に、ＩＰネットワーク１内のある端末装
置から音声系ネットワーク内の端末装置６へ音声信号を
送る場合について説明する。ＩＰネットワーク１内のあ
る端末装置からＭＡＣフレーム化された音声パケットが
出力されると、この音声パケットがルータ２を介して送
信パケットＰＴの形で伝送路３へ送られる。伝送路３へ
送られた送信パケットＰＴが、到着パケットＰＲの形で
音声ハブ４０へ到着すると、この到着パケットＰＲがＬ
ＡＮ制御部４３内に格納される。ＩＰネットワーク１の
トラフィック等によって遅延が発生すると、この遅延が
ゆらぎの形で到着パケットＰＲ内に含まれる。ＣＰＵ４
１は、ＬＡＮ制御部４３から音声データを取出し、受信
バッファ４５に書込んでいく。受信バッファ４５に蓄積
された音声データは、書込まれた順序に従い順次読出さ
れ、コーデック４６によって例えばulawＰＣＭでアナロ
グ音声信号に再生される。再生されたアナログ音声信号
には、図４に示すように受信バッファ４５のバッファサ
イズＢＳに応じた再生遅延時間が生じる。このアナログ
音声信号は、インタフェース部４７及び伝送路５を介し
て端末装置６へ送られる。ＩＰネットワーク１のトラフ
ィック等によって発生する一時的な遅延（即ち、ゆら
ぎ）により、受信バッファ４５内に音声データが存在し
なくなったときには、ＣＰＵ４１から該受信バッファ４
５に対して無音パタンが挿入されるので、端末装置６へ
送られるアナログ音声信号は無音状態になる。このゆら
ぎによる遅延を吸収するため、受信バッファ４５のバッ
ファサイズＢＳが大きくなっていき、該受信バッファ４
５に蓄積された音声データが一定量ＴＨを超えると、該
受信バッファ４５から出力されるバッファサイズ検出信
号Ｓ４５が“１”になる。このとき、一定量ＴＨまでの
バッファサイズＢＳに応じて、再生遅延時間が長くな
る。Next, a case where a voice signal is transmitted from a certain terminal device in the IP network 1 to the terminal device 6 in the voice network will be described. When a MAC framed voice packet is output from a certain terminal device in the IP network 1, this voice packet is sent to the transmission line 3 via the router 2 in the form of a transmission packet PT. When the transmission packet PT sent to the transmission line 3 arrives at the voice hub 40 in the form of the arrival packet PR, the arrival packet PR is changed to L.
It is stored in the AN control unit 43. When a delay occurs due to traffic of the IP network 1 or the like, this delay is included in the arrival packet PR in the form of fluctuation. CPU4
1 retrieves audio data from the LAN control unit 43 and writes it in the reception buffer 45. The audio data stored in the reception buffer 45 is sequentially read out in the written order and reproduced by the codec 46 into an analog audio signal by, for example, ulawPCM. The reproduced analog audio signal has a reproduction delay time corresponding to the buffer size BS of the reception buffer 45 as shown in FIG. This analog audio signal is sent to the terminal device 6 via the interface unit 47 and the transmission path 5. When there is no voice data in the reception buffer 45 due to a temporary delay (that is, fluctuation) caused by the traffic of the IP network 1 or the like, the CPU 41 causes the reception buffer 4 to receive the voice data.
Since the silence pattern is inserted into the voice signal 5, the analog voice signal sent to the terminal device 6 becomes silent. In order to absorb the delay due to this fluctuation, the buffer size BS of the receiving buffer 45 increases, and the receiving buffer 4
When the audio data stored in 5 exceeds a certain amount TH, the buffer size detection signal S45 output from the reception buffer 45 becomes "1". At this time, the reproduction delay time becomes longer according to the buffer size BS up to the fixed amount TH.

【００１４】無音パタン検出部４８は、受信バッファ４
５に書込まれる音声データの量をモニタしており、受信
バッファ４５における蓄積音声データの一定量ＴＨを超
える時間が一定時間継続し（この時間はバッファ４４，
４５の書込み及び読出し用のクロック信号等によってカ
ウントされる）、該受信バッファ４５に書込まれている
音声データが一定量無音パタンになると、該無音パタン
検出部４８から出力される無音パタン検出信号Ｓ４８が
“１”になる。例えば、無音パタン検出部４８では、音
声データがulawＰＣＭで符号化されている場合、下位７
ビットデータが全て“１”（７Ｆ（Ｈ）またはＦＦ
（Ｈ））になるか否かを監視し、全て“１”になると無
音パタン検出信号Ｓ４８を“１”にする。バッファサイ
ズ検出信号Ｓ４５と無音パタン検出信号Ｓ４８が共に
“１”になると、ＡＮＤゲート４９から出力されるリセ
ット信号Ｓ４９が“１”になり、受信バッファ４５及び
無音パタン検出部４８がリセットされ、該受信バッファ
４５のバッファサイズＢＳが０にクリアされる。そのた
め、コーデック４６から出力されるアナログ音声信号の
再生遅延時間も短縮される。The silence pattern detector 48 is provided in the reception buffer 4
5 monitors the amount of audio data to be written in 5, and the time in which the amount of audio data accumulated in the receiving buffer 45 exceeds a certain amount TH continues for a certain period of time (this time, the buffer 44,
45), and when the sound data written in the reception buffer 45 has a certain amount of silence pattern, the silence pattern detection signal output from the silence pattern detection unit 48. S48 becomes "1". For example, in the silent pattern detection unit 48, if the audio data is encoded by ulawPCM, the lower 7
All bit data is "1" (7F (H) or FF
(H)) is monitored, and when all are "1", the silent pattern detection signal S48 is set to "1". When both the buffer size detection signal S45 and the silence pattern detection signal S48 become "1", the reset signal S49 output from the AND gate 49 becomes "1", and the reception buffer 45 and the silence pattern detection unit 48 are reset. The buffer size BS of the reception buffer 45 is cleared to 0. Therefore, the reproduction delay time of the analog audio signal output from the codec 46 is also shortened.

【００１５】この第１の実施形態では、次の（ａ），
（ｂ）のような利点がある。（ａ）ＩＰネットワーク１においては、通信に関与する
ルータ２の段数や、バーストデータトラフィックの発生
頻度等が不透明である。このようなＩＰネットワーク１
において、最大ゆらぎ時間が数十ｍｓから数秒まであり
得る。そこで、本実施形態では、受信バッファ４５の蓄
積容量を極めて大きくしたので、ＩＰネットワーク１に
おいて最大のゆらぎが発生した場合にもこれを確実に吸
収できる。従って、会話の連続性及び明瞭性を保証でき
る。（ｂ）音声パケットの欠落があっても通話品質には影響
しない程度の、発生頻度の比較的低い一瞬のバースト遅
延が発生した場合、バッファサイズＢＳが０に戻される
ので、音声遅延から早期に回復し、リアルタイムな会話
が可能になる。In the first embodiment, the following (a),
There is an advantage like (b). (A) In the IP network 1, the number of stages of the router 2 involved in communication, the occurrence frequency of burst data traffic, and the like are unclear. Such an IP network 1
In, the maximum fluctuation time can be from several tens of ms to several seconds. Therefore, in the present embodiment, the storage of the reception buffer 45
Since the product capacity is extremely large, even when the maximum fluctuation occurs in the IP network 1, this can be reliably absorbed. Therefore, the continuity and clarity of the conversation can be guaranteed. (B) When a momentary burst delay with a relatively low occurrence frequency that does not affect the call quality even if there is a voice packet drop occurs, the buffer size BS is returned to 0, so the voice delay can be performed early. It recovers and enables real-time conversation.

【００１６】この無音パタン検出部４８Ａは、図１の音
声ハブ４０内の無音パタン検出部４８に代えて設けられ
るもので、入力側が図１の受信パタン４５の入力端子に
接続された音声データ蓄積用のＦＩＦＯ部４８ａと、無
音パタンを記憶する無音パタン記憶部４８ｂとを有して
いる。ＦＩＦＯ部４８ａは、例えば受信バッファ４５と
同一の構成であり、このＦＩＦＯ部４８ａ及び無音パタ
ン記憶部４８ｂの出力端子に、比較部４８ｃが接続され
ている。比較部４８ｃは、ＦＩＦＯ部４８ａに蓄積され
た音声データと、無音パタン記憶部４８ｂに記憶された
無音パタンとを比較し、受信バッファ４５に蓄積された
一定量の無音パタンを検出すると、“１”の無音パタン
検出信号Ｓ４８を出力するものであり、ゲート回路等で
構成されている。この無音パタン検出信号Ｓ４８は、図
１の無音パタン検出部４８から出力される無音パタン検
出信号Ｓ４８と同一の信号である。比較部４８ｃの出力
端子には、出力部４８ｄが接続されている。出力部４８
ｄは、無音パタン検出信号Ｓ４８を入力し、一定量の無
音パタンが一定時間ＴＣ連続して受信バッファ４５に蓄
積されたことを検出すると、例えば“１”の無音パタン
検出信号Ｓ４８ｄを出力し、図１のＡＮＤゲート４９に
与える機能を有し、レジスタ等で構成されている。The silent pattern detecting section 48A is provided in place of the silent pattern detecting section 48 in the audio hub 40 of FIG. 1, and the input side is connected to the input terminal of the receiving pattern 45 of FIG. It has a FIFO unit 48a for voice and a silent pattern storage unit 48b for storing silent patterns. The FIFO unit 48a has, for example, the same configuration as the reception buffer 45, and the comparison unit 48c is connected to the output terminals of the FIFO unit 48a and the silent pattern storage unit 48b. The comparison unit 48c compares the voice data stored in the FIFO unit 48a with the silence pattern stored in the silence pattern storage unit 48b, and when a certain amount of silence pattern stored in the reception buffer 45 is detected, “1” is output. It outputs a silent pattern detection signal S48 of "" and is composed of a gate circuit and the like. This silence pattern detection signal S48 is the same signal as the silence pattern detection signal S48 output from the silence pattern detection unit 48 in FIG. The output unit 48d is connected to the output terminal of the comparison unit 48c. Output unit 48
d receives the silent pattern detection signal S48 and outputs a silent pattern detection signal S48d of "1", for example, when detecting that a certain amount of silent pattern is continuously accumulated in the reception buffer 45 for a certain time TC. It has a function to be given to the AND gate 49 in FIG. 1, and is configured by a register or the like.

【００１７】図６は、図５の無音パタン検出部４８Ａを
用いた音声通信ゆらぎ吸収方法を説明するための波形図
である。この波形図を参照しつつ、第２の実施形態の音
声通信ゆらぎ吸収方法を説明する。第１の実施形態で
は、受信バッファ４５に書込まれている音声データが一
定量無音パタンになると、無音パタン検出部４８から出
力される無音パタン検出信号Ｓ４８（これは図５の比較
部４８ｃから出力される無音パタン検出信号Ｓ４８と同
一の信号）が“１”になり、ＡＮＤゲート４９から出力
されるリセット信号Ｓ４９が“１”になって受信バッフ
ァ４５のバッファサイズＢＳが０にクリアされる。この
ゆらぎ吸収方法では、例えば、頻度の低いバースト遅延
が発生して定められた遅延時間を超えるときにはバッフ
ァサイズＢＳを０に戻すようにしているが、その定めら
れた遅延時間を超える遅延が高頻度で発生するような場
合、音声の明瞭性が保証されなくなって自然な会話が阻
害されるおそれがある。そこで、この第２の実施形態の
ゆらぎ吸収方法では、ＦＩＦＯ部４８ａ、無音パタン記
憶部４８ｂ及び比較部４８ｃにより、受信バッファ４５
に蓄積される一定量の無音パタンを検出すると、該比較
部４８ｃから“１”の無音パタン検出信号Ｓ４８を出力
して出力部４８ｄに与える。出力部４８ｄは、クロック
信号ＣＫにより動作し、一定量の無音パタンが一定時間
ＴＣだけ連続していることを検出すると、“１”の無音
パタン検出信号Ｓ４８ｄを出力し、図１のＡＮＤゲート
４９に与える。これにより、ＡＮＤゲート４９から
“１”のリセット信号Ｓ４９が出力され、受信バッファ
４５及び無音パタン検出部４８Ａがリセットされ、該受
信バッファ４５のバッファサイズＢＳが０にクリアさ
れ、コーデック４６から出力されるアナログ音声信号の
再生遅延が短縮される。従って、定められた遅延時間を
超える遅延が高頻度で発生する場合、音声の明瞭性が保
証されて自然な会話を継続することができる。FIG. 6 is a waveform diagram for explaining a voice communication fluctuation absorbing method using the silent pattern detecting section 48A of FIG. The voice communication fluctuation absorbing method according to the second embodiment will be described with reference to this waveform diagram. In the first embodiment, when the voice data written in the reception buffer 45 has a certain amount of silence pattern, the silence pattern detection signal S48 output from the silence pattern detection unit 48 (this is output from the comparison unit 48c in FIG. 5). The same signal as the silent pattern detection signal S48 output) becomes "1", the reset signal S49 output from the AND gate 49 becomes "1", and the buffer size BS of the reception buffer 45 is cleared to 0. . In this fluctuation absorbing method, for example, the buffer size BS is returned to 0 when a low-frequency burst delay occurs and exceeds a predetermined delay time. However, a delay exceeding the predetermined delay time is high. In such cases, the clarity of the voice may not be guaranteed and natural conversation may be disturbed. Therefore, in the fluctuation absorbing method according to the second embodiment, the reception buffer 45 is configured by the FIFO unit 48a, the silent pattern storage unit 48b, and the comparison unit 48c.
When a certain amount of silence pattern accumulated in the above is detected, the comparison unit 48c outputs a silence pattern detection signal S48 of "1" and gives it to the output unit 48d. The output unit 48d operates according to the clock signal CK, and when detecting that a certain amount of silent pattern continues for a certain time TC, outputs the silent pattern detection signal S48d of "1", and the AND gate 49 of FIG. Give to. As a result, the reset signal S49 of "1" is output from the AND gate 49, the reception buffer 45 and the silence pattern detection unit 48A are reset, the buffer size BS of the reception buffer 45 is cleared to 0, and the codec 46 outputs it. The reproduction delay of the analog audio signal is reduced. Therefore, when a delay exceeding the predetermined delay time frequently occurs, the intelligibility of the voice is guaranteed and the natural conversation can be continued.

【００１８】なお、本発明は上記実施形態に限定され
ず、種々の変形が可能である。この変形例としては、例
えば、次の（ｉ），（ii）のようなものがある。（ｉ）図１の音声ハブ４０は、他の回路で構成しても
よい。例えば、図１または図５では、無音パタン検出部
４８，４８Ａ及びＡＮＤゲート４９によってリセット手
段を構成したが、これに限定されない。このリセット手
段は、一定周期、受信バッファ４５に対して書込みデー
タがオーバフローした場合、またそのオーバフローの時
間が一定量を超える場合、無音検出時、あるいはこれら
の組合せ、またはＩＰネットワーク１等のパケット通信
網の混み具合の監視結果をトリガとして、受信バッファ
４５のバッファサイズＢＳを０にクリアする等、種々の
方法を適用できる。（ii）上記実施形態では、受信バッファ４５に蓄積さ
れる音声データは、非圧縮の例えばulawＰＣＭで符号化
されたデータであるが、本発明は他のＡＤＰＣＭ、ＣＳ
−ＡＣＥＬＰ、ＬＤ−ＣＥＬＰ等の符号化方式で符号化
されたデータについても適用可能である。また、音声ハ
ブ４０に到着する到着パケットＰＲは、無音圧縮されて
いる場合でも、ゆらぎが生じる場合があるので、このよ
うな到着パケットＰＲについても本発明のゆらぎ吸収方
法を適用可能である。The present invention is not limited to the above embodiment, and various modifications can be made. Examples of this modification include the following (i) and (ii). (I) The audio hub 40 of FIG. 1 may be configured with other circuits. For example, in FIG. 1 or 5, the silence pattern detection units 48 and 48A and the AND gate 49 constitute the reset means, but the reset means is not limited to this. This reset means is used when a write data overflows to the receive buffer 45 for a certain period, when the overflow time exceeds a certain amount, when a silence is detected, or a combination thereof, or packet communication such as the IP network 1 or the like. Various methods such as clearing the buffer size BS of the reception buffer 45 to 0 can be applied by using the result of monitoring the congestion degree of the network as a trigger. (Ii) In the above embodiment, the audio data accumulated in the reception buffer 45 is uncompressed data encoded by, for example, ulawPCM, but the present invention is not limited to ADPCM and CS.
It is also applicable to data encoded by an encoding method such as -ACELP or LD-CELP. Further, the arrival packet PR arriving at the voice hub 40 may have fluctuations even if it is silence-compressed. Therefore, the fluctuation absorption method of the present invention can be applied to such arrival packet PR.

【００１９】[0019]

【発明の効果】以上詳細に説明したように、本発明のう
ちの請求項１に係る発明によれば、ＦＩＦＯ方式の音声
データ蓄積用バッファは、ＩＰネットワークで発生する
最大遅延時間を吸収し得る大きさの蓄積容量を有するの
で、通信に関与するルータの段数や、バーストデータト
ラフィックの発生頻度等が不透明なＩＰネットワークに
おいて、最大のゆらぎが発生しても、このゆらぎを的確
に吸収できる。つまり、容量の大きいバッファに有音パ
ケットのみならず、会話を自然なものとするために必要
な無音パケットをも蓄積することで、自然な通話を実現
している。このように、本発明では、通話品質を維持す
ることを優先して容量の大きいバッファに有音パケット
及び無音パケットを蓄積し、本当に通話品質に影響の無
い無音部分のみ廃棄し、これにより、会話の連続性及び
明瞭性を保証することができる。さらに、バッファサイ
ズが一定量を超えた場合において、その一定量を超える
時間が一定時間継続すること、及びそのバッファに蓄積
された音声データが一定量無音パタンになることを無音
パタン検出部で検出した場合に、リセット信号により該
バッファをリセットしてバッファサイズをクリアするよ
うにしたので、音声パケットの欠陥があっても通話品質
には影響しない程度の発生頻度の比較的低い一瞬のバー
スト遅延が発生した場合、バッファの再生遅延時間が調
整されて音声遅延から早期に回復し、リアルタイムな会
話が可能になる。As described above in detail, according to the invention of claim 1 of the present invention, the audio of the FIFO system is used.
Since the data storage buffer has a storage capacity large enough to absorb the maximum delay time generated in the IP network, the number of router stages involved in communication, the frequency of occurrence of burst data traffic, etc. are unclear. Even if the maximum fluctuation occurs in a large IP network, this fluctuation can be accurately absorbed. In other words, a natural call is realized by storing not only voiced packets but also silent packets necessary for making a conversation natural in a large capacity buffer. As described above, according to the present invention, the voice packet and the silent packet are accumulated in the buffer having a large capacity with priority given to maintaining the call quality, and only the silent part that does not really affect the call quality is discarded. It is possible to guarantee the continuity and clarity of the. In addition, buffer size
When's exceeds a predetermined amount, silence that the time exceeding the predetermined amount continues for a predetermined time, and that the audio data stored in the buffer becomes constant amount silence pattern
When it is detected by the pattern detector , the reset signal
Since the buffer is reset to clear the buffer size , if there is a momentary burst delay with a relatively low occurrence frequency that does not affect the call quality even if there is a voice packet defect, the playback delay of the buffer The time is adjusted to recover early from voice delays and enable real-time conversation.

【００２０】請求項２に係る発明によれば、ＦＩＦＯ
方式の音声データ蓄積用バッファは、ＩＰネットワーク
で発生する最大遅延時間を吸収し得る大きさの蓄積容量
を有するので、請求項１に係る発明と同様の効果が得ら
れる。さらに、バッファサイズが一定量を超えた場合に
おいて、その一定量を超える時間が一定時間継続するこ
と、及びその蓄積音声データが一定時間以上無音パタン
になることを無音パタン検出部で検出した場合に、リセ
ット信号によりバッファをリセットしてバッファサイズ
をクリアするようにしたので、定められた遅延時間を超
える遅延が高頻度で発生するような場合においても、音
声の明瞭性が保証され、自然な会話を継続することがで
きる。According to the invention of claim 2, a FIFO
The audio data storage buffer of this method has a storage capacity large enough to absorb the maximum delay time generated in the IP network.
Since having the same effect as the invention according to claim 1 is obtained. Furthermore, when the buffer size exceeds a certain amount, when the silent pattern detection unit detects that the time exceeding the certain amount continues for a certain period of time and that the accumulated voice data becomes a silent pattern for a certain period of time or more. , Lyce
Since the buffer is reset by the input signal to clear the buffer size , the clarity of the voice is guaranteed even when the delay exceeding the specified delay time occurs frequently. , Able to continue a natural conversation.

[Brief description of drawings]

【図１】本発明の第１の実施形態で用いられる音声・デ
ータ統合通信用音声ハブの概略の構成図である。FIG. 1 is a schematic configuration diagram of a voice hub for voice / data integrated communication used in a first embodiment of the present invention.

【図２】従来の音声・データ統合通信システムの概略の
構成図である。FIG. 2 is a schematic configuration diagram of a conventional voice / data integrated communication system.

【図３】図２のシステムにおける従来の音声通信ゆらぎ
吸収方法を説明するための波形図である。3 is a waveform diagram for explaining a conventional voice communication fluctuation absorbing method in the system of FIG.

【図４】図１の音声ハブを用いた音声通信ゆらぎ吸収方
法を説明するための波形図である。FIG. 4 is a waveform diagram for explaining a voice communication fluctuation absorbing method using the voice hub of FIG.

【図５】本発明の第２の実施形態に用いられる無音パタ
ン検出部の概略の構成図である。FIG. 5 is a schematic configuration diagram of a silent pattern detection unit used in the second embodiment of the present invention.

【図６】図５の無音パタン検出部を用いた音声通信ゆら
ぎ吸収方法を説明するための波形図である。6 is a waveform diagram for explaining a voice communication fluctuation absorption method using the silent pattern detection unit of FIG.

[Explanation of symbols]

１ＩＰネットワーク２ルータ６端末装置４０音声ハブ４１ＣＰＵ４３ＬＡＮ制御部４４送信バッファ４５受信バッファ４６コーデック４８，４８Ａ無音パタン検出部４８ａＦＩＦＯ部４８ｂ無音パタン記憶部４８ｃ比較部４８ｄ出力部４９ＡＮＤゲート 1 IP network 2 routers 6 Terminal 40 audio hubs 41 CPU 43 LAN control unit 44 send buffer 45 receive buffer 46 codecs 48, 48A silent pattern detector 48a FIFO section 48b silence pattern storage unit 48c comparison section 48d output section 49 AND gate

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) H04L 12/56 ─────────────────────────────────────────────────── ─── Continuation of front page (58) Fields surveyed (Int.Cl. ⁷ , DB name) H04L 12/56

Claims

(57) [Claims]

1. Voiced and unvoiced voice data is extracted from a framed voice packet sent from an internet protocol network , and written to and read.
First-in-file with a constant playback delay time at
Writes to the buffer for audio data storage of the write- out method sequentially, and according to the written order , delays the playback delay time.
In the voice communication in which the voice data is sequentially read out , reproduced by the decoding means into a voice signal and given to the terminal device, the buffer has a storage capacity large enough to absorb the maximum delay time generated in the internet protocol network.
Amount of audio data that has a capacity and is accumulated in the buffer
If the buffer size exceeds a certain amount, the buffer size
Has a function of outputting a noise detection signal, and the buffer size is generated in the voice packet.
It absorbs the delay time due to
The buffer size detection signal is output from the buffer.
In this case, the buffer size exceeds a certain amount for a certain period of time, and the buffer size is written.
Make sure that a certain amount of rare audio data becomes a silent pattern.
If it detected by the silence pattern detecting unit, the buffer size
Based on the shift detection signal and the detection result of the silent pattern detection unit.
The reset signal generated by the
To clear the buffer size and
A voice communication fluctuation absorbing method characterized by resetting a sound pattern detecting unit .

2. Voiced and unvoiced voice data is extracted from a framed voice packet sent from an internet protocol network and is read from the written data.
First-in-file with a constant playback delay time at
Writes to the buffer for audio data storage of the write- out method sequentially, and according to the written order , delays the playback delay time.
In the voice communication in which the voice data is sequentially read out , reproduced by the decoding means into a voice signal and given to the terminal device, the buffer has a storage capacity large enough to absorb the maximum delay time generated in the internet protocol network.
Amount of audio data that has a capacity and is accumulated in the buffer
If the buffer size exceeds a certain amount, the buffer size
Has a function of outputting a noise detection signal, and the buffer size is generated in the voice packet.
It absorbs the delay time due to
The buffer size detection signal is output from the buffer.
In this case, the buffer size exceeds a certain amount for a certain period of time, and the buffer size is written.
When the silent pattern detection unit detects that the rare audio data becomes a silent pattern for a certain period of time, the buffer
Size detection signal and the detection result of the silent pattern detection unit
The reset signal generated based on
And clear the buffer size
A method for absorbing fluctuations in voice communication, comprising resetting the silent pattern detector .