JP2000151694A

JP2000151694A - Method for controlling correction for voice fluctuation, voice reproduction device and voice repeater

Info

Publication number: JP2000151694A
Application number: JP31481398A
Authority: JP
Inventors: Yoshi Nakajima; 善中島; Atsushi Niimura; 篤新村
Original assignee: Hitachi Communication Systems Inc
Current assignee: Hitachi Information Technology Co Ltd
Priority date: 1998-11-05
Filing date: 1998-11-05
Publication date: 2000-05-30
Anticipated expiration: 2018-11-05
Also published as: JP3917767B2

Abstract

PROBLEM TO BE SOLVED: To realize voice reproduction with excellent quality by controlling setting of a reproduction wait time optimum dynamically in response to a change in an arrival time even when the arrival time of voice data is fluctuated. SOLUTION: Upon the receipt of a notice, a fluctuation correction control section 105 acquires transmission time information from received data and based on the information, calculates an optimum reproduction wait time by the voice fluctuation correction control method that is a feature of this invention. The control section 105, based on the calculated wait time, writes digital voice data to an address of the reproduction buffer 101 that is corrected from a reception section 100 to apply fluctuation correction control by waiting an optimum reproduction wait time.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声通信システム
に係わり、特にインターネット等のパケット通信網を利
用して音声データを送受信するシステムにおいて、音声
データを受信再生する際での音声ゆらぎ補正制御方法、
更には、その音声ゆらぎ補正制御方法に係る音声再生装
置および音声中継装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice communication system, and more particularly to a voice fluctuation correction control method for receiving and reproducing voice data in a system for transmitting and receiving voice data using a packet communication network such as the Internet. ,
Further, the present invention relates to an audio reproducing device and an audio relay device according to the audio fluctuation correction control method.

【０００２】[0002]

【従来の技術】近年、音声電話をＬＡＮ（Local Area N
etwork）に統合する技術が提案され、電話機間の通話に
インターネット、またはイントラネットを利用する通話
システムの構築やサービスが次第に実施されつつあるの
が現状である。ところで、これまでの構内通信では、音
声電話は専らＰＢＸ（ Private Branch Exchange：構内
交換機）によっている一方では、データ通信にはＬＡＮ
を用いて行われており、両通信網は相互に独立したもの
となっている。そこで、音声もＬＡＮのデータとして送
受信し得るように、音声がＬＡＮの通信プロトコルによ
る通信データに適応したディジタルデータに変換される
ようにすれば、ＬＡＮを利用しての音声通話が可能とな
り、初めて音声電話網とデータ通信網の統合が実現され
ることになる。2. Description of the Related Art In recent years, voice telephones have been switched to local area networks (LANs).
At present, the construction and services of a communication system using the Internet or an intranet for telephone calls between telephones are being gradually implemented. By the way, in conventional private communication, voice telephone is exclusively provided by PBX (Private Branch Exchange: private branch exchange), while data communication is performed by LAN.
The two communication networks are independent of each other. Therefore, if voice is converted to digital data adapted to communication data according to the LAN communication protocol so that voice can be transmitted and received as LAN data, voice communication using the LAN becomes possible. The integration of the voice telephone network and the data communication network will be realized.

【０００３】しかしながら、ＬＡＮで利用される通信プ
ロトコルはインターネットプロトコル（ＩＰ： Interne
t Protocol）が標準となっている。一方、インターネッ
トでは、ＯＳＩ参照モデルのトランスポート層プロトコ
ルとして、ＴＣＰ（Transmission Control Protocol）
およびＵＤＰ（ User Datagram Protocol）が用いられ
ているのが実情である。ＴＣＰはコネクション型の通信
接続を行うもので、パケット順序制御、フロー制御、エ
ラー検出・回復、再送、輻輳制御などの機能を有してい
ることから、したがって、ＴＣＰによる場合には、信頼
性のある通信サービスが提供されるものとなっている。
一方、ＵＤＰはコネクションレス型のプロトコルとさ
れ、ＴＣＰのような機能を有しない分、リアルタイム性
が要求される通信に使用されるものとなっている。例え
ば音声通話を実現するには、一部分の音声データが欠落
しても再送することの重要性は低く、連続的に発生して
いる音声データは順次送信されなければならないものと
なっている。即ち、音声のように、リアルタイム性が要
求される通信にはＵＤＰが用いられているものである。
問題は、インターネット通信は離散的に発生するトラフ
ィックを送受信するには適しているが、音声のように、
連続的に発生するデータはデータパケット各々がＵＤＰ
により順次連続的に高速送信されてから、必ずしも同一
伝送経路を介し伝送されるとは限らないというものであ
る。換言すれば、データパケット各々がＵＤＰにより順
次連続的に高速送信されてから、様々な伝送経路を介し
受信側で受信されるまでの時間は一定ではなく、受信さ
れるまでの時間にはバラツキが生じ、したがって、送信
順に必ずしも受信されるとは限らなく、何等かの制御が
施されなければ、受信順に順次再生出力されたとして
も、連続性ある音声として必ずしも再生され得ないとい
うものである。However, a communication protocol used in a LAN is an Internet protocol (IP: Internet Protocol).
t Protocol) is standard. On the other hand, in the Internet, TCP (Transmission Control Protocol) is used as a transport layer protocol of the OSI reference model.
In fact, UDP (User Datagram Protocol) is used. TCP performs connection-type communication connection and has functions such as packet order control, flow control, error detection / recovery, retransmission, and congestion control. A certain communication service is provided.
On the other hand, UDP is a connectionless protocol, and is used for communication that requires real-time performance because it does not have a function like TCP. For example, in order to realize a voice call, it is of low importance to retransmit even if a part of voice data is lost, and voice data generated continuously must be sequentially transmitted. That is, UDP such as voice is used for communication that requires real-time properties.
The problem is that while Internet communications are good for sending and receiving discrete traffic, like voice,
Continuously generated data consists of UDP data packets.
Therefore, after high-speed transmission is performed sequentially and continuously, transmission is not always performed via the same transmission path. In other words, the time from when each data packet is successively transmitted at high speed by UDP until it is received at the receiving side via various transmission paths is not constant, and the time until it is received varies. Therefore, the sound is not always received in the order of transmission, and unless some control is performed, even if the sound is sequentially reproduced and output in the order of reception, the sound cannot always be reproduced as a continuous sound.

【０００４】そのような音声の不連続的再生を防止すべ
く、これまでにあっては、受信再生側には再生バッファ
が設けられた上、順次受信される音声データパケット各
々はその再生バッファ上に送信順に並べ替えられた状態
として一時蓄積されつつ、連続再生可能となる時間を待
って、初めて再生出力されていたものである。図５には
従来技術に係る音声再生装置が示されているが、これに
ついて簡単ながら説明すれば以下のようである。In order to prevent such discontinuous reproduction of audio, a reception buffer has been provided on the reception / reproduction side, and each audio data packet sequentially received is stored in the reproduction buffer. Is temporarily stored in a state of being rearranged in the order of transmission, and is reproduced and output for the first time after waiting for a time when continuous reproduction is possible. FIG. 5 shows an audio reproducing apparatus according to the prior art, which will be briefly described as follows.

【０００５】即ち、パケット通信網からの音声データパ
ケット各々は受信部５００で順次受信された上、受信さ
れた音声データパケット中のディジタル音声データ（符
号化圧縮状態）は再生バッファ５０１上にパケット送信
順に並べ替えされた状態として順次一時蓄積されるもの
となっている。一方、そのような一時蓄積に並行して、
連続再生可能なディジタル音声データが再生バッファ５
０１上に一時蓄積される一定時間を待って、再生バッフ
ァ５００からはディジタル音声データが再生データ読み
出し部５０２により順次一定周期で読み出されるものと
なっている。順次読み出されたディジタル音声データ
は、その後、復号化伸長された状態としてＤ／Ａ変換部
５０３で順次アナログ音声信号に変換された上、音声出
力部（スピーカやヘッドフォン等）５０４から音声とし
て再生出力されているものである。このように、インタ
ーネットやＬＡＮ、ＡＴＭネットワーク、フレームリレ
ーネットワーク等のパケット通信網（非同期通信網）上
では、音声データパケット各々はその送信順序通りに受
信側で受信されるとは限らなく、受信音声データパケッ
ト各々がその送信順序通りに連続的に並べ替えされた状
態として一時蓄積された上、読み出し再生されるまでに
ある程度の待ち時間が要されていたものである。That is, each voice data packet from the packet communication network is sequentially received by the receiving unit 500, and digital voice data (encoded and compressed state) in the received voice data packet is transmitted to the reproduction buffer 501 by packet transmission. The information is temporarily stored in a state of being rearranged in order. On the other hand, in parallel with such temporary accumulation,
Digital audio data that can be continuously reproduced is stored in a reproduction buffer 5
After a certain period of time temporarily stored in the storage buffer 01, digital audio data is sequentially read from the reproduction buffer 500 by the reproduction data reading unit 502 at a constant period. The sequentially read digital audio data is then sequentially converted into an analog audio signal by a D / A conversion unit 503 in a decoded and expanded state, and then reproduced as audio from an audio output unit (such as a speaker or a headphone) 504. This is what has been output. As described above, on a packet communication network (asynchronous communication network) such as the Internet, a LAN, an ATM network, a frame relay network, etc., each voice data packet is not always received by the receiving side in the transmission order, and the received voice data packet is not always received. Each of the data packets is temporarily stored in a state where the data packets are continuously rearranged in the transmission order, and a certain waiting time is required until the data packets are read and reproduced.

【０００６】[0006]

【発明が解決しようとする課題】以上のように、これま
でにあっては、受信再生側に一定容量の再生バッファが
用意された上、順次受信される音声データパケット各々
は再生バッファにパケット送信順に並べ替えされた状態
として順次一時蓄積される一方では、その再生バッファ
から一定再生待ち時間後に順次読み出し再生されている
が、このような方法による場合、場合如何によっては不
具合を生じるものとなっている。というのは、パケット
通信網が一定再生待ち時間を必要としないような良好な
通信状態であっても、読み出し再生までに一定時間待つ
必要があり、これとは逆に、一定再生待ち時間を上回る
ような受信遅れが起こった場合には、再生上、空白時間
が生じてしまうからである。As described above, up to now, a reproduction buffer having a fixed capacity is prepared on the receiving and reproducing side, and each of the audio data packets sequentially received is transmitted to the reproducing buffer. While the data is temporarily stored in a sequentially sorted state, the data is sequentially read out and reproduced from the reproduction buffer after a predetermined reproduction waiting time. However, such a method may cause a problem depending on the case. I have. This is because, even in a good communication state in which the packet communication network does not require a constant reproduction wait time, it is necessary to wait for a certain time before reading and reproducing, and conversely, it exceeds the constant reproduction wait time. This is because if such a reception delay occurs, a blank time will occur during reproduction.

【０００７】本発明の目的は、送信音声データパケット
各々が受信されるに際し、その受信順序が送信順序とは
異なる状態として受信、即ち、ばらついた状態として受
信される場合に、そのバラツキ程度に応じて動的に最適
な再生待ち時間が随時設定された上、受信音声データパ
ケット各々が良好な音声として再生出力され得る音声ゆ
らぎ補正制御方法を供するにある。また、音声中継装置
で電話網へ中継する際に、通信網の状態により動的に最
適な再生待ち時間で音声再生し、送信することにより良
質な音声送信を実現するものである。[0007] An object of the present invention is to provide a transmission voice data packet which is received in a state where the reception order is different from the transmission order, that is, when received as a scattered state. Therefore, the present invention provides a sound fluctuation correction control method in which an optimum reproduction waiting time is dynamically set at any time, and each received sound data packet can be reproduced and output as good sound. Also, when relaying to a telephone network by a voice relay device, high quality voice transmission is realized by dynamically reproducing and transmitting voice with an optimum reproduction waiting time according to the state of the communication network.

【０００８】[0008]

【課題を解決するための手段】上記課題を解決するため
に、本発明は、第１に通信網を介して、符号化されたデ
ィジタル音声データを含む音声パケットを受信し、ディ
ジタル音声データを取り出し、ディジタル音声データを
再生バッファに一時蓄積して、再生出力する場合におい
て、受信した音声パケットに含まれる送信時に刻印した
送信時刻情報から算出する再生バッファ上の書き込み位
置と現在の再生位置との差、即ち、再生待ち間隔を監視
し、再生待ち間隔に基づいて、音声データ書き込み位置
を補正することを特徴とするものである。In order to solve the above-mentioned problems, the present invention firstly receives a voice packet containing encoded digital voice data via a communication network and extracts the digital voice data. In the case where digital audio data is temporarily stored in the reproduction buffer and reproduced and output, the difference between the current reproduction position and the write position on the reproduction buffer calculated from the transmission time information stamped at the time of transmission included in the received audio packet. That is, the reproduction waiting interval is monitored, and the audio data writing position is corrected based on the reproduction waiting interval.

【０００９】第２に、上記第１の解決手順で、音声パケ
ット受信の際に算出した再生待ち間隔と、予め設定した
許容できる再生待ち間隔の最小値あるいは最大値との比
較により、音声データ書き込み位置を補正することを特
徴とするものである。Second, in the first solution procedure, the audio data writing is performed by comparing the reproduction waiting interval calculated at the time of receiving the audio packet with a preset minimum or maximum allowable reproduction waiting interval. The position is corrected.

【００１０】第３に、上記第２の解決手順で、音声パケ
ット受信の際に算出した再生待ち間隔と、予め設定した
許容できる再生待ち間隔の最小値あるいは最大値との比
較で、音声パケット再生待ち間隔が最小値と最大値の範
囲外の場合に音声データ書き込み位置を補正することを
特徴とするものである。Third, in the second solution procedure, the audio packet reproduction is performed by comparing the reproduction wait interval calculated at the time of receiving the audio packet with a preset minimum or maximum allowable reproduction wait interval. When the waiting interval is out of the range between the minimum value and the maximum value, the voice data writing position is corrected.

【００１１】第４に、上記第３の解決手順で、音声パケ
ット受信の際に算出した再生待ち間隔と、予め設定した
許容できる再生待ち間隔の最小値と最大値との比較で、
音声パケット再生待ち間隔が最小値と最大値の範囲外と
なる事象の発生した頻度に基づいて、音声データ書き込
み位置を補正するものである。Fourth, in the third solution procedure, a comparison is made between the reproduction waiting interval calculated at the time of receiving the voice packet and the preset minimum and maximum allowable reproduction waiting intervals.
The audio data writing position is corrected based on the frequency of occurrence of an event in which the audio packet reproduction waiting interval is out of the range between the minimum value and the maximum value.

【００１２】第５に、上記第２の解決手順で、音声パケ
ット受信の際に算出した再生待ち間隔と、予め設定した
許容できる再生待ち間隔の最小値との比較で、最小値あ
るいは最小値より小となる事象の発生した頻度に基づい
て、音声データ書き込み位置を再生待ち間隔を狭める方
向に補正することを特徴とするものである。Fifth, in the second solution procedure, the reproduction waiting interval calculated at the time of receiving the audio packet is compared with a preset minimum allowable reproduction waiting interval, and the minimum value or the minimum value is determined. The audio data writing position is corrected in a direction to shorten the reproduction waiting interval based on the frequency of occurrence of a small event.

【００１３】第６に本発明による音声再生装置は、パケ
ット通信網から音声データパケットを受信するデータ受
信部と、音声ゆらぎを制御する上記記載の制御方式を使
用したゆらぎ補正制御部と、受信したデータパケット中
のディジタル音声データを保存する再生バッファと、再
生するディジタル音声データを再生バッファから読み出
す再生データ読み出し部と、前記再生データ読み出し部
が出力するディジタル音声データを音声データに変換し
て出力するＤ／Ａ変換部と、前記のＤ／Ａ変換部から出
力された音声データを再生出力する音声出力部を備えた
構成となっている。Sixthly, an audio reproducing apparatus according to the present invention comprises: a data receiving section for receiving an audio data packet from a packet communication network; a fluctuation correction control section using the above-described control method for controlling audio fluctuation; A reproduction buffer for storing digital audio data in the data packet, a reproduction data read unit for reading the digital audio data to be reproduced from the reproduction buffer, and converting the digital audio data output by the reproduction data read unit into audio data for output The configuration includes a D / A conversion unit and an audio output unit that reproduces and outputs the audio data output from the D / A conversion unit.

【００１４】第７に、上記第６に記載の音声再生装置の
パケット通信網は、インターネット、ローカルエリアネ
ットワーク、ＡＴＭネットワークあるいはフレームリレ
ーであって差し支えない。Seventh, the packet communication network of the audio reproducing apparatus according to the sixth aspect may be the Internet, a local area network, an ATM network or a frame relay.

【００１５】第８に、本発明による音声中継装置は、上
記第６に記載の音声再生装置の音声出力部に替えて交換
回線網へ音声を出力する回線対応部を備えた構成となっ
ている。Eighth, an audio relay apparatus according to the present invention is provided with a line corresponding section for outputting audio to a switched line network in place of the audio output section of the above-described audio reproducing apparatus. .

【００１６】第９に、上記第八に記載の音声中継装置
は、交換回線網は、公衆アナログ電話網、公衆ＩＳＤＮ
あるいは内線電話網であって差し支えない。Ninth, in the voice relay apparatus according to the eighth aspect, the switched network is a public analog telephone network, a public ISDN.
Alternatively, it may be an extension telephone network.

【００１７】[0017]

【発明の実施の形態】本発明は、パケット通信網等の非
同期通信網を介して、音声データを受信して音声再生す
る音声再生装置と再生音声を交換回線網に送信する音声
中継装置に適用されるもので、特にインターネットを利
用して音声データを通信するインターネット電話に適用
される。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention is applied to a voice reproducing apparatus for receiving voice data and reproducing voice via an asynchronous communication network such as a packet communication network and a voice relay apparatus for transmitting reproduced voice to a switching network. In particular, the present invention is applied to an Internet telephone that communicates voice data using the Internet.

【００１８】（実施例１）：以下、本発明の第１の実施
形態について図面に基づき詳細に説明する。図１は本実
施の形態に係る音声再生装置のブロック構成図である。
図１の１００はパケット通信網からデータパケットを受
信する受信部であり、１０５にデータを受信したことを
通知する。１０５は本発明の特徴をなすゆらぎ補正制御
部で、受信部１００からの通知をもとに再生バッファ１
０１への書き込み制御をする。１０１は再生バッファで
１０５から出力されたディジタル音声データを一時的に
保存する。１０２は再生データ読み出し部であり、連続
再生可能なディジタル音声データが１０１に蓄積される
のを待ってからディジタル音声データを読み出し、１０
３へ出力する。１０３はディジタル音声データを音声デ
ータに変換して１０４に出力するＤ／Ａ変換部である。
１０４はスピーカやヘッドホン等の音声出力部である。Example 1 Hereinafter, a first embodiment of the present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram of the audio reproducing apparatus according to the present embodiment.
A receiving unit 100 receives a data packet from a packet communication network, and notifies 105 that data has been received. Reference numeral 105 denotes a fluctuation correction control unit which is a feature of the present invention.
01 is controlled. A playback buffer 101 temporarily stores digital audio data output from the playback buffer 105. Reference numeral 102 denotes a reproduction data readout unit, which reads digital audio data after the digital audio data that can be continuously reproduced is stored in 101, and reads out the digital audio data.
Output to 3. Reference numeral 103 denotes a D / A conversion unit that converts digital audio data into audio data and outputs the data to 104.
Reference numeral 104 denotes an audio output unit such as a speaker or headphones.

【００１９】以上のように構成された音声再生装置の再
生バッファ１０１、再生データ読み出し部１０２、ゆら
ぎ補正制御部１０５について以下、図２と図３を参照し
つつ詳細に説明する。The playback buffer 101, playback data readout unit 102, and fluctuation correction control unit 105 of the audio playback apparatus configured as described above will be described below in detail with reference to FIGS.

【００２０】図２はゆらぎ補正制御部１０５のバッファ
制御処理を説明する図である。図のブロックの並びは図
１における再生バッファ１０１を表しており、図１のゆ
らぎ補正制御部１０５が音声データを最適位置に書込ん
でいき、再生データ読み出し部１０２が図２の左側のブ
ロックから読み出し、再生していくことを表している。
また、Ｂ０は音声再生する単位ブロックである。この値
は予め一定に設定されたサンプリング時間毎にディジタ
ル音声データ化された音声データの大きさであり、例え
ば、３０ｍ秒間隔のサンプリング時間で８ビット８ｋＨ
ｚのμ法則を用いた場合は２４０バイトのディジタル音
声データとなり、これが音声再生単位ブロックの大きさ
となる。Ｂ１は現在再生してる再生ブロック単位のブロ
ック番号を、Ｂ２は受信した音声データを書き込む再生
ブロック単位のブロック番号をそれぞれ示している。ｄ
は書き込みブロック番号Ｂ２と再生ブロック番号Ｂ１の
間隔を表す値で、すなわち受信した音声データを書き込
んでから再生されるまで待ち合わせる音声データのブロ
ック数を表している。図２の斜線部分が音声データを受
信してから再生されるまでに待ち合わせている音声デー
タを示している。FIG. 2 is a diagram for explaining a buffer control process of the fluctuation correction control unit 105. The arrangement of the blocks in the figure represents the reproduction buffer 101 in FIG. 1. The fluctuation correction control unit 105 in FIG. 1 writes the audio data at the optimum position, and the reproduction data reading unit 102 starts from the left block in FIG. It means reading and reproducing.
B0 is a unit block for sound reproduction. This value is the size of audio data that has been converted into digital audio data at every predetermined sampling time, for example, 8 bits 8 kHz at a sampling time of 30 ms.
When the μ-law of z is used, digital audio data of 240 bytes is obtained, which is the size of an audio reproduction unit block. B1 indicates the block number of the reproduction block currently being reproduced, and B2 indicates the block number of the reproduction block in which the received audio data is written. d
Is a value indicating the interval between the write block number B2 and the reproduction block number B1, that is, the number of audio data blocks that wait until the received audio data is written and then reproduced. The hatched portions in FIG. 2 indicate the audio data waiting from reception of the audio data to reproduction.

【００２１】この再生されるまでに待ち合わせるブロッ
ク数と音声再生には次のような関係がある。即ち、待ち
合わせるブロック数が多いほど、音声データを受信して
から再生されるまでの時間が長くなってしまうが、通信
状態によって到着時間が大きくゆらいでも音声が途切れ
ること無く連続に再生することが出来る。一方、再生さ
れるまでに待ち合わせるブロック数が少ないほど、音声
データを受信してから再生されるまでの時間が短くな
り、少ない遅延時間で再生することが出来るが、再生を
待ち合わせる時間以上の遅れで音声データが到着する場
合は音声が途切れて再生されるという関係がある。この
音声データを受信してから再生するまでに待ち合わせる
間隔の算出、即ち、再生ブロック番号Ｂ１に対する書き
込みブロック番号Ｂ２を決定する処理をゆらぎ補正制御
部１０５で行う。なお、Ｔｍｉｎは最小再生待ち時間を
表し、この値は書き込みブロック番号Ｂ２と再生ブロッ
ク番号Ｂ１との間隔ｄ値がとる最小値を表す。Ｔｍａｘ
は最大再生待ち時間を表し、間隔ｄ値がとる最大値を表
す。このＴｍｉｎ、Ｔｍａｘの値はゆらぎ補正制御処理
のため参照する値であり、音声データ受信前に予め設定
しておく値である。例えば再生単位ブロックＢ０を３０
ｍ秒間隔でサンプリングしたディジタル音声データとし
た場合、最小再生待ち時間を６０ｍ秒、最大３００ｍ秒
の再生遅延時間を許容すると設定した場合は、Ｔｍｉｎ
＝２、Ｔｍａｘ＝１０として設定される。The following relationship exists between the number of blocks waiting to be reproduced and audio reproduction. That is, as the number of waiting blocks increases, the time from when the audio data is received to when the audio data is reproduced becomes longer. However, even if the arrival time greatly fluctuates depending on the communication state, the audio can be reproduced continuously without interruption. . On the other hand, the smaller the number of blocks waiting to be played back, the shorter the time from when the audio data is received to when it is played back, and it can be played back with a small delay time, but with a delay longer than the time waiting for playback. When audio data arrives, the audio is interrupted and played. The fluctuation correction control unit 105 performs the calculation of the waiting interval from the reception of the audio data to the reproduction, that is, the process of determining the write block number B2 for the reproduction block number B1. Note that Tmin represents the minimum reproduction waiting time, and this value represents the minimum value of the interval d between the write block number B2 and the reproduction block number B1. Tmax
Represents the maximum reproduction waiting time, and represents the maximum value of the interval d. The values of Tmin and Tmax are values that are referred to for the fluctuation correction control processing, and are values that are set before receiving audio data. For example, if the reproduction unit block B0 is 30
If digital audio data sampled at m-second intervals is used, a minimum playback wait time of 60 ms and a maximum playback delay time of 300 ms are set to Tmin.
= 2, Tmax = 10.

【００２２】図３はゆらぎ補正制御部１０５の動作を説
明するフローチャートである。図３では、音声が到着す
る度に、当該フローを実行し、再生バッファ上の現在再
生ブロック番号と書き込みブロック番号との間隔ｄを求
め、この間隔ｄを最大値Ｔｍａｘ、最小値Ｔｍｉｎと比
較し、特定時間範囲である場合の発生頻度によって、再
生バッファに書き込む位置を補正する動作を説明してい
る。FIG. 3 is a flowchart for explaining the operation of the fluctuation correction control unit 105. In FIG. 3, each time a voice arrives, the flow is executed, an interval d between the current playback block number and the write block number in the playback buffer is obtained, and this interval d is compared with the maximum value Tmax and the minimum value Tmin. The operation of correcting the position to be written to the reproduction buffer according to the frequency of occurrence in the specific time range is described.

【００２３】図３で処理のために使用している作業用変
数を以下に説明すれば、補正値ＡＪは音声データを書き
込むブロック番号を補正する値で、整数値をとる。音声
データ到着が連続して遅れる状態が続くと、この値は増
加していき、現在再生ブロックより時間的に離れた再生
バッファに音声データを書き込むように補正される。一
方、音声データが連続して早く到着する状態が続くと、
この値は減少していき、現在再生ブロックに時間的に近
い再生バッファに書き込むように補正される。The working variables used for the processing in FIG. 3 will be described below. The correction value AJ is a value for correcting the block number in which the audio data is written, and takes an integer value. When the state where the arrival of the audio data is continuously delayed continues, this value increases and is corrected so that the audio data is written to a reproduction buffer temporally separated from the current reproduction block. On the other hand, if audio data continues to arrive early,
This value decreases, and is corrected so as to write to a reproduction buffer that is temporally closer to the current reproduction block.

【００２４】送信時刻情報Ｔは、音声送信装置にて音声
を符号化したデータ量に依存した時刻を表す情報であ
り、音声データとともに付加して送信する。インターネ
ットでの音声、映像のようなリアルタイムトラフィック
を送受信する場合、ＲＴＰ（Real Time Protocol）が用
いられる。ＲＴＰは主にＵＤＰの上位層で規定されるプ
ロトコルであり、リアルタイムデータを制御する目的で
時刻印やシーケンス番号等の情報を定義している。この
プロトコルの時刻情報部を利用して音声送信時の時刻情
報を送信する。例えば３０ｍ秒間隔のサンプリング時間
で８ビット８ｋＨz のμ法則を用いた場合は、３０ｍ秒
毎に２４０バイトの音声データが生成されるが、このと
きの送信時刻情報Ｔは１パケット毎に２４０づつ加算さ
れた値が設定される。The transmission time information T is information indicating a time depending on the data amount of voice encoded by the voice transmitting device, and is transmitted together with the voice data. When transmitting and receiving real-time traffic such as audio and video on the Internet, RTP (Real Time Protocol) is used. RTP is a protocol mainly defined in the upper layer of UDP, and defines information such as a time stamp and a sequence number for the purpose of controlling real-time data. The time information at the time of voice transmission is transmitted using the time information part of this protocol. For example, when the 8-bit 8 kHz μ-law is used at a sampling time of 30 ms, 240 bytes of audio data are generated every 30 ms, and the transmission time information T at this time is added by 240 for each packet. The set value is set.

【００２５】最小値比較カウンタＣＭｉｎは、間隔ｄが
最小再生待ち時間Ｔｍｉｎ以上である場合が連続して発
生した回数をカウントしておく値で、このカウンタ値Ｃ
Ｍｉｎが予め設定しておく比較値以上連続してカウント
されたとき、音声データはＴｍｉｎより大きな再生待ち
合せ時間で定常的に再生されているので、より早く再生
するよう補正するため、データ書き込みブロック位置を
再生ブロック位置側に近づけるように補正値ＡＪを変更
する。ＣＭｉｎと比較するために予め設定しておく比較
値は補正最高値ＡＪＷｍａｘで、例えば１０回連続でＣ
Ｍｉｎがカウントされたときに補正値ＡＪを変更するよ
う設定する場合は、ＡＪＷｍａｘ＝１０となる。The minimum value comparison counter CMin is a value that counts the number of times that the interval d is continuously equal to or longer than the minimum reproduction waiting time Tmin.
When Min is continuously counted over a preset comparison value, the audio data is constantly reproduced with a reproduction waiting time longer than Tmin. Is changed so that is closer to the reproduction block position side. The comparison value set in advance for comparison with CMin is the corrected maximum value AJWmax.
When setting to change the correction value AJ when Min is counted, AJWmax = 10.

【００２６】補正値増加カウンタＡＪＰは、間隔ｄが最
小再生待ち時間Ｔｍｉｎより小のときが連続して発生し
た回数をカウントする値であって、このカウンタ値が予
め設定しておく比較値以上連続で発生したときに、再生
ブロック位置に時間的に離れる方向に音声データを書き
込むように補正値ＡＪを変更する。ＡＪＰと比較するた
めに予め設定しておく比較値は補正値増加変更カウンタ
ＡＪＰＣｈで、例えば２回連続でＡＪＰがカウントされ
たときに補正値ＡＪを変更するように設定する場合は、
ＡＪＰＣｈ＝２となる。The correction value increase counter AJP is a value for counting the number of times that the interval d is continuously smaller than the minimum reproduction waiting time Tmin, and the counter value is larger than a preset comparison value. When the error occurs, the correction value AJ is changed so that the audio data is written in the direction temporally away from the reproduction block position. The comparison value set in advance for comparison with AJP is a correction value increase change counter AJPCh. For example, when the correction value AJ is changed when AJP is counted twice consecutively,
AJPCh = 2.

【００２７】補正値減少カウンタＡＪＭは、差ｄが最大
再生待ち時間Ｔｍａｘより大のときが連続して発生した
回数をカウントする値であって、このカウンタ値が予め
設定しておく比較値以上連続で発生したときに、再生ブ
ロック位置から時間的に近い再生バッファに音声データ
を書き込むように補正値ＡＪを変更する。ＡＪＭと比較
するために予め設定しておく比較値は補正値減少変更カ
ウンタＡＪＭＣｈで、例えば２回連続でＡＪＭがカウン
トされたときに補正値ＡＪを変更するように設定する場
合は、ＡＪＭＣｈ＝２となる。The correction value decrease counter AJM is a value for counting the number of consecutive occurrences when the difference d is greater than the maximum reproduction waiting time Tmax, and the counter value is continuously larger than a preset comparison value. When the error occurs, the correction value AJ is changed so that the audio data is written to the reproduction buffer temporally close to the reproduction block position. A comparison value set in advance for comparison with AJM is a correction value decrease change counter AJMCh. For example, when the correction value AJ is changed when AJM is counted twice consecutively, AJMCh = 2. Becomes

【００２８】音声再生装置では、新たな音声パケットが
到着する度に、図３のフローを実行し、補正値ＡＪと送
信時刻情報Ｔを元に、再生バッファへ書き込む音声デー
タの書き込み位置を決定するとともに、送信時刻情報Ｔ
から算出する再生バッファ上の書き込み位置と現在の再
生位置との差、即ち、再生待ち間隔に基づいて補正値Ａ
Ｊを算出する。送信時刻情報Ｔは、例えば、音声送信装
置側で音声データを符号化したデータ量に依存する値を
基準値として、音声をサンプリングした時間を刻印した
値であり、音声送信装置はこの値をディジタル音声デー
タとともに付加して送信する。The audio reproducing apparatus executes the flow of FIG. 3 every time a new audio packet arrives, and determines the write position of the audio data to be written into the reproduction buffer based on the correction value AJ and the transmission time information T. And transmission time information T
The correction value A is calculated based on the difference between the write position on the reproduction buffer and the current reproduction position, that is, the reproduction wait interval.
Calculate J. The transmission time information T is, for example, a value obtained by engraving the time at which the audio was sampled with a value depending on the data amount obtained by encoding the audio data on the audio transmitting device side as a reference value. It is transmitted together with the audio data.

【００２９】以下、図３について動作を詳細に説明す
る。新しい音声データを受信する（Ｓ１）と受信データ
から送信時刻情報Ｔを取得する（Ｓ２）。次に現在の再
生ブロック番号Ｂ１を取得する（Ｓ３）。そして、送信
時刻情報Ｔと補正値ＡＪを元に書き込みブロック番号Ｂ
２を算出する（Ｓ４）。この補正値ＡＪは直前に到着し
た受信データ処理により更新されている（初期値は
０）。送信時刻情報Ｔは３０ｍ秒間隔サンプリングで８
ビット８ｋＨz のμ法則の場合、２４０の整数倍の値で
あり、Ｔを２４０で割り、補正値ＡＪを加えることで、
補正された書き込みブロック番号Ｂ２を算出している。Hereinafter, the operation will be described in detail with reference to FIG. When new voice data is received (S1), transmission time information T is obtained from the received data (S2). Next, the current playback block number B1 is obtained (S3). Then, based on the transmission time information T and the correction value AJ, the write block number B
2 is calculated (S4). The correction value AJ has been updated by the immediately preceding received data processing (the initial value is 0). The transmission time information T is 8 at 30 ms interval sampling.
In the case of the μ-law of the bit 8 kHz, the value is an integral multiple of 240. By dividing T by 240 and adding the correction value AJ,
The corrected write block number B2 is calculated.

【００３０】次に、書き込みブロック番号Ｂ２から現在
の再生ブロック番号Ｂ１の値を引き、間隔ｄを算出する
（Ｓ５）。次に、最小値比較カウンタＣＭｉｎを加算
（インクリメント：＋１更新）しておく（Ｓ６）。この
ＣＭｉｎは間隔ｄがＴｍｉｎより大きい場合が連続して
発生する状態のときに、より早く音声再生するよう、書
き込みブロック番号を補正するために用いる値である。
次に、書き込み再生位置ブロック間隔ｄが最小再生待ち
時間Ｔｍｉｎ以下かを判定する（Ｓ７）。判定が偽であ
ったら、次に間隔ｄを最大再生待ち時間Ｔｍａｘと比較
するためにに分岐し、真であったら、間隔ｄがＴｍｉ
ｎより大きい場合が連続しなかったので、最小値比較カ
ウンタＣＭｉｎを初期化する（Ｓ８）。Next, the value of the current reproduction block number B1 is subtracted from the write block number B2 to calculate an interval d (S5). Next, the minimum value comparison counter CMin is incremented (increment: +1 updated) (S6). This CMin is a value used to correct the write block number so that the sound is reproduced earlier when the case where the interval d is larger than Tmin occurs continuously.
Next, it is determined whether the writing / reproducing position block interval d is equal to or less than the minimum reproduction waiting time Tmin (S7). If the determination is false, then the interval d is branched to compare with the maximum reproduction wait time Tmax, and if true, the interval d is set to Tmi.
Since the case where the value is larger than n is not continuous, the minimum value comparison counter CMin is initialized (S8).

【００３１】次に、補正値増加カウンタＡＪＰは補正値
増加変更カウンタＡＪＰＣｈ以上であれば（Ｓ９）補正
値ＡＪを加算（インクリメント）し（Ｓ１０）、補正値
増加カウンタＡＪＰを初期化する（Ｓ１１）。その後、
補正値増加カウンタＡＪＰの加算（インクリメント）と
補正値減少カウンタＡＪＭの初期化をする（Ｓ１２）。
その後、に分岐し、書き込みブロック番号Ｂ２へ受信
データを書き込む（Ｓ２２）。間隔ｄがＴｍｉｎ以下で
あった場合、当該受信データの処理はこれで終了し、
に戻り次受信データ到着を待つ。Next, if the correction value increase counter AJP is equal to or larger than the correction value increase change counter AJPCh (S9), the correction value AJ is added (incremented) (S10), and the correction value increase counter AJP is initialized (S11). . afterwards,
The correction value increase counter AJP is added (incremented) and the correction value decrease counter AJM is initialized (S12).
Thereafter, the flow branches to (2), and the received data is written to the write block number B2 (S22). When the interval d is equal to or smaller than Tmin, the processing of the received data is completed here.
And waits for the next received data to arrive.

【００３２】ステップ（Ｓ７）の判定が偽のとき、書き
込み再生位置ブロック間隔ｄが最大再生待ち時間Ｔｍａ
ｘ以上かを判定する（Ｓ１３）。偽のとき、即ち、間隔
ｄがＴｍｉｎとＴｍａｘとの間であった場合、処理へ
分岐し、書き込みブロック番号Ｂ２へ受信データを書き
込む（Ｓ２２）、補正値ＡＪは変更せずにに戻る。真
のとき補正値減少カウンタＡＪＭが補正値減少変更カウ
ンタＡＪＭＣｈ以上かを判定して（Ｓ１４）、補正値Ａ
Ｊを減算（デクリメント：−１更新）し（Ｓ１５）、補
正値減少カウンタＡＪＭを初期化する（Ｓ１６）。その
後、補正値減少カウンタＡＪＭの加算（インクリメン
ト）と補正値増加カウンタＡＪＰを初期化する（Ｓ１
７）。When the determination in step (S7) is false, the writing / reproducing position block interval d is equal to the maximum reproduction waiting time Tma.
It is determined whether the value is equal to or more than x (S13). When the result is false, that is, when the interval d is between Tmin and Tmax, the process branches to write the received data to the write block number B2 (S22), and returns without changing the correction value AJ. If true, it is determined whether the correction value decrease counter AJM is equal to or greater than the correction value decrease change counter AJMCh (S14), and the correction value A
J is decremented (decrement: -1 updated) (S15), and a correction value decrease counter AJM is initialized (S16). Thereafter, the addition (increment) of the correction value decrease counter AJM and the correction value increase counter AJP are initialized (S1).
7).

【００３３】ステップ（Ｓ７）と（Ｓ１３）の判定がと
もに偽のとき、即ち、間隔ｄがＴｍｉｎとＴｍａｘとの
間であった場合、最小値比較カウンタＣＭｉｎが補正最
高値ＡＪＷｍａｘ以上かを判定する（Ｓ１８）。真なら
補正値ＡＪを減算（デクリメント）（Ｓ１９）、補正値
減少カウンタＡＪＭを初期化（Ｓ２０）し、最小値比較
カウンタＣＭｉｎを初期化する（Ｓ２１）。補正最高値
ＡＪＷｍａｘは予め設定しておく値で、ステップ（Ｓ１
８）の判定で、間隔ｄが最小再生待ち時間Ｔｍｉｎより
大きい場合が、補正最高値ＡＪＷｍａｘの回数分連続し
て起こったら補正値ＡＪを減算（デクリメント）する。
即ち、受信データの到着時間が一定時間幅に定常的に受
信するような、通信網が安定している状態のときは、再
生待ち時間を少なくするように、書き込みブロック番号
を補正するため、補正値ＡＪを減算（デクリメント）す
る。When the determinations in steps (S7) and (S13) are both false, that is, when the interval d is between Tmin and Tmax, it is determined whether the minimum value comparison counter CMin is equal to or greater than the corrected maximum value AJWmax. (S18). If true, the correction value AJ is subtracted (decremented) (S19), the correction value decrease counter AJM is initialized (S20), and the minimum value comparison counter CMin is initialized (S21). The correction maximum value AJWmax is a value set in advance, and is determined in step (S1).
In the determination of 8), when the interval d is longer than the minimum reproduction waiting time Tmin, if the number of times of the maximum correction value AJWmax occurs continuously, the correction value AJ is subtracted (decremented).
That is, when the communication network is stable such that the arrival time of the received data is constantly received within a fixed time width, the write block number is corrected so as to reduce the reproduction waiting time. The value AJ is subtracted (decremented).

【００３４】その後、書き込みブロック番号Ｂ２へ受信
データを書き込む（Ｓ２２）、に戻り次受信データを
待機する。Thereafter, the received data is written to the write block number B2 (S22), and the flow returns to the standby state for the next received data.

【００３５】以上の到着時間ゆらぎ補正制御部の処理制
御により、通信網状態によって音声データの到着時間に
ゆらぎが生じても音声再生待ち時間を動的に補正制御す
ることで、通信網が大きな再生待ち時間を必要としない
ような良好な通信状態のときは再生待ち時間を最小時間
に、逆に音声データ到着が遅れる通信状態のときは大き
な再生待ち時間に変化させられる。By the processing control of the arrival time fluctuation correction control unit described above, even if the arrival time of voice data fluctuates depending on the state of the communication network, the voice reproduction waiting time is dynamically corrected and controlled, so that the communication network has a large reproduction time. The reproduction wait time is changed to the minimum time when the communication state is good such that no wait time is required, and the reproduction wait time is changed to a long reproduction wait time when the communication state delays the arrival of audio data.

【００３６】さて、以上に説明したゆらぎ補正制御部の
動作を踏まえ、ゆらぎ補正制御部を含む音声再生装置の
動作について説明する。Now, the operation of the audio reproducing apparatus including the fluctuation correction control unit will be described based on the operation of the fluctuation correction control unit described above.

【００３７】受信部１００はパケット通信網から自音声
再生装置宛てのディジタル音声データを受信すると、ゆ
らぎ補正制御部１０５へ受信したことを通知する。１０
５は通知を受けると受信データから送信時刻情報Ｔを取
得し、この値から上記で説明したゆらぎ補正制御方法に
より、最適な再生待ち時間を算出する。この算出した最
適再生待ち時間を基にして１０５は、再生バッファ１０
１の最適な位置へディジタル音声データを書き込む。こ
の書き込む位置の動的な最適化により、通信状態に応じ
た再生待ち時間を待つように調整する。１０２は再生バ
ッファ１０１のディジタル音声データを逐次読み出し、
１０３へ出力する。Ｄ／Ａ変換部１０３は１０２から受
け取ったディジタル音声データを逐次、Ｄ／Ａ変換して
音声出力部１０４へ出力する。１０４からは音声として
再生出力する。When receiving the digital audio data addressed to the own audio reproducing apparatus from the packet communication network, the receiving section 100 notifies the fluctuation correction control section 105 of the reception. 10
5 receives the notification, obtains the transmission time information T from the received data, and calculates the optimum reproduction waiting time from the value by the fluctuation correction control method described above. Based on the calculated optimal reproduction waiting time, 105
1. Write digital audio data to the optimum position. By dynamically optimizing the writing position, adjustment is made so as to wait for a reproduction waiting time according to the communication state. 102 sequentially reads out digital audio data from the reproduction buffer 101,
Output to 103. The D / A conversion unit 103 sequentially performs D / A conversion on the digital audio data received from the D / A converter 102 and outputs the digital audio data to the audio output unit 104. 104 reproduces and outputs the voice.

【００３８】以上のような本発明の第一の実施形態によ
れば、上記の音声データ到着時間ゆらぎ補正制御部を含
む音声再生装置によって、インターネットに代表するデ
ータ到着時間にゆらぎが発生する非同期通信網での音声
通信をする際、通信網状態に対応した最適な再生待ち時
間で音声再生が実現される。According to the above-described first embodiment of the present invention, the asynchronous communication in which the data arrival time typified by the Internet fluctuates by the audio reproduction apparatus including the above-described audio data arrival time fluctuation correction control unit. When performing voice communication on a network, voice reproduction is realized with an optimum reproduction waiting time corresponding to the communication network state.

【００３９】（実施例２）：以下本発明の第２の実施形
態について図面に基づき説明すれば、図４は本実施の形
態に係る音声中継装置のブロック構成図である。図４
で、４００はパケット通信網からデータパケットを受信
する受信部であり、４０５にデータを受信したことを通
知する。４０５は本発明の特徴をなすゆらぎ補正制御部
で、受信部４００からの通知をもとに再生バッファ４０
１への書き込み制御をする。４０１は再生バッファで４
０５から出力されたディジタル音声データを一時的に保
存する。４０２は再生データ読み出し部であり、連続再
生可能なディジタル音声データが４０１に蓄積されるの
を待ってからディジタル音声データを読み出し、４０３
へ出力する。４０３はディジタル音声データを音声デー
タに変換して４０４に出力するＤ／Ａ変換部である。４
０４は回線対応部で４０３から出力された音声を電話回
線網へ送信する。Embodiment 2 Hereinafter, a second embodiment of the present invention will be described with reference to the drawings. FIG. 4 is a block diagram of a voice relay apparatus according to the present embodiment. FIG.
A receiving unit 400 receives a data packet from a packet communication network, and notifies a receiving unit 405 that data has been received. Reference numeral 405 denotes a fluctuation correction control unit which is a feature of the present invention.
1 is controlled to be written. 401 is a playback buffer 4
05 is temporarily stored. Reference numeral 402 denotes a reproduction data readout unit which reads digital audio data after waiting for digital audio data that can be continuously reproduced to be stored in 401;
Output to A D / A converter 403 converts digital audio data into audio data and outputs the audio data to 404. 4
Reference numeral 04 denotes a line corresponding unit for transmitting the voice output from 403 to the telephone line network.

【００４０】以上のように構成された音声中継装置につ
いて、その動作について説明すれば、受信部４００はパ
ケット通信網から自音声中継装置宛てのディジタル音声
データを受信すると、ゆらぎ補正制御部４０５へ受信し
たことを通知する。４０５は通知を受けると受信データ
から送信時刻情報Ｔを取得し、この値から上記で説明し
たゆらぎ補正制御方法により、最適な再生待ち時間を算
出する。この算出した最適再生待ち時間を基にして４０
５は、再生バッファ４０１の最適な位置へディジタル音
声データを書き込む。この書き込む位置の動的な最適化
により、通信状態に応じた再生待ち時間を待つように調
整する。４０２は再生バッファ４０１のディジタル音声
データを逐次読み出し、４０３へ出力する。Ｄ／Ａ変換
部４０３は４０２から受け取ったディジタル音声データ
を逐次、Ｄ／Ａ変換して音声出力部４０４へ出力する。
４０４からは電話回線網へ再生出力される。The operation of the voice relay apparatus configured as described above will be described. When receiving the digital voice data addressed to the voice relay apparatus from the packet communication network, the receiving section 400 transmits the digital voice data to the fluctuation correction control section 405. Notify that Upon receiving the notification, the transmission time information 405 obtains transmission time information T from the received data, and calculates an optimum reproduction waiting time from the value by the fluctuation correction control method described above. Based on the calculated optimal reproduction waiting time, 40
5 writes digital audio data to the optimum position of the reproduction buffer 401. By dynamically optimizing the writing position, adjustment is made so as to wait for a reproduction waiting time according to the communication state. 402 sequentially reads out digital audio data from the reproduction buffer 401 and outputs it to 403. The D / A converter 403 sequentially converts the digital audio data received from 402 into a digital signal and outputs the digital audio data to the audio output unit 404.
From 404, it is reproduced and output to the telephone line network.

【００４１】以上の動作により、本発明の第２の実施形
態によれば、上記の音声データ到着時間ゆらぎ補正制御
部を含む音声中継装置によって、インターネットに代表
するデータ到着時間にゆらぎが発生する非同期通信網で
の音声通信の中継をする際、通信網状態に対応した再生
待ち時間で音声再生し、電話回線網へ送信することで最
適な音声中継が実現される。According to the above operation, according to the second embodiment of the present invention, the voice relay apparatus including the voice data arrival time fluctuation correction control unit causes the data arrival time typified by the Internet to fluctuate. When relaying voice communication in a communication network, voice is reproduced with a reproduction waiting time corresponding to the state of the communication network and transmitted to the telephone line network, whereby an optimum voice relay is realized.

【００４２】なお、以上の説明から判るように、再生待
ち時間を小さくするように、音声データの書き込み位置
が変更された上、書き込みが行われる場合は、未再生音
声データ上に上書きされる結果として、その未再生音声
データは無効化されることになる。例えば３０ｍ秒間隔
のサンプリングの場合、一回の書き込み位置補正で３０
ｍ秒分の音声データが無効化されるというものである。
したがって、仮に、連続的に書き込み位置の変更が行わ
れれば、連続的に音声データの無効化が生じる結果とし
て、聴覚上、音声の途切れが感じられることになる。し
かしながら、本発明による場合には、指定回数分、再生
待ち時間を小さくする事象が発生した場合に、初めて書
き込み位置補正処理が行われており、この結果、時間的
に離散した状態として音声データが無効化されているこ
とから、聴覚上、殆ど問題とはされないものとなってい
る。As can be seen from the above description, the writing position of the audio data is changed so that the reproduction waiting time is shortened, and when writing is performed, the result is overwritten on the unreproduced audio data. As a result, the unplayed audio data is invalidated. For example, in the case of sampling at intervals of 30 ms, one writing position correction requires 30
That is, m seconds of audio data are invalidated.
Therefore, if the writing position is continuously changed, as a result of the invalidation of the audio data, the sound is interrupted as perceived. However, according to the present invention, the write position correction process is performed for the first time when an event that reduces the playback wait time by the specified number of times occurs, and as a result, the audio data is in a time-discrete state. Because it has been disabled, it is of little concern for hearing.

【００４３】一方、以上とは逆に、再生待ち時間を大き
くするように、音声データの書き込み位置が変更された
上、書き込みが行われる場合には、その書き込み位置と
既に音声データが書き込みされている位置との間には大
きなアドレス差が生じる結果として、その間での音声の
再生に際しては、音声が全く再生されないか（音声再生
上の途切れに相当）、または既に再生済の音声データが
再生されることになる（音声再生上の不連続再生に相
当）。このような不具合は、直前音声データの繰返し再
生による補間方法によって、ある程度軽減され得るもの
となっている。したがって、仮に、連続的に書き込み位
置の変更が行われれば、音声データの連続性が失われる
結果として、聴覚上、同一音声の補間再生により不自然
さを感じることになる。しかしながら、本発明による場
合、指定回数分、再生待ち時間を大きくする事象が発生
した場合に、初めて書き込み位置補正処理が行われてお
り、この結果、時間的に離散した状態として再生待ち時
間が遅らされていることから、聴覚上、殆ど問題とはな
らないものとなっている。On the other hand, contrary to the above, the writing position of the audio data is changed so as to increase the reproduction waiting time, and when writing is performed, the writing position and the audio data have already been written. As a result, there is a large address difference between the current position and the current position, so that when the sound is reproduced during that time, no sound is reproduced (corresponding to a break in sound reproduction), or already reproduced sound data is reproduced. (Corresponding to discontinuous reproduction in audio reproduction). Such inconvenience can be reduced to some extent by an interpolation method based on repeated reproduction of the immediately preceding audio data. Therefore, if the writing position is continuously changed, the continuity of the audio data is lost, and as a result, the unnaturalness is perceived by the interpolation reproduction of the same audio. However, according to the present invention, the write position correction process is performed for the first time when an event that increases the reproduction waiting time by the specified number of times occurs, and as a result, the reproduction waiting time is delayed as a time discrete state. As a result, there is little problem with hearing.

【００４４】[0044]

【発明の効果】以上、説明したように、本発明によれ
ば、非同期通信網（インターネット等）を介し音声通信
が行われた上、受信ディジタル音声データが音声再生装
置で音声として再生される際に、通信網の状態により動
的に最適な再生待ち時間が設定された上、音声が再生さ
れることによって、良質な音声通話を実現する。また、
非同期通信網を介しての音声通信を行うために受信した
ディジタル音声データを音声中継装置で電話網へ送信す
る際に、通信網の状態により動的に最適な再生待ち時間
で音声再生をすることにより良質な音声送信を実現す
る。As described above, according to the present invention, when voice communication is performed via an asynchronous communication network (such as the Internet) and received digital voice data is reproduced as voice by a voice reproducing apparatus. In addition, the optimal reproduction waiting time is dynamically set according to the state of the communication network, and the high-quality voice communication is realized by reproducing the voice. Also,
When digital voice data received for performing voice communication via an asynchronous communication network is transmitted to a telephone network by a voice relay device, voice playback is dynamically performed at an optimum playback waiting time depending on the state of the communication network. To realize high quality voice transmission.

[Brief description of the drawings]

【図１】図１は、本発明による音声再生装置のブロック
構成を示す図FIG. 1 is a diagram showing a block configuration of an audio reproducing apparatus according to the present invention.

【図２】図２は、その音声再生装置におけるゆらぎ補正
制御処理を説明するための図FIG. 2 is a diagram for explaining fluctuation correction control processing in the audio reproduction device;

【図３】図３は、そのゆらぎ補正制御処理のフローを示
す図FIG. 3 is a diagram showing a flow of the fluctuation correction control processing;

【図４】図４は、本発明による音声中継装置のブロック
構成を示す図FIG. 4 is a diagram showing a block configuration of a voice relay device according to the present invention;

【図５】図５は、従来技術に係る音声再生装置のブロッ
ク構成を示す図FIG. 5 is a diagram showing a block configuration of an audio reproducing apparatus according to the related art.

[Explanation of symbols]

１００，４００…受信部、１０１，４０１…再生バッフ
ァ、１０２，４０２…再生データ読み出し部、１０３，
４０３…Ｄ／Ａ変換部、１０４…音声出力部、１０５，
４０５…ゆらぎ補正制御部、４０４…回線対応部、Ｂ０
…再生単位ブロック、Ｂ１…再生位置、Ｂ２…書き込み
位置100, 400 ... receiving unit, 101, 401 ... reproduction buffer, 102, 402 ... reproduction data reading unit, 103,
403: D / A converter, 104: audio output unit, 105,
405: fluctuation correction control unit, 404: line corresponding unit, B0
... reproduction unit block, B1 ... reproduction position, B2 ... write position

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｈ０４Ｑ 11/04 Ｈ０４Ｑ 11/04 ＲＦターム(参考） 5K030 GA11 HA08 HB01 JT01 KA03 KA19 5K033 BA14 DB10 DB13 5K069 AA01 BA02 BA10 CA08 FC03 FC13 FC16 5K101 LL05 SS08 9A001 CZ04 CZ08 HH15 JZ25 KK31──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification code FI Theme coat ゛ (Reference) H04Q 11/04 H04Q 11/04 RF term (Reference) 5K030 GA11 HA08 HB01 JT01 KA03 KA19 5K033 BA14 DB10 DB13 5K069 AA01 BA02 BA10 CA08 FC03 FC13 FC16 5K101 LL05 SS08 9A001 CZ04 CZ08 HH15 JZ25 KK31

Claims

[Claims]

When receiving a voice packet including encoded digital voice data via a communication network, extracting the digital voice data, temporarily storing the digital voice data in a reproduction buffer, and reproducing and outputting the data, The difference between the write position on the reproduction buffer calculated from the transmission time information stamped at the time of transmission included in the received audio packet and the current reproduction position, that is, the reproduction wait interval is monitored, and the audio data is A sound fluctuation correction control method, wherein a writing position is corrected.

2. The audio data writing position is corrected by comparing a reproduction waiting interval calculated at the time of receiving an audio packet with a preset minimum or maximum allowable reproduction waiting interval. Item 2. The sound fluctuation correction control method according to Item 1.

3. A range between the minimum value and the maximum value of the audio packet reproduction waiting interval, which is obtained by comparing the reproduction waiting interval calculated at the time of receiving the audio packet with a preset minimum or maximum allowable reproduction waiting interval. 3. The audio fluctuation correction control method according to claim 2, wherein the audio data writing position is corrected when the voice data is outside.

4. A range between the minimum value and the maximum value of the voice packet reproduction waiting interval, which is obtained by comparing the reproduction waiting interval calculated at the time of receiving the voice packet with the preset minimum value and maximum value of the allowable reproduction waiting interval. 4. The sound fluctuation correction control method according to claim 3, wherein the sound data writing position is corrected based on the frequency of occurrence of an outside event.

5. A comparison between a playback waiting interval calculated at the time of receiving a voice packet and a preset minimum value of an allowable playback waiting interval, based on a frequency of occurrence of a minimum value or an event smaller than the minimum value. 3. The audio fluctuation correction control method according to claim 2, wherein the audio data writing position is corrected in a direction to shorten the reproduction waiting interval.

6. A voice fluctuation correction control method according to claim 1, wherein the data receiving unit receives a voice data packet from a packet communication network, and controls the fluctuation of the voice data arrival time. , A reproduction buffer for storing digital audio data in a received data packet, a reproduction data readout unit for reading out digital audio data to be reproduced from the reproduction buffer, and a digital output from the reproduction data readout unit. D / A that converts audio data into audio data and outputs it
An audio reproducing apparatus comprising: a conversion unit; and an audio output unit that reproduces and outputs audio data output from the D / A conversion unit.

7. The audio reproducing apparatus according to claim 6, wherein the packet communication network is the Internet.

8. The audio reproducing apparatus according to claim 6, wherein the packet communication network is a local area network.

9. The audio reproducing apparatus according to claim 6, wherein the packet communication network is an ATM network.

10. The audio reproducing apparatus according to claim 6, wherein the packet communication network is a frame relay network.

11. A voice relay device comprising a line corresponding unit for outputting voice to a circuit switching network in place of the voice output unit according to claim 6.

12. The voice relay apparatus according to claim 11, wherein the circuit switching network is a public analog telephone network.

13. The voice relay apparatus according to claim 11, wherein the circuit switching network is a public ISDN.

14. The voice relay apparatus according to claim 11, wherein the circuit switching network is an extension telephone network.