JP2001251342A

JP2001251342A - Packet receiver

Info

Publication number: JP2001251342A
Application number: JP2000060064A
Authority: JP
Inventors: Shinji Hayakawa; 慎司早川; Satoshi Watanabe; 聡渡辺
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2000-03-06
Filing date: 2000-03-06
Publication date: 2001-09-14
Anticipated expiration: 2020-03-06
Also published as: JP3813046B2

Abstract

PROBLEM TO BE SOLVED: To provide a packet receiver that can enhance sound quality in the case of insertion of a correction packet. SOLUTION: The packet receiver detects whether or not each communication packet is normally received in time series on the basis of an identifier given to each communication packet containing valid voice data by a transmitter side, and decides whether or not a prescribed correction packet not containing valid voice data is to be inserted to a received communication packet series. The packet receiver is provided with a communication packet size detection means that checks a size of a communication packet being that of the received communication packet and a correction packet insertion means that selects at least a size smaller than that of the communication packet as the size of the correction packet when the size of the communication packet exceeds a threshold value of the communication packet size set to the packet insertion means.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明はパケット受信装置に
関し、例えば、符号化した音声信号を収容して伝送され
てきたパケットを受信、復号して音声出力を行う場合な
どに適用し得るものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a packet receiving apparatus, and can be applied to, for example, a case where a packet transmitted containing an encoded voice signal is received, decoded, and voice output is performed. .

【０００２】[0002]

【従来の技術】音声データを収容したパケットを送信
し、受信するパケット音声通信では、受信側装置へのパ
ケット到着の遅れ（“ジッタ”、または、“ゆらぎ”と
呼ぶ）と、通信中にパケットが失われるパケット損失
（または、“パケットロス”）が、通話品質に大きな影
響をおよぼす。ジッタとパケットロスに共通する悪影響
は、受信側装置に搭載された受信バッファが空になるこ
とである。ジッタが大きくなり、受信バッファに蓄積し
てあるデータを使いきるまでに、次のパケットが到着し
ないと受信バッファが空になる。パケットロスが発生す
ると、本来到着するはずのデータが紛失するため、紛失
したパケット分のデータ量が失われ、受信バッファにデ
ータが蓄積されないまま、それまでに蓄積したデータを
消費してしまう。このため、パケットロス後に次パケッ
トが到着するのが遅れると、大きなジッタの場合同様、
受信バッファが空になる。受信バッファが空になると再
生するデータがなくなるため、最悪の場合は音声が途切
れてしまう。2. Description of the Related Art In packet voice communication in which a packet containing voice data is transmitted and received, a packet arrival delay (called "jitter" or "fluctuation") at a receiving apparatus is called a packet during communication. Packet loss (or “packet loss”), which has a significant effect on call quality. An adverse effect common to jitter and packet loss is that the receiving buffer mounted on the receiving device becomes empty. If the next packet does not arrive before the data accumulated in the reception buffer is used up, the reception buffer becomes empty. When a packet loss occurs, the data that should have arrived is lost, so that the data amount of the lost packet is lost, and the data stored so far is consumed without storing the data in the reception buffer. For this reason, if the arrival of the next packet is delayed after a packet loss, as in the case of large jitter,
The receive buffer becomes empty. When the reception buffer becomes empty, there is no data to be reproduced, and in the worst case, the sound is interrupted.

【０００３】パケットロスには、これ以外にも悪影響が
ある。例えば、第１パケットから第３パケットまでを送
信し、第２パケットが紛失した場合、受信バッファに
は、第１パケットの次に第３パケットが蓄積されるた
め、これを再生すると、音飛びが発生する。この音飛び
は、通話品質を低下させるだけでなく、情報を欠落させ
るため通話内容が不明瞭になる。There are other adverse effects on packet loss. For example, if the first packet to the third packet are transmitted, and the second packet is lost, the reception buffer stores the third packet after the first packet. appear. This skipping not only lowers the call quality, but also causes the information to be lost, so that the contents of the call become unclear.

【０００４】これらの問題を改善する方法として、特開
平１０−２８５２１３号公報（文献１）には、受信バッ
ファが空になってもパケットが到着しない場合に限り、
送信パケットと同じサイズのダミーパケットを再生する
方法が示されている。文献１には、これによって、音の
途切れを回避できると記述されている。As a method for solving these problems, Japanese Patent Laid-Open Publication No. Hei 10-285213 (Document 1) discloses a method in which a packet does not arrive even if a reception buffer becomes empty.
A method for reproducing a dummy packet having the same size as a transmission packet is shown. Document 1 describes that this can avoid interruption of sound.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、前記文
献１に示された方法では、受信バッファが空になって初
めてダミーパケットを再生するため、パケットロスが発
生しても受信バッファが空にならない場合は、従来と同
様、音飛びが発生する。However, according to the method disclosed in the above document 1, dummy packets are reproduced only when the reception buffer becomes empty, so that even if a packet loss occurs, the reception buffer does not become empty. Causes sound skipping as in the prior art.

【０００６】ネットワークのトラフィック状況や受信バ
ッファの容量などの諸条件により、パケットロスが発生
しなくても受信バッファが空になることもある反面、パ
ケットロスが発生しても受信バッファが空にならないこ
ともあり得るからである。[0006] Depending on various conditions such as network traffic conditions and the capacity of the receiving buffer, the receiving buffer may be empty even if no packet loss occurs, but the receiving buffer does not become empty even if a packet loss occurs. It is possible.

【０００７】受信バッファの容量を大きくすれば、パケ
ットロスが発生しなくても受信バッファが空になる可能
性を低減することができるものの、受信バッファの容量
を大きくすることは同時に、パケットロスが発生しても
受信バッファが空にならない可能性を高めることを意味
する。[0007] Increasing the capacity of the receiving buffer can reduce the possibility that the receiving buffer will be empty even if no packet loss occurs, but increasing the capacity of the receiving buffer simultaneously reduces the packet loss. This means increasing the possibility that the receiving buffer will not be empty even if it occurs.

【０００８】[0008]

【課題を解決するための手段】かかる課題を解決するた
めに、本発明は、有効な音声データを収容した各通信パ
ケットに送信側で付与された識別子をもとに、時系列に
受信される各通信パケットの受信が正常に行われている
どうかを検出し、当該検出結果に基づいて、受信された
通信パケット系列の中に、有効な音声データを収容して
いない所定の補正パケットの挿入を行うかどうかを決定
するパケット受信装置において、受信した前記通信パケ
ットのサイズである通信パケットサイズを調べる通信パ
ケットサイズ検出手段と、当該通信パケットサイズに関
する閾値を設定しておき、少なくとも通信パケットサイ
ズが当該閾値を上回った場合には、前記補正パケットの
サイズとして、通信パケットサイズよりも小さなサイズ
を選択する補正パケット挿入手段とを備えることを特徴
とする。In order to solve such a problem, the present invention provides a method for receiving a communication packet containing valid voice data in a time-series manner based on an identifier given by a transmission side to each communication packet. It detects whether each communication packet is normally received, and inserts a predetermined correction packet that does not contain valid voice data into the received communication packet sequence based on the detection result. In the packet receiving device for determining whether to perform the communication packet size, a communication packet size detecting means for checking a communication packet size, which is the size of the received communication packet, and a threshold value for the communication packet size are set, and at least the communication packet size is If the threshold value is exceeded, a correction packet for selecting a size smaller than the communication packet size as the size of the correction packet. Characterized in that it comprises a Tsu preparative insertion means.

【０００９】[0009]

【発明の実施の形態】（Ａ）実施形態以下、本発明のパケット受信装置の実施形態について説
明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS (A) Embodiment An embodiment of a packet receiving apparatus according to the present invention will be described below.

【００１０】第１〜第３の実施形態に共通する特徴は、
受信したパケットのサイズ（１つの受信パケットに含ま
れるフレーム数）が小さい場合には当該パケットと同じ
サイズのエラーパケットを挿入し、受信パケットのサイ
ズが大きい場合には当該パケットよりもサイズの小さい
エラーパケットを挿入することで、エラーパケット部分
の音質劣化を防止する点にある。The features common to the first to third embodiments are as follows.
If the size of the received packet (the number of frames included in one received packet) is small, an error packet having the same size as the packet is inserted. If the size of the received packet is large, an error packet smaller than the packet is inserted. By inserting a packet, the sound quality of the error packet portion is prevented from deteriorating.

【００１１】（Ａ−１）第１の実施形態の構成本実施形態の受信装置２０の構成を図７に示す。(A-1) Configuration of First Embodiment FIG. 7 shows the configuration of a receiving apparatus 20 of the present embodiment.

【００１２】図７において、受信装置２０は、受信端１
と、パケット監視部２と、受信バッファ３と、復号部４
と、出力端５とを備えている。In FIG. 7, a receiving apparatus 20 includes a receiving end 1.
, Packet monitoring unit 2, reception buffer 3, decoding unit 4
And an output terminal 5.

【００１３】受信端１は、図示しない送信側装置から送
信された符号化音声データを収容したパケットを、ネッ
トワークを介して受信し、パケット監視部２に渡す機能
を有する。以下では、送信側装置から送信され、ネット
ワークを介して受信装置２０（あるいは受信端１）に受
信されたパケットを受信パケットと呼ぶ。The receiving end 1 has a function of receiving a packet containing coded voice data transmitted from a transmitting side device (not shown) via a network and passing the packet to a packet monitoring unit 2. Hereinafter, a packet transmitted from the transmitting apparatus and received by the receiving apparatus 20 (or the receiving end 1) via the network is referred to as a received packet.

【００１４】パケット監視部２は、タイムアウト時間の
設定、監視機能、受信パケット到着順の逆転監視機能
と、エラーフレーム生成挿入機能と、受信パケット廃棄
機能を有する部分で、その内部構成は図１に示すもので
ある。The packet monitoring unit 2 has a function of setting and monitoring a time-out period, a function of monitoring the reverse of the order of arrival of received packets, a function of generating and inserting an error frame, and a function of discarding received packets. It is shown.

【００１５】タイムアウト時間や受信パケット到着順の
逆転を監視するためには、その前提として、パケット監
視部２が、受信した各受信パケットを識別する必要があ
る。この識別には、前記送信側装置が、各受信パケット
に付与したパケット識別情報を利用する。パケット識別
情報とは、例えば、受信パケットのシーケンス番号（パ
ケット番号）、受信パケットが前記送信側装置から送信
された時刻や受信パケットが送信側装置において生成さ
れた時刻を示すタイムスタンプ情報なとであってよい。In order to monitor the timeout period and the reverse of the order of arrival of the received packets, the packet monitoring unit 2 needs to identify each received packet as a prerequisite. For this identification, the transmitting side apparatus uses packet identification information given to each received packet. The packet identification information includes, for example, a sequence number (packet number) of a received packet, time stamp information indicating a time at which the received packet was transmitted from the transmitting device and a time at which the received packet was generated at the transmitting device. May be.

【００１６】当該パケット監視部２の内部構成を図１に
示す。FIG. 1 shows the internal configuration of the packet monitoring unit 2.

【００１７】(Ａ−１−１）パケット監視部２の内部構
成図１において、パケット監視部２は、受信正常性検出部
１１と、パケットサイズ検出部１２と、しきい値設定部
１３と、エラーパケットサイズ選択部１４と、エラーパ
ケット挿入部１５とを備えている。(A-1-1) Internal Configuration of Packet Monitoring Unit 2 In FIG. 1, the packet monitoring unit 2 includes a reception normality detection unit 11, a packet size detection unit 12, a threshold setting unit 13, An error packet size selection unit 14 and an error packet insertion unit 15 are provided.

【００１８】このうち受信正常性検出部１１は、上述し
たシーケンス番号やタイムスタンプなどをもとに、送信
側装置が送信したパケットが欠落も順序の逆転も無く、
一定のタイムアウト時間内に、受信パケットとして正常
に受信されているかどうかを検出する部分で、受信され
た各受信パケットに欠落、順序の逆転、タイムアウトな
どが検出された場合には、異常受信信号ＡＳをパケット
サイズ検出部１２に供給する。The reception normality detector 11 detects the packet transmitted by the transmitting apparatus based on the above-mentioned sequence number and time stamp without any loss or reversal of the packet.
In a portion for detecting whether or not a received packet is normally received within a certain timeout period, if a missing, reversed order, or timeout is detected in each received packet, an abnormal reception signal AS Is supplied to the packet size detector 12.

【００１９】パケット番号やタイムアウト時間、タイム
スタンプ情報は、受信パケットの欠落や、受信パケット
到着順の逆転を監視するのが目的であり、これらが検知
できるものならば、いずれも利用できる。例えば、通信
プロトコルの一種である、ＴＣＰ／ＩＰ（Transmission
Control Protocol／Internet Protocol）のシーケンス
番号情報や、ＵＤＰ／ＩＰ（User Datagram Protocol／
Internet Protocol）のＲＴＰ（Realtime Transport Pr
otocol）におけるシーケンス番号やタイムスタンプ情報
などが利用できる。The purpose of the packet number, the timeout time, and the time stamp information is to monitor the loss of the received packet and the reversal of the order of arrival of the received packet, and any information can be used as long as they can be detected. For example, TCP / IP (Transmission
Control Protocol / Internet Protocol (Sequence Number) information, UDP / IP (User Datagram Protocol /
Internet Protocol (RTP) RTP (Realtime Transport Pr
otocol) can be used.

【００２０】具体的には、受信側（当該受信正常性検出
部１１）が、受信パケットのシーケンス番号を監視し、
今回受信パケットのシーケンス番号が、前回受信パケッ
トのシーケンス番号をインクリメントして得られる番号
（前回受信パケットのシーケンス番号＋１）でない場合
は、パケットロス、あるいは、受信パケット到着順の逆
転が起きたものと、前記受信正常性検出部１１が判定す
ることが考えられる。More specifically, the receiving side (the reception normality detecting section 11) monitors the sequence number of the received packet,
If the sequence number of the currently received packet is not the number obtained by incrementing the sequence number of the previously received packet (the sequence number of the previously received packet + 1), it is assumed that packet loss or inversion of the order of arrival of the received packet has occurred. It is conceivable that the reception normality detector 11 makes the determination.

【００２１】また、前回受信パケットのタイムスタンプ
が１６時５２分、今回受信パケットのタイムスタンプが
１６時４０分のような場合、今回受信パケットの方が前
回受信パケットより先に送出されたにも関わらず、今回
受信パケットの方が後に到着しているので、受信パケッ
ト到着順の逆転が発生したと、当該受信正常性検出部１
１が判定することが考えられる。In the case where the time stamp of the previous received packet is 16:52 and the time stamp of the current received packet is 16:40, even if the currently received packet is transmitted earlier than the previous received packet. Regardless, since the currently received packet arrives later, the reception normality detecting unit 1 determines that the order of arrival of the received packet has been reversed.
It is possible that 1 determines.

【００２２】これらの方法は、あくまで一例であり、こ
れに限らず、独自の検知情報を送信側装置で各受信パケ
ットに付加して送信させ、当該検知情報に応じた判定を
受信正常性検出部１１が行うようにしてもよいことは言
うまでもない。These methods are merely examples, and the present invention is not limited to this method. The transmitting apparatus adds unique detection information to each received packet and causes the packet to be transmitted. Needless to say, 11 may be performed.

【００２３】受信正常性検出部１１が出力した異常信号
信号ＡＳを受け取ると、パケットサイズ検出部１２は、
受信している各受信パケットのサイズを調べて、当該サ
イズを、サイズ情報ＳＩとしてエラーパケットサイズ選
択部１４に供給する機能を備えている。When receiving the abnormal signal AS output from the reception normality detecting section 11, the packet size detecting section 12
It has a function of checking the size of each received packet being received and supplying the size to the error packet size selection unit 14 as size information SI.

【００２４】エラーパケットサイズ選択部１４は、受信
パケットに関するしきい値ＴＨの設定を、しきい値設定
部１３から受けており、前記サイズ情報ＳＩと当該しき
い値ＴＨとを比較してサイズ決定信号ＤＳをエラーパケ
ット挿入部１５に出力する部分である。The error packet size selection unit 14 receives the setting of the threshold value TH for the received packet from the threshold value setting unit 13, and determines the size by comparing the size information SI with the threshold value TH. This is a part that outputs the signal DS to the error packet insertion unit 15.

【００２５】エラーパケット挿入部１５は、このサイズ
決定信号ＤＳに応じて規定されるサイズのエラーパケッ
トを、受信パケット系列のなかに挿入する部分である。The error packet inserting section 15 is a section for inserting an error packet having a size defined according to the size determination signal DS into a received packet sequence.

【００２６】エラーパケットとは、エラーフレームを複
数まとめたものである。エラーフレームとは、例えば、
後述するフレーム消去の隠蔽機能で利用されるパケット
で、聴覚特性上、フレームデータの欠損を補完するとと
もに、受信バッファ３中に蓄積されるデータ量（キュー
長）の損失を補填して以降に受信され蓄積される受信パ
ケットの遅延時間（受信装置２０内におけるゆらぎ、す
なわち受信パケット間隔）を制御することを目的として
作られるデータである。An error packet is a collection of a plurality of error frames. An error frame is, for example,
A packet used in the concealment function of frame erasure, which will be described later, compensates for loss of frame data due to auditory characteristics, and compensates for loss in the amount of data (queue length) accumulated in the reception buffer 3 for subsequent reception. This is data created for the purpose of controlling the delay time (fluctuation in the receiving device 20, that is, the receiving packet interval) of the received packets to be stored.

【００２７】エラーフレーム自体の構造については、例
えば、白色雑音、有色雑音、前フレームのデータを模倣
してエラーフレームのデータを作成する方法、前フレー
ムのデータから推測してエラーフレームのデータを作成
する方法などが使用できる。前フレームのデータを模倣
する方法は、例えば、後述するＧ．７２３．１やＧ７２
９に用いられている無音圧縮技術などが利用できる。ま
た、前フレームのデータを模倣する方法は、線形予測
法、適応予測法、ニューラルネットワークによる予測な
ど、様々な方法が利用できる。これらの模倣方法や、予
測方法は、ここに記述したものに限定するものではな
く、独自の方法によっても、本実施形態の効果を得るこ
とが可能であることは言うまでもない。For the structure of the error frame itself, for example, a method of creating error frame data by imitating white noise, colored noise, and data of the previous frame, and creating error frame data by inferring from the data of the previous frame Can be used. A method of imitating the data of the previous frame is described in, for example, G.11. 723.1 and G72
9 can be used. Further, as a method of imitating the data of the previous frame, various methods such as a linear prediction method, an adaptive prediction method, and a prediction using a neural network can be used. These imitation methods and prediction methods are not limited to those described here, and it goes without saying that the effects of the present embodiment can be obtained by a unique method.

【００２８】受信されるパケット系列に対するエラーパ
ケットの挿入は、例えば、図７の受信バッファ（先入れ
先出しタイプのメモリ：ＦＩＦＯ）３に蓄積された最後
尾の受信パケット（符号化音声データ）の後に、当該エ
ラーパケットを書き込むことによって行う。The insertion of the error packet into the received packet sequence is performed, for example, after the last received packet (coded audio data) stored in the receiving buffer (first-in first-out type memory: FIFO) 3 in FIG. This is done by writing an error packet.

【００２９】前記復号部４は、受信バッファ３から読み
出された受信パケット（またはエラーパケット）の持つ
音声データを復号する機能のほかに、フレーム消去の隠
蔽機能を装備している。The decoding unit 4 has a function of decoding the audio data of the received packet (or error packet) read from the reception buffer 3 and a function of concealing the frame erasure.

【００３０】フレーム消去の隠蔽機能とは、復号しよう
とするパケット系列のなかにエラーパケットを検出した
場合、当該エラーパケット部分では、その前後の受信パ
ケットの持っている音声データに基づいて復号部４自身
が出力音声を合成する機能である。この機能によって、
エラーパケット部分で有効な音声データが失われている
ことが音声出力に与える影響を、可及的に低減すること
ができる。The concealment function of frame erasure is that, when an error packet is detected in a packet sequence to be decoded, the error packet portion detects the error packet based on the audio data of the previous and next received packets. This is a function for synthesizing output speech by itself. With this feature,
The effect on the audio output that the valid audio data is lost in the error packet portion can be reduced as much as possible.

【００３１】ここで、音声の符号化、復号方式として
は、例えば、線形ＰＣＭ、ＩＴＵ−Ｔ勧告のＧ．７１１
（μ則ＰＣＭ）、Ｇ．７２６（ＡＤＰＣＭ）、Ｇ．７２
３．１、Ｇ．７２９（ＣＳ−ＡＣＥＬＰ）といった方式
や、ＣＥＬＰ（Code Excited Linear Prediction：符号
励振線形予測）符号化方式、リニアＰＣＭ方式などが利
用できる。また、独自仕様の符号化方式でも本実施形態
の効果が得られることは言うまでもなく、符号化・復号
方式は特に限定するものではない。Here, the coding and decoding schemes of speech include, for example, linear PCM and ITU-T recommended G.264. 711
(Μ law PCM); 726 (ADPCM); 72
3.1, G. 729 (CS-ACELP), CELP (Code Excited Linear Prediction) coding, linear PCM, and the like. In addition, it goes without saying that the effects of the present embodiment can be obtained even with an encoding system of a unique specification, and the encoding / decoding system is not particularly limited.

【００３２】符号化、復号方式は、符号化時、復号時の
１フレームの時間に影響を与える。例えば、上記のＧ．
７２９方式では、１フレームが１０ｍｓ（０．０１秒）
であり、Ｇ．７２３．１方式では、フレームが３０ｍｓ
（０．０３秒）である。また、１パケットに含まれるフ
レーム数（パケットサイズ、パケット長と呼ぶ）も様々
あり、こちらは使用する装置によって任意に決定するこ
とができる。ただし、通常、符号化音声データを複数フ
レーム繋げたものに、通信方式固有の情報であるヘッダ
が付属して１つのパケットを構成する。このため、後述
するように、受信パケットのパケットサイズを小さくす
ると遅延時間が少なくなるかわりに、より高速な通信速
度を要求され、逆に、受信パケットのパケットサイズを
大きくすると通信速度は低減できるが、パケットロス時
の影響や、遅延時間が増加する傾向となる。The encoding and decoding methods affect the time of one frame during encoding and decoding. For example, in the above G.P.
In the 729 system, one frame is 10 ms (0.01 seconds)
G. In the 723.1 system, the frame is 30 ms.
(0.03 seconds). The number of frames included in one packet (called a packet size and a packet length) also varies, and this can be arbitrarily determined depending on the device used. However, one packet is usually formed by connecting coded audio data to a plurality of frames and adding a header which is information unique to the communication system. For this reason, as will be described later, if the packet size of the received packet is reduced, a higher communication speed is required instead of reducing the delay time. Conversely, if the packet size of the received packet is increased, the communication speed can be reduced. However, the influence of packet loss and the delay time tend to increase.

【００３３】前記復号部４から復号結果を受け取る出力
端５は、当該復号結果に応じて音声データを再生する機
能を持つ部分で、例えば音声出力を行うスピーカなどで
あってよい。The output terminal 5 receiving the decoding result from the decoding unit 4 has a function of reproducing audio data in accordance with the decoding result, and may be, for example, a speaker for outputting audio.

【００３４】以下、上記のような構成を有する本実施形
態の動作について説明する。The operation of the present embodiment having the above configuration will be described below.

【００３５】（Ａ−２）第１の実施形態の動作図２のフローチャートに、前記パケット監視部２の動作
を示す。このフローチャートは前記受信正常性としてタ
イムアウト（Time Out）を検出する場合に対応してお
り、Ｓ１〜Ｓ５の各ステップから構成されている。各受
信パケットに対するタイムアウト時間の進行が開始され
るたびに、処理は最初のステップＳ１からスタートす
る。(A-2) Operation of the First Embodiment The operation of the packet monitoring unit 2 is shown in the flowchart of FIG. This flowchart corresponds to a case where a timeout (Time Out) is detected as the reception normality, and includes steps S1 to S5. The process starts from the first step S1 each time the progress of the timeout period for each received packet is started.

【００３６】パケット監視部２内の受信正常性検出部１
１は、各受信パケットの受信処理に応じて受信端１から
供給される信号（Ｓ１）を常に監視している（Ｓ２）。The reception normality detector 1 in the packet monitor 2
1 constantly monitors the signal (S1) supplied from the receiving end 1 in accordance with the reception processing of each received packet (S2).

【００３７】タイムアウト時間内においては、受信パケ
ットの到着（受信）を示す信号が受信正常性検出部１１
に供給されていない期間、ステップＳ２は、受信パケッ
トが受信されていないことを示すＮｏ側の分岐を継続す
るとともに、ステップＳ４では、タイムアウト時間まで
タイマ進行を認識しつつ、Ｎｏ側の分岐を選択する。Within the timeout period, a signal indicating the arrival (reception) of the received packet is received by the reception normality detector 11.
In step S2, the branch on the No side indicating that the received packet is not received is continued, and in step S4, the branch on the No side is selected while recognizing that the timer has advanced until the timeout period. I do.

【００３８】当該タイムアウト時間内のいずれかの時点
に目的のパケット識別情報を付与された受信パケットが
受信されたことを示す信号が受信端１から供給されると
（Ｓ１）、ステップＳ２はＹｅｓ側に分岐して受信バッ
ファ３に当該受信パケット（またはその受信パケットに
収容された符号化音声データ）を受信バッファ３に入力
する（Ｓ３）。このとき前記タイマはリセットされて次
のタイムアウト時間の監視状態に移行する。When a signal indicating that the received packet to which the target packet identification information has been received has been received from the receiving end 1 at any time within the timeout period (S1), step S2 is performed on the Yes side. Then, the received packet (or the encoded voice data contained in the received packet) is input to the reception buffer 3 (S3). At this time, the timer is reset and shifts to the monitoring state for the next timeout period.

【００３９】一方、目的の受信パケットの受信を示す信
号がタイムアウト時間までに受信正常性検出部１１に供
給されなかった場合は、ステップＳ２は、当該タイムア
ウト時間内にＹｅｓ側に分岐することがなく、受信パケ
ットが受信されていないことを示すＮｏ側の分岐を継続
する。On the other hand, if the signal indicating the reception of the target received packet has not been supplied to the reception normality detecting unit 11 before the timeout period, the step S2 does not branch to the Yes side within the timeout period. , The branch on the No side indicating that the received packet is not received is continued.

【００４０】そしてステップＳ２のＮｏ側の分岐の継続
状態がタイムアウト時間の経過時点まで達したことを検
出したとき、前記受信正常性検出部１１は異常受信信号
ＡＳをパケットサイズ検出部１２に供給し、ステップＳ
４はＹｅｓ側に分岐してステップＳ５のエラーパケット
の生成処理が実行される。When detecting that the continuation state of the branch on the No side in step S2 has reached the time point of the elapse of the timeout period, the reception normality detection unit 11 supplies the abnormal reception signal AS to the packet size detection unit 12. , Step S
4 branches to the Yes side to execute the error packet generation processing in step S5.

【００４１】エラーパケットの生成処理（Ｓ５）の内部
では、次のような処理が実行される。In the error packet generation processing (S5), the following processing is executed.

【００４２】最初に、当該異常受信信号ＡＳを受け取っ
た前記パケットサイズ検出部１２が例えば当該タイムア
ウトと判定された受信パケットの直前に受信された受信
パケットのサイズを検出して当該サイズに応じたサイズ
情報ＳＩをエラーパケットサイズ選択部１４に供給す
る。First, the packet size detection unit 12 which has received the abnormal reception signal AS detects, for example, the size of a reception packet received immediately before the reception packet determined to be timed out, and determines a size corresponding to the size. The information SI is supplied to the error packet size selection unit 14.

【００４３】次に、エラーパケットサイズ選択部１４は
当該サイズ情報ＳＩと前記しきい値設定部１３から供給
されるしきい値ＴＨとを比較し、サイズ情報ＳＩの方が
しきい値ＴＨよりも大きい場合には、次の式（５）に基
づいてエラーパケットのサイズを決定する。すなわちこ
のとき、エラーパケットサイズは、ＴＨが指定するサイ
ズ以上で、なおかつＳＩが指定するサイズよりも小さ
い。Next, the error packet size selection unit 14 compares the size information SI with the threshold value TH supplied from the threshold value setting unit 13, and the size information SI is larger than the threshold value TH. If it is larger, the size of the error packet is determined based on the following equation (5). That is, at this time, the error packet size is equal to or larger than the size specified by TH and smaller than the size specified by SI.

【００４４】ＴＨ≦エラーパケットのサイズ＜ＳＩ …（５）反対に、サイズ情報ＳＩの方がしきい値ＴＨよりも小さ
い場合には、次の式（６）に基づいてエラーパケットの
サイズをＳＩが指定するサイズと同サイズに決定する。TH ≦ error packet size <SI (5) On the other hand, if the size information SI is smaller than the threshold value TH, the error packet size is set based on the following equation (6). Is determined to be the same as the size specified by.

【００４５】エラーパケットのサイズ＝ＳＩ …（６）最後に、これら式（５）または（６）に基づいて決定さ
れたエラーパケットサイズを指定するサイズ決定信号Ｄ
Ｓがエラーパケット挿入部１５に供給されると、エラー
パケット挿入部１５は、当該サイズのエラーパケットを
受信バッファ３に書き込むことによってエラーパケット
の挿入を実行する。Error packet size = SI (6) Finally, a size determination signal D that specifies the error packet size determined based on these equations (5) or (6)
When S is supplied to the error packet inserting unit 15, the error packet inserting unit 15 inserts the error packet by writing the error packet of the size into the reception buffer 3.

【００４６】このとき、サイズ決定信号ＤＳの状態は大
きく分けると、式（５）に対応する状態と式（６）に対
応する状態の２通りの状態を取り得るが、式（６）に対
応するサイズ決定信号ＤＳに対しては、エラーパケット
挿入部１５は、挿入するエラーパケットのサイズを受信
パケットのサイズ（ＳＩに対応）と同じとすればよい。At this time, the state of the size determination signal DS can be roughly divided into two states, a state corresponding to the equation (5) and a state corresponding to the equation (6). For the size determination signal DS to be performed, the error packet inserting unit 15 may set the size of the error packet to be inserted to be the same as the size of the received packet (corresponding to SI).

【００４７】エラーパケット挿入部１５によって挿入さ
れるエラーパケットのサイズは、受信される各受信パケ
ットのサイズが通信中に変化しない場合、通信開始時に
決定し、通信毎に変更することも可能である。勿論、受
信パケットのサイズが常に一定の場合は、本方式を搭載
した装置を設置した時点で、エラーパケットのサイズを
決定し、以後、不変としてもよい。この場合、前記パケ
ットサイズのしきい値ＴＨは、装置の設置時点でだけ参
照される。If the size of each received packet does not change during communication, the size of the error packet inserted by the error packet insertion unit 15 is determined at the start of communication, and can be changed for each communication. . Of course, if the size of the received packet is always constant, the size of the error packet may be determined at the time of installing the device equipped with the present method, and may be made constant thereafter. In this case, the threshold value TH of the packet size is referred to only when the device is installed.

【００４８】以上のような図２のフローチャートの動作
は、各受信パケットごと、各タイムアウト時間ごとに繰
り返される。The operation of the flowchart of FIG. 2 described above is repeated for each received packet and for each timeout period.

【００４９】最終的に、出力端５から出力される音声出
力の品質を保証するために重要な機能を持つ前記しきい
値ＴＨについて、図３を参照しながら説明する。Finally, the threshold value TH having an important function for guaranteeing the quality of the audio output from the output terminal 5 will be described with reference to FIG.

【００５０】図３は、受信装置２０で受信パケットのパ
ケットサイズＳＩと同サイズのエラーパケットを挿入し
た場合の、パケットサイズＳＩと音質評価ＱＶの関係を
示す。FIG. 3 shows the relationship between the packet size SI and the sound quality evaluation QV when the receiving device 20 inserts an error packet having the same size as the packet size SI of the received packet.

【００５１】図３において、サイズＳＩ２などの大きな
受信パケットをパケットロスなどによって失い、その部
分に同サイズのエラーパケットを挿入すると、音質評価
値はＱＶ２などの低い値となって、音質劣化が著しくな
ることが分かる。これは上述したフレーム消去の隠蔽機
能の限界に対応し、有効な音声データがない部分（エラ
ーパケット区間）が時間的に長くなるほど、合成音声が
人間の聴覚特性に作用して、品質の劣化をもたらす程度
が著しくなるためであるとも考えられる。In FIG. 3, when a large received packet such as the size SI2 is lost due to packet loss or the like and an error packet of the same size is inserted in that portion, the sound quality evaluation value becomes a low value such as QV2, and the sound quality is significantly deteriorated. It turns out that it becomes. This corresponds to the limit of the concealment function of the frame erasure described above, and the longer the portion without valid voice data (error packet section) becomes longer in time, the more the synthesized voice affects the human auditory characteristics, and the lower the quality is. It is also thought that this is because the degree of effect is significant.

【００５２】反対に、サイズＳＩ２などの非常に小さな
受信パケットの場合には、同サイズのエラーパケットを
挿入しても、音質評価値はＱＶ１などの極めて良好な値
となる。On the other hand, in the case of a very small reception packet such as the size SI2, the sound quality evaluation value becomes an extremely good value such as QV1 even if an error packet of the same size is inserted.

【００５３】ところが、サイズＳＩをあまり小さくする
と、オーバーヘッドの割合が増加するために、サイズＳ
Ｉを小さくする前と同じ量の音声データを単位時間内に
送信するには、必要な通信速度が上昇してしまう。However, if the size SI is too small, the ratio of overhead increases, so that the size S
In order to transmit the same amount of voice data within a unit time as before I was reduced, the required communication speed would increase.

【００５４】なぜなら、音声のパケット通信では、符号
化された音声データに様々なヘッダ情報が付加されてパ
ケットを構成するが、通常このヘッダ情報はパケットサ
イズの大小にかかわらず固定サイズであるから、パケッ
トサイズを小さくすることは、全通信データの中に占め
る音声データの割合を減少することに他ならないためで
ある。This is because, in voice packet communication, a packet is formed by adding various header information to encoded voice data. However, since this header information is usually of a fixed size regardless of the packet size, This is because reducing the packet size is nothing less than reducing the proportion of voice data in all communication data.

【００５５】例えば、ヘッダ情報サイズを１０とし、パ
ケットサイズを１００、送信する全音声データサイズを
９０００と仮定した場合を例に取ると、必要なパケット
数は次の式(１）、（２）より、１００となる。For example, assuming that the header information size is 10, the packet size is 100, and the total audio data size to be transmitted is 9000, the required number of packets is expressed by the following equations (1) and (2). Thus, it becomes 100.

【００５６】１０（ヘッダ）＋ｎ（音声データ量）＝１００（パケットサイズ） …（１）これにより、ｎ＝９０（１パケットに入る音声データ量）よって、必要なパケット数は、９０００／９０＝１００ …(２）同じ前提の下で、パケットサイズだけを半分の５０にし
た場合は、同じく９０００の音声データを送るために必
要なパケット数は、次式（３）、（４）から、２２５パ
ケットであることが分かる。10 (header) + n (sound data amount) = 100 (packet size) (1) Thus, n = 90 (sound data amount included in one packet) Therefore, the required number of packets is 9000/90 = 100 (2) Under the same premise, if only the packet size is halved to 50, the number of packets required to transmit 9000 voice data is calculated from the following equations (3) and (4) as follows: It turns out that it is a packet.

【００５７】１０（ヘッダ）＋ｎ（音声データ量）＝５０（パケットサイズ） …( ３）これにより、ｎ＝４０（１パケットに入る音声データ量）よって、必要なパケット数は、９０００／４０＝２２５ …（４）両パケットサイズとも送信する符号化音声の総データ量
は同一であるため、受信側で音声出力される時間は、同
じ長さである。10 (header) + n (sound data amount) = 50 (packet size) (3) Thus, n = 40 (sound data amount included in one packet) Therefore, the number of necessary packets is 9000/40 = 225 (4) Since the total data amount of the coded voice to be transmitted is the same for both packet sizes, the time for voice output on the receiving side is the same length.

【００５８】したがって、式(２）の１００パケットと
式(４）の２２５パケットは、同じ時間で送信し、受信
する必要があるので、パケットサイズ５０で音声通信す
る場合は、パケットサイズ１００で音声通信する場合に
比べ、データサイズ当たりで１，１２５倍、パケット数
当たりで２，２５倍の通信速度が必要となる。Therefore, it is necessary to transmit and receive 100 packets of equation (2) and 225 packets of equation (4) at the same time. As compared with the case of communication, a communication speed of 1,125 times per data size and 225 times per packet number is required.

【００５９】一般的に通信速度の上昇は、送信側装置、
受信側装置および通信網のコスト増加を招くばかりでな
く、十分な通信速度が得られない通信網等を使用した場
合には、激しい音質劣化をまねくため、好ましくない。
また、通信速度の上昇にはハードウエア的な限界がある
ため、算出された設計値を常に実現できるとは限らず、
実現できたとしても、送信側装置も受信側装置も高速な
ものほど大規模で高価になる傾向がある。In general, an increase in communication speed is caused by a transmitting device,
In addition to increasing the cost of the receiving device and the communication network, the use of a communication network or the like for which a sufficient communication speed cannot be obtained is not preferable because it causes severe deterioration in sound quality.
In addition, since the increase in communication speed has a hardware limitation, the calculated design value cannot always be realized,
Even if it can be realized, there is a tendency that the faster the transmitting device and the receiving device, the larger and the more expensive the device.

【００６０】結局、受信パケットのパケットサイズを小
さくして通話品質劣化現象を防ぐことは、困難や不利益
を伴い、一定の限界がある。As a result, it is difficult and disadvantageous to reduce the packet size of the received packet to prevent the communication quality deterioration phenomenon, and there is a certain limit.

【００６１】したがって、受信パケットのサイズＳＩに
は特に制限を設けず、エラーパケットのサイズには制限
を設けるために、図示したサイズＳＩＸの付近にしきい
値ＴＨを選定するのがよいと考えられる。Therefore, it is considered that it is better to select the threshold value TH near the illustrated size SIX in order to set no limit on the size SI of the received packet and to limit the size of the error packet.

【００６２】サイズＳＩＸはサイズＳＩ１に比べるとか
なり大きいが、その割には当該音質評価値ＱＶＸは良好
で、前記ＱＶ１とほとんど同じレベルである。Although the size SIX is considerably larger than the size SI1, the sound quality evaluation value QVX is good, and the level is almost the same as that of the QV1.

【００６３】（Ａ−３）第１の実施形態の効果以上、本実施形態によれば、受信パケットのパケットサ
イズを小さくして通信速度を上昇しなくても、パケット
ロス時、パケット到着順の逆転時、受信タイムアウト時
などに、エラーパケット挿入部分で生じていた音飛びな
どによる著しい音質劣化を防止することができ、データ
量の損失を補填して以降に受信される受信パケットの遅
延時間を制御することも可能である。(A-3) Effect of the First Embodiment As described above, according to the present embodiment, even if the packet size of the received packet is reduced and the communication speed is not increased, the packet arrival order in the packet arrival order can be reduced. At the time of reversal, reception time-out, etc., it is possible to prevent remarkable sound quality deterioration due to skipping that occurred at the error packet insertion part, and to compensate for the loss of data amount, and to reduce the delay time of received packets received thereafter. It is also possible to control.

【００６４】また本実施形態では、受信バッファが空に
ならなくても、受信正常性検出部が異常受信信号を出力
した場合にはエラーパケットを挿入するため、音飛びの
発生を阻止できる可能性が高まる。Further, in this embodiment, even if the reception buffer does not become empty, an error packet is inserted when the reception normality detection section outputs an abnormal reception signal. Increase.

【００６５】（Ｂ）第２の実施形態以下では、本実施形態が第１の実施形態と相違する点に
ついてのみ説明する。(B) Second Embodiment In the following, only the points of this embodiment different from the first embodiment will be described.

【００６６】第１の実施形態では、正常な受信状態が継
続される場合には時系列に受信される各受信パケットの
パケットサイズに応じて、ある受信パケットとその次の
受信パケットは順次に本来の遅延間隔（受信パケット間
隔）で受信バッファ３内に蓄積されてキュー（待ち行
列）を形成するが、前記式（５）に対応したサイズ決定
信号ＤＳが出力された場合には、挿入されるエラーパケ
ットのサイズは、本来の受信パケットのサイズよりも小
さいため、挿入以降に受信される受信パケットと、挿入
された当該エラーパケットの直前の受信パケットとの間
隔は、本来よりも短くなってしまう。In the first embodiment, when a normal reception state is continued, a certain reception packet and the next reception packet are sequentially transmitted according to the packet size of each reception packet received in time series. Is accumulated in the reception buffer 3 at the delay interval (reception packet interval), and forms a queue (queue). When the size determination signal DS corresponding to the equation (5) is output, it is inserted. Since the size of the error packet is smaller than the size of the original received packet, the interval between the received packet received after insertion and the received packet immediately before the inserted error packet becomes shorter than originally expected. .

【００６７】本実施形態は、第１の実施形態において必
ずしも十分でなかった遅延時間の制御機能の精度を向上
したことを特徴とする。This embodiment is characterized in that the accuracy of the delay time control function, which is not always sufficient in the first embodiment, is improved.

【００６８】（Ｂ−２）第２の実施形態の構成および動
作本実施形態の受信装置２１の構成を図４に示す。この受
信装置２１は基本的に、第１の実施形態の受信装置２０
に対応する装置である。(B-2) Configuration and Operation of the Second Embodiment FIG. 4 shows the configuration of the receiving apparatus 21 of the present embodiment. This receiving device 21 is basically the receiving device 20 of the first embodiment.
It is a device corresponding to.

【００６９】図４において、受信装置２１は、受信端１
と、パケット監視部２と、受信バッファ３と、復号部４
と、出力端５と、残フレーム制御部６とを備えている。In FIG. 4, the receiving device 21 is
, Packet monitoring unit 2, reception buffer 3, decoding unit 4
, An output terminal 5 and a remaining frame control unit 6.

【００７０】このうち残フレーム制御部６以外の構成要
素１〜５の機能は、実質的に、同一の符号を付した図７
の各部と同じである。The functions of the components 1 to 5 other than the remaining frame control unit 6 are substantially the same as those in FIG.
It is the same as each part.

【００７１】図８には、パケット監視部２と当該残フレ
ーム制御部６の内部構成を示している。FIG. 8 shows the internal configuration of the packet monitoring unit 2 and the remaining frame control unit 6.

【００７２】ここで、パケット監視部２を構成する受信
正常性検出部１１、パケットサイズ検出部１２、しきい
値設定部１３、エラーパケットサイズ選択部１４、エラ
ーパケット挿入部１５の機能は、実質的に、同一の符号
を付した図１の各部と同じである。Here, the functions of the reception normality detection unit 11, packet size detection unit 12, threshold value setting unit 13, error packet size selection unit 14, and error packet insertion unit 15 constituting the packet monitoring unit 2 are substantially the same. Therefore, the same reference numerals are used as in FIG.

【００７３】図８において、破線で囲んだ部分が残フレ
ーム制御部６に対応する。In FIG. 8, a portion surrounded by a broken line corresponds to the remaining frame control section 6.

【００７４】すなわち、残フレーム制御部６は、サイズ
差検出部３０と、エラーフレーム連続数決定部３１と、
エラーフレーム挿入部３２とを備えている。That is, the remaining frame control unit 6 includes a size difference detection unit 30, an error frame continuation number determination unit 31,
An error frame insertion unit 32.

【００７５】このうちサイズ差検出部３０は、前記エラ
ーパケットサイズ選択部１４から受け取ったサイズ決定
信号ＤＳと、前記パケットサイズ検出部１２から受け取
ったサイズ情報ＳＩとを比較して、挿入されたエラーパ
ケットサイズ（ＤＳに対応）と受信パケットのパケット
サイズ（ＳＩに対応）のサイズ差（フレーム数の差）を
検出し、当該検出結果に応じたサイズ差情報ＤＤをエラ
ーフレーム連続数決定部３１に供給する部分である。The size difference detection unit 30 compares the size determination signal DS received from the error packet size selection unit 14 with the size information SI received from the packet size detection unit 12 to determine the inserted error. A size difference (difference in the number of frames) between the packet size (corresponding to DS) and the packet size (corresponding to SI) of the received packet is detected, and the size difference information DD corresponding to the detection result is sent to the error frame continuation number determination unit 31. It is a part to supply.

【００７６】すなわちサイズ差検出部３０は、求めるサ
イズ差をＲｅｍとし、受信パケットサイズをＰsize、挿
入したエラーパケットサイズをＥpsizeとすると、次の
式(７)で示される演算操作を行うことになる。That is, if the size difference to be obtained is set to Rem, the received packet size is set to Psize, and the inserted error packet size is set to Epsize, the size difference detection unit 30 performs the arithmetic operation represented by the following equation (7). .

【００７７】Ｒｅｍ＝Ｐsize−Ｅpsize …（７）換言するなら、このサイズ差Ｒｅｍは、エラーパケット
挿入部１５が行うエラーパケットの挿入によって生じた
遅延時間の制御誤差（ＤＤ）を示すものである。Rem = Psize−Epsize (7) In other words, the size difference Rem indicates a delay time control error (DD) caused by error packet insertion performed by the error packet insertion unit 15.

【００７８】エラーフレーム連続数決定部３１では、こ
のサイズ差情報ＤＤ（Ｒｅｍ）、前記しきい値設定部１
３から受け取ったしきい値ＴＨ、被挿入パケットの音声
フレームの数などを参照して、エラーフレームの挿入パ
ターンを決定する。被挿入パケットとは、前記エラーパ
ケット挿入部１５によるエラーパケット挿入の直後に到
着した受信パケット（およびそれ以降に到着した受信パ
ケット）であって、エラーフレームの挿入対象となるパ
ケットのことである。直後に到着した受信パケット以降
に到着した受信パケットの音声データ部に対して挿入操
作が行われるのは、直後の１受信パケットだけではすべ
てのエラーフレームを挿入し切れなかった場合である。In the error frame continuation number determining section 31, the size difference information DD (Rem) and the threshold value setting section 1
3 to determine the insertion pattern of the error frame with reference to the threshold value TH, the number of voice frames of the packet to be inserted, and the like. The inserted packet is a received packet arriving immediately after the error packet insertion unit 15 inserts an error packet (and a received packet arriving after that), and is a packet to which an error frame is to be inserted. The insertion operation is performed on the audio data part of the received packet arriving after the immediately following received packet when all the error frames cannot be completely inserted by only the immediately following received packet.

【００７９】エラーフレーム連続数決定部３１から供給
される挿入フレーム連続数ＭＥに応じたエラーフレーム
の挿入操作を行うエラーフレーム挿入部３２は、前記被
挿入パケットの音声データ部に対してエラーフレームの
挿入を実行する。An error frame insertion unit 32 for performing an operation of inserting an error frame according to the number ME of consecutive inserted frames supplied from the number-of-continuous-error-frames determination unit 31 outputs an error frame of Perform an insert.

【００８０】エラーフレーム挿入部３２は、各挿入操作
ごとに、前記挿入フレーム連続数ＭＥで指定された数の
エラーフレームを、当該被挿入パケットの音声データ部
に挿入してから、前記受信バッファ３に蓄積する。挿入
操作は、１つの被挿入パケットに対して１回または複数
回行われ、前記サイズ差Ｒｅｍに応じた回数だけ繰り返
される。The error frame insertion unit 32 inserts the number of error frames specified by the number ME of consecutive inserted frames into the audio data portion of the packet to be inserted for each insertion operation, To accumulate. The insertion operation is performed once or multiple times for one inserted packet, and is repeated a number of times according to the size difference Rem.

【００８１】前記挿入フレーム連続数ＭＥは音質の観点
から、少なくとも前記しきい値ＴＨ以下に設定する必要
があるが、サイズ差情報ＤＤが指定するサイズよりも大
きくする必要はない。サイズ差情報ＤＤ（Ｒｅｍ）が指
定するサイズが当該しきい値ＴＨよりも大きい場合に
は、できるだけ早期に前記遅延時間の制御誤差ＤＤを解
消するために当該挿入フレーム連続数ＭＥを、前記しき
い値ＴＨと同じ値に設定するとよい。The number ME of consecutive inserted frames needs to be set at least equal to or less than the threshold value TH from the viewpoint of sound quality, but need not be larger than the size specified by the size difference information DD. If the size specified by the size difference information DD (Rem) is larger than the threshold value TH, the number ME of consecutive inserted frames is set to the threshold value ME in order to eliminate the control error DD of the delay time as soon as possible. It may be set to the same value as the value TH.

【００８２】エラーフレーム連続数決定部３１が前記挿
入パターンを決定するに際しては、まず連続して挿入す
るエラーフレームの数の最大値である挿入フレーム最大
連続数ＬＥを検出し、次に、当該ＬＥ以下の値のなかか
ら挿入フレーム連続数ＭＥを決定するようにするとよ
い。When determining the insertion pattern, the error frame continuation number determining section 31 first detects the maximum number of consecutive insertion frames LE which is the maximum value of the number of error frames to be continuously inserted, and then detects the LE. The number ME of consecutive inserted frames may be determined from the following values.

【００８３】挿入フレーム連続数ＭＥは、１回のエラー
パケット挿入で生じた遅延誤差ＤＤを解消するあいだ
（１解消区間）、同じ値を継続するようにしてもよい
が、必要に応じてエラーフレーム挿入のたびに値を変更
（ただし最大値はＬＥ）するようにしてもよい。The number ME of consecutive inserted frames may be the same value while the delay error DD caused by one error packet insertion is eliminated (one cancellation section). The value may be changed (however, the maximum value is LE) at each insertion.

【００８４】被挿入パケットが多くの音声フレームを持
っていて、エラーフレームを挿入可能な候補位置（の時
間軸方向の密度）が多く、なおかつ当該遅延誤差ＤＤが
比較的少ない場合には、ＬＥよりも十分に小さなＭＥ
を、前記１解消区間にわたって継続することができる
が、候補位置が少なくＤＤが多い場合には、ＬＥと同じ
値のＭＥを１解消区間にわたって継続する必要が生じ得
る。If the packet to be inserted has many voice frames, the candidate positions (in the time axis direction) at which error frames can be inserted are large, and the delay error DD is relatively small, the LE is used. ME small enough
Can be continued over the one resolution section. However, if the number of candidate positions is small and the number of DDs is large, it may be necessary to continue the ME having the same value as LE over the one resolution section.

【００８５】また、通常、エラーフレームの挿入を行え
るキュー上の位置は限られている（例えば受信バッファ
３の入力端子または出力端子、すなわちキューの先頭ま
たは最後尾）から、候補位置が少なくＤＤが多い場合な
どには、エラーフレームの挿入パターンを決定しようと
するときには、前記遅延誤差ＤＤを完全に解消できるだ
けの見通しが立たず、その一方で、最初の被挿入パケッ
トの候補位置に対するエラーフレーム挿入は直ちに行わ
なければならないような状況も起こり得る。このような
場合、ＭＥの値は最初の見通しが立たない段階ではＬＥ
と同じ値にしておき、解消の見通しが立った時点から、
ＬＥより小さな値に変更するようにしてもよい。Usually, the positions on the queue where error frames can be inserted are limited (for example, the input terminal or the output terminal of the reception buffer 3, that is, the head or tail of the queue). In many cases, when trying to determine the insertion pattern of the error frame, it is not possible to completely eliminate the delay error DD. On the other hand, error frame insertion at the candidate position of the first packet to be inserted is not performed. There may be situations that must be done immediately. In such a case, the value of ME may be LE
To the same value as above, and from the time when the prospect of cancellation is established,
The value may be changed to a value smaller than LE.

【００８６】さらに、必要に応じて、すべての遅延誤差
ＤＤのうち、可能な部分だけを解消するようなシステム
構成をとることもできる。この場合でも、第１の実施形
態に比べて遅延時間の制御機能の精度は向上する。例え
ば、受信パケットを伝送してくるネットワークの状態が
悪く、受信装置２１において前記エラーパケットの挿入
が頻発するような状況では、このようなシステム構成を
とらざるを得なくなるものと考えられる。Further, if necessary, it is possible to adopt a system configuration in which only a possible portion of all the delay errors DD is eliminated. Even in this case, the accuracy of the delay time control function is improved as compared with the first embodiment. For example, in a situation where the state of the network transmitting the received packet is bad and the error packet is frequently inserted in the receiving device 21, it is considered that such a system configuration has to be adopted.

【００８７】いずれの場合でも、前記エラーフレームの
挿入方法として通話品質上、最も好ましいのは、Ｒｅｍ
分のデータ量を１フレームごとに分割し、それぞれエラ
ーフレームとして任意の間隔、または、等間隔で、次の
到着パケットに挿入するものである。これは、上述した
エラーフレームを挿入可能な候補位置が多く、なおかつ
当該遅延誤差ＤＤが比較的少ない場合などに実行すると
よい。In any case, the most preferable method for inserting the error frame in terms of speech quality is Rem.
This is to divide the amount of data for each frame and insert it into the next arriving packet as an error frame at arbitrary intervals or at equal intervals. This may be performed when there are many candidate positions into which the above-described error frame can be inserted and the delay error DD is relatively small.

【００８８】Ｒｅｍが４フレーム分あり、前記被挿入パ
ケットが次の配列String１に示すように、１０個の音声
フレームから構成されているパケットサイズ１０の場合
を例に、具体的に説明する。The case where the Rem has 4 frames and the packet to be inserted has a packet size of 10 composed of 10 voice frames as shown in the following array String 1 will be specifically described as an example.

【００８９】Ａ、Ｂ、Ｃ、Ｄ、Ｅ、Ｆ、Ｇ、Ｈ、Ｉ、Ｊ …String１ここで、Ａ〜Ｊの各アルファベットは、当該被挿入パケ
ットに収容されている各音声フレームを示している。A, B, C, D, E, F, G, H, I, J... String1 Here, each alphabet of A to J indicates each audio frame contained in the inserted packet. I have.

【００９０】いま、挿入フレーム連続数ＭＥが１フレー
ムを指定しているものとし、挿入する各エラーフレーム
を（１）、（２）、（３）、（４）とし、前記配列Stri
ng１に対して各エラーフレーム（１）〜(４）を等間隔
に挿入するものとすると、配列String１は次の配列Stri
ng２に変換される。Now, it is assumed that the number ME of consecutive frames to be inserted specifies one frame, each error frame to be inserted is (1), (2), (3), and (4).
Assuming that error frames (1) to (4) are inserted at equal intervals into ng1, the array String1 becomes the next array Stri
Converted to ng2.

【００９１】（１）、Ａ、Ｂ、Ｃ、（２）、Ｄ、Ｅ、Ｆ、（３）、Ｇ、Ｈ、Ｉ、（４）、Ｊ …String２ここで、挿入されたエラーフレーム（１）は、先頭音声
フレームであるＡの前に配置され、エラーフレーム
(２）は、連続する音声フレームＣとＤのあいだに挿入
され、エラーフレーム(３）は、連続する音声フレーム
ＦとＧのあいだに挿入され、エラーフレーム(４）は、
連続する音声フレームＩとＧのあいだに挿入されてい
る。 (1) , A, B, C, (2) , D, E, F, (3) , G, H, I, (4), J... String2 Here, the inserted error frame (1) ) Is placed before A, which is the first audio frame, and contains an error frame.
(2) is inserted between successive audio frames C and D, error frame (3) is inserted between consecutive audio frames F and G, and error frame (4) is inserted
It is inserted between successive audio frames I and G.

【００９２】これは、各エラーフレームを等間隔（３フ
レーム間隔）で挿入した例であるが、この間隔は、必ず
しも、すべて等間隔である必要はなく、例えばフレーム
（４）の挿入位置を音声フレームＨとＩのあいだに変更
して、３フレーム間隔の部分と２フレーム間隔の部分が
混在するようにしてもよい。This is an example in which error frames are inserted at equal intervals (three-frame intervals). However, the intervals do not necessarily have to be all equal intervals. By changing between frames H and I, a portion with a three-frame interval and a portion with a two-frame interval may be mixed.

【００９３】変換後のこの配列String２は、変換後の先
頭フレームである左端のフレーム（１）から順番に、受
信バッファ３に蓄積されてキューの最後尾を形成する。
そして最後尾の音声フレームＪのあとには、次に到着し
た受信パケットの音声フレーム（またはエラーパケッ
ト）が蓄積され、以降は同様な動作が繰り返され得る。The converted array String2 is accumulated in the reception buffer 3 in order from the leftmost frame (1), which is the first frame after the conversion, to form the tail of the queue.
After the last voice frame J, the voice frame (or error packet) of the next received packet is accumulated, and the same operation can be repeated thereafter.

【００９４】なお、配列String２では、エラーフレーム
（１）を先頭音声フレームＡの前に配置したが、この挿
入操作は、受信パケット系列中で前記配列String１の直
前に位置するパケットがエラーパケットであるものとす
ると好ましくない。この位置にエラーフレームを配置す
ることは実質的に当該エラーパケットのサイズを大きく
することに等しいからである。ただし、直前に位置する
パケットが被挿入パケットであって、当該被挿入パケッ
トの最後尾に位置するのが音声フレームである場合（つ
まりＲｅｍが４を超え、配列String２が２つ目以降の被
挿入パケットであるケース）には、このような問題はな
い。In the array String2, the error frame (1) is arranged before the head audio frame A. However, in this insertion operation, the packet located immediately before the array String1 in the received packet sequence is an error packet. This is not preferable. This is because placing an error frame at this position is substantially equivalent to increasing the size of the error packet. However, if the packet located immediately before is the inserted packet and the last frame of the inserted packet is an audio frame (that is, Rem exceeds 4 and the array String2 is the second or later inserted packet). Packet case), there is no such problem.

【００９５】同様に、配列String２中で最後尾の音声フ
レームＪの後ろにエラーフレームを配置することは、そ
の直後に通常の受信パケットが蓄積されるのであれば問
題ないが、新たなエラーパケットや後続の被挿入パケッ
トが蓄積されるのであれば、当該エラーパケットのサイ
ズや被挿入パケットに対するエラーフレームの挿入パタ
ーンに制約を与える。Similarly, arranging an error frame after the last audio frame J in the array String2 is not a problem as long as normal received packets are stored immediately after that. If the following inserted packet is accumulated, restrictions are imposed on the size of the error packet and the insertion pattern of the error frame for the inserted packet.

【００９６】配列String２では、挿入されたエラーフレ
ームの連続数は１なので、図３から明らかなように、前
記出力端５から出力される音声の音質の観点からは、最
も好ましい。In the array String2, since the number of consecutive error frames inserted is 1, as is apparent from FIG. 3, it is most preferable from the viewpoint of the sound quality of the sound output from the output terminal 5.

【００９７】ところが、３フレーム間隔の挿入を前提と
すると、前記Ｒｅｍが多すぎる場合などには、このよう
な変換を実行することが適切でない場合もある。例えば
Ｒｅｍが５フレーム分以上あるものとすると、すべての
エラーフレームを前記配列String１の中には挿入するこ
とはできない。However, assuming that a three-frame interval is inserted, it may not be appropriate to execute such conversion when the Rem is too large. For example, if Rem is equal to or more than 5 frames, not all error frames can be inserted into the array String1.

【００９８】このような場合には、前記配列String１を
収容した被挿入パケットの次に受信される受信パケット
の到着を待って、その受信パケットの音声データ部にフ
レーム（５）以降のエラーフレームを挿入することも考
えられるが、ここでは、上述したできるだけ早期に前記
遅延時間の制御誤差を解消するという要求をも満足する
ために、前記しきい値ＴＨが２以上であり、Ｒｅｍが５
フレーム分であるものとして、前記配列String１を次の
配列String３のように変換する。In such a case, after the arrival of the received packet received next to the inserted packet containing the array String1, the error frame after the frame (5) is added to the audio data part of the received packet. Although it is conceivable to insert the threshold value, here, the threshold value TH is 2 or more and Rem is 5 to satisfy the above-described requirement of eliminating the control error of the delay time as soon as possible.
The array String1 is converted as in the following array String3, assuming that it is a frame.

【００９９】Ａ、Ｂ、Ｃ、（１）、（２）、Ｄ、Ｅ、Ｆ、（３）、（４）、Ｇ、Ｈ、Ｉ、（５）、Ｊ …String３この配列String３では、エラーフレーム（１）と(２）
が、音声フレームＣとＤのあいだに挿入され、エラーフ
レーム（３）と(４）が、音声フレームＦとＧのあいだ
に挿入され、音声フレームＩとＪのあいだには、エラー
フレーム（５）だけが挿入されている。この配列String
３は、前記配列String２と比べ、エラーフレームの挿入
位置も異なっているが、挿入位置は、挿入フレーム連続
数ＭＥの供給タイミングなどに応じて適宜変更するよう
にしてよい。A, B, C, (1), (2), D, E, F, (3) , (4), G, H, I, ( 5), J ... String3 In this array String3, an error occurs. Frames (1) and (2)
Is inserted between the audio frames C and D, the error frames (3) and (4) are inserted between the audio frames F and G, and the error frame (5) is inserted between the audio frames I and J. Only have been inserted. This array String
In No. 3, the insertion position of the error frame is also different from that of the array String2, but the insertion position may be appropriately changed according to the supply timing of the number ME of consecutive inserted frames.

【０１００】前記配列String１をこの配列String３に変
換するにあたって、前記エラーフレーム挿入部３２がエ
ラーフレーム連続数決定部３１から受け取る挿入フレー
ム連続数ＭＥは、フレーム（１）、（２）およびフレー
ム(３）、（４）を挿入する際には２で、フレーム
（５）を挿入する際には１である。In converting the array String1 into the array String3, the number ME of consecutive inserted frames received by the error frame inserting unit 32 from the number-of-continuous-error-frames determining unit 31 is represented by frames (1), (2) and (3). ), 2 when inserting (4), and 1 when inserting frame (5).

【０１０１】配列String３では挿入されたエラーフレー
ムの連続数は、２または１であるが、連続数２は、しき
い値ＴＨ未満なので、音質劣化はユーザにほとんど体感
されない程度に抑制される。In the array String3, the number of consecutive error frames inserted is 2 or 1, but since the number of consecutive 2 is less than the threshold value TH, sound quality deterioration is suppressed to such an extent that the user hardly feels it.

【０１０２】ここでは、変換後の配列String２や３は、
受信バッファ３に蓄積してキューの最後尾を形成するも
のとしたが、受信バッファ３を介することなく、直接、
復号部４に供給するようにしてもよい。これは、前記被
挿入パケットが受信バッファ３に蓄積されている期間内
に残フレーム制御部６における処理が行われればよいか
ら、残フレーム制御部６における処理に時間がかかる場
合のタイミング調整などに有利である。Here, the converted arrays String2 and String3 are
The data is accumulated in the reception buffer 3 to form the tail of the queue.
You may make it supply to the decoding part 4. This is because the processing in the remaining frame control unit 6 only needs to be performed during the period in which the inserted packet is accumulated in the reception buffer 3, and is used for timing adjustment when the processing in the remaining frame control unit 6 takes time. It is advantageous.

【０１０３】この場合でも、配列String１からString２
への変換や配列String１からString３への変換などが実
行可能である。In this case, too, the strings String1 to String2
Can be executed, and the conversion from array String1 to String3 can be executed.

【０１０４】（Ｂ−２）第２の実施形態の効果本実施形態によれば、第１の実施形態の効果と同等な効
果を得ることができる。(B-2) Effects of the Second Embodiment According to the present embodiment, the same effects as those of the first embodiment can be obtained.

【０１０５】加えて、本実施形態では、エラーパケット
の挿入による上述した遅延時間の制御誤差を完全に解消
することができ、受信バッファ枯渇による音声途切れの
防止や、音質劣化の改善に有効である。In addition, in the present embodiment, the above-described control error of the delay time due to the insertion of the error packet can be completely eliminated, which is effective for preventing the audio interruption due to the exhaustion of the reception buffer and improving the sound quality deterioration. .

【０１０６】（Ｃ）第３の実施形態以下では、本実施形態が第１、第２の実施形態と相違す
る点についてのみ説明する。(C) Third Embodiment In the following, only differences between the present embodiment and the first and second embodiments will be described.

【０１０７】第２の実施形態では、エラーフレームの挿
入に際して挿入位置の前後の音声フレームの中身につい
ては配慮しなかったが、本実施形態は、エラーフレーム
の挿入部分が聴覚特性に与える品質劣化の程度は隣接す
る音声フレームの内容に応じて相違する点に着目し、エ
ラーフレーム挿入部分の音質劣化を一層効果的に抑制す
ることを特徴とする。In the second embodiment, when inserting an error frame, no consideration is given to the contents of the audio frames before and after the insertion position. However, in the present embodiment, the quality degradation caused by the insertion of the error frame on the auditory characteristics is not considered. Focusing on the fact that the degree differs depending on the content of the adjacent audio frame, it is characterized in that the sound quality degradation at the error frame insertion portion is suppressed more effectively.

【０１０８】（Ｃ−１）第３の実施形態の構成および動
作本実施形態の受信装置２２の構成を図５に示す。この受
信装置２２は基本的に、第１の実施形態の受信装置２
０、第２の実施形態の受信装置２１に対応する装置であ
る。(C-1) Configuration and Operation of the Third Embodiment FIG. 5 shows the configuration of the receiving device 22 of the present embodiment. This receiving device 22 is basically the receiving device 2 of the first embodiment.
0, a device corresponding to the receiving device 21 of the second embodiment.

【０１０９】図５において、受信装置２２は、受信端１
と、パケット監視部２と、受信バッファ３と、復号部４
と、出力端５と、残フレーム制御部６、フレームレベル
識別部７とを備えている。In FIG. 5, the receiving device 22 is
, Packet monitoring unit 2, reception buffer 3, decoding unit 4
, An output terminal 5, a remaining frame control unit 6, and a frame level identification unit 7.

【０１１０】このうちフレームレベル識別部７以外の構
成要素１〜６の機能は、実質的に、同一の符号を付した
図８の各部と同じである。The functions of the components 1 to 6 other than the frame level identification section 7 are substantially the same as those of FIG. 8 to which the same reference numerals are given.

【０１１１】図９には、パケット監視部２と、残フレー
ム制御部６と、当該フレームレベル識別部７の内部構成
を示している。FIG. 9 shows the internal configuration of the packet monitoring unit 2, the remaining frame control unit 6, and the frame level identification unit 7.

【０１１２】図９中の構成要素１１〜１５、３０，３１
の機能は、実質的に、同一の符号を付した図８の各部と
同じである。Components 11 to 15, 30, 31 in FIG.
Are substantially the same as those in FIG. 8 to which the same reference numerals are given.

【０１１３】図９において、一点鎖線で囲んだ部分がフ
レームレベル識別部７に対応する。In FIG. 9, a portion surrounded by a dashed line corresponds to the frame level identifying section 7.

【０１１４】すなわち、フレームレベル識別部７は、無
音フレーム検出部４０と、音圧レベル検出部４１とを備
えている。That is, the frame level identification section 7 includes a silent frame detection section 40 and a sound pressure level detection section 41.

【０１１５】このうち無音フレーム検出部４０は、受信
パケットが収容している各音声フレームを調べて無音フ
レームを検出し当該無音フレームの位置を示す無音位置
ＮＬをエラーフレーム挿入部４２に供給する部分であ
る。無音フレームとは、音声データとしては有効である
が復号しても有意な音声出力が得られない符号化音声デ
ータを収容している音声フレームである。The silence frame detecting section 40 examines each voice frame contained in the received packet, detects the silence frame, and supplies the silence position NL indicating the position of the silence frame to the error frame insertion section 42. It is. A silence frame is a speech frame containing coded speech data that is valid as speech data but does not provide a significant speech output even when decoded.

【０１１６】無音フレーム（無音部）の判定機能として
は、例えば波形の二乗平均値を求め、所定の閾値より小
さい場合に無音と判定する方法や、波形の分散値を求
め、所定の閾値より小さい場合に無音と判定する方法
や、各フレームデータに記載された有音無音の識別コー
ドを参照する方法などがあり、これ以外にも様々な方法
が適用できる。As a function of determining a silence frame (silence portion), for example, a method of determining a root mean square of a waveform and determining that the value is smaller than a predetermined threshold value, or determining a variance value of the waveform and determining a variance value of the waveform, In such a case, there is a method of determining that there is no sound, a method of referring to a sound / silence identification code described in each frame data, and various other methods can be applied.

【０１１７】最後に述べた、各フレームデータに記載さ
れた有音無音の識別コードとは、例えば、前述したＧ．
７２３．１やＧ．７２９などの符号化方式において、無
音圧縮機能（ＶＡＤ機能）を用いて符号化した場合、各
フレームデータの先頭に、有音、無音、過渡音の何れか
のコードが記される。The last-mentioned sound / silence identification code described in each frame data is, for example, the above-mentioned G.100.
723.1 and G.P. In the case of encoding using a silence compression function (VAD function) in an encoding method such as 729, any of speech, silence, and transient sounds is described at the head of each frame data.

【０１１８】また、前記音圧レベル検出部４１は、受信
パケットが収容している各音声フレームを調べて、所定
レベル以上の音圧レベルを持つ符号化音声データを収容
した音声フレームを検出し、その音声フレームの位置を
示す大音圧位置ＬＬをエラーフレーム挿入部４２に供給
する部分である。音圧レベルとは、各音声フレームを復
号することによって得られる音声出力の音の大きさのこ
とである。The sound pressure level detecting section 41 examines each sound frame contained in the received packet and detects a sound frame containing encoded sound data having a sound pressure level equal to or higher than a predetermined level. This is a part that supplies the large sound pressure position LL indicating the position of the audio frame to the error frame insertion unit 42. The sound pressure level is a loudness of an audio output obtained by decoding each audio frame.

【０１１９】音圧レベルの検出は、上述した無音フレー
ムの検出に用いた判定機能と同様に、例えば、波形の二
乗平均値を求めその値を予め設定した音圧レベルと比較
すること等によって行うことができる。The detection of the sound pressure level is performed, for example, by obtaining the mean square value of the waveform and comparing the value with a preset sound pressure level, in the same manner as the above-described determination function used for detecting the silent frame. be able to.

【０１２０】エラーフレームを無音部に挿入すると、挿
入したエラーフレーム分、無音区間が延びるが、聴覚特
性上、それを人間が感知することは現実的に不可能であ
る。When an error frame is inserted into a silent section, a silent section is extended by the length of the inserted error frame, but it is practically impossible for a human to perceive it due to auditory characteristics.

【０１２１】また、エラーフレームを音圧レベルの高い
部分に挿入すると、エラーフレーム直前の大音圧による
マスキング効果で、聴覚特性上、エラーフレーム部の音
声は人間に知覚されにくくなる。このマスキング効果
は、音圧レベルがない音（いわゆる小さい音）の後に、
音圧レベルが高い音（いわゆる大きい音）がきた場合に
も効果を発揮し、音圧レベルの高い音が、直前の音圧レ
ベルがない音を知覚しにくくする。When an error frame is inserted into a portion having a high sound pressure level, a masking effect by a large sound pressure immediately before the error frame makes it difficult for a human to perceive the sound in the error frame portion due to auditory characteristics. This masking effect occurs after a sound without sound pressure level (so-called small sound)
The effect is exhibited even when a sound with a high sound pressure level (a so-called loud sound) comes, and it is difficult for a sound with a high sound pressure level to be perceived as a sound without a previous sound pressure level.

【０１２２】したがって、エラーフレームの挿入位置と
して、無音の音声フレームに隣接した位置や、大音圧レ
ベルの音声フレーム（大音圧レベルフレーム）に隣接し
た位置を選択することにより、音質の劣化がユーザに感
得されにくくなる。Therefore, by selecting a position adjacent to a silent sound frame or a position adjacent to a sound frame of a high sound pressure level (high sound pressure level frame) as an insertion position of an error frame, deterioration of sound quality is reduced. It becomes difficult for the user to feel.

【０１２３】エラーフレーム挿入部４２の機能は、上述
した第２の実施形態のエラーフレーム挿入部３２とは相
違し、前記エラーフレーム連続数決定部３１から受け取
った挿入フレーム連続数ＭＥと、無音フレーム検出部４
０から受け取った無音位置ＮＬと、音圧レベル検出部４
１から受け取った大音圧位置ＬＬとに基づいて、上述し
た被挿入パケットの音声データ部に、エラーフレームの
挿入操作を実行する。The function of the error frame insertion unit 42 is different from that of the error frame insertion unit 32 of the second embodiment described above. Detector 4
0 and the sound pressure level detector 4
Based on the large sound pressure position LL received from No. 1, an operation of inserting an error frame into the audio data portion of the above-described inserted packet is performed.

【０１２４】フレームレベル識別部７は、必ずしも無音
フレーム検出部４０と音圧レベル検出部４１の双方を備
えている必要はなく、いずれか一方だけを装備していて
もよい。The frame level discriminating section 7 does not necessarily need to include both the silent frame detecting section 40 and the sound pressure level detecting section 41, and may include only one of them.

【０１２５】また、無音フレームや大音圧レベルフレー
ムの発生頻度は、ユーザ間の通話状態などに応じて変動
するものなので、１つの被挿入パケットに収容された音
声フレームにおける大音圧レベルフレームと無音フレー
ムの分布状態は未知である。すなわち１つの被挿入パケ
ットに収容された音声フレームに、大音圧レベルフレー
ムは存在するが無音フレームは存在しない場合もあり、
反対に無音フレームは存在するが大音圧レベルフレーム
は存在しない場合や、無音フレームも大音圧レベルフレ
ームも存在しない場合や、双方とも存在する場合もあり
得る。Further, since the frequency of occurrence of a silent frame or a high sound pressure level frame varies depending on the state of communication between users and the like, the high sound pressure level frame and the high sound pressure level frame in the voice frame accommodated in one inserted packet. The distribution state of silence frames is unknown. That is, in the audio frame accommodated in one inserted packet, there is a case where a high sound pressure level frame exists but no silence frame exists,
Conversely, there may be a case where there is a silence frame but no high sound pressure level frame, a case where neither a silence frame nor a high sound pressure level frame exists, or a case where both exist.

【０１２６】ここでは、無音フレームは存在するが大音
圧レベルフレームは存在しない場合を例に、本実施形態
の動作について説明する。Here, the operation of this embodiment will be described by taking as an example a case where a silent frame exists but a high sound pressure level frame does not exist.

【０１２７】前記被挿入パケットが次の配列String４に
示すように、６個の音声フレームから構成されているも
のとする。It is assumed that the inserted packet is composed of six audio frames as shown in the following array String4.

【０１２８】無音１、有音２、有音３、無音４、有音５、有音６ …String４この配列String４を調べた結果、前記無音レベル検出部
４０の出力する無音位置ＮＬは、無音１と無音４の位置
で能動状態となり、他の位置では非能動状態である。ま
た、この例では、有音２、有音３、有音５、有音６の音
圧レベルは高くないため、音圧レベル検出部４１が出力
する大音圧位置ＬＬは、一度も能動状態とならないもの
とする。Silence 1, sound 2, sound 3, sound 4, sound 5, sound 6 ... String4 As a result of examining this array String4, the silence position NL output from the silence level detection unit 40 is silence 1 And the silence 4 is active, and the other positions are inactive. Further, in this example, since the sound pressure levels of the sound 2, the sound 3, the sound 5, and the sound 6 are not high, the large sound pressure position LL output from the sound pressure level detecting unit 41 has never been in the active state. It shall not be.

【０１２９】このとき、挿入フレーム連続数ＭＥが１を
指定しているものとし、前記サイズ差ＤＤが２以上ある
ものとすると、前記配列String４はエラーフレーム挿入
部４２の動作によって、次の配列String５に変換され
る。At this time, assuming that the number ME of consecutive frames to be inserted specifies 1, and that the size difference DD is 2 or more, the array String4 is changed to the next array String5 by the operation of the error frame insertion unit 42. Is converted to

【０１３０】無音１、有音２、有音３、（１）、無音４、（２）、有音５、有音６…Stri ng５ここでは、無音４の左右に１つずつ、合計２つのエラー
フレーム（１）、（２）を挿入したが、無音１の右側に
もいずれかのエラーフレームを挿入することも可能であ
る。Silence 1, sound 2, sound 3, (1) , sound 4, (2) , sound 5, sound 6... Trigger5 Here, one to the left and right of silence 4, a total of two Although error frames (1) and (2) have been inserted, it is also possible to insert any error frame on the right side of silence 1.

【０１３１】もしも、挿入フレーム連続数ＭＥが２であ
る場合には、前記配列String４は次の配列String６に変
換することもできる。If the number ME of consecutive frames to be inserted is two, the array String4 can be converted to the next array String6.

【０１３２】無音１、有音２、有音３、無音４、（１）、（２）、有音５、有音６…Stri ng６ここでも、２つの連続する挿入フレーム（１）、（２）
は、無音４の左側に挿入してもよいし、無音１の右側に
挿入してもよい。Silence 1, sound 2, sound 3, sound 4, (1) , (2) , sound 5, sound 6... Trigger 6 Here, two consecutive inserted frames (1), (2) )
May be inserted to the left of silence 4 or to the right of silence 1.

【０１３３】また、本実施形態では、マスキング効果な
どの大音圧レベルや無音部の聴覚特性を利用するため、
これらを利用しない第２の実施形態に比べて、挿入フレ
ーム最大連続数ＬＥの値を大きくしても音質劣化は感得
されない可能性が高い。したがって、実際に出力端５か
ら出力される再生音を試聴することによって、挿入フレ
ーム最大連続数ＬＥの値を決定するようにしてもよい。In this embodiment, since a large sound pressure level such as a masking effect and the auditory characteristics of a silent part are used,
As compared to the second embodiment that does not use these, even if the value of the maximum consecutive number of inserted frames LE is increased, there is a high possibility that sound quality degradation will not be perceived. Therefore, the value of the maximum number of consecutive insertion frames LE may be determined by listening to the reproduced sound actually output from the output terminal 5.

【０１３４】なお、変換後の配列String５やString６が
受信バッファ３に蓄積されてキューの最後尾を形成する
ことは、第２の実施形態と同じである。It is to be noted that the converted strings String5 and String6 are accumulated in the reception buffer 3 to form the tail of the queue, as in the second embodiment.

【０１３５】また、第２の実施形態と同様に、変換後の
配列String５やString６は、受信バッファ３に蓄積する
ことなく、直接、復号部４に供給するようにしてもよ
い。これは、前記被挿入パケットが受信バッファ３に蓄
積されている期間内に残フレーム制御部６およびフレー
ムレベル識別部７における処理が行われればよいから、
残フレーム制御部６やフレームレベル識別部７における
処理に時間がかかる場合のタイミング調整などに有利で
ある。As in the second embodiment, the converted arrays String5 and String6 may be directly supplied to the decoding unit 4 without being stored in the reception buffer 3. This is because the processing in the remaining frame control unit 6 and the frame level identification unit 7 only needs to be performed during the period in which the inserted packet is accumulated in the reception buffer 3.
This is advantageous for timing adjustment when the processing in the remaining frame control unit 6 and the frame level identification unit 7 takes time.

【０１３６】図６に示した受信装置２３は、この場合に
対応した構成となっている。The receiving device 23 shown in FIG. 6 has a configuration corresponding to this case.

【０１３７】（Ｃ−３）第３の実施形態の効果本実施形態によれば、第１、第２の実施形態と同等な効
果を得ることができる。(C-3) Effects of Third Embodiment According to this embodiment, the same effects as those of the first and second embodiments can be obtained.

【０１３８】加えて、本実施形態では、被挿入パケット
が収容している音声フレームの中身にも配慮してエラー
フレームの挿入を行うので、聴覚特性を巧みに利用し
て、音声品質を第１、第２の実施形態よりも向上するこ
とや、挿入フレーム最大連続数（ＬＥ）の値を大きくし
て、いっそう早期に遅延時間の制御誤差（ＤＤ）を解消
することが可能である。In addition, in the present embodiment, the error frame is inserted in consideration of the contents of the audio frame contained in the packet to be inserted. It is possible to eliminate the delay time control error (DD) more quickly by improving the second embodiment and by increasing the value of the maximum number of consecutive insertion frames (LE).

【０１３９】（Ｄ）他の実施形態なお、第１〜第３の実施形態では、前述したパケット番
号、タイムアウト時間の設定と監視、エラーフレーム生
成、パケット廃棄の機能をパケット監視部２に集約した
が、全て、あるいは、部分的に機能を分けて複数のユニ
ットとして構成してもよい。(D) Other Embodiments In the first to third embodiments, the functions of setting and monitoring the packet number, the timeout time, generating an error frame, and discarding the packet are integrated in the packet monitoring unit 2. However, all or some of the functions may be divided into a plurality of units.

【０１４０】また、第１〜第３の実施形態では、説明の
簡明のために多くの具体的な数値を示したが、これらの
数値は例示的なものであり、本発明は上述した数値以外
についても広く適用することができる。In the first to third embodiments, many specific numerical values are shown for simplicity of explanation. However, these numerical values are merely examples, and the present invention is not limited to the numerical values described above. Can be widely applied.

【０１４１】さらに、第１〜第３の実施形態で示した構
成図や動作フローは、本発明を実現する一例を示してい
るにすぎない。よって本発明の適用にあたっては、前述
した機能を有する要素がどこかに含まれていればよく、
これら以外の機能を有する要素が含まれていても構わな
い。Further, the configuration diagrams and operation flows shown in the first to third embodiments are merely examples for realizing the present invention. Therefore, in applying the present invention, it is sufficient that the element having the function described above is included somewhere.
Elements having functions other than these may be included.

【０１４２】また、第１〜第３の実施形態では、本発明
を主としてハードウエア的に実現したが、本発明はソフ
トウエア的に実現することも可能である。In the first to third embodiments, the present invention is mainly realized by hardware, but the present invention can be realized by software.

【０１４３】[0143]

【発明の効果】以上に説明したように、本発明は、各通
信パケットの受信そのものが正常に行われているどうか
を検出しているので、例えば受信バッファが空になるか
どうか等に依存せずに機能して、音飛びなどが発生する
ことを阻止できる可能性を高める。As described above, since the present invention detects whether or not each communication packet is normally received, it depends on whether or not the reception buffer becomes empty. Function to increase the possibility of preventing the occurrence of sound skipping and the like.

【０１４４】また、本発明では、通信パケットサイズが
大きい場合には、通信パケットサイズよりも小さなサイ
ズの補正パケットを挿入するから、有効な音声データを
収容していない補正パケットを挿入してもその挿入部分
における音質の劣化を抑制し、通話品質を維持すること
が可能である。Further, according to the present invention, when the communication packet size is large, a correction packet smaller than the communication packet size is inserted. It is possible to suppress the deterioration of the sound quality at the insertion portion and maintain the communication quality.

[Brief description of the drawings]

【図１】第１〜第３の実施形態に係る受信装置のパケッ
ト監視部の概略構成を示すブロック図である。FIG. 1 is a block diagram illustrating a schematic configuration of a packet monitoring unit of a receiving device according to first to third embodiments.

【図２】第１〜第３の実施形態の動作説明図である。FIG. 2 is an operation explanatory diagram of the first to third embodiments.

【図３】受信パケットのパケットサイズと同サイズのエ
ラーパケットを挿入した場合の、パケットサイズと音質
評価の関係を示す概略図である。FIG. 3 is a schematic diagram showing the relationship between packet size and sound quality evaluation when an error packet having the same size as the packet size of a received packet is inserted.

【図４】第２の実施形態に係る受信装置の概略構成を示
すブロック図である。FIG. 4 is a block diagram illustrating a schematic configuration of a receiving device according to a second embodiment.

【図５】第３の実施形態に係る受信装置の概略構成を示
すブロック図である。FIG. 5 is a block diagram illustrating a schematic configuration of a receiving device according to a third embodiment.

【図６】第３の実施形態に係る受信装置の変形例の概略
構成を示すブロック図である。FIG. 6 is a block diagram illustrating a schematic configuration of a modified example of the receiving device according to the third embodiment.

【図７】第１の実施形態に係る受信装置の概略構成を示
すブロック図である。FIG. 7 is a block diagram illustrating a schematic configuration of a receiving device according to the first embodiment.

【図８】第２の実施形態に係る受信装置の主要部の概略
構成を示すブロック図である。FIG. 8 is a block diagram illustrating a schematic configuration of a main part of a receiving device according to a second embodiment.

【図９】第３の実施形態に係る受信装置の主要部の概略
構成を示すブロック図である。FIG. 9 is a block diagram illustrating a schematic configuration of a main part of a receiving device according to a third embodiment.

[Explanation of symbols]

２…パケット監視部、３…受信バッファ、４…復号部、
６…残フレーム制御部、７…フレームレベル識別部、１
１…受信正常性検出部、１２…パケットサイズ検出部、
１３…しきい値設定部、１４…エラーパケットサイズ選
択部、１５…エラーパケット挿入部、３０…サイズ差検
出部、３１…エラーフレーム連続数決定部、３２、４２
…エラーフレーム挿入部、４０…無音フレーム検出部、
４１…音圧レベル検出部。2 packet monitoring unit 3 reception buffer 4 decoding unit
6: remaining frame control unit, 7: frame level identification unit, 1
1 ... reception normality detector, 12 ... packet size detector,
13: threshold setting unit, 14: error packet size selection unit, 15: error packet insertion unit, 30: size difference detection unit, 31: error frame continuation number determination unit, 32, 42
... error frame insertion unit, 40 ... silence frame detection unit
41 ... Sound pressure level detection unit.

フロントページの続きＦターム(参考） 5K030 GA11 HA08 HB01 HB28 JT03 KA01 LA06 LB11 LC13 MA12 MB11 MB12 9A001 CC03 EE04 HH15 Continued on the front page F term (reference) 5K030 GA11 HA08 HB01 HB28 JT03 KA01 LA06 LB11 LC13 MA12 MB11 MB12 9A001 CC03 EE04 HH15

Claims

[Claims]

1. A method for detecting whether or not each communication packet received in time series is normally received, based on an identifier assigned to each communication packet containing valid voice data on a transmission side. A packet receiving device that determines whether to insert a predetermined correction packet that does not contain valid voice data into a received communication packet sequence based on the detection result, Communication packet size detecting means for checking the communication packet size which is the size of
A packet receiving apparatus comprising: a correction packet inserting unit that selects a size smaller than the communication packet size as the size of the correction packet when at least the communication packet size exceeds the threshold.

2. The packet receiving apparatus according to claim 1, wherein, when the size of the correction packet inserted in the communication packet sequence is smaller than the communication packet size, a difference between the size of the correction packet and the size of the communication packet is reduced. A correction frame insertion unit for inserting a correction frame that does not contain valid audio data into a backward communication packet located after the correction packet in the communication packet sequence. Receiver.

3. The packet receiving apparatus according to claim 2, wherein the correction frame inserting unit checks valid audio data contained in the backward communication packet, and determines, from the audio data, a portion adjacent to the correction frame. And a correction frame for inserting the correction frame so that the inserted correction frame is adjacent to the non-visible data. A packet receiving device comprising an insertion control unit.