JP3669660B2

JP3669660B2 - Call system

Info

Publication number: JP3669660B2
Application number: JP00691297A
Authority: JP
Inventors: 弘久飯島; 光夫古村; 玲浜田
Original assignee: Secom Co Ltd; Casio Computer Co Ltd
Current assignee: Secom Co Ltd; Casio Computer Co Ltd
Priority date: 1997-01-17
Filing date: 1997-01-17
Publication date: 2005-07-13
Anticipated expiration: 2017-01-17
Also published as: JPH10210074A

Description

【０００１】
【発明の属する技術分野】
本発明は通話システムに関し、特に、音声情報をインターネット等のパケット通信網を介して送受信する通話システムの音声品質維持のための技術に関する。
【０００２】
【背景技術】
近年、音声メール機能を有するパーソナルコンピュータ等のコンピュータを電話網やＩＳＤＮ(Integrated Services Digital Network) などの公衆網に接続することにより、同様に公衆網に接続されたコンピュータとインターネットを介して音声の送受信ができるようにした通話システムがある。かかる通話システムは、特に音声の通信を海外とする場合には、公衆回線使用料がユーザー端末から接続業者（プロバイダー）までの市内電話料金のみで済み、国際電話網を利用する場合に比して安価に通話を行うことができることから広く普及しつつある。
【０００３】
また、パーソナルコンピュータ等のコンピュータを使用せずに、電話機間の通話接続にインターネットを使用する通話システムも提案されている。図１３は現在提案されている上記通話システムの電話機間の通話接続への適用例を示す図である。同図に示す通話システムでは、電話機１０１は公衆電話網１０２により中継局１０３と接続され、一方、電話機１０４は公衆電話網１０５により中継局１０６と接続される。また、前記中継局１０３と前記中継局１０６とがインターネット１０７により通信接続される。
【０００４】
そして、同図に示す通話システムでは、電話機１０１から送信される音声は中継局１０３で符号化され、例えば、３０ｍｓ分の音声を表す符号毎にパケット化される。そして、３０ｍｓ毎の間隔でＩＰパケットがインターネット１０７を介して中継局１０６に送信され、該中継局１０６に設けられた図示しない受信バッファに一旦蓄えられる。その後、ＩＰパケットが開梱されて元の音声に復号化され、電話機１０１から送信される音声が電話機１０４にて再生される。
【０００５】
【発明が解決しようとする課題】
しかし、インターネット１０７での通信は離散的なデータの通信には適しているものの、電話や放送等における音声や画像データ等の連続的なデータを受信側で即座に再生する形態の通信には不向きであって、上記通話システムにおいては、インターネット１０７の通信トラフィックの混雑量が増減する場合や、中継局１０３と中継局１０６との間の同期が正確に確保されていない場合には、中継局１０６に設けられた受信バッファがオーバーフローやアンダーフローを起こすことがある。
【０００６】
こうした受信バッファのオーバーフローが生じると、電話機１０４において不連続な音声が再生されることになり、電話機１０１と電話機１０４との間の円滑な会話が害される。また、受信バッファのアンダーフローが生じた場合も、電話機１０４において通話音声が再生されずに無音区間が生じ、電話機１０１と電話機１０４との間の円滑な会話が害される。また、この場合、中継局１０６から電話機１０４への音声の送信が累積して遅延するという不具合が発生する。以下、かかる不具合について図面に基づいてさらに詳しく説明する。
【０００７】
図１４は、インターネット１０７の通信トラフィックの混雑量の時間推移の一例を表す図である。同図において、期間Ｐはインターネット１０７の通信トラフィックが通話の基準となる通常の混雑程度の期間を表し、期間Ｑは期間Ｐより混雑しているが比較的小さい混雑程度の期間を表す。また、期間Ｒは期間Ｐに比べ混雑が緩和されている期間を表し、期間Ｓは期間Ｑより更に大きい混雑程度の期間を表す。そして、期間Ｐでは中継局１０３から中継局１０６までに一つのＩＰパケットが伝送するのに必要な時間、すなわち通信所要時間は１００ｍｓであり、期間Ｑでは１１０ｍｓであるとする。また、期間Ｒでは通信所要時間は９０ｍｓであり、期間Ｓでは１３０ｍｓであるとする。
【０００８】
以下、同図に示すような通信トラフィックの時間推移を示すインターネット１０７に８つのＩＰパケット１〜８を３０ｍｓの一定間隔で送信する場合、すなわち、ＩＰパケット１及び２が中継局１０３と中継局１０６との間を期間Ｐに伝送され、ＩＰパケット３及び４が中継局１０３と中継局１０６との間を期間Ｑに伝送され、ＩＰパケット５及び６が中継局１０３と中継局１０６との間を期間Ｒに伝送され、そしてＩＰパケット７及び８が中継局１０３と中継局１０６との間を期間Ｓに伝送される場合について説明する。
【０００９】
図１５は、各期間Ｐ〜Ｓに伝送するＩＰパケット１〜８のインターネット１０７での通信所要時間と、それらのＩＰパケット１〜８が本来電話機１０４へ送信されるべき時刻からの遅延時間である送信遅延時間とを、対応づけて表す図である。同図に示すように、ＩＰパケット３〜６に含まれる音声は本来電話機１０４へ送信されるべき時刻からそれぞれ１０ｍｓ遅れて送信され、ＩＰパケット７及び８に含まれる音声は本来電話機１０４へ送信されるべき時刻からそれぞれ３０ｍｓ遅れて送信される。
【００１０】
以下、かかる送信遅延時間の発生原因について、図１６に基づいて説明する。図１６（ａ）は、電話機１０１から送信される２４０ｍｓの音声を示す図であり、図１６（ｂ）は、中継局１０３においてその音声が３０ｍｓ単位で音声フレームデータに符号化され、ＩＰパケット１〜８にパケット化される様子を示す図である。また、図１６（ｃ）は、中継局１０３から送信されるＩＰパケットが中継局１０６で受信される様子を示す図であり、図１６（ｄ）は、それらのＩＰパケットに含まれる音声フレームデータが再び音声に復号化され、電話機１０４に送信される様子を示す図である。
【００１１】
なお、ここでは、音声の符号化レートは４ｋｂｐｓ、すなわち１秒間の音声が４ｋｂｉｔに圧縮されるものとし、インターネット１０７の転送レート（転送速度、通信速度）は１２ｋｂｐｓとする。従って、インターネット１０７においては、各ＩＰパケット１〜８は元の通話音声時間に対し３分の１の時間に圧縮されて転送される。すなわち、電話機１０１から送信される音声のうちの３０ｍｓ分の音声が正味１０ｍｓ程度のデータに圧縮されて転送される。ここで「正味」とは、パケットのヘッダ情報、誤り検出、誤り訂正等による付加ビットを除外して考えていることを表す。
【００１２】
同図（ａ）及び（ｂ）によれば、まず、電話機１０１から０〜６０ｍｓに送信される音声は、それぞれ３０ｍｓごとに音声フレームデータに符号化された後、ＩＰパケット１及び２にパケット化され、３０ｍｓの間隔で中継局１０３から中継局１０６に送信される。そして、これらのＩＰパケット１及び２は期間Ｐにインターネット１０７に伝送されるため、同図（ｃ）に示すように、それぞれ１００ｍｓ後に中継局１０６に到着する。その後、同図（ｄ）に示すように、パケット１及び２に含まれる音声フレームデータは中継局１０６に到着すると直ちに元の音声に復号化されて電話機１０４に送信される。
【００１３】
また、同図（ａ）及び（ｂ）によれば、電話機１０１から６０〜１２０ｍｓに送信される音声は、それぞれ３０ｍｓごとに音声フレームデータに符号化された後、ＩＰパケット３及び４にパケット化され、３０ｍｓの間隔で中継局１０３から中継局１０６に送信される。そして、これらのＩＰパケット３及び４は、通信トラフィックの混雑が増した期間Ｑにインターネット１０７に伝送されるため、同図（ｃ）に示すように、それぞれ１１０ｍｓ後に中継局１０６に到着する。その後、同図（ｄ）に示すように、ＩＰパケット３及び４に含まれる音声フレームデータは中継局１０６に到着すると直ちに元の音声に復号化されて電話機１０４に送信される。
【００１４】
このとき、期間Ｑで最初に到着するＩＰパケット３は本来の時刻（ＩＰパケット２の中継局１０６への到着後３０ｍｓ後、すなわち１９０ｍｓ）よりも１０ｍｓ遅れて到着し、期間Ｐで最後に到着するＩＰパケット２との到着間隔は４０ｍｓに広がる。その結果、期間Ｑで最初に到着するＩＰパケット３に含まれる音声は、本来電話機１０４に送信されるべき時、すなわち期間Ｐで最後に到着するＩＰパケット２に含まれる音声の電話機１０４への送信終了の時から１０ｍｓ遅れて送信され、電話機１０４への送信音声には１０ｍｓの空白が生じることになる。そして、ＩＰパケット３に含まれる音声は本来電話機１０４に送信されるべき時刻から１０ｍｓ遅れて送信される。
【００１５】
また、期間Ｑで２番目に到着するＩＰパケット４は、通常通り、一つ前のＩＰパケット３と３０ｍｓの間隔を空けて到着するが、ＩＰパケット４に含まれる音声は直前に到着したＩＰパケット３に含まれる音声の電話機１０４への送信の後で続けて送信されるものであるため、期間Ｑで最初に到着するＩＰパケット３の遅延をそのまま引き継いで、本来送信されるべき時間から１０ｍｓ遅れて送信される。
【００１６】
さらに、同図（ａ）及び（ｂ）によれば、電話機１０１から１２０〜１８０ｍｓに送信される音声は、それぞれ３０ｍｓごとに音声フレームデータに符号化された後、ＩＰパケット５及び６にパケット化され、３０ｍｓの間隔で中継局１０３から中継局１０６に送信される。そして、これらのＩＰパケット５及び６は、通信トラフィックの混雑の緩和した期間Ｒにインターネット１０７に伝送されるため、同図（ｃ）に示すように、それぞれ９０ｍｓ後に中継局１０６に到着する。この結果、ＩＰパケット５が２４０ｍｓ、ＩＰパケット６が２７０ｍｓに中継局１０６に到着する。その後、同図（ｄ）に示すように、ＩＰパケット５及び６に含まれる音声フレームデータは中継局１０６に到着すると直ちに元の音声に復号化されて電話機１０４に送信される。
【００１７】
同図に示すように、こうしてインターネット１０７の通信トラフィックの混雑が緩和しても送信遅延時間が減少することはない。これは、直前のＩＰパケット４に含まれる音声の送信が終了する前に、次のＩＰパケット５が中継局１０６に到着したとしても直前のＩＰパケット４を送信した後でなければ送信されないことに起因する。
【００１８】
また、同図（ａ）及び（ｂ）によれば、電話機１０１から１８０〜２４０ｍｓに送信される音声は、それぞれ３０ｍｓごとに音声フレームデータに符号化された後、ＩＰパケット７及び８にパケット化され、３０ｍｓの間隔で中継局１０３から中継局１０６に送信される。そして、これらのＩＰパケット７及び８は、通信トラフィックの混雑が大きく増した期間Ｓにインターネット１０７に伝送されるため、同図（ｃ）に示すように、それぞれ１３０ｍｓ後に中継局１０６に到着する。この結果、ＩＰパケット７が３４０ｍｓ、ＩＰパケット８が３７０ｍｓに中継局１０６に到着する。その後、同図（ｄ）に示すように、ＩＰパケット７及び８に含まれる音声フレームデータは中継局１０６に到着すると直ちに元の音声に復号化されて電話機１０４に送信される。
【００１９】
この場合は、ＩＰパケット７に含まれる音声は、直前に中継局１０６に到着したＩＰパケット６に含まれる音声の電話機１０４への送信終了時から２０ｍｓ遅れて送信される。そのため、ＩＰパケット７及び８に含まれる音声は、ＩＰパケット３に含まれる音声の電話機１０４への送信時に発生した送信遅延時間１０ｍｓに、さらに２０ｍｓが加算され、本来電話機１０４に送信されるべき時刻から３０ｍｓ遅れて送信されることになる。
【００２０】
以上のようにして、ＩＰパケットに含まれる音声の電話機１０４への送信が、直前に中継局１０６に到着するＩＰパケットに含まれる音声の電話機１０４への送信終了時よりも遅れる場合には、それ以降に到着するＩＰパケットに含まれる音声についても、すべてその遅延を引き継ぐ。従って、通話時間が長時間に及べば及ぶほど、その間のインターネット１０７の混雑量の増減により円滑な通話を妨げる程に送信遅延が累積するという不具合が生じる。
【００２１】
本発明はかかる課題に鑑みてなされたものであって、パケット通信網を用いて音声接続を行う場合に、受話端末における再生音声の品質を維持することのできる通話システムを提供することを課題とする。
【００３０】
【課題を解決するための手段】
本発明は、送話端末から送信される音声情報に基づいて生成される音声フレームデータをパケット化し、パケット通信網を介して受話端末側に順次送信する第一の中継局と、該第一の中継局から送信されるパケットを受信し、該パケットに含まれる音声フレームデータに基づいて生成される音声情報を前記受話端末に順次送信する第二の中継局と、を含む通話システムであって、前記第二の中継局は、前記第一の中継局から受信するパケットに含まれるデータのうちの少なくとも音声フレームデータを一時的に蓄積するバッファメモリと、該バッファメモリに蓄積される音声フレームデータの数を計測する音声フレームデータ数計測手段と、前記バッファメモリのオーバーフローを検出するオーバーフロー検出手段と、前記オーバーフロー検出手段により前記バッファメモリから所定個数以上の音声フレームデータが連続してオーバーフローしたことが検出され、かつ、前記音声フレームデータ数計測手段により前記バッファメモリに蓄積される音声フレームデータの数が前記バッファメモリに蓄積可能な音声フレームデータの最大の数に達していると判断される場合に、前記バッファメモリに蓄積される音声フレームデータを所定基準に基づいて選択的に削除する選択削除手段と、を含むことを特徴とする。
【００３１】
本発明によれば、前記オーバーフロー検出手段により前記バッファメモリから所定個数以上の音声フレームデータが連続してオーバーフローしたことが検出された場合、すなわち、前記第二の中継局により連続して受信される所定個数のパケットに含まれる音声フレームデータが全て消失した場合で、さらに、前記音声フレームデータ数計測手段により前記バッファメモリに蓄積されるデータ量がバッファの記憶容量に達していると判断される場合には、所定個数を超えるパケットに含まれる音声フレームデータが連続して消失する虞があると判断し、前記バッファメモリに蓄積されるデータから所定基準に合致する音声フレームデータを選択的に削除する。ここで、所定基準とは、例えば、受信される音声フレームデータ中に音声振幅パワを表すデータが含まれる場合、音声振幅パワが小さい旨を表すデータを含み、発声時以外のものであると推定される音声フレームデータを削除する基準や、ほぼ同じデータが継続して現れ、母音部分と思われる部分の一部を削除する基準などを採用できる。また、受信される音声フレームデータがＣＥＬＰ（ Codebook-Excited Linear Prediction ）方式により圧縮されたものである場合には、コードブックのインデックスとして人の声以外、例えば、背景雑音を表すインデックスを含む音声フレームデータを削除する基準などを採用できる。
【００３２】
こうすれば、重要度の低いデータを予め選択して削除することにより、前記バッファメモリがオーバーフローして必要なデータが連続して消失することを防止することができる。この結果、受話端末に送信されるべき音声情報の一部分が大きくまとまって消失することを防止することができ、パケット通信網を用いて音声接続を行う場合にも受話端末における再生音声の品質を維持することができる。尚、前記送話端末及び前記受話端末は、それぞれ、一方向通信や双方向通信などにおける音声情報の送信側の端末及び受信側の端末を意味する。したがって、双方向通信においては発呼側の端末及び着呼側の端末に限定されるものではない。
【００３６】
【発明の実施の形態】
以下、本発明の実施の形態について図面に基づき詳細に説明する。
【００３７】
［構成］
図１は本実施の形態にかかる通話システムを表す概略図である。以下、同図に従って本通話システムの構成を説明する。
【００３８】
１は公衆網の一種である電話網Ａに接続された発呼側の電話機であり、２は電話網ＡとインターネットＢの双方に接続される発信局であり、クライアント部３及びサーバー部４を含む。５は、インターネットＢと電話網Ｃの双方と接続される着信局であり、サーバー部６及びクライアント部７を含む。８は、電話網Ｃに接続された着呼側の電話機である。また、Ｒ１はインターネットＢにおける通信経路を決定する発信局側ルーターであり、Ｒ２はインターネットＢにおける通信経路を決定する着信局側ルーターである。
【００３９】
尚、本実施の形態では、公衆網である電話網Ａ及び電話網Ｃを例として示しているが、これに代えて専用網であるＣＡＴＶ網やインターネットを用いても良い。また、Ｂは専用網であるインターネットを示しているが、これに代えて公衆網である電話網や専用網であるＣＡＴＶ網を用いてもよい。
【００４０】
以下、かかる本通話システムの各構成部分についてさらに詳しく説明する。
【００４１】
まず、発信局２のクライアント部３について説明する。図２はクライアント部３の構成を示す図である。同図に示すクライアント部３は、自動発信／着信機能を有するＮＣＵ(Network Control Unit)３１、該ＮＣＵ３１を制御する制御プログラムを有するＮＣＵ制御部３２、ＲＳ２３２Ｃインターフェースなどの通信インターフェースを用いたクライアント部３とサーバー部４との通信を制御する通信インターフェース制御部３３、音声などのデータをコード化及びデコード化するＣＯＤＥＣ３４、該ＣＯＤＥＣ３４を制御するＣＯＤＥＣ制御部３５を含んで構成される。
【００４２】
なお、電話網Ａがデジタル網（例えば、ＩＳＤＮ）の場合、ＣＯＤＥＣ３４は、音声などのデータをデジタル信号からデジタル信号への圧縮又は展開を行うとともに、コード化又はデコード化するものである。
【００４３】
次に、発信局２のサーバー部４について説明する。図３はサーバー部４の構成を示す図である。同図に示すサーバー部４は、ＲＳ２３２Ｃインターフェースなどの通信インターフェースを用いたクライアント部３とサーバー部４との間の通信を制御する通信インターフェース制御部４１、発信情報や音声情報等を一時記憶するバッファを有し、該音声情報等をパケット化して着信側へ転送するパケット通信部４２、インターネットＢとの通信制御を行うネットワーク制御部４３を含んで構成される。
【００４４】
次に、着信局５のサーバー部６について説明する。図４はサーバー部６の構成を示す図である。同図に示すサーバー部６は、インターネットＢとの通信制御を行うネットワーク制御部６１、発信局２と送受信するパケットを一時記憶するバッファを有し、音声情報を着信端末側と送受信するパケット通信部６２、ＲＳ２３２Ｃインターフェースなどの通信インターフェースを用いたクライアント部７とサーバー部６との間の通信を制御する通信インターフェース制御部６３を含んで構成される。
【００４５】
さらに、着信局５のクライアント部７について説明する。図５はクライアント部７の構成を示す図である。同図に示すクライアント部７はＲＳ２３２Ｃインターフェースなどの通信インターフェースを用いたクライアント部７とサーバー部６との間の通信を制御する通信インターフェース制御部７１、自動発信／着信機能を有するＮＣＵ７２、ＮＣＵ７２を制御する制御プログラムを有するＮＣＵ制御部７３、音声などのデータをコード化及びデコード化をするＣＯＤＥＣ７４、ＣＯＤＥＣ７４を制御するＣＯＤＥＣ制御部７５、を含んで構成される。
【００４６】
ここで、本通話システムには、音声品質を向上させるため、前記パケット通信部４２，６２には、受信バッファの管理を行うための機能が付加されている。このための構成は、発呼側のパケット通信部４２と着呼側のパケット通信部６２とで同様であるから、ここでは、着信局５のサーバー部６に含まれるパケット通信部６２について詳しく説明する。
【００４７】
図６はパケット通信部６２の構成を示す図である。同図に示すパケット通信部６２は、送信系統として、符号化音声データをＩＰパケットに梱包するパケット梱包部６２１、パケット梱包部６２１により出力されるＩＰパケットを所定間隔で順次インターネットＢに送出するための送信バッファ部６２２と、を含む。また、パケット通信部６２は、受信系統として、インターネットＢから受信するＩＰパケットを一時的に蓄積する受信バッファ部６２３と、受信バッファ部６２３から読み出すＩＰパケットを開梱するパケット開梱部６２４と、受信バッファ部６２３の入力や出力を監視して該受信バッファ部６２３の蓄積データを管理する受信バッファ管理部６２５と、を含む。ここで、前記受信バッファ部６２３は、カウンタ６２３ａ，６２３ｂと、バッファメモリ６２３ｃと、スイッチ６２３ｄと、を含み、前記受信バッファ管理部６２５は、パケット生成挿入部６２５ａと、パケット検索削除部６２５ｂと、パケット開梱開始制御部６２５ｃと、パケット蓄積数計測部６２５ｄと、を含む。
【００４８】
カウンタ６２３ａは、ＩＰパケットがインターネットＢから受信されるごとにカウント信号を受信バッファ管理部６２５のパケット蓄積数計測部６２５ｄに送信する。またカウンタ６２３ｂは、ＩＰパケットがバッファメモリ６２３ｃから読み出されるごとにカウント信号を受信バッファ管理部６２５のパケット蓄積数計測部６２５ｄに送信する。バッファメモリ６２３ｃは、先入れ先出し式で最大でＩＰパケットを１０個記憶することのできるバッファメモリであり、発信局２から受信するＩＰパケットを一時的に蓄積する。スイッチ６２３ｄは、パケット開梱開始制御部６２５ｃからのタイミング信号によりオン又はオフし、オン時にはバッファメモリ６２３ｃからの読出しを禁止し、オフ時にはバッファメモリ６２３ｃからの読出しを許可するバッファメモリ６２３ｃの出力制限手段である。また、前記パケット蓄積数計測部６２５ｄは、カウンタ６２３ａ，６２３ｂから入力されるカウント信号とカウンタ６２３ａ，６２３ｂから入力されるカウント信号とに基づいてバッファメモリ６２３ｃに蓄積されるＩＰパケットの個数を計測する手段である。パケット生成挿入部６２５ａは、パケット蓄積数計測部６２５ｄによりバッファメモリ６２３ｃに蓄積されるＩＰパケットの個数が０であると計測される場合に、３０ｍｓの無音を表すＩＰパケットをバッファメモリ６２３ｃに挿入する。パケット検索削除部６２５ｂは、パケット蓄積数計測部６２５ｄによりバッファメモリ６２３ｃに蓄積されるＩＰパケットが容量の７５％、すなわち３個以上であると計測される場合に、バッファメモリ６２３ｃがオーバーフローの直前の状態にあると判断し、バッファメモリ６２３ｃから最小の音声振幅パワの音声を表すＩＰパケットを検索し、そのＩＰパケットを削除する。パケット開梱開始制御部６２５ｃは、ＮＣＵ制御部７３から通話開始信号が入力されると、一旦バッファメモリ６２３ｃからのＩＰパケットの読出しを禁止すべく、スイッチ６２３ｄにタイミング信号を送信する。そして、パケット蓄積数計測部６２５ｄにより、バッファメモリ６２３ｃにバッファメモリ６２３ｃの容量の５０％、すなわちＩＰパケット２個が蓄積されていると計測される場合に、そのバッファメモリ６２３ｃからのＩＰパケットの読出し禁止を解除すべく、スイッチ６２３ｄにタイミング信号を送信する。
【００４９】
［接続動作］
次に以上説明した構成を有する本通話システムにおける発呼側電話機１と着呼側電話機８との接続動作について説明する。
【００５０】
まず、発呼者が発呼側電話機１をオフフックし、発信局２の電話番号をダイヤリングすると、電話網Ａの図示しない交換局によって呼出信号が発信局２へ送られる。そして、クライアント部３のＮＣＵ制御部３２がＮＣＵ３１を介してその呼出信号を受信すると発呼側電話機１と発信局２が電話網Ａにより接続される。
【００５１】
次に、発呼者が通話相手先の電話番号を発呼側電話機１からダイヤリングすると、この電話番号は、電話網Ａ、クライアント部３のＮＣＵ３１、ＮＣＵ制御部３２、通信インターフェース制御部３３を経由してサーバー部４へ送信される。通話相手先の電話番号データは、通信インターフェース制御部４１を介してパケット通信部４２に送信され、パケット通信部４２において通話相手先の電話番号データはパケット化される。そして、パケット化された通話相手先の電話番号データはネットワーク制御部４３よりルーターＲ１へ送信され、インターネットＢ、ルーターＲ２を介して着信局５に送信される。
【００５２】
通話相手先の電話番号データを受信した着信局５では、その通話相手先の電話番号データをサーバー部６のネットワーク制御部６１からパケット通信部６２に送信する。パケット通信部６２では、パケット化された通話相手先の電話番号データを開梱し通信インターフェース制御部６３を介してクライアント部７へ送信する。クライアント部７では、通信インターフェース制御部７１を経て通話相手先の電話番号をＮＣＵ制御部７３に送信する。そして、ＮＣＵ制御部７３は受信した通話相手先の電話番号をＮＣＵ７２から電話網Ｃの交換器（図示せず）に自動発信する。こうして、相手方（受信者）が着呼側電話機をオフフックすると、応答信号が着信局５のＮＣＵ制御部７３と発信局２のＮＣＵ制御部３２とに送信され、それらのＮＣＵ制御部７３，３２は各々ＣＯＤＥＣ制御部７５，３５、及びパケット通信部４２，６２に通話開始信号を送出する。こうして、発信局２のＣＯＤＥＣ制御部３５と着信局５のＣＯＤＥＣ制御部７５とによるＣＯＤＥＣでの音声接続が開始される。
【００５３】
音声接続が開始されると、発呼者の音声は電話網Ａ、発信局２のクライアント部３のＣＯＤＥＣ３４、通信インターフェース制御部３３、通信インターフェース制御部４１を経由してパケット通信部４２に送信される。そして該パケット通信部４２により３０ｍｓの音声が１個のパケットに変換される。そして、そのパケットは、ネットワーク制御部４３、ルーターＲ１、インターネットＢ、ルーターＲ２を経由して着信局５に送信される。尚、音声は前記ＣＯＤＥＣ３４により符号化される際、接続中の信号と区別するために、ＣＯＤＥＣ制御部３５により通話中を示す通話中信号「SPEECH」が付される。こうして、３０ｍｓ毎に１個のパケットが発信局２から着信局５へ送信される。
【００５４】
さらに、音声データは、着信局５のサーバー部６のネットワーク制御部６１、パケット通信部６２、通信インターフェース制御部６３、７１、ＣＯＤＥＣ制御部７５、ＣＯＤＥＣ７４、電話網Ｃを経由して着呼側電話機８に送信される。同様に、相手方（受信者）の音声についても、以上の説明と逆のデータ伝送が行われる。
【００５５】
［受信バッファ部制御動作］
本通話システムでは音声接続が開始に伴い、発信局２のサーバー部４に設けられたパケット通信部４２と、着信局５のサーバー部６に設けられたパケット通信部６２と、においてそれぞれ受信バッファの制御が開始される。これらは同様の制御処理を行うことから、以下では、パケット通信部６２における受信バッファの制御処理について説明する。
【００５６】
上述のように、パケット通信部６２の受信バッファ管理部６２５には、パケット生成挿入部６２５ａと、パケット検索削除部６２５ｂと、パケット開梱開始制御部６２５ｃとが設けられていて、それぞれ受信バッファ部６２３に対する制御処理を行う。ここでは、まず、パケット開梱開始制御部６２５ｃの動作について、図７に基づいて説明する。
【００５７】
同図に示すように、パケット開梱開始制御部６２５ｃは、前述のようにしてＮＣＵ制御部７３から通話開始信号を受信すると（Ｓ１０１）、受信バッファ部６２３の出力を一旦停止すべくスイッチ６２３ｄへタイミング信号を送信する（Ｓ１０２）。そして、受信バッファ部６２３に、通話中信号「SPEECH」が付されたＩＰパケットが蓄積されるのを待ち、パケット蓄積数計測手段から出力されるパケット蓄積数が予め定めるバッファ初期値に達すれば（Ｓ１０３）、受信バッファ部６２３の出力の停止を解除すべくスイッチ６２３ｄへ再びタイミング信号を送信する（Ｓ１０４）。なお、上記バッファ初期値は、例えば、前記受信バッファ部６２３に蓄積可能な最大数の半分程度に設定すればよい。すなわち、前記受信バッファ部６２３のＩＰパケットの最大蓄積可能数が４個の場合は、前記バッファ初期値として２個程度を設定すればよい。
【００５８】
以上の制御によれば、前記受信バッファ部６２３内に上記バッファ初期値と同数のパケットを蓄積した状態で通話を開始することができ、この結果、以下に説明するように、インターネットＢの通信トラフィックの混雑量の一時的な増大を吸収して、音声の前記着呼側電話機８への送信遅延の発生を回避させることができ、パケット通信網を用いて音声接続を行う場合に、着呼側電話機８における再生音声の品質を維持することができる。
【００５９】
すなわち、例えば、パケット１個当たり３０ｍｓ分の音声に相当するとし、そのパケットが受信バッファ部６２３に２個蓄積された状態であるとする。この場合、音声を途切れなく着呼側電話機８へ送信するためには、着信局５へのパケットの到着は３０ｍｓ以内の周期であることが望ましいが、インターネットＢの通信トラフィックの混雑量に対応してこの周期が変動することがありうる。
【００６０】
この場合の作用を図８を用いて説明する。図８は、例として、バッファメモリ６２３ｃの容量をＩＰパケット４個分とし、バッファ初期値を２とした場合の通話初期状態を示している。
【００６１】
まず、インターネットＢの通信トラフィックの混雑量が一時的に増大し、ＩＰパケットの前記受信バッファ部６２３への到着に遅れが生じた場合を考える。この場合は、同図において、バッファメモリ６２３ｃ中のＩＰパケットが順次パケット開梱部６２４で開梱されて、バッファ蓄積数が区間Ａに移行する。すなわち、バッファメモリ６２３ｃに３０ｍｓ分のパケットが２個蓄積されていれば、その時点から６０ｍｓの間パケットが届かなくとも、バッファメモリ６２３ｃに蓄積済のパケットに含まれる音声が順次着呼側電話機８に送信されるため、着呼側電話機８では音声情報は途切れることなく再生される。したがって、以上のバッファ初期値の設定によれば、最大６０ｍｓのＩＰパケットの到着の遅れを受信バッファ部６２３が吸収することができる。ただし、バッファ初期値を比較的大きめの値に設定すれば、その蓄積に要する時間が着呼側電話機８への送信時間の遅れとして現れるため、いわば、発呼側電話機１と着呼側電話機８との間の通信所要時間がその分増加することになる。このため、発呼側電話機と着呼側電話機との間の通信所要時間が、実用的な音声通信の限界といわれる数１００ｍｓに納まるように、バッファ初期値を設定することが好ましい。
【００６２】
ここで、以上の制御を行うことのできる本通話システムが、インターネットＢの通信トラフィックの混雑量の増減状況と、発信局２からのパケットの送信状況とが、既に示した図１４〜１６に基づく説明と同様の状況下にある場合に、着信局５から着呼側電話機８へ送信される音声を図９に基づいて説明する。
【００６３】
図９（ａ）は、発呼側電話機１から送信される２４０ｍｓの音声を示す図であり、図９（ｂ）は、発信局２においてその音声が３０ｍｓごとにＩＰパケット１〜８にパケット化される様子を示す図である。また、図９（ｃ）は、発信局２から送信されるＩＰパケットが着信局５で受信される様子を示す図であり、図９（ｄ）は、それらのＩＰパケットが再び音声に復号化され、着呼側電話機８に送信される様子を示す図である。
【００６４】
同図（ａ）及び（ｂ）によれば、まず、発呼側電話機１から０〜６０ｍｓに送信される音声は、それぞれ３０ｍｓごとに音声フレームデータに符号化された後、ＩＰパケット１及び２にパケット化され、３０ｍｓの間隔で発信局２から着信局５に送信される。そして、これらのＩＰパケット１及び２は期間ＰにインターネットＢを伝送するため、同図（ｃ）に示すように、それぞれ１００ｍｓ後に着信局５に到着する。その後、同図（ｄ）に示すように、パケット開梱開始制御部６２５ｃによる受信バッファ部６２３の制御により、ＩＰパケット１は受信バッファ部６２３のバッファメモリ６２３ｃに一旦蓄積される。この際、ＩＰパケットが一つ通過した旨がカウンタ６２３ａからパケット蓄積数計測部６２５ｄに出力され、パケット蓄積数が１であると計測される。また、このとき、スイッチ６２３ｄはＩＰパケットの読出しを禁止された状態のままである。そして、次のＩＰパケット２が受信バッファ部６２３のバッファメモリ６２３ｃに入力されると、ＩＰパケットが一つ通過した旨がカウンタ６２３ａからパケット蓄積数計測部６２５ｄに出力され、パケット蓄積数が２であると計測される。これを受けてパケット開梱開始制御部６２５ｃは、スイッチ６２３ｄへＩＰパケットの読出し禁止を解除すべくタイミング信号を送信する。こうして、ＩＰパケット１に含まれる音声フレームデータが元の音声に復号化されて着呼側電話機８に送信され、続いてＩＰパケット２が元の音声に復号化されて着呼側電話機８に送信される。
【００６５】
また、同図（ａ）及び（ｂ）によれば、発呼側電話機１から６０〜１２０ｍｓに送信される音声は、それぞれ３０ｍｓごとに音声フレームデータに符号化された後、ＩＰパケット３及び４にパケット化され、３０ｍｓの間隔で発信局２から着信局５に送信される。そして、これらのＩＰパケット３及び４は、通信トラフィックの混雑が増した期間ＱにインターネットＢを伝送するため、同図（ｃ）に示すように、それぞれ１１０ｍｓ後に着信局５に到着する。その後、同図（ｄ）に示すように、ＩＰパケット３はバッファメモリ６２３ｃに一時蓄積され、直前のＩＰパケット２に含まれる音声の着呼側電話機８への送信が終了すれば、元の音声に復号化されて着呼側電話機８に送信される。ＩＰパケット４も同様に、バッファメモリ６２３ｃに一時蓄積され、直前のＩＰパケット３に含まれる音声の着呼側電話機８への送信が終了すれば、元の音声に復号化されて着呼側電話機８に送信される。
【００６６】
このように、パケット開梱開始制御部６２５ｃが設けられていない通話システムでは、ＩＰパケット２とＩＰパケット３との間で送信遅延時間が発生し、これが累積されていたのに対し（図１６（ｄ）参照）、本通話システムでは、バッファメモリ６２３ｃに蓄積されるパケットがインターネットＢの通信トラフィックの混雑量の一時的な増大を吸収するため、音声の送信遅延の発生を回避させることができる。
【００６７】
次に、パケット検索削除部６２５ｂの動作について、図１０に基づいて説明する。
【００６８】
同図に示すように、パケット検索削除部６２５ｂは、まず、パケット蓄積数計測部６２５ｄから入力される受信バッファ部６２３中のパケット蓄積数が満量の７５％以上、すなわち３個以上であるか否かを判断する（Ｓ２０１）。パケットの蓄積数が３個以上であれば、受信バッファ部６２３がオーバーフローの直前の状態にあると判断して、受信バッファ部６２３のバッファメモリ６２３ｃに蓄積されたパケットから最小の音声振幅パワの音声情報が含まれるパケットを検索し（Ｓ２０２）、そのパケットを削除する（Ｓ２０３）。そして、以上の処理を前記ＮＣＵ制御部７３から通信終了の旨の信号が送信されるまで続ける（Ｓ２０４）。
【００６９】
以上の制御によれば、インターネットＢの通信トラフィックの混雑量が緩和して、バッファメモリ６２３ｃ中のパケット蓄積数が図８に示す区間Ｂに移行し、さらに、バッファメモリ６２３ｃがオーバーフローの直前の状態に陥った場合に、バッファメモリ６２３ｃから前もってパケットを削除して、オーバーフローの発生を未然に防止することができる。また、この制御によれば、バッファメモリ６２３ｃに蓄積されたパケットのうち音声品質に与える影響の少ないパケットを選択的に削除するため、オーバーフローによって重要な音声情報を含むパケットが不可避的に欠落することを防止することができる。
【００７０】
次に、パケット生成挿入部６２５ａの動作について、図１１に基づいて説明する。
【００７１】
同図に示すように、パケット生成挿入部６２５ａは、まず、パケット蓄積数計測部６２５ｄから入力される受信バッファ部６２３中のパケット蓄積数が０であるか否かを監視し（Ｓ３０１）、パケット蓄積数が０であれば無音を表すパケットをバッファメモリ６２３ｃに挿入する（Ｓ３０２）。なお、バッファメモリ６２３ｃに挿入するパケットは無音を表すものとは限らず、直前のパケットを再度挿入し、或いは、それ以前のパケットに含まれる音声情報に基づいて生成される平均的な音声を表すパケットを挿入する等が採用できる。
【００７２】
以上の制御によれば、インターネットＢの通信トラフィックの混雑量が増大して、バッファメモリ６２３ｃ中のパケット蓄積数が図８に示す区間Ａに移行し、さらに、受信バッファ部６２３に入力されるパケットの時間間隔が６０ｍｓ以上になった場合に着信局５における処理の円滑化を図ることができる。すなわち、この場合、バッファが空になり（図８，区間Ｄ）、なお再生する音声を含むパケットが届かないのであるから、着呼側電話機８における音声情報は途切れることとなる。このとき、以上の制御によれば、受信バッファ部６２３の後段であるパケット開梱部６２４での処理対象として、ダミー音声を表すパケットを充てることで、処理の円滑化を図ることができる。
【００７３】
以上説明したように、本通話システムによれば、バッファメモリ６２３ｃの管理を行うことにより、発信局２と着信局５との間の通信路、すなわちインターネットＢの通信トラフィックの混雑量の増減や、発信局２及び着信局５の同期確保が困難であることに起因する、オーバーフローやアンダーフローの問題、また、発呼側電話機１及び着呼側電話機８への音声の送信遅延時間の累積の問題を解決することができる。
【００７４】
なお、以上説明した本通話システムは種々の変形実施が可能である。例えば、上記説明においては、パケット検索削除部６２５ｂが、バッファメモリ６２３ｃのオーバーフローを予防する制御を行ったが、所定回数のオーバーフローをそのまま許容し、所定回数以上のオーバーフローが生じる場合には、そのオーバーフローを回避するように受信バッファ部６２３の制御を行ってもよい。この場合、パケット蓄積数計測部６２５ｄは、さらに、カウンタ６２３ａ，６２３ｂとカウンタ６２３ａ，６２３ｂから入力されるカウント信号に基づいてバッファメモリ６２３ｃのオーバーフローを検出することができるよう構成される。
【００７５】
図１２は、パケット検索削除部６２５ｂの制御動作の変形例を説明するフロー図である。同図に示すように、パケット検索削除部６２５ｂは、まず、パケット蓄積数計測部６２５ｄから入力されるバッファメモリ６２３ｃにオーバーフローが生じた旨の信号に基づいて、バッファメモリ６２３ｃに所定回数の連続するオーバーフローがあるか否かを監視する（Ｓ４０１）。そして、バッファメモリ６２３ｃに所定回数の連続するオーバーフローが発生すれば、次に、パケット蓄積数計測部６２５ｄから入力されるバッファ蓄積数がバッファメモリ６２３ｃに最大蓄積可能なパケット数と同数であるか否かを判断する（Ｓ４０２）。そして、バッファメモリ６２３ｃに所定回数の連続するオーバーフローが発生し、かつ、バッファ蓄積数がバッファメモリ６２３ｃに最大蓄積可能なパケット数と同数であれば、パケット検索削除部６２５ｂは、バッファメモリ６２３ｃに蓄積されたパケットから最小の音声振幅パワの音声情報が含まれるパケットを検索し（Ｓ４０３）、そのパケットを削除する（Ｓ４０４）。そして、以上の処理を前記ＮＣＵ制御部７３から通信終了の旨の信号が送信されるまで続ける（Ｓ４０５）。
【００７６】
以上の制御によれば、インターネットＢの通信トラフィックの混雑量が緩和して、バッファメモリ６２３ｃが連続して所定回数オーバーフローした場合（図８，区間Ｃ）、バッファメモリ６２３ｃからパケットを削除して、所定回数以上の連続するパケットのオーバーフローの発生を未然に防止することができる。この結果、着呼側電話機８に送信されるべき音声の一部分が大きくまとまって消失することを防止することができる。なお、以上の説明では、パケット蓄積数計測部６２５ｄが所定数の連続するパケットのオーバーフローを検出する場合について説明したが、１個のパケットがオーバーフローする場合でも同様の効果を奏する。すなわち、この場合、２個の連続するパケットがオーバーフローすることを回避することができる。
【００７７】
また、バッファメモリ６２３ｃがオーバーフローすれば、着呼側電話機８に送信しなければならない音声が３０ｍｓ分減少するため、着呼側電話機８への送信遅延時間が３０ｍｓ程改善されることになる。すなわち、受信バッファ部６２３のオーバーフローには、音声の送信遅延を解消する作用がある。
【図面の簡単な説明】
【図１】本通話システムの概略構成を示す図である。
【図２】発信局のクライアント部の構成を示す図である。
【図３】発信局のサーバー部の構成を示す図である。
【図４】着信局のサーバー部の構成を示す図である。
【図５】着信局のクライアント部の構成を示す図である。
【図６】着信局のパケット通信部の構成を示す図である。
【図７】パケット開梱開始制御部の動作を説明するフロー図である。
【図８】通話開始時のバッファメモリの記憶内容を示す図である。
【図９】本通話システムの作用を説明する図である。
【図１０】パケット検索削除部の動作を説明するフロー図である。
【図１１】パケット生成挿入部の動作を説明するフロー図である。
【図１２】パケット検索削除部の変形例の動作を説明するフロー図である。
【図１３】現在提案されているインターネットを利用した電話機間の通話接続を説明する図である。
【図１４】インターネットの通信トラフィックの混雑量の推移の一例を示す図である。
【図１５】現在提案されているインターネットを利用した電話機間の通話接続の作用を説明する図である。
【図１６】現在提案されているインターネットを利用した電話機間の通話接続の作用を説明する図である。
【符号の説明】
Ｂインターネット、１発呼側電話機（送話端末，受話端末）、２発信局（中継局）、５着信局（中継局）、８着呼側電話機（受話端末，送話端末）、６２３受信バッファ部、６２３ａ，６２３ｂカウンタ（オーバーフロー判断手段，音声フレームデータ数計測手段）、６２３ｃバッファメモリ、６２３ｄスイッチ（バッファメモリ出力制限手段）、６２５受信バッファ管理部、６２５ａパケット生成挿入部（ダミー音声挿入手段）、６２５ｂパケット検索削除部（選択削除手段）、６２５ｃパケット開梱開始制御部（バッファメモリ出力制限手段）、６２５ｄパケット蓄積数計測部（オーバーフロー判断手段，音声フレームデータ数計測手段）。[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a call system, and more particularly to a technique for maintaining voice quality in a call system that transmits and receives voice information via a packet communication network such as the Internet.
[0002]
[Background]
In recent years, by connecting a computer such as a personal computer having a voice mail function to a public network such as a telephone network or ISDN (Integrated Services Digital Network), a computer connected to the public network can also transmit and receive voice over the Internet. There is a call system that can be used. Such a call system requires only a local line charge from the user terminal to the connection provider (provider), especially when the voice communication is overseas, compared to using the international telephone network. It is becoming widespread because it can make calls at low cost.
[0003]
There has also been proposed a call system that uses the Internet for call connection between telephones without using a computer such as a personal computer. FIG. 13 is a diagram showing an application example of the call system currently proposed to call connection between telephones. In the call system shown in the figure, the telephone set 101 is connected to the relay station 103 via the public telephone network 102, while the telephone set 104 is connected to the relay station 106 via the public telephone network 105. Further, the relay station 103 and the relay station 106 are communicatively connected via the Internet 107.
[0004]
In the call system shown in the figure, the voice transmitted from the telephone 101 is encoded by the relay station 103, and is packetized for each code representing, for example, 30 ms of voice. Then, IP packets are transmitted to the relay station 106 via the Internet 107 at intervals of 30 ms and temporarily stored in a reception buffer (not shown) provided in the relay station 106. Thereafter, the IP packet is unpacked and decoded to the original voice, and the voice transmitted from the telephone set 101 is reproduced by the telephone set 104.
[0005]
[Problems to be solved by the invention]
However, although communication on the Internet 107 is suitable for communication of discrete data, it is not suitable for communication in a form in which continuous data such as voice or image data in telephone or broadcast is reproduced immediately on the receiving side. In the above call system, when the traffic amount of communication traffic on the Internet 107 increases or decreases, or when synchronization between the relay station 103 and the relay station 106 is not accurately ensured, the relay station 106 May cause overflow or underflow.
[0006]
When such an overflow of the reception buffer occurs, discontinuous sound is reproduced on the telephone 104, and smooth conversation between the telephone 101 and the telephone 104 is impaired. In addition, when an underflow of the reception buffer occurs, the telephone call 104 is not reproduced and a silent period occurs, and smooth conversation between the telephone set 101 and the telephone set 104 is impaired. Further, in this case, there is a problem that voice transmission from the relay station 106 to the telephone 104 is accumulated and delayed. Hereinafter, this problem will be described in more detail based on the drawings.
[0007]
FIG. 14 is a diagram illustrating an example of temporal transition of the amount of congestion of communication traffic on the Internet 107. In the figure, a period P represents a period of normal congestion in which the communication traffic of the Internet 107 becomes a reference for a call, and a period Q represents a period of congestion that is more congested than the period P but relatively small. In addition, the period R represents a period in which congestion is reduced compared to the period P, and the period S represents a period of congestion that is greater than the period Q. In the period P, the time required for one IP packet to be transmitted from the relay station 103 to the relay station 106, that is, the communication required time is 100 ms, and in the period Q is 110 ms. In the period R, the required communication time is 90 ms, and in the period S is 130 ms.
[0008]
In the following, when eight IP packets 1-8 are transmitted to the Internet 107 showing the time transition of communication traffic as shown in the figure at regular intervals of 30 ms, that is, the IP packets 1 and 2 are transmitted to the relay station 103 and the relay station 106. IP packets 3 and 4 are transmitted in the period Q between the relay station 103 and the relay station 106, and IP packets 5 and 6 are transmitted between the relay station 103 and the relay station 106. A case where the packet is transmitted in the period R and the IP packets 7 and 8 are transmitted in the period S between the relay station 103 and the relay station 106 will be described.
[0009]
FIG. 15 shows the time required for communication on the Internet 107 of the IP packets 1 to 8 transmitted in each period P to S, and the delay time from the time when those IP packets 1 to 8 should be originally transmitted to the telephone 104. It is a figure showing a transmission delay time in association with each other. As shown in the figure, the voice included in the IP packets 3 to 6 is transmitted with a delay of 10 ms from the time when it should be transmitted to the telephone 104, and the voice included in the IP packets 7 and 8 is originally transmitted to the telephone 104. Each transmission is delayed by 30 ms from the expected time.
[0010]
Hereinafter, the cause of the transmission delay time will be described with reference to FIG. FIG. 16 (a) is a diagram showing 240 ms voice transmitted from the telephone set 101. FIG. 16 (b) is a diagram in which the voice is encoded into voice frame data in 30 ms units at the relay station 103, and the IP packet 1 It is a figure which shows a mode that -8 is packetized. FIG. 16C is a diagram illustrating a state where the IP packet transmitted from the relay station 103 is received by the relay station 106, and FIG. 16D is a diagram illustrating voice frame data included in the IP packet. FIG. 4 is a diagram illustrating a state in which is decoded into voice and transmitted to the telephone 104 again.
[0011]
Here, it is assumed that the voice encoding rate is 4 kbps, that is, the voice for one second is compressed to 4 kbit, and the transfer rate (transfer speed, communication speed) of the Internet 107 is 12 kbps. Therefore, on the Internet 107, each IP packet 1-8 is compressed and transferred to one-third of the original call voice time. That is, 30 ms of the voice transmitted from the telephone set 101 is compressed into data of about 10 ms and transferred. Here, “net” means that the additional bits due to packet header information, error detection, error correction, and the like are excluded.
[0012]
According to FIGS. 2A and 2B, first, the voice transmitted from the telephone 101 to 0 to 60 ms is encoded into voice frame data every 30 ms and then packetized into IP packets 1 and 2. And is transmitted from the relay station 103 to the relay station 106 at intervals of 30 ms. Since these IP packets 1 and 2 are transmitted to the Internet 107 in the period P, they arrive at the relay station 106 after 100 ms, respectively, as shown in FIG. Thereafter, as shown in FIG. 4D, the voice frame data included in the packets 1 and 2 is decoded into the original voice as soon as it arrives at the relay station 106 and transmitted to the telephone set 104.
[0013]
Also, according to (a) and (b) of the figure, voice transmitted from the telephone 101 to 60 to 120 ms is encoded into voice frame data every 30 ms, and then packetized into IP packets 3 and 4. And is transmitted from the relay station 103 to the relay station 106 at intervals of 30 ms. Since these IP packets 3 and 4 are transmitted to the Internet 107 during a period Q when the congestion of communication traffic has increased, they arrive at the relay station 106 after 110 ms, respectively, as shown in FIG. Thereafter, as shown in FIG. 4D, when the voice frame data included in the IP packets 3 and 4 arrives at the relay station 106, the voice frame data is immediately decoded and transmitted to the telephone 104.
[0014]
At this time, the IP packet 3 that arrives first in the period Q arrives 10 ms later than the original time (30 ms after arrival of the IP packet 2 at the relay station 106, ie, 190 ms), and arrives last in the period P. The arrival interval with the IP packet 2 extends to 40 ms. As a result, when the voice included in the IP packet 3 arriving first in the period Q is originally to be transmitted to the telephone 104, that is, transmitting the voice included in the IP packet 2 arriving last in the period P to the telephone 104. The message is transmitted with a delay of 10 ms from the end time, and a 10 ms blank is generated in the transmission voice to the telephone set 104. Then, the voice included in the IP packet 3 is transmitted with a delay of 10 ms from the time when it should originally be transmitted to the telephone 104.
[0015]
In addition, the IP packet 4 that arrives second in the period Q arrives at an interval of 30 ms from the previous IP packet 3 as usual, but the voice included in the IP packet 4 arrives immediately before the IP packet. 3 is transmitted after the transmission of the voice included in the telephone 104 to the telephone 104, so that the delay of the IP packet 3 that arrives first in the period Q is taken over as it is, and is delayed by 10 ms from the time to be originally transmitted. Sent.
[0016]
Furthermore, according to (a) and (b) of the figure, the voice transmitted from the telephone set 101 to 120 to 180 ms is encoded into voice frame data every 30 ms and then packetized into IP packets 5 and 6. And is transmitted from the relay station 103 to the relay station 106 at intervals of 30 ms. Since these IP packets 5 and 6 are transmitted to the Internet 107 during a period R in which congestion of communication traffic is eased, the IP packets 5 and 6 arrive at the relay station 106 after 90 ms, respectively, as shown in FIG. As a result, IP packet 5 arrives at relay station 106 at 240 ms and IP packet 6 arrives at 270 ms. Thereafter, as shown in FIG. 4D, the voice frame data included in the IP packets 5 and 6 is decoded into the original voice as soon as it arrives at the relay station 106 and transmitted to the telephone set 104.
[0017]
As shown in the figure, even if the congestion of communication traffic on the Internet 107 is alleviated in this way, the transmission delay time does not decrease. This is because, even if the next IP packet 5 arrives at the relay station 106 before the transmission of the voice included in the previous IP packet 4 is finished, it is not transmitted until after the previous IP packet 4 is transmitted. to cause.
[0018]
Also, according to (a) and (b) of the figure, the voice transmitted from the telephone set 101 to 180 to 240 ms is encoded into voice frame data every 30 ms and then packetized into IP packets 7 and 8. And is transmitted from the relay station 103 to the relay station 106 at intervals of 30 ms. Since these IP packets 7 and 8 are transmitted to the Internet 107 during a period S in which the congestion of communication traffic has greatly increased, they arrive at the relay station 106 after 130 ms, respectively, as shown in FIG. As a result, IP packet 7 arrives at relay station 106 at 340 ms and IP packet 8 arrives at 370 ms. Thereafter, as shown in FIG. 4D, the voice frame data included in the IP packets 7 and 8 is decoded into the original voice as soon as it arrives at the relay station 106 and transmitted to the telephone set 104.
[0019]
In this case, the voice included in the IP packet 7 is transmitted with a delay of 20 ms from the end of transmission of the voice included in the IP packet 6 arriving at the relay station 106 immediately before to the telephone set 104. Therefore, the voice included in the IP packets 7 and 8 is added to the transmission delay time of 10 ms generated when the voice included in the IP packet 3 is transmitted to the telephone 104, and 20 ms is added to the original time to be transmitted to the telephone 104. Will be transmitted 30 ms later.
[0020]
As described above, when the transmission of the voice included in the IP packet to the telephone 104 is delayed from the end of the transmission of the voice included in the IP packet arriving at the relay station 106 immediately before the end of the transmission. All the voices included in the IP packets that arrive thereafter will also take over the delay. Therefore, the longer the call time is, the more the transmission delay accumulates to the extent that the smooth call is hindered due to the increase or decrease in the amount of congestion on the Internet 107 during that time.
[0021]
The present invention has been made in view of such problems, and it is an object of the present invention to provide a call system capable of maintaining the quality of reproduced voice at a receiving terminal when voice connection is performed using a packet communication network. To do.
[0030]
[Means for Solving the Problems]
The present invention packetizes voice frame data generated based on voice information transmitted from a transmitting terminal, and sequentially transmits it to a receiving terminal side via a packet communication network, and the first relay station A second relay station that receives a packet transmitted from a relay station and sequentially transmits voice information generated based on voice frame data included in the packet to the receiving terminal, The second relay station includes a buffer memory that temporarily stores at least audio frame data of data included in a packet received from the first relay station, and audio frame data stored in the buffer memory. Audio frame data number measuring means for measuring the number of overflows, overflow detecting means for detecting overflow of the buffer memory, and overflow detection. A predetermined number or more audio frame data is detected that the overflow continuously from the buffer memory by means,AndThe number of audio frame data stored in the buffer memory by the audio frame data number measuring means is the buffer memory.Of audio frame data that can be stored inmaximumofWhen it is determined that the number has been reached, it is stored in the buffer memory.Voice frameSelecting and deleting means for selectively deleting data based on a predetermined criterion.
[0031]
According to the present invention, when it is detected by the overflow detection means that a predetermined number or more of audio frame data has continuously overflowed from the buffer memory, that is, continuously received by the second relay station. When all the audio frame data included in the predetermined number of packets is lost, and further, when the audio frame data number measuring unit determines that the amount of data stored in the buffer memory has reached the storage capacity of the buffer Determines that there is a possibility that voice frame data included in a packet exceeding a predetermined number may be continuously lost, and selectively deletes voice frame data that meets a predetermined criterion from the data stored in the buffer memory. .Here, for example, when the data representing the audio amplitude power is included in the received audio frame data, the predetermined criterion includes data indicating that the audio amplitude power is small, and is estimated to be other than during utterance It is possible to adopt a criterion for deleting the voice frame data to be deleted, a criterion for deleting a part of the portion that seems to be a vowel part because almost the same data appears continuously. Also, the received audio frame data is CELP ( Codebook-Excited Linear Prediction In the case of being compressed by the above method, a codebook index other than human voice, for example, a criterion for deleting speech frame data including an index representing background noise can be adopted.
[0032]
In this way, it is possible to prevent the buffer memory from overflowing and necessary data from being continuously lost by selecting and deleting data of low importance in advance. As a result, it is possible to prevent a part of the voice information to be transmitted to the receiving terminal from being largely lost and maintain the quality of the reproduced voice at the receiving terminal even when voice connection is performed using a packet communication network. can do.The transmitting terminal and the receiving terminal mean a terminal on the transmission side and a terminal on the reception side of voice information in one-way communication and two-way communication, respectively. Therefore, the bidirectional communication is not limited to the calling terminal and the called terminal.
[0036]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
[0037]
[Constitution]
FIG. 1 is a schematic diagram showing a call system according to the present embodiment. Hereinafter, the configuration of the call system will be described with reference to FIG.
[0038]
Reference numeral 1 denotes a telephone on the calling side connected to a telephone network A which is a kind of public network. Reference numeral 2 denotes a transmitting station connected to both the telephone network A and the Internet B. The client unit 3 and the server unit 4 are connected to each other. Including. A receiving station 5 connected to both the Internet B and the telephone network C includes a server unit 6 and a client unit 7. Reference numeral 8 denotes a telephone on the called side connected to the telephone network C. R1 is a transmitting station side router that determines a communication path in the Internet B, and R2 is a receiving station side router that determines a communication path in the Internet B.
[0039]
In this embodiment, the telephone network A and the telephone network C, which are public networks, are shown as an example, but a CATV network or the Internet, which is a dedicated network, may be used instead. Further, B indicates the Internet which is a dedicated network, but instead of this, a telephone network which is a public network or a CATV network which is a dedicated network may be used.
[0040]
Hereinafter, each component of the call system will be described in more detail.
[0041]
First, the client unit 3 of the transmission station 2 will be described. FIG. 2 is a diagram showing the configuration of the client unit 3. The client unit 3 shown in the figure includes an NCU (Network Control Unit) 31 having an automatic transmission / reception function, an NCU control unit 32 having a control program for controlling the NCU 31, and a client unit 3 using a communication interface such as an RS232C interface. And a communication interface control unit 33 for controlling communication with the server unit 4, a CODEC 34 for encoding and decoding data such as voice, and a CODEC control unit 35 for controlling the CODEC 34.
[0042]
When the telephone network A is a digital network (for example, ISDN), the CODEC 34 compresses or expands data such as voice from a digital signal to a digital signal and encodes or decodes the data.
[0043]
Next, the server unit 4 of the transmission station 2 will be described. FIG. 3 is a diagram showing the configuration of the server unit 4. The server unit 4 shown in the figure includes a communication interface control unit 41 that controls communication between the client unit 3 and the server unit 4 using a communication interface such as an RS232C interface, and a buffer that temporarily stores transmission information, voice information, and the like. A packet communication unit 42 that packetizes the voice information and transfers it to the incoming side, and a network control unit 43 that controls communication with the Internet B.
[0044]
Next, the server unit 6 of the receiving station 5 will be described. FIG. 4 is a diagram showing the configuration of the server unit 6. The server unit 6 shown in the figure includes a network control unit 61 that controls communication with the Internet B, a buffer that temporarily stores packets transmitted and received with the transmitting station 2, and a packet communication unit that transmits and receives voice information with the receiving terminal side. 62, a communication interface control unit 63 that controls communication between the client unit 7 and the server unit 6 using a communication interface such as an RS232C interface.
[0045]
Further, the client unit 7 of the receiving station 5 will be described. FIG. 5 is a diagram showing the configuration of the client unit 7. The client unit 7 shown in the figure controls a communication interface control unit 71 that controls communication between the client unit 7 and the server unit 6 using a communication interface such as an RS232C interface, an NCU 72 having an automatic transmission / reception function, and an NCU 72. An NCU control unit 73 having a control program for encoding, a CODEC 74 for encoding and decoding data such as voice, and a CODEC control unit 75 for controlling the CODEC 74.
[0046]
Here, in this call system, in order to improve the voice quality, a function for managing a reception buffer is added to the packet communication units 42 and 62. Since the configuration for this is the same in the packet communication unit 42 on the calling side and the packet communication unit 62 on the called side, here the packet communication unit 62 included in the server unit 6 of the destination station 5 will be described in detail. To do.
[0047]
FIG. 6 is a diagram showing the configuration of the packet communication unit 62. The packet communication unit 62 shown in the figure serves as a transmission system to send out the IP packet output from the packet packing unit 621 and the packet packing unit 621 sequentially to the Internet B at predetermined intervals. Transmission buffer unit 622. Further, the packet communication unit 62 includes, as a reception system, a reception buffer unit 623 that temporarily stores IP packets received from the Internet B, a packet unpacking unit 624 that unpacks IP packets read from the reception buffer unit 623, A reception buffer management unit 625 that monitors input and output of the reception buffer unit 623 and manages accumulated data in the reception buffer unit 623. Here, the reception buffer unit 623 includes counters 623a and 623b, a buffer memory 623c, and a switch 623d. The reception buffer management unit 625 includes a packet generation insertion unit 625a, a packet search deletion unit 625b, A packet unpacking start control unit 625c and a packet accumulation number measuring unit 625d are included.
[0048]
Each time an IP packet is received from the Internet B, the counter 623a transmits a count signal to the packet accumulation number measurement unit 625d of the reception buffer management unit 625. The counter 623b transmits a count signal to the packet accumulation number measurement unit 625d of the reception buffer management unit 625 every time an IP packet is read from the buffer memory 623c. The buffer memory 623c is a buffer memory capable of storing a maximum of 10 IP packets in a first-in first-out manner, and temporarily stores IP packets received from the transmission station 2. The switch 623d is turned on or off by a timing signal from the packet unpacking start control unit 625c. When the switch 623d is turned on, reading from the buffer memory 623c is prohibited. When the switch 623d is turned off, reading from the buffer memory 623c is permitted. Means. The packet accumulation number measuring unit 625d measures the number of IP packets accumulated in the buffer memory 623c based on the count signals input from the counters 623a and 623b and the count signals input from the counters 623a and 623b. Means. The packet generation / insertion unit 625a inserts an IP packet representing silence of 30 ms into the buffer memory 623c when the number of IP packets accumulated in the buffer memory 623c is measured by the packet accumulation number measurement unit 625d to be zero. . The packet search / deletion unit 625b determines that the buffer memory 623c is immediately before the overflow when the packet accumulation number measurement unit 625d measures 75% of the IP packet accumulated in the buffer memory 623c, that is, three or more. It is determined that the state is in the state, the IP packet representing the voice with the smallest voice amplitude power is searched from the buffer memory 623c, and the IP packet is deleted. When a call start signal is input from the NCU control unit 73, the packet unpacking start control unit 625c transmits a timing signal to the switch 623d to prohibit reading of the IP packet from the buffer memory 623c. When the packet accumulation number measuring unit 625d measures 50% of the capacity of the buffer memory 623c, that is, when two IP packets are accumulated in the buffer memory 623c, the IP packet is read from the buffer memory 623c. In order to release the prohibition, a timing signal is transmitted to the switch 623d.
[0049]
[Connection operation]
Next, a connection operation between the calling side telephone 1 and the called side telephone 8 in the call system having the above-described configuration will be described.
[0050]
First, when the calling party goes off-hooking the calling side telephone 1 and dials the telephone number of the calling station 2, a calling signal is sent to the calling station 2 by an exchange not shown in the telephone network A. When the NCU control unit 32 of the client unit 3 receives the call signal via the NCU 31, the calling side telephone 1 and the calling station 2 are connected by the telephone network A.
[0051]
Next, when the caller dials the telephone number of the other party from the calling side telephone 1, the telephone number is transmitted to the telephone network A, the NCU 31 of the client unit 3, the NCU control unit 32, and the communication interface control unit 33. And transmitted to the server unit 4. The telephone number data of the communication partner is transmitted to the packet communication unit 42 via the communication interface control unit 41, and the telephone number data of the communication partner is packetized in the packet communication unit 42. Then, the packetized telephone number data of the communication partner is transmitted from the network control unit 43 to the router R1, and transmitted to the destination station 5 via the Internet B and the router R2.
[0052]
In the receiving station 5 that has received the telephone number data of the call partner, the telephone number data of the call partner is transmitted from the network control unit 61 of the server unit 6 to the packet communication unit 62. The packet communication unit 62 unpacks the packetized telephone number data of the communication partner and transmits it to the client unit 7 via the communication interface control unit 63. The client unit 7 transmits the telephone number of the communication partner to the NCU control unit 73 via the communication interface control unit 71. Then, the NCU control unit 73 automatically transmits the received telephone number of the communication partner from the NCU 72 to an exchange (not shown) of the telephone network C. Thus, when the other party (recipient) off-hooks the called telephone, a response signal is transmitted to the NCU control unit 73 of the receiving station 5 and the NCU control unit 32 of the transmitting station 2, and the NCU control units 73 and 32 Call start signals are sent to the CODEC control units 75 and 35 and the packet communication units 42 and 62, respectively. Thus, the CODEC voice connection between the CODEC control unit 35 of the transmitting station 2 and the CODEC control unit 75 of the receiving station 5 is started.
[0053]
When the voice connection is started, the caller's voice is transmitted to the packet communication unit 42 via the telephone network A, the CODEC 34 of the client unit 3 of the transmitting station 2, the communication interface control unit 33, and the communication interface control unit 41. The The packet communication unit 42 converts the 30 ms voice into one packet. Then, the packet is transmitted to the destination station 5 via the network control unit 43, the router R1, the Internet B, and the router R2. When the audio is encoded by the CODEC 34, the CODEC control unit 35 adds a busy signal “SPEECH” indicating that the call is in progress in order to distinguish it from the connected signal. Thus, one packet is transmitted from the transmitting station 2 to the receiving station 5 every 30 ms.
[0054]
Further, the voice data is transmitted to the called side telephone via the network control unit 61, the packet communication unit 62, the communication interface control units 63 and 71, the CODEC control unit 75, the CODEC 74, and the telephone network C of the server unit 6 of the receiving station 5. 8 is transmitted. Similarly, for the voice of the other party (recipient), data transmission opposite to that described above is performed.
[0055]
[Receive buffer section control operation]
In this call system, when the voice connection is started, the packet communication unit 42 provided in the server unit 4 of the transmitting station 2 and the packet communication unit 62 provided in the server unit 6 of the receiving station 5 respectively receive the reception buffers. Control begins. Since these perform the same control processing, the reception buffer control processing in the packet communication unit 62 will be described below.
[0056]
As described above, the reception buffer management unit 625 of the packet communication unit 62 is provided with the packet generation / insertion unit 625a, the packet search / deletion unit 625b, and the packet unpacking start control unit 625c. A control process for 623 is performed. Here, first, the operation of the packet unpacking start control unit 625c will be described with reference to FIG.
[0057]
As shown in the figure, when the packet unpacking start control unit 625c receives a call start signal from the NCU control unit 73 as described above (S101), the packet unpacking start control unit 625c sends a switch 623d to temporarily stop the output of the reception buffer unit 623. A timing signal is transmitted (S102). Then, it waits for the IP packet with the call signal “SPEECH” to be accumulated in the reception buffer unit 623, and if the packet accumulation number output from the packet accumulation number measuring means reaches a predetermined buffer initial value ( In step S103, the timing signal is transmitted again to the switch 623d to cancel the stop of the output of the reception buffer unit 623 (S104). The buffer initial value may be set to about half of the maximum number that can be stored in the reception buffer unit 623, for example. That is, when the maximum number of IP packets that can be stored in the reception buffer unit 623 is four, the buffer initial value may be set to about two.
[0058]
According to the above control, a call can be started with the same number of packets as the buffer initial value stored in the reception buffer unit 623. As a result, as described below, communication traffic of the Internet B When the voice connection is made using the packet communication network, it is possible to absorb the temporary increase in the amount of congestion of the voice and avoid the transmission delay of the voice to the called telephone 8. The quality of the reproduced sound in the telephone 8 can be maintained.
[0059]
In other words, for example, it is assumed that the packet corresponds to 30 ms of voice, and that two packets are stored in the reception buffer unit 623. In this case, in order to transmit the voice to the called telephone 8 without interruption, it is desirable that the arrival of the packet at the receiving station 5 has a period of 30 ms or less, but it corresponds to the congestion amount of communication traffic of the Internet B. The lever cycle can vary.
[0060]
The operation in this case will be described with reference to FIG. FIG. 8 shows an initial call state when the capacity of the buffer memory 623c is four IP packets and the buffer initial value is 2, for example.
[0061]
First, let us consider a case where the amount of congestion of communication traffic on the Internet B temporarily increases and a delay occurs in the arrival of IP packets at the reception buffer unit 623. In this case, in the same figure, the IP packets in the buffer memory 623c are sequentially unpacked by the packet unpacking unit 624, and the buffer accumulation number shifts to the section A. That is, if two packets for 30 ms are stored in the buffer memory 623c, even if the packet does not reach for 60 ms from that time, the voices included in the packets stored in the buffer memory 623c are sequentially received by the called telephone 8 Therefore, the call side telephone 8 reproduces the voice information without interruption. Therefore, according to the setting of the buffer initial value described above, the reception buffer unit 623 can absorb the delay of arrival of the IP packet of 60 ms at maximum. However, if the buffer initial value is set to a relatively large value, the time required for the accumulation appears as a delay in the transmission time to the called telephone 8, so to speak, the calling telephone 1 and the called telephone 8. The time required for communication with the network increases accordingly. For this reason, it is preferable to set the buffer initial value so that the time required for communication between the calling side telephone and the called side telephone falls within several hundreds of milliseconds, which is said to be a practical voice communication limit.
[0062]
Here, according to the call system capable of performing the above control, the increase / decrease state of the traffic amount of the communication traffic of the Internet B and the transmission state of the packet from the transmission station 2 are based on FIGS. The voice transmitted from the receiving station 5 to the called telephone 8 when the situation is similar to the description will be described with reference to FIG.
[0063]
FIG. 9A is a diagram showing 240 ms of voice transmitted from the calling side telephone 1, and FIG. 9B is a diagram showing how the voice is packetized into IP packets 1 to 8 every 30 ms at the calling station 2. FIG. It is a figure which shows a mode that it is performed. FIG. 9C is a diagram showing a state where the IP packet transmitted from the source station 2 is received by the destination station 5, and FIG. 9D is a diagram in which those IP packets are decoded into voice again. FIG. 4 is a diagram showing a state where the call is transmitted to the called telephone 8.
[0064]
According to FIGS. 2A and 2B, first, the voice transmitted from the calling telephone 1 to 0 to 60 ms is encoded into voice frame data every 30 ms, and then the IP packets 1 and 2 are transmitted. And is transmitted from the transmitting station 2 to the receiving station 5 at intervals of 30 ms. Since these IP packets 1 and 2 transmit the Internet B in the period P, they arrive at the receiving station 5 after 100 ms, respectively, as shown in FIG. Thereafter, as shown in FIG. 4D, the IP packet 1 is temporarily stored in the buffer memory 623c of the reception buffer unit 623 under the control of the reception buffer unit 623 by the packet unpacking start control unit 625c. At this time, the fact that one IP packet has passed is output from the counter 623a to the packet accumulation number measuring unit 625d, and the packet accumulation number is measured to be one. At this time, the switch 623d remains in a state where reading of the IP packet is prohibited. When the next IP packet 2 is input to the buffer memory 623c of the reception buffer unit 623, the fact that one IP packet has passed is output from the counter 623a to the packet accumulation number measurement unit 625d, and the packet accumulation number is 2. It is measured when there is. In response to this, the packet unpacking start control unit 625c transmits a timing signal to the switch 623d to cancel the prohibition of reading the IP packet. Thus, the voice frame data included in the IP packet 1 is decoded into the original voice and transmitted to the called telephone 8, and then the IP packet 2 is decoded into the original voice and transmitted to the called telephone 8. Is done.
[0065]
Also, according to FIGS. 4A and 4B, the voice transmitted from the calling telephone 1 to 60 to 120 ms is encoded into voice frame data every 30 ms, and then the IP packets 3 and 4 are transmitted. And is transmitted from the transmitting station 2 to the receiving station 5 at intervals of 30 ms. Since these IP packets 3 and 4 are transmitted through the Internet B during the period Q in which the traffic congestion is increased, the IP packets 3 and 4 arrive at the receiving station 5 after 110 ms, respectively, as shown in FIG. Thereafter, as shown in FIG. 4D, the IP packet 3 is temporarily stored in the buffer memory 623c, and when the transmission of the voice included in the immediately preceding IP packet 2 to the called telephone 8 is completed, the original voice is stored. Is transmitted to the called telephone 8. Similarly, the IP packet 4 is also temporarily stored in the buffer memory 623c, and when transmission of the voice included in the previous IP packet 3 to the called telephone 8 is completed, it is decoded into the original voice and is received. 8 is transmitted.
[0066]
As described above, in the call system in which the packet unpacking start control unit 625c is not provided, the transmission delay time occurs between the IP packet 2 and the IP packet 3, and this is accumulated (FIG. 16 ( In this call system, since the packet stored in the buffer memory 623c absorbs a temporary increase in the amount of congestion of communication traffic on the Internet B, occurrence of voice transmission delay can be avoided.
[0067]
Next, the operation of the packet search / deletion unit 625b will be described with reference to FIG.
[0068]
As shown in the figure, the packet search / deletion unit 625b first determines whether the packet accumulation number in the reception buffer unit 623 input from the packet accumulation number measurement unit 625d is 75% or more of the full amount, that is, three or more. It is determined whether or not (S201). If the accumulated number of packets is 3 or more, it is determined that the reception buffer unit 623 is in a state immediately before the overflow, and the audio having the minimum audio amplitude power is determined from the packets accumulated in the buffer memory 623c of the reception buffer unit 623. A packet including the information is searched (S202), and the packet is deleted (S203). The above processing is continued until a signal indicating the end of communication is transmitted from the NCU control unit 73 (S204).
[0069]
According to the above control, the congestion amount of communication traffic on the Internet B is alleviated, the number of packets stored in the buffer memory 623c is shifted to the section B shown in FIG. 8, and the buffer memory 623c is in a state immediately before the overflow. In such a case, the packet can be deleted in advance from the buffer memory 623c to prevent an overflow from occurring. In addition, according to this control, packets that have little influence on voice quality among the packets stored in the buffer memory 623c are selectively deleted, so that packets containing important voice information are inevitably lost due to overflow. Can be prevented.
[0070]
Next, the operation of the packet generation / insertion unit 625a will be described with reference to FIG.
[0071]
As shown in the figure, the packet generation / insertion unit 625a first monitors whether the packet accumulation number in the reception buffer unit 623 input from the packet accumulation number measurement unit 625d is 0 (S301). If the accumulated number is 0, a packet representing silence is inserted into the buffer memory 623c (S302). Note that the packet to be inserted into the buffer memory 623c does not necessarily represent silence, but represents the average voice generated based on the voice information included in the previous packet by inserting the previous packet again. For example, a packet can be inserted.
[0072]
According to the above control, the amount of congestion of communication traffic on the Internet B increases, the number of packets stored in the buffer memory 623c shifts to the section A shown in FIG. 8, and further, packets input to the reception buffer unit 623 When the time interval becomes 60 ms or more, processing at the receiving station 5 can be facilitated. That is, in this case, the buffer becomes empty (section D in FIG. 8), and the packet containing the voice to be reproduced does not reach, so the voice information in the called telephone 8 is interrupted. At this time, according to the above control, processing can be facilitated by allocating a packet representing dummy voice as a processing target in the packet unpacking unit 624, which is a subsequent stage of the reception buffer unit 623.
[0073]
As described above, according to this call system, by managing the buffer memory 623c, the communication path between the transmitting station 2 and the receiving station 5, that is, the increase or decrease in the amount of congestion of communication traffic on the Internet B, Problems of overflow and underflow due to difficulty in ensuring synchronization between the transmitting station 2 and the receiving station 5, and problems of accumulation of audio transmission delay times to the calling telephone 1 and the called telephone 8 Can be solved.
[0074]
Note that the telephone system described above can be variously modified. For example, in the above description, the packet search / deletion unit 625b performs control to prevent overflow of the buffer memory 623c. The reception buffer unit 623 may be controlled so as to avoid this. In this case, the packet accumulation number measuring unit 625d is further configured to be able to detect an overflow of the buffer memory 623c based on the count signals input from the counters 623a and 623b and the counters 623a and 623b.
[0075]
FIG. 12 is a flowchart for explaining a modification of the control operation of the packet search and deletion unit 625b. As shown in the figure, the packet search / deletion unit 625b first continues the buffer memory 623c a predetermined number of times based on a signal indicating that an overflow has occurred in the buffer memory 623c input from the packet accumulation number measurement unit 625d. It is monitored whether there is an overflow (S401). If a predetermined number of consecutive overflows occur in the buffer memory 623c, then whether or not the buffer accumulation number input from the packet accumulation number measurement unit 625d is the same as the maximum number of packets that can be accumulated in the buffer memory 623c. Is determined (S402). If a predetermined number of consecutive overflows occur in the buffer memory 623c and the buffer accumulation number is the same as the maximum number of packets that can be accumulated in the buffer memory 623c, the packet search and deletion unit 625b accumulates in the buffer memory 623c. A packet including voice information with the minimum voice amplitude power is searched from the received packets (S403), and the packet is deleted (S404). The above processing is continued until a signal indicating the end of communication is transmitted from the NCU control unit 73 (S405).
[0076]
According to the above control, when the congestion amount of communication traffic on the Internet B is reduced and the buffer memory 623c continuously overflows a predetermined number of times (FIG. 8, section C), the packet is deleted from the buffer memory 623c, Occurrence of overflow of a continuous packet more than a predetermined number of times can be prevented beforehand. As a result, it is possible to prevent a part of the voice to be transmitted to the called telephone 8 from being largely lost. In the above description, the case where the packet accumulation number measuring unit 625d detects an overflow of a predetermined number of consecutive packets has been described, but the same effect can be obtained even when one packet overflows. That is, in this case, it is possible to avoid two consecutive packets from overflowing.
[0077]
If the buffer memory 623c overflows, the voice that must be transmitted to the called telephone 8 is reduced by 30 ms, so that the transmission delay time to the called telephone 8 is improved by about 30 ms. That is, the overflow of the reception buffer unit 623 has an effect of eliminating the audio transmission delay.
[Brief description of the drawings]
FIG. 1 is a diagram showing a schematic configuration of the call system.
FIG. 2 is a diagram illustrating a configuration of a client unit of a transmission station.
FIG. 3 is a diagram illustrating a configuration of a server unit of a transmission station.
FIG. 4 is a diagram showing a configuration of a server unit of a receiving station.
FIG. 5 is a diagram illustrating a configuration of a client unit of a receiving station.
FIG. 6 is a diagram illustrating a configuration of a packet communication unit of a receiving station.
FIG. 7 is a flowchart illustrating the operation of a packet unpacking start control unit.
FIG. 8 is a diagram showing the contents stored in a buffer memory at the start of a call.
FIG. 9 is a diagram for explaining the operation of the call system.
FIG. 10 is a flowchart for explaining the operation of a packet search and deletion unit.
FIG. 11 is a flowchart for explaining the operation of the packet generation / insertion unit;
FIG. 12 is a flowchart for explaining the operation of a modified example of the packet search and deletion unit.
FIG. 13 is a diagram for explaining a call connection between telephones using the Internet currently proposed.
FIG. 14 is a diagram illustrating an example of a change in the amount of congestion of Internet communication traffic.
FIG. 15 is a diagram for explaining the operation of call connection between telephones using the Internet currently proposed.
FIG. 16 is a diagram for explaining the operation of call connection between telephones using the Internet currently proposed.
[Explanation of symbols]
B Internet, 1 calling telephone (sending terminal, receiving terminal), 2 transmitting station (relay station), 5 receiving station (relay station), 8 receiving telephone (receiving terminal, transmitting terminal), 623 receiving buffer 623a, 623b counter (overflow judgment means, voice frame data number measurement means), 623c buffer memory, 623d switch (buffer memory output restriction means), 625 reception buffer management section, 625a packet generation insertion section (dummy voice insertion means) 625b Packet search deletion unit (selection deletion unit), 625c Packet unpacking start control unit (buffer memory output restriction unit), 625d Packet accumulation number measurement unit (overflow determination unit, voice frame data number measurement unit).

Claims

A first relay station that packetizes voice frame data generated based on voice information transmitted from a transmitting terminal and sequentially transmits the packet to a receiving terminal via a packet communication network;
A second relay station that receives a packet transmitted from the first relay station and sequentially transmits voice information generated based on voice frame data included in the packet to the receiving terminal;
A call system including:
The second relay station is
A buffer memory for temporarily storing at least voice frame data of data included in a packet received from the first relay station;
Audio frame data number measuring means for measuring the number of audio frame data stored in the buffer memory;
Overflow detection means for detecting an overflow of the buffer memory;
The overflow detection unit by the buffer from the memory of a predetermined number or more audio frame data is detected to be overflowing continuously, and the number of audio frame data accumulated in the buffer memory by the audio frame data number measuring means If it is determined to have reached the maximum number of audio frame data that can be stored in the buffer memory, a selection deleting means selectively deleted based audio frame data accumulated in the buffer memory to a predetermined criterion ,
A call system characterized by including: