JP3992799B2

JP3992799B2 - Voice communication device

Info

Publication number: JP3992799B2
Application number: JP26128997A
Authority: JP
Inventors: 俊光林; 俊明小山; 淳柳沼
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1997-09-26
Filing date: 1997-09-26
Publication date: 2007-10-17
Anticipated expiration: 2017-09-26
Also published as: JPH11103360A

Description

【０００１】
【発明の属する技術分野】
本発明は、端末である音声通信装置間で音声データをやり取りし、会話ができる音声中継システムに用いる音声通信装置に関する。
【０００２】
【従来の技術】
ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）に接続されたパーソナルコンピュータ（パソコン）同士で、音声データを送受信して電話をする音声中継システムとしては、例えば、日経データプロ・データコム，１９９５．３，ｐ．６８１に記載された日立製作所のＴａｌｋｗａｒｅが知られている。また、インターネットを使って電話をする音声中継システムとしては、例えば、Ｉｎｔｅｒｎｅｔｗｏｒｋｉｎｇ，１９９６．９，ｐ．１４−１５に記載されたＮｅｔｓｃａｐｅＣｏｍｍｕｎｉｃａｔｉｏｎｓ社のＣｏｏｌＴａｌｋが知られている。
【０００３】
通信ネットワーク上で音声データを送受信する場合、通信ネットワークの負荷により音声データが遅れて相手に届いたりする。音声データを受信した時に、すぐに再生を行うようにすると、音声データが遅れて届くと音切れが発生することになる。そこで、ＴａｌｋｗａｒｅやＣｏｏｌＴａｌｋにおいては、受信した音声データを少し蓄えて再生することにより、音切れを防止するようにしている。
【０００４】
特に、Ｔａｌｋｗａｒｅにおいては、音声データを蓄える量を自動的に計算し、音声の再生処理を行っている。
【０００５】
【発明が解決しようとする課題】
しかしながら、ネットワークの伝送速度が遅くなると、音声データの蓄積量も多くなるため、音声の遅延が大きくなってくる。音声の遅延が大きくなると、受信しているオペレータによっては、受信している音声に違和感を覚える場合もある。また、音声データの蓄積量には、上限値を設けて有るため、この上限値を超えるような音声データの伝送遅れが発生すると、音切れが発生することになる。音切れは、受信している音声が、プツプツと断続的になるため、受信しているオペレータによっては、受信している音声に不快感を覚える場合もある。
【０００６】
音声の遅れに違和感を覚える場合や音切れに不快感を覚える場合は、オペレータによって個人差があるものである。従って、従来のように、音声データを蓄える量を自動的に計算し、音声の再生処理を行う方法では、オペレータの感性に応じたよりよい音質の音声を提供するのが困難であるという問題があった。オペレータによっては、音切れが多少発生してもいいから音声の遅延を少なくしたいようにしたいという場合や、または、遅延が大きくなってもいいから音切れをなくしたいという場合がある。
【０００７】
本発明の目的は、よりよい音質の音声を提供することができる音声通信装置を提供することにある。
【０００８】
【課題を解決するための手段】
（１）上記目的を達成するために、本発明は、音声データの中継を行う音声通信装置において、他の音声通信装置から送られてきた音声データを受信した後一時蓄積する蓄積手段と、前記蓄積手段から送られてきた音声データを音声出力手段により再生する再生手段と、前記蓄積手段に蓄積する音声データの蓄積量の設定値を音声中継中に可変する設定値可変手段と、使用者に前記音声データの設定値の少なくとも一部を表示する表示手段と、使用者により、前記音声データの設定値の入力が指示される入力手段とを有し、前記表示手段は前記音声データの設定値として、前記蓄積手段に蓄積する音声データの量を表す音声データ蓄積量の設定値を表示し、前記入力手段は、使用者により前記音声データ蓄積量を入力され、前記設定値可変手段は、前記入力手段からの前記音声データ蓄積量の設定を、新たな音声データ蓄積量として設定するとともに、前記音声データの設定値は、さらに、前記蓄積手段に一度にまとめて入力し、もしくは前記蓄積手段から一度にまとめて出力する音声データの長さを表す音声データ長、又は、前記音声データ符号化方式の少なくともいずれかを含み、前記音声データ長又は前記音声データ符号化方式の少なくともいずれかは、前記表示手段により表示され、前記入力手段を介して新たな情報を入力され、前記設定値可変手段により前記入力された新たな情報が設定されるようにしたものである。かかる構成により、利用者の判断により、音声データの蓄積量の設定値を音声中継中に可変できるので、利用者にとって聞きやすい状態、即ち、よりよい音質の音声を提供し得るものとなる。
【００１０】
（２）上記（１）において、好ましくは、前記蓄積手段に蓄積する音声データの蓄積量を制御する制御手段を備え、当該制御手段は、前記蓄積手段に蓄積されている音声データの蓄積量が所定値より少なくなると、他の音声通信装置から送られて音声データを前記蓄積手段に蓄積した上で、送られて音声データと同じ音声データの一部を再度前記蓄積手段に蓄積するようにしたものである。かかる構成により、音切れを防止し得るとともに、音切れ防止のために再生される音の音質を向上し得るものとなる。
【００１１】
【発明の実施の形態】
以下、図１〜図１３を用いて、本発明の一実施形態による音声通信装置を用いる音声中継システムについて説明する。
最初に、図１を用いて、本発明の一実施形態による音声中継システムの全体構成について説明する。
【００１２】
ＰＢＸゲートウェイ２００は、電話回線１０と接続されており、電話回線１０からかかってきた電話を受け付けることができる。ＰＢＸゲートウェイ２００は、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）２０を介して、本実施形態による音声通信装置を構成するパーソナルコンピュータ（パソコン）３００，４００に接続されており、ＬＡＮ２０経由で情報交換が可能である。ＰＢＸゲートウェイ２００は、電話から聞こえる音声をパソコン３００，４００に送ったり、パソコン３００，４００から送られてきた音声データを電話回線へ再生することができる。
【００１３】
パソコン３００，４００には、ハンドセット３９０，４９０が接続されており、ハンドセット３９０，４９０を使って音声の入出力が可能である。
パソコン３００の表示部には、設定画面３００Ａが表示可能であり、この設定画面３００Ａを表示している状態では、パソコン３００の設定値の変更が可能である。この設定画面３００Ａの内容及び設定変更の方法については、図１２を用いて後述する。
【００１４】
次に、図２を用いて、本発明の一実施形態による音声中継システムの中のＰＢＸゲートウェイのハードウェア構成について説明する。
【００１５】
ＰＢＸゲートウェイ２００の中の電話インタフェース２０５は、電話回線１０と接続されており、電話着信等を認識できる。電話インタフェース２０５は、電話インタフェースドライバ２１０により制御される。電話インタフェースドライバ２１０は、図示しない内部バッファを有しており、電話回線１０から電話インターフェース２０５を介して送られてくる音声データを蓄積し、また、電話インターフェース２０５を介して電話回線１０に送出する音声データを蓄積する。
【００１６】
キーボードドライバ２１５は、キーボード２２０からの入力を、受け付ける。マウスドライバ２２５は、マウス２３０からの入力を受け付ける。ディスプレイ２３５に対する表示は、ディスプレイコントローラ２４０により制御される。ＰＢＸゲートウェイ２００で実行されるプログラム等は、ディスクコントローラ２４５により、ディスク２５０からメインメモリ２５５へ読み込まれ、ＣＰＵ２６０によって実行される。ＰＢＸゲートウェイ２００は、通信ネットワークであるＬＡＮ２０とは、通信インタフェース２６５により接続されている。
【００１７】
次に、図３を用いて、本発明の一実施形態による音声中継システムの中の音声通信装置であるパソコンのハードウェア構成について説明する。なお、ここでは、パソコン３００を例にとって説明するが、パソコン４００も同様の構成となっている。
【００１８】
音声入出力インタフェース３０５は、ハンドセット３９０と接続されており、ハンドセット３９０の音声入出力をすることができる。音声入出力インタフェース３１０は、音声入出力インタフェースドライバ３１０により制御される。音声インタフェースドライバ３１０は、図示しない内部バッファを有しており、ハンドセット３９０から音声インターフェース３０５を介して送られてくる音声データを蓄積し、また、音声インターフェース３０５を介してハンドセット３９０に送出する音声データを蓄積する。
【００１９】
キーボードドライバ３１５は、キーボード３２０からの入力を、受け付ける。マウスドライバ３２５は、マウス３３０からの入力を受け付ける。ディスプレイ３３５に対する表示は、ディスプレイコントローラ３４０により制御される。パソコン３００で実行されるプログラム等は、ディスクコントローラ３４５により、ディスク３５０からメインメモリ３５５へ読み込まれ、ＣＰＵ３６０によって実行される。パソコン３００は、通信ネットワークであるＬＡＮ２０とは、通信インタフェース３６５により接続されている。
【００２０】
次に、図４及び図５を用いて、本発明の一実施形態による音声中継システムの中のＰＢＸゲートウェイ２００の音声中継の処理について説明する。
【００２１】
ＰＢＸゲートウェイ２００で実行される音声中継プログラムは、ディスクコントローラ２４５により、ディスク２５０からメインメモリ２５５へ読み込まれ、ＣＰＵ２６０によって実行される。音声中継プログラムは、ＦＤやＣＤ−ＲＯＭ等の記憶媒体からディスク２５０にインストールされる。
【００２２】
図４のステップ４１０において、ＰＢＸゲートウェイ２００のＣＰＵ２６０は、通常、電話着信待ちの状態となる。着信待ち状態の時、音声中継プログラムは、図２に示した電話インタフェースドライバ２１０を用いて、電話インタフェース２０５に電話着信があるかどうかを定期的に調べる。
【００２３】
電話着信があると、ステップ４２０において、ＣＰＵ２６０は、ディスク２５０に保存されている音声ファイルを開き、ファイルに記録されている音声データを電話インタフェースドライバ２１０を用いて、音声インタフェース２０５へ送ることにより、音声応答を行う。この音声応答の内容は、例えば、「こちらは、ＰＢＸゲートウェイです。内線番号を押して下さい。」のように、内線番号を押下させることをアナウンスする内容である。
【００２４】
そして、ステップ４３０において、ＣＰＵ２６０は、内線番号を受け付ける。内線番号の受付の際、音声中継プログラムは、図２に示した電話インタフェースドライバ２１０を用いて、電話インタフェース２０５にＤＴＭＦ信号で入力された番号を定期的に調べ、電話インタフェース２０５が認識したＤＴＭＦ信号を電話インタフェースドライバ２１０から取得する。
【００２５】
認識した内線番号を受取ると、ステップ４４０において、ＣＰＵ２６０は、図５に示した内線番号−ＩＰアドレス対応表５００を用いて、ＩＰアドレスを取得し、ＬＡＮ２０で接続されたパソコン３００などへ呼出しメッセージを送る。
【００２６】
ここで、図５を用いて、内線番号−ＩＰアドレス対応表５００の構成について説明する。
図５に示すように、内線番号−ＩＰアドレス対応表５００は、内線番号５１０と、氏名５２０と、ＩＰアドレス５３０とから構成されている。例えば、内線番号５１０が「１２３４」に対応するパソコンのオペレータは、氏名５２０の欄に記載されているように、「ｔａｒｏ−ｙａｍａｍｏｔｏ」であり、使用しているパソコンのＩＰアドレス５３０が、「１９２．１０．１．１２」である。即ち、ステップ４３０において受け付けた内線番号が、「１２３４」である場合には、対応するＩＰアドレス「１９２．１０．１．１２」が取得される。なお、内線番号−ＩＰアドレス対応表５００は、メインメモリ２５５に格納されている。
また、ステップ４４０におけるＰＢＸゲートウェイ２００からの呼び出しを受けたパソコンでは、呼出しベルまたはメッセージを表示することで、着信をパソコンの利用者へ知らせる。そして、このパソコンの利用者が、電話を受け付けるボタンを押すと、受信ＯＫのメッセージが、パソコンからＰＢＸゲートウェイ２００へ送られる。
【００２７】
ステップ４５０において、ＣＰＵ２６０は、パソコンが受信ＯＫであるか否かを判断し、受信ＯＫのメッセージをＰＢＸゲートウェイが受取ると、ステップ４６０において、ＣＰＵ２６０は、音声中継を開始する。また、受信ＯＫでない場合には、ステップ４９０において、ＣＰＵ２６０は、電話を切る。
【００２８】
ステップ４６０において、ＣＰＵ２６０は、音声中継を開始するが、音声中継の処理の詳細については、図９および図１０を用いて後述する。
【００２９】
音声中継開始後、ＣＰＵ２６０の音声中継プログラムは、電話インタフェースドライバ２１０を用いて、定期的に電話切断を調べている。
ステップ４７０において、ＣＰＵ２６０は、電話切断を検出したか否か若しくはパソコン３００からの切断メッセージを受信したか否かを判断する。電話切断を検出したか若しくはパソコン３００からの切断メッセージを受信した場合には、ステップ４８０において、ＣＰＵ２６０は、音声中継を終了する。そして、ステップ４９０において、ＣＰＵ２６０は、電話を切断し、ステップ４１０における電話着信待ち状態に戻る。
【００３０】
次に、図６〜図８を用いて、本発明の一実施形態による音声中継システムの中の音声通信装置であるパソコン３００からパソコン４００を呼出した時の音声中継の処理について説明する。
【００３１】
パソコン３００で実行される音声中継プログラムは、図３に示したディスクコントローラ３４５により、ディスク３５０からメインメモリ３５５へ読み込まれ、ＣＰＵ３６０によって実行される。音声中継プログラムは、ＦＤやＣＤ−ＲＯＭ等の記憶媒体からディスク３５０にインストールされる。
【００３２】
音声中継を開始する時に利用者は、「ＩＰアドレス」を、図７に示した発信画面３００Ｂに入力し、通話する相手を指定した後、「発信」ボタンを押す。
ステップ６１０において、ＣＰＵ３６０は、発信ボタンが押下されたことを確認する。
次に、ステップ６２０において、ＣＰＵ３６０は、発信メッセージを他のパソコン４００へ送る。
【００３３】
次に、ステップ６３０において、ＣＰＵ３６０は、ＬＡＮ２０で接続されたパソコン４００などへ呼出しメッセージを送る。
また、ステップ６３０におけるパソコン３００からの呼び出しを受けたパソコン４００では、呼出しベルまたはメッセージを表示することで、着信をパソコンの利用者へ知らせる。そして、このパソコンの利用者が、電話を受け付けるボタンを押すと、受信ＯＫのメッセージが、パソコン４００からパソコン３００へ送られる。
【００３４】
ステップ６４０において、ＣＰＵ３６０は、パソコンが受信ＯＫであるか否かを判断し、受信ＯＫのメッセージをパソコン３００が受取ると、ステップ６５０において、ＣＰＵ３６０は、音声中継を開始する。また、受信ＯＫでない場合には、ステップ６１０における電話着信待ち状態に戻る。
【００３５】
ステップ６５０において、ＣＰＵ３６０は、音声中継を開始するが、音声中継を開始すると、音声中継中のパソコン３００，４００では、図８に示した音声中継中の画面３００Ｃが表示される。
【００３６】
ステップ６６０において、ＣＰＵ３６０は、音声中継中の画面３００Ｃの切断ボタンが押されたか否か若しくはパソコン３４０からの切断メッセージを受信したか否かを判断する。切断ボタンの押下を検出したか若しくはパソコン４００からの切断メッセージを受信した場合には、ステップ６７０において、ＣＰＵ３６０は、音声中継を終了する。そして、ＣＰＵ３６０は、電話を切断し、ステップ６１０における電話着信待ち状態に戻る。
【００３７】
次に、図９〜図１１を用いて、本発明の一実施形態による音声中継システムにおける音声中継処理の中のデータ送信時及びデータ受信時の処理について説明する。
【００３８】
図４のステップ４６０または図６のステップ６５０において音声中継を開始すると、ＰＢＸゲートウェイ２００またはパソコン３００は、図９に示す相手（パソコン３００またはパソコン４００）に音声データを送信するデータ送信処理と、図１０に示す相手から送られてきた音声データを受取った時のデータ受信処理を開始する。なお、以下の説明では、パソコン３００からパソコン４００にデータを送信する場合について説明する。
【００３９】
最初に、データ送信処理について、図９を用いて説明する。
ステップ９１０において、ＣＰＵ３６０は、音声入出力インタフェースドライバ３１０を用いて、送信する音声データの量を確認する。
ここで、図１１を用いて、音声中継の設定値について説明する。図１１に示すように、設定内容と設定値とから構成されており、これらの設定内容及び設定値は、メインメモリ３５５に格納されている。図１１に示す例では、「音声データの長さ」は、「１２０」バイト（Ｂｙｔｅ）に設定されている。音声データは、所定のバイト数毎にまとめて送信するようにしており、ここで、「音声データの長さ」は、１回に送信する音声データの長さを示している。音声データの長さが１２０バイトということは、約３０ｍＳの長さの音声信号に相当するものである。また、「音声データ蓄積量」は、「４００〜５００」バイトに設定されている。ここで、「音声データ蓄積量」は、音切れを防止するために、音声データを蓄積する量を示している。この詳細については、図１０のステップ１０３０，１０４０において説明する。「符号化方式」は、「ＡＤＰＣＭ」に設定されている。「音量」は、「１０」（任意単位）に設定されている。
【００４０】
ステップ９２０において、ＣＰＵ３６０は、ステップ９１０において確認した音声データ量が、図１１に示した「音声データの長さ」の設定値「１２０」バイト（相手に一度に送る音声データの長さ）より大きいか否かを判断し、設定値に満たなければ、ステップ９１０に戻って、音声データ量の確認を繰り返す。
【００４１】
送信すべき音声データが、「音声データの長さ」の設定値を超えると、ステップ９３０において、ＣＰＵ３６０は、音声入出力インタフェースドライバ３１０を用いて、音声入出力インタフェース３１０から音声データを取り出す。
【００４２】
そして、ステップ９４０において、ＣＰＵ３６０は、取り出した音声データを相手のパソコンへ送る。
【００４３】
ステップ９５０において、ＣＰＵ３６０は、音声中継の終了か否かを判断し、終了でなければ、ステップ９１０からステップ９４０を繰り返す。
【００４４】
次に、図１０を用いて、データ受信時の処理について説明する。なお、パソコン４００において、データ受信処理を行うものであるが、パソコン３００とパソコン４００は同一の構成であるので、ここでは、図３に示したパソコン３００の構成に基づいて説明する。
【００４５】
ステップ１０１０において、ＣＰＵ３６０は、図９のステップ７４０において送られた音声データを相手のパソコンが受取る。
【００４６】
次に、ステップ１０２０において、ＣＰＵ３６０は、音声入出力インタフェース３１０にある音声データの蓄積量を確認する。
図３に示したパソコン３００の構成において、ＬＡＮ２０及び通信インターフェース３６５を介して受信された音声データは、一旦、ＣＰＵ３６０内の内部バッファに取り込まれる。その後、ＣＰＵ３６０の内部バッファ内で音声データは、音声入出力インターフェースドライバ３１０の内部バッファに入力する。音声入出力インターフェースドライバ３１０の内部バッファの容量は、例えば、４ＫＢであるが、常時内部バッファには、４００〜５００バイト程度の音声データが蓄積されている。音声入出力インターフェースドライバ３１０は、一定時間毎，例えば、３０ｍＳ毎に、音声データを音声入出力インタフェース３０５に送出し、ハンドセット３９０から音を再生することになる。音声入出力インターフェースドライバ３１０の内部バッファに蓄積されている音声データがなくなると、ハンドセット３９０から音が再生されなくなり、音切れの状態となる。
そこで、ステップ１０２０において、ＣＰＵ３６０は、音声入出力インタフェースドライバ３１０にある音声データの蓄積量を確認する。
【００４７】
ここで、図１１に示した「音声データ蓄積量」に設定されているように、「４００〜５００」ＫＢとすると、ステップ１０３０，１０４０において、音声入出力インタフェース３１０にある音声データの蓄積量が、この設定された音声データ蓄積量の最大値より多いか、最小値よりも少ないかを判断する。
【００４８】
ステップ１０３０において、ＣＰＵ３６０が、音声入出力インタフェースドライバ３１０にある音声データの蓄積量が、この設定された音声データ蓄積量の最大値より少ないと判断され、また、ステップ１０４０において、ＣＰＵ３６０が、音声入出力インタフェースドライバ３１０にある音声データの蓄積量が、この設定された音声データ蓄積量の最小値より多いと判断されると、ステップ１０６０において、音声データの１回分を、ＣＰＵ３６０の内部バッファから音声入出力インタフェースドライバ３１０の内部バッファに入力する。
即ち、ステップ１０６０の処理が行われる場合は、音声入出力インタフェースドライバ３１０の内部バッファにある音声データの蓄積量は、４００〜５００バイトの適正な範囲内にある。そこで、図１１に示したように、「音声データの長さ」は、「１２０」バイトと設定されているので、音声データの１回分で１２０バイト分の音声データを、ＣＰＵ３６０の内部バッファから音声入出力インタフェースドライバ３１０の内部バッファに入力する。一方、音声入出力インタフェースドライバ３１０は、一定周期で１２０バイトの音声データを音声入出力インターフェース３０５に送り出しているので、音声入出力インタフェースドライバ３１０の内部バッファにある音声データの蓄積量は、４００〜５００バイトの適正な範囲を維持できることになる。従って、ハンドセット３９０から再生される音が音切れを生じることはなくなる。
【００４９】
次に、ステップ１０３０において、ＣＰＵ３６０が、音声入出力インタフェースドライバ３１０にある音声データの蓄積量が、この設定された音声データ蓄積量の最大値より少ないと判断され、また、ステップ１０４０において、ＣＰＵ３６０が、音声入出力インタフェースドライバ３１０にある音声データの蓄積量が、この設定された音声データ蓄積量の最小値より少ないと判断されると、ステップ１０５０において、音声データの１．５回分を、ＣＰＵ３６０の内部バッファから音声入出力インタフェースドライバ３１０の内部バッファに入力する。
即ち、ステップ１０５０の処理が行われる場合は、音声入出力インタフェースドライバ３１０の内部バッファにある音声データの蓄積量は、４００バイトより少なくなった場合である。そこで、図１１に示したように、「音声データの長さ」は、「１２０」バイトと設定されているので、音声データの１．５回分で１８０バイト分の音声データを、ＣＰＵ３６０の内部バッファから音声入出力インタフェースドライバ３１０の内部バッファに入力する。一方、音声入出力インタフェースドライバ３１０は、一定周期で１２０バイトの音声データを音声入出力インターフェース３０５に送り出しているので、音声入出力インタフェースドライバ３１０の内部バッファにある音声データの蓄積量は、順次増加して、適正な範囲である４００〜５００バイトに復帰できることになる。
なお、音声データの１．５回分で１８０バイト分の音声データを、ＣＰＵ３６０の内部バッファから音声入出力インタフェースドライバ３１０の内部バッファに入力するためには、１回分の１２０バイトのデータ及びこの同じ１２０バイトのデータの半分の６０バイトの音声データを送る。即ち、６０バイトの音声データは、全く同じデータを送ることになる。その結果、音声入出力インタフェース３０５によりハンドセット３９０から再生する音は、同じ音が６０バイト分繰り返されることになるが、これが、ハンドセット３９０を利用する者にとって不快感を与えるものではない。その第１の理由としては、６０バイトの音声データは、約１５ｍＳと極めて短い時間であることである。また、このとき再生される音が前の音と同じであることから、人間の聴覚上では、不自然さが発生しない。ここで、疑似的な音声データを音声入出力インタフェース３０５に入力することも考えられるが、疑似的な音声データは、その前に再生された音とは異なるため、人間の聴覚上では、むしろ違和感を感じることになる。
【００５０】
従って、音声入出力インタフェースドライバ３１０の内部バッファにある音声データの蓄積量は、順次増加して、適正な範囲である４００〜５００バイトに復帰できることになるため、ハンドセット３９０から再生される音が音切れを生じることはなくなる。
【００５１】
また、ステップ１０３０において、ＣＰＵ３６０が、音声入出力インタフェースドライバ３１０にある音声データの蓄積量が、この設定された音声データ蓄積量の最大値より多いと判断されると、何もせずに、ステップ１０１０における音声データ受信に戻る。
即ち、この場合には、音声入出力インタフェースドライバ３１０にある音声データの蓄積量が、５００バイト以上あるため、ここで音声データを音声入出力インタフェースドライバ３１０に送ると、音声データが多くなり、音声データの再生に時間がかかることになる。つまり、音声データの遅延が大きくなり、相手の声が遅れて聞こえることになる。
【００５２】
以上のようにして、音声データの再生処理においては、音声データの蓄積量をある設定値内にすることで、音声データが遅れて届いた時でも、音声が途切れることなく再生することができ、設定量を超えた遅延をなくすことができる。
【００５３】
なお、上述した図９，図１０の説明では、パソコンにおける音声再生処理について説明したが、ＰＢＸゲートウェイ２００における音声再生処理の場合は、上述した説明の音声入出力インタフェース３０５を電話インタフェース２０５に、音声入出力インタフェースドライバ３１０を電話インタフェースドライバ２１０に置き換えればよいものであり、ＰＢＸゲートウェイ２００においても同様の音声再生処理を行うことができる。
【００５４】
次に、図１及び図１２を用いて、音声中継設定の処理について説明する。
【００５５】
図１に示した音声中継設定画面３００Ａの「設定」ボタンが押されると、図１２に示す音声中継設定の処理が行われる。
【００５６】
ステップ１２１０において、ＣＰＵ３６０は、設定画面３００Ａの「設定」ボタンが押された事を認識すると、どの設定が変更になったか調べ、それぞれの処理を行う。
【００５７】
ステップ１２２０において、ＣＰＵ３６０は、再生バッファの値が変更されているか否かを判断し、再生バッファの値が変更されている場合には、ステップ１２３０において、ＣＰＵ３６０は、図１１に示した音声中継の設定値の中の「音声データ蓄積量」を、設定された値に変更する。なお、「音声データ蓄積量」に設定幅がある場合は、その設定幅をあらかじめ決めておくものとして、図１１に示した例では、設定幅を「１２０」バイトとしてある。音声中継設定画面３００Ａでは、「音声データ蓄積量」の設定値の下限値を設定するので、「音声データ蓄積量」は、４００〜５００バイトとなる。
本実施形態においては、「音声データ蓄積量」の設定値を変更するだけで、図１０で説明した方式により、音声入出力インタフェースドライバ３１０に蓄積される音声データ量を変えることができる。
【００５８】
パソコン３００のオペレータは、相手からの電話を受信中に、音の遅延が大きくなったと感じたときには、図１に示すように、パソコン３００の表示画面に音声中継設定画面３００Ａを表示して、「音声データ蓄積量」の設定値を小さくする。これによって、「音声データ蓄積量」を小さくすることができるため、再生される音の遅延量を小さくすることができる。また、音切れが発生した場合には、図１に示すように、パソコン３００の表示画面に音声中継設定画面３００Ａを表示して、「音声データ蓄積量」の設定値を大きくする。これによって、「音声データ蓄積量」を大きくすることができるため、再生される音の音切れを防止することができる。
【００５９】
音声の遅れに違和感を覚える場合や音切れに不快感を覚える場合は、オペレータによって個人差があるものであり、オペレータによっては、音切れが多少発生してもいいから音声の遅延を少なくしたいようにしたいという場合や、または、遅延が大きくなってもいいから音切れをなくしたいという場合があるので、オペレータの感性に応じたよりよい音質の音声を得ることができるようになる。
【００６０】
次に、ステップ１２４０において、ＣＰＵ３６０は、符号化方式に変更があったか否かを判断し、符号化方式に変更があった場合は、ステップ１２５０において、ＣＰＵ３６０は、符号化方式を変更する。
ここで、符号化方式とは、ハンドセット３９０から入力された音声を、ＬＡＮ２０で送受信できる形に変更する方式のことである。符号化方式の変更は、音声入出力インタフェースドライバ３１０を使って、音声入出力インタフェース３１０の設定を変えることで行う。
【００６１】
また、ステップ１２６０において、ＣＰＵ３６０は、音声データ長が変更されているか否かを判断し、音声データ長が変更されていた場合は、ステップ１２７０において、ＣＰＵ３６０は、「音声データの長さ」を変更する。この設定値を変更するだけで、図９で説明した音声データ送信時の処理により、送信する音声データの長さを変更することができる。
【００６２】
以上説明したように、本実施形態では、利用者が音声中継の設定を変更できることにより、音切れしてもいいから遅延をなるべくしたい人や遅延してもいいから音切れするのが嫌な人の要望をかなえることができる。
【００６３】
次に、図１３を用いて、電話機等の端末を用いて、音声中継の設定値を変更する場合について説明する。
【００６４】
図１２に示した例では、音声通信装置としてパソコンを用いているため、ディスプレイを用いて設定値の変更を行うことができるが、ディスプレイのない電話機等の音声通信装置の場合は、設定値を設定用のボタンなどを使用し、電話機のボタンを押した時に発生するＤＴＭＦ（ＤｕａｌＴｏｎｅＭｕｌｔｉＦｒｅｑｕｅｎｃｙ）信号を用いて設定できる。
【００６５】
図１３は、ＰＢＸゲートウェイ２００が、電話網１０に接続された電話機から発せられたＤＴＭＦ信号を検出し、設定を変更する処理を説明するフローチャートある。
【００６６】
ステップ１３１０において、ＰＢＸゲートウェイ２００のＣＰＵ２６０は、音声中継中にＤＴＭＦ信号を、電話インタフェースドライバ２１０へ繰り返し問い合わせる。
【００６７】
そして、ステップ１３２０において、ＣＰＵ２６０は、ＤＴＭＦ信号を検出したか否かを判断し、ＤＴＭＦ信号を検出すると、ステップ１３３０において、ＣＰＵ２６０は、電話インタフェースドライバ２１０から、受信したＤＴＭＦ信号を取得する。
【００６８】
次に、取得したＤＴＭＦ信号が設定変更の命令か否かを判断する。ここで、設定変更命令は、あらかじめ決められたものとし、例えば、「＃」を押した後に「４００」を押すと、「音声データ蓄積量」を「４００」にするというように決められている。
【００６９】
そして、ＤＴＭＦ信号が設定変更の命令である場合には、ステップ１３５０において、ＣＰＵ２６０は、図１１に示した音声中継の設定値を変更する。
【００７０】
以上のようにして、本実施形態では、音声中継の音声データ蓄積量の設定をパソコンからだけでなく、電話機側からもできるようになる。
【００７１】
【発明の効果】
本発明によれば、利用者の判断により、利用者にとって聞きやすい状態、即ち、よりよい音質の音声を提供することができるようになる。
【図面の簡単な説明】
【図１】本発明の一実施形態による音声中継システムの全体構成を示すシステム構成図である。
【図２】本発明の一実施形態による音声中継システムの中のＰＢＸゲートウェイのハードウェア構成のブロック図である。
【図３】本発明の一実施形態による音声中継システムの中の音声通信装置であるパソコンのハードウェア構成のブロック図である。
【図４】本発明の一実施形態による音声中継システムの中のＰＢＸゲートウェイの音声中継処理を示すフローチャートである。
【図５】本発明の一実施形態による音声中継システムの中のＰＢＸゲートウェイが有している内線番号−ＩＰアドレス対応表の説明図である。
【図６】本発明の一実施形態による音声中継システムの中の音声通信装置であるパソコンの音声中継処理を示すフローチャートである。
【図７】本発明の一実施形態による音声中継システムの中の音声通信装置であるパソコンの音声中継処理においてパソコンから発信する時の発信画面の説明図である。
【図８】本発明の一実施形態による音声中継システムの中の音声通信装置であるパソコンの音声中継処理において音声中継中の画面の説明図である。
【図９】本発明の一実施形態による音声中継システムにおける音声中継処理の中のデータ送信時の処理を説明するフローチャートである。
【図１０】本発明の一実施形態による音声中継システムにおける音声中継処理の中のデータ受信時の処理を説明するフローチャートである。
【図１１】本発明の一実施形態による音声中継システムにおける音声中継処理において利用する設定値の一例の説明図である。
【図１２】本発明の一実施形態による音声中継システムにおける音声中継処理の中の音声中継設定時の処理を説明するフローチャートである。
【図１３】本発明の一実施形態による音声中継システムにおける音声中継処理の中の電話機等の端末から音声中継設定する際の処理を説明するフローチャートである。
【符号の説明】
１０…電話網
２０…ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）
２００…ＰＢＸゲートウェイ装置
２０５…電話インタフェース
２１０…電話インタフェースドライバ
２６０，３６０…ＣＰＵ
３００，４００…パソコン
３０５…音声入出力インタフェース
３１０…音声入出力インタフェースドライバ
３９０，４９０…ハンドセット[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a voice communication apparatus used in a voice relay system in which voice data is exchanged between voice communication apparatuses, which are terminals, and conversation is possible.
[0002]
[Prior art]
As a voice relay system for sending and receiving voice data between personal computers (personal computers) connected to a LAN (Local Area Network), for example, Nikkei Data Pro Datacom, 1995, p. Talkware of Hitachi, Ltd. described in 681 is known. An example of a voice relay system for making a call using the Internet is, for example, Internetworking, 1996.69, p. A Cool Talk of Netscape Communications, described in 14-15, is known.
[0003]
When audio data is transmitted / received over a communication network, the audio data is delayed and reaches the other party due to the load on the communication network. If playback is performed immediately when audio data is received, sound interruption will occur if the audio data arrives late. Therefore, in Talkware and Cool Talk, sound interruption is prevented by accumulating and reproducing a little received audio data.
[0004]
In particular, in Talkware, the amount of audio data stored is automatically calculated and audio reproduction processing is performed.
[0005]
[Problems to be solved by the invention]
However, if the network transmission rate is slow, the amount of voice data stored increases, so the voice delay increases. When the voice delay increases, some received operators may feel uncomfortable with the received voice. In addition, since there is an upper limit for the amount of audio data stored, if a transmission delay of the audio data that exceeds this upper limit occurs, sound interruption will occur. In the sound interruption, since the received voice is intermittent, the received voice may be uncomfortable depending on the receiving operator.
[0006]
When the user feels uncomfortable with the sound delay or feels uncomfortable with the sound interruption, there are individual differences depending on the operator. Therefore, the conventional method of automatically calculating the amount of audio data stored and performing the audio reproduction process has a problem in that it is difficult to provide audio with better sound quality according to the sensitivity of the operator. It was. Depending on the operator, there may be a case where it is desired to reduce the delay of the sound because the sound may be interrupted to some extent, or a case where it is desired to eliminate the sound because the delay may be increased.
[0007]
An object of the present invention is to provide an audio communication apparatus that can provide audio with better sound quality.
[0008]
[Means for Solving the Problems]
(1) In order to achieve the above object, the present invention provides a voice communication device that relays voice data, a storage unit that temporarily stores voice data transmitted from another voice communication device after receiving the voice data; Reproducing means for reproducing the audio data sent from the accumulating means by the audio output means, setting value varying means for varying the setting value of the amount of accumulated audio data accumulated in the accumulating means during voice relay, and A display unit that displays at least a part of the set value of the audio data; and an input unit that is instructed by the user to input the set value of the audio data. The display unit includes the set value of the audio data. A setting value of a voice data storage amount representing the amount of voice data stored in the storage means, and the input means receives the voice data storage amount by a user, and the setting value variable It is a set of the audio data accumulation amount from the input means is set as a new audio data accumulation amount In addition, the set value of the audio data is further input to the storage unit at once, or an audio data length representing the length of audio data to be output from the storage unit at once, or the audio data Including at least one of the encoding methods, the audio data length or at least one of the audio data encoding methods is displayed by the display means, and new information is input via the input means, and the set value The input new information is set by the variable means. It is what I did. With this configuration, the set value of the accumulated amount of voice data can be changed during voice relay according to the user's judgment, so that it is possible to provide a voice that is easy to hear for the user, that is, a voice with better sound quality.
[0010]
( 2 ) In the above (1), preferably, control means for controlling an accumulation amount of audio data accumulated in the accumulation means is provided, and the control means has an accumulation amount of audio data accumulated in the accumulation means as a predetermined value. When the number is less, the voice data sent from another voice communication device is stored in the storage means, and then a part of the same voice data sent as the voice data is stored in the storage means again. is there. With such a configuration, sound interruption can be prevented, and sound quality of sound reproduced for preventing sound interruption can be improved.
[0011]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, a voice relay system using a voice communication apparatus according to an embodiment of the present invention will be described with reference to FIGS.
First, the overall configuration of a voice relay system according to an embodiment of the present invention will be described with reference to FIG.
[0012]
The PBX gateway 200 is connected to the telephone line 10 and can accept calls received from the telephone line 10. The PBX gateway 200 is connected via a LAN (Local Area Network) 20 to personal computers (personal computers) 300 and 400 constituting the voice communication apparatus according to the present embodiment, and can exchange information via the LAN 20. The PBX gateway 200 can send voices heard from the telephone to the personal computers 300 and 400 and reproduce voice data sent from the personal computers 300 and 400 on the telephone line.
[0013]
Handsets 390 and 490 are connected to the personal computers 300 and 400, and voice input / output is possible using the handsets 390 and 490.
A setting screen 300A can be displayed on the display unit of the personal computer 300, and the setting value of the personal computer 300 can be changed while the setting screen 300A is displayed. The contents of the setting screen 300A and the setting change method will be described later with reference to FIG.
[0014]
Next, the hardware configuration of the PBX gateway in the voice relay system according to the embodiment of the present invention will be described with reference to FIG.
[0015]
A telephone interface 205 in the PBX gateway 200 is connected to the telephone line 10 and can recognize incoming calls. The telephone interface 205 is controlled by the telephone interface driver 210. The telephone interface driver 210 has an internal buffer (not shown), accumulates voice data sent from the telephone line 10 via the telephone interface 205, and sends it to the telephone line 10 via the telephone interface 205. Accumulate audio data.
[0016]
The keyboard driver 215 accepts input from the keyboard 220. The mouse driver 225 receives input from the mouse 230. Display on the display 235 is controlled by the display controller 240. A program or the like executed by the PBX gateway 200 is read from the disk 250 into the main memory 255 by the disk controller 245 and executed by the CPU 260. The PBX gateway 200 is connected to the LAN 20 that is a communication network by a communication interface 265.
[0017]
Next, the hardware configuration of a personal computer that is a voice communication device in the voice relay system according to the embodiment of the present invention will be described with reference to FIG. Here, the personal computer 300 will be described as an example, but the personal computer 400 has the same configuration.
[0018]
The voice input / output interface 305 is connected to the handset 390 and can input / output voice from the handset 390. The voice input / output interface 310 is controlled by the voice input / output interface driver 310. The audio interface driver 310 has an internal buffer (not shown), accumulates audio data sent from the handset 390 via the audio interface 305, and transmits audio data sent to the handset 390 via the audio interface 305. Accumulate.
[0019]
The keyboard driver 315 accepts input from the keyboard 320. The mouse driver 325 accepts input from the mouse 330. The display on the display 335 is controlled by the display controller 340. A program or the like executed on the personal computer 300 is read from the disk 350 into the main memory 355 by the disk controller 345 and executed by the CPU 360. The personal computer 300 is connected to the LAN 20, which is a communication network, via a communication interface 365.
[0020]
Next, the voice relay processing of the PBX gateway 200 in the voice relay system according to the embodiment of the present invention will be described using FIG. 4 and FIG.
[0021]
The voice relay program executed by the PBX gateway 200 is read from the disk 250 into the main memory 255 by the disk controller 245 and executed by the CPU 260. The voice relay program is installed in the disk 250 from a storage medium such as an FD or a CD-ROM.
[0022]
In step 410 of FIG. 4, the CPU 260 of the PBX gateway 200 normally enters a state waiting for an incoming call. When waiting for an incoming call, the voice relay program periodically checks whether there is an incoming call to the telephone interface 205 using the telephone interface driver 210 shown in FIG.
[0023]
When there is an incoming call, in step 420, the CPU 260 opens the audio file stored in the disk 250 and sends the audio data recorded in the file to the audio interface 205 using the telephone interface driver 210. Make a voice response. The content of this voice response is, for example, a content that announces that the extension number is to be pressed, such as “This is a PBX gateway. Please press the extension number”.
[0024]
In step 430, CPU 260 accepts an extension number. When the extension number is received, the voice relay program periodically checks the number input to the telephone interface 205 with the DTMF signal using the telephone interface driver 210 shown in FIG. 2, and the DTMF signal recognized by the telephone interface 205 is received. Is obtained from the telephone interface driver 210.
[0025]
When the recognized extension number is received, in step 440, the CPU 260 obtains an IP address using the extension number-IP address correspondence table 500 shown in FIG. 5, and sends a call message to the personal computer 300 connected by the LAN 20 or the like. send.
[0026]
Here, the configuration of the extension number-IP address correspondence table 500 will be described with reference to FIG.
As shown in FIG. 5, the extension number / IP address correspondence table 500 includes an extension number 510, a name 520, and an IP address 530. For example, the operator of the personal computer corresponding to the extension number 510 of “1234” is “taro-yamamoto” as described in the name 520 column, and the IP address 530 of the personal computer being used is “192”. .10.12 ". That is, when the extension number received in step 430 is “1234”, the corresponding IP address “192.168.1.12.” Is acquired. The extension number / IP address correspondence table 500 is stored in the main memory 255.
Further, in the personal computer that has received a call from the PBX gateway 200 in step 440, a call bell or a message is displayed to notify the user of the personal computer of the incoming call. When the user of this personal computer presses a button for accepting a call, a reception OK message is sent from the personal computer to the PBX gateway 200.
[0027]
In step 450, the CPU 260 determines whether or not the personal computer is receiving OK, and when the PBX gateway receives the receiving OK message, in step 460, the CPU 260 starts voice relay. If the reception is not OK, the CPU 260 hangs up the telephone in step 490.
[0028]
In step 460, the CPU 260 starts the voice relay. Details of the voice relay process will be described later with reference to FIGS.
[0029]
After starting the voice relay, the voice relay program of the CPU 260 uses the telephone interface driver 210 to periodically check the telephone disconnection.
In step 470, the CPU 260 determines whether the telephone disconnection is detected or whether a disconnect message is received from the personal computer 300. If the telephone disconnection is detected or a disconnection message is received from the personal computer 300, the CPU 260 ends the voice relay at step 480. In step 490, CPU 260 disconnects the telephone and returns to the incoming call waiting state in step 410.
[0030]
Next, voice relay processing when the personal computer 400 is called from the personal computer 300 which is a voice communication device in the voice relay system according to the embodiment of the present invention will be described with reference to FIGS.
[0031]
The voice relay program executed on the personal computer 300 is read from the disk 350 into the main memory 355 by the disk controller 345 shown in FIG. 3 and executed by the CPU 360. The voice relay program is installed on the disk 350 from a storage medium such as an FD or a CD-ROM.
[0032]
When starting the voice relay, the user inputs the “IP address” on the outgoing screen 300B shown in FIG. 7 and designates the other party to call, and then presses the “outgoing” button.
In step 610, CPU 360 confirms that the call button has been pressed.
Next, in step 620, the CPU 360 sends the outgoing message to the other personal computer 400.
[0033]
Next, in step 630, the CPU 360 sends a call message to the personal computer 400 connected via the LAN 20.
The personal computer 400 that received the call from the personal computer 300 in step 630 notifies the user of the personal computer of the incoming call by displaying a call bell or a message. Then, when the user of this personal computer presses a button for accepting a call, a reception OK message is sent from the personal computer 400 to the personal computer 300.
[0034]
In step 640, the CPU 360 determines whether or not the personal computer is reception OK. When the personal computer 300 receives the reception OK message, in step 650, the CPU 360 starts voice relay. On the other hand, if the reception is not OK, the process returns to the incoming call waiting state in step 610.
[0035]
In step 650, the CPU 360 starts voice relay. When the voice relay is started, the personal computer 300, 400 that is voice relaying displays the screen 300C for voice relay shown in FIG.
[0036]
In step 660, CPU 360 determines whether a disconnect button on screen 300C during voice relay has been pressed or whether a disconnect message from personal computer 340 has been received. If it is detected that the disconnect button has been pressed or if a disconnect message is received from the personal computer 400, in step 670, the CPU 360 ends the voice relay. Then, CPU 360 disconnects the phone and returns to the incoming call waiting state in step 610.
[0037]
Next, processing at the time of data transmission and data reception in the voice relay processing in the voice relay system according to the embodiment of the present invention will be described with reference to FIGS.
[0038]
When voice relay is started in step 460 of FIG. 4 or step 650 of FIG. 6, the PBX gateway 200 or the personal computer 300 performs data transmission processing for transmitting voice data to the other party (the personal computer 300 or the personal computer 400) shown in FIG. Data reception processing when the voice data sent from the other party shown in FIG. 10 is received is started. In the following description, a case where data is transmitted from the personal computer 300 to the personal computer 400 will be described.
[0039]
First, the data transmission process will be described with reference to FIG.
In step 910, the CPU 360 confirms the amount of audio data to be transmitted using the audio input / output interface driver 310.
Here, setting values for voice relay will be described with reference to FIG. As shown in FIG. 11, the setting contents and setting values are configured, and these setting contents and setting values are stored in the main memory 355. In the example shown in FIG. 11, “length of audio data” is set to “120” bytes (Bytes). The audio data is transmitted collectively for each predetermined number of bytes. Here, “the length of the audio data” indicates the length of the audio data to be transmitted at one time. An audio data length of 120 bytes corresponds to an audio signal having a length of about 30 mS. Further, the “audio data accumulation amount” is set to “400 to 500” bytes. Here, “sound data storage amount” indicates the amount of sound data stored in order to prevent sound interruption. This will be described in detail in steps 1030 and 1040 in FIG. The “encoding method” is set to “ADPCM”. “Volume” is set to “10” (arbitrary unit).
[0040]
In step 920, the CPU 360 determines that the amount of audio data confirmed in step 910 is larger than the setting value “120” bytes (length of audio data to be sent to the other party at a time) of “length of audio data” shown in FIG. If it does not satisfy the set value, the process returns to step 910 to repeat the confirmation of the audio data amount.
[0041]
When the audio data to be transmitted exceeds the set value of “length of audio data”, in step 930, the CPU 360 uses the audio input / output interface driver 310 to extract the audio data from the audio input / output interface 310.
[0042]
In step 940, the CPU 360 sends the extracted audio data to the partner computer.
[0043]
In step 950, the CPU 360 determines whether or not the voice relay is finished, and if not finished, repeats the steps 910 to 940.
[0044]
Next, processing at the time of data reception will be described with reference to FIG. The personal computer 400 performs data reception processing. Since the personal computer 300 and the personal computer 400 have the same configuration, the description will be made based on the configuration of the personal computer 300 shown in FIG.
[0045]
In step 1010, the CPU 360 receives the audio data sent in step 740 of FIG.
[0046]
Next, in step 1020, the CPU 360 confirms the amount of audio data stored in the audio input / output interface 310.
In the configuration of the personal computer 300 shown in FIG. 3, audio data received via the LAN 20 and the communication interface 365 is once taken into an internal buffer in the CPU 360. Thereafter, the audio data in the internal buffer of the CPU 360 is input to the internal buffer of the audio input / output interface driver 310. The capacity of the internal buffer of the audio input / output interface driver 310 is, for example, 4 KB, but audio data of about 400 to 500 bytes is always stored in the internal buffer. The voice input / output interface driver 310 sends voice data to the voice input / output interface 305 at regular time intervals, for example, every 30 ms, and reproduces sound from the handset 390. When there is no audio data stored in the internal buffer of the audio input / output interface driver 310, no sound is played from the handset 390, and the sound is cut off.
Therefore, in step 1020, the CPU 360 checks the amount of audio data stored in the audio input / output interface driver 310.
[0047]
Here, assuming that “400 to 500” KB is set as “sound data storage amount” shown in FIG. 11, in steps 1030 and 1040, the storage amount of the sound data in the sound input / output interface 310 is the same. Then, it is determined whether it is larger or smaller than the set maximum value of the voice data storage amount.
[0048]
In step 1030, the CPU 360 determines that the amount of audio data stored in the audio input / output interface driver 310 is less than the set maximum value of the audio data storage amount. In step 1040, the CPU 360 If it is determined that the amount of audio data stored in the output interface driver 310 is greater than the set minimum value of the audio data storage amount, in step 1060, one audio data is input from the internal buffer of the CPU 360 into the audio input. Input to the internal buffer of the output interface driver 310.
That is, when the process of step 1060 is performed, the amount of audio data stored in the internal buffer of the audio input / output interface driver 310 is within an appropriate range of 400 to 500 bytes. Therefore, as shown in FIG. 11, since the “length of audio data” is set to “120” bytes, 120 bytes of audio data for one time of audio data is sent from the internal buffer of the CPU 360 to the audio data. Input to the internal buffer of the input / output interface driver 310. On the other hand, since the voice input / output interface driver 310 sends 120 bytes of voice data to the voice input / output interface 305 at a constant cycle, the amount of voice data stored in the internal buffer of the voice input / output interface driver 310 is 400 to 400. An appropriate range of 500 bytes can be maintained. Therefore, the sound reproduced from the handset 390 will not be interrupted.
[0049]
Next, in step 1030, the CPU 360 determines that the accumulated amount of audio data in the audio input / output interface driver 310 is smaller than the set maximum value of the audio data accumulated amount. In step 1040, the CPU 360 If it is determined that the amount of sound data stored in the sound input / output interface driver 310 is less than the minimum value of the set amount of sound data, in step 1050, 1.5 times of sound data is stored in the CPU 360. Input from the internal buffer to the internal buffer of the audio input / output interface driver 310.
In other words, the processing in step 1050 is performed when the amount of audio data stored in the internal buffer of the audio input / output interface driver 310 is less than 400 bytes. Therefore, as shown in FIG. 11, since the “length of audio data” is set to “120” bytes, the audio data for 180 bytes in 1.5 times of the audio data is stored in the internal buffer of the CPU 360. To the internal buffer of the voice input / output interface driver 310. On the other hand, since the voice input / output interface driver 310 sends 120 bytes of voice data to the voice input / output interface 305 at regular intervals, the amount of voice data stored in the internal buffer of the voice input / output interface driver 310 gradually increases. Thus, it is possible to return to the proper range of 400 to 500 bytes.
In addition, in order to input audio data of 180 bytes in 1.5 times of audio data from the internal buffer of the CPU 360 to the internal buffer of the audio input / output interface driver 310, 120 bytes of data for the same time and the same 120 are used. Sends 60 bytes of audio data, half of the byte data. That is, the same data is sent for 60-byte audio data. As a result, the sound reproduced from the handset 390 by the voice input / output interface 305 is repeated for 60 bytes of the same sound, but this does not cause discomfort to those who use the handset 390. The first reason is that the 60-byte audio data has an extremely short time of about 15 mS. Further, since the sound reproduced at this time is the same as the previous sound, no unnaturalness occurs in human hearing. Here, it is conceivable to input pseudo audio data to the audio input / output interface 305. However, since the pseudo audio data is different from the sound reproduced before that, it is rather uncomfortable on human hearing. You will feel.
[0050]
Accordingly, the amount of audio data stored in the internal buffer of the audio input / output interface driver 310 can be gradually increased and restored to the appropriate range of 400 to 500 bytes. No cuts will occur.
[0051]
If the CPU 360 determines in step 1030 that the accumulated amount of audio data stored in the audio input / output interface driver 310 is greater than the maximum value of the set audio data accumulated amount, step 1010 is performed without doing anything. Return to receiving voice data.
That is, in this case, since the amount of audio data stored in the audio input / output interface driver 310 is 500 bytes or more, if the audio data is sent to the audio input / output interface driver 310 here, the audio data increases, It takes time to reproduce the data. That is, the delay of the voice data becomes large, and the other party's voice can be heard with a delay.
[0052]
As described above, in the audio data reproduction process, by setting the accumulated amount of audio data within a certain set value, even when the audio data arrives late, the audio can be reproduced without interruption, The delay exceeding the set amount can be eliminated.
[0053]
In the above description of FIGS. 9 and 10, the sound reproduction process in the personal computer has been described. However, in the case of the sound reproduction process in the PBX gateway 200, the sound input / output interface 305 described above is used as the telephone interface 205. The input / output interface driver 310 may be replaced with the telephone interface driver 210, and the PBX gateway 200 can perform the same audio reproduction process.
[0054]
Next, the voice relay setting process will be described with reference to FIGS.
[0055]
When the “Setting” button on the voice relay setting screen 300A shown in FIG. 1 is pressed, the voice relay setting process shown in FIG. 12 is performed.
[0056]
In step 1210, when recognizing that the “setting” button on the setting screen 300A has been pressed, the CPU 360 checks which setting has been changed and performs each process.
[0057]
In step 1220, the CPU 360 determines whether or not the value of the reproduction buffer has been changed. If the value of the reproduction buffer has been changed, in step 1230, the CPU 360 performs the voice relay shown in FIG. The “audio data storage amount” in the set value is changed to the set value. If the “audio data accumulation amount” has a set range, the set range is determined in advance. In the example shown in FIG. 11, the set range is set to “120” bytes. In the voice relay setting screen 300A, the lower limit value of the set value of “voice data storage amount” is set, so that “voice data storage amount” is 400 to 500 bytes.
In the present embodiment, the amount of audio data accumulated in the audio input / output interface driver 310 can be changed by simply changing the set value of “audio data accumulation amount” according to the method described with reference to FIG.
[0058]
When the operator of the personal computer 300 feels that the sound delay has increased while receiving a call from the other party, the operator displays a voice relay setting screen 300A on the display screen of the personal computer 300 as shown in FIG. Decrease the setting value of “Audio data accumulation amount”. As a result, the “sound data storage amount” can be reduced, and the delay amount of the reproduced sound can be reduced. Further, when sound interruption occurs, as shown in FIG. 1, the voice relay setting screen 300A is displayed on the display screen of the personal computer 300, and the setting value of “voice data accumulation amount” is increased. As a result, the “sound data accumulation amount” can be increased, so that it is possible to prevent interruption of the reproduced sound.
[0059]
If you feel uncomfortable with the sound delay or feel uncomfortable with the sound interruption, there are individual differences depending on the operator, and some operators may want to reduce the sound delay because sound interruption may occur. In some cases, it may be necessary to eliminate the sound interruption because the delay may be large, so it is possible to obtain a sound with better sound quality according to the sensitivity of the operator.
[0060]
Next, in step 1240, the CPU 360 determines whether or not the encoding method has been changed. If the encoding method has changed, in step 1250, the CPU 360 changes the encoding method.
Here, the encoding method is a method of changing the voice input from the handset 390 into a form that can be transmitted and received by the LAN 20. The encoding method is changed by changing the setting of the voice input / output interface 310 using the voice input / output interface driver 310.
[0061]
In step 1260, the CPU 360 determines whether or not the audio data length has been changed. If the audio data length has been changed, the CPU 360 changes “length of audio data” in step 1270. To do. By simply changing this set value, the length of the audio data to be transmitted can be changed by the processing at the time of audio data transmission described with reference to FIG.
[0062]
As described above, in this embodiment, the user can change the voice relay setting so that the sound can be cut off, or the person who wants to delay as much as possible or the person who does not like to cut off the sound because it can be delayed. Can be fulfilled.
[0063]
Next, with reference to FIG. 13, a case where the setting value of voice relay is changed using a terminal such as a telephone will be described.
[0064]
In the example shown in FIG. 12, since a personal computer is used as the voice communication device, the setting value can be changed using a display. However, in the case of a voice communication device such as a telephone without a display, the setting value is changed. A setting button or the like can be used, and a setting can be performed using a DTMF (Dual Tone Multi Frequency) signal generated when the telephone button is pressed.
[0065]
FIG. 13 is a flowchart illustrating a process in which the PBX gateway 200 detects a DTMF signal transmitted from a telephone connected to the telephone network 10 and changes the setting.
[0066]
In step 1310, the CPU 260 of the PBX gateway 200 repeatedly inquires the telephone interface driver 210 for a DTMF signal during voice relay.
[0067]
In step 1320, the CPU 260 determines whether or not a DTMF signal has been detected. When the DTMF signal is detected, the CPU 260 acquires the received DTMF signal from the telephone interface driver 210 in step 1330.
[0068]
Next, it is determined whether or not the acquired DTMF signal is a setting change command. Here, it is assumed that the setting change command is determined in advance. For example, when “400” is pressed after “#” is pressed, “audio data accumulation amount” is set to “400”. .
[0069]
If the DTMF signal is a setting change command, in step 1350, the CPU 260 changes the voice relay setting value shown in FIG.
[0070]
As described above, in the present embodiment, the setting of the voice data storage amount for voice relay can be set not only from the personal computer but also from the telephone side.
[0071]
【The invention's effect】
According to the present invention, it is possible to provide a sound that is easy for the user to hear, that is, a voice with better sound quality, based on the judgment of the user.
[Brief description of the drawings]
FIG. 1 is a system configuration diagram showing the overall configuration of a voice relay system according to an embodiment of the present invention.
FIG. 2 is a block diagram of a hardware configuration of a PBX gateway in a voice relay system according to an embodiment of the present invention.
FIG. 3 is a block diagram of a hardware configuration of a personal computer that is a voice communication device in the voice relay system according to the embodiment of the present invention.
FIG. 4 is a flowchart showing voice relay processing of a PBX gateway in the voice relay system according to the embodiment of the present invention.
FIG. 5 is an explanatory diagram of an extension number-IP address correspondence table possessed by a PBX gateway in a voice relay system according to an embodiment of the present invention.
FIG. 6 is a flowchart showing voice relay processing of a personal computer which is a voice communication device in the voice relay system according to the embodiment of the present invention.
FIG. 7 is an explanatory diagram of a call origination screen when making a call from a personal computer in a voice relay process of a personal computer that is a voice communication device in a voice relay system according to an embodiment of the present invention.
FIG. 8 is an explanatory diagram of a screen during voice relay in voice relay processing of a personal computer that is a voice communication device in the voice relay system according to the embodiment of the present invention.
FIG. 9 is a flowchart illustrating processing at the time of data transmission in voice relay processing in the voice relay system according to the embodiment of the present invention.
FIG. 10 is a flowchart for explaining processing at the time of data reception in voice relay processing in the voice relay system according to the embodiment of the present invention;
FIG. 11 is an explanatory diagram illustrating an example of setting values used in the voice relay processing in the voice relay system according to the embodiment of the invention.
FIG. 12 is a flowchart for explaining processing at the time of voice relay setting in voice relay processing in the voice relay system according to the embodiment of the present invention;
FIG. 13 is a flowchart illustrating processing when voice relay is set from a terminal such as a telephone in voice relay processing in the voice relay system according to the embodiment of the present invention.
[Explanation of symbols]
10 ... Telephone network
20 ... LAN (Local Area Network)
200 ... PBX gateway device
205 ... Telephone interface
210 ... Telephone interface driver
260, 360 ... CPU
300, 400 ... PC
305 ... Voice input / output interface
310 ... Voice input / output interface driver
390, 490 ... Handset

Claims

In a voice communication device that relays voice data,
Storage means for temporarily storing after receiving the voice data sent from another voice communication device;
Reproduction means for reproducing the audio data sent from the storage means by means of an audio output means;
Set value variable means for changing the set value of the amount of voice data stored in the storage means during voice relay;
Display means for displaying at least a part of the set value of the audio data to the user;
An input means for instructing input of a set value of the audio data by a user;
The display means displays a set value of an audio data accumulation amount representing an amount of audio data accumulated in the accumulation means as the audio data set value,
The input means receives the voice data accumulation amount by a user,
The set value variable means sets the setting of the audio data accumulation amount from the input means as a new audio data accumulation amount ,
The set value of the audio data is further input to the storage unit at once, or an audio data length representing the length of audio data to be output from the storage unit at once, or the audio data encoding Including at least one of the methods,
At least one of the audio data length and the audio data encoding method is displayed by the display means, new information is input via the input means, and the new information input by the set value variable means Is set .

The voice communication device according to claim 1,
Control means for controlling the amount of voice data stored in the storage means;
When the storage amount of the voice data stored in the storage unit is less than a predetermined value, the control unit stores the voice data sent from another voice communication device and stores the voice data in the storage unit. A voice communication apparatus, wherein a part of the same voice data as the data is stored again in the storage means.