JP2012212970A

JP2012212970A - Voice communication device, voice communication system, and voice communication method and program

Info

Publication number: JP2012212970A
Application number: JP2011076335A
Authority: JP
Inventors: Seiji Yamaguchi; 誠二山口
Original assignee: NEC Embedded Products Ltd
Current assignee: NEC Embedded Products Ltd
Priority date: 2011-03-30
Filing date: 2011-03-30
Publication date: 2012-11-01
Anticipated expiration: 2031-03-30
Also published as: JP5763385B2

Abstract

PROBLEM TO BE SOLVED: To provide a voice communication device, a voice communication system, and a voice communication method and program which, when performing half duplex communication by radio communication, can avoid collisions of upstream and downstream voice data in a limited bandwidth.SOLUTION: The voice communication device is a device to perform half duplex communication by radio communication to and from a counter terminal. While voice data to be received from the counter terminal is being output from the local device, it enables voice data to be sent to the counter terminal to be input. And, during output from the local device of voice data to be received from the counter terminal, it keeps the input voice data saved in memory and, when it has finished receiving voice data from the counter terminal, sends to the counter terminal a request for control to ensure that voice data will not be transmitted from the counter terminal; thereafter, it transmits the saved voice data to the counter terminal.

Description

本発明は、音声データの無線通信を実現する音声通信装置、音声通信システム、音声通信方法及びプログラムに関する。 The present invention relates to a voice communication device, a voice communication system, a voice communication method, and a program for realizing wireless communication of voice data.

複数の端末間で音声データの送受信を行う音声通信システムとしては、例えば、建物・家屋に設置されるドアホン、インターホンが知られている（例えば特許文献１、２参照）。このような音声通信システムでは、有線通信を採用することで、全二重通信（同時通信）が可能である。無線通信で全二重通信を実現するためには、通信方向を時分割で切り替え、見かけ上同時通信を実現するといった方法があるが、切り替え操作が煩雑な上、細切れの入力データを違和感なく再生するには無線ＬＡＮのような帯域幅の広い通信方式を使用することが必要である。しかし、無線ＬＡＮを使用したとしても、通信するデータ量が増えれば同時通信ができなくなるおそれがある。一方で、無線通信で全二重通信を実現する方法として、送信専用、受信専用の通信手段を二重にすることで対応することが考えられるが、コストがかかることから現実的ではない。以上の理由から、音声通信システムで無線通信を採用する場合には、音声データの送受信にて同じ周波数帯を使用する半二重通信（交互通信）が採用される。 As an audio communication system that transmits and receives audio data between a plurality of terminals, for example, doorphones and intercoms installed in buildings and houses are known (see, for example, Patent Documents 1 and 2). In such a voice communication system, full-duplex communication (simultaneous communication) is possible by adopting wired communication. In order to realize full-duplex communication by wireless communication, there is a method of switching the communication direction in time division and apparently simultaneous communication, but the switching operation is complicated and the input data that is shredded is reproduced without a sense of incongruity For this purpose, it is necessary to use a communication method with a wide bandwidth such as a wireless LAN. However, even if a wireless LAN is used, simultaneous communication may not be possible if the amount of data to be communicated increases. On the other hand, as a method for realizing full-duplex communication by wireless communication, it is conceivable to cope by duplexing transmission-only and reception-only communication means, but this is not practical because of cost. For the above reasons, when wireless communication is adopted in a voice communication system, half-duplex communication (alternate communication) using the same frequency band for voice data transmission / reception is adopted.

特開２０００−２２４３１９号公報JP 2000-224319 A 特開２００７−２２８３９２号公報JP 2007-228392 A

半二重通信を実現する短距離無線通信の規格として、例えばzigbeeが挙げられる。このzigbeeを例えばドアホンに採用し、映像と音声の両方の通信を行う場合、映像通信と音声通信に使用される帯域幅の割合は１０：１程度になり、映像が優先して通信される（音声通信の割合が小さい理由は、音声通信の上りと下りの両方で帯域幅を確保すると無駄が生じる為）。zigbeeは元々の帯域幅が狭いことから、音声通信に使用される帯域は、上記割合によりさらに狭くなる。従って、上りと下りの音声データの衝突が頻繁に生じ、音声通信が成り立たなくなるおそれがある。 An example of a short-range wireless communication standard that realizes half-duplex communication is zigbee. When this zigbee is used for, for example, a door phone to perform both video and audio communication, the ratio of the bandwidth used for video communication and audio communication is about 10: 1, and video is preferentially communicated ( (The reason why the rate of voice communication is small is that securing bandwidth on both the uplink and downlink of voice communication causes waste). Since zigbee has a narrow original bandwidth, the bandwidth used for voice communication is further narrowed by the above ratio. Therefore, there is a possibility that collision between uplink and downlink voice data frequently occurs and voice communication cannot be established.

本発明は、上記事情に鑑みてなされたものであり、無線通信により半二重通信を行う場合において、限られた帯域幅における上りと下りの音声データの衝突を回避することができる音声通信装置、音声通信システム、音声通信方法及びプログラムを提供することを目的とする。 The present invention has been made in view of the above circumstances, and a voice communication device capable of avoiding a collision between uplink and downlink voice data in a limited bandwidth when performing half-duplex communication by wireless communication. An object is to provide a voice communication system, a voice communication method, and a program.

かかる目的を達成するために、本発明の音声通信装置は、対向端末との間で無線通信により半二重通信を行う音声通信装置であって、対向端末から受信する音声データを自装置外へ出力している最中に、対向端末に対して送信する音声データを入力可能とし、対向端末から受信する音声データの自装置外への出力中は、入力された音声データを保存しておき、対向端末からの音声データの受信が終了した後、対向端末から音声データの送信が行われないように制御するための要求を対向端末に送信した上で、保存しておいた音声データを対向端末に対して送信することを特徴とする。 In order to achieve such an object, the voice communication apparatus of the present invention is a voice communication apparatus that performs half-duplex communication by radio communication with the opposite terminal, and sends voice data received from the opposite terminal to the outside of the own apparatus. While outputting, it is possible to input voice data to be transmitted to the opposite terminal, and during the output of the voice data received from the opposite terminal to the outside of the own device, the input voice data is stored, After receiving the voice data from the opposite terminal, send a request to the opposite terminal to control the voice data not to be transmitted from the opposite terminal, and then save the saved voice data to the opposite terminal. Is transmitted.

本発明の音声通信システムは、本発明の音声通信装置同士により音声データの通信を行うことを特徴とする。 The voice communication system of the present invention is characterized in that voice data is communicated between the voice communication apparatuses of the present invention.

本発明の音声通信方法は、対向端末との間で無線通信により半二重通信を行う端末の音声通信方法であって、端末は、対向端末から受信する音声データを自端末外へ出力している最中に、対向端末に対して送信する音声データを入力可能とし、対向端末から受信する音声データの自端末外への出力中は、入力された音声データを保存しておき、対向端末からの音声データの受信が終了した後、対向端末から音声データの送信が行われないように制御するための要求を対向端末に送信した上で、保存しておいた音声データを対向端末に対して送信することを特徴とする。 The voice communication method of the present invention is a voice communication method of a terminal that performs half-duplex communication by wireless communication with an opposite terminal, and the terminal outputs voice data received from the opposite terminal to the outside of the own terminal. The voice data to be transmitted to the opposite terminal can be input while the voice data received from the opposite terminal is being output to the outside of the terminal. After the reception of the voice data is completed, a request for controlling the voice data to be transmitted from the opposite terminal is transmitted to the opposite terminal, and the saved voice data is sent to the opposite terminal. It is characterized by transmitting.

本発明のプログラムは、対向端末との間で無線通信により半二重通信を行う端末のコンピュータに実行させるプログラムであって、コンピュータに、対向端末から受信する音声データを自端末外へ出力している最中に、対向端末に対して送信する音声データを入力する処理と、対向端末から受信する音声データの自端末外への出力中は、入力された音声データを保存する処理と、対向端末からの音声データの受信が終了した後、対向端末から音声データの送信が行われないように制御するための要求を対向端末に送信した上で、保存しておいた音声データを対向端末に対して送信する処理と、を実行させることを特徴とする。 The program of the present invention is a program that is executed by a computer of a terminal that performs half-duplex communication by wireless communication with an opposite terminal, and outputs audio data received from the opposite terminal to the computer. During the process of inputting the voice data to be transmitted to the opposite terminal, and during the output of the voice data received from the opposite terminal to the outside of the own terminal, the process of saving the input voice data, and the opposite terminal After receiving the audio data from the remote terminal, send a request to the opposite terminal to control the transmission of the audio data from the opposite terminal, and then send the saved voice data to the opposite terminal. And the process of transmitting.

本発明によれば、無線通信により半二重通信を行う場合において、限られた帯域幅における上りと下りの音声データの衝突を回避することができる。 According to the present invention, when half-duplex communication is performed by wireless communication, it is possible to avoid collision between uplink and downlink audio data in a limited bandwidth.

本発明の一実施形態に係る音声通信装置のハードウェア構成の一例を示すブロック図である。It is a block diagram which shows an example of the hardware constitutions of the audio | voice communication apparatus which concerns on one Embodiment of this invention. 本発明の一実施形態に係る音声通信装置の動作時の一例を説明する機能ブロック図である。It is a functional block diagram explaining an example at the time of operation | movement of the audio | voice communication apparatus which concerns on one Embodiment of this invention. 本発明の一実施形態に係る音声通信装置の動作時の一例を説明する機能ブロック図である。It is a functional block diagram explaining an example at the time of operation | movement of the audio | voice communication apparatus which concerns on one Embodiment of this invention. 本発明の一実施形態に係る音声通信システムの動作例を説明するイメージ図である。It is an image figure explaining the operation example of the audio | voice communication system which concerns on one Embodiment of this invention.

以下、本発明を実施するための形態（実施形態）について添付図面を参照して詳細に説明する。 DESCRIPTION OF EMBODIMENTS Hereinafter, embodiments (embodiments) for carrying out the present invention will be described in detail with reference to the accompanying drawings.

本実施形態の音声通信装置は、対向端末（通信相手となる端末）との間で短距離無線通信規格により半二重通信を行う端末である（携帯型、固定型のいずれであってもよい）。本実施形態の音声通信装置のハードウェア構成の一例を図１に示す。図１に示すように、本実施形態の音声通信装置は、Ｉ／Ｆ１、システムバス２、Ａ／Ｄ変換部３、Ｄ／Ａ変換部４、ＣＯＤＥＣ（ＡＡＣ）５、ＲＡＭ（Random Access Memory）、ＣＰＵ（Central Processing Unit）７、無線部８、ＣＰＩＯ９、マイク１０、スピーカ１１、ボタン１２、ＬＥＤ１３を有する。無線部８で採用される短距離無線通信規格は、zigbeeを例とする。なお、図１には図示していないが、音声とともにやり取りされる映像を表示する表示部（ディスプレイ）等を備えてもよい。 The voice communication apparatus according to the present embodiment is a terminal that performs half-duplex communication with an opposite terminal (terminal that is a communication partner) according to the short-range wireless communication standard (either portable or fixed type). ). An example of the hardware configuration of the voice communication apparatus of this embodiment is shown in FIG. As shown in FIG. 1, the voice communication apparatus according to the present embodiment includes an I / F 1, a system bus 2, an A / D converter 3, a D / A converter 4, a CODEC (AAC) 5, and a RAM (Random Access Memory). , CPU (Central Processing Unit) 7, wireless unit 8, CPIO 9, microphone 10, speaker 11, button 12, and LED 13. An example of the short-range wireless communication standard adopted by the wireless unit 8 is zigbee. Although not shown in FIG. 1, a display unit (display) or the like for displaying a video exchanged with sound may be provided.

次に、本実施形態の音声通信装置の動作例について説明する。ここでは、図１に示す構成の音声通信装置同士による通信を例とし、それらの音声通信装置を端末Ａ、端末Ｂとする。図２は端末Ａの動作例を説明する図であり、図３は端末Ｂの動作例を説明する図である。図４は端末Ａ、端末Ｂ両方の動作例を説明する図である。なお、図２及び図３における各ステップは、図４における各ステップに対応する（例えば、図２の「Ｓ４」は図４の「Ｓ４」と同じ動作を意味する）。以下、図２、図３、図４を参照しながら具体的に説明する。 Next, an operation example of the voice communication apparatus according to the present embodiment will be described. Here, communication between voice communication apparatuses having the configuration shown in FIG. 1 is taken as an example, and these voice communication apparatuses are referred to as terminal A and terminal B. FIG. 2 is a diagram illustrating an operation example of the terminal A, and FIG. 3 is a diagram illustrating an operation example of the terminal B. FIG. 4 is a diagram for explaining an operation example of both the terminal A and the terminal B. Each step in FIGS. 2 and 3 corresponds to each step in FIG. 4 (for example, “S4” in FIG. 2 means the same operation as “S4” in FIG. 4). Hereinafter, a specific description will be given with reference to FIGS. 2, 3, and 4.

まず、端末Ａ、Ｂともに、「発言権空き状態（待機中）」であるとする。発言権とは、対向端末に対して、定められた帯域幅を使って音声を送信することができる権利である。発言権がない端末は、発言権がある対向端末から送信される音声を受信する状態となり、その対向端末に音声を送信することができないように制御される。発言権は、各端末間で要求し合うことで移動する。上記発言権空き状態とは、自端末に発言権がない状態、かつ、自端末において音声の送受信を行っていない状態を意味する。その他に、「発言獲得中（送信中）」と「発言権開放中（受信中）」がある。発言権獲得中は、自端末に発言権がある状態、かつ、自端末から対向端末に対して音声の送信を行っている状態を意味する。発言権開放中は、自端末に発言権がない状態、かつ、自端末にて対向端末から音声の受信を行っている状態を意味する。端末Ａ、Ｂにおいて、例えばＣＰＵ７は常に、自端末が、発言権空き状態、発言権獲得中、発言権開放中のいずれの状態にあるかを把握している。 First, it is assumed that both terminals A and B are in a “speaking right idle state (standby)”. The right to speak is a right that allows voice to be transmitted to the opposite terminal using a predetermined bandwidth. The terminal that does not have the right to speak is in a state of receiving the voice transmitted from the opposite terminal having the right to speak, and is controlled so that the voice cannot be transmitted to the opposite terminal. The right to speak moves by requesting each terminal. The above-mentioned speaking right vacant state means a state where the own terminal does not have the right to speak, and a state where voice transmission / reception is not performed at the own terminal. In addition, there are “speaking being acquired (sending)” and “speaking right being released (receiving)”. The acquisition of the right to speak means a state in which the terminal has the right to speak and a state in which voice is transmitted from the terminal to the opposite terminal. When the floor is open, it means that the terminal does not have the floor and that the terminal is receiving voice from the opposite terminal. In the terminals A and B, for example, the CPU 7 always knows whether the terminal itself is in a speaking right idle state, a speaking right is being acquired, or a speaking right is being released.

ここで、操作者Ｘが、端末Ａにおいて、ボタン１２を押してからマイク１０に向かって「どちら様ですか？」と発声し、その発声が終わったら押下していたボタン１２を離す（Ｓ１、Ｓ２）。このボタン１２は、操作者が自分の声を端末Ａに入力したいときに押下するボタンである。 Here, at the terminal A, the operator X presses the button 12 and then utters “Who is it?” Toward the microphone 10 and releases the pressed button 12 when the utterance ends (S1, S2). ). This button 12 is a button to be pressed when the operator wants to input his / her voice to the terminal A.

端末ＡのＣＰＵ７は、ボタン１２に対する入力（押下）を検知すると（Ｓ３）、無線部８を介して、発言権獲得要求を端末Ｂへ送信する（Ｓ４）。なお、ここでは例として、ＣＰＵ７の発言権獲得要求の送信タイミングを、ボタン１２に対する入力の検知とした場合としたが、これに限られない。その他の例として、マイク１０に対して予め定められた閾値以上の音量の入力を検知した場合としてもよい。その場合、操作者は発声の際にボタン１２を押下する必要がないので、ボタン１２を端末に設けなくてもよい。 When the CPU 7 of the terminal A detects an input (pressing) on the button 12 (S3), it transmits a request to acquire a right to speak to the terminal B via the wireless unit 8 (S4). Here, as an example, the transmission timing of the request to acquire the right to speak by the CPU 7 is assumed to be an input to the button 12, but the present invention is not limited to this. As another example, it is good also as a case where the input of the sound volume more than a predetermined threshold with respect to the microphone 10 is detected. In this case, since the operator does not need to press the button 12 when speaking, the button 12 may not be provided on the terminal.

また、操作者Ｘがボタン１２を押している間に発声した「どちら様ですか？」という音声は、端末Ａにおいて、マイク１０から入力される（Ｓ２）と同時に、Ａ／Ｄ変換部３によりＡ／Ｄ（Analog/Digital）変換されて生音源バッファとしてＲＡＭ６に保存される（Ｓ５）。そして、この生音源バッファは、ＣＯＤＥＣ５により圧縮されて圧縮音源バッファとしてＲＡＭ６に保存される（Ｓ６）。図２に示す生音源バッファ及び圧縮音源バッファにおいて、例えばバッファ１が、上記「どちら様ですか？」という音声データとなる。すなわち、図２に示すようにＲＡＭ６には複数の生音源バッファ及び圧縮音源バッファが保存されるが、それらバッファデータの１つ１つは、ボタン１２が押下されている間に入力された音声データ、又は、マイク１０に対して予め定められた閾値以上の音量で入力された音声データとなる。 In addition, the voice “How is it?” Uttered while the operator X is pressing the button 12 is input from the microphone 10 in the terminal A (S2), and at the same time, the A / D converter 3 performs A / D (Analog / Digital) converted and stored in the RAM 6 as a live sound source buffer (S5). The raw sound source buffer is compressed by the CODEC 5 and stored in the RAM 6 as a compressed sound source buffer (S6). In the live sound source buffer and the compressed sound source buffer shown in FIG. 2, for example, the buffer 1 becomes the audio data “How is it?”. That is, as shown in FIG. 2, a plurality of raw sound source buffers and compressed sound source buffers are stored in the RAM 6, and each of these buffer data is audio data input while the button 12 is pressed. Alternatively, the sound data is input to the microphone 10 at a volume equal to or higher than a predetermined threshold.

端末Ａから送信された上記発言獲得要求は、端末Ｂにて受信される。端末Ｂにおいて、ＣＰＵ７は、無線部８を介して端末Ａからの発言獲得要求を受信すると（Ｓ７）、自端末の現在の状態が発言権空き状態であることから、その発言獲得要求を許諾する許諾通知を、無線部８を介して端末Ａに返信する（Ｓ８）。このとき、端末ＢのＣＰＵ７は、発言権空き状態から発言権開放中に遷移する。この遷移により、端末Ｂは、端末Ａからの音声を受信する状態となり、端末Ａに対して音声の送信ができないように制御される。 The message acquisition request transmitted from the terminal A is received by the terminal B. In the terminal B, when the CPU 7 receives the speech acquisition request from the terminal A via the wireless unit 8 (S7), the current state of the terminal itself is the speech right empty state, so the speech acquisition request is granted. A permission notice is returned to the terminal A via the wireless unit 8 (S8). At this time, the CPU 7 of the terminal B transitions from the speaking right idle state to the right to release the speaking right. By this transition, the terminal B is in a state of receiving the voice from the terminal A, and is controlled so that the voice cannot be transmitted to the terminal A.

端末Ａにおいて、ＣＰＵ７は、無線部８を介して端末Ｂからの許諾通知を受信すると（Ｓ９）、発言権空き状態から発言権獲得中に遷移する。この遷移により、端末Ａは、端末Ｂに対して音声を送信する状態となる。そして、ＣＰＵ７は、ＲＡＭ６に保存されている圧縮音源バッファの端末Ｂへの送信を開始するように制御する（Ｓ１０）。この制御により、ＲＡＭ６に保存されている圧縮音源バッファは、例えばＲＡＭ６に保存された順で、無線部８を介して端末Ｂへ送信される（Ｓ１１）。 In the terminal A, when the CPU 7 receives the permission notification from the terminal B via the wireless unit 8 (S9), the CPU 7 transitions from the speaking right idle state to the acquisition of the speaking right. With this transition, terminal A enters a state of transmitting voice to terminal B. Then, the CPU 7 controls to start transmitting the compressed sound source buffer stored in the RAM 6 to the terminal B (S10). By this control, the compressed sound source buffer stored in the RAM 6 is transmitted to the terminal B via the wireless unit 8 in the order stored in the RAM 6, for example (S11).

端末Ａから送信されてきた圧縮音源バッファは、端末Ｂにおいて、無線部８にて受信された後、ＲＡＭ６に保存される（Ｓ１２）。その後、その圧縮音源バッファは、ＣＯＤＥＣ５により伸張されて生音源バッファとしてＲＡＭ６に保存された後（Ｓ１３）、Ｄ／Ａ変換部４によりＤ／Ａ変換部４によりＤ／Ａ（Digital/Analog）変換されて（Ｓ１４）、スピーカ１５から出力される（Ｓ１５）。すなわち、端末Ｂのスピーカ１５から、操作者Ｘによる「どちら様ですか？」という音声が出力されることになる。 The compressed sound source buffer transmitted from the terminal A is stored in the RAM 6 after being received by the wireless unit 8 in the terminal B (S12). Thereafter, the compressed sound source buffer is decompressed by the CODEC 5 and stored in the RAM 6 as a raw sound source buffer (S13), and then the D / A conversion unit 4 performs the D / A (Digital / Analog) conversion by the D / A conversion unit 4. (S14) and output from the speaker 15 (S15). That is, a voice “How is it?” By the operator X is output from the speaker 15 of the terminal B.

ここで、端末Ｂにおいて、スピーカ１５から「どちら様ですか？」という音声が出力されている最中に、操作者Ｙが音声を入力することができる。例えば、操作者Ｙは、端末Ｂにおいて、ボタン１２を押してからマイク１０に向かって「宅配便です」と発声し、その発声が終わったら押下していたボタン１２を離す（Ｓ１６、Ｓ１７）。 Here, in the terminal B, while the voice “How is it?” Is being output from the speaker 15, the operator Y can input the voice. For example, at the terminal B, the operator Y presses the button 12 and then utters “It is a courier service” toward the microphone 10 and releases the pressed button 12 when the utterance ends (S16, S17).

端末ＢのＣＰＵ７は、ボタン１２に対する入力（押下）を検知すると（Ｓ１８）、自端末の状態が発言権開放中であることから、端末Ａに対する発言権獲得要求の送信を待機する。 When the CPU 7 of the terminal B detects an input (pressing) on the button 12 (S18), since the state of the terminal itself is releasing the floor, the CPU 7 waits for the transmission of the floor acquisition request to the terminal A.

また、操作者Ｙがボタン１２を押している間に発声した「宅配便です」という音声は、端末Ｂにおいて、マイク１０から入力される（Ｓ１７）と同時に、Ａ／Ｄ変換部３によりＡ／Ｄ変換されて生音源バッファとしてＲＡＭ６に保存される（Ｓ１９）。そして、この生音源バッファは、ＣＯＤＥＣ５により圧縮されて圧縮音源バッファとしてＲＡＭ６に保存される（Ｓ２０）。図３に示す生音源バッファ及び圧縮音源バッファにおいて、例えばバッファ１が、上記「宅配便です」という音声データとなる。 In addition, the voice “courier service” uttered while the operator Y is pressing the button 12 is input from the microphone 10 at the terminal B (S17), and at the same time, the A / D conversion unit 3 performs the A / D conversion. The converted data is stored in the RAM 6 as a live sound source buffer (S19). The raw sound source buffer is compressed by the CODEC 5 and stored in the RAM 6 as a compressed sound source buffer (S20). In the live sound source buffer and the compressed sound source buffer shown in FIG. 3, for example, the buffer 1 is the voice data “It is a home delivery”.

発言権獲得中である端末Ａにおいて、ＣＰＵ７は、ＲＡＭ６に保存されている圧縮音源バッファの全ての送信が終了したことを検知すると（Ｓ２１）、発言権獲得中から発言権空き状態へ遷移する。そして、ＣＰＵ７は、自端末が発言権空き状態となった旨を対向端末へ知らせるための発言権空き通知を、無線部８を介して端末Ｂへ送信する（Ｓ２２）。 In the terminal A that is acquiring the right to speak, when the CPU 7 detects that all transmissions of the compressed sound source buffer stored in the RAM 6 have been completed (S21), the CPU 7 transits from the right to speak to the state where the right to speak is not obtained. Then, the CPU 7 transmits to the terminal B via the wireless unit 8 a speech right availability notification for notifying the opposite terminal that the terminal is in a speech right idle state (S22).

端末ＢのＣＰＵ７は、無線部８を介して端末Ａからの発言権空き通知を受信すると（Ｓ２３）、端末Ａに対する発言権獲得要求の送信を待機していることから、無線部８を介し、端末Ａに対して発言権獲得要求を送信する（Ｓ２４）。このとき、端末ＢのＣＰＵ７は、発言権開放状態から発言権獲得中に遷移する。 When the CPU 7 of the terminal B receives the speaking right availability notification from the terminal A via the wireless unit 8 (S23), it waits for the transmission of the right to acquire the speaking right to the terminal A. A request to acquire a right to speak is transmitted to terminal A (S24). At this time, the CPU 7 of the terminal B transitions from the state of speaking right release to during the acquisition of the right of speaking.

端末ＡのＣＰＵ７は、端末Ｂからの発言権獲得要求を受信すると（Ｓ２５）、発言権空き状態から発言権開放中へ遷移する。この遷移により、端末Ａは、端末Ｂからの音声を受信する状態となり、端末Ｂに対して音声の送信ができないように制御される。 When the CPU 7 of the terminal A receives the request for acquiring the right to speak from the terminal B (S25), the CPU 7 makes a transition from the free state to the right to speak. By this transition, the terminal A enters a state of receiving the voice from the terminal B, and is controlled so that the voice cannot be transmitted to the terminal B.

発言権獲得中となった端末ＢのＣＰＵ７は、ＲＡＭ６に保存されている圧縮音源バッファの端末Ａへの送信を開始するように制御する（Ｓ２６）。この制御により、ＲＡＭ６に保存されている圧縮音源バッファは、例えばＲＡＭ６に保存された順で、無線部８を介して端末Ａへ送信される（Ｓ２７）。 The CPU 7 of the terminal B that has acquired the right to speak is controlled to start transmission of the compressed sound source buffer stored in the RAM 6 to the terminal A (S26). By this control, the compressed sound source buffer stored in the RAM 6 is transmitted to the terminal A via the wireless unit 8 in the order stored in the RAM 6, for example (S27).

端末Ｂから送信されてきた圧縮音源バッファは、端末Ａにおいて、無線部８にて受信された後、ＲＡＭ６に保存される（Ｓ２８）。その後、その圧縮音源バッファは、ＣＯＤＥＣ５により伸張されて生音源バッファとしてＲＡＭ６に保存された後（Ｓ２９）、Ｄ／Ａ変換部４によりＤ／Ａ変換部４によりＤ／Ａ変換されて（Ｓ３０）、スピーカ１５から出力される（Ｓ３１）。すなわち、端末Ａのスピーカ１５から、操作者Ｙによる「宅配便です」という音声が出力されることになる。 The compressed sound source buffer transmitted from the terminal B is stored in the RAM 6 after being received by the wireless unit 8 in the terminal A (S28). Thereafter, the compressed sound source buffer is decompressed by the CODEC 5 and stored in the RAM 6 as a raw sound source buffer (S29), and then D / A converted by the D / A converter 4 by the D / A converter 4 (S30). Is output from the speaker 15 (S31). That is, a voice “It is a courier service” by the operator Y is output from the speaker 15 of the terminal A.

発言権獲得中である端末Ｂにおいて、ＣＰＵ７は、ＲＡＭ６に保存されている圧縮音源バッファの全ての送信が終了したことを検知すると（Ｓ３２）、発言権獲得中から発言権空き状態へ遷移する。そして、ＣＰＵ７は、発言権空き通知を、無線部８を介して端末Ａへ送信する（Ｓ３３）。 In the terminal B that is acquiring the right to speak, when the CPU 7 detects that all transmissions of the compressed sound source buffer stored in the RAM 6 have been completed (S32), the CPU 7 transits from the right to speak to the state where the right to speak is not obtained. Then, the CPU 7 transmits a speaking right availability notification to the terminal A via the wireless unit 8 (S33).

端末ＡのＣＰＵ７は、無線部８を介して端末Ｂからの発言権空き通知を受信すると（Ｓ３４）、端末Ｂが発言権空き状態となったことを認識するとともに、自端末において端末Ｂに対する発言権獲得要求の送信を待機していないことから、発言権開放中から発言権空き状態へ遷移する。そして、ＣＰＵ７は、発言権空き通知を、無線部８を介し、端末Ｂに対して送信する（Ｓ３５）。 When the CPU 7 of the terminal A receives the speaking right availability notification from the terminal B via the wireless unit 8 (S34), the CPU 7 recognizes that the terminal B is in the speaking right empty state, and at the own terminal, speaks to the terminal B. Since there is no waiting for transmission of the right acquisition request, the state shifts from the right to speak to the state where the right to speak is empty. Then, the CPU 7 transmits a speaking right availability notification to the terminal B via the wireless unit 8 (S35).

端末ＢのＣＰＵ７は、無線部８を介して端末Ａからの発言権空き通知を受信し（Ｓ３６）、端末Ａが発言権空き状態となったことを認識する。以上により、一連の動作が終了する。 The CPU 7 of the terminal B receives the speaking right availability notification from the terminal A via the wireless unit 8 (S36), and recognizes that the terminal A is in the speaking right empty state. Thus, a series of operations is completed.

なお、端末Ｂにおいて、端末Ａからの音声データのスピーカ出力中にマイク１０から音声を入力しても、端末Ａから送信されてくる音声データが多いと、端末Ｂに発言権が移らず、入力した音声が端末Ａに送信されない。よって、話が噛み合わなくなるおそれがある。これを解決する手段として、例えば、ＣＰＵ７が、マイク入力された音声データをＲＡＭ６に保存してからの時間（例えば圧縮してからの時間）を計測し、その計測した時間が一定時間（予め定められた時間。設定はユーザが任意に変更可能）を経過した場合、当該音声データを対向端末に送信することなく自動的に削除するようにしてもよい。その場合、入力された音声データを削除したことを、画面表示等により操作者に通知するようにしてもよい。 In terminal B, even if voice is input from microphone 10 while the voice data from terminal A is being output to the speaker, if there is a lot of voice data transmitted from terminal A, the right to speak is not transferred to terminal B, and input is performed. Is not transmitted to terminal A. Therefore, there is a possibility that the talk will not be engaged. As a means for solving this, for example, the CPU 7 measures the time (for example, the time after compression) after the audio data input from the microphone is stored in the RAM 6, and the measured time is a predetermined time (predetermined in advance). The audio data may be automatically deleted without being transmitted to the opposite terminal when the time has passed (setting can be arbitrarily changed by the user). In that case, the operator may be notified by screen display or the like that the input voice data has been deleted.

以上説明したように、本実施形態によれば、対向端末との間で無線通信により半二重通信を行う音声通信装置であって、対向端末から受信する音声データを自装置外へ出力している最中に、対向端末に対して送信する音声データを入力可能とし、対向端末から受信する音声データの自装置外への出力中は、入力された音声データを保存しておき、対向端末からの音声データの受信が終了した後、対向端末から音声データの送信が行われないように制御するための要求を対向端末に送信した上で、保存しておいた音声データを前記対向端末に対して送信することを特徴とするので、無線通信（例えば短距離無線通信規格）により半二重通信を行う場合において、限られた帯域幅における上りと下りの音声データの衝突を回避することができる。 As described above, according to the present embodiment, a voice communication device that performs half-duplex communication by wireless communication with an opposite terminal, and outputs voice data received from the opposite terminal to the outside of the own device. The voice data to be transmitted to the opposite terminal can be input while the voice data received from the opposite terminal is being output to the outside of the device. After the reception of the audio data is completed, a request for controlling the audio data not to be transmitted from the opposite terminal is transmitted to the opposite terminal, and the stored audio data is transmitted to the opposite terminal. Therefore, when half-duplex communication is performed by wireless communication (for example, short-range wireless communication standard), it is possible to avoid collision between uplink and downlink audio data in a limited bandwidth. .

以上、本発明の実施形態について説明したが、上記実施形態に限定されるものではなく、その要旨を逸脱しない範囲において種々の変形が可能である。 As mentioned above, although embodiment of this invention was described, it is not limited to the said embodiment, A various deformation | transformation is possible in the range which does not deviate from the summary.

例えば、上述した実施形態における動作は、ハードウェア、または、ソフトウェア、あるいは、両者の複合構成によって実行することも可能である。 For example, the operation in the above-described embodiment can be executed by hardware, software, or a combined configuration of both.

ソフトウェアによる処理を実行する場合には、処理シーケンスを記録したプログラムを、専用のハードウェアに組み込まれているコンピュータ内のメモリにインストールして実行させてもよい。あるいは、各種処理が実行可能な汎用コンピュータにプログラムをインストールして実行させてもよい。 When executing processing by software, a program in which a processing sequence is recorded may be installed and executed in a memory in a computer incorporated in dedicated hardware. Or you may install and run a program in the general purpose computer which can perform various processes.

例えば、プログラムは、記録媒体としてのハードディスクやＲＯＭ（Read Only Memory）に予め記録しておくことが可能である。あるいは、プログラムは、ＣＤ−ＲＯＭ（Compact Disc Read Only Memory）、ＭＯ（Magneto Optical）ディスク、ＤＶＤ（Digital Versatile Disc）、ＵＳＢ（Universal Serial Bus）メモリ、磁気ディスク、半導体メモリなどのリムーバブル記録媒体に、一時的、あるいは、永続的に格納（記録）しておくことが可能である。このようなリムーバブル記録媒体は、いわゆるパッケージソフトウエアとして提供することが可能である。 For example, the program can be recorded in advance on a hard disk or a ROM (Read Only Memory) as a recording medium. Alternatively, the program is stored on a removable recording medium such as a CD-ROM (Compact Disc Read Only Memory), an MO (Magneto Optical) disc, a DVD (Digital Versatile Disc), a USB (Universal Serial Bus) memory, a magnetic disc, and a semiconductor memory. It is possible to store (record) temporarily or permanently. Such a removable recording medium can be provided as so-called package software.

なお、プログラムは、上述したようなリムーバブル記録媒体からコンピュータにインストールする他、ダウンロードサイトから、コンピュータに無線転送してもよい。または、ＬＡＮ（Local Area Network）、インターネットといったネットワークを介して、コンピュータに有線で転送してもよい。コンピュータでは、転送されてきたプログラムを受信し、内蔵するハードディスク等の記録媒体にインストールすることが可能である。 The program may be wirelessly transferred from the download site to the computer in addition to being installed on the computer from the removable recording medium as described above. Or you may wire-transfer to a computer via networks, such as LAN (Local Area Network) and the internet. The computer can receive the transferred program and install it on a recording medium such as a built-in hard disk.

また、上記実施形態で説明した処理動作に従って時系列的に実行されるのみならず、処理を実行する装置の処理能力、あるいは、必要に応じて並列的にあるいは個別に実行するように構築することも可能である。 In addition to being executed in time series in accordance with the processing operations described in the above embodiment, the processing capability of the apparatus that executes the processing, or a configuration to execute in parallel or individually as necessary Is also possible.

また、上記実施形態で説明したシステムは、複数の装置の論理的集合構成にしたり、各装置の機能を混在させたりするように構築することも可能である。 In addition, the system described in the above embodiment can be configured to have a logical set configuration of a plurality of devices or to mix the functions of each device.

１Ｉ／Ｆ
２システムバス
３Ａ／Ｄ変換部
４Ｄ／Ａ変換部
５ＣＯＤＥＣ
６ＲＡＭ
７ＣＰＵ
８無線部
９ＧＰＩＯ
１０マイク
１１スピーカ
１２ボタン
１３ＬＥＤ 1 I / F
2 System bus 3 A / D converter 4 D / A converter 5 CODEC
6 RAM
7 CPU
8 Radio section 9 GPIO
10 Microphone 11 Speaker 12 Button 13 LED

Claims

A voice communication device that performs half-duplex communication by wireless communication with an opposite terminal,
While outputting voice data received from the opposite terminal to the outside of the device, it is possible to input voice data to be transmitted to the opposite terminal,
While outputting audio data received from the opposite terminal to the outside of the device, the input audio data is stored,
After the reception of audio data from the opposite terminal is completed, a request for controlling the audio data not to be transmitted from the opposite terminal is transmitted to the opposite terminal, and the stored audio A voice communication device transmitting data to the opposite terminal.

2. The voice communication apparatus according to claim 1, wherein when a predetermined time has elapsed after storing the input voice data, the voice data is deleted without being transmitted to the opposite terminal.

Save a plurality of the input audio data,
3. The voice communication apparatus according to claim 1, wherein when transmitting to the opposite terminal, the plurality of stored voice data is transmitted to the opposite terminal in the order of saving. 4.

The input voice data is
It is either a voice input while a button included in the voice communication apparatus is operated or a voice input from a microphone included in the voice communication apparatus at a volume equal to or higher than a predetermined threshold. The voice communication apparatus according to any one of claims 1 to 3.

The voice communication apparatus according to claim 1, wherein zigbee is used as the wireless communication standard.

6. A voice communication system, wherein voice data communication is performed between the voice communication apparatuses according to claim 1.

A voice communication method of a terminal that performs half-duplex communication by wireless communication with an opposite terminal,
The terminal
While outputting voice data received from the opposite terminal to the outside of the own terminal, it is possible to input voice data to be transmitted to the opposite terminal,
While outputting the voice data received from the opposite terminal to the outside of the terminal, the input voice data is stored,
After the reception of audio data from the opposite terminal is completed, a request for controlling the audio data not to be transmitted from the opposite terminal is transmitted to the opposite terminal, and the stored audio A voice communication method comprising transmitting data to the opposite terminal.

A program to be executed by a computer of a terminal that performs half-duplex communication by wireless communication with an opposite terminal,
In the computer,
A process of inputting voice data to be transmitted to the opposite terminal while outputting voice data received from the opposite terminal to the outside of the own terminal;
During the output of the audio data received from the opposite terminal to the outside of the own terminal, a process of storing the input audio data;
After the reception of audio data from the opposite terminal is completed, a request for controlling the audio data not to be transmitted from the opposite terminal is transmitted to the opposite terminal, and the stored audio A process of transmitting data to the opposite terminal;
A program characterized by having executed.