JP2010103853A

JP2010103853A - Sound volume monitoring apparatus and sound volume monitoring method

Info

Publication number: JP2010103853A
Application number: JP2008274759A
Authority: JP
Inventors: Tsuyoki Nishikawa; 剛樹西川; Hiromoto Furukawa; 博基古川; Takeo Kanamori; 丈郎金森
Original assignee: Panasonic Corp
Current assignee: Panasonic Corp
Priority date: 2008-10-24
Filing date: 2008-10-24
Publication date: 2010-05-06

Abstract

<P>PROBLEM TO BE SOLVED: To obtain a sound volume independent of a positional relationship between a speaker and a microphone. <P>SOLUTION: The present invention relates to a sound volume monitoring apparatus 100 used for an acoustic system 1 including communication devices 11 and 12 which exchange sounds with each other via a communication network 10, comprising: a direct wave sound volume calculator 102 for calculating direct wave sound volume information 132 contained in a sound collecting signal m(t) and indicating the sound volume of a direct wave 142 in a sound output from a speaker 111; a distance calculator 101 for calculating distance information 131 indicating a distance between the speaker 111 and a microphone 112; a speaker position sound volume calculator 103 for calculating speaker position sound volume information 133 indicating the sound volume of a sound, output by the speaker 111, at a position spaced apart from the speaker 111 by a predetermined distance, while using the distance information 131 and the direct wave sound volume information 132; and a transmitter 104 for transmitting the speaker position sound volume information 133 to the communication device 12. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、音量監視装置及び音量監視方法に関し、特に、通信網を経由して、互いに音声を送受信する第１の音声通信端末と第２の音声通信端末とを含む音響システムに用いられ、第２の音声通信端末から送信された音声が、第１の音声通信端末で出音される音量を監視する音量監視装置に関する。 The present invention relates to a volume monitoring device and a volume monitoring method, and more particularly, to a sound system including a first voice communication terminal and a second voice communication terminal that transmit and receive voice to and from each other via a communication network. The present invention relates to a volume monitoring device that monitors the volume of sound transmitted from a second voice communication terminal and output from a first voice communication terminal.

音声会議システム及びテレビ会議システムにおいて、一般的に、自地点側では、相手地点側のスピーカ及びマイクロホンの設置条件、及び接続機器の音量設定値などは分からない。これにより、自地点側は、相手地点側でどのような音量で自地点の音声が拡声されているか確認できない。よって、例えば、自地点側のユーザが相手地点側のユーザに質問をした際に、相手地点側のユーザからリアクションがないような場合に、自地点側のユーザは、相手地点側で正しく音声が拡声されている把握できないために、不安になる。このため、自地点側のユーザは、相手地点側のユーザに音声が正しく拡声されているかの確認を行う必要が生じるという問題がある。 In the audio conference system and the video conference system, generally, the installation condition of the speaker and microphone on the partner site side, the volume setting value of the connected device, etc. are not known on the local site side. As a result, the local site cannot confirm at what volume the voice of the local site is amplified at the partner site. Therefore, for example, when a user on the local site side asks the user on the other site side, if there is no reaction from the user on the other site side, the user on the local site side correctly speaks on the other site side. It becomes uneasy because it cannot be grasped because it is loud. For this reason, there is a problem that the user on the local site side needs to confirm whether or not the voice is correctly amplified by the user on the counterpart site side.

この問題に対して、特許文献１には、第１地点側が、当該第１地点側で拡声された音声の音量を取得し、取得した音量を第２地点側に通知する技術が開示されている。具体的には、第１地点側は、第２地点側のユーザの音声が第１地点側で拡声された際の、第１地点側に設置されたマイクロホンの収音信号を用いて、第１地点側における音量を取得する。これにより、第２地点側のユーザは、第１地点側で拡声さている音声の音量を知ることができる。 To deal with this problem, Patent Document 1 discloses a technique in which the first point side acquires the volume of the voice that is loudened on the first point side, and notifies the second point side of the acquired volume. . Specifically, the first point side uses the sound pickup signal of the microphone installed on the first point side when the voice of the user on the second point side is amplified on the first point side, Get the volume on the point side. Thereby, the user on the second point side can know the sound volume of the voice that is loud on the first point side.

しかしながら、第１地点側のマイクロホンの収音信号は、第１地点側のスピーカから拡声された第２地点側のユーザの音声だけでなく、第１地点側のユーザの音声及び第１地点側の背景騒音も混じっている。これにより、特許文献１記載の技術は、第２地点側のユーザの音声の音量のみを正しく取得することが困難であるという問題を有している。 However, the collected sound signal of the microphone on the first point side is not only the voice of the user on the second point side amplified by the speaker on the first point side, but also the voice of the user on the first point side and the sound on the first point side. There is also background noise. As a result, the technique described in Patent Document 1 has a problem that it is difficult to correctly acquire only the volume of the voice of the user on the second point side.

この問題を解決する技術として、特許文献２に記載の技術が知られている。特許文献２記載の技術では、第１地点側が、当該第１地点側が備えるエコーキャンセラで生成される疑似エコーを利用して、当該第１地点側で拡声された音声の音量を取得する。 As a technique for solving this problem, a technique described in Patent Document 2 is known. In the technique described in Patent Document 2, the first point side uses the pseudo echo generated by the echo canceller included in the first point side to acquire the volume of the voice that is amplified on the first point side.

ここで、エコーキャンセラで生成される疑似エコーは、第１地点側のマイクロホンの収音信号に含まれる音声のうち、第１地点側に設置されたスピーカから拡声された第２地点側のユーザの音声に相当する。よって、特許文献２記載の技術は、第１地点側のユーザの音声及び第１地点側の背景雑音の影響を除去した第２地点側のユーザの音声の音量のみを取得できる。
特開２００４−４８３２９号公報特開２００７−２６７２１８号公報 Here, the pseudo echo generated by the echo canceller is the sound of the user on the second point side that is amplified from the speaker installed on the first point side out of the sound included in the collected sound signal of the microphone on the first point side. Corresponds to voice. Therefore, the technique described in Patent Literature 2 can acquire only the volume of the user's voice on the first point side and the sound of the user's voice on the second point side from which the influence of background noise on the first point side is removed.
JP 2004-48329 A JP 2007-267218 A

しかしながら、特許文献２記載の技術では、疑似エコーより取得された音量は第１地点側のマイクロホン位置での音量である。つまり、同じ音量の音声が第１地点側のスピーカから拡声された場合でも、第１地点側のスピーカとマイクロホンとの位置関係に応じて、異なる音量が取得されることとなる。例えば、スピーカとマイクロホンとの位置が近い場合には、マイクロホン位置での音量は大きくなり、スピーカとマイクロホンとの位置が遠い場合には、マイクロホン位置での音量は小さくなる。これにより、第２地点側のユーザは、第１地点側で拡声される、定量的な音量を知ることができない。 However, in the technique described in Patent Document 2, the volume acquired from the pseudo echo is the volume at the microphone position on the first point side. That is, even when the sound with the same volume is amplified from the speaker on the first point side, different sound volumes are acquired according to the positional relationship between the speaker on the first point side and the microphone. For example, when the position of the speaker and the microphone is close, the sound volume at the microphone position increases, and when the position of the speaker and the microphone is far, the sound volume at the microphone position decreases. Thereby, the user on the second point side cannot know the quantitative sound volume that is amplified on the first point side.

以上のように、特許文献２記載の技術は、スピーカとマイクロホンとの位置関係に応じて、取得される音量が変化するという課題を有している。なお、特許文献１記載の技術も同様の課題を有している。 As described above, the technique described in Patent Document 2 has a problem that the acquired sound volume changes according to the positional relationship between the speaker and the microphone. The technique described in Patent Document 1 has a similar problem.

そこで、本発明は、前記従来の課題を解決するものであり、スピーカとマイクロホンとの位置関係に依存しない音量を取得できる音量監視装置及び音量監視方法を提供することを目的とする。 Therefore, the present invention solves the above-described conventional problems, and an object of the present invention is to provide a volume monitoring apparatus and a volume monitoring method that can acquire a volume independent of the positional relationship between a speaker and a microphone.

上記目的を達成するために、本発明に係る音量監視装置は、通信網を経由して、互いに音声を送受信する第１の音声通信端末と第２の音声通信端末とを含む音響システムに用いられ、前記第２の音声通信端末から送信された音声が、前記第１の音声通信端末で出音される音量を監視する音量監視装置であって、前記第１の音声通信端末は、前記第２の音声通信端末により、前記通信網を経由して送信されるスピーカ入力信号に基づき、音声を出音する第１のスピーカと、音声を収音することにより収音信号を生成し、当該収音信号を前記第２の音声通信端末に送信する第１のマイクロホンとを備え、前記音量監視装置は、前記収音信号に含まれる、前記第１のマイクロホンの位置における、前記第１のスピーカから出音された音声のうち直接波の音量である直接波音量を算出する直接波音量算出部と、前記第１のスピーカと前記第１のマイクロホンとの距離を算出する距離算出部と、前記距離算出部により算出された前記距離と、前記直接波音量算出部により算出された前記直接波音量とを用いて、前記第１のスピーカから予め定められた距離における、当該第１のスピーカにより出音される音声の音量である第１音量を算出する第１音量算出部と、前記第１音量を、前記通信網を経由して前記第２の音声通信端末に送信する送信部とを備える。 In order to achieve the above object, a volume monitoring apparatus according to the present invention is used in an acoustic system including a first voice communication terminal and a second voice communication terminal that transmit and receive voice to and from each other via a communication network. , A volume monitoring device for monitoring a volume of sound transmitted from the second voice communication terminal and output from the first voice communication terminal, wherein the first voice communication terminal includes the second voice communication terminal; The sound communication terminal generates a sound collection signal by collecting the first speaker that emits sound based on the speaker input signal transmitted via the communication network, and collects the sound. A first microphone that transmits a signal to the second voice communication terminal, and the volume monitoring device outputs the signal from the first speaker at the position of the first microphone included in the collected sound signal. Directly from the sound A direct wave volume calculation unit that calculates a direct wave volume that is a volume of the first sound, a distance calculation unit that calculates a distance between the first speaker and the first microphone, and the distance calculated by the distance calculation unit The first volume is the volume of the sound output by the first speaker at a predetermined distance from the first speaker using the direct wave volume calculated by the direct wave volume calculation unit. A first volume calculation unit that calculates a volume, and a transmission unit that transmits the first volume to the second voice communication terminal via the communication network.

この構成によれば、本発明に係る音量監視装置は、第１のスピーカから予め定められた距離における、当該第１のスピーカにより出音される音声の音量を算出できる。よって、本発明に係る音量監視装置は、スピーカとマイクロホンとの位置関係に依存しない、スピーカから拡声される音量を取得できる。これにより、第２の音声通信端末のユーザは、自身の声が第１の音声通信端末において正しく拡声されているか否かを、より正確に把握できる。 According to this configuration, the volume monitoring apparatus according to the present invention can calculate the volume of the sound output from the first speaker at a predetermined distance from the first speaker. Therefore, the volume monitoring apparatus according to the present invention can acquire the volume that is loudened from the speaker without depending on the positional relationship between the speaker and the microphone. Thereby, the user of the second voice communication terminal can more accurately grasp whether or not his / her voice is properly amplified in the first voice communication terminal.

また、前記第１音量算出部は、前記第１音量として、前記第１のスピーカの位置における、当該第１のスピーカにより出音される音声の音量であるスピーカ位置音量を算出してもよい。 In addition, the first sound volume calculation unit may calculate a speaker position sound volume that is a sound volume of sound output from the first speaker at the position of the first speaker as the first sound volume.

この構成によれば、本発明に係る音量監視装置は、第１のスピーカの位置における、当該第１のスピーカにより出音される音声の音量を算出できる。よって、本発明に係る音量監視装置は、スピーカとマイクロホンとの位置関係に依存しない、スピーカから拡声される音量を取得できる。 According to this configuration, the volume monitoring device according to the present invention can calculate the volume of the sound output by the first speaker at the position of the first speaker. Therefore, the volume monitoring apparatus according to the present invention can acquire the volume that is loudened from the speaker without depending on the positional relationship between the speaker and the microphone.

また、前記音量監視装置は、さらに、前記第１のスピーカと前記第１のマイクロホンとの間の音響伝達特性を算出する音響伝達特性算出部を備え、前記直接波音量算出部、及び前記距離算出部のうち少なくとも一方は、前記音響伝達特性を用いて、前記直接波音量、又は前記距離を算出してもよい。 The volume monitoring device further includes an acoustic transfer characteristic calculation unit that calculates an acoustic transfer characteristic between the first speaker and the first microphone, the direct wave volume calculation unit, and the distance calculation. At least one of the units may calculate the direct wave volume or the distance using the acoustic transfer characteristic.

この構成によれば、本発明に係る音量監視装置は、従来から用いられているエコーキャンセラで算出される音響伝達特性を用いて、第１のスピーカと第１のマイクロホンとの距離又は直接波音量を算出できる。これにより、第１の音声通信端末に対する機能追加（コスト増加）を抑制しつつ、音量監視装置の機能を実現できる。 According to this configuration, the volume monitoring device according to the present invention uses the acoustic transfer characteristic calculated by the echo canceller conventionally used, or the distance between the first speaker and the first microphone or the direct wave volume. Can be calculated. Thereby, the function of the volume monitoring device can be realized while suppressing the function addition (cost increase) to the first voice communication terminal.

また、前記距離算出部は、前記第１のスピーカから出音された音声が前記第１のマイクロホンに収音されるまでの時間遅延を、前記音響伝達特性を用いて算出し、当該時間遅延から前記距離を算出し、前記直接波音量算出部は、前記時間遅延と、前記スピーカ入力信号と、前記音響伝達特性とを用いて、前記直接波音量を算出してもよい。 In addition, the distance calculation unit calculates a time delay until the sound output from the first speaker is picked up by the first microphone using the acoustic transfer characteristics, and calculates the time delay from the time delay. The distance may be calculated, and the direct wave volume calculation unit may calculate the direct wave volume using the time delay, the speaker input signal, and the acoustic transfer characteristic.

また、前記第１音量算出部は、前記距離算出部により算出された前記距離と、前記直接波音量算出部により算出された前記直接波音量とを用いて、前記第１のスピーカにより出音された音声の、前記第１のスピーカから異なる距離離れた複数の位置における音量である複数の距離別音量を、前記第１音量として算出する距離別音量算出部を備えてもよい。 Further, the first sound volume calculation unit outputs sound from the first speaker using the distance calculated by the distance calculation unit and the direct wave sound volume calculated by the direct wave sound volume calculation unit. A distance-by-distance volume calculation unit that calculates a plurality of distance-by-distance volumes, which are sound volumes at a plurality of positions at different distances from the first speaker, as the first volume.

この構成によれば、本発明に係る音量監視装置は、第１のスピーカから複数の距離における音量を算出できる。つまり、本発明に係る音量監視装置は、より第１の音声通信端末のユーザに聞こえている音量に近い音量を算出できる。 According to this configuration, the volume monitoring device according to the present invention can calculate the volume at a plurality of distances from the first speaker. That is, the volume monitoring device according to the present invention can calculate a volume closer to the volume heard by the user of the first voice communication terminal.

また、前記音量監視装置は、さらに、前記第１のスピーカが設置された空間の残響特性を算出する残響特性算出部を備え、前記第１音量算出部は、さらに、前記残響特性を用いて前記距離別音量算出部により算出された前記複数の距離別音量を調整する残響調整部を備え、前記送信部は、前記第１音量として、前記残響調整部により調整された前記複数の距離別音量を、前記通信網を経由して前記第２の音声通信端末に送信してもよい。 The volume monitoring apparatus further includes a reverberation characteristic calculation unit that calculates a reverberation characteristic of a space in which the first speaker is installed, and the first volume calculation unit further uses the reverberation characteristic to A reverberation adjusting unit that adjusts the plurality of distance-specific volumes calculated by the distance-based sound volume calculating unit, and the transmission unit uses the plurality of distance-specific volumes adjusted by the reverberation adjusting unit as the first sound volume. Then, it may be transmitted to the second voice communication terminal via the communication network.

この構成によれば、本発明に係る音量監視装置は、残響特性を考慮した音量を算出できるので、より第１の音声通信端末のユーザに聞こえている音量に近い音量を算出できる。 According to this configuration, the volume monitoring apparatus according to the present invention can calculate the volume taking into account the reverberation characteristics, and therefore can calculate a volume closer to the volume heard by the user of the first voice communication terminal.

また、前記音量監視装置は、さらに、前記第１のスピーカと前記第１のマイクロホンとの間の音響伝達特性を算出する音響伝達特性算出部を備え、前記残響特性算出部は、前記音響伝達特性を用いて前記残響特性を算出してもよい。 The volume monitoring device further includes an acoustic transfer characteristic calculation unit that calculates an acoustic transfer characteristic between the first speaker and the first microphone, and the reverberation characteristic calculation unit includes the acoustic transfer characteristic. The reverberation characteristics may be calculated using

この構成によれば、本発明に係る音量監視装置は、従来から用いられているエコーキャンセラで算出される音響伝達特性を用いて、残響特性を算出できる。これにより、第１の音声通信端末に対する機能追加（コスト増加）を抑制しつつ、音量監視装置の機能を実現できる。 According to this configuration, the sound volume monitoring apparatus according to the present invention can calculate the reverberation characteristic using the acoustic transfer characteristic calculated by the echo canceller conventionally used. Thereby, the function of the volume monitoring device can be realized while suppressing the function addition (cost increase) to the first voice communication terminal.

また、前記音量監視装置は、さらに、前記第１の音声通信端末における騒音の音量である騒音レベルを算出する騒音レベル算出部を備え、前記第１算出部は、さらに、前記第１音量が前記騒音レベル以下の場合に、当該第１音量を実質的に無音状態に調整する騒音調整部を備え、前記送信部は、前記騒音調整部により調整された前記第１音量を、前記通信網を経由して前記第２の音声通信端末に送信してもよい。 The volume monitoring apparatus further includes a noise level calculation unit that calculates a noise level that is a volume of noise in the first voice communication terminal, and the first calculation unit further includes the first volume being the first volume. A noise adjustment unit that adjusts the first sound volume to a substantially silent state when the noise level is lower than the noise level, and the transmission unit transmits the first sound volume adjusted by the noise adjustment unit via the communication network. Then, it may be transmitted to the second voice communication terminal.

この構成によれば、本発明に係る音量監視装置は、第１の音声通信端末における騒音を考慮した音量を算出できるので、より第１の音声通信端末のユーザに聞こえている音量に近い音量を算出できる。 According to this configuration, the volume monitoring device according to the present invention can calculate the volume in consideration of the noise in the first voice communication terminal, and thus the volume closer to the volume heard by the user of the first voice communication terminal. It can be calculated.

また、前記音量監視装置は、さらに、前記第１音量が予め定められた音量より大きい場合に、前記通信網を経由して、前記第２の音声通信端末に、前記第１の音声通信端末において過大音が拡声されたことを警告する情報を送信する警告部を備えてもよい。 In addition, the volume monitoring device further includes the first voice communication terminal via the communication network to the second voice communication terminal when the first volume is larger than a predetermined volume. You may provide the warning part which transmits the information which alert | reports that an excessive loud sound was amplified.

この構成によれば、本発明に係る音量監視装置は、第１の音声通信端末において、過大音が拡声された場合には、その旨を第２の音声通信端末のユーザに警告できる。 According to this configuration, the sound volume monitoring apparatus according to the present invention can warn the user of the second voice communication terminal when the excessive voice is amplified in the first voice communication terminal.

また、前記音量監視装置は、さらに、前記距離別音量算出部により算出された複数の距離別音量を用いて、前記第１スピーカにより出力された音声が予め定められた音量以上となる前記第１のスピーカからの距離を算出し、当該距離を示す拡声領域情報を生成する拡声領域算出部を備え、前記送信部は、さらに、前記拡声領域情報を、前記通信網を経由して、前記第２の音声通信端末に送信してもよい。 Further, the sound volume monitoring device further uses the plurality of distance sound volumes calculated by the distance sound volume calculation unit to cause the sound output from the first speaker to be equal to or higher than a predetermined sound volume. A loudspeaker area calculation unit that calculates a distance from the speaker and generates loudspeaker area information indicating the distance, and the transmitter further transmits the loudspeaker area information via the communication network to the second It may be transmitted to the voice communication terminal.

この構成によれば、本発明に係る音量監視装置は、第１の音声通信端末において、音声が聞こえている範囲を、音声通信端末に通知できる。これにより、第２の音声通信端末のユーザは、自身の声が第１の音声通信端末のユーザに聞こえているか否かを容易に把握できる。 According to this configuration, the volume monitoring device according to the present invention can notify the voice communication terminal of the range in which the voice is heard in the first voice communication terminal. Thereby, the user of the second voice communication terminal can easily grasp whether his / her voice is heard by the user of the first voice communication terminal.

また、前記第１のマイクロホンは指向性を有する収音信号を生成し、前記音量監視装置は、さらに、前記第１マイクロホンにより生成された前記指向性を有する収音信号を、無指向化する無指向化部を備え、前記直接波音量算出部は、前記無指向化部により無指向化された収音信号に含まれる前記直接波音量を算出してもよい。 The first microphone generates a sound collecting signal having directivity, and the sound volume monitoring device further performs non-directing on the sound collecting signal having directivity generated by the first microphone. The direct wave volume calculating unit may include a directing unit, and the direct wave volume calculating unit may calculate the direct wave volume included in the collected sound signal made omnidirectional by the omnidirectional unit.

この構成によれば、本発明に係る音量監視装置は、第１のマイクロホンが指向性を有する場合でも、スピーカとマイクロホンとの位置関係に依存しない、スピーカから拡声される音量を取得できる。 According to this configuration, even when the first microphone has directivity, the volume monitoring device according to the present invention can acquire the volume that is loudened from the speaker without depending on the positional relationship between the speaker and the microphone.

なお、本発明は、このような音量監視装置として実現できるだけでなく、音量監視装置に含まれる特徴的な手段をステップとする音量監視方法として実現したり、そのような特徴的なステップをコンピュータに実行させるプログラムとして実現したりすることもできる。そして、そのようなプログラムは、ＣＤ−ＲＯＭ等の記録媒体及びインターネット等の伝送媒体を介して流通させることができるのは言うまでもない。 The present invention can be realized not only as such a volume monitoring device but also as a volume monitoring method using characteristic means included in the volume monitoring device as a step, or such a characteristic step in a computer. It can also be realized as a program to be executed. Needless to say, such a program can be distributed via a recording medium such as a CD-ROM and a transmission medium such as the Internet.

さらに、本発明は、このような音量監視装置の機能の一部又は全てを実現する半導体集積回路（ＬＳＩ）として実現したり、このような音量監視装置を備える音声通信端末、音声会議端末、又はテレビ会議端末として実現したり、このような音声通信端末、音声会議端末、又はテレビ会議端末を含む音響システム、音声会議システム、又はテレビ会議システムとして実現したりできる。 Furthermore, the present invention can be realized as a semiconductor integrated circuit (LSI) that realizes part or all of the functions of such a volume monitoring device, or an audio communication terminal, audio conference terminal, or It can be realized as a video conference terminal, or can be realized as an audio system, a voice conference system, or a video conference system including such a voice communication terminal, a voice conference terminal, or a video conference terminal.

以上より、本発明は、スピーカとマイクロホンとの位置関係に依存しない音量を取得できる音量監視装置及び音量監視方法を提供できる。 As described above, the present invention can provide a volume monitoring device and a volume monitoring method that can acquire a volume independent of the positional relationship between the speaker and the microphone.

以下本発明の実施の形態について、図面を参照しながら説明する。 Embodiments of the present invention will be described below with reference to the drawings.

（実施の形態１）
本発明の実施の形態１に係る音量監視装置は、スピーカが設置された位置における、当該スピーカにより出音される音量を算出する。これにより、本発明の実施の形態１に係る音量監視装置は、スピーカとマイクロホンとの位置関係に依存しない定量的な音量を取得できる。 (Embodiment 1)
The volume monitoring apparatus according to Embodiment 1 of the present invention calculates the volume of sound output by the speaker at the position where the speaker is installed. Thereby, the volume monitoring apparatus according to Embodiment 1 of the present invention can acquire a quantitative volume that does not depend on the positional relationship between the speaker and the microphone.

まず、本発明の実施の形態１に係る音量監視装置を含む音響システムの構成を説明する。 First, the configuration of an acoustic system including a volume monitoring device according to Embodiment 1 of the present invention will be described.

図１は、本発明の実施の形態１に係る音響システムの構成を示す図である。
図１に示す音響システム１は、第１の場所に設置されたコミュニケーション装置１１と、第２の場所に設置されたコミュニケーション装置１２と、通信網１０とを含む。このコミュニケーション装置１１とコミュニケーション装置１２とは、通信網１０を経由して接続される。また、コミュニケーション装置１１とコミュニケーション装置１２とは、音声通信端末であり、通信網１０を介して、互いにリアルタイムの音声の送受信を行う。 FIG. 1 is a diagram showing a configuration of an acoustic system according to Embodiment 1 of the present invention.
The acoustic system 1 illustrated in FIG. 1 includes a communication device 11 installed at a first location, a communication device 12 installed at a second location, and a communication network 10. The communication device 11 and the communication device 12 are connected via the communication network 10. The communication device 11 and the communication device 12 are voice communication terminals, and transmit and receive real-time voice to and from each other via the communication network 10.

コミュニケーション装置１１は、スピーカ１１１と、マイクロホン１１２とを備える。
スピーカ１１１は、コミュニケーション装置１２から通信網１０を経由して送信されたスピーカ入力信号ｓ（ｔ）に基づき、音声を出音する。 The communication device 11 includes a speaker 111 and a microphone 112.
The speaker 111 outputs sound based on the speaker input signal s (t) transmitted from the communication device 12 via the communication network 10.

マイクロホン１１２は、コミュニケーション装置１１の周辺の音声を収音することにより、収音信号ｍ（ｔ）を生成する。また、マイクロホン１１２は、この収音信号ｍ（ｔ）を、通信網１０を経由してコミュニケーション装置１２へ送信する。また、この収音信号ｍ（ｔ）には、コミュニケーション装置１１のユーザ１１０が発する音声と、コミュニケーション装置１１の周辺の雑音等と、スピーカ１１１により出音される、コミュニケーション装置１２のユーザ１２０が発した音声とが含まれる。 The microphone 112 collects sound around the communication device 11 to generate a sound collection signal m (t). Further, the microphone 112 transmits the sound collection signal m (t) to the communication device 12 via the communication network 10. In addition, the collected sound signal m (t) is generated by the user 120 of the communication device 12, which is output by the speaker 111, such as a sound emitted by the user 110 of the communication device 11, noise around the communication device 11, and the like. Audio.

コミュニケーション装置１２は、スピーカ１２１と、マイクロホン１２２とを備える。
スピーカ１２１は、コミュニケーション装置１１から通信網１０を経由して送信された収音信号ｍ（ｔ）に基づき、音声を出音する。 The communication device 12 includes a speaker 121 and a microphone 122.
The speaker 121 outputs a sound based on the collected sound signal m (t) transmitted from the communication device 11 via the communication network 10.

マイクロホン１２２は、コミュニケーション装置１２の周辺の音声を収音することにより、収音信号を生成する。また、マイクロホン１２２は、当該収音信号をスピーカ入力信号ｓ（ｔ）として、通信網１０を経由してコミュニケーション装置１１へ送信する。また、この収音信号には、コミュニケーション装置１２のユーザ１２０が発する音声と、コミュニケーション装置１２の周辺の雑音等と、スピーカ１２１により出音される、コミュニケーション装置１１のユーザ１１０が発した音声とが含まれる。 The microphone 122 collects sound around the communication device 12 to generate a sound collection signal. Further, the microphone 122 transmits the collected sound signal as a speaker input signal s (t) to the communication device 11 via the communication network 10. In addition, the sound collection signal includes a sound uttered by the user 120 of the communication device 12, a noise around the communication device 12, and a sound uttered by the user 110 of the communication device 11 that is output from the speaker 121. included.

図２は、本発明の実施の形態１に係る音量監視装置１００のブロック図である。
図２に示すようにコミュニケーション装置１１は、音量監視装置１００を備える。 FIG. 2 is a block diagram of volume monitoring apparatus 100 according to Embodiment 1 of the present invention.
As shown in FIG. 2, the communication device 11 includes a volume monitoring device 100.

音量監視装置１００は、コミュニケーション装置１２から送信された音声が、コミュニケーション装置１１で出音される音量を監視する。言い換えると、音量監視装置１００は、コミュニケーション装置１２のユーザ１２０により発せられた音声が、コミュニケーション装置１１においてどのような音量で拡声されているかを監視する。具体的には、音量監視装置１００は、スピーカ１１１が設置された位置における当該スピーカ１１１が出音する音声の音量を算出し、算出した音量をコミュニケーション装置１２に送信する。 The volume monitoring device 100 monitors the volume at which the voice transmitted from the communication device 12 is output by the communication device 11. In other words, the volume monitoring device 100 monitors the volume of the sound uttered by the user 120 of the communication device 12 at the communication device 11. Specifically, the volume monitoring apparatus 100 calculates the volume of the sound output from the speaker 111 at the position where the speaker 111 is installed, and transmits the calculated volume to the communication apparatus 12.

この音量監視装置１００は、距離算出部１０１と、直接波音量算出部１０２と、スピーカ位置音量算出部１０３と、送信部１０４とを備える。 This volume monitoring apparatus 100 includes a distance calculation unit 101, a direct wave volume calculation unit 102, a speaker position volume calculation unit 103, and a transmission unit 104.

距離算出部１０１は、スピーカ入力信号ｓ（ｔ）と、収音信号ｍ（ｔ）とを用いて、スピーカ１１１とマイクロホン１１２との間の距離を示す距離情報１３１を算出する。 The distance calculation unit 101 calculates distance information 131 indicating the distance between the speaker 111 and the microphone 112 using the speaker input signal s (t) and the sound pickup signal m (t).

直接波音量算出部１０２は、スピーカ入力信号ｓ（ｔ）と、収音信号ｍ（ｔ）と、距離情報１３１とを用いて、収音信号ｍ（ｔ）に含まれる、マイクロホン１１２が設置された位置における、スピーカ１１１から出音された音声のうち直接波の音量を示す直接波音量情報１３２を算出する。 In the direct wave volume calculation unit 102, the microphone 112 included in the sound pickup signal m (t) is installed using the speaker input signal s (t), the sound pickup signal m (t), and the distance information 131. The direct wave volume information 132 indicating the volume of the direct wave in the sound output from the speaker 111 at the selected position is calculated.

ここで、直接波とは、典型的には、スピーカ１１１から出音された音声のうち、反射物（壁）等に反射されずに直接マイクロホン１１２に入力された音声である。 Here, the direct wave is typically the sound that is directly input to the microphone 112 without being reflected by a reflector (wall) or the like, out of the sound output from the speaker 111.

スピーカ位置音量算出部１０３は、直接波音量情報１３２と距離情報１３１とを用いて、スピーカ１１１が設置されている位置における当該スピーカ１１１が拡声する音声の音量であるスピーカ位置音量情報１３３を算出する。 The speaker position volume calculation unit 103 uses the direct wave volume information 132 and the distance information 131 to calculate the speaker position volume information 133 that is the volume of the sound that is loudened by the speaker 111 at the position where the speaker 111 is installed. .

送信部１０４は、スピーカ位置音量情報１３３を、通信網１０を経由して、コミュニケーション装置１２に送信する。 The transmission unit 104 transmits the speaker position volume information 133 to the communication device 12 via the communication network 10.

コミュニケーション装置１２は、さらに、受信部１０５と、表示部１０６とを備える。
受信部１０５は、送信部１０４により送信されたスピーカ位置音量情報１３３を、通信網１０を経由して受信する。 The communication device 12 further includes a receiving unit 105 and a display unit 106.
The receiving unit 105 receives the speaker position volume information 133 transmitted from the transmitting unit 104 via the communication network 10.

表示部１０６は、受信部１０５により受信されたスピーカ位置音量情報１３３に示されるスピーカ１１１の位置における音量を表示するディスプレイ等である。例えば、表示部１０６は、スピーカ位置音量情報１３３で示される音量を、数値、又は映像（メータ等）で表示する。 The display unit 106 is a display or the like that displays the volume at the position of the speaker 111 indicated by the speaker position volume information 133 received by the receiving unit 105. For example, the display unit 106 displays the volume indicated by the speaker position volume information 133 as a numerical value or a video (such as a meter).

なお、ここでは、簡単化のため、コミュニケーション装置１１のみが本発明の実施の形態１に係る音量監視装置１００を備える例を説明するが、コミュニケーション装置１２も音量監視装置１００を備えてもよい。また、コミュニケーション装置１１が受信部１０５及び表示部１０６を備えてもよい。 Here, for the sake of simplicity, an example will be described in which only communication device 11 includes volume monitoring device 100 according to Embodiment 1 of the present invention, but communication device 12 may also include volume monitoring device 100. In addition, the communication device 11 may include the receiving unit 105 and the display unit 106.

以下、以上の様に構成された音量監視装置１００による動作を説明する。
図３は、音量監視装置１００による動作の流れを示すフローチャートである。 Hereinafter, the operation of the volume monitoring apparatus 100 configured as described above will be described.
FIG. 3 is a flowchart showing a flow of operations performed by the volume monitoring device 100.

まず、距離算出部１０１は、スピーカ入力信号ｓ（ｔ）と、収音信号ｍ（ｔ）とを用いて、スピーカ１１１とマイクロホン１１２との間の距離を示す距離情報１３１を算出する（Ｓ１０１）。ここで、距離情報１３１とは、スピーカ１１１とマイクロホン１１２との相対関係を示す情報であり、具体的には、スピーカ１１１とマイクロホン１１２と間の距離、及び、スピーカ１１１とマイクロホン１１２と間の音波の時間遅延のうち少なくとも一方を含む。 First, the distance calculation unit 101 calculates distance information 131 indicating the distance between the speaker 111 and the microphone 112 using the speaker input signal s (t) and the sound collection signal m (t) (S101). . Here, the distance information 131 is information indicating the relative relationship between the speaker 111 and the microphone 112, specifically, the distance between the speaker 111 and the microphone 112, and the sound wave between the speaker 111 and the microphone 112. Including at least one of the time delays.

具体的には、距離算出部１０１は、下記式（１）及び式（２）を用いてスピーカ１１１とマイクロホン１１２と間の音波の時間遅延を算出する。つまり、距離算出部１０１は、下記式（２）に示すようにｃ（τ）が最大となる時間遅延τ_maxを求めることで、スピーカ１１１とマイクロホン１１２との間の音波の時間遅延τ_maxを算出する。ここで、ｃ（τ）は、式（１）に示すように、スピーカ入力信号ｓ（ｔ）と収音信号ｍ（ｔ）との相関関数の絶対値である。 Specifically, the distance calculation unit 101 calculates the time delay of the sound wave between the speaker 111 and the microphone 112 using the following formulas (1) and (2). That is, the distance calculation unit 101, by obtaining the time delay tau _max to be c (tau) is maximum as shown in the following formula (2), the time delay tau _max of the sound wave between the speaker 111 and the microphone 112 calculate. Here, c (τ) is an absolute value of a correlation function between the speaker input signal s (t) and the sound pickup signal m (t) as shown in the equation (1).

また、上記式（１）における＜・＞_tは時間ｔに関する平均値を示す。
また、距離算出部１０１は、スピーカ１１１とマイクロホン１１２との距離ｄ_mを、サンプリング周波数Ｆ_sと音速Ｖ_sと上記時間遅延τ_maxとを用いて、下記式（３）により算出する。 In the above formula (1), <•> _t represents an average value with respect to time t.
The distance calculation unit 101, a distance d _m between the speaker 111 and the microphone 112, by using the sampling frequency F _s and the sound velocity V _s and the time delay tau _max, is calculated by the following equation (3).

なお、距離算出部１０１は、スピーカ１１１とマイクロホン１１２の距離ｄ_mを、コミュニケーション装置１１が備えるカメラ（図示せず）により撮影されたカメラ画像から算出してもよい。 The distance calculation unit 101, a distance d _m of the speaker 111 and the microphone 112 may be calculated from a camera image captured by a camera (not shown) to the communication apparatus 11 is provided.

また、距離算出部１０１は、スピーカ１１１とマイクロホン１１２との時間遅延τ_maxを、上記カメラ画像を用いて算出された距離ｄ_mと、上記式（３）とを用いて算出してもよい。 The distance calculation unit 101, the time delay tau _max between the speaker 111 and the microphone 112, the distance d _m calculated by using the camera image, the equation (3) may be calculated using the.

次に、直接波音量算出部１０２は、スピーカ入力信号ｓ（ｔ）と、収音信号ｍ（ｔ）と、距離情報１３１とを用いて、直接波の音量を示す直接波音量情報１３２を算出する（Ｓ１０２）。 Next, the direct wave volume calculation unit 102 uses the speaker input signal s (t), the collected sound signal m (t), and the distance information 131 to calculate direct wave volume information 132 indicating the volume of the direct wave. (S102).

ここで、コミュニケーション装置１１において、コミュニケーション装置１２から送信されたスピーカ入力信号ｓ（ｔ）がスピーカ１１１から拡声される。この拡声された音声は、スピーカ１１１からマイクロホン１１２への直接の経路と、反射物、壁、天井及び床等に反射する経路とを含む複数の経路を通ってマイクロホン１１２で収音される。 Here, in the communication device 11, the speaker input signal s (t) transmitted from the communication device 12 is amplified from the speaker 111. The amplified sound is picked up by the microphone 112 through a plurality of paths including a direct path from the speaker 111 to the microphone 112 and a path reflecting on a reflector, a wall, a ceiling, a floor, and the like.

図４は、スピーカ１１１により拡声された音声が、マイクロホン１１２で収音される状況を示す図である。図４に示すように、マイクロホン１１２に収音される音声は、スピーカ１１１により拡声され、反射物１４１等に反射せずに直接マイクロホン１１２に入力される直接波１４２に対応する音声と、反射物１４１等により反射された後、マイクロホン１１２に入力される反射波１４３に対応する音声とを含む。 FIG. 4 is a diagram illustrating a situation in which the sound amplified by the speaker 111 is collected by the microphone 112. As shown in FIG. 4, the sound collected by the microphone 112 is amplified by the speaker 111, and the sound corresponding to the direct wave 142 input directly to the microphone 112 without being reflected by the reflector 141 or the like and the reflector And the sound corresponding to the reflected wave 143 input to the microphone 112 after being reflected by 141 or the like.

なお、直接波１４２は、スピーカ１１１とマイクロホン１１２との間の最短の経路を直線状に伝搬した音声に限らない。例えば、スピーカ１１１とマイクロホン１１２との間に壁又は障害物等が配置されている場合には、直接波は収音されなくなるため、音量監視装置１００は、近似的に直接波として、当該壁又は障害物等を迂回してマイクロホン１１２に収音される音声のうち、スピーカ１１１とマイクロホン１１２との間の最短の経路を伝搬した音声を用いる。言い換えると、ここで用いる直接波とは、スピーカ１１１から出音された音声のうち、最初にマイクロホン１１２に入力された音声であり、スピーカ１１１から出音された音声のうち、マイクロホン１１２における音量が最も大きい音声とする。 Note that the direct wave 142 is not limited to sound that has propagated linearly along the shortest path between the speaker 111 and the microphone 112. For example, when a wall or an obstacle is disposed between the speaker 111 and the microphone 112, the direct wave is not collected, so that the volume monitoring device 100 can approximate the wall or Of the sound collected by the microphone 112 while bypassing an obstacle, the sound propagated through the shortest path between the speaker 111 and the microphone 112 is used. In other words, the direct wave used here is the sound first input to the microphone 112 out of the sound output from the speaker 111, and the volume of the microphone 112 out of the sound output from the speaker 111 is Make the loudest voice.

同様に、距離情報１３１で示される距離は、スピーカ１１１とマイクロホン１１２との間の直線の距離に限らず、スピーカ１１１とマイクロホン１１２との間に壁又は障害物等が配置されている場合には、当該距離は、当該壁又は障害物等を迂回してマイクロホン１１２に収音される音声の経路のうち、最短の経路の長さである。言い換えると、距離情報１３１で示される距離は、直接波１４２が伝搬する経路の長さである。 Similarly, the distance indicated by the distance information 131 is not limited to the straight line distance between the speaker 111 and the microphone 112, and when a wall or an obstacle is disposed between the speaker 111 and the microphone 112. The distance is the length of the shortest path among the paths of the sound collected by the microphone 112 bypassing the wall or obstacle. In other words, the distance indicated by the distance information 131 is the length of the path through which the direct wave 142 propagates.

直接波音量算出部１０２は、スピーカ入力信号ｓ（ｔ）と収音信号ｍ（ｔ）と時間遅延τ_maxとを用いて、直接波音量情報１３２を算出する。具体的には、直接波音量情報１３２は、直接波１４２に対応する音声信号である直接波信号ｓ_d（ｔ）と、直接波１４２の音圧である直接波信号音圧Ｐ_dとを含む。 The direct wave volume calculation unit 102 calculates the direct wave volume information 132 using the speaker input signal s (t), the sound collection signal m (t), and the time delay τ _max . Specifically, the direct wave volume information 132 includes a direct wave signal s _d (t) that is an audio signal corresponding to the direct wave 142 and a direct wave signal sound pressure P _d that is a sound pressure of the direct wave 142. .

図５Ａは、スピーカ１１１により拡声されるスピーカ入力信号ｓ（ｔ）の一例を示す図である。また、図５Ｂは、マイクロホン１１２より収音される直接波信号ｓ_d（ｔ）の一例を示す図である。 FIG. 5A is a diagram illustrating an example of a speaker input signal s (t) that is amplified by the speaker 111. FIG. 5B is a diagram illustrating an example of the direct wave signal s _d (t) collected by the microphone 112.

図５Ｂに示すように、直接波１４２に対応する直接波信号ｓ_d（ｔ）は、スピーカ入力信号ｓ（ｔ）から、時間遅延τ_max分遅れて、マイクロホン１１２で収音される。また、マイクロホン１１２に収音される収音信号ｍ（ｔ）には、直接波１４２に対応する直接波信号ｓ_d（ｔ）と、複数の反射波１４３に対応する複数の反射波信号１５３とが含まれる。 As shown in FIG. 5B, the direct wave signal s _d (t) corresponding to the direct wave 142 is picked up by the microphone 112 with a delay of time delay τ _max from the speaker input signal s (t). In addition, the collected sound signal m (t) collected by the microphone 112 includes a direct wave signal s _d (t) corresponding to the direct wave 142 and a plurality of reflected wave signals 153 corresponding to the plurality of reflected waves 143. Is included.

具体的には、直接波音量算出部１０２は、下記式（４）を用いて、収音信号ｍ（ｔ）の中から、直接波１４２に対応する信号を分離することにより、直接波信号ｓ_d（ｔ）を算出する。また、直接波音量算出部１０２は、下記式（５）を用いて、算出した直接波信号ｓ_d（ｔ）から、直接波信号音圧Ｐ_dを算出する。 Specifically, the direct wave volume calculation unit 102 separates a signal corresponding to the direct wave 142 from the collected sound signal m (t) using the following equation (4), thereby obtaining a direct wave signal s. _d (t) is calculated. Further, the direct wave volume calculation unit 102 calculates the direct wave signal sound pressure P _d from the calculated direct wave signal s _d (t) using the following equation (5).

また、実際には障害物による経路の変化の影響、風又は温度変化による経路の変化の影響、及びサンプリング周波数の影響など、様々な要因でτ_maxは微妙に揺らいで変化する場合がある。これらの影響を緩和するために、直接波信号ｓ_d（ｔ）は式（６）のように数ｍｓ〜数１０ｍｓに相当するＮ点のデータ時間的マージンをもって算出しても構わない。 Actually, τ _max may slightly fluctuate and change due to various factors such as the influence of a path change due to an obstacle, the influence of a path change due to a wind or temperature change, and the influence of a sampling frequency. In order to mitigate these effects, the direct wave signal s _d (t) may be calculated with a data time margin of N points corresponding to several ms to several tens of ms as shown in Equation (6).

次に、スピーカ位置音量算出部１０３は、直接波音量情報１３２と距離情報１３１とを用いて、スピーカ１１１が設置されている位置における当該スピーカ１１１により拡声される音声の音量であるスピーカ位置音量情報１３３を算出する（Ｓ１０３）。 Next, the speaker position volume calculation unit 103 uses the direct wave volume information 132 and the distance information 131, and the speaker position volume information that is the volume of the sound that is loudened by the speaker 111 at the position where the speaker 111 is installed. 133 is calculated (S103).

ここで、スピーカ入力信号ｓ（ｔ）に対応するスピーカ１１１により拡声された音声は、距離とともに減衰し、マイクロホン１１２の位置では式（４）で示す時間特性、かつ式（５）で示す音圧レベルで収音される。よって、スピーカ位置音量算出部１０３は、直接波音量情報１３２（直接波信号ｓ_d（ｔ）、及び直接波信号音圧Ｐ_d）と、距離ｄ_mとを用いてスピーカ１１１の位置でのスピーカ位置音量情報１３３を算出できる。ここで、スピーカ１１１の位置とは、スピーカ１１１で拡声される音声の距離減衰特性が、距離に反比例するという関係を満たし始める位置とする。また、以下では、スピーカ１１１に対応するスピーカユニットの設置位置から、この距離に反比例するという関係を満たし始める位置までの距離をｄ_s［ｍ］とする。つまり、スピーカ１１１に対応するスピーカユニットの設置位置からマイクロホン１１２までの距離は、ｄ_m−ｄ_sとなる。 Here, the sound amplified by the speaker 111 corresponding to the speaker input signal s (t) is attenuated with the distance, and at the position of the microphone 112, the time characteristic indicated by the equation (4) and the sound pressure indicated by the equation (5) are obtained. Sound is picked up by level. Therefore, the speaker of the speaker position volume calculation unit 103, the direct wave volume information 132 (direct wave signal s _d (t), and direct wave signal sound pressure P _d), the position of the speaker 111 by using the distance d _m The position volume information 133 can be calculated. Here, the position of the speaker 111 is a position where the distance attenuation characteristic of the sound amplified by the speaker 111 starts to satisfy the relationship that it is inversely proportional to the distance. In the following description, the distance from the installation position of the speaker unit corresponding to the speaker 111 to the position where the relationship of being inversely proportional to this distance starts to be satisfied is referred to as d _s [m]. That is, the distance from the installation position of the speaker unit corresponding to the speaker 111 to the microphone 112 becomes d _m -d _s.

また、スピーカ位置音量情報１３３は、スピーカ１１１の位置での音声信号であるスピーカ位置信号ｓｐ（ｔ）、及びスピーカ１１１の位置での信号音圧であるスピーカ位置信号音圧Ｐ_spを含む。スピーカ位置音量算出部１０３は、直接波信号ｓ_d（ｔ）と、距離ｄ_mとを用いて、下記式（７）により、スピーカ位置信号ｓｐ（ｔ）を算出する。つまり、スピーカ位置音量算出部１０３は、直接波信号ｓ_d（ｔ）を、比率ｄ_m／ｄ_sで乗算することにより、スピーカ位置信号ｓｐ（ｔ）を算出する。 The speaker position sound volume information 133 includes a speaker position signal sp (t) that is an audio signal at the position of the speaker 111 and a speaker position signal sound pressure _Psp that is a signal sound pressure at the position of the speaker 111. Speaker position volume calculation unit 103, the direct wave signal s _d (t), by using the distance d _m, by the following equation (7), calculates a speaker position signal sp (t). That is, the speaker position volume calculation unit 103 calculates the speaker position signal sp (t) by multiplying the direct wave signal s _d (t) by the ratio d _m / d _s .

また、スピーカ位置音量算出部１０３は、直接波信号音圧Ｐ_dと、距離ｄ_mとを用いて、下記式（８）により、スピーカ位置信号音圧Ｐ_spを算出する。 The speaker positions volume calculation unit 103 uses the direct wave signal sound pressure P _d, the distance d _m, by the following equation (8), calculates the speaker position signal sound pressure P _sp.

次に、送信部１０４は、スピーカ位置音量算出部１０３により算出されたスピーカ位置音量情報１３３を、通信網１０を経由して、コミュニケーション装置１２に送信する（Ｓ１０４）。 Next, the transmission unit 104 transmits the speaker position volume information 133 calculated by the speaker position volume calculation unit 103 to the communication device 12 via the communication network 10 (S104).

以上のように、本発明の実施の形態１に係る音量監視装置１００は、スピーカ１１１の位置における、当該スピーカ１１１により拡声される音量を取得し、取得した音量をコミュニケーション装置１２に通知できる。つまり、音量監視装置１００は、スピーカ１１１とマイクロホン１１２との位置関係に依存しない音量を取得できる。これにより、コミュニケーション装置１２のユーザ１２０は、自身の声がコミュニケーション装置１１において正しく拡声されているか否かを定量的に把握できる。 As described above, the volume monitoring apparatus 100 according to Embodiment 1 of the present invention can acquire the volume that is loudened by the speaker 111 at the position of the speaker 111 and notify the communication apparatus 12 of the acquired volume. That is, the volume monitoring apparatus 100 can acquire a volume that does not depend on the positional relationship between the speaker 111 and the microphone 112. As a result, the user 120 of the communication device 12 can quantitatively grasp whether or not his / her voice is correctly amplified in the communication device 11.

なお、上記説明では、音声がモノラルの場合について説明したが、２チャンネル以上の場合、つまりスピーカ１１１が複数の場合にも、本発明を適用できる。つまり、音量監視装置１００は、スピーカ１１１毎のスピーカ位置音量情報１３３を算出してもよい。 In the above description, the case where the sound is monaural has been described, but the present invention can also be applied to a case where there are two or more channels, that is, a plurality of speakers 111. That is, the volume monitoring apparatus 100 may calculate the speaker position volume information 133 for each speaker 111.

また、スピーカ位置音量情報１３３は、直接波信号ｓ_d（ｔ）、及び直接波信号音圧Ｐ_dのうち一方のみを含んでもよい。同様に、距離情報１３１は、時間遅延τ_max及び距離ｄ_mのうち一方のみを含んでもよい。また、直接波音量情報１３２は、直接波信号ｓ_d（ｔ）及び直接波信号音圧Ｐ_dのうち一方のみを含んでもよい。 The speaker location volume information 133, the direct wave signal s _d (t), and may include only one of the direct wave signal sound pressure P _d. Similarly, the distance information 131 may include only one of the time delay tau _max and distance d _m. Moreover, the direct wave volume information 132 may include only one of the direct wave signal s _d (t) and the direct wave signal sound pressure P _d .

また、スピーカ位置音量情報１３３は、直接波信号ｓ_d（ｔ）、及び直接波信号音圧Ｐ_dのうち少なくとも一方を用いて作成した情報であってもよい。例えば、スピーカ位置音量情報１３３は、スピーカ１１１の位置における音量を視覚的に示す画像（メータ等）のデータであってもよい。 The speaker position sound volume information 133 may be information created using at least one of the direct wave signal s _d (t) and the direct wave signal sound pressure P _d . For example, the speaker position volume information 133 may be data of an image (such as a meter) that visually indicates the volume at the position of the speaker 111.

（実施の形態２）
本発明の実施の形態２に係る音量監視装置２００では、距離算出部２０１及び直接波音量算出部２０２がスピーカ１１１とマイクロホン１１２との間の音響伝達特性を用いて、距離情報１３１及び直接波音量情報１３２を算出する。 (Embodiment 2)
In the volume monitoring apparatus 200 according to the second embodiment of the present invention, the distance calculation unit 201 and the direct wave volume calculation unit 202 use the acoustic transfer characteristics between the speaker 111 and the microphone 112, and the distance information 131 and the direct wave volume. Information 132 is calculated.

図６は、本発明の実施の形態２に係る音量監視装置２００の構成を示すブロック図である。なお、図２と同様の要素には同一の符号を付しており、重複する説明は省略する。 FIG. 6 is a block diagram showing a configuration of volume monitoring apparatus 200 according to Embodiment 2 of the present invention. Elements similar to those in FIG. 2 are denoted by the same reference numerals, and redundant description is omitted.

図６に示すように、音量監視装置２００は、実施の形態１に係る音量監視装置１００の構成に加え、さらに、音響伝達特性算出部２０７を備える。また、音量監視装置２００では、実施の形態１に係る音量監視装置１００に対して、距離算出部２０１及び直接波音量算出部２０２の構成が異なる。 As illustrated in FIG. 6, the volume monitoring apparatus 200 includes an acoustic transfer characteristic calculation unit 207 in addition to the configuration of the volume monitoring apparatus 100 according to the first embodiment. Further, the volume monitoring apparatus 200 differs from the volume monitoring apparatus 100 according to Embodiment 1 in the configuration of the distance calculation unit 201 and the direct wave volume calculation unit 202.

音響伝達特性算出部２０７は、スピーカ入力信号ｓ（ｔ）と、収音信号ｍ（ｔ）とを用いて、スピーカ１１１とマイクロホン１１２との間の音響伝達特性２３７を算出する。具体的には、音響伝達特性算出部２０７は、エコーキャンセラを用いて音響伝達特性２３７を推定する。例えば、音響伝達特性算出部２０７は、音響伝達特性２３７を推定するために疑似的にスピーカ入力信号ｓ（ｔ）を生成することによりスピーカ１１１から音声を拡声させる。さらに、音響伝達特性算出部２０７は、当該音声がマイクロホン１１２で収音された収音信号ｍ（ｔ）を用いて、音響伝達特性２３７を推定する。 The sound transfer characteristic calculation unit 207 calculates the sound transfer characteristic 237 between the speaker 111 and the microphone 112 using the speaker input signal s (t) and the sound collection signal m (t). Specifically, the acoustic transfer characteristic calculation unit 207 estimates the acoustic transfer characteristic 237 using an echo canceller. For example, the acoustic transfer characteristic calculation unit 207 amplifies the sound from the speaker 111 by generating a pseudo speaker input signal s (t) in order to estimate the acoustic transfer characteristic 237. Furthermore, the sound transfer characteristic calculation unit 207 estimates the sound transfer characteristic 237 using the sound collection signal m (t) obtained by collecting the sound with the microphone 112.

また、エコーキャンセラとは、従来のコミュニケーション装置において、エコーを除去する際に用いられるものであり、本発明の実施の形態２に係る音量監視装置２００に、エコーキャンセラの機能の一部を用いることで、コミュニケーション装置１１の機能追加（コスト増加）を抑制できる。 The echo canceller is used when removing echoes in a conventional communication device, and a part of the function of the echo canceller is used in the volume monitoring device 200 according to the second embodiment of the present invention. Therefore, the function addition (cost increase) of the communication device 11 can be suppressed.

ここで、エコーキャンセラの動作について簡単に説明する。
エコーキャンセラは、マイクロホン１１２に収音された収音信号ｍ（ｔ）から、スピーカ１１１により拡声された音声に対応する信号を除去する。これにより、スピーカ１１１により拡声された音声が、再度、コミュニケーション装置１２で拡声されることを防止できる。つまり、エコーキャンセラは、スピーカ１１１からマイクロホン１１２へと回り込む音声をキャンセルすることにより、エコー及びハウリングを防止する。 Here, the operation of the echo canceller will be briefly described.
The echo canceller removes a signal corresponding to the voice amplified by the speaker 111 from the collected sound signal m (t) collected by the microphone 112. Thereby, it is possible to prevent the voice amplified by the speaker 111 from being amplified again by the communication device 12. In other words, the echo canceller prevents echo and howling by canceling the sound that circulates from the speaker 111 to the microphone 112.

具体的には、まず、エコーキャンセラは、スピーカ入力信号ｓ（ｔ）を用いて、エコー経路の伝達特性であるエコー伝達特性を推定する。次に、エコーキャンセラは、スピーカ入力信号ｓ（ｔ）と、推定したエコー伝達特性とを乗算することにより、マイクロホン１１２により収音された収音信号ｍ（ｔ）に含まれるスピーカ１１１により拡声された音声に対応する信号である疑似エコーを推定する。次に、エコーキャンセラは、収音信号ｍ（ｔ）から疑似エコーを減算することにより、収音信号ｍ（ｔ）から、スピーカ１１１により拡声された音声に対応する信号を除去する。 Specifically, the echo canceller first estimates an echo transfer characteristic that is a transfer characteristic of the echo path, using the speaker input signal s (t). Next, the echo canceller multiplies the speaker input signal s (t) by the estimated echo transfer characteristic, so that the sound is amplified by the speaker 111 included in the collected sound signal m (t) collected by the microphone 112. The pseudo echo that is a signal corresponding to the voice is estimated. Next, the echo canceller subtracts the pseudo echo from the collected sound signal m (t), thereby removing the signal corresponding to the sound amplified by the speaker 111 from the collected sound signal m (t).

音響伝達特性算出部２０７は、このエコーキャンセラで推定されたエコー伝達特性を、音響伝達特性２３７として距離算出部２０１及び直接波音量算出部２０２に出力する。
距離算出部２０１は、音響伝達特性２３７を用いて距離情報１３１を算出する。 The acoustic transfer characteristic calculation unit 207 outputs the echo transfer characteristic estimated by the echo canceller as the acoustic transfer characteristic 237 to the distance calculation unit 201 and the direct wave volume calculation unit 202.
The distance calculation unit 201 calculates distance information 131 using the acoustic transfer characteristic 237.

直接波音量算出部２０２は、音響伝達特性２３７を用いて直接波音量情報１３２を算出する。 The direct wave volume calculation unit 202 calculates the direct wave volume information 132 using the acoustic transfer characteristic 237.

図７は、音響伝達特性２３７を有限インパルス応答で表記したものであり、横軸が時間τ、縦軸が振幅ｈ（τ）である。図７に示すように振幅のピーク１６２が直接波１４２の振幅となる。次のピーク１６３と、当該ピーク１６３より後に現れるピークが複数の反射波１４３のそれぞれに対応する振幅、つまり残響特性に対応する振幅となる。 FIG. 7 shows the acoustic transfer characteristic 237 represented by a finite impulse response, where the horizontal axis represents time τ and the vertical axis represents amplitude h (τ). As shown in FIG. 7, the amplitude peak 162 is the amplitude of the direct wave 142. The next peak 163 and the peak appearing after the peak 163 have an amplitude corresponding to each of the plurality of reflected waves 143, that is, an amplitude corresponding to reverberation characteristics.

以下、音量監視装置２００の動作を説明する。
図８は、音量監視装置２００による動作の流れを示すフローチャートである。 Hereinafter, the operation of the volume monitoring apparatus 200 will be described.
FIG. 8 is a flowchart showing a flow of operations performed by the volume monitoring apparatus 200.

まず、音響伝達特性算出部２０７は、スピーカ入力信号ｓ（ｔ）と、収音信号ｍ（ｔ）とを用いて、スピーカ１１１とマイクロホン１１２との間の音響伝達特性２３７を算出する（Ｓ２０１）。 First, the sound transfer characteristic calculation unit 207 calculates the sound transfer characteristic 237 between the speaker 111 and the microphone 112 using the speaker input signal s (t) and the sound collection signal m (t) (S201). .

次に、距離算出部２０１は、音響伝達特性算出部２０７により推定された音響伝達特性２３７を用いて、距離情報１３１を算出する（Ｓ２０２）。ここで、図７に示すように直接波１４２の振幅であるピーク１６２の時間がスピーカ１１１とマイクロホン１１２との間の時間遅延τ_maxとなる。よって、距離算出部２０１は、音響伝達特性ｈ（τ）を用いて、下記式（９）により時間遅延τ_maxを算出できる。つまり、距離算出部２０１は、音響伝達特性において振幅のピークが現れるまでの時間を算出することにより、時間遅延τ_maxを算出する。 Next, the distance calculation unit 201 calculates the distance information 131 using the acoustic transfer characteristic 237 estimated by the acoustic transfer characteristic calculation unit 207 (S202). Here, as shown in FIG. 7, the time of the peak 162 that is the amplitude of the direct wave 142 is the time delay τ _max between the speaker 111 and the microphone 112. Therefore, the distance calculation unit 201 can calculate the time delay τ _max by the following equation (9) using the acoustic transfer characteristic h (τ). That is, the distance calculation unit 201 calculates the time delay τ _max by calculating the time until the amplitude peak appears in the acoustic transfer characteristics.

また、距離算出部１０１は、スピーカ１１１とマイクロホン１１２の距離ｄ_mを、サンプリング周波数Ｆ_sと音速Ｖ_sと上記時間遅延τ_maxとを用いて、上記式（３）により算出する。 The distance calculation unit 101, a distance d _m of the speaker 111 and the microphone 112, by using the sampling frequency F _s and the sound velocity V _s and the time delay tau _max, is calculated by the equation (3).

次に、直接波音量算出部２０２は、音響伝達特性ｈ（τ）を用いて直接波音量情報１３２を算出する（Ｓ２０３）。具体的には、直接波音量算出部２０２は、スピーカ入力信号ｓ（ｔ）と音響伝達特性ｈ（τ）と時間遅延τ_maxとを用いて、下記式（１０）により直接波信号ｓ_d（ｔ）を算出する。つまり、直接波音量算出部２０２は、音響伝達特性ｈ（τ）のピークと、時間遅延τ_max分前の時刻におけるスピーカ入力信号ｓ（ｔ）とを乗算することにより、直接波信号ｓ_d（ｔ）を算出する。 Next, the direct wave volume calculation unit 202 calculates the direct wave volume information 132 using the acoustic transfer characteristic h (τ) (S203). Specifically, the direct wave volume calculation unit 202 uses the speaker input signal s (t), the acoustic transfer characteristic h (τ), and the time delay τ _max to calculate the direct wave signal s _d ( t) is calculated. That is, the direct wave volume calculation unit 202 multiplies the peak of the acoustic transfer characteristic h (τ) by the speaker input signal s (t) at the time before the time delay τ _max to thereby generate the direct wave signal s _d ( t) is calculated.

なお、上記式（１０）ではτ_maxという１点のデータを基に直接波信号ｓ_d（ｔ）を算出しているが、τ_maxの前後数１０ｍｓに相当するＮ点のデータを利用して、下記式（１１）から直接波信号ｓ_d（ｔ）を算出してもよい。 Although it calculates the direct wave signal s _d (t) on the basis of the data of one point of the above formula (10) in tau _max, by utilizing the data of N points corresponding to the front and rear number 10ms of tau _max The direct wave signal s _d (t) may be calculated from the following equation (11).

次に、スピーカ位置音量算出部１０３は、実施の形態１と同様に、直接波音量情報１３２と距離情報１３１とを用いて、スピーカ位置音量情報１３３を算出する（Ｓ２０４）。 Next, the speaker position sound volume calculation unit 103 calculates the speaker position sound volume information 133 using the direct wave sound volume information 132 and the distance information 131 as in the first embodiment (S204).

次に、送信部１０４は、スピーカ位置音量算出部１０３により算出されたスピーカ位置音量情報１３３を、通信網１０を経由して、コミュニケーション装置１２に送信する（Ｓ２０５）。 Next, the transmission unit 104 transmits the speaker position volume information 133 calculated by the speaker position volume calculation unit 103 to the communication device 12 via the communication network 10 (S205).

以上より、本発明の実施の形態２に係る音量監視装置２００は、実施の形態１に係る音量監視装置２００と同様に、スピーカ１１１とマイクロホン１１２との位置関係に依存しない音量を取得し、取得した音量をコミュニケーション装置１２に通知できる。これにより、コミュニケーション装置１２のユーザ１２０は、自身の声がコミュニケーション装置１１において正しく拡声されているか否かを定量的に把握できる。 As described above, the volume monitoring apparatus 200 according to Embodiment 2 of the present invention acquires and acquires the volume independent of the positional relationship between the speaker 111 and the microphone 112, as in the volume monitoring apparatus 200 according to Embodiment 1. The communication device 12 can be notified of the sound volume. As a result, the user 120 of the communication device 12 can quantitatively grasp whether or not his / her voice is correctly amplified in the communication device 11.

さらに、本発明の実施の形態２に係る音量監視装置２００は、エコーキャンセラで算出される音響伝達特性２３７を用いて、距離情報１３１及び直接波音量情報１３２を算出する。これにより、音量監視装置２００は、コミュニケーション装置１１に対する機能追加（コスト増加）を抑制できる。 Furthermore, the volume monitoring apparatus 200 according to Embodiment 2 of the present invention calculates the distance information 131 and the direct wave volume information 132 using the acoustic transfer characteristic 237 calculated by the echo canceller. Thereby, the volume monitoring apparatus 200 can suppress the function addition (cost increase) to the communication apparatus 11.

なお、上記説明では、距離算出部２０１及び直接波音量算出部２０２が共に、音響伝達特性２３７を用いた場合について説明したが、距離算出部２０１及び直接波音量算出部２０２のうち一方のみが音響伝達特性２３７を用いてもよい。 In the above description, both the distance calculation unit 201 and the direct wave volume calculation unit 202 have been described using the acoustic transfer characteristic 237, but only one of the distance calculation unit 201 and the direct wave volume calculation unit 202 is an acoustic signal. A transfer characteristic 237 may be used.

また、上記説明では、音声がモノラルの場合について説明したが、２チャンネル以上の場合、つまりスピーカ１１１が複数の場合でも、音響伝達特性算出部２０７がマルチチャンネルの音響伝達特性２３７を算出し、距離算出部２０１及び直接波音量算出部２０２がマルチチャンネルの音響伝達特性２３７を利用することで、スピーカ１１１毎の距離情報１３１及び直接波音量情報１３２を算出できる。 In the above description, the case where the sound is monaural has been described. However, even when there are two or more channels, that is, when there are a plurality of speakers 111, the sound transfer characteristic calculation unit 207 calculates the multi-channel sound transfer characteristic 237, and the distance The calculation unit 201 and the direct wave volume calculation unit 202 can calculate the distance information 131 and the direct wave volume information 132 for each speaker 111 by using the multi-channel acoustic transfer characteristic 237.

（実施の形態３）
本発明の実施の形態３に係る音量監視装置３００は、実施の形態１に係る音量監視装置１００の変形例であり、スピーカ１１１の位置から所定の距離毎における音量を算出する。 (Embodiment 3)
Volume monitoring apparatus 300 according to Embodiment 3 of the present invention is a modification of volume monitoring apparatus 100 according to Embodiment 1, and calculates the volume at a predetermined distance from the position of speaker 111.

図９は、本発明の実施の形態３に係る音量監視装置３００のブロック図である。図９に示す音量監視装置３００は、実施の形態１に係る音量監視装置１００の構成に加えて、さらに、距離別音量算出部３０８を備える。なお、図２と同様の要素には同一の符号を付しており、重複する説明は省略する。 FIG. 9 is a block diagram of volume monitoring apparatus 300 according to Embodiment 3 of the present invention. The volume monitoring apparatus 300 shown in FIG. 9 includes a distance-based volume calculation unit 308 in addition to the configuration of the volume monitoring apparatus 100 according to the first embodiment. Elements similar to those in FIG. 2 are denoted by the same reference numerals, and redundant description is omitted.

距離別音量算出部３０８は、スピーカ位置音量情報１３３を用いて、スピーカ１１１により出音された音声の、スピーカ１１１から異なる距離離れた複数の位置における音量である距離別音量情報３３８を算出する。例えば、距離別音量情報３３８は、スピーカ１１１の位置から所定の距離毎におけるスピーカ１１１により拡声された音声の音量である。 The distance-based sound volume calculation unit 308 uses the speaker position sound volume information 133 to calculate distance-based sound volume information 338 that is the sound volume at a plurality of positions at different distances from the speaker 111 of the sound output from the speaker 111. For example, the volume information 338 by distance is the volume of the sound that is amplified by the speaker 111 at a predetermined distance from the position of the speaker 111.

図１０は、本発明の実施の形態３に係る音量監視装置３００の動作の流れを示すフローチャートである。なお、図１０に示すステップＳ３０１〜Ｓ３０３の処理は、図３に示すステップＳ１０１〜Ｓ１０３の処理と同様であり、説明は省略する。 FIG. 10 is a flowchart showing an operation flow of the sound volume monitoring apparatus 300 according to Embodiment 3 of the present invention. Note that the processing in steps S301 to S303 shown in FIG. 10 is the same as the processing in steps S101 to S103 shown in FIG.

ステップＳ３０３の後、距離別音量算出部３０８は、スピーカ位置音量情報１３３を用いて距離別音量情報３３８を算出する（Ｓ３０４）。ここで、距離別音量情報３３８は、
スピーカ１１１からｒ［ｍ］離れた地点における、スピーカ１１１により拡声された音声信号である距離別信号ｓｐ_r（ｔ）と、スピーカ１１１よりｒ［ｍ］離れた地点における音圧レベルである距離別信号音圧Ｐ_rとを含む。 After step S303, the sound volume calculation unit 308 by distance calculates the sound volume information 338 by distance using the speaker position sound volume information 133 (S304). Here, the volume information 338 by distance is
A distance-specific signal sp _r (t) that is an audio signal amplified by the speaker 111 at a point away from the speaker 111 and a sound pressure level at a point that is r [m] away from the speaker 111 Signal sound pressure _Pr .

具体的には、距離別音量算出部３０８は、スピーカ位置信号ｓｐ（ｔ）を用いて、下記式（１２）により距離別信号ｓｐ_r（ｔ）を算出する。つまり、距離別音量算出部３０８は、スピーカ位置信号ｓｐ（ｔ）を距離ｒで除算することにより、距離別信号ｓｐ_r（ｔ）を算出する。 Specifically, the distance-specific volume calculation unit 308 calculates the distance-specific signal sp _r (t) using the speaker position signal sp (t) according to the following equation (12). That is, the distance-specific sound volume calculation unit 308 calculates the distance-specific signal sp _r (t) by dividing the speaker position signal sp (t) by the distance r.

また、距離別音量算出部３０８は、直接波信号音圧Ｐ_dと距離ｄ_mとを用いて下記式（１３）により距離別信号音圧Ｐ_rを算出する。 The distance by volume calculation unit 308 calculates the distance-signal sound pressure P _r by the following formula (13) using the direct wave signal sound pressure P _d and the distance d _m.

次に、送信部１０４は、距離別音量算出部３０８により算出された距離別音量情報３３８を、通信網１０を経由して、コミュニケーション装置１２に送信する（Ｓ３０５）。 Next, the transmission unit 104 transmits the distance-specific volume information 338 calculated by the distance-specific volume calculation unit 308 to the communication device 12 via the communication network 10 (S305).

コミュニケーション装置１２の受信部１０５は、送信部１０４により送信された距離別音量情報３３８を受信する。表示部１０６は、受信部１０５により受信された距離別音量情報３３８で示される距離別の音量を、数値、又は映像（分布図等）で表示する。 The receiving unit 105 of the communication device 12 receives the volume information 338 according to distance transmitted by the transmitting unit 104. The display unit 106 displays the distance-specific volume indicated by the distance-specific volume information 338 received by the reception unit 105 as a numerical value or a video (distribution map or the like).

以上により、本発明の実施の形態３に係る音量監視装置３００は、スピーカ１１１とマイクロホン１１２との位置関係に依存しない、スピーカ１１１からの距離別の、当該スピーカ１１１から拡声される音量を取得し、取得した距離別の音量をコミュニケーション装置１２に通知できる。これにより、コミュニケーション装置１２のユーザ１２０は、自身の声がコミュニケーション装置１１において正しく拡声されているか否かを定量的に把握できる。 As described above, the volume monitoring apparatus 300 according to Embodiment 3 of the present invention acquires the volume that is loudened from the speaker 111 for each distance from the speaker 111 and does not depend on the positional relationship between the speaker 111 and the microphone 112. The communication device 12 can be notified of the acquired volume for each distance. As a result, the user 120 of the communication device 12 can quantitatively grasp whether or not his / her voice is correctly amplified in the communication device 11.

さらに、コミュニケーション装置１２において、スピーカ１１１の位置の音量のみでなく、スピーカ１１１からの距離別の音量が表示されるので、コミュニケーション装置１２のユーザ１２０は、よりコミュニケーション装置１１のユーザ１１０に聞こえている音量に近い音量を把握できる。 Furthermore, since not only the volume at the position of the speaker 111 but also the volume according to the distance from the speaker 111 is displayed on the communication device 12, the user 120 of the communication device 12 can be heard more by the user 110 of the communication device 11. The volume close to the volume can be grasped.

なお、上記説明では、距離別音量算出部３０８は、スピーカ１１１の位置から所定の距離毎の複数の距離別音量情報３３８を算出するとしたが、スピーカ１１１から予め定められた距離における１つの距離別音量情報３３８のみを算出してもよい。 In the above description, the volume-by-distance calculation unit 308 calculates a plurality of distance-by-distance volume information 338 for each predetermined distance from the position of the speaker 111, but for each distance at a predetermined distance from the speaker 111. Only the volume information 338 may be calculated.

また、距離別音量算出部３０８は、スピーカ１１１の位置から任意の複数の距離における複数の距離別音量情報３３８を算出してもよい。 Further, the volume-by-distance calculation unit 308 may calculate a plurality of distance-by-distance volume information 338 at an arbitrary plurality of distances from the position of the speaker 111.

また、距離別音量情報３３８は、スピーカ位置音量情報１３３を含んでもよい。
また、距離別音量情報３３８は、距離別信号ｓｐ_r（ｔ）、及び距離別信号音圧Ｐ_rのうち一方のみを含んでもよい。また、距離別音量情報３３８は、距離別信号ｓｐ_r（ｔ）、及び距離別信号音圧Ｐ_rのうち少なくとも一方を用いて作成したデータであってもよい。例えば、スピーカ位置音量情報１３３は、距離別の音量を視覚的に示す画像（分布図等）のデータであってもよい。 Moreover, the volume information 338 by distance may include speaker position volume information 133.
The distance-specific volume information 338 may include only one of the distance-specific signal sp _r (t) and the distance-specific signal sound pressure _Pr . The distance-specific volume information 338 may be data created using at least one of the distance-specific signal sp _r (t) and the distance-specific signal sound pressure _Pr . For example, the speaker position volume information 133 may be data of an image (distribution map or the like) that visually indicates the volume for each distance.

また、上記説明では、音量監視装置２００は、スピーカ位置音量情報１３３を算出したのちに、当該スピーカ位置音量情報１３３を用いて距離別音量情報３３８を算出するとしたが、上記式（１２）及び式（１３）に示すように、距離情報１３１及び直接波音量情報１３２とを用いて、直接的に距離別音量情報３３８を算出してもよい。 In the above description, the volume monitoring apparatus 200 calculates the speaker position volume information 133 and then calculates the distance volume information 338 using the speaker position volume information 133. However, the above formula (12) and formula As shown in (13), the distance-specific volume information 338 may be directly calculated using the distance information 131 and the direct wave volume information 132.

（実施の形態４）
本発明の実施の形態４に係る音量監視装置４００は、実施の形態３に係る音量監視装置３００の変形例であり、さらに、残響及び騒音の影響を考慮して、距離別音量情報３３８を調整する。 (Embodiment 4)
Volume monitoring device 400 according to Embodiment 4 of the present invention is a modification of volume monitoring device 300 according to Embodiment 3, and further adjusts volume-specific volume information 338 in consideration of the effects of reverberation and noise. To do.

図１１は、本発明の実施の形態４に係る音量監視装置４００のブロック図である。図１１に示す音量監視装置４００は、実施の形態３に係る音量監視装置３００の構成に加えて、さらに、残響特性算出部４０９と、騒音レベル算出部４１０と、距離別音量調整部４１１とを備える。なお、図９と同様の要素には同一の符号を付しており、重複する説明は省略する。 FIG. 11 is a block diagram of volume monitoring apparatus 400 according to Embodiment 4 of the present invention. In addition to the configuration of volume monitoring apparatus 300 according to Embodiment 3, volume monitoring apparatus 400 shown in FIG. 11 further includes reverberation characteristic calculation section 409, noise level calculation section 410, and volume-by-distance adjustment section 411. Prepare. In addition, the same code | symbol is attached | subjected to the element similar to FIG. 9, and the overlapping description is abbreviate | omitted.

残響特性算出部４０９は、スピーカ入力信号ｓ（ｔ）と収音信号ｍ（ｔ）とを用いて、スピーカ１１１が設置された空間の残響特性４３９を算出する。 The reverberation characteristic calculation unit 409 calculates the reverberation characteristic 439 of the space in which the speaker 111 is installed, using the speaker input signal s (t) and the sound collection signal m (t).

騒音レベル算出部４１０は、収音信号ｍ（ｔ）を用いて、騒音レベル４４０を生成する。騒音レベル４４０とは、コミュニケーション装置１１における騒音の音量であり、言い換えると、収音信号ｍ（ｔ）に含まれる騒音の音量である。 The noise level calculation unit 410 generates a noise level 440 using the collected sound signal m (t). The noise level 440 is the volume of noise in the communication device 11, in other words, the volume of noise included in the collected sound signal m (t).

距離別音量調整部４１１は、残響特性４３９と騒音レベル４４０とを用いて、距離別音量情報３３８を調整することにより、調整距離別音量情報４４１を生成する。具体的には、距離別音量調整部４１１は、直接波１４２のみに対応する距離別音量情報３３８に、残響の成分を加えることで、より実際の音量に近い調整距離別音量情報４４１を生成する。また、距離別音量調整部４１１は、距離別音量情報３３８が騒音レベル４４０以下の場合に、当該距離別音量を実質的に無音状態に調整する。言い換えると、距離別音量調整部４１１は、距離別音量情報３３８に含まれる音声のうち、騒音レベル４４０以下の音量の時間区間の音量を実質的にゼロにすることにより、距離別音量情報３３８を調整する。 The distance-by-distance volume adjusting unit 411 generates the adjustment-distance-by-distance volume information 441 by adjusting the distance-by-distance volume information 338 using the reverberation characteristics 439 and the noise level 440. Specifically, the volume control unit 411 for each distance generates volume information 441 for each adjustment distance that is closer to the actual volume by adding a reverberation component to the volume information 338 for each distance corresponding to only the direct wave 142. . Also, the distance-by-distance volume adjustment unit 411 adjusts the distance-by-distance sound volume substantially silent when the distance-by-distance volume information 338 is equal to or lower than the noise level 440. In other words, the sound volume adjusting unit 411 for each distance sets the sound volume information 338 for each distance by substantially reducing the sound volume in the time interval of the sound volume of the noise level 440 or less among the sounds included in the sound information 338 for each distance. adjust.

以上の様に構成された音量監視装置４００の動作を説明する。
図１２は、音量監視装置４００の動作の流れを示すフローチャートである。なお、図１２に示すステップＳ４０１〜Ｓ４０４の処理は、図１０に示すステップＳ３０１〜Ｓ３０４の処理と同様であり、説明は省略する。 The operation of the volume monitoring device 400 configured as described above will be described.
FIG. 12 is a flowchart showing an operation flow of the volume monitoring device 400. Note that the processing in steps S401 to S404 shown in FIG. 12 is the same as the processing in steps S301 to S304 shown in FIG.

ステップＳ４０４の後、残響特性算出部４０９は、スピーカ入力信号ｓ（ｔ）と収音信号ｍ（ｔ）とを用いて、残響特性４３９を算出する（Ｓ４０５）。この残響特性４３９は、直接波１４２と残響成分との比率α、及び残響成分に対応する音声信号である残響信号ｓ_rev（ｔ）のうち少なくとも一方を含む。ここで、残響成分とは、壁、床、又は物体などに複数回以上反射してマイクロホンに到達した信号成分である。 After step S404, the reverberation characteristic calculation unit 409 calculates the reverberation characteristic 439 using the speaker input signal s (t) and the collected sound signal m (t) (S405). The reverberation characteristic 439 includes at least one of a ratio α between the direct wave 142 and the reverberation component and a reverberation signal s _rev (t) that is an audio signal corresponding to the reverberation component. Here, the reverberation component is a signal component that reaches the microphone after being reflected more than once on the wall, floor, or object.

まず、残響特性算出部４０９は、スピーカ入力信号ｓ（ｔ）と収音信号ｍ（ｔ）と直接波１４２の時間遅延τ_maxとを用いて、下記式（１４）により、比率αを算出する。下記式（１４）に示すように比率αは、スピーカ入力信号ｓ（ｔ）及び収音信号ｍ（ｔ）の相関関数を利用することで、算出できる。 First, the reverberation characteristic calculation unit 409 calculates the ratio α by the following equation (14) using the speaker input signal s (t), the sound collection signal m (t), and the time delay τ _max of the direct wave 142. . As shown in the following formula (14), the ratio α can be calculated by using a correlation function of the speaker input signal s (t) and the sound pickup signal m (t).

なお、式（１４）におけるτ_revは残響成分の時間遅延である。また、残響成分は、直接波１４２に比べ、音圧が小さく、微々たる環境変動の影響を受けやすいため、振幅の値は揺らぎがちである。よって、下記式（１５）及び式（１６）のように平均値を用いて、比率αを算出することも有効である。 Note that τ _rev in the equation (14) is a time delay of the reverberation component. In addition, the reverberation component has a smaller sound pressure than the direct wave 142 and is easily affected by slight environmental fluctuations, and therefore the amplitude value tends to fluctuate. Therefore, it is also effective to calculate the ratio α using the average value as in the following formulas (15) and (16).

ここで、式（１５）及び式（１６）におけるＮは、平均値をとる時間区間であり、数ｍｓから１００ｍｓ程度の時間に相当するデータ長である。また、残響特性算出部４０９は、比率αを用いて、下記式（１７）により残響成分に対応する音声信号である残響信号ｓ_rev（ｔ）を算出できる。つまり、残響特性算出部４０９は、直接波信号ｓ_d（ｔ）に比率αを乗算することにより、残響信号ｓ_rev（ｔ）を算出できる。 Here, N in the equations (15) and (16) is a time interval in which the average value is taken, and is a data length corresponding to a time of about several ms to 100 ms. In addition, the reverberation characteristic calculation unit 409 can calculate the reverberation signal s _rev (t), which is an audio signal corresponding to the reverberation component, using the ratio α, according to the following equation (17). That is, the reverberation characteristic calculation unit 409 can calculate the reverberation signal s _rev (t) by multiplying the direct wave signal s _d (t) by the ratio α.

次に、距離別音量調整部４１１は、残響特性４３９を用いて、距離別音量情報３３８を調整する（Ｓ４０６）。具体的には、距離別音量調整部４１１は、距離別信号ｓｐ_r（ｔ）と残響信号ｓ_rev（ｔ）とを用いて、下記式（１８）により、残響調整距離別信号ｓｐ_r（ｔ）’を算出する。つまり、距離別音量調整部４１１は、距離別信号ｓｐ_r（ｔ）と、残響信号ｓ_rev（ｔ）とを加算することにより、残響調整距離別信号ｓｐ_r（ｔ）’を算出する。 Next, the distance-specific volume adjustment unit 411 adjusts the distance-specific volume information 338 using the reverberation characteristic 439 (S406). Specifically, the sound volume adjustment unit 411 for each distance uses the signal for each distance sp _r (t) and the reverberation signal s _rev (t), and the signal for each reverberation adjustment distance sp _r (t) according to the following equation (18). ) 'Is calculated. That is, the distance-specific volume adjustment unit 411 calculates the reverberation-adjusted distance-specific signal sp _r (t) ′ by adding the distance-specific signal sp _r (t) and the reverberation signal s _rev (t).

また、距離別音量調整部４１１は、比率αと直接波信号音圧Ｐ_dと距離ｄ_mとを用いて、下記式（１９）により残響調整距離別信号音圧Ｐ_r’を算出する。 The distance by volume adjusting unit 411, by using the ratio α and the direct wave signal sound pressure P _d and the distance d _m, to calculate the reverberation adjustment distance-signal sound pressure P _r 'by the following equation (19).

また、騒音レベル算出部４１０は、収音信号ｍ（ｔ）を用いて、騒音レベル４４０を生成する（Ｓ４０７）。具体的には、騒音レベルは、下記式（２０）に示す非音声区間の収音信号ｍ（ｔ）である騒音信号ｍ_n（ｔ）、及び、騒音信号ｍ_n（ｔ）の音圧である騒音信号音圧Ｐ_nとを含む。ここで非音声区間とは、コミュニケーション装置１１において音声が収音されない区間である。つまり、非音声区間とは、コミュニケーション装置１１のユーザ１１０、及びコミュニケーション装置１２のユーザ１２０が共に、発声していない区間である。また、騒音信号音圧Ｐ_nは、非音声区間の収音信号ｍ（ｔ）の音圧である。具体的には、騒音レベル算出部４１０は、下記式（２１）に示すように、雑音信号ｍ_n（ｔ）の二乗の平均値を用いて騒音信号音圧Ｐ_nを算出する。 Further, the noise level calculation unit 410 generates a noise level 440 using the collected sound signal m (t) (S407). Specifically, the noise level is a noise signal m _n (t), which is a sound collection signal m (t) in the non-speech section shown in the following equation (20), and a sound pressure of the noise signal m _n (t). Including a certain noise signal sound pressure P _n . Here, the non-speech section is a section in which voice is not collected by the communication device 11. That is, the non-speech section is a section in which neither the user 110 of the communication device 11 nor the user 120 of the communication device 12 is speaking. The noise signal sound pressure P _n is the sound pressure of the sound collection signal m (t) in the non-speech section. Specifically, the noise level calculation unit 410 calculates the noise signal sound pressure P _n using the mean value of the squares of the noise signal m _n (t) as shown in the following equation (21).

上記では、コミュニケーション装置１１のユーザ１１０、及びコミュニケーション装置１２のユーザ１２０が共に、発声していない区間における収音信号ｍ（ｔ）を用いて雑音信号を算出する例について述べたが、コミュニケーション装置１１のユーザ１１０のみが発声していない区間におけるエコーキャンセラ出力信号を用いても構わない。 In the above description, the example in which the noise signal is calculated using the collected sound signal m (t) in the section where the user 110 of the communication device 11 and the user 120 of the communication device 12 are not speaking has been described. An echo canceller output signal in a section where only the user 110 is not speaking may be used.

次に、距離別音量調整部４１１は、騒音レベル４４０を用いて、距離別音量情報３３８を調整することにより、調整距離別音量情報４４１を算出する（Ｓ４０８）。ここで、スピーカ１１１により拡声される音声のうち、ユーザ１１０に聞き取れる音声は、騒音信号音圧Ｐ_nより大きな音圧の音声である。よって、距離別音量調整部４１１は、騒音レベル４４０を用いて距離別音量情報に制限を掛ける。 Next, the volume control unit 411 for each distance calculates the volume information 441 for each adjustment distance by adjusting the volume information 338 for each distance using the noise level 440 (S408). Here, of the sound that is amplified by the speaker 111, the sound that can be heard by the user 110 is a sound having a sound pressure higher than the noise signal sound pressure _Pn . Thus, the distance-specific volume adjustment unit 411 uses the noise level 440 to limit the distance-specific volume information.

また、調整距離別音量情報４４１は、調整距離別信号ｓｐ_r（ｔ）’’と、調整距離別信号音圧Ｐ_r’’とを含む。 The volume information 441 by adjustment distance includes a signal sp _r (t) ″ by adjustment distance and a signal sound pressure P _r ″ by adjustment distance.

具体的には、距離別音量調整部４１１は、雑音信号ｍ_n（ｔ）と残響調整距離別信号ｓｐ_r（ｔ）’とを用いて、下記式（２２）により、補正距離別音声信号ｓｐ_r（ｔ）’’を算出する。つまり、距離別音量調整部４１１は、残響調整距離別信号ｓｐ_r（ｔ）’に含まれる音声信号のうち、雑音信号ｍ_n（ｔ）より小さい振幅の時間区間の音声信号を実質的にゼロにすることにより、補正距離別音声信号ｓｐ_r（ｔ）’’を算出する。 Specifically, the sound volume adjustment unit 411 for each distance uses the noise signal m _n (t) and the signal for each reverberation adjustment distance sp _r (t) ′, and the sound signal sp for each correction distance according to the following equation (22). _r (t) '' is calculated. That is, the sound volume adjusting unit 411 for each distance substantially eliminates the sound signal in the time interval having an amplitude smaller than the noise signal m _n (t) among the sound signals included in the signal for each reverberation adjusting distance sp _r (t) ′. Thus, the audio signal sp _r (t) ″ for each correction distance is calculated.

また、距離別音量調整部４１１は、騒音信号音圧Ｐ_nと残響調整距離別信号音圧Ｐ_r’とを用いて、下記式（２３）により、補正距離別音声信号音圧Ｐ_r’’を算出する。つまり、距離別音量調整部４１１は、残響調整距離別信号音圧Ｐ_r’が騒音信号音圧Ｐ_nより小さい場合に、残響調整距離別信号音圧Ｐ_r’を実質的にゼロにすることで、補正距離別音声信号音圧Ｐ_r’’を算出する。 The distance by volume adjusting unit 411, the noise signal sound pressure P _n and reverberation adjustment distance-signal sound pressure P _r 'using the, by the following equation (23), the correction distance-audio signal sound pressure P _r' ' Is calculated. That is, the sound volume adjustment unit 411 for each distance sets the signal sound pressure P _r 'for each reverberation adjustment distance to substantially zero when the signal sound pressure P _r ' for each reverberation adjustment distance is smaller than the noise signal sound pressure P _n. Then, the sound signal sound pressure P _r ″ for each correction distance is calculated.

次に、送信部１０４は、距離別音量調整部４１１により算出された調整距離別音量情報４４１を、通信網１０を経由して、コミュニケーション装置１２に送信する（Ｓ４０９）。 Next, the transmission unit 104 transmits the volume information 441 by adjustment distance calculated by the volume adjustment unit 411 by distance to the communication device 12 via the communication network 10 (S409).

コミュニケーション装置１２の受信部１０５は、送信部１０４により送信された調整距離別音量情報４４１を受信する。表示部１０６は、受信部１０５により受信された調整距離別音量情報４４１で示される距離別の音量を、数値、又は映像（分布図等）で表示する。 The reception unit 105 of the communication device 12 receives the volume information 441 by adjustment distance transmitted by the transmission unit 104. The display unit 106 displays the volume for each distance indicated by the volume information 441 for each adjustment distance received by the reception unit 105 as a numerical value or a video (distribution map or the like).

以上より、本発明の実施の形態４に係る音量監視装置４００は、残響及び雑音の影響を考慮した音量を算出できるので、実際にユーザが聞く音量に、よりマッチした音量を取得し、取得した音量をコミュニケーション装置１２に送信できる。 As described above, the volume monitoring apparatus 400 according to the fourth embodiment of the present invention can calculate the volume in consideration of the effects of reverberation and noise, and thus acquires and acquires a volume that more closely matches the volume actually heard by the user. The volume can be transmitted to the communication device 12.

なお、上記説明では、距離別音量調整部４１１は、距離別音量情報３３８を、残響特性４３９を用いて調整した（Ｓ４０６）後に、さらに騒音レベル４４０を用いて調整する（Ｓ４０８）場合について説明したが、距離別音量調整部４１１は、距離別音量情報３３８を、騒音レベル４４０を用いて調整した（Ｓ４０８）後、騒音レベル４４０を用いて調整（Ｓ４０８）してもよい。 In the above description, the distance-dependent volume adjustment unit 411 adjusts the distance-specific volume information 338 using the reverberation characteristics 439 (S406), and further adjusts using the noise level 440 (S408). However, the sound volume adjusting unit 411 for each distance may adjust the sound volume information 338 for each distance using the noise level 440 (S408) and then adjusting the sound level information 338 using the noise level 440 (S408).

また、距離別音量調整部４１１は、距離別音量情報３３８に対して、残響特性４３９を用いた調整、及び、騒音レベル４４０を用いた調整のうち、いずれか一方のみを行ってもよい。 Further, the sound volume adjustment unit 411 according to distance may perform only one of the adjustment using the reverberation characteristic 439 and the adjustment using the noise level 440 with respect to the sound volume information 338 according to distance.

具体的には、距離別音量調整部４１１が距離別音量情報３３８に対して、騒音レベル４４０を用いた調整のみを行う場合、距離別音量調整部４１１により算出される調整距離別信号ｓｐ_r（ｔ）’’と調整距離別信号音圧Ｐ_r’’とは、下記式（２４）、及び式（２５）で示される。 Specifically, when the distance-specific volume adjustment unit 411 performs only the adjustment using the noise level 440 on the distance-specific volume information 338, the distance-specific volume adjustment unit 411 calculates the adjustment distance-specific signal sp _r ( t) ″ and the signal sound pressure P _r ″ for each adjustment distance are expressed by the following equations (24) and (25).

なお、調整距離別音量情報４４１は、調整距離別信号ｓｐ_r（ｔ）’’、及び調整距離別信号音圧Ｐ_r’’のうち一方のみを含んでもよい。同様に、残響特性４３９は、比率α、及び残響信号ｓ_rev（ｔ）のうち一方のみを含んでもよい。また、騒音レベル４４０は、騒音信号ｍ_n（ｔ）、及び騒音信号音圧Ｐ_nのうち一方のみを含んでもよい。 Note that the adjustment distance-specific volume information 441 may include only one of the adjustment distance-specific signal sp _r (t) ″ and the adjustment distance-specific signal sound pressure P _r ″. Similarly, the reverberation characteristic 439 may include only one of the ratio α and the reverberation signal s _rev (t). In addition, the noise level 440 may include only one of the noise signal m _n (t) and the noise signal sound pressure P _n .

また、実施の形態１又は２に係る音量監視装置１００又は２００が、さらに、騒音レベル算出部４１０及び距離別音量調整部４１１を備えてもよい。言い換えると、距離別音量調整部４１１は、スピーカ位置音量情報１３３に対して、上述した騒音レベル４４０を用いた調整を行ってもよい。 Further, the sound volume monitoring apparatus 100 or 200 according to the first or second embodiment may further include a noise level calculation unit 410 and a sound volume adjustment unit 411 for each distance. In other words, the sound volume adjustment unit 411 for each distance may perform adjustment using the noise level 440 described above on the speaker position sound volume information 133.

（実施の形態５）
本発明の実施の形態５に係る音量監視装置５００は、実施の形態４に係る音量監視装置４００の変形例であり、実施の形態２に係る音量監視装置２００と同様に、音響伝達特性２３７を用いて残響特性４３９を推定する。 (Embodiment 5)
A volume monitoring apparatus 500 according to Embodiment 5 of the present invention is a modification of the volume monitoring apparatus 400 according to Embodiment 4, and has an acoustic transfer characteristic 237 similar to the volume monitoring apparatus 200 according to Embodiment 2. The reverberation characteristic 439 is estimated by using it.

図１３は、本発明の実施の形態５に係る音量監視装置５００の構成を示すブロック図である。なお、図６及び図１１と同様の要素には同一の符号を付しており、重複する説明は省略する。 FIG. 13 is a block diagram showing a configuration of volume monitoring apparatus 500 according to Embodiment 5 of the present invention. Elements similar to those in FIGS. 6 and 11 are denoted by the same reference numerals, and redundant description is omitted.

図１３に示すように、音量監視装置５００は、実施の形態４に係る音量監視装置４００の構成に加え、さらに、音響伝達特性算出部２０７を備える。また、音量監視装置５００では、実施の形態４に係る音量監視装置４００に対して、距離算出部２０１と直接波音量算出部２０２と残響特性算出部５０９との構成が異なる。 As shown in FIG. 13, volume monitoring apparatus 500 further includes an acoustic transfer characteristic calculation unit 207 in addition to the configuration of volume monitoring apparatus 400 according to Embodiment 4. Further, in the volume monitoring device 500, the configuration of the distance calculation unit 201, the direct wave volume calculation unit 202, and the reverberation characteristic calculation unit 509 is different from the volume monitoring device 400 according to the fourth embodiment.

なお、音響伝達特性算出部２０７と距離算出部２０１と直接波音量算出部２０２との構成は、実施の形態２と同様であり説明は省略する。 Note that the configurations of the acoustic transfer characteristic calculation unit 207, the distance calculation unit 201, and the direct wave volume calculation unit 202 are the same as those in the second embodiment, and a description thereof will be omitted.

ここで、実施の形態２と同様に、音響伝達特性算出部２０７は、従来から用いられているエコーキャンセラの機能の一部を用いることで実現できる。よって、本発明の実施の形態５に係る音量監視装置５００は、コミュニケーション装置１１の機能追加（コスト増加）を抑制できる。 Here, as in the second embodiment, the acoustic transfer characteristic calculation unit 207 can be realized by using a part of functions of an echo canceller that has been conventionally used. Therefore, volume monitoring apparatus 500 according to Embodiment 5 of the present invention can suppress the function addition (cost increase) of communication apparatus 11.

残響特性算出部５０９は、音響伝達特性２３７を用いて残響特性４３９を算出する。具体的には、残響特性算出部５０９は、音響伝達特性ｈ（τ）を用いて、下記式（２６）、式（２７）又は式（２８）により、直接波１４２と残響成分との比率αを算出する。 The reverberation characteristic calculation unit 509 calculates the reverberation characteristic 439 using the acoustic transfer characteristic 237. Specifically, the reverberation characteristic calculation unit 509 uses the acoustic transfer characteristic h (τ) to calculate the ratio α between the direct wave 142 and the reverberation component according to the following equation (26), equation (27), or equation (28). Is calculated.

また、残響特性算出部５０９は、音響伝達特性ｈ（τ）を用いて、下記式（２９）により、残響信号ｓ_rev（ｔ）を算出する。 In addition, the reverberation characteristic calculation unit 509 calculates the reverberation signal s _rev (t) by the following equation (29) using the acoustic transfer characteristic h (τ).

なお、上記式（２９）ではτ_revという１点のデータを基に残響信号ｓ_rev（ｔ）を算出したが、τ_revの前後数ｍｓ〜数１００ｍｓに相当するＮ点のデータを利用して式（３０）のように算出してもよい。 The above formula (29) in was calculated reverberation signal s _rev (t) on the basis of the data of one point of tau _rev, by utilizing the data of N points corresponding to the front and rear number ms~ number 100ms of tau _rev You may calculate like Formula (30).

以上より、本発明の実施の形態５に係る音量監視装置５００は、従来から用いられているエコーキャンセラで算出される音響伝達特性２３７を用いて、残響特性４３９を算出する。これにより、音量監視装置５００は、コミュニケーション装置１１に対する機能追加（コスト増加）を抑制できる。 From the above, the sound volume monitoring apparatus 500 according to Embodiment 5 of the present invention calculates the reverberation characteristic 439 using the acoustic transfer characteristic 237 calculated by the echo canceller conventionally used. Thereby, the volume monitoring apparatus 500 can suppress the function addition (cost increase) to the communication apparatus 11.

なお、上記説明では、距離算出部２０１、直接波音量算出部２０２及び残響特性算出部５０９のうち全てが、音響伝達特性２３７を用いた場合について説明したが、距離算出部２０１、直接波音量算出部２０２及び残響特性算出部５０９のうち一以上が音響伝達特性２３７を用いてもよい。 In the above description, the distance calculation unit 201, the direct wave volume calculation unit 202, and the reverberation characteristic calculation unit 509 all use the acoustic transfer characteristic 237. However, the distance calculation unit 201, direct wave volume calculation One or more of the unit 202 and the reverberation characteristic calculation unit 509 may use the acoustic transfer characteristic 237.

また、上記説明では、音声がモノラルの場合について説明したが、２チャンネル以上の場合、つまりスピーカ１１１が複数の場合でも、音響伝達特性算出部２０７がマルチチャンネルの音響伝達特性２３７を算出し、残響特性算出部５０９がマルチチャンネルの音響伝達特性２３７を利用することで、スピーカ１１１毎の残響特性４３９を算出できる。 In the above description, the case where the sound is monaural has been described. However, even when there are two or more channels, that is, when there are a plurality of speakers 111, the sound transfer characteristic calculation unit 207 calculates the multi-channel sound transfer characteristics 237, and the reverberation. The characteristic calculation unit 509 can calculate the reverberation characteristic 439 for each speaker 111 by using the multi-channel acoustic transfer characteristic 237.

（実施の形態６）
本発明の実施の形態６に係る音量監視装置６００は、実施の形態１に係る音量監視装置１００の変形例であり、音量監視装置１００の機能に加え、スピーカ１１１により拡声される音声が所定の音量以上の場合、コミュニケーション装置１２に警告を送る機能を有する。 (Embodiment 6)
Volume monitoring apparatus 600 according to Embodiment 6 of the present invention is a modification of volume monitoring apparatus 100 according to Embodiment 1, and in addition to the function of volume monitoring apparatus 100, the sound that is loudened by speaker 111 is predetermined. When the volume is higher than the volume, the communication device 12 has a function of sending a warning.

図１４は、本発明の実施の形態６に係る音量監視装置６００の構成を示すブロック図である。なお、図２と同様の要素には同一の符号を付しており、重複する説明は省略する。 FIG. 14 is a block diagram showing a configuration of volume monitoring apparatus 600 according to Embodiment 6 of the present invention. Elements similar to those in FIG. 2 are denoted by the same reference numerals, and redundant description is omitted.

図１４に示す音量監視装置６００は、音量監視装置１００の構成に加え、さらに、警告部６１２を備える。 A volume monitoring apparatus 600 shown in FIG. 14 includes a warning unit 612 in addition to the configuration of the volume monitoring apparatus 100.

警告部６１２は、スピーカ位置音量算出部１０３により算出されたスピーカ位置音量情報１３３で示される音量が所定の音量以上であるか否かを判定する。また、警告部６１２は、スピーカ位置音量情報１３３で示される音量が所定の音量以上であると判定した場合、送信部１０４及び通信網１０を経由して、スピーカ１１１により拡声された音声の音量が所定の音量以上であることを示す警告情報６４２を、コミュニケーション装置１２に送信する。言い換えると、警告情報６４２は、コミュニケーション装置１１において、過大音が拡声されたことを警告する情報である。 The warning unit 612 determines whether or not the volume indicated by the speaker position volume information 133 calculated by the speaker position volume calculation unit 103 is equal to or higher than a predetermined volume. Further, when the warning unit 612 determines that the volume indicated by the speaker position volume information 133 is equal to or higher than the predetermined volume, the volume of the sound amplified by the speaker 111 via the transmission unit 104 and the communication network 10 is increased. Warning information 642 indicating that the volume is above a predetermined level is transmitted to the communication device 12. In other words, the warning information 642 is information that warns that an excessive sound has been amplified in the communication device 11.

以上の構成により、本発明の実施の形態６に係る音量監視装置６００は、コミュニケーション装置１２から送信された音声が、スピーカ１１１により非常に大きい音で拡声されている場合には、コミュニケーション装置１２に警告を通知できる。 With the above configuration, the volume monitoring device 600 according to Embodiment 6 of the present invention allows the communication device 12 to communicate with the communication device 12 when the sound transmitted from the communication device 12 is loudly louded by the speaker 111. A warning can be notified.

なお、送信部１０４は、スピーカ位置音量情報１３３、及び警告情報６４２を共に、コミュニケーション装置１２に送信してもよいし、警告情報６４２のみをコミュニケーション装置１２に送信してもよい。 The transmitting unit 104 may transmit both the speaker position volume information 133 and the warning information 642 to the communication device 12 or may transmit only the warning information 642 to the communication device 12.

また、上記説明では、実施の形態１に係る音量監視装置１００が、さらに、警告部６１２を備える例を説明したが、実施の形態２〜５に係る音量監視装置２００〜５００が、さらに、警告部６１２を備えてもよい。言い換えると、警告部６１２は、スピーカ１１１から所定の距離ｒにおける音量が所定の音量以上である場合に、警告情報６４２を、コミュニケーション装置１２に送信してもよい。 In the above description, the volume monitoring apparatus 100 according to the first embodiment further includes the warning unit 612. However, the volume monitoring apparatuses 200 to 500 according to the second to fifth embodiments further include a warning. A portion 612 may be provided. In other words, the warning unit 612 may transmit the warning information 642 to the communication device 12 when the volume at the predetermined distance r from the speaker 111 is equal to or higher than the predetermined volume.

（実施の形態７）
本発明の実施の形態７に係る音量監視装置７００は、実施の形態１に係る音量監視装置１００の変形例であり、指向性マイクロホン７１２ａ及び７１２ｂにより、収音された複数の収音信号７４２ａ及び７４２ｂを無指向性の収音信号ｍ（ｔ）に変換した後に、当該変換した収音信号ｍ（ｔ）を用いて、スピーカ位置における音量を算出する。 (Embodiment 7)
A volume monitoring apparatus 700 according to Embodiment 7 of the present invention is a modification of the volume monitoring apparatus 100 according to Embodiment 1, and a plurality of sound collection signals 742a collected by directional microphones 712a and 712b and After converting 742b into an omnidirectional sound pickup signal m (t), the volume at the speaker position is calculated using the converted sound pickup signal m (t).

図１５は、本発明の実施の形態７に係る音量監視装置７００の構成を示すブロック図である。なお、図２と同様の要素には同一の符号を付しており、重複する説明は省略する。 FIG. 15 is a block diagram showing a configuration of volume monitoring apparatus 700 according to Embodiment 7 of the present invention. Elements similar to those in FIG. 2 are denoted by the same reference numerals, and redundant description is omitted.

図１５に示す音量監視装置７００は、音量監視装置１００の構成に加え、さらに、無指向化部７１３を備える。また、コミュニケーション装置１１は、可変の指向性を有する複数のマイクロホン７１２ａ及び７１２ｂを備える。 A volume monitoring apparatus 700 shown in FIG. 15 further includes an omnidirectional unit 713 in addition to the configuration of the volume monitoring apparatus 100. The communication device 11 also includes a plurality of microphones 712a and 712b having variable directivities.

無指向化部７１３は、マイクロホン７１２ａ及び７１２ｂにより収音された指向性を有する収音信号７４２ａ及び７４２ｂを、無指向化することにより、収音信号ｍ（ｔ）を生成する。例えば、複数の指向性マイクロホンで全方位の音を収音するようなマイクロホンシステムの場合、無指向化部７１３は、各マイクロホンの収音信号を、各方向からの音が均等の音圧で収音されるように加算することにより、収音信号ｍ（ｔ）を生成する。 The omnidirectional unit 713 generates a sound collection signal m (t) by making the sound collection signals 742a and 742b having directivity collected by the microphones 712a and 712b omnidirectional. For example, in the case of a microphone system that collects omnidirectional sound with a plurality of directional microphones, the omnidirectional unit 713 collects the sound collected from each microphone with sound pressure equal to the sound from each direction. The sound pickup signal m (t) is generated by adding the sound so as to be heard.

また、距離算出部１０１及び直接波音量算出部１０２は、無指向化部７１３により無指向化された収音信号ｍ（ｔ）を用いて、距離情報１３１及び直接波音量情報１３２を算出する。 In addition, the distance calculation unit 101 and the direct wave volume calculation unit 102 calculate the distance information 131 and the direct wave volume information 132 using the sound collection signal m (t) that has been omnidirectional by the omnidirectional unit 713.

以上の構成により、本発明の実施の形態７に係る音量監視装置７００は、指向性のマイクロホンが使用するコミュニケーション装置１１においても、スピーカ１１１とマイクロホン１１２との位置関係に依存しない、スピーカ１１１から拡声される音量を取得できる。 With the above configuration, the volume monitoring device 700 according to Embodiment 7 of the present invention is louder than the speaker 111 that does not depend on the positional relationship between the speaker 111 and the microphone 112 even in the communication device 11 used by the directional microphone. Can be obtained volume.

なお、上記説明では、コミュニケーション装置１１は無指向化した収音信号ｍ（ｔ）を、通信網１０を経由して、コミュニケーション装置１２に送信しているが、指向性を有する複数の収音信号７４２ａ〜７４２ｂをコミュニケーション装置１２に送信してもよい。 In the above description, the communication device 11 transmits the omnidirectional sound pickup signal m (t) to the communication device 12 via the communication network 10, but a plurality of sound pickup signals having directivity are provided. 742a to 742b may be transmitted to the communication device 12.

また、指向性マイクロホンの数は、２以上であってもよい。
また、上記説明では、実施の形態１に係る音量監視装置１００が、さらに、無指向化部７１３を備える例を説明したが、実施の形態２〜６に係る音量監視装置２００〜６００が、さらに、無指向化部７１３を備えてもよい。 The number of directional microphones may be two or more.
In the above description, the sound volume monitoring apparatus 100 according to Embodiment 1 further includes an omnidirectional unit 713, but the sound volume monitoring apparatuses 200 to 600 according to Embodiments 2 to 6 are further provided. The omnidirectional unit 713 may be provided.

（実施の形態８）
本発明の実施の形態８に係る音量監視装置８００は、実施の形態３に係る音量監視装置３００の変形例であり、さらに、スピーカ１１１により拡声された音声がユーザ１１０に聞こえる範囲を示す拡声領域情報８４４を算出する。 (Embodiment 8)
Volume monitoring apparatus 800 according to Embodiment 8 of the present invention is a modification of volume monitoring apparatus 300 according to Embodiment 3, and further includes a loudspeaker region that indicates a range in which user 110 can hear the sound that is loudened by speaker 111. Information 844 is calculated.

図１６は、本発明の実施の形態８に係る音量監視装置８００の構成を示すブロック図である。なお、図９と同様の要素には同一の符号を付しており、重複する説明は省略する。 FIG. 16 is a block diagram showing a configuration of volume monitoring apparatus 800 according to Embodiment 8 of the present invention. In addition, the same code | symbol is attached | subjected to the element similar to FIG. 9, and the overlapping description is abbreviate | omitted.

図１６に示す音量監視装置８００は、音量監視装置３００の構成に加え、さらに、拡声領域算出部８１４を備える。 A volume monitoring apparatus 800 shown in FIG. 16 includes a loudspeaker area calculation unit 814 in addition to the configuration of the volume monitoring apparatus 300.

拡声領域算出部８１４、距離別音量情報３３８を用いて、スピーカ１１１により拡声された音声が、予め定められた音量より大きくなる範囲（スピーカ１１１からの距離）を示す拡声領域情報８４４を算出する。具体的には、拡声領域算出部８１４は、複数の距離における距離別信号音圧Ｐ_rが、予め定められた値より大きくなる距離ｒを拡声領域情報８４４として算出する。 Using the loudness area calculation unit 814 and the volume information 338 by distance, loudness area information 844 indicating a range (distance from the speaker 111) in which the sound loudened by the speaker 111 is larger than a predetermined volume is calculated. Specifically, loudspeaker region calculation unit 814, distance-signal sound pressure P _r in a plurality of distances, to calculate a larger distance r from a predetermined value as the loudspeaker region information 844.

また、送信部１０４は、拡声領域算出部８１４により算出された拡声領域情報８４４を、通信網１０を経由して、コミュニケーション装置１２に送信する。 In addition, the transmission unit 104 transmits the sound area information 844 calculated by the sound area calculation unit 814 to the communication device 12 via the communication network 10.

なお、送信部１０４は、拡声領域情報８４４のみをコミュニケーション装置１２に送信してもよいし、拡声領域情報８４４と、距離別音量情報３３８とをコミュニケーション装置１２に送信してもよい。 Note that the transmission unit 104 may transmit only the sound area information 844 to the communication apparatus 12 or may transmit the sound area information 844 and the sound volume information 338 for each distance to the communication apparatus 12.

また、拡声領域情報８４４は、スピーカ１１１により拡声された音声が、予め定められた音量より大きくなる範囲を視覚的に示す画像のデータであってもよい。また、音量監視装置８００は、拡声領域情報８４４及び距離別音量情報３３８を用いて、距離別の音量を視覚的に示す画像（分布図等）に、予め定められた音量より大きくなる範囲を視覚的に示す情報（当該範囲を示す枠等）を加えた画像を生成し、当該画像のデータをコミュニケーション装置１２に送信してもよい。 Further, the sound enhancement area information 844 may be image data that visually indicates a range in which the sound amplified by the speaker 111 is larger than a predetermined volume. In addition, the volume monitoring apparatus 800 uses the loudness area information 844 and the distance-specific volume information 338 to visually recognize a range that is larger than a predetermined volume on an image (distribution map or the like) that visually indicates the volume by distance. An image to which information (such as a frame indicating the range) is added may be generated, and data of the image may be transmitted to the communication device 12.

また、上記説明では、実施の形態３に係る音量監視装置３００が、さらに、拡声領域算出部８１４を備える例を説明したが、実施の形態４又は５に係る音量監視装置４００又は５００が、さらに、拡声領域算出部８１４を備えてもよい。 In the above description, the sound volume monitoring apparatus 300 according to the third embodiment has been described as further including the loudness area calculation unit 814. However, the sound volume monitoring apparatus 400 or 500 according to the fourth or fifth embodiment is further provided. The loudspeaker area calculation unit 814 may be provided.

また、上記実施の形態１〜８の説明では、音声を受信する側のコミュニケーション装置１１が、音量監視装置１００〜８００を備えるとしたが、音声を送信する側のコミュニケーション装置１２が、音量監視装置１００〜８００を備えてもよい。さらに、通信網１０に接続されるサーバ等が、音量監視装置１００〜８００を備えてもよい。 In the description of the first to eighth embodiments, the communication device 11 on the receiving side includes the sound volume monitoring devices 100 to 800. However, the communication device 12 on the transmitting side transmits the sound monitoring device. You may provide 100-800. Further, a server or the like connected to the communication network 10 may include the volume monitoring devices 100 to 800.

（実施の形態９）
本発明の実施の形態９では、上述した実施の形態１〜８に係る音量監視装置１００〜８００のいずれかを備えるコミュニケーション装置１１を含むコミュニケーションシステムについて説明する。 (Embodiment 9)
In the ninth embodiment of the present invention, a communication system including the communication device 11 including any one of the sound volume monitoring devices 100 to 800 according to the first to eighth embodiments described above will be described.

図１７は、本発明の実施の形態９に係る２拠点でのコミュニケーションサービスを実現するコミュニケーションシステム２の構成を示す図である。 FIG. 17 is a diagram showing a configuration of a communication system 2 that implements a communication service at two locations according to Embodiment 9 of the present invention.

図１７において、第１の拠点に配置されるコミュニケーション装置１１と、第２の拠点に配置されるコミュニケーション装置１２とは、通信機能を有する映像音声制御装置であり、通信網１０を介して相互接続が可能である。 In FIG. 17, the communication device 11 disposed at the first base and the communication device 12 disposed at the second base are video / audio control devices having a communication function, and are interconnected via the communication network 10. Is possible.

コミュニケーション装置１１は、第１の拠点におけるリアルタイムな映像音声データを、カメラ及びマイクから取得し、取得した映像音声データを、通信網１０を介してコミュニケーション装置１２に送信する。また、コミュニケーション装置１１は、第２の拠点におけるリアルタイムな映像音声データを、コミュニケーション装置１２から受信し、自装置のディスプレイ及びスピーカに出力する。 The communication device 11 acquires real-time video / audio data at the first site from the camera and the microphone, and transmits the acquired video / audio data to the communication device 12 via the communication network 10. Further, the communication device 11 receives the real-time video / audio data at the second site from the communication device 12 and outputs it to the display and the speaker of the own device.

また、通信網１０を介しているにもかかわらず、距離による影響を低減した、よりリアルなコミュニケーションサービスを提供するために、コミュニケーション装置１１及び１２は、複数個のディスプレイ、カメラ、マイク、及びスピーカを備える。これらの入出力装置は、予め適した位置に配置されており、この配置に特徴を有している。これについては、図を用いて後で詳細に説明する。 In addition, in order to provide a more realistic communication service that reduces the influence of distance despite the communication network 10, the communication devices 11 and 12 include a plurality of displays, cameras, microphones, and speakers. Is provided. These input / output devices are arranged in advance at suitable positions, and are characterized by this arrangement. This will be described in detail later with reference to the drawings.

通信網１０は、有線回線でも無線回線でもよく、また、この両方の組み合わせであってもよい。また、インターネット又は公衆電話回線などのパブリックネットワークであってもよく、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）などの限られたドメインの中でクローズされたローカルネットワークであってもよく、また、この両方の組み合わせであってもよい。 The communication network 10 may be a wired line or a wireless line, or a combination of both. Further, it may be a public network such as the Internet or a public telephone line, a local network closed in a limited domain such as a LAN (Local Area Network), or a combination of both. There may be.

また、コミュニケーション装置１１とコミュニケーション装置１２との間の通信には、例えば、ＲＴＰ（Ｒｅａｌ−ｔｉｍｅＴｒａｎｓｐｏｒｔＰｒｏｔｏｃｏｌ）又はＴＣＰ（ＴｒａｎｓｍｉｓｓｉｏｎＣｏｎｔｒｏｌＰｒｏｔｏｃｏｌ）を用いたデジタル通信が用いられる。 For communication between the communication device 11 and the communication device 12, for example, digital communication using RTP (Real-time Transport Protocol) or TCP (Transmission Control Protocol) is used.

また、コミュニケーション装置１１及び１２は、ネットワーク上の位置を示すアドレス情報としてＩＰアドレスが割り当てられているものとする。なお、ＩＰアドレスでなく、電話番号など他の情報をアドレス情報として用いてもよい。 Further, it is assumed that the communication devices 11 and 12 are assigned IP addresses as address information indicating positions on the network. Other information such as a telephone number may be used as the address information instead of the IP address.

また、コミュニケーション装置１１及び１２が送受信するデータは、リアルタイムな映像音声データとしたが、光ディスク又はハードディスクなどの記憶媒体に記録されている映像音声データも、リアルタイムな映像音声データと共に送受信することができる。また、コミュニケーション装置１１及び１２が送受信するデータは、静止画データ、テキスト、又はＨＴＭＬなどの文書データでもよい。 Further, the data transmitted and received by the communication devices 11 and 12 is real-time video / audio data. However, the video / audio data recorded on a storage medium such as an optical disk or a hard disk can also be transmitted / received together with the real-time video / audio data. . The data transmitted and received by the communication devices 11 and 12 may be still image data, text, or document data such as HTML.

以上により、コミュニケーション装置１１とコミュニケーション装置１２とは、予め適した位置に配置された複数個の入出力装置を用いて、他拠点のリアルタイムな映像及び音声を出力することが可能となり、よりリアルなコミュニケーションサービスを提供することができる。 As described above, the communication device 11 and the communication device 12 can output real-time video and audio of other bases by using a plurality of input / output devices arranged in advance in a suitable position. Communication services can be provided.

また、図１７に示した２拠点でのコミュニケーションサービスだけでなく、３拠点以上での相互接続によるコミュニケーションサービスが可能である。図１８は、前述したコミュニケーション装置１１及び１２以外の機器も備えたコミュニケーションシステム３の構成の一例を示している。 Further, not only the communication service at the two bases shown in FIG. 17 but also the communication service by the interconnection at three or more bases is possible. FIG. 18 shows an example of the configuration of the communication system 3 including devices other than the communication devices 11 and 12 described above.

図１８に示すコミュニケーションシステム３では、コミュニケーション装置１１と、コミュニケーション装置１２と、ノートＰＣ（パーソナルコンピュータ）１３と、ＰＤＡ（ＰｅｒｓｏｎａｌＤｉｇｉｔａｌＡｓｓｉｓｔａｎｔ）１５と、携帯電話１６と、ディスクトップＰＣ１９とが接続され、５拠点でのコミュニケーションシステムが実施される。なお、ここでは、公衆電話回線である通信網１０とインターネット１８とが、インターネットサービスプロバイダであるサーバ１７を介して接続されているものとする。 In the communication system 3 shown in FIG. 18, a communication device 11, a communication device 12, a notebook PC (personal computer) 13, a PDA (Personal Digital Assistant) 15, a mobile phone 16, and a desktop PC 19 are connected. A communication system is implemented at five locations. Here, it is assumed that the communication network 10 that is a public telephone line and the Internet 18 are connected via a server 17 that is an Internet service provider.

また、ノートＰＣ１３、ＰＤＡ１５、携帯電話１６、及びディスクトップＰＣ１９のうち１以上が、上述した音量監視装置１００〜８００のいずれかを備えてもよい。 One or more of the notebook PC 13, PDA 15, mobile phone 16, and desktop PC 19 may include any of the volume monitoring devices 100 to 800 described above.

コミュニケーション装置１１及び１２は、図１７と同じであるため、それ以外の機器について説明する。なお、図１８に示した通り、コミュニケーション装置１１及び１２を構成する構成要素の一部又は全部は、１個のシステムＬＳＩ（ＬａｒｇｅＳｃａｌｅＩｎｔｅｇｒａｔｉｏｎ：大規模集積回路）から構成されているとしてもよい。つまり、上述した音量監視装置１００〜８００を構成する構成要素の一部又は全部は、１個のシステムＬＳＩから構成されているとしてもよい。 Since the communication devices 11 and 12 are the same as those in FIG. 17, other devices will be described. As shown in FIG. 18, some or all of the constituent elements constituting the communication devices 11 and 12 may be configured by one system LSI (Large Scale Integration). That is, some or all of the constituent elements of the volume monitoring devices 100 to 800 described above may be configured by one system LSI.

ノートＰＣ１３は、カメラ機能を内蔵しておらず、外付けでカメラ１４が接続されている。カメラ１４は、デジタルビデオカメラなどの動画撮影が可能な機器である。ノートＰＣ１３は、カメラ１４により撮影された映像音声データを、自拠点でのリアルタイムな映像音声データとして、通信網１０を介して、他機器に送信する。なお、カメラ１４が、動画撮影機能を有しておらず、静止画撮影機能のみの場合、撮影した静止画データを一定間隔で送信してもよい。 The notebook PC 13 does not have a built-in camera function, and the camera 14 is connected externally. The camera 14 is a device capable of moving image shooting such as a digital video camera. The notebook PC 13 transmits the video / audio data captured by the camera 14 to other devices via the communication network 10 as real-time video / audio data at the local site. When the camera 14 does not have a moving image shooting function but has only a still image shooting function, the shot still image data may be transmitted at regular intervals.

ＰＤＡ１５は、カメラ機能を有しておらず、自拠点のリアルタイムな映像データを送信することができない。ＰＤＡ１５は、通信網１０を介して、受信した映像データをディスプレイ及びスピーカに出力するとともに、自拠点でのリアルタイムな音声データを他機器に送信する。なお、カメラ機能を有している場合は、映像音声データを送受信することが可能となる。 The PDA 15 does not have a camera function and cannot transmit real-time video data of its own base. The PDA 15 outputs the received video data to the display and the speaker via the communication network 10 and transmits real-time audio data at the local site to other devices. If the camera function is provided, video / audio data can be transmitted and received.

携帯電話１６は、ＣＣＤカメラなどのカメラ付きの携帯電話であり、自拠点でのリアルタイムな映像音声データをカメラ及びマイクから取得し、通信網１０を介して他機器に送信する。また、受信した映像音声データを自装置のディスプレイ及びスピーカに出力する。また、携帯電話１６は、ＰＤＣ（ＰｅｒｓｏｎａｌＤｉｇｉｔａｌＣｏｍｍｕｎｉｃａｔｉｏｎｓ）方式、ＣＤＭＡ（ＣｏｄｅＤｉｖｉｓｉｏｎＭｕｌｔｉｐｌｅＡｃｃｅｓｓ）方式、ＧＳＭ（ＧｌｏｂａｌＳｙｓｔｅｍｆｏｒＭｏｂｉｌｅＣｏｍｍｕｎｉｃａｔｉｏｎｓ）方式、Ｗ−ＣＤＭＡ（ＷｉｄｅｂａｎｄーＣｏｄｅＤｉｖｉｓｉｏｎＭｕｌｔｉｐｌｅＡｃｃｅｓｓ）方式、ＣＤＭＡ１ｘ（ＣｏｄｅＤｉｖｉｓｉｏｎＭｕｌｔｉｐｌｅＡｃｃｅｓｓ）方式、及びＬＴＥ（ＬｏｎｇＴｅｒｍＥｖｏｌｕｔｉｏｎ）などのうち、いずれの通信方式を用いてもよい。また、携帯電話１６は、ＳＤカードなどの記録媒体である蓄積メディアを装着可能なスロット部を有しており、記録メディアに記録されているデータを、コミュニケーションサービスに参加している他機器と共有することが可能である。さらに、携帯電話１６による通信網１０への接続は、ＷｉＭＡＸなど他の無線通信機能を用いてもよい。 The mobile phone 16 is a mobile phone with a camera such as a CCD camera, and acquires real-time video and audio data at the local site from the camera and microphone and transmits them to other devices via the communication network 10. Also, the received video / audio data is output to the display and speaker of the device itself. Further, the mobile phone 16 includes a PDC (Personal Digital Communications) system, a CDMA (Code Division Multiple Access) system, a GSM (Global System for Mobile Communications) system, and a W-CDMA (Wide CDMA (Wide CDMA) system. Any communication method may be used, such as a Division Multiple Access (LTE) method and LTE (Long Term Evolution). In addition, the mobile phone 16 has a slot portion in which a storage medium that is a recording medium such as an SD card can be mounted, and data recorded on the recording medium is shared with other devices participating in the communication service. Is possible. Furthermore, connection to the communication network 10 by the mobile phone 16 may use other wireless communication functions such as WiMAX.

ディスクトップＰＣ１９は、カメラ機能を内蔵しており、自拠点でのリアルタイムな映像音声データをカメラ及びマイクから取得し、インターネット１８と通信網１０を介して他機器に送信する。なお、インターネット１８と通信網１０とは、インターネットサービスプロバイダのサーバ１７を介して接続しているものとする。また、ディスクトップＰＣ１９は、受信した映像音声データを自装置のディスプレイ及びスピーカに出力する。なお、ディスクトップＰＣ１９は、光ディスク又はＳＤカードなどの記憶媒体である蓄積メディアの読み取りが可能なデバイスと、外付けＨＤＤ又は内部メモリとのうち１以上を有しており、これらに記録されているデータを、コミュニケーションサービスに参加している他機器と共有することが可能である。 The desktop PC 19 has a built-in camera function, acquires real-time video and audio data at its own location from the camera and microphone, and transmits them to other devices via the Internet 18 and the communication network 10. It is assumed that the Internet 18 and the communication network 10 are connected via a server 17 of an Internet service provider. Further, the desktop PC 19 outputs the received video / audio data to the display and the speaker of its own apparatus. The desktop PC 19 has at least one of a device capable of reading a storage medium such as an optical disk or an SD card and an external HDD or an internal memory, and is recorded on these. Data can be shared with other devices participating in the communication service.

また、各機器は、ネットワーク上の位置を示すアドレス情報として電話番号又はＩＰアドレスが割り当てられているものとする。なお、ＩＰｖ６対応のＩＰアドレスを用いることで、各機器が物理的に移動しても、同じアドレスを用いてコミュニケーションサービスに参加することが可能となる。 Each device is assigned a telephone number or an IP address as address information indicating a position on the network. Note that by using an IPv6-compatible IP address, it is possible to participate in the communication service using the same address even if each device physically moves.

また、各機器は、コミュニケーションサービスに参加している他機器へマルチキャストで映像音声データの送信を行なってもよい。また、特定の機器（例えばコミュニケーション装置１１）をサーバと設定し、サーバが他機器から映像音声データを受信して処理を行なった後、他機器へマルチキャストしてもよい。 Further, each device may transmit the video / audio data by multicast to other devices participating in the communication service. In addition, a specific device (for example, the communication device 11) may be set as a server, and after the server receives and processes video / audio data from another device, it may be multicast to the other device.

以上により、コミュニケーション装置１１とコミュニケーション装置１２は、複数拠点に位置する各機器が送信したリアルタイムな映像や音声を出力することが可能となり、よりリアルなコミュニケーションサービスを提供することができる。 As described above, the communication device 11 and the communication device 12 can output real-time video and audio transmitted from each device located at a plurality of locations, and can provide a more realistic communication service.

次に、コミュニケーション装置１１及び１２が備える入出力装置の配置について説明する。図１９は、コミュニケーション装置１１及び１２の構成の一例を示す図である。 Next, the arrangement of input / output devices included in the communication devices 11 and 12 will be described. FIG. 19 is a diagram illustrating an example of the configuration of the communication devices 11 and 12.

コミュニケーション装置１１及び１２は、本体２０と、ディスプレイ２１ａ、２１ｂ及び２１ｃと、カメラ２２ａ、２２ｂ、２２ｃ、２２ｄ及び２２ｅと、マイク２３と、スピーカ２４ａ、２４ｂ及び２４ｃと、リモコン２６とを備えている。また、各入出力装置（ディスプレイ２１ａ、２１ｂ及び２１ｃ、カメラ２２ａ、２２ｂ、２２ｃ、２２ｄ及び２２ｅ、マイク２３、スピーカ２４ａ、２４ｂ及び２４ｃ、及びリモコン２６）は、本体２０と接続されている。この接続は、有線回線であっても無線回線であってもよい。また、コミュニケーションサービスに参加する１人以上のユーザは、ディスプレイ２１ａ、２１ｂ及び２１ｃの方向に向いて机２５の席につくことを想定している。 The communication devices 11 and 12 include a main body 20, displays 21a, 21b, and 21c, cameras 22a, 22b, 22c, 22d, and 22e, a microphone 23, speakers 24a, 24b, and 24c, and a remote control 26. . Each input / output device (displays 21a, 21b and 21c, cameras 22a, 22b, 22c, 22d and 22e, microphone 23, speakers 24a, 24b and 24c, and remote control 26) is connected to the main body 20. This connection may be a wired line or a wireless line. In addition, it is assumed that one or more users participating in the communication service are seated on the desk 25 facing the displays 21a, 21b, and 21c.

本体２０は、ＣＰＵ及びメモリを備えた情報処理装置である。本体２０は、各入出力装置の制御と、入出力装置から入力された映像音声データの符号化処理と、通信網１０を介した通信制御処理と、通信網１０を介して受信した映像音声データの復号化処理と、復号化した映像音声データの入出力装置への出力処理などとを行なう。 The main body 20 is an information processing apparatus including a CPU and a memory. The main body 20 controls each input / output device, encodes video / audio data input from the input / output device, performs communication control processing via the communication network 10, and receives video / audio data received via the communication network 10. And the output processing of the decoded video / audio data to the input / output device.

ディスプレイ２１ａ、２１ｂ及び２１ｃは、映像などを表示する装置であり、例えば、ＬＣＤ（ＬｉｇｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）又はＰＤＰ（ＰｌａｓｍａＤｉｓｐｌａｙＰａｎｅｌ）である。このディスプレイ２１ａ、２１ｂ及び２１ｃは、コミュニケーションサービスに参加するユーザの正面に位置するよう、机２５の前面に並べて配置される。ここでは３個のディスプレイが接続されている。この３個のディスプレイには、他拠点での参加者の映像が、机２５の席についているよう表示される。つまり、他拠点の映像は、ディスプレイの個数に合わせて分割して表示される。なお、４個以上のディスプレイを接続してもよい。 The displays 21a, 21b, and 21c are devices that display video and the like, and are, for example, an LCD (Liquid Crystal Display) or a PDP (Plasma Display Panel). The displays 21a, 21b and 21c are arranged side by side on the front surface of the desk 25 so as to be positioned in front of the user participating in the communication service. Here, three displays are connected. On these three displays, images of participants at other bases are displayed as if they are seated at the desk 25. That is, the video of another base is divided and displayed according to the number of displays. Four or more displays may be connected.

カメラ２２ａ、２２ｂ、２２ｃ、２２ｄ及び２２ｅは、デジタルビデオカメラなどの動画撮影機能を有する撮影装置である。このカメラ２２ａ、２２ｂ、２２ｃ、２２ｄ及び２２ｅは、ディスプレイ２１ａ、２１ｂ及び２１ｃの上部に配置される。ここでは５個のカメラが接続されている。カメラ２２ａは、左に位置するディスプレイ２１ａの左右方向の中央に配置される。カメラ２２ｅは、右に位置するディスプレイ２１ｃの左右方向の中央に配置される。カメラ２２ｂ、２２ｃ及び２２ｄは、中央に位置するディスプレイ２１ｂの左右方向の中央に並べて配置される。また、隣り合わせに配置されたカメラの撮影対象は、映像の端が一部重なるものとする。これにより、コミュニケーション装置１１及び１２は、ディスプレイ２１ａ、２１ｂ及び２１ｃの方向に向いて机２５の席についたユーザの映像を、切れ目なく撮影して、他拠点に送信することが可能となる。 The cameras 22a, 22b, 22c, 22d, and 22e are photographing devices having a moving image photographing function such as a digital video camera. The cameras 22a, 22b, 22c, 22d and 22e are arranged on the tops of the displays 21a, 21b and 21c. Here, five cameras are connected. The camera 22a is disposed at the center in the left-right direction of the display 21a located on the left. The camera 22e is disposed at the center in the left-right direction of the display 21c located on the right. The cameras 22b, 22c and 22d are arranged side by side at the center in the left-right direction of the display 21b located at the center. In addition, it is assumed that the shooting targets of the cameras arranged adjacent to each other partially overlap the video edges. As a result, the communication devices 11 and 12 can capture the video of the user who is seated at the desk 25 facing the direction of the displays 21a, 21b, and 21c, and can transmit it to other bases.

マイク２３は、周辺の音声の集音を行なう入力装置である。このマイク２３は、机２５の中央に配置される。また、机２５の席についたユーザの人数に合わせた個数の指向性マイクを、各ユーザの正面位置するように配置してもよい。また、１個の無指向性マイクと１個以上の指向性マイクを組み合わせて配置してもよい。これにより、他拠点において、どの方向からの音声かを把握することが可能となり、他拠点は音声を出力する方向を制御することが可能となる。 The microphone 23 is an input device that collects surrounding sounds. The microphone 23 is arranged at the center of the desk 25. Moreover, you may arrange | position the directional microphone of the number according to the number of users who seated on the desk 25 so that it may be located in front of each user. One omnidirectional microphone and one or more directional microphones may be arranged in combination. As a result, it is possible to grasp the direction from which the sound is transmitted at the other site, and the other site can control the direction in which the sound is output.

スピーカ２４ａ、２４ｂ及び２４ｃは、ディスプレイ２１ａ、２１ｂ及び２１ｃの背後に配置される。ディスプレイの個数に合わせて、ここでは３個のスピーカが接続されている。これにより、ディスプレイ２１ａ、２１ｂ及び２１ｃに表示されている映像に合わせて、音声を出力するスピーカを制御することが可能となる。つまり、ディスプレイ２１ａに表示されているユーザの声が、スピーカ２４ａから出力されることになる。 The speakers 24a, 24b and 24c are arranged behind the displays 21a, 21b and 21c. Here, three speakers are connected in accordance with the number of displays. This makes it possible to control the speaker that outputs sound in accordance with the images displayed on the displays 21a, 21b, and 21c. That is, the user's voice displayed on the display 21a is output from the speaker 24a.

リモコン２６は、ユーザからの入力指示を受け、本体２０への操作入力信号を送信する操作入力装置である。なお、リモコン２６は、机２５の席についたユーザにより操作可能であればよい。また、ここでは、操作入力装置はリモコンとしたが、キーボード及びマウスなど、他の操作入力装置を用いてもよい。また、机２５の席についたユーザの人数に合わせた個数の操作入力装置を、各ユーザの正面に位置するように配置してもよい。 The remote control 26 is an operation input device that receives an input instruction from a user and transmits an operation input signal to the main body 20. The remote controller 26 may be operated by a user who sits on the desk 25. Although the operation input device is a remote control here, other operation input devices such as a keyboard and a mouse may be used. Moreover, you may arrange | position the operation input device of the number according to the number of users who seated on the desk 25 so that it may be located in front of each user.

以上により、コミュニケーション装置１１及び１２は、他拠点とのコミュニケーションサービスを提供することが可能となる。つまり、ユーザがリモコン２６を操作して接続先（他拠点）を設定して通信を確立し、コミュニケーションサービスを開始する。コミュニケーションサービス実行中は、カメラ２２ａ、２２ｂ、２２ｃ、２２ｄ及び２２ｅが、机２５の席についたユーザの映像を撮影し、同時に、マイク２３が、音声を収音する。 As described above, the communication devices 11 and 12 can provide communication services with other bases. That is, the user operates the remote control 26 to set a connection destination (another base), establish communication, and start a communication service. During the execution of the communication service, the cameras 22a, 22b, 22c, 22d, and 22e capture the image of the user who sits on the desk 25, and at the same time, the microphone 23 collects sound.

本体２０は、カメラ２２ａ、２２ｂ、２２ｃ、２２ｄ及び２２ｅとマイク２３とから、自拠点のリアルタイムな映像音声データを取得し、取得した映像音声データに符号化処理を行い、符号化した映像音声データを他拠点へ送信する。また、本体２０は、他拠点の映像音声データを受信し、受信した映像音声データに復号化処理を行い、復号化した映像音声データをディスプレイ２１ａ、２１ｂ及び２１ｃとスピーカ２４ａ、２４ｂ及び２４ｃとへ出力する。 The main body 20 acquires real-time video / audio data of the local site from the cameras 22a, 22b, 22c, 22d and 22e and the microphone 23, performs encoding processing on the acquired video / audio data, and encodes the encoded video / audio data. To other locations. Further, the main body 20 receives the video / audio data of the other base, performs a decoding process on the received video / audio data, and sends the decoded video / audio data to the displays 21a, 21b and 21c and the speakers 24a, 24b and 24c. Output.

これにより、複数拠点間で相互にリアルタイムな映像及び音声を出力することが可能となり、よりリアルなコミュニケーションサービスを提供することができる。また、コミュニケーションサービス実行中に、ユーザがリモコン２６を操作して、光ディスク又はＳＤカードなどの記憶媒体である蓄積メディアに記録されているデータを取得し、通信網１０を介して送受信することで、コミュニケーションサービスに参加している他機器と当該データを共有することが可能となる。 Thereby, it becomes possible to mutually output real-time video and audio between a plurality of sites, and a more realistic communication service can be provided. In addition, during execution of the communication service, the user operates the remote control 26 to acquire data recorded in a storage medium that is a storage medium such as an optical disc or an SD card, and transmits / receives the data via the communication network 10. The data can be shared with other devices participating in the communication service.

図２０は、コミュニケーション装置１１及び１２の構成を示すブロック図である。
コミュニケーション装置１１及び１２は、制御部３０、音声符号化部３１、音声復号化部３２、画像符号化部３３、画像復号化部３４、電源回路３５、タイマー回路３６、音声処理部３７、音声出力部３８、音声入力部３９、表示処理部４０、表示部４１、画像処理部４２、画像入力部４３、操作入力制御部４４、操作入力部４５、通信制御部４６、及び送受信回路４７を備える。各処理部は、バスラインを通じて互いに接続されている。また、必要に応じて、バスラインには、ハードディスク装置４８及び読取装置４９を接続することが可能である。ハードディスク装置４８と読取装置４９とは、それぞれインタフェースを通じてバスラインに接続される。 FIG. 20 is a block diagram showing the configuration of the communication devices 11 and 12.
The communication devices 11 and 12 include a control unit 30, a voice encoding unit 31, a voice decoding unit 32, an image encoding unit 33, an image decoding unit 34, a power supply circuit 35, a timer circuit 36, a voice processing unit 37, a voice output. A unit 38, an audio input unit 39, a display processing unit 40, a display unit 41, an image processing unit 42, an image input unit 43, an operation input control unit 44, an operation input unit 45, a communication control unit 46, and a transmission / reception circuit 47. Each processing unit is connected to each other through a bus line. If necessary, a hard disk device 48 and a reading device 49 can be connected to the bus line. The hard disk device 48 and the reading device 49 are each connected to a bus line through an interface.

制御部３０は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）５０、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）５１、及びＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）５２を備え、コミュニケーション装置全体の制御を行う。ＣＰＵ５０は、単一のＣＰＵで構成されても良く、複数のＣＰＵで構成されても良い。ＲＯＭ５１は、ＣＰＵ５０の動作を規定するコンピュータプログラムを記憶している。コンピュータプログラムは、ハードディスク装置４８に記憶させることもできる。ＣＰＵ５０は、ＲＯＭ５１又はハードディスク装置４８が格納するコンピュータプログラムを、必要に応じてＲＡＭ５２に書き込みつつ、コンピュータプログラムが規定する処理を実行する。ＲＡＭ５２は、ＣＰＵ５０が処理を実行するのに伴って発生するデータを一時的に記憶する媒体としても機能する。ＲＯＭ５１には、フラッシュＲＯＭのように書き込みが可能で、電源を切っても記憶内容を保持できる不揮発性のメモリ及び記憶媒体も含まれる。また、ＲＡＭ５２には、電源を切ると記憶内容が保持されない揮発性のメモリ及び記憶媒体が含まれる。 The control unit 30 includes a CPU (Central Processing Unit) 50, a ROM (Read Only Memory) 51, and a RAM (Random Access Memory) 52, and controls the entire communication apparatus. The CPU 50 may be composed of a single CPU or a plurality of CPUs. The ROM 51 stores a computer program that defines the operation of the CPU 50. The computer program can also be stored in the hard disk device 48. The CPU 50 executes processing defined by the computer program while writing the computer program stored in the ROM 51 or the hard disk device 48 into the RAM 52 as necessary. The RAM 52 also functions as a medium for temporarily storing data generated as the CPU 50 executes processing. The ROM 51 includes a nonvolatile memory and a storage medium that can be written like a flash ROM and can retain stored contents even when the power is turned off. In addition, the RAM 52 includes a volatile memory and a storage medium that do not retain stored contents when the power is turned off.

音声符号化部３１は、音声処理部３７から通知された音声データを、特定の符号化方法によって圧縮符号化することにより、符号化音声データに変換し、変換した符号化音声データを通信制御部４６に出力する。 The audio encoding unit 31 converts the audio data notified from the audio processing unit 37 into encoded audio data by compression encoding using a specific encoding method, and converts the converted encoded audio data into a communication control unit. Output to 46.

音声復号化部３２は、通信制御部４６から通知された符号化音声データを、特定の復号化方法で復号化することにより再生可能な音声データを生成し、生成した音声データを音声処理部３７に出力する。 The audio decoding unit 32 generates reproducible audio data by decoding the encoded audio data notified from the communication control unit 46 by a specific decoding method, and the generated audio data is converted into an audio processing unit 37. Output to.

画像符号化部３３は、画像処理部４２から通知された画像データを、特定の符号化方法によって圧縮符号化することにより、符号化画像データに変換し、変換した符号化画像データを通信制御部４６に出力する。 The image encoding unit 33 converts the image data notified from the image processing unit 42 into encoded image data by compression encoding using a specific encoding method, and converts the converted encoded image data into a communication control unit. Output to 46.

画像復号化部３４は、通信制御部４６から通知された符号化画像データを、特定の復号化方法で復号化することにより再生可能な画像データを生成し、生成した画像データを表示処理部４０に出力する。 The image decoding unit 34 generates reproducible image data by decoding the encoded image data notified from the communication control unit 46 by a specific decoding method, and the generated image data is displayed on the display processing unit 40. Output to.

電源回路３５は、電源キーのオン操作により、バッテリーパック又はアダプタ経由で受け取った電力を各処理部に供給することにより、コミュニケーション装置を動作可能な状態に起動する。また、コミュニケーション装置１１及び１２は、通常モード及び省電力モードを含む複数のモードを備えでもよい。省電力モード時には、電源回路３５は、一部の処理部にのみに電力を供給することで、必要な電力を低減することができる。例えば、コミュニケーション装置１１及び１２は、他拠点からの着呼待ち状態の場合、通信制御部４６及び送受信回路４７のみに電力を供給する省電力モードで動作し、着呼を受けた際に省電力モードから通常モードに遷移することで、他処理部の動作を開始してもよい。 The power supply circuit 35 activates the communication device in an operable state by supplying power received via the battery pack or the adapter to each processing unit by turning on the power key. The communication devices 11 and 12 may include a plurality of modes including a normal mode and a power saving mode. In the power saving mode, the power supply circuit 35 can reduce the necessary power by supplying power only to some of the processing units. For example, the communication devices 11 and 12 operate in a power saving mode in which power is supplied only to the communication control unit 46 and the transmission / reception circuit 47 when waiting for an incoming call from another base. The operation of the other processing unit may be started by transitioning from the mode to the normal mode.

タイマー回路３６は、一定の周期でタイマー割込信号を出力する装置である。
音声出力部３８は、音声データを出力する装置であり、図１９で説明したスピーカ２４ａ、２４ｂ及び２４ｃに相当する。音声出力部３８は、音声処理部３７から通知された音声データを出音する。 The timer circuit 36 is a device that outputs a timer interrupt signal at a constant cycle.
The audio output unit 38 is a device that outputs audio data, and corresponds to the speakers 24a, 24b, and 24c described in FIG. The audio output unit 38 outputs the audio data notified from the audio processing unit 37.

音声入力部３９は、周辺の音声の集音を行なう入力装置であり、図１９で説明したマイク２３に相当する。音声入力部３９は、集音した音声信号を音声処理部３７に通知する。 The voice input unit 39 is an input device that collects peripheral voices, and corresponds to the microphone 23 described with reference to FIG. The voice input unit 39 notifies the voice processing unit 37 of the collected voice signal.

音声処理部３７は、１つ以上の音声入力部３９から通知された音声信号をデジタル変換し、変換したデジタル音声信号に対して合成又は加工処理などを行い、当該処理を行った音声信号を音声符号化部３１に通知する。また、音声処理部３７が、ノイズキャンセル機能などにより、よりクリアな音声データを生成することもできる。また、音声入力部３９が複数個存在する場合、音声処理部３７は、複数個の音声入力部３９の配置情報を管理し、配置に適した音声データを生成することが可能となる。さらに、音声処理部３７は、音声復号化部３２から通知された音声データの分割及び加工処理などを行い、生成した音声データを音声出力部３８に通知する。音声処理部３７は、複数個の音声出力部３８の配置情報を管理し、各音声出力部３８に適した複数個の音声データを生成することが可能となる。 The audio processing unit 37 digitally converts the audio signal notified from the one or more audio input units 39, performs synthesis or processing on the converted digital audio signal, and converts the audio signal subjected to the processing into audio Notify the encoding unit 31. Further, the sound processing unit 37 can generate clearer sound data by a noise cancellation function or the like. Further, when there are a plurality of voice input units 39, the voice processing unit 37 can manage arrangement information of the plurality of voice input units 39 and generate voice data suitable for the arrangement. Further, the voice processing unit 37 performs division and processing of the voice data notified from the voice decoding unit 32 and notifies the generated voice data to the voice output unit 38. The sound processing unit 37 can manage arrangement information of the plurality of sound output units 38 and generate a plurality of sound data suitable for each sound output unit 38.

表示部４１は、画像及び文字等を表示する装置である。図１９で説明したディスプレイ２１ａ、２１ｂ及び２１ｃに相当する。表示部４１は、表示処理部４０から通知された表示データを画面に表示する。 The display unit 41 is a device that displays images, characters, and the like. This corresponds to the displays 21a, 21b, and 21c described in FIG. The display unit 41 displays the display data notified from the display processing unit 40 on the screen.

表示処理部４０は、画像復号化部３４から通知された画像データの分割及び加工処理などを行い、生成した表示データを表示部４１に通知する。表示処理部４０は、複数個の表示部４１の配置情報を管理し、各表示部４１に適した複数個の表示データを生成することが可能となる。また、表示処理部４０は、画像処理部４２から自拠点の画像データを受け取り、表示部４１に出力することで、自拠点の画像を表示することも可能である。 The display processing unit 40 divides and processes the image data notified from the image decoding unit 34 and notifies the display unit 41 of the generated display data. The display processing unit 40 manages the arrangement information of the plurality of display units 41, and can generate a plurality of display data suitable for each display unit 41. The display processing unit 40 can also display the image of the local site by receiving the image data of the local site from the image processing unit 42 and outputting the image data to the display unit 41.

画像入力部４３は、デジタルビデオカメラなどの動画撮影機能を有する撮影装置であり、図１９で説明したカメラ２２ａ、２２ｂ、２２ｃ、２２ｄ及び２２ｅに相当する。画像入力部４３は、撮影した画像データを画像処理部４２に通知する。また、画像入力部４３は、デジタルスチルカメラなどの静止画撮影機能のみの撮影装置であってもよく、撮影した静止画データを一定間隔で画像処理部４２に通知してもよい。 The image input unit 43 is a shooting device having a moving image shooting function such as a digital video camera, and corresponds to the cameras 22a, 22b, 22c, 22d, and 22e described in FIG. The image input unit 43 notifies the image processing unit 42 of the captured image data. The image input unit 43 may be a photographing device having only a still image photographing function, such as a digital still camera, and may notify the image processing unit 42 of photographed still image data at regular intervals.

画像処理部４２は、１つ以上の画像入力部４３から通知される画像データの合成及び加工処理などを行い、画像符号化部３３に通知する。画像処理部４２は、複数個の画像入力部４３の配置情報を管理し、配置に適した画像データを生成することが可能となる。また、画像処理部４２は、表示処理部４０に通知することで、自拠点の画像データを表示部４１に直接表示することも可能である。 The image processing unit 42 performs composition and processing of image data notified from one or more image input units 43 and notifies the image encoding unit 33 of the result. The image processing unit 42 can manage the arrangement information of the plurality of image input units 43 and generate image data suitable for the arrangement. The image processing unit 42 can also display the image data of its own base directly on the display unit 41 by notifying the display processing unit 40.

操作入力部４５は、ユーザからの入力指示を受け、当該入力指示に対応する操作入力信号を操作入力制御部４４に通知する装置であり、図１９で説明したリモコン２６に相当する。なお、操作入力部４５は、キーボード及びマウスなど、他の操作入力装置であってもよい。また、操作入力部４５は、ジャイロ機能及びセンサー機能などを用いて、入力指示を受けてもよい。 The operation input unit 45 is a device that receives an input instruction from the user and notifies the operation input control unit 44 of an operation input signal corresponding to the input instruction, and corresponds to the remote controller 26 described with reference to FIG. The operation input unit 45 may be another operation input device such as a keyboard and a mouse. The operation input unit 45 may receive an input instruction using a gyro function, a sensor function, or the like.

操作入力制御部４４は、操作入力部４５から通知された操作入力信号を受け取り、受け取った操作入力信号を対応する制御データに変換したうえで制御部３０に出力する。 The operation input control unit 44 receives the operation input signal notified from the operation input unit 45, converts the received operation input signal into corresponding control data, and outputs the control data to the control unit 30.

送受信回路４７は、ネットワークを介したデータ送受信を行なう回路である。例えば、ネットワークが無線回線の場合、送受信回路４７は、通信制御部４６から受け取った送信データを、所定の方式で変調し、変調した送受信データを無線搬送波に乗せて送信する機能と、アンテナに誘起した高周波信号の中から所定の周波数帯の信号を受信し、受信した信号を復調したうえで通信制御部４６に通知する機能を有する。 The transmission / reception circuit 47 is a circuit that performs data transmission / reception via a network. For example, when the network is a wireless line, the transmission / reception circuit 47 modulates transmission data received from the communication control unit 46 by a predetermined method, transmits the modulated transmission / reception data on a wireless carrier wave, and induces to the antenna. It has a function of receiving a signal in a predetermined frequency band from among the high-frequency signals, and notifying the communication control unit 46 after demodulating the received signal.

通信制御部４６は、送受信回路４７を用いて、他機器と自身との間で通信を確立したうえで、ネットワークを介した映像音声データの送受信を行なう。例えば、呼接続制御（ＣａｌｌＣｏｎｔｒｏｌ）機能、及びデータ通信制御（ＩＰ、ＲＴＰ、ＴＣＰなど）機能を有している。また、音声符号化部３１から通知された符号化音声データと、画像符号化部３３から通知された符号化画像データとを受け取り、受け取った符号化音声データ及び符号化画像データを所定の方式で多重化することにより、送信する映像音声データを生成する。また、受信した映像音声データを、所定の方式で多重分離することにより、符号化音声データと符号化画像データとに分離し、分離した符号化音声データを音声復号化部３２に通知し、分離した符号化画像データを画像復号化部３４に通知する。また、コミュニケーション装置１１及び１２が複数の異なるネットワークに接続可能である場合、コミュニケーション装置１１及び１２は、各ネットワークに対応する複数個の通信制御部４６と送受信回路４７とを備える構成でもよい。 The communication control unit 46 uses the transmission / reception circuit 47 to establish communication between the other device and itself, and transmits / receives video / audio data via the network. For example, it has a call connection control (Call Control) function and a data communication control (IP, RTP, TCP, etc.) function. Also, the encoded audio data notified from the audio encoding unit 31 and the encoded image data notified from the image encoding unit 33 are received, and the received encoded audio data and encoded image data are received by a predetermined method. By multiplexing, video / audio data to be transmitted is generated. Further, the received video / audio data is demultiplexed by a predetermined method to separate into encoded audio data and encoded image data, and the separated encoded audio data is notified to the audio decoding unit 32 for separation. The encoded image data is notified to the image decoding unit 34. Further, when the communication devices 11 and 12 can be connected to a plurality of different networks, the communication devices 11 and 12 may include a plurality of communication control units 46 and transmission / reception circuits 47 corresponding to the respective networks.

ハードディスク装置４８は、内蔵するハードディスクに対して、コンピュータプログラム、又はデータを書き込み及び読み出す装置である。 The hard disk device 48 is a device that writes and reads a computer program or data to and from the built-in hard disk.

読取装置４９は、記録媒体（例えばＣＤ、ＤＶＤ又はメモリカードなど）に記録されたコンピュータプログラム、又はデータを読み取る装置である。 The reading device 49 is a device that reads a computer program or data recorded on a recording medium (for example, a CD, a DVD, or a memory card).

なお、上述した音量監視装置１００〜８００に含まれる送信部１０４以外の処理部は、音声処理部３７に相当し、送信部１０４及び受信部１０５は、通信制御部４６及び送受信回路４７に相当する。また、スピーカ１１１及び１２１は、音声出力部３８に相当し、マイクロホン１１２及び１２２は、音声入力部３９に相当し、表示部１０６は、表示処理部４０及び表示部４１に相当する。 The processing units other than the transmission unit 104 included in the volume monitoring devices 100 to 800 described above correspond to the audio processing unit 37, and the transmission unit 104 and the reception unit 105 correspond to the communication control unit 46 and the transmission / reception circuit 47. . The speakers 111 and 121 correspond to the audio output unit 38, the microphones 112 and 122 correspond to the audio input unit 39, and the display unit 106 corresponds to the display processing unit 40 and the display unit 41.

以上のように、コミュニケーション装置１１及び１２は、複数の入出力装置を備えたコンピュータとして構成されており、他拠点とのコミュニケーションサービスを提供することが可能となる。 As described above, the communication devices 11 and 12 are configured as computers having a plurality of input / output devices, and can provide communication services with other bases.

（その他変形例）
なお、本発明を上記実施の形態に基づいて説明してきたが、本発明は、上記の実施の形態に限定されないのはもちろんである。以下のような場合も本発明に含まれる。 (Other variations)
Although the present invention has been described based on the above embodiment, it is needless to say that the present invention is not limited to the above embodiment. The following cases are also included in the present invention.

（１）上記の各装置は、具体的には、マイクロプロセッサ、ＲＯＭ、ＲＡＭ、ハードディスクユニット、ディスプレイユニット、キーボード、マウスなどから構成されるコンピュータシステムである。前記ＲＡＭ又はハードディスクユニットには、コンピュータプログラムが記憶されている。前記マイクロプロセッサが、前記コンピュータプログラムにしたがって動作することにより、各装置は、その機能を達成する。ここでコンピュータプログラムは、所定の機能を達成するために、コンピュータに対する指令を示す命令コードが複数個組み合わされて構成されたものである。 (1) Each of the above devices is specifically a computer system including a microprocessor, ROM, RAM, a hard disk unit, a display unit, a keyboard, a mouse, and the like. A computer program is stored in the RAM or hard disk unit. Each device achieves its functions by the microprocessor operating according to the computer program. Here, the computer program is configured by combining a plurality of instruction codes indicating instructions for the computer in order to achieve a predetermined function.

（２）上記の各装置を構成する構成要素の一部又は全部は、１個のシステムＬＳＩ（ＬａｒｇｅＳｃａｌｅＩｎｔｅｇｒａｔｉｏｎ：大規模集積回路）から構成されているとしてもよい。システムＬＳＩは、複数の構成部を１個のチップ上に集積して製造された超多機能ＬＳＩであり、具体的には、マイクロプロセッサ、ＲＯＭ及びＲＡＭなどを含んで構成されるコンピュータシステムである。前記ＲＡＭには、コンピュータプログラムが記憶されている。前記マイクロプロセッサが、前記コンピュータプログラムにしたがって動作することにより、システムＬＳＩは、その機能を達成する。 (2) A part or all of the constituent elements constituting each of the above-described devices may be constituted by one system LSI (Large Scale Integration). The system LSI is an ultra-multifunctional LSI manufactured by integrating a plurality of components on a single chip. Specifically, the system LSI is a computer system including a microprocessor, a ROM, a RAM, and the like. . A computer program is stored in the RAM. The system LSI achieves its functions by the microprocessor operating according to the computer program.

（３）上記の各装置を構成する構成要素の一部又は全部は、各装置に脱着可能なＩＣカード又は単体のモジュールから構成されているとしてもよい。前記ＩＣカード又は前記モジュールは、マイクロプロセッサ、ＲＯＭ、ＲＡＭなどから構成されるコンピュータシステムである。前記ＩＣカード又は前記モジュールは、上記の超多機能ＬＳＩを含むとしてもよい。マイクロプロセッサが、コンピュータプログラムにしたがって動作することにより、前記ＩＣカード又は前記モジュールは、その機能を達成する。このＩＣカード又はこのモジュールは、耐タンパ性を有するとしてもよい。 (3) Part or all of the constituent elements constituting each of the above devices may be configured from an IC card that can be attached to and detached from each device or a single module. The IC card or the module is a computer system including a microprocessor, a ROM, a RAM, and the like. The IC card or the module may include the super multifunctional LSI described above. The IC card or the module achieves its function by the microprocessor operating according to the computer program. This IC card or this module may have tamper resistance.

（４）本発明は、上記に示す方法であるとしてもよい。また、これらの方法をコンピュータにより実現するコンピュータプログラムであるとしてもよいし、前記コンピュータプログラムからなるデジタル信号であるとしてもよい。 (4) The present invention may be the method described above. Further, the present invention may be a computer program that realizes these methods by a computer, or may be a digital signal composed of the computer program.

また、本発明は、前記コンピュータプログラム又は前記デジタル信号をコンピュータ読み取り可能な記録媒体、例えば、フレキシブルディスク、ハードディスク、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤ、ＤＶＤ−ＲＯＭ、ＤＶＤ−ＲＡＭ、ＢＤ（Ｂｌｕ−ｒａｙＤｉｓｃ）、又は半導体メモリなどに記録したものとしてもよい。また、これらの記録媒体に記録されている前記デジタル信号であるとしてもよい。 The present invention also provides a computer-readable recording medium such as a flexible disk, hard disk, CD-ROM, MO, DVD, DVD-ROM, DVD-RAM, BD (Blu-ray Disc). Or recorded in a semiconductor memory or the like. The digital signal may be recorded on these recording media.

また、本発明は、前記コンピュータプログラム又は前記デジタル信号を、電気通信回線、無線又は有線通信回線、インターネットを代表とするネットワーク、データ放送等を経由して伝送するものとしてもよい。 Further, the present invention may transmit the computer program or the digital signal via an electric communication line, a wireless or wired communication line, a network represented by the Internet, a data broadcast, or the like.

また、本発明は、マイクロプロセッサとメモリとを備えたコンピュータシステムであって、前記メモリは、上記コンピュータプログラムを記憶しており、前記マイクロプロセッサは、前記コンピュータプログラムにしたがって動作するとしてもよい。 The present invention may be a computer system including a microprocessor and a memory, wherein the memory stores the computer program, and the microprocessor operates according to the computer program.

また、前記プログラム又は前記デジタル信号を前記記録媒体に記録して移送することにより、又は前記プログラム又は前記デジタル信号を、前記ネットワーク等を経由して移送することにより、独立した他のコンピュータシステムにより実施するとしてもよい。 In addition, the program or the digital signal is recorded on the recording medium and transferred, or the program or the digital signal is transferred via the network or the like and executed by another independent computer system. You may do that.

（５）上記実施の形態及び上記変形例をそれぞれ組み合わせるとしてもよい。 (5) The above embodiment and the above modifications may be combined.

本発明は、音量監視装置に適用でき、特に、音声会議システム、テレビ会議システム、及びハンズフリー電話などに用いられる音量監視装置に適用できる。 The present invention can be applied to a volume monitoring apparatus, and in particular, can be applied to a volume monitoring apparatus used for an audio conference system, a video conference system, a hands-free telephone, and the like.

本発明の実施の形態１に係る音響システムの構成を示す図である。It is a figure which shows the structure of the acoustic system which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る音量監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the volume monitoring apparatus which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る音量監視装置の動作の流れを示すフローチャートである。It is a flowchart which shows the flow of operation | movement of the volume monitoring apparatus which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係るスピーカにより拡声された音声信号が、マイクロホンで収音される状況を示す図である。It is a figure which shows the condition where the audio | voice signal amplified by the speaker which concerns on Embodiment 1 of this invention is picked up with a microphone. 本発明の実施の形態１に係るスピーカにより拡声されるスピーカ入力信号の一例を示す図である。It is a figure which shows an example of the speaker input signal amplified by the speaker which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係るマイクロホンより収音される直接波信号の一例を示す図である。It is a figure which shows an example of the direct wave signal picked up by the microphone which concerns on Embodiment 1 of this invention. 本発明の実施の形態２に係る音量監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the volume monitoring apparatus which concerns on Embodiment 2 of this invention. 本発明の実施の形態２に係る音響伝達特性の一例を示す図である。It is a figure which shows an example of the acoustic transmission characteristic which concerns on Embodiment 2 of this invention. 本発明の実施の形態２に係る音量監視装置の動作の流れを示すフローチャートである。It is a flowchart which shows the flow of operation | movement of the volume monitoring apparatus which concerns on Embodiment 2 of this invention. 本発明の実施の形態３に係る音量監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the volume monitoring apparatus which concerns on Embodiment 3 of this invention. 本発明の実施の形態３に係る音量監視装置の動作の流れを示すフローチャートである。It is a flowchart which shows the flow of operation | movement of the volume monitoring apparatus which concerns on Embodiment 3 of this invention. 本発明の実施の形態４に係る音量監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the volume monitoring apparatus which concerns on Embodiment 4 of this invention. 本発明の実施の形態４に係る音量監視装置の動作の流れを示すフローチャートである。It is a flowchart which shows the flow of operation | movement of the volume monitoring apparatus which concerns on Embodiment 4 of this invention. 本発明の実施の形態５に係る音量監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the volume monitoring apparatus which concerns on Embodiment 5 of this invention. 本発明の実施の形態６に係る音量監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the volume monitoring apparatus which concerns on Embodiment 6 of this invention. 本発明の実施の形態７に係る音量監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the volume monitoring apparatus which concerns on Embodiment 7 of this invention. 本発明の実施の形態８に係る音量監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the volume monitoring apparatus which concerns on Embodiment 8 of this invention. 本発明の実施の形態９に係る２拠点でのコミュニケーションシステムの構成を示す図である。It is a figure which shows the structure of the communication system in 2 bases concerning Embodiment 9 of this invention. 本発明の実施の形態９に係る多拠点でのコミュニケーションシステムの構成の一例を示す図である。It is a figure which shows an example of a structure of the communication system in the multiple bases concerning Embodiment 9 of this invention. 本発明の実施の形態９に係るコミュニケーション装置が備える入出力装置の配置の一例を示す図である。It is a figure which shows an example of arrangement | positioning of the input / output device with which the communication apparatus which concerns on Embodiment 9 of this invention is provided. 本発明の実施の形態９に係るコミュニケーション装置の構成の一例を示すブロック図である。It is a block diagram which shows an example of a structure of the communication apparatus which concerns on Embodiment 9 of this invention.

Explanation of symbols

１音響システム
２、３コミュニケーションシステム
１０通信網
１１、１２コミュニケーション装置
１３ノートＰＣ
１４カメラ
１５ＰＤＡ
１６携帯電話
１７サーバ
１８インターネット
１９ディスクトップＰＣ
２０本体
２１ａ、２１ｂ、２１ｃディスプレイ
２２ａ、２２ｂ、２２ｃ、２２ｄ、２２ｅカメラ
２３マイク
２４ａ、２４ｂ、２４ｃスピーカ
２５机
２６リモコン
３０制御部
３１音声符号化部
３２音声復号化部
３３画像符号化部
３４画像復号化部
３５電源回路
３６タイマー回路
３７音声処理部
３８音声出力部
３９音声入力部
４０表示処理部
４１表示部
４２画像処理部
４３画像入力部
４４操作入力制御部
４５操作入力部
４６通信制御部
４７送受信回路
５０ＣＰＵ
５１ＲＯＭ
５２ＲＡＭ
１００、２００、３００、４００、５００、６００、７００、８００音量監視装置
１０１、２０１距離算出部
１０２、２０２直接波音量算出部
１０３スピーカ位置音量算出部
１０４送信部
１０５受信部
１０６表示部
１１０、１２０ユーザ
１１１、１２１スピーカ
１１２、１２２、７１２ａ、７１２ｂマイクロホン
１３１距離情報
１３２直接波音量情報
１３３スピーカ位置音量情報
１４１反射物
１４２直接波
１４３反射波
１５３反射波信号
１６２、１６３ピーク
２０７音響伝達特性算出部
２３７音響伝達特性
３０８距離別音量算出部
３３８距離別音量情報
４０９、５０９残響特性算出部
４１０騒音レベル算出部
４１１距離別音量調整部
４３９残響特性
４４０騒音レベル
４４１調整距離別音量情報
６１２警告部
６４２警告情報
７１３無指向化部
７４２ａ、７４２ｂ収音信号
８１４拡声領域算出部
８４４拡声領域情報 DESCRIPTION OF SYMBOLS 1 Acoustic system 2, 3 Communication system 10 Communication network 11, 12 Communication apparatus 13 Notebook PC
14 Camera 15 PDA
16 Mobile phone 17 Server 18 Internet 19 Desktop PC
20 Main body 21a, 21b, 21c Display 22a, 22b, 22c, 22d, 22e Camera 23 Microphone 24a, 24b, 24c Speaker 25 Machine 26 Remote control 30 Control unit 31 Audio encoding unit 32 Audio decoding unit 33 Image encoding unit 34 Image Decoding unit 35 Power supply circuit 36 Timer circuit 37 Audio processing unit 38 Audio output unit 39 Audio input unit 40 Display processing unit 41 Display unit 42 Image processing unit 43 Image input unit 44 Operation input control unit 45 Operation input unit 46 Communication control unit 47 Transceiver circuit 50 CPU
51 ROM
52 RAM
100, 200, 300, 400, 500, 600, 700, 800 Volume monitoring device 101, 201 Distance calculation unit 102, 202 Direct wave volume calculation unit 103 Speaker position volume calculation unit 104 Transmission unit 105 Reception unit 106 Display unit 110, 120 User 111, 121 Speaker 112, 122, 712a, 712b Microphone 131 Distance information 132 Direct wave volume information 133 Speaker position volume information 141 Reflector 142 Direct wave 143 Reflected wave 153 Reflected wave signal 162, 163 Peak 207 Sound transfer characteristic calculation unit 237 Sound transfer characteristics 308 Sound volume calculation unit by distance 338 Volume information by distance 409, 509 Reverberation characteristic calculation unit 410 Noise level calculation unit 411 Sound volume adjustment unit by distance 439 Reverberation characteristic 440 Noise level 441 Volume information by adjustment distance 612 WARNING 642 warning information 713 non-directional unit 742a, 742b collected sound signal 814 loudspeaker area calculating unit 844 loudspeaker area information

Claims

The sound transmitted from the second voice communication terminal is used in an acoustic system including a first voice communication terminal and a second voice communication terminal that transmit and receive voice to and from each other via a communication network. A volume monitoring device for monitoring the volume of sound output from one voice communication terminal,
The first voice communication terminal is
A first speaker that emits voice based on a speaker input signal transmitted by the second voice communication terminal via the communication network;
A first microphone that generates a collected sound signal by collecting sound and transmits the collected sound signal to the second voice communication terminal;
The volume monitoring device includes:
A direct wave volume calculation unit that calculates a direct wave volume that is a volume of a direct wave among sounds output from the first speaker at the position of the first microphone included in the collected sound signal;
A distance calculation unit for calculating a distance between the first speaker and the first microphone;
Using the distance calculated by the distance calculator and the direct wave volume calculated by the direct wave volume calculator, the first speaker at a predetermined distance from the first speaker. A first volume calculation unit that calculates a first volume that is the volume of the sound that is output;
A volume monitoring device comprising: a transmission unit that transmits the first volume to the second voice communication terminal via the communication network.

The sound volume according to claim 1, wherein the first sound volume calculation unit calculates a speaker position sound volume that is a sound volume of sound output from the first speaker at the position of the first speaker as the first sound volume. Monitoring device.

The volume monitoring device further includes:
An acoustic transfer characteristic calculator that calculates an acoustic transfer characteristic between the first speaker and the first microphone;
The volume monitoring apparatus according to claim 1, wherein at least one of the direct wave volume calculation unit and the distance calculation unit calculates the direct wave volume or the distance using the acoustic transfer characteristic.

The distance calculation unit calculates a time delay until the sound output from the first speaker is picked up by the first microphone using the acoustic transfer characteristic, and calculates the distance from the time delay. To calculate
The volume monitoring apparatus according to claim 3, wherein the direct wave volume calculation unit calculates the direct wave volume using the time delay, the speaker input signal, and the acoustic transfer characteristic.

The first volume calculation unit
Using the distance calculated by the distance calculation unit and the direct wave volume calculated by the direct wave volume calculation unit, the sound output from the first speaker from the first speaker. The volume monitoring apparatus according to claim 1, further comprising: a distance-by-distance calculation unit that calculates a plurality of distance-by-distance volumes, which are sound volumes at a plurality of positions separated by different distances, as the first volume.

The volume monitoring device further includes:
A reverberation characteristic calculating unit for calculating a reverberation characteristic of a space in which the first speaker is installed;
The first volume calculation unit further includes:
A reverberation adjusting unit that adjusts the plurality of distance-based volumes calculated by the distance-based volume calculation unit using the reverberation characteristics;
The said transmission part transmits the said several sound volume according to distance adjusted by the said reverberation adjustment part to the said 2nd audio | voice communication terminal via the said communication network as said 1st sound volume. Volume monitoring device.

The volume monitoring device further includes:
An acoustic transfer characteristic calculator that calculates an acoustic transfer characteristic between the first speaker and the first microphone;
The volume monitoring device according to claim 6, wherein the reverberation characteristic calculation unit calculates the reverberation characteristic using the acoustic transfer characteristic.

The volume monitoring device further includes:
A noise level calculation unit that calculates a noise level that is a volume of noise in the first voice communication terminal;
The first calculation unit further includes:
A noise adjusting unit that adjusts the first volume to a substantially silent state when the first volume is equal to or lower than the noise level;
The volume according to any one of claims 1 to 7, wherein the transmission unit transmits the first volume adjusted by the noise adjustment unit to the second voice communication terminal via the communication network. Monitoring device.

The volume monitoring device further includes:
When the first volume is larger than a predetermined volume, the second voice communication terminal is warned via the communication network that an excessive sound has been loudened in the first voice communication terminal. The sound volume monitoring apparatus according to claim 1, further comprising a warning unit that transmits information.

The volume monitoring device further includes:
Using a plurality of distance-specific sound volumes calculated by the distance-specific sound volume calculating unit, calculating a distance from the first speaker at which the sound output from the first speaker is equal to or higher than a predetermined sound volume, A loudspeaker area calculation unit for generating loudspeaker area information indicating the distance,
The sound volume monitoring apparatus according to claim 5, wherein the transmission unit further transmits the loudspeak area information to the second voice communication terminal via the communication network.

The first microphone generates a sound collection signal having directivity,
The volume monitoring device further includes:
An omnidirectional unit that omnidirectionally collects the directivity sound-collected signal generated by the first microphone;
The volume monitoring apparatus according to any one of claims 1 to 10, wherein the direct wave volume calculation unit calculates the direct wave volume included in a sound collection signal made omnidirectional by the omnidirectional unit.

The sound transmitted from the second voice communication terminal is used in an acoustic system including a first voice communication terminal and a second voice communication terminal that transmit and receive voice to and from each other via a communication network. A volume monitoring method for monitoring the volume of sound output from one voice communication terminal,
The first voice communication terminal is
A first speaker that emits voice based on a speaker input signal transmitted by the second voice communication terminal via the communication network;
A first microphone that generates a collected sound signal by collecting sound and transmits the collected sound signal to the second voice communication terminal;
The volume monitoring method includes:
A direct wave volume calculating step for calculating a direct wave volume, which is a volume of a direct wave among sounds output from the first speaker at the position of the first microphone included in the sound pickup signal;
A distance calculating step for calculating a distance between the first speaker and the first microphone;
The first speaker at a predetermined distance from the first speaker using the distance calculated in the distance calculating step and the direct wave volume calculated in the direct wave volume calculating step. A first sound volume calculating step for calculating a first sound volume that is a sound volume of the sound to be output;
A transmission step of transmitting the first volume to the second voice communication terminal via the communication network.

The sound transmitted from the second voice communication terminal is used in an acoustic system including a first voice communication terminal and a second voice communication terminal that transmit and receive voice to and from each other via a communication network. A volume monitoring method program for monitoring the volume of sound output from one voice communication terminal,
The first voice communication terminal is
A first speaker that emits voice based on a speaker input signal transmitted by the second voice communication terminal via the communication network;
A first microphone that generates a collected sound signal by collecting sound and transmits the collected sound signal to the second voice communication terminal;
The program is
A direct wave volume calculating step for calculating a direct wave volume, which is a volume of a direct wave among sounds output from the first speaker at the position of the first microphone included in the sound pickup signal;
A distance calculating step for calculating a distance between the first speaker and the first microphone;
The first speaker at a predetermined distance from the first speaker using the distance calculated in the distance calculating step and the direct wave volume calculated in the direct wave volume calculating step. A first sound volume calculating step for calculating a first sound volume that is a sound volume of the sound to be output;
A program for causing a computer to execute a transmission step of transmitting the first volume to the second voice communication terminal via the communication network.

The sound transmitted from the second voice communication terminal is used in an acoustic system including a first voice communication terminal and a second voice communication terminal that transmit and receive voice to and from each other via a communication network. 1. An integrated circuit for monitoring the volume of sound output from one voice communication terminal,
The first voice communication terminal is
A first speaker that emits voice based on a speaker input signal transmitted by the second voice communication terminal via the communication network;
A first microphone that generates a collected sound signal by collecting sound and transmits the collected sound signal to the second voice communication terminal;
The integrated circuit comprises:
A direct wave volume calculation unit that calculates a direct wave volume that is a volume of a direct wave among sounds output from the first speaker at the position of the first microphone included in the collected sound signal;
A distance calculation unit for calculating a distance between the first speaker and the first microphone;
Using the distance calculated by the distance calculator and the direct wave volume calculated by the direct wave volume calculator, the first speaker at a predetermined distance from the first speaker. A first volume calculation unit that calculates a first volume that is the volume of the sound that is output;
An integrated circuit comprising: a transmission unit configured to transmit the first volume to the second voice communication terminal via the communication network.