JP2017083216A

JP2017083216A - Sound operation system, operation method, and computer program

Info

Publication number: JP2017083216A
Application number: JP2015209438A
Authority: JP
Inventors: 俊羅; Shun Ami
Original assignee: Lenovo Singapore Pte Ltd
Current assignee: Lenovo Singapore Pte Ltd
Priority date: 2015-10-25
Filing date: 2015-10-25
Publication date: 2017-05-18
Anticipated expiration: 2035-10-25
Also published as: JP6140385B2

Abstract

PROBLEM TO BE SOLVED: To operate an object device by utilizing a simple operation sound.SOLUTION: A plurality of object devices 13a to 13e mounted with microphones 201a to 201e respectively are arranged in a space. A user 10 holds a reference device 15 mounted with a microphone 301. The user causes his/her fingers to sound at a first operation position 18a in order to operate the object device 13a, and then, generates operation sounds by causing his/her fingers to sound at a second operation position 18b heading to the object device 13d from the operation position. A network server 11 calculates a sound source distance from the operation positions 18a, 18b to the object device with respect to each object device on the basis of a difference between the propagation time to the reference device of the operation sound and the propagation time to the object device so that the object device 13d is specified. The network server transmits an operation command to the object device 13d.SELECTED DRAWING: Figure 1

Description

本発明は、単純な操作音で対象装置を制御する技術に関し、さらには、無指向性の操作音でユーザが意図する対象装置を特定する技術に関する。 The present invention relates to a technology for controlling a target device with a simple operation sound, and further to a technology for specifying a target device intended by a user with a non-directional operation sound.

テレビ、空調機、および照明器具といったホーム・アプライアンスは、専用のまたは集約した赤外線式のリモート・コントローラで制御することができる。近年、ＩｏＴ（Internet of Things）およびＭ２Ｍ（Machine to Machine）といった、個別に稼働している機器をネットワークで接続し、各機器が生成したデータをリアルタイムで統合して機器の制御に利用するシステムが発展を遂げてきている。ホーム・アプライアンスは、ＩｏＴ／Ｍ２Ｍシステムに組み込んだ携帯式電子機器を通じて制御することも可能である。 Home appliances such as televisions, air conditioners, and luminaires can be controlled by dedicated or integrated infrared remote controllers. In recent years, systems that individually operate devices such as IoT (Internet of Things) and M2M (Machine to Machine) are connected via a network, and data generated by each device is integrated in real time and used for device control. Has been developing. The home appliance can also be controlled through a portable electronic device built into the IoT / M2M system.

特許文献１は、携帯端末装置に入力した音声コマンドを音声認識して操作する方法を開示する。特許文献２は、ＶＴＲやテレビジョン受像機などを音声コマンドで操作する音声コマンド遠隔制御装置を開示する。同文献は、利用者が、所望の電子機器に操作部を向けて押ボタンスイッチを押した後に、その電子機器に対するコマンドを音声で入力すると、その音声信号が無線でベース部に送信されてＶＴＲの再生動作が開始することを記載する。 Patent Document 1 discloses a method for recognizing and operating a voice command input to a mobile terminal device. Patent Document 2 discloses a voice command remote control device that operates a VTR, a television receiver, or the like with a voice command. According to this document, when a user inputs a command for an electronic device by voice after the operation unit is directed to a desired electronic device and the user presses the push button switch, the audio signal is wirelessly transmitted to the base unit and is transmitted to the VTR. The start of the playback operation is described.

特開２００６−２２１２７０号公報JP 2006-221270 A 特開平１１−１２０６４７号公報Japanese Patent Laid-Open No. 11-120647

ホーム・アプライアンスの操作における１つの課題として、操作を意図するユーザがその場ですみやかに操作できることを挙げることができる。専用のリモート・コントローラは、操作したい瞬間にユーザの手の届く範囲にあるとはいえず、さらにボタン操作は、すみやかな操作という要請に十分に応えることはできない。ＩｏＴ／Ｍ２Ｍシステムを構成する携帯式電子機器から操作コマンドを送る方法も同様である。 One problem in the operation of the home appliance is that the user who intends to operate it can operate immediately on the spot. The dedicated remote controller cannot be said to be within the reach of the user's hand at the moment of operation, and the button operation cannot sufficiently meet the demand for prompt operation. The same applies to the method of sending an operation command from the portable electronic device constituting the IoT / M2M system.

このような課題に応える１つの方法として、カメラでユーザのジェスチャを撮影して操作する方法がある。また、音声認識装置を搭載するホーム・アプライアンスまたはＩｏＴ／Ｍ２Ｍシステムが、ユーザ発声した、「オープン」、「クローズ」などのような言語コマンドを認識して対応する操作をする方法もある。しかし、画像認識装置および音声認識装置はシステムの規模が大がかりであることや認識精度などの問題が残る。 One method for responding to such a problem is a method of photographing and manipulating a user's gesture with a camera. There is also a method in which a home appliance or an IoT / M2M system equipped with a voice recognition device recognizes a language command such as “open” or “close” uttered by a user and performs a corresponding operation. However, the image recognition apparatus and the voice recognition apparatus have problems such as the large scale of the system and the recognition accuracy.

そこで本発明の目的は、単純な操作音を利用して対象装置を操作する方法を提供することにある。さらに本発明の目的は、ユーザが意図するときにただちに対象装置を操作できる方法を提供することにある。さらに本発明の目的は、そのような方法を実現する音響操作システム、制御装置、ネットワーク・サーバ、携帯式電子機器およびコンピュータ・プログラムを提供することにある。 Therefore, an object of the present invention is to provide a method for operating a target apparatus using simple operation sounds. A further object of the present invention is to provide a method capable of operating a target device immediately when the user intends. It is another object of the present invention to provide an acoustic operation system, a control device, a network server, a portable electronic device, and a computer program that realize such a method.

本発明は、複数の対象装置の中から音響操作で選択した目的対象装置を操作する音響操作システムを提供する。音響操作システムは、第１の操作位置と第２の操作位置で音源が生成した操作音を検出する複数の対象マイクロフォンを利用して操作音のデータを受け取り、音源から対象マイクロフォンまでの操作音の伝搬時間に基づいて、第１の操作位置から対象マイクロフォンまでの第１の音源距離および第２の操作位置から対象マイクロフォンまでの第２の音源距離を計算する音源距離計算部と、第１の音源距離、第２の音源距離および第１の操作位置から第２の操作位置までの移動距離に基づいて選択した対象マイクロフォンに関連する目的対象装置を特定する対象装置特定部を有する。 The present invention provides an acoustic operation system for operating a target target device selected by acoustic operation from among a plurality of target devices. The acoustic operation system receives operation sound data using a plurality of target microphones that detect operation sounds generated by the sound source at the first operation position and the second operation position, and receives operation sound data from the sound source to the target microphone. A sound source distance calculation unit that calculates a first sound source distance from the first operation position to the target microphone and a second sound source distance from the second operation position to the target microphone based on the propagation time; and a first sound source A target device specifying unit that specifies a target device related to the target microphone selected based on the distance, the second sound source distance, and the moving distance from the first operation position to the second operation position;

操作音は空間において指向性を有しないが、音響操作システムは、第１の操作位置と第２の操作位置で発生した操作音の伝搬時間と移動距離を計算することで、操作音にユーザが意図する方向情報を抽出する。本発明では、音声認識をする必要がないため、指を鳴らしたり掌を叩いたりあるいは携帯式電子機器の筐体を叩いたりする単純なユーザの人体の動きによって操作音を生成するだけで実現できるため、ユーザがその場でただちに操作するのに都合がよい。なお、本発明ではユーザが発生する音声を単純な操作音として利用することもできる。対象マイクロフォンは、対象装置に搭載されていてもよい。 Although the operation sound does not have directivity in space, the acoustic operation system calculates the propagation time and movement distance of the operation sound generated at the first operation position and the second operation position, so that the user can obtain the operation sound. Extract the intended direction information. In the present invention, since it is not necessary to perform voice recognition, it can be realized only by generating an operation sound by a simple user's movement of the user, such as ringing a finger, hitting a palm, or hitting a casing of a portable electronic device. Therefore, it is convenient for the user to operate immediately on the spot. In the present invention, the voice generated by the user can be used as a simple operation sound. The target microphone may be mounted on the target device.

第２の操作位置は、目的対象装置に関連付けられた対象マイクロフォンから第１の操作位置に向かう方向に存在するように設定することができる。この場合、ユーザは第１の操作位置で音響操作をしたあとに、目的対象装置に関連付けられた対象マイクロフォンを目安にして第２の操作位置を選択することで、音響操作システムに容易に目的対象装置を選択する意思を伝えることができる。 The second operation position can be set so as to exist in a direction from the target microphone associated with the target target device toward the first operation position. In this case, after the user performs an acoustic operation at the first operation position, the user can easily select the second operation position using the target microphone associated with the target device as a guide, so that the user can easily perform the target operation on the acoustic operation system. Can communicate the intention of selecting a device.

音源距離計算部は、操作音の生成時刻と対象マイクロフォンが検出した操作音の検出時刻の時間差から第１の音源距離および第２の音源距離を計算することができる。また、音源距離計算部は、第１の操作位置および第２の操作位置において音源から既知の参照距離に配置した参照マイクロフォンを利用して操作音のデータを受信し、対象マイクロフォンが検出した操作音の検出時刻、参照マイクロフォンが検出した操作音の検出時刻および参照距離を利用して第１の音源距離および第２の音源距離を計算することができる。この場合、音源が生成した操作音の生成時刻を取得する必要がないため、指先や掌のような人体の部位を音源にすることが可能になる。 The sound source distance calculation unit can calculate the first sound source distance and the second sound source distance from the time difference between the operation sound generation time and the operation sound detection time detected by the target microphone. The sound source distance calculation unit receives operation sound data using a reference microphone disposed at a known reference distance from the sound source at the first operation position and the second operation position, and the operation sound detected by the target microphone. The first sound source distance and the second sound source distance can be calculated using the detected time, the detection time of the operation sound detected by the reference microphone, and the reference distance. In this case, since it is not necessary to acquire the generation time of the operation sound generated by the sound source, a human body part such as a fingertip or a palm can be used as the sound source.

参照マイクロフォンをユーザが装着するウェアラブル・デバイスに搭載し、音源をユーザの指先または掌とすることができる。この場合、第１の操作位置と第２の操作位置で参照距離を保ちながら音源を容易に第２の操作位置に移動させることができる。さらに、指先や掌による操作音の生成は、直感的かつ迅速な操作を可能にする。参照マイクロフォンをユーザが保持して使用する携帯式電子機器に搭載し、音源を携帯式電子機器の筐体の一部とすることができる。 The reference microphone can be mounted on a wearable device worn by the user, and the sound source can be the fingertip or palm of the user. In this case, the sound source can be easily moved to the second operation position while maintaining the reference distance between the first operation position and the second operation position. Furthermore, the operation sound generated by the fingertip or palm enables an intuitive and quick operation. The reference microphone can be mounted on a portable electronic device that is held and used by the user, and the sound source can be a part of the casing of the portable electronic device.

対象マイクロフォンおよび参照マイクロフォンが検出した雑音を含む検出音から操作音の時間的な発生パターンに基づいて音響操作の有効性を判断する音響操作認識部を備えれば、雑音による誤操作を防ぐことができる。音響操作認識部は、対象マイクロフォンが検出した検出音の検出時刻と参照マイクロフォンが検出した検出音の検出時刻の差に基づいて同一の発生音について対象マイクロフォンと参照マイクロフォンが検出した検出音のペアが操作音のペアを形成しないと判断したときに、音響操作を無効にすることができる。この場合、雑音を誤って操作音と認識したときや、対象マイクロフォンの音響データから操作音を認識できないようなときの誤操作を防ぐことができる。 Providing an acoustic operation recognition unit that determines the effectiveness of an acoustic operation based on a temporal generation pattern of an operation sound from detection sounds including noise detected by the target microphone and the reference microphone can prevent erroneous operation due to noise. . The acoustic operation recognizing unit generates a pair of detection sounds detected by the target microphone and the reference microphone for the same generated sound based on a difference between the detection time of the detection sound detected by the target microphone and the detection time of the detection sound detected by the reference microphone. When it is determined not to form a pair of operation sounds, the acoustic operation can be invalidated. In this case, it is possible to prevent erroneous operation when noise is mistakenly recognized as an operation sound or when the operation sound cannot be recognized from the acoustic data of the target microphone.

音響操作認識部は、参照マイクロフォンまたは対象マイクロフォンが検出した先行する検出音の検出時刻と後続の検出音の検出時刻の時間差が所定値以上のときに、当該検出音を破棄して検出音から雑音を取り除くことができる。音響操作システムは、第１の操作位置および第２の操作位置においてそれぞれ生成した複数個の操作音に対応する操作コマンドを目的対象装置に出力する操作コマンド生成部を有していてもよい。 When the time difference between the detection time of the preceding detection sound detected by the reference microphone or the target microphone and the detection time of the subsequent detection sound is equal to or greater than a predetermined value, the acoustic operation recognition unit discards the detection sound and generates noise from the detection sound. Can be removed. The acoustic operation system may include an operation command generation unit that outputs operation commands corresponding to a plurality of operation sounds respectively generated at the first operation position and the second operation position to the target device.

本発明の第２の態様は、複数の対象装置の中から選択した目的対象装置を音響操作に応じて音響操作システムが操作する方法を提供する。音源が第１の操作位置で生成した操作音と第２の操作位置で生成した操作音を複数の対象装置が検出する。つづいて音源から対象装置までの操作音の伝搬時間に基づいて、第１の操作位置から対象装置までの第１の音源距離および第２の操作位置から対象装置までの第２の音源距離を計算する。第１の操作位置と第２の操作位置の移動距離を取得する。つづいて、第１の音源距離、第２の音源距離および移動距離に基づいて目的対象装置を特定する。つづいて、目的対象装置に操作コマンドを送る。 A second aspect of the present invention provides a method in which an acoustic operation system operates a target target device selected from a plurality of target devices according to an acoustic operation. The plurality of target devices detect the operation sound generated by the sound source at the first operation position and the operation sound generated at the second operation position. Subsequently, based on the propagation time of the operation sound from the sound source to the target device, the first sound source distance from the first operation position to the target device and the second sound source distance from the second operation position to the target device are calculated. To do. The movement distance between the first operation position and the second operation position is acquired. Subsequently, the target device is specified based on the first sound source distance, the second sound source distance, and the movement distance. Subsequently, an operation command is sent to the target device.

第２の操作位置が、第１の操作位置から目的対象装置に向かう方向に存在する場合に、第１の音源距離と第２の音源距離の差から参照距離を減算した評価パラメータを所定の閾値と比較することで目的対象装置を特定することができる。あるいは複数の対象装置についてそれぞれ計算した評価パラメータを相互に比較することで目的対象装置を特定することができる。 When the second operation position is present in the direction from the first operation position toward the target device, an evaluation parameter obtained by subtracting the reference distance from the difference between the first sound source distance and the second sound source distance is set to a predetermined threshold value. The target device can be specified by comparing with. Alternatively, the target target device can be specified by comparing the evaluation parameters calculated for each of the plurality of target devices.

音源から既知の参照距離で操作音を検出した第１の検出時刻を取得し、複数の対象装置が操作音を検出した第２の検出時刻を取得し、第２の検出時刻と第１の検出時刻の差を計算して、第１の音源距離と第２の音源距離を計算することができる。第１の操作位置で第１の個数の操作音を検出し、第２の操作位置で第２の個数の操作音を検出し、第１の個数と第２の個数の組み合わせで複数の操作コマンドを生成し、第１の個数または第２の個数が所定値以上のときに音響操作を無効にすることができる。 The first detection time at which the operation sound is detected at a known reference distance from the sound source is acquired, the second detection time at which the plurality of target devices detect the operation sound is acquired, the second detection time and the first detection time By calculating the time difference, the first sound source distance and the second sound source distance can be calculated. A first number of operation sounds are detected at the first operation position, a second number of operation sounds are detected at the second operation position, and a plurality of operation commands are combined by combining the first number and the second number. And the acoustic operation can be disabled when the first number or the second number is equal to or greater than a predetermined value.

本発明により、単純な操作音を利用して対象装置を操作する方法を提供することができた。さらに本発明により、ユーザが意図するときにただちに対象装置を操作できるようにする方法を提供することができた。さらに本発明により、そのような方法を実現する音響操作システム、制御装置、ネットワーク・サーバおよび携帯式電子機器を提供することができた。 According to the present invention, it is possible to provide a method for operating a target apparatus using simple operation sounds. Furthermore, according to the present invention, it is possible to provide a method that allows the target device to be operated immediately when the user intends. Furthermore, according to the present invention, an acoustic operation system, a control device, a network server, and a portable electronic device that realize such a method can be provided.

本実施の形態にかかる音響操作システム１００の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the acoustic operation system 100 concerning this Embodiment. 音響操作システム１００を構成する参照デバイス１５の一例を説明するための図である。FIG. 3 is a diagram for explaining an example of a reference device 15 constituting the acoustic operation system 100. 操作音を検出して音源１７から対象装置１３までの音源距離ｄを計算する方法を説明するための図である。It is a figure for demonstrating the method to detect the operation sound and to calculate the sound source distance d from the sound source 17 to the target apparatus 13. FIG. 音源距離ｄと移動距離ｄ０から操作を意図する対象装置１３ｄを特定する方法を説明するための図である。It is a figure for demonstrating the method to specify the target apparatus 13d which intends operation from the sound source distance d and the movement distance d0. 音響システム１００の構成の一例を説明するための概略の機能ブロック図である。1 is a schematic functional block diagram for explaining an example of a configuration of an acoustic system 100. FIG. 参照デバイス１５および対象装置１３が検出した検出音３０の波形の一例を模式的に示した図である。It is the figure which showed typically an example of the waveform of the detection sound 30 which the reference device 15 and the target apparatus 13 detected. 音響操作認識部４０３が、各対象装置１３について音響操作の有効性を判断する方法を説明するための図である。FIG. 6 is a diagram for explaining a method by which the acoustic operation recognition unit 403 determines the effectiveness of the acoustic operation for each target device 13. 音響操作認識部４０３が、音響操作の有効性を判断する方法を説明するためのフローチャートである。It is a flowchart for demonstrating the method by which the acoustic operation recognition part 403 judges the effectiveness of acoustic operation. 音響操作システム１００の動作を説明するためのフローチャートである。3 is a flowchart for explaining the operation of the acoustic operation system 100. 音響操作システム６００の構成を説明するための概略の機能ブロック図である。3 is a schematic functional block diagram for explaining the configuration of an acoustic operation system 600. FIG. 音響操作システム７００の構成を説明するための概略の機能ブロック図である。4 is a schematic functional block diagram for explaining the configuration of an acoustic operation system 700. FIG.

［定義］
本願明細書で使用する用語の意味を説明する。操作音は、ユーザが音響操作により発生させる時間軸上での離散的な音を意味する。音響操作は、ユーザが第１の操作位置と第２の操作位置のそれぞれで１個または複数個の操作音を生成して複数の対象装置の中から操作を意図する目的対象装置を選択して制御する操作を意味する。発生音は、操作音と雑音を含んだ音響操作の実行環境で発生するすべての音を意味する。検出音は、音響操作システムが検出した発生音を意味する。 [Definition]
The meanings of terms used in this specification will be described. The operation sound means a discrete sound on the time axis that is generated by a sound operation by the user. In the acoustic operation, the user generates one or a plurality of operation sounds at each of the first operation position and the second operation position, and selects a target target device intended for the operation from the plurality of target devices. Means the operation to be controlled. The generated sound means all sounds generated in the acoustic operation execution environment including the operation sound and noise. The detected sound means a generated sound detected by the acoustic operation system.

［音響操作システム］
図１は、本実施の形態にかかる音響操作システム１００の概要を説明するための図である。図２は、音響操作システム１００を構成する参照デバイス１５の一例を説明するための図である。ユーザ１０が存在する３次元空間には、操作対象となる対象装置１３ａ〜１３ｅ、ネットワーク・サーバ１１および参照デバイス１５が存在する。対象装置１３ａ〜１３ｅは特に限定する必要はないが、一例として、空調機、照明装置、テレビ、ドア施錠装置、およびカーテンなどのホーム・アプライアンスとすることができる。対象装置１３ａ〜１３ｅは、それぞれマイクロフォン２０１ａ〜２０１ｅを搭載している。 [Sound operation system]
FIG. 1 is a diagram for explaining the outline of the acoustic operation system 100 according to the present embodiment. FIG. 2 is a diagram for explaining an example of the reference device 15 constituting the acoustic operation system 100. In the three-dimensional space in which the user 10 exists, there are target devices 13a to 13e, a network server 11, and a reference device 15 to be operated. The target devices 13a to 13e are not particularly limited, but may be home appliances such as an air conditioner, a lighting device, a television, a door locking device, and a curtain as an example. The target devices 13a to 13e are equipped with microphones 201a to 201e, respectively.

マイクロフォン２０１ａ〜２０１ｅは必ずしも対象装置１３ａ〜１３ｅに搭載する必要はなく、対応する対象装置１３ａ〜１３ｅに接続され、かつ、ユーザにとって両者の関連性が理解できるようになっていればよい。以下においては、対象装置１３ａ〜１３ｅ、およびマイクロフォン２０１ａ〜２０１ｅを個々に区別する必要がないときは、それらを総称して対象装置１３、およびマイクロフォン２０１と記載することにする。対象装置１３と記載した場合には、単一の対象装置を示す場合と複数の対象装置を示す場合がある。 The microphones 201a to 201e do not necessarily have to be mounted on the target devices 13a to 13e, but only need to be connected to the corresponding target devices 13a to 13e so that the user can understand the relationship between them. In the following, when it is not necessary to individually distinguish the target devices 13a to 13e and the microphones 201a to 201e, they will be collectively referred to as the target device 13 and the microphone 201. When the target device 13 is described, a single target device may be indicated and a plurality of target devices may be indicated.

ユーザ１０は、複数の対象装置１３のなかから選択した対象装置１３ｄ（たとえばテレビ）に、ネットワーク・サーバ１１を通じてオン／オフや音量調整などの操作コマンドを送ろうとしている。対象装置１３は、既存のリモート・コントローラやネットワーク・デバイスとの操作を併用してもよい。ユーザ１０は、対象装置１３ｄの選択と操作コマンドの送信を第１の操作位置１８ａと第２の操作位置１８ｂにおける音響操作で行う。 The user 10 is about to send an operation command such as on / off or volume adjustment to the target device 13d (for example, a television) selected from the plurality of target devices 13 through the network server 11. The target apparatus 13 may use an operation with an existing remote controller or network device. The user 10 performs selection of the target device 13d and transmission of an operation command by acoustic operation at the first operation position 18a and the second operation position 18b.

ユーザ１０と対象装置１３ｄの間には、第１の操作位置１８ａで音響操作をした後に音響操作を行う第２の操作位置１８ｂを決めるための仮想的な方向直線１９を定義する。ユーザ１０は、第１の操作位置１８ａで音響操作をした後に、方向直線１９を意識しながら第２の操作位置１８ｂを決めることができる。第１の操作位置１８ａと第２の操作位置１８ｂの距離を移動距離ｄ０ということにする。 A virtual directional straight line 19 is defined between the user 10 and the target device 13d for determining the second operation position 18b for performing the acoustic operation after performing the acoustic operation at the first operation position 18a. The user 10 can determine the second operation position 18b while paying attention to the direction straight line 19 after performing the acoustic operation at the first operation position 18a. The distance between the first operation position 18a and the second operation position 18b is referred to as a movement distance d0.

ユーザ１０の手１０ａには、装着または保持といった態様で参照デバイス１５が存在している。参照デバイス１５はマイクロフォン３０１を搭載しており、操作音４０を生成した音源１７から各対象装置１３までの距離を測定するための参照情報を提供する。参照デバイス１５は、ユーザ１０の手１０ａを音源にするのに都合がよく、かつ第１の操作場所１８ａから第２の操作場所１８ｂまでの移動が容易な携帯式電子機器とすることができる。携帯式電子機器の一例として腕時計のように腕に装着するウェアラブル・デバイス１５ａを採用することができる（図２（Ａ））。 The reference device 15 is present in the hand 10a of the user 10 in a manner such as wearing or holding. The reference device 15 includes a microphone 301 and provides reference information for measuring the distance from the sound source 17 that generated the operation sound 40 to each target device 13. The reference device 15 can be a portable electronic device that is convenient for using the hand 10a of the user 10 as a sound source and that can be easily moved from the first operation location 18a to the second operation location 18b. As an example of a portable electronic device, a wearable device 15a attached to an arm like a wristwatch can be employed (FIG. 2A).

図２でウェアラブル・デバイス１５ａは、マイクロフォン３０１ａを実装している。参照デバイス１５としてウェアラブル・デバイス１５ａを採用する場合は、ユーザが、親指に押し当てた中指をはじいて指を鳴らすことで、指先で操作音４０ａを生成する音源１７ａを構成することができる。音源１７で発生した１個または複数個の操作音４０ａは、空中を球面状に伝搬してマイクロフォン３０１、２０１に到達する。 In FIG. 2, the wearable device 15a includes a microphone 301a. When the wearable device 15a is employed as the reference device 15, the user can configure the sound source 17a that generates the operation sound 40a with the fingertip by repelling the middle finger pressed against the thumb. One or a plurality of operation sounds 40 a generated by the sound source 17 propagates in the air in a spherical shape and reaches the microphones 301 and 201.

マイクロフォン３０１は、少なくともユーザが操作音４０を生成する際に音源１７から既知の参照距離ｒに存在すればよい。マイクロフォン２０１は音響操作の際に音源１７から未知の距離に存在する。ウェアラブル・デバイス１５ａの場合は、音源１７ａとなる指先とマイクロフォン３０１ａの参照距離ｒを一般的な人間の掌の大きさとしてあらかじめ想定しておくことができる。また、ウェアラブル・デバイス１５ａでは、空間をユーザ１０が移動したり姿勢を変えたりしても参照距離ｒが変化しないので都合がよい。音源１７ａは、ユーザが指先に装着する器具や打楽器を利用して構成してもよい。 The microphone 301 may be present at a known reference distance r from the sound source 17 at least when the user generates the operation sound 40. The microphone 201 exists at an unknown distance from the sound source 17 during acoustic operation. In the case of the wearable device 15a, the reference distance r between the fingertip serving as the sound source 17a and the microphone 301a can be assumed in advance as a general human palm size. Also, the wearable device 15a is convenient because the reference distance r does not change even when the user 10 moves in the space or changes the posture. The sound source 17a may be configured using an instrument or percussion instrument worn by the user on the fingertip.

参照デバイス１５としてのウェアラブル・デバイスは、めがねのように顔に装着したりベルトのように腰に装着したりするものでもよい。音源１７ａは、音響操作の際にマイクロフォン３０１ａから既知の参照距離ｒに存在していれば指先に限定する必要はなく、ユーザ１０の声でもよいしクラッピングする掌でもよい。このような音響操作は、リモート・コントローラやネットワーク・デバイスから操作するよりも、ユーザが意図するときにただちに直感的に行うことができる。参照デバイス１５は、少なくとも操作音４０を生成する際にマイクロフォン３０１が音源１７から既知の参照距離ｒに存在するものであれば、ウェアラブル・デバイス１５ａに限定する必要はない。 The wearable device as the reference device 15 may be worn on the face like glasses or on the waist like a belt. The sound source 17a need not be limited to the fingertip as long as it exists at a known reference distance r from the microphone 301a during the acoustic operation, and may be a voice of the user 10 or a palm that is clapped. Such an acoustic operation can be performed intuitively immediately when the user intends, rather than operating from a remote controller or network device. The reference device 15 need not be limited to the wearable device 15a as long as the microphone 301 exists at a known reference distance r from the sound source 17 when generating the operation sound 40 at least.

図２（Ｂ）は、参照デバイス１５の他の例としてユーザ１０が手に保持して使用する携帯式電子機器１５ｂを示している。携帯式電子機器１５ｂは、マイクロフォン３０１ｂを搭載するスマートフォンやタブレット端末とすることができる。ユーザは、マイクロフォン３０１ｂから既知の参照距離にある筐体の所定の部位を指やペンで叩いて操作音４０ｂを生成する。叩かれる筐体の所定の部位が音源１７ｂとなる。操作音４０ｂは空中を伝搬してマイクロフォン２０１に到達し、空中および筐体を伝搬してマイクロフォン３０１に到達する。 FIG. 2B shows a portable electronic device 15b that is held and used by the user 10 as another example of the reference device 15. The portable electronic device 15b can be a smartphone or a tablet terminal equipped with the microphone 301b. The user generates an operation sound 40b by hitting a predetermined part of the housing at a known reference distance from the microphone 301b with a finger or a pen. A predetermined part of the case to be hit is the sound source 17b. The operation sound 40b propagates through the air and reaches the microphone 201, and propagates through the air and the casing to reach the microphone 301.

音源１７ａ、１７ｂは、人体の単純な動作で操作音４０ａ、４０ｂを生成することができる。また、ウェアラブル・デバイス１５ａや携帯式電子機器１５ｂは常にユーザの身近に存在する。したがって、ユーザが意図するときにただちに方向直線１９上の第１の操作位置１８ａ、第２の操作位置１８ｂで音響操作をするのに都合がよい。また、音源１７ａ、１７ｂからマイクロフォン３０１ａ、３０１ｂまでの参照距離ｒは事前に想定でき、かつ、第１の操作位置１８ａ、第２の操作位置１８ｂで参照距離ｒを保つのに都合がよい。ただし、本発明は参照デバイス１５および音源１７をここに例示した範囲に限定するものではなく、本発明の思想を逸脱しない範囲で当業者が容易に想起できるさまざまな参照デバイス１５および音源１７を採用することができる。 The sound sources 17a and 17b can generate the operation sounds 40a and 40b with a simple movement of the human body. Also, the wearable device 15a and the portable electronic device 15b are always close to the user. Therefore, it is convenient to perform an acoustic operation at the first operation position 18a and the second operation position 18b on the direction straight line 19 immediately when the user intends. Further, the reference distance r from the sound sources 17a and 17b to the microphones 301a and 301b can be assumed in advance, and it is convenient to maintain the reference distance r at the first operation position 18a and the second operation position 18b. However, the present invention does not limit the reference device 15 and the sound source 17 to the ranges exemplified here, and various reference devices 15 and the sound sources 17 that can be easily conceived by those skilled in the art without departing from the spirit of the present invention. can do.

図１に戻って、音響操作システム１００は原理上、方向直線１９上に複数の対象装置１３が重なって存在しないことを前提にする。参照デバイス１５と対象装置１３は、無線ネットワークでネットワーク・サーバ１１に接続している。無線ネットワークの規格は特に限定する必要はなく、無線ＬＡＮやBluetooth（登録商標）などを採用することができる。なお、本発明は有線ネットワークの適用を排除しない。特にネットワーク・サーバ１１と対象装置１３は、位置を変化させる必要がないため、有線ネットワークで接続しても音響操作には支障がない。 Returning to FIG. 1, the acoustic operation system 100 is based on the premise that a plurality of target devices 13 do not exist on the direction line 19 in principle. The reference device 15 and the target device 13 are connected to the network server 11 via a wireless network. The standard of the wireless network is not particularly limited, and a wireless LAN, Bluetooth (registered trademark), or the like can be adopted. The present invention does not exclude the application of a wired network. In particular, since the network server 11 and the target device 13 do not need to change their positions, even if they are connected via a wired network, there is no problem in acoustic operation.

ネットワーク・サーバ１１は、ネットワークを通じて参照デバイス１５および対象装置１３と操作音４０の処理に必要なデータをリアルタイムで交換する。一例において、ネットワーク・サーバ１１は、操作音４０を処理して、対象装置１３を操作するための操作コマンドを生成する。参照デバイス１５、対象装置１３およびネットワーク・サーバ１１は、一例においてＩｏＴ／Ｍ２Ｍシステムとして構成することができる。他の例では、ネットワーク・サーバ１１を利用しないで、操作音４０の処理に必要なデータを参照デバイス１５または対象装置１３の一方が処理したり、両方で分散して処理したりすることもできる。 The network server 11 exchanges data necessary for processing of the operation sound 40 with the reference device 15 and the target device 13 through the network in real time. In one example, the network server 11 processes the operation sound 40 and generates an operation command for operating the target device 13. The reference device 15, the target device 13, and the network server 11 can be configured as an IoT / M2M system in an example. In another example, without using the network server 11, the data required for processing the operation sound 40 can be processed by either the reference device 15 or the target device 13, or can be distributed and processed by both. .

音源１７で発生した操作音４０は、ほぼ球面波となって対象装置１３まで伝搬する。音源１７は、無指向性の点音源とすることができる。音響操作システム１００は、操作音４０を時間軸上で離散的に存在する物理的な音として利用するため、操作音４０はユーザ１０と対象装置１３を関連付けるための方向の情報を保有する必要がない。したがって、各対象装置１３のマイクロフォン２０１が操作音４０を検出しただけでは、ユーザ１０が操作を意図する対象装置１３ｄを特定することができない。ここで、たとえば同じ操作音４０を検出するマイクロフォン２０１ｃ、２０１ｄは音源１７からの距離が相違するため伝搬時間に差が生ずることに着目する。本実施の形態では以下に説明するように、伝搬時間の差を利用して、方向直線１９上に存在する対象装置１３ｄとそれ以外の対象装置を区別して操作を意図する対象装置１３ｄを特定する。 The operation sound 40 generated by the sound source 17 propagates to the target device 13 as a substantially spherical wave. The sound source 17 can be an omnidirectional point sound source. Since the acoustic operation system 100 uses the operation sound 40 as a physical sound that exists discretely on the time axis, the operation sound 40 needs to have information on a direction for associating the user 10 with the target device 13. Absent. Therefore, the target device 13d that the user 10 intends to operate cannot be specified only by detecting the operation sound 40 by the microphone 201 of each target device 13. Here, attention is paid to the fact that, for example, the microphones 201c and 201d that detect the same operation sound 40 have different propagation times because the distances from the sound source 17 are different. In this embodiment, as will be described below, the target device 13d intended for the operation is specified by distinguishing the target device 13d existing on the direction straight line 19 from the other target devices using the difference in propagation time. .

音響操作は、第１の操作位置１８ａおよび第２の操作位置で操作音４０を生成する。ユーザ１０は、第１の操作位置１８ａで１回目の操作音４０を生成し、第１の操作位置１８ａと対象装置１３ｄのマイクロフォン２０１ｄを結ぶ方向直線１９上の第２の操作位置１８ｂで２回目の操作音を生成する。このとき参照デバイス１５のマイクロフォン３０１と音源１７はともに既知の参照距離ｒを保ちながら第１の操作位置から第２の操作位置まで移動する。 The acoustic operation generates an operation sound 40 at the first operation position 18a and the second operation position. The user 10 generates the first operation sound 40 at the first operation position 18a and performs the second operation at the second operation position 18b on the directional line 19 connecting the first operation position 18a and the microphone 201d of the target device 13d. The operation sound is generated. At this time, both the microphone 301 and the sound source 17 of the reference device 15 move from the first operation position to the second operation position while maintaining a known reference distance r.

参照デバイス１５および対象装置１３は、同じ操作音４０を異なる検出時刻で検出する。参照デバイス１５と各対象装置１３は操作音４０を瞬時にデータ化してネットワーク・サーバ１１に送る。ネットワーク・サーバ１１は、第１の操作位置１８ａおよび第２の操作位置１８ｂにおける音源１７から各対象装置１３が搭載するマイクロフォン２０１までの操作音の伝搬時間の差、および参照距離ｒに基づいて各マイクロフォン２０１と音源１７の距離を計算することができる。各マイクロフォン２０１と音源１７の距離を音源距離ｄ（図３）という。音源距離ｄに対して対象装置１３のサイズが十分に小さい場合は、音源距離ｄを対象装置１３と音源１７の距離として扱うことができる。 The reference device 15 and the target device 13 detect the same operation sound 40 at different detection times. The reference device 15 and each target device 13 instantaneously convert the operation sound 40 into data and send it to the network server 11. The network server 11 is configured based on the difference in the propagation time of the operation sound from the sound source 17 to the microphone 201 mounted on each target device 13 at the first operation position 18a and the second operation position 18b, and the reference distance r. The distance between the microphone 201 and the sound source 17 can be calculated. The distance between each microphone 201 and the sound source 17 is referred to as a sound source distance d (FIG. 3). When the size of the target device 13 is sufficiently small with respect to the sound source distance d, the sound source distance d can be handled as the distance between the target device 13 and the sound source 17.

２つの操作位置１８ａ、１８ｂでそれぞれ計算した音源距離ｄは、ユーザが操作を意図する対象装置１３ｄを他の対象装置から区別する情報を与える。ネットワーク・サーバ１１は、特定した対象装置１３ｄにあらかじめ定義した操作コマンドを送る。音響操作は、第１の操作位置１８ａおよび第２の操作位置１８ｂでそれぞれ１個の操作音４０を生成してもよい。 The sound source distance d calculated at each of the two operation positions 18a and 18b gives information for distinguishing the target device 13d that the user intends to operate from other target devices. The network server 11 sends a predefined operation command to the identified target device 13d. The acoustic operation may generate one operation sound 40 at each of the first operation position 18a and the second operation position 18b.

この場合、対象装置１３に対して現在の状態を変更する１個の情報を操作コマンドで送ることができる。具体的には、動作している対象装置を停止させたり、停止している対象装置を動作させたりする操作コマンドを送ることができる。操作コマンドの数をさらに増やすために、第１の操作位置１８ａおよび第２の操作位置１８ｂでそれぞれ連続的に複数個の操作音４０を生成することができる。第１の操作位置１８ａと第２の操作位置１８ｂのそれぞれにおける操作音４０の数の複数の組み合わせに対して複数の操作コマンドを対応付けることができる。 In this case, one piece of information for changing the current state can be sent to the target device 13 by an operation command. Specifically, it is possible to send an operation command for stopping the target device that is operating or operating the target device that is stopped. In order to further increase the number of operation commands, a plurality of operation sounds 40 can be generated continuously at the first operation position 18a and the second operation position 18b, respectively. A plurality of operation commands can be associated with a plurality of combinations of the number of operation sounds 40 at each of the first operation position 18a and the second operation position 18b.

［音源距離の計算］
つぎに図３を参照して、操作音４０を検出して音源１７から対象装置１３までの音源距離ｄを計算する方法を説明する。操作音４０の発生時刻をｔ０とし、音源１７とマイクロフォン３０１の参照距離をｒとし、音源１７とマイクロフォン２０１の音源距離をｄとする。図３（Ａ）は、マイクロフォン３０１、音源１７およびマイクロフォン２０１の空間的な位置関係に対応する図で、図３（Ｂ）はそれらを時間軸上に配置した図である。マイクロフォン３０１は少なくとも音響操作をする際に、音源１７を中心とし参照距離ｒを半径とする円１６上に存在する。 [Calculation of sound source distance]
Next, a method for detecting the operation sound 40 and calculating the sound source distance d from the sound source 17 to the target device 13 will be described with reference to FIG. The generation time of the operation sound 40 is t0, the reference distance between the sound source 17 and the microphone 301 is r, and the sound source distance between the sound source 17 and the microphone 201 is d. 3A is a diagram corresponding to the spatial positional relationship among the microphone 301, the sound source 17, and the microphone 201, and FIG. 3B is a diagram in which they are arranged on the time axis. The microphone 301 exists on a circle 16 having the sound source 17 as a center and a reference distance r as a radius when performing acoustic operation at least.

任意の発生時刻ｔ０でユーザ１０が音響操作をする。音源１７で発生した操作音４０は音速ｓで空間を伝搬する。マイクロフォン３０１は操作音４０を検出時刻ｔ１で検出し、マイクロフォン２０１は同じ操作音４０を検出時刻ｔ２で検出する。操作音４０がマイクロフォン３０１まで伝搬する時間ｔｔ１は空気中の音速をｓとしたとき、ｔｔ１＝ｔ１−ｔ０＝ｒ／ｓとなる。したがって、ネットワーク・サーバ１１はマイクロフォン３０１の検出音を利用すると操作音４０の発生時刻ｔ０を、ｔ０＝ｔ１−ｒ／ｓで計算することができる。操作音４０がマイクロフォン２０１まで伝搬する時間ｔｔ２は、ｔｔ２＝ｔ２−ｔ０＝ｄ／ｓとなる。したがってネットワーク・サーバ１１は音源距離ｄを
ｄ＝（ｔ２−ｔ０）ｓ（１）
で計算することができる。 The user 10 performs an acoustic operation at an arbitrary occurrence time t0. The operation sound 40 generated by the sound source 17 propagates through the space at the sound speed s. The microphone 301 detects the operation sound 40 at the detection time t1, and the microphone 201 detects the same operation sound 40 at the detection time t2. The time tt1 for the operation sound 40 to propagate to the microphone 301 is tt1 = t1−t0 = r / s, where s is the speed of sound in the air. Accordingly, when the detection sound of the microphone 301 is used, the network server 11 can calculate the generation time t0 of the operation sound 40 by t0 = t1−r / s. The time tt2 for the operation sound 40 to propagate to the microphone 201 is tt2 = t2−t0 = d / s. Therefore, the network server 11 sets the sound source distance d to d = (t2−t0) s (1)
Can be calculated with

式（１）で音源距離ｄを計算するネットワーク・サーバ１１は、音源１７が操作音４０を生成する発生時刻ｔ０を電気的に取得する必要がある。たとえば、参照デバイス１５が、携帯式電子機器１５ｂの場合は、音源１７ｂの直下に操作音、振動、または圧力などを検知するセンサを設けて、センサが動作した時刻を発生時刻ｔ０とすることができる。ここで、式（１）は、マイクロフォン３０１が検出した検出時刻ｔ１を使って、
ｄ＝（ｔ２−ｔ１）ｓ＋ｒ（２）
で計算することができる。 The network server 11 that calculates the sound source distance d using Equation (1) needs to electrically acquire the generation time t0 when the sound source 17 generates the operation sound 40. For example, when the reference device 15 is the portable electronic device 15b, a sensor that detects operation sound, vibration, pressure, or the like is provided immediately below the sound source 17b, and the time when the sensor is operated may be set as the generation time t0. it can. Here, the expression (1) is obtained by using the detection time t1 detected by the microphone 301.
d = (t2-t1) s + r (2)
Can be calculated with

ｔ２−ｔ１は、音源１７からマイクロフォン３０１およびマイクロフォン２０１までの操作音４０の伝搬時間の差に相当する。式（２）は、操作音４０のマイクロフォン２０１での検出時刻ｔ２、マイクロフォン３０１での検出時刻ｔ１、および参照距離ｒを取得すれば音源距離ｄを計算できることを示している。式（２）は、指先や掌のような発生時刻ｔ０を計時することが困難な音源１７を採用する場合でも適用できる。 t2−t1 corresponds to a difference in propagation time of the operation sound 40 from the sound source 17 to the microphone 301 and the microphone 201. Expression (2) indicates that the sound source distance d can be calculated by obtaining the detection time t2 of the operation sound 40 at the microphone 201, the detection time t1 at the microphone 301, and the reference distance r. Expression (2) can be applied even when the sound source 17 that is difficult to measure the occurrence time t0 such as a fingertip or a palm is employed.

参照デバイス１５が携帯式電子機器１３ｂの場合は、マイクロフォン３０１ｂから参照距離ｒの筐体の所定の位置を叩いたときに、操作音４０ｂが空気中と筐体を伝搬する。マイクロフォン３０１ｂが筐体を伝搬する検出する操作音４０を検出する場合は、筐体の媒質の音速をｍとすれば、ｔ０＝ｔ１−ｒ／ｍとなり、ネットワーク・サーバ１１は音源距離ｄを、
ｄ＝（ｔ２−ｔ１）ｓ＋ｒｓ／ｍ（３）
で計算することができる。 In the case where the reference device 15 is the portable electronic device 13b, the operation sound 40b propagates in the air and the case when a predetermined position of the case at the reference distance r is hit from the microphone 301b. When the operation sound 40 detected by the microphone 301b propagating through the casing is detected, if the sound speed of the medium of the casing is m, t0 = t1-r / m, and the network server 11 determines the sound source distance d,
d = (t2-t1) s + rs / m (3)
Can be calculated with

［ユーザの操作を意図する対象装置の特定］
図４は、音源距離ｄと移動距離ｄ０から操作を意図する対象装置１３ｄを特定する方法を説明するための図である。ここでは、操作を意図する対象装置１３ｄを対象装置１３ｃから区別する場合を例にして説明するが、対象装置１３ｄを他の対象装置から区別する原理も同様に理解することができる。ユーザ１０は、最初に任意の操作位置１８ａに音源１７を位置付けて操作音４０を生成する。 [Identification of target device intended for user operation]
FIG. 4 is a diagram for explaining a method of specifying the target device 13d intended for the operation from the sound source distance d and the movement distance d0. Here, a case where the target device 13d intended to be operated is distinguished from the target device 13c will be described as an example. However, the principle of distinguishing the target device 13d from other target devices can be understood in the same manner. The user 10 first generates the operation sound 40 by positioning the sound source 17 at an arbitrary operation position 18a.

任意の操作位置１８ａは、ユーザが操作を意図した場所として自由に選択することができる。ネットワーク・サーバ１１は、図３を参照して説明した方法で、操作位置１８ａからマイクロフォン２０１ｃ、２０１ｄまでの音源距離ｄ１、ｄ３を計算する。つぎにユーザ１０は方向直線１９上の対象装置１３ｄに近い第２の操作位置１８ｂに音源１７を位置付けて操作音４０を生成する。第２の操作位置１８ｂは、第１の操作位置１８ａに対して、対象装置１３ｄより遠い位置とすることもできる。移動距離ｄ０はあらかじめ決めておいてもよいが、本実施の形態では任意の移動距離ｄを後に説明するように参照デバイス１５を利用して、または他の方法でネットワーク・サーバ１１が計測する。 The arbitrary operation position 18a can be freely selected as a place where the user intends to operate. The network server 11 calculates sound source distances d1 and d3 from the operation position 18a to the microphones 201c and 201d by the method described with reference to FIG. Next, the user 10 generates the operation sound 40 by positioning the sound source 17 at the second operation position 18 b close to the target device 13 d on the directional line 19. The second operation position 18b can be a position farther from the target device 13d than the first operation position 18a. The movement distance d0 may be determined in advance, but in the present embodiment, the network server 11 measures an arbitrary movement distance d using the reference device 15 as described later or by another method.

第２の操作位置１８ｂは、厳密に方向直線１９上に存在する必要はない。第２の操作位置１８ｂが方向直線１７から離れるときの許容値は、音響操作システム１００の対象装置１３に対する分解能または対象装置１３の相互間の距離に依存する。ネットワーク・サーバ１１は図３で説明した方法で、第２の操作位置１８ｂからマイクロフォン２０１ｃ、２０１ｄまでの音源距離ｄ２、ｄ４を計算する。第１の操作位置１８ａにおける参照距離ｒと、第２の操作位置１８ｂにおける参照距離ｒはネットワーク・サーバ１１が距離を認識していれば異なる値でもよい。 The second operating position 18b does not have to be strictly on the direction line 19. The allowable value when the second operation position 18b moves away from the directional line 17 depends on the resolution of the acoustic operation system 100 with respect to the target device 13 or the distance between the target devices 13. The network server 11 calculates the sound source distances d2 and d4 from the second operation position 18b to the microphones 201c and 201d by the method described in FIG. The reference distance r at the first operation position 18a and the reference distance r at the second operation position 18b may be different values as long as the network server 11 recognizes the distance.

ネットワーク・サーバ１１は、音源距離ｄ１、ｄ２の差および音源距離ｄ３、ｄ４の差を計算する。ここで、音源距離ｄ１、ｄ２の差の絶対値および音源距離ｄ３、ｄ４の差の絶対値をそれぞれΔ１２、Δ３４とする。第２の操作位置１８ｂがほぼ方向直線１９上に存在すれば、対象装置１３ｄに関するΔ１２は、Δ１２＝ａｂｓ（ｄ１−ｄ２）≒ｄ０となる。また、対象装置１３ｃに関するΔ３４は、三角形の３つの辺の大小関係より、移動距離ｄ０より小さくなって、Δ３４＝ａｂｓ（ｄ３−ｄ４）≠ｄ０となる。 The network server 11 calculates the difference between the sound source distances d1 and d2 and the difference between the sound source distances d3 and d4. Here, the absolute value of the difference between the sound source distances d1 and d2 and the absolute value of the difference between the sound source distances d3 and d4 are Δ12 and Δ34, respectively. If the second operation position 18b is substantially on the directional line 19, Δ12 related to the target device 13d is Δ12 = abs (d1−d2) ≈d0. Further, Δ34 related to the target device 13c is smaller than the moving distance d0 because of the size relationship of the three sides of the triangle, and Δ34 = abs (d3−d4) ≠ d0.

ここで、Δ１２およびΔ３４と移動距離ｄ０の差を評価パラメータＺということにする。ユーザが操作を意図する対象装置１３ｄの評価パラメータＺは、他の対象装置１３ｃの評価パラメータＺよりも小さくなる。この性質を利用してネットワーク・サーバ１１は、以下の方法で評価パラメータＺから対象装置１３ｄを特定することができる。 Here, the difference between Δ12 and Δ34 and the movement distance d0 is referred to as an evaluation parameter Z. The evaluation parameter Z of the target device 13d that the user intends to operate is smaller than the evaluation parameter Z of the other target device 13c. Using this property, the network server 11 can specify the target device 13d from the evaluation parameter Z by the following method.

第１の方法では、ネットワーク・サーバ１１が、第１の操作位置１８ａにおける音源距離をｄｐとし、第２の操作位置１８ｂにおける音源距離をｄｑとして、すべての対象装置１３について評価パラメータＺを計算する。ネットワーク・サーバ１１は、評価パラメータＺに対して所定の閾値Ｔｈを設定して、式（４）を満たす対象装置１３をユーザが操作を意図する対象装置１３ｄであると認識することができる。 In the first method, the network server 11 calculates the evaluation parameter Z for all the target devices 13 with the sound source distance at the first operation position 18a as dp and the sound source distance at the second operation position 18b as dq. . The network server 11 sets a predetermined threshold Th for the evaluation parameter Z, and can recognize that the target device 13 that satisfies Expression (4) is the target device 13d that the user intends to operate.

Ｚ＝ａｂｓ（ｄｐ−ｄｑ）−ｄ０＜Ｔｈ（４）
式（４）は、評価パラメータＺを閾値Ｔｈで絶対値評価しているため、ユーザが不用意に行った音響操作に対して、いずれの対象装置１３も反応しないようにすることができる。他方で閾値Ｔｈを操作環境ごとに設定する必要があり、第２の操作位置１８ｂが方向直線１９から外れる程度によっては、対象装置１３ｄを特定できない場合もでてくる。 Z = abs (dp−dq) −d0 <Th (4)
In Expression (4), since the absolute value of the evaluation parameter Z is evaluated with the threshold Th, it is possible to prevent any target device 13 from reacting to the acoustic operation performed carelessly by the user. On the other hand, it is necessary to set the threshold Th for each operation environment, and depending on the degree to which the second operation position 18b deviates from the direction straight line 19, the target device 13d may not be specified.

第２の方法では、すべての対象装置１３について計算した評価パラメータＺの中で、式（５）を満たすように評価パラメータＺが最小を示した対象装置１３ｄを操作対象として特定することができる。
Ｚ＝ａｂｓ（ｄｐ−ｄｑ）−ｄ０＝ｍｉｎ（５）
式（５）は、評価パラメータＺを相対値評価しているため、第２の操作位置１８ｂが方向直線１９から外れても、操作直線１９に最も近いいずれかの対象装置１３が特定される。ただし、ユーザが不用意に行った音響操作に対してもいずれかの対象装置１３が反応することになる。いずれの特定方法を採用するかは、音響操作システム１００を導入する環境に基づいて選択することができる。 In the second method, among the evaluation parameters Z calculated for all the target devices 13, the target device 13d whose evaluation parameter Z indicates the minimum so as to satisfy Expression (5) can be specified as the operation target.
Z = abs (dp−dq) −d0 = min (5)
Since Expression (5) evaluates the evaluation parameter Z as a relative value, even if the second operation position 18b deviates from the directional line 19, any target device 13 that is closest to the operation line 19 is specified. However, any one of the target devices 13 reacts to an acoustic operation performed carelessly by the user. Which specific method is adopted can be selected based on the environment in which the acoustic operation system 100 is introduced.

［音響操作システム］
図５は、音響操作システム１００の構成の一例を説明するための概略の機能ブロック図である。音響操作システム１００は一例において、複数の対象装置１３、参照デバイス１５およびネットワーク・サーバ１１で構成する。音響操作システム１００は、他の例において図１０、図１１を参照して説明するように、ネットワーク・サーバ１１を利用しないで構成することもできる。対象装置１３、参照デバイス１５およびネットワーク・サーバ１１はペアリングを完了すると、ネットワークを通じてデータ交換をして音響操作システム１００としてまとまりのある動作をする。ネットワークを通じたデータ交換の速度は、音響が空間や個体媒質を伝搬する時間よりも十分に短い。 [Sound operation system]
FIG. 5 is a schematic functional block diagram for explaining an example of the configuration of the acoustic operation system 100. In one example, the acoustic operation system 100 includes a plurality of target devices 13, a reference device 15, and a network server 11. The acoustic operation system 100 can also be configured without using the network server 11 as described with reference to FIGS. 10 and 11 in another example. When the pairing is completed, the target device 13, the reference device 15, and the network server 11 exchange data through the network and perform a coherent operation as the acoustic operation system 100. The speed of data exchange through the network is sufficiently shorter than the time for sound to propagate through space and solid media.

複数の対象装置１３はそれぞれ、操作音４０を検出するマイクロフォン２０１、Ａ／Ｄ変換部２０３、対象装置機能２０５、対象装置操作部２０７およびネットワーク・インターフェース２０９を備えている。対象装置機能２０５は、たとえば、テレビ、空調機といった各ホーム・アプライアンスの本来の機能に相当する。対象装置操作部２０７は、ネットワーク・サーバ１１から操作コマンドを受け取って対象装置機能２０５を制御する。対象装置操作部２０７は、赤外線式のリモート・コントローラから操作信号を受け取って対象装置機能２０５を制御してもよい。 Each of the plurality of target devices 13 includes a microphone 201 that detects the operation sound 40, an A / D conversion unit 203, a target device function 205, a target device operation unit 207, and a network interface 209. The target device function 205 corresponds to an original function of each home appliance such as a television and an air conditioner. The target device operation unit 207 receives an operation command from the network server 11 and controls the target device function 205. The target device operation unit 207 may control the target device function 205 by receiving an operation signal from an infrared remote controller.

参照デバイス１５は、マイクロフォン３０１、Ａ／Ｄ変換部３０３、モーション・センサ３０５、Ａ／Ｄ変換部３０７およびネットワーク・インターフェース３０９を備えている。モーション・センサ３０５は、加速度、角速度および地磁気をそれぞれ３軸で検出する。モーション・センサ３０５は、第１の操作位置１８ａから第２の操作位置１８ｂまでの移動距離ｄ０を計測するための状態信号を出力する。Ａ／Ｄ変換部２０３、３０３、３０７は、マイクロフォン２０１，３０１またはモーション・センサ３０５から受け取ったアナログ信号に対して標本化処理および量子化処理をしてディジタル・データに変換し、ネットワーク・サーバ１１に出力する。 The reference device 15 includes a microphone 301, an A / D conversion unit 303, a motion sensor 305, an A / D conversion unit 307, and a network interface 309. The motion sensor 305 detects acceleration, angular velocity, and geomagnetism on three axes. The motion sensor 305 outputs a state signal for measuring the movement distance d0 from the first operation position 18a to the second operation position 18b. The A / D conversion units 203, 303, and 307 perform sampling processing and quantization processing on the analog signals received from the microphones 201 and 301 or the motion sensor 305 to convert them into digital data, and then convert them to the network server 11. Output to.

ネットワーク・サーバ１１は、ネットワーク・インターフェース４０１、音響操作認識部４０３、移動距離計算部４０５、計時部４０７、音源距離計算部４０９、対象装置特定部４１１および操作コマンド生成部４１３を含んでいる。これらの要素は、ＯＳやアプリケーション・プログラムのようなソフトウェア資源とそれらを実行するＣＰＵ、チップ・セット、およびシステム・メモリなどのハードウェア資源の協働で構成することができる。ソフトウェア資源は、あらかじめネットワーク・サーバ１１の不揮発性メモリに格納したり、ネットワークを通じてダウンロードしたりすることができる。 The network server 11 includes a network interface 401, an acoustic operation recognition unit 403, a movement distance calculation unit 405, a timing unit 407, a sound source distance calculation unit 409, a target device identification unit 411, and an operation command generation unit 413. These elements can be configured by cooperation of software resources such as an OS and application programs and hardware resources such as a CPU, a chip set, and a system memory that execute the software resources. Software resources can be stored in advance in a non-volatile memory of the network server 11 or downloaded through a network.

音響操作認識部４０３は、複数の対象装置１３および参照デバイス１５から受け取った検出音の音響データから、有効な音響操作を認識する。音響操作認識部４０３は、有効な音響操作を認識したときに、各対象装置１３の検出時刻ｔ２および参照デバイス１５の検出時刻ｔ１を音源距離計算部４０９および移動距離計算部４０５に送る。音響操作認識部４０３は、第１の操作位置１８ａおよび第２の操作位置１８ｂで行われたそれぞれの操作音４０の数を操作コマンド生成部４１３に送る。 The acoustic operation recognition unit 403 recognizes an effective acoustic operation from the acoustic data of the detected sound received from the plurality of target devices 13 and the reference device 15. When the acoustic operation recognition unit 403 recognizes an effective acoustic operation, the acoustic operation recognition unit 403 sends the detection time t2 of each target device 13 and the detection time t1 of the reference device 15 to the sound source distance calculation unit 409 and the movement distance calculation unit 405. The acoustic operation recognition unit 403 sends the number of operation sounds 40 performed at the first operation position 18a and the second operation position 18b to the operation command generation unit 413.

計時部４０７は時刻を計時して、音響操作認識部４０３に提供する。音源距離計算部４０９は、あらかじめ音速ｓ、ｍ、および参照距離ｒを取得している。音源距離計算部４０９は、図３を参照して説明した方法で、各対象装置１３について第１の操作位置１８ａおよび第２の操作位置１８ｂにおける音源距離ｄを計算する。 The timer 407 measures the time and provides it to the acoustic operation recognition unit 403. The sound source distance calculation unit 409 obtains the sound speeds s and m and the reference distance r in advance. The sound source distance calculation unit 409 calculates the sound source distance d at the first operation position 18a and the second operation position 18b for each target device 13 by the method described with reference to FIG.

移動距離計算部４０５は、検出時刻ｔ１、ｔ２におけるモーション・センサ３０５の状態信号から計算した移動距離ｄ０を対象装置特定部４１１に送る。対象装置特定部４１１は、評価パラメータＺを使った絶対値評価または相対値評価でユーザが操作を意図する対象デバイス１３ｄを特定してその識別子を出力する。操作コマンド生成部４１３は、音響操作認識部４０３から受け取った第１の操作位置１８ａ、第２の操作位置１８ｂのそれぞれの操作音４０の個数と、対象装置特定部４１１から受け取った対象装置１３ｄの識別子から操作コマンドを生成してネットワーク・インターフェース４０１を通じて対象装置操作部２０７に出力する。 The movement distance calculation unit 405 sends the movement distance d0 calculated from the state signal of the motion sensor 305 at the detection times t1 and t2 to the target device identification unit 411. The target device specifying unit 411 specifies the target device 13d that the user intends to operate by absolute value evaluation or relative value evaluation using the evaluation parameter Z, and outputs the identifier. The operation command generation unit 413 receives the number of operation sounds 40 at each of the first operation position 18a and the second operation position 18b received from the acoustic operation recognition unit 403 and the target device 13d received from the target device specifying unit 411. An operation command is generated from the identifier and output to the target device operation unit 207 through the network interface 401.

たとえば第１の操作位置１８ａ、第２の操作位置１８ｂでそれぞれ１個または２個の操作音を生成するときは、２つの操作位置１８ａ、１８ｂで行う１回の音響操作で、（１個，１個）、（１個，２個）、（２個，１個）、（２個，２個）の４つの操作情報を生成することができる。操作コマンド生成部４１３は、あらかじめ各対象装置１３について、操作情報と操作コマンドを対応づけておくことができる。 For example, when one or two operation sounds are generated at the first operation position 18a and the second operation position 18b, respectively, one acoustic operation performed at the two operation positions 18a and 18b is performed by (one, Four pieces of operation information of (1 piece), (1 piece, 2 pieces), (2 pieces, 1 piece), and (2 pieces, 2 pieces) can be generated. The operation command generation unit 413 can associate operation information and operation commands for each target device 13 in advance.

［音響操作の認識］
つぎに、音響操作認識部４０３の動作の一例を図６、図７、図８を参照して説明する。図６は、参照デバイス１５および対象装置１３が検出した検出音３０の波形の一例を模式的に示した図である。図７は音響操作認識部４０３が、各対象装置１３について音響操作の有効性を判断する方法を説明するための図である。図８は、音響操作認識部４０３が音響操作の有効性を判断する手順を説明するためのフローチャートである。 [Sound operation recognition]
Next, an example of the operation of the acoustic operation recognition unit 403 will be described with reference to FIGS. FIG. 6 is a diagram schematically illustrating an example of the waveform of the detection sound 30 detected by the reference device 15 and the target device 13. FIG. 7 is a diagram for explaining a method in which the acoustic operation recognition unit 403 determines the effectiveness of the acoustic operation for each target device 13. FIG. 8 is a flowchart for explaining a procedure by which the acoustic operation recognizing unit 403 determines the effectiveness of the acoustic operation.

マイクロフォン２０１、３０１が検出する検出音３０には操作音４０に加えて雑音も含まれている。また検出音３０の音圧レベルが小さいと音響操作認識部４０３は操作音４０を認識できない。したがって、誤操作を防ぐために雑音と操作音４０を区別する必要がある。また、複数個の操作音４０で複数の操作コマンドを生成する場合は、連続して生成される後続の操作音４０と、第２の操作位置１８ｂで生成される操作音４０を区別する必要がある。音響操作認識部４０３は、対象装置１３および参照デバイス１５から受け取った雑音を含む検出音３０から有効な操作音４０を認識して検出時刻および操作音４０の個数を抽出する。 The detection sound 30 detected by the microphones 201 and 301 includes noise in addition to the operation sound 40. If the sound pressure level of the detected sound 30 is low, the acoustic operation recognition unit 403 cannot recognize the operation sound 40. Accordingly, it is necessary to distinguish between the noise and the operation sound 40 in order to prevent erroneous operation. In addition, when a plurality of operation commands are generated using a plurality of operation sounds 40, it is necessary to distinguish between the subsequent operation sounds 40 generated continuously and the operation sounds 40 generated at the second operation position 18b. is there. The acoustic operation recognition unit 403 recognizes an effective operation sound 40 from the detection sound 30 including noise received from the target device 13 and the reference device 15 and extracts the detection time and the number of operation sounds 40.

図６は、操作位置１８ａ、１８ｂにおいてそれぞれ最大３個の操作音で構成する音響操作を例示して、検出音３０から音響操作の有効性を判断する方法の一例を説明するための図である。図６は、第１の操作位置１８ａにおいて発生時刻ｔ０で発生した連続する２個の発生音について、参照デバイス１５が検出した検出音３１ａ、３２ａと、対象装置１３が検出した検出音３１ｂ、３２ｂを示している。図６はまた、第２の操作位置１８ｂにおいて発生時刻ｔ００で発生した連続する３個の発生音について、参照デバイス１５が検出した検出音３３ａ〜３５ａと、対象装置１３が検出した検出音３３ｂ〜３５ｂを示している。 FIG. 6 is a diagram for explaining an example of a method for determining the effectiveness of the acoustic operation from the detection sound 30 by exemplifying an acoustic operation composed of a maximum of three operation sounds at the operation positions 18a and 18b. . FIG. 6 shows detected sounds 31a and 32a detected by the reference device 15 and detected sounds 31b and 32b detected by the target device 13 for two consecutive generated sounds generated at the generation time t0 at the first operation position 18a. Is shown. FIG. 6 also shows detection sounds 33a to 35a detected by the reference device 15 and detection sounds 33b to 33b detected by the target device 13 for three consecutive generation sounds generated at the generation time t00 at the second operation position 18b. 35b is shown.

この時点で音響操作認識部４０３は発生時刻ｔ０、ｔ００を認識せず、また、検出音３０と操作場所１８ａ、１８ｂの関係を認識していない。音響操作認識部４０３は検出音３０に対して、第１の操作位置１８ａおよび第２の操作位置１８ｂで最初に検出した検出音３１ａ、３３ａの検出時刻ｔ０、ｔ００からの経過時間に、それぞれの操作位置１８ａ、１８ｂで音響操作を完了するまでの閾値Ｔｈ１を設定している。第１の操作位置１８ａおよび第２の操作位置１８ｂを認識する方法は後に説明する。 At this time, the acoustic operation recognition unit 403 does not recognize the occurrence times t0 and t00, and does not recognize the relationship between the detected sound 30 and the operation locations 18a and 18b. The acoustic operation recognizing unit 403 detects the detected sound 30 at the elapsed time from the detection times t0 and t00 of the detected sounds 31a and 33a first detected at the first operation position 18a and the second operation position 18b. A threshold value Th1 until the acoustic operation is completed at the operation positions 18a and 18b is set. A method for recognizing the first operation position 18a and the second operation position 18b will be described later.

また、音響操作認識部４０３は検出音３０に対して、第１の操作位置１８ａで最初に検出した検出音３１ａの検出時刻ｔ０からの経過時間に、第２の操作位置１８ｂでの音響操作を開始および終了するまでの時間としての閾値Ｔｈ２、Ｔｈ３を設定している。最小の閾値Ｔｈ２は、第１の操作位置１８ａから第２の操作位置１８ｂまで音源１７が移動するまでの移動時間の最小値に基づいて設定することができる。最大の閾値Ｔｈ３は、第１の操作位置１８ａで音響操作をしたのちに第２の操作位置１８ｂで音響操作をしない場合に、音響操作システム１００が今回の音響操作をリセットするまでの許容時間に基づいて設定することができる。 In addition, the acoustic operation recognition unit 403 performs an acoustic operation at the second operation position 18b on the detection sound 30 during the elapsed time from the detection time t0 of the detection sound 31a first detected at the first operation position 18a. Threshold values Th2 and Th3 are set as time until start and end. The minimum threshold Th2 can be set based on the minimum value of the movement time until the sound source 17 moves from the first operation position 18a to the second operation position 18b. The maximum threshold Th3 is an allowable time until the acoustic operation system 100 resets the current acoustic operation when the acoustic operation is not performed at the second operation position 18b after the acoustic operation is performed at the first operation position 18a. Can be set based on.

図８のブロック５０１で音響操作認識部４０３はリセットされている。リセットされた音響操作認識部４０３は、リセット後に最初に取得した検出音３１ａ、３１ｂを、第１の操作位置１８ａで生成されたものとして扱う。音響操作認識部４０３は、各対象装置１３および参照デバイス１５から音響データを受信して検出音３０の分析を開始する。ブロック５０３で音響操作認識部４０３は、時間軸上で離散的に存在する各検出音３０の発生時刻と消滅時刻を特定するために、一例において振幅に対して所定の振幅レベル３６を設定する。音響操作認識部４０３は、音源１７から対象装置１３までの音源距離ｄが長い場合に、必要に応じて音圧レベルの減衰を補正するために波形処理をすることができる。 In block 501 of FIG. 8, the acoustic operation recognition unit 403 is reset. The reset acoustic operation recognizing unit 403 treats the detection sounds 31a and 31b acquired first after the reset as being generated at the first operation position 18a. The acoustic operation recognition unit 403 receives acoustic data from each target device 13 and the reference device 15 and starts analyzing the detected sound 30. In block 503, the acoustic operation recognizing unit 403 sets a predetermined amplitude level 36 for the amplitude in one example in order to specify the generation time and the disappearance time of each detection sound 30 that exists discretely on the time axis. When the sound source distance d from the sound source 17 to the target device 13 is long, the acoustic operation recognition unit 403 can perform waveform processing to correct the attenuation of the sound pressure level as necessary.

音響操作認識部４０３は、振幅レベル３６で各検出音３０の検出時刻と消滅時刻を特定して持続時間ｗを計算する。音響操作認識部４０３は、あらかじめ音源１７が生成する操作音４０の持続時間や周波数成分などの特徴情報を登録しておく。音響操作認識部４０３は、検出音３０の持続時間ｗに対して、下限の閾値Ｔｈ４および上限の閾値Ｔｈ５を設定し、
Ｔｈ４＜ｗ≦Ｔｈ５（６）
を満たさない検出音を雑音とみなすことができる。音響操作認識部４０３は、登録した周波数成分と検出音３０の周波数成分とを比較して操作音４０か否かを認識する。音響操作認識部４０３は、ブロック５５１で持続時間ｗや周波数成分から操作音４０として認識できない検出音３０を雑音として破棄することができる。 The acoustic operation recognition unit 403 calculates the duration w by specifying the detection time and disappearance time of each detected sound 30 at the amplitude level 36. The acoustic operation recognition unit 403 registers feature information such as the duration and frequency component of the operation sound 40 generated by the sound source 17 in advance. The acoustic operation recognition unit 403 sets a lower limit threshold Th4 and an upper limit threshold Th5 for the duration w of the detection sound 30,
Th4 <w ≦ Th5 (6)
Detected sound that does not satisfy can be regarded as noise. The acoustic operation recognition unit 403 compares the registered frequency component with the frequency component of the detection sound 30 to recognize whether or not the operation sound is 40. The acoustic operation recognizing unit 403 can discard the detected sound 30 that cannot be recognized as the operation sound 40 from the duration w or frequency component in block 551 as noise.

ブロック５０５で音響操作認識部４０３は、各検出音３０の検出時刻に基づいて先行する検出音と後続の検出音の時間差ａを計算する。図６では、検出音３１ａ、３２ａの時間差ａ１、検出音３３ａ、３４ａの時間差ａ２、検出音３４ａ、３５ａの時間差ａ３を示している。対象装置１３が検出した検出音についての時間差ａも同様に計算することができる。なお、第１の操作位置１８ａで最後に検出した検出音３２ａと第２の操作位置１８ｂで最初に検出した検出音３３ａの時間差は時間差ａから除外する。 In block 505, the acoustic operation recognition unit 403 calculates a time difference “a” between the preceding detection sound and the subsequent detection sound based on the detection time of each detection sound 30. FIG. 6 shows a time difference a1 between the detection sounds 31a and 32a, a time difference a2 between the detection sounds 33a and 34a, and a time difference a3 between the detection sounds 34a and 35a. The time difference a for the detected sound detected by the target device 13 can be calculated in the same manner. The time difference between the detected sound 32a detected last at the first operation position 18a and the detected sound 33a detected first at the second operation position 18b is excluded from the time difference a.

音響操作認識部４０３は、時間差ａに対して最小の閾値Ｔｈ６および最大の閾値Ｔｈ７を設定する。最小の閾値Ｔｈ６は、ユーザが連続的に操作音４０を生成することができる最小の時間差に基づいて設定し、最大の閾値Ｔｈ７は、閾値Ｔｈ１と操作音の個数から設定することができる。音響操作認識部４０３は、
Ｔｈ６＜ａ≦Ｔｈ７（７）
の条件を満たさない後続の検出音３０を雑音と見なしてブロック５５１で破棄することができる。 The acoustic operation recognition unit 403 sets a minimum threshold Th6 and a maximum threshold Th7 for the time difference a. The minimum threshold Th6 can be set based on the minimum time difference that allows the user to continuously generate the operation sound 40, and the maximum threshold Th7 can be set from the threshold Th1 and the number of operation sounds. The acoustic operation recognition unit 403
Th6 <a ≦ Th7 (7)
The subsequent detection sound 30 that does not satisfy the above condition can be regarded as noise and discarded at block 551.

ここまでの手順で処理された検出音３０は、操作音４０であっても十分な音圧レベルがないために雑音として破棄されたり、ブロック５０３、５０５で分別できない雑音が混入していたりする可能性がある。本実施の形態では、さらに以下の手順を実行して音響操作認識部４０３が、検出音３０の時間的発生パターンに基づいて音響操作の有効性を判断することができる。音響操作システム１００は、図３を参照して説明したように音源距離ｄを計測するために、同一の操作音４０について、参照デバイス１５が検出した操作音４０と各対象装置１３が検出した操作音４０で、操作音４０のペアを形成する必要がある。 The detected sound 30 processed in the procedure so far may be discarded as noise because there is not a sufficient sound pressure level even if it is the operation sound 40, or noise that cannot be distinguished in the blocks 503 and 505 may be mixed. There is sex. In the present embodiment, the acoustic operation recognition unit 403 can further determine the effectiveness of the acoustic operation based on the temporal generation pattern of the detected sound 30 by executing the following procedure. The acoustic operation system 100 measures the sound source distance d as described with reference to FIG. 3, and the operation sound 40 detected by the reference device 15 and the operation detected by each target device 13 for the same operation sound 40. The sound 40 needs to form a pair of operation sounds 40.

ブロック５０７で音響操作認識部４０３は、音響操作の有効性を判断するために、雑音を含む同一の発生音に対する検出音３０のペア候補が操作音４０のペアを形成するか否かを判断する。音響操作認識部４０３は、対象装置１３で検出したたとえば検出音３１ｂに対して、参照デバイス１５が検出した先行する最も近い検出音（この場合検出音３１ａ）を検出音３０のペア候補として選択することができる。図６は、検出音３０のペア候補（３１ａ，３１ｂ）、（３２ａ，３２ｂ）、（３３ａ，３３ｂ）、（３４ａ，３４ｂ）および（３５ａ，３５ｂ）を示している。 In block 507, the acoustic operation recognizing unit 403 determines whether or not the pair candidate of the detection sound 30 for the same generated sound including noise forms a pair of the operation sound 40 in order to determine the effectiveness of the acoustic operation. . The acoustic operation recognizing unit 403 selects, for example, the detected sound 31b detected by the target device 13 as the closest candidate detected sound (in this case, the detected sound 31a) detected by the reference device 15 as a pair candidate of the detected sound 30. be able to. FIG. 6 shows pair candidates (31a, 31b), (32a, 32b), (33a, 33b), (34a, 34b), and (35a, 35b) of the detected sound 30.

音響操作システム１００は、音響操作の利用環境における音源距離ｄの最大値を、あらかじめ想定することができる。音響操作認識部４０３は、検出音３０のペア候補の有効性を判断するために、検出時刻の時間差（ｔ２−ｔ１）に対して最小の閾値Ｔｈ８と最大の閾値Ｔｈ９を設定する。最小の閾値Ｔｈ８は、想定する最小の音源距離ｄと参照距離ｒとの差に基づいて設定することができる。最大の閾値値Ｔｈ９は、想定する最大の音源距離ｄと参照距離ｒとの差に基づいて設定することができる。 The acoustic operation system 100 can assume in advance the maximum value of the sound source distance d in the environment in which acoustic operation is used. The acoustic operation recognizing unit 403 sets a minimum threshold Th8 and a maximum threshold Th9 for the time difference (t2−t1) of the detection time in order to determine the validity of the pair candidate of the detection sound 30. The minimum threshold Th8 can be set based on the difference between the assumed minimum sound source distance d and the reference distance r. The maximum threshold value Th9 can be set based on the difference between the assumed maximum sound source distance d and the reference distance r.

音響操作認識部４０３は検出音３０のペア候補が、
Ｔｈ８＜（ｔ２−ｔ１）≦Ｔｈ９（８）
を満たさないときに当該対象装置１３については操作音４０のペアが形成できないため、ブロック５５３で音響操作を無効として扱う。式（８）を満たす対象装置１３については、ブロック５０９に移行する。図７（Ａ）は、対象装置１３が検出音３１ｂを検出しないために、音響操作認識部４０３は、後続の検出音３２ｂと先行する検出音３１ａによる検出音３０のペア候補が有効でないことを示している。 The acoustic operation recognizing unit 403 determines whether the detection sound 30 pair candidate is
Th8 <(t2-t1) ≦ Th9 (8)
Since the pair of operation sounds 40 cannot be formed for the target device 13 when the condition is not satisfied, the acoustic operation is treated as invalid in block 553. For the target device 13 that satisfies Expression (8), the process proceeds to block 509. FIG. 7A shows that since the target device 13 does not detect the detection sound 31b, the acoustic operation recognition unit 403 indicates that the pair candidate of the detection sound 30 by the subsequent detection sound 32b and the preceding detection sound 31a is not valid. Show.

音響操作認識部４０３は、各対象装置１３について、リセット後に最初に検出した検出音３１ａ、３１ｂおよび式（７）を満たす後続の検出音３２ａ、３２ｂを第１の操作位置１８ａで発生した検出音３０とみなす。音響操作認識部４０３は、閾値Ｔｈ１を経過した後に最初に検出した検出音３３ａ、３３ｂおよび式（７）を満たす後続の検出音３４ａ、３５ａ、３４ｂ、３５ｂを第２の操作位置１８ｂで発生した検出音３０とみなす。ブロック５０９で音響操作認識部４０３は、第１の操作位置１８ａで最初に検出した検出音３１ａと第２の操作位置１８ｂで最初に検出した検出音３３ａの時間差ｂを計算する。対象装置１３についても同様に計算することができる。 The acoustic operation recognizing unit 403 detects, for each target device 13, first detected sounds 31a, 31b detected after reset and subsequent detected sounds 32a, 32b that satisfy Expression (7) at the first operation position 18a. Consider 30. The acoustic operation recognizing unit 403 generates the detected sounds 33a and 33b that are detected first after the threshold value Th1 and the subsequent detected sounds 34a, 35a, 34b, and 35b that satisfy Expression (7) at the second operation position 18b. The detection sound 30 is considered. In block 509, the acoustic operation recognizing unit 403 calculates a time difference b between the detected sound 31a first detected at the first operation position 18a and the detected sound 33a first detected at the second operation position 18b. The same calculation can be performed for the target device 13.

音響操作認識部４０３は、時間差ｂが
Ｔｈ２＜ｂ≦Ｔｈ３（９）
を満たさないときに当該対象装置１３については、ブロック５５３で当該音響操作を無効として扱う。式（９）を満たす対象装置１３については、ブロック５１１に移行する。図７（Ｂ）は、検出音３３ａを検出するまでの時間差ｂが長すぎて、当該対象装置１３については、音響操作認識部４０３がリセットされるときの様子を示している。 In the acoustic operation recognition unit 403, the time difference b is Th2 <b ≦ Th3 (9)
When the target device 13 is not satisfied, the acoustic operation is treated as invalid in block 553. For the target device 13 that satisfies Equation (9), the process proceeds to block 511. FIG. 7B shows a state in which the time difference b until the detection sound 33a is detected is too long and the acoustic operation recognition unit 403 is reset for the target device 13.

ブロック５１１で音響操作認識部４０３は、検出時刻ｔ１から閾値Ｔｈ１の間に検出した検出音３０の数および、検出時刻ｔ００から閾値Ｔｈ１の間に検出した検出音３０の数を計算する。音響操作認識部４０３は、検出音３０の数が設定した最大個数（この場合は３個）を超える場合は、雑音が混入しているとみなして、当該対象装置１３については、ブロック５５３で音響操作を無効として扱う。最大個数以下の対象装置１３については、ブロック５１３に移行する。図７（Ｃ）は、閾値Ｔｈ１の間に、４個の検出音３１ａ、３２ａ、３６ａ、３７ａが検出されて音響操作が無効になるときの様子を示している。 In block 511, the acoustic operation recognition unit 403 calculates the number of detection sounds 30 detected between the detection time t1 and the threshold Th1, and the number of detection sounds 30 detected between the detection time t00 and the threshold Th1. When the number of detected sounds 30 exceeds the set maximum number (in this case, three), the acoustic operation recognizing unit 403 considers that noise is mixed, and the target device 13 performs acoustic processing in block 553. Treat the operation as invalid. For the target devices 13 having the maximum number or less, the process proceeds to block 513. FIG. 7C shows a state in which the acoustic operation is disabled by detecting four detection sounds 31a, 32a, 36a, and 37a during the threshold Th1.

ブロック５１３で音響操作認識部４０３は、有効な音響操作を認識した対象装置１３について図９のブロック６０７に移行する。ブロック５５３で音響操作認識部４０３は、有効な音響操作を認識しない対象装置１３について音響操作認識部４０３をリセットして図９のブロック６０３に移行し、新たに第１の操作位置１８ａで音響操作が行われるまで待機する。ブロック５０３〜５１１の手順は、音響操作の環境に応じて適宜選択して採用することができる。また、処理の順番は図８に例示したものに限定する必要はなく、たとえば、ブロック５０３、５０５は順番を入れ替えてもよく、また、ブロック５０７〜５１１も順番を入れ替えてもよい。 In block 513, the acoustic operation recognition unit 403 proceeds to block 607 in FIG. 9 for the target device 13 that has recognized an effective acoustic operation. In block 553, the acoustic operation recognizing unit 403 resets the acoustic operation recognizing unit 403 for the target device 13 that does not recognize an effective acoustic operation, and proceeds to block 603 in FIG. 9 to newly perform the acoustic operation at the first operation position 18a. Wait until is done. The procedures of blocks 503 to 511 can be appropriately selected and adopted according to the acoustic operation environment. Further, the order of processing does not have to be limited to that illustrated in FIG. 8. For example, the order of the blocks 503 and 505 may be changed, and the order of the blocks 507 to 511 may also be changed.

［音響操作システムの動作］
図９は音響操作システム１００の動作を説明するためのフローチャートである。ブロック６０１で対象装置１３、ネットワーク・サーバ１１および参照デバイス１５が相手のネットワーク・アドレスを取得して相互認証を完了し、ネットワークを構築している。ブロック６０３で音響操作認識部４０３は発生時刻ｔ０、ｔ００で検出音３０を受け取る。 [Operation of acoustic operation system]
FIG. 9 is a flowchart for explaining the operation of the acoustic operation system 100. In block 601, the target device 13, the network server 11, and the reference device 15 acquire the other party's network address, complete mutual authentication, and construct a network. In block 603, the acoustic operation recognition unit 403 receives the detection sound 30 at the occurrence times t0 and t00.

ブロック６０５で音響操作認識部４０３は、図８の手順で各対象装置１３について音響操作の有効性を判断する。ブロック６０７で音響操作認識部４０３は、音響操作が有効であると認識した対象装置１３について、たとえば図６に例示した検出音３０に対応する操作音４０を認識する。音響操作認識部４０３は、第１の操作位置１８ａおよび第２の操作位置１８ｂに関する検出時刻ｔ１、ｔ２、ｔ１１、ｔ１２を音源距離計算部４０９と移動距離計算部４０５に送る。 In block 605, the acoustic operation recognizing unit 403 determines the effectiveness of the acoustic operation for each target device 13 in the procedure of FIG. In block 607, the acoustic operation recognition unit 403 recognizes, for example, the operation sound 40 corresponding to the detection sound 30 illustrated in FIG. 6 for the target device 13 that has recognized that the acoustic operation is valid. The acoustic operation recognizing unit 403 sends detection times t1, t2, t11, and t12 related to the first operation position 18a and the second operation position 18b to the sound source distance calculation unit 409 and the movement distance calculation unit 405.

ブロック６０９で音響操作認識部４０３は、第１の操作位置１８ａにおける操作音４０の個数（２個）と第２の操作位置１８ｂにおける操作音４０の個数（３個）を操作コマンド生成部４１３に送る。ブロック６１１で移動距離計算部４０５は、検出時刻ｔ１、ｔ２２と、モーション・センサ３０５が出力した状態データから移動距離ｄ０を計算して対象装置特定部４１１に送る。 In block 609, the acoustic operation recognizing unit 403 informs the operation command generation unit 413 of the number (two) of operation sounds 40 at the first operation position 18a and the number (three) of operation sounds 40 at the second operation position 18b. send. In block 611, the movement distance calculation unit 405 calculates the movement distance d0 from the detection times t1 and t22 and the state data output from the motion sensor 305, and sends the movement distance d0 to the target device identification unit 411.

ブロック６１３で音源距離計算部４０９が各対象装置１３について式（１）〜（３）のいずれかを利用して、各対象装置１３について、第１の操作位置１８ａおよび第２の操作位置１８ｂにおける音源距離ｄを計算する。ブロック６１５で対象装置特定部４１１は、音響操作認識部４０３が有効な音響操作を認識した対象装置について式（４）または式（５）のいずれかを適用し、操作を意図している対象装置１３ｄを特定する。ブロック６１７で操作コマンド生成部４１３は、特定した対象装置１３ｄの対象装置操作部２０７に対して、操作音の個数の組み合わせに対応する操作コマンドを出力して音響操作システム１００をリセットする。 In block 613, the sound source distance calculation unit 409 uses any one of the formulas (1) to (3) for each target device 13, and at each of the target devices 13 at the first operation position 18a and the second operation position 18b. The sound source distance d is calculated. In block 615, the target device identification unit 411 applies either Equation (4) or Equation (5) to the target device for which the acoustic operation recognition unit 403 has recognized an effective acoustic operation, and intends the operation. 13d is specified. In block 617, the operation command generation unit 413 outputs an operation command corresponding to the combination of the number of operation sounds to the target device operation unit 207 of the specified target device 13d, and resets the acoustic operation system 100.

音響操作システム１００は、ネットワーク・サーバ１１の機能を対象装置１３および参照デバイス１５に分散することができる。たとえば、音響操作認識部４０３、計時部４０７を対象装置１３および参照デバイス１５に分散して搭載することもできる。計時部４０７を対象装置１３および参照デバイス１５に分散するときは、必要に応じてＮＴＰサーバや同期コマンドを使って、分散した計時部４０７のＲＴＣ（Real Time Clock）を同期することができる。計時部４０７を分散するときは、音響操作認識部４０３は、対象装置１３と参照デバイス１５から検出時刻ｔ１、ｔ２を受け取るようにしてもよい。移動距離計算部４０５は参照デバイス１５に実装することもできる。 The acoustic operation system 100 can distribute the functions of the network server 11 to the target device 13 and the reference device 15. For example, the acoustic operation recognizing unit 403 and the time measuring unit 407 can be distributed and mounted on the target device 13 and the reference device 15. When distributing the time measuring unit 407 to the target device 13 and the reference device 15, it is possible to synchronize the RTC (Real Time Clock) of the distributed time measuring unit 407 using an NTP server or a synchronization command as necessary. When distributing the time measuring unit 407, the acoustic operation recognizing unit 403 may receive the detection times t1 and t2 from the target device 13 and the reference device 15. The movement distance calculation unit 405 can also be mounted on the reference device 15.

マイクロフォン３０１と移動距離ｄ０を測定するデバイスは参照デバイス１５から分離することができる。このとき移動距離ｄ０をモーション・センサ３０５に代えてステレオ・カメラで計測するようにしてもよい。音響操作システム１００の分解能は、対象装置１３の相互間の距離に依存する。音源距離ｄの測定精度は要求される分解能に相応のものにすることができる。この場合、図２（Ａ）に示したように指を鳴らして操作音４０ａを生成するときに、第１の操作位置１８ａを胸の近くに設定し、前腕を伸ばした位置を第２の操作位置１８ｂに設定して、標準的な上腕の長さを移動距離ｄ０に設定するようにしてもよい。 The microphone 301 and the device for measuring the moving distance d0 can be separated from the reference device 15. At this time, the movement distance d0 may be measured by a stereo camera instead of the motion sensor 305. The resolution of the acoustic operation system 100 depends on the distance between the target devices 13. The measurement accuracy of the sound source distance d can be made appropriate to the required resolution. In this case, when the operation sound 40a is generated by ringing a finger as shown in FIG. 2A, the first operation position 18a is set near the chest, and the position where the forearm is extended is the second operation. The position may be set to the position 18b, and the standard upper arm length may be set to the movement distance d0.

［音響操作システムの他の構成例］
図１０、図１１は、音響操作システム６００、７００の構成を説明するための概略の機能ブロック図である。図１０、図１１では、図５と同一の要素または図５から容易に機能が理解できる要素に同一の参照番号を付して説明を省略する。音響操作システム６００、７００はネットワーク・サーバ１１を利用しないで構成している。音響操作システム６００はネットワークで接続した、参照デバイス６０５と対象装置６０３で構成している。音響操作システム６００では、ネットワーク・サーバの機能を参照デバイス６０５に組み込んでいる。ネットワーク・インターフェース６０７には、複数の対象装置６０３が接続される。 [Other examples of acoustic operation system]
10 and 11 are schematic functional block diagrams for explaining the configuration of the acoustic operation systems 600 and 700. FIG. 10 and 11, the same reference numerals are given to the same elements as those in FIG. 5 or elements whose functions can be easily understood from FIG. The acoustic operation systems 600 and 700 are configured without using the network server 11. The acoustic operation system 600 includes a reference device 605 and a target device 603 connected via a network. In the acoustic operation system 600, the function of the network server is incorporated in the reference device 605. A plurality of target devices 603 are connected to the network interface 607.

音響操作システム７００は、対象装置７０３と参照デバイス７０５で構成している。音響操作システム７００では、ネットワーク・サーバの機能を対象装置７０３に組み込んでいる。ネットワーク・インターフェース３０９には、複数の対象装置７０３が接続される。音響操作システム６００、７００においても、参照デバイス６０５および対象装置７０３が搭載する機能を分散することができる。本発明は図５、図１０、図１１で説明した音響操作システム１００、６００、７００に限定するものではなく、所定の機能を他の方法で分散したり類似の機能を採用したりする場合も本発明の思想を含み当業者が容易に想起できるものは本発明の範囲に含む。 The acoustic operation system 700 includes a target device 703 and a reference device 705. In the acoustic operation system 700, the function of the network server is incorporated in the target device 703. A plurality of target devices 703 are connected to the network interface 309. Also in the acoustic operation systems 600 and 700, the functions installed in the reference device 605 and the target device 703 can be distributed. The present invention is not limited to the acoustic operation systems 100, 600, and 700 described with reference to FIGS. 5, 10, and 11, and a predetermined function may be distributed by another method or a similar function may be adopted. Anything that can be easily conceived by those skilled in the art including the concept of the present invention is included in the scope of the present invention.

これまで本発明について図面に示した特定の実施の形態をもって説明してきたが、本発明は図面に示した実施の形態に限定されるものではなく、本発明の効果を奏する限り、これまで知られたいかなる構成であっても採用することができることはいうまでもないことである。 Although the present invention has been described with the specific embodiments shown in the drawings, the present invention is not limited to the embodiments shown in the drawings, and is known so far as long as the effects of the present invention are achieved. It goes without saying that any configuration can be adopted.

１０ユーザ
１１ネットワーク・サーバ
１００、６００、７００音響操作システム
１３、１３ａ〜１３ｅ、６０３、７０３対象装置
１５、６０５、７０５参照デバイス
１５ａウェアラブル・デバイス（参照デバイス）
１５ｂ携帯式電子機器（参照デバイス）
１７、１７ａ、１７ｂ音源
１８ａ第１の操作位置
１８ｂ第２の操作位置
１９方向直線
３０、３１ａ〜３５ａ、３１ｂ〜３５ｂ検出音
４０、４１ａ〜４５ａ、４１ｂ〜４５ｂ操作音
２０１、２０１ａ〜２０１ｅ、３０１、３０１ａ、３０１ｂマイクロフォン
ｄ音源距離
ｒ参照距離 10 User 11 Network server 100, 600, 700 Acoustic operation system 13, 13a-13e, 603, 703 Target device 15, 605, 705 Reference device 15a Wearable device (reference device)
15b Portable electronic device (reference device)
17, 17a, 17b Sound source 18a First operation position 18b Second operation position 19 Directional straight line 30, 31a-35a, 31b-35b Detection sound 40, 41a-45a, 41b-45b Operation sound 201, 201a-201e, 301 , 301a, 301b Microphone d Sound source distance r Reference distance

Claims

An acoustic operation system for operating a target target device selected by acoustic operation from a plurality of target devices,
The operation sound data is received using a plurality of target microphones that detect operation sounds generated by the sound source at the first operation position and the second operation position, and the operation sounds from the sound source to the target microphone are received. A sound source distance calculator that calculates a first sound source distance from the first operation position to the target microphone and a second sound source distance from the second operation position to the target microphone based on a propagation time;
The target device associated with the target microphone selected based on the first sound source distance, the second sound source distance, and the movement distance from the first operation position to the second operation position is specified. An acoustic operation system having a target device specifying unit.

The acoustic operation system according to claim 1, wherein the second operation position is present in a direction from the target microphone associated with the target device toward the first operation position.

The sound source distance calculation unit calculates the first sound source distance and the second sound source distance from a time difference between a generation time of the operation sound and a detection time of the operation sound detected by the target microphone. Acoustic operation system.

The sound source distance calculation unit receives the operation sound data using a reference microphone disposed at a known reference distance from the sound source at the first operation position and the second operation position, and the target microphone The first sound source distance and the second sound source distance are calculated using the detected time of the detected operation sound, the detected time of the operation sound detected by the reference microphone, and the reference distance. Acoustic operation system.

The acoustic operation system according to claim 4, wherein the reference microphone is mounted on a portable electronic device, and the sound source is a user's fingertip or palm.

The acoustic operation system according to claim 5, wherein the portable electronic device is equipped with a motion sensor that outputs state information for acquiring the moving distance.

The acoustic operation system according to claim 4, wherein the reference microphone is mounted on a portable electronic device, and the sound source is a part of a casing of the portable electronic device.

The acoustic operation system according to claim 4, further comprising: an acoustic operation recognition unit that determines the effectiveness of the acoustic operation based on a temporal generation pattern of detection sound including noise detected by the target microphone and the reference microphone.

The acoustic operation recognition unit detects the same generated sound by the target microphone and the reference microphone based on a difference between a detection time of the detection sound detected by the target microphone and a detection time of the detection sound detected by the reference microphone. The acoustic operation system according to claim 4, wherein the acoustic operation is invalidated when it is determined that the detected sound pair does not form the operation sound pair.

The acoustic operation recognition unit discards the detection sound when a time difference between a detection time of a preceding detection sound detected by the reference microphone or the target microphone and a detection time of a subsequent detection sound is a predetermined value or more. 4. The acoustic operation system according to 4.

The acoustic operation system according to claim 1, further comprising an operation command generation unit that outputs operation commands corresponding to a plurality of operation sounds respectively generated at the first operation position and the second operation position to the target device. .

A network server capable of configuring an acoustic operation system for operating a target target device selected by acoustic operation from a plurality of target devices,
A first operation sound data is received from a reference device having a reference microphone disposed at a known reference distance from the sound source at the first operation position and the second operation position so that the operation sound generated by the sound source can be detected. And the interface
A second interface for receiving the operation sound data from a plurality of target devices each including a target microphone for detecting the operation sound generated by the sound source at the first operation position and the second operation position;
A first sound source between the target microphone and the first operation position based on a difference between a propagation time of the operation sound from the sound source to the target microphone, a propagation time from the sound source to the reference microphone, and the reference distance A sound source distance calculating unit for calculating a distance and a second sound source distance between the target microphone and the second operation position;
A target device that identifies the target target device on which the reference microphone selected based on the first sound source distance, the second sound source distance, and the moving distance from the first operation position to the second operation position is mounted A specific part,
A network server including an operation command generation unit that transmits an operation command to the target device through the second interface;

A control device mounted on a target device that can be operated by acoustic operation,
An interface for receiving the operation sound data from a reference device having a reference microphone disposed at a known reference distance from the sound source at the first operation position and the second operation position so that the operation sound generated by the sound source can be detected; ,
A target microphone for detecting the operation sound;
A first sound source between the target microphone and the first operation position based on a difference between a propagation time of the operation sound from the sound source to the target microphone, a propagation time from the sound source to the reference microphone, and the reference distance A sound source distance calculation unit for calculating a distance and a second sound source distance between the target microphone and the second operation position;
A target device specifying unit that recognizes the target device based on the first sound source distance, the second sound source distance, and a moving distance from the first operation position to the second operation position. Control device.

A portable electric device capable of operating a target target device selected by acoustic operation from a plurality of target devices,
A reference microphone disposed at a known reference distance from the sound source at the first operation position and the second operation position so that the operation sound generated by the sound source can be detected;
An interface for receiving the operation sound data from the plurality of target devices each having a target microphone;
A first sound source between the target microphone and the first operation position based on a difference between a propagation time of the operation sound from the sound source to the target microphone, a propagation time from the sound source to the reference microphone, and the reference distance A sound source distance calculation unit for calculating a distance and a second sound source distance between the target microphone and the second operation position;
A target device specifying unit for specifying the target target device selected based on the first sound source distance, the second sound source distance, and the movement distance from the first operation position to the second operation position;
A portable electronic device comprising: an operation command generation unit that transmits an operation command to the target device through the interface.

A method in which an acoustic operation system operates a target target device selected from a plurality of target devices according to an acoustic operation,
The plurality of target devices detecting an operation sound generated by the sound source at the first operation position and an operation sound generated at the second operation position;
Based on the propagation time of the operation sound from the sound source to the target device, a first sound source distance from the first operation position to the target device and a second distance from the second operation position to the target device. Calculating the sound source distance;
Obtaining a moving distance between the first operation position and the second operation position;
A method comprising: specifying the target device based on the first sound source distance, the second sound source distance, and the moving distance; and sending an operation command to the target device.

The second operation position exists in a direction from the first operation position toward the target device;
16. The step of identifying the target device includes comparing an evaluation parameter obtained by subtracting the reference distance from a difference between the first sound source distance and the second sound source distance with a predetermined threshold value. Method.

The second operation position exists in a direction from the first operation position toward the target device;
The method according to claim 16, wherein identifying the target device includes comparing the evaluation parameters calculated for the plurality of target devices with each other.

Calculating the first sound source distance and the second sound source distance;
Obtaining a first detection time at which the operation sound is detected at a known reference distance from the sound source;
Obtaining a second detection time at which each of the plurality of target devices detects the operation sound;
16. The method of claim 15, comprising calculating a difference between the second detection time and the first detection time.

Detecting a first number of operation sounds at the first operation position;
Detecting a second number of operation sounds at the second operation position;
Generating a plurality of the operation commands by a combination of the first number and the second number;
The method according to claim 15, further comprising disabling the acoustic operation when the first number or the second number is equal to or greater than a predetermined value.

In an acoustic operation system for operating a target target device selected from a plurality of target devices in accordance with an acoustic operation,
A function of the plurality of target devices to detect an operation sound generated by the sound source at the first operation position and an operation sound generated at the second operation position;
Based on the propagation time of the operation sound from the sound source to the target device, a first sound source distance from the first operation position to the target device and a second distance from the second operation position to the target device. A function to calculate the sound source distance,
A function of acquiring a movement distance between the first operation position and the second operation position;
A computer program for realizing a function of specifying the target device based on the first sound source distance, the second sound source distance, and the moving distance.