JP2003061184A

JP2003061184A - Information processing apparatus and method, information generating device and method, recording medium and program

Info

Publication number: JP2003061184A
Application number: JP2001245991A
Authority: JP
Inventors: Tetsujiro Kondo; 哲二郎近藤; Yuji Okumura; 裕二奥村; Koichi Fujishima; 幸一藤島; Tomoyuki Otsuki; 知之大月
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2001-08-14
Filing date: 2001-08-14
Publication date: 2003-02-28
Anticipated expiration: 2021-08-14
Also published as: JP4752153B2

Abstract

PROBLEM TO BE SOLVED: To provide an image processing apparatus that can utilize voice uttered by a user so as to detect a direction of the face of the user. SOLUTION: Microphone amplifiers 2L, 2R amplify sound signal of voice collected by microphones 1L, 1R placed at a prescribed position and uttered by a prescribed sound source, generate processing sound signals L, R and supply them to a detector 3. The detector 3 respectively calculates sound pressures L, R of the processing sound signals L, R and also calculates a sound pressure ratio (= sound pressure R/sound pressure L). The detector 3 detects a direction of the sound source corresponding to the calculated sound pressure ratio on the basis of data denoting a cross-reference between the sound pressure ratio and the direction of the sound source possessed by itself.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、情報処理装置およ
び方法、情報生成装置および方法、記録媒体、並びにプ
ログラムに関し、特に、音源が発した音声を利用して、
音源の位置や向きを検出することができるようにした情
報処理装置および方法、情報生成装置および方法、記録
媒体、並びにプログラムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information processing apparatus and method, an information generating apparatus and method, a recording medium, and a program.
The present invention relates to an information processing apparatus and method, an information generating apparatus and method, a recording medium, and a program capable of detecting a position and an orientation of a sound source.

【０００２】[0002]

【従来の技術】例えば、遠隔している複数の会議室にお
ける画像や音声を、ネットワークを介して相互に通信
し、各会議室において、他の会議室の映像や音声を再生
することにより、あたかも１つのテーブルを囲んでいる
かのように会議を行うことができる遠隔システムが存在
する。2. Description of the Related Art For example, images and sounds in a plurality of remote conference rooms are communicated with each other via a network, and in each conference room, images and sounds in other conference rooms are reproduced, so that the images and sounds are reproduced. There are remote systems that allow meetings to take place as if surrounding a single table.

【０００３】この遠隔会議システムにおいては、会議に
参加しているユーザの顔の向きに応じて、映像の表示や
音声の出力が制御される。すなわち、ユーザの顔の向き
が検出されるようになされている。In this remote conference system, display of video and output of audio are controlled in accordance with the direction of the face of the user participating in the conference. That is, the direction of the user's face is detected.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、特殊な
センサを利用したり、ユーザの顔の部分の画像データを
利用して、ユーザの顔の向きを検出する方法は、開示さ
れているが、ユーザが発した音声を利用して、ユーザの
顔の向きを検出する方法は、まだ開示されていない。However, a method for detecting the orientation of the user's face using a special sensor or using image data of the user's face has been disclosed. A method for detecting the orientation of the user's face using the voice emitted by the user has not been disclosed yet.

【０００５】本発明はこのような状況に鑑みてなされた
ものであり、音源が発した音声を利用して、音源の顔の
向きを検出することができるようにするものである。[0005] The present invention has been made in view of such circumstances, and it is an object of the present invention to detect the direction of the face of a sound source using the sound emitted by the sound source.

【０００６】[0006]

【課題を解決するための手段】本発明の情報処理装置
は、所定の位置に設置された複数の集音装置により集音
された、所定の音源が発した音声に対応する音声信号か
ら、音源の状態に対応する特徴量を算出する第１の算出
手段と、第１の算出手段により算出された特徴量に基づ
いて、音源の状態を検出する状態検出手段とを備えるこ
とを特徴とする。According to the present invention, there is provided an information processing apparatus comprising: a sound source corresponding to a sound generated by a predetermined sound source, which is collected by a plurality of sound collecting devices installed at predetermined positions; And a state detecting means for detecting the state of the sound source based on the characteristic amount calculated by the first calculating means.

【０００７】状態は、所定の位置にある音源の向きとす
ることができる。The state can be the direction of a sound source at a predetermined position.

【０００８】状態検出手段は、予め求められた、特徴量
と状態との対応関係を示す対応関係データを保持し、対
応関係データおよび特徴量に基づいて、状態を検出する
ことができる。[0008] The state detecting means holds correspondence data indicating a correspondence relationship between the characteristic amount and the state, which is obtained in advance, and can detect the state based on the correspondence data and the characteristic amount.

【０００９】第１の算出手段は、第１の集音装置により
集音された、音源が発した音声に対応する第１の音声信
号、および第２の集音装置により集音された、音源が発
した音声に対応する第２の音声信号から、第１の音声信
号の音圧と第２の音声信号の音圧との音圧比を、特徴量
として算出し、状態検出手段は、予め求められた、音圧
比と状態との対応関係を示す対応関係データを保持し、
対応関係データおよび音圧比に基づいて、状態を検出す
ることができる。The first calculating means comprises: a first sound signal corresponding to the sound emitted by the sound source, collected by the first sound collecting device; and a sound source collected by the second sound collecting device. The sound pressure ratio between the sound pressure of the first sound signal and the sound pressure of the second sound signal is calculated as a feature amount from the second sound signal corresponding to the sound emitted by the device. Holding the corresponding relationship data indicating the corresponding relationship between the sound pressure ratio and the state,
The state can be detected based on the correspondence data and the sound pressure ratio.

【００１０】第１の算出手段は、第１の集音装置により
集音された、音源が発した音声に対応する第１の音声信
号、および第２の集音装置により集音された、音源が発
した音声に対応する第２の音声信号から、第１の音声信
号の高周波部分からなる第１の高周波音声信号の音圧
と、第２の音声信号の高周波部分からなる第２の高周波
音声信号の音圧との音圧比を、特徴量として算出し、状
態検出手段は、予め求められた、音圧比と状態との対応
関係を示す対応関係データを保持し、対応関係データお
よび音圧比に基づいて、状態を検出することができる。The first calculating means includes a first sound signal corresponding to the sound emitted by the sound source, collected by the first sound collecting device, and a sound source collected by the second sound collecting device. From the second sound signal corresponding to the sound generated by the first sound signal, the sound pressure of the first high-frequency sound signal consisting of the high-frequency part of the first sound signal, and the second high-frequency sound consisting of the high-frequency part of the second sound signal The sound pressure ratio between the sound pressure of the signal and the sound pressure ratio is calculated as a characteristic amount, and the state detection unit holds the previously obtained correspondence data indicating the correspondence between the sound pressure ratio and the state. Based on this, the state can be detected.

【００１１】第１の算出手段は、第１の集音装置により
集音された、音源が発した音声に対応する第１の音声信
号、および第２の集音装置により集音された、音源が発
した音声に対応する第２の音声信号から、第１の音声信
号の位相と第２の音声信号の位相との位相差を、特徴量
として算出し、状態検出手段は、予め求められた、位相
差と状態との対応関係を示す対応関係データを保持し、
対応関係データおよび位相差に基づいて、状態を検出す
ることができる。The first calculating means includes a first sound signal corresponding to the sound emitted by the sound source, collected by the first sound collecting device, and a sound source collected by the second sound collecting device. The phase difference between the phase of the first audio signal and the phase of the second audio signal is calculated as a feature amount from the second audio signal corresponding to the voice emitted by the device, and the state detection means is obtained in advance. Holds correspondence data indicating the correspondence between the phase difference and the state,
The state can be detected based on the correspondence data and the phase difference.

【００１２】第１の算出手段は、第１の集音装置、第２
の集音装置、および第３の集音装置により集音された、
所定の範囲内に位置する音源が発した音声に対応する第
１の音声信号、第２の音声信号、および第３の音声信号
から、第１の音声信号の位相と第２の音声信号の位相と
の第１の位相差、および第１の音声信号の位相と第３の
音声信号の位相との第２の位相差を、特徴量として算出
し、状態検出手段は、第１の位相差および第２の位相差
に基づいて、音源の、平面上の位置を検出することがで
きる。The first calculating means includes a first sound collecting device, a second sound collecting device,
A sound collector, and a sound collected by a third sound collector,
From the first audio signal, the second audio signal, and the third audio signal corresponding to the audio emitted from the sound source located within the predetermined range, the phases of the first audio signal and the second audio signal are obtained. , And a second phase difference between the phase of the first audio signal and the phase of the third audio signal are calculated as a feature quantity, and the state detecting means calculates the first phase difference and Based on the second phase difference, the position of the sound source on a plane can be detected.

【００１３】第１の集音装置、第２の集音装置、および
第３の集音装置のそれぞれにより集音された、所定の範
囲内に設けられた複数の実験位置に位置した音源が発し
た音声に対応する第４の音声信号、第５の音声信号、お
よび第６の音声信号から、第４の音声信号の位相と第５
の音声信号の位相との第３の位相差、および第４の音声
信号の位相と第６の音声信号との第４の位相差を、実験
位置毎に算出し、さらに、第３の位相差と実験位置との
関係に基づく第１の係数、および第４の位相差と実験位
置との関係に基づく第２の係数を算出する第２の算出手
段と、第２の算出手段により算出された第１の係数およ
び第２の係数を記憶する記憶手段とをさらに設け、状態
検出手段には、第１の位相差と第１の係数との関係、お
よび第２の位相差と第２の係数との関係に基づいて、音
源の、平面上の位置を検出させることができる。[0013] Sound sources located at a plurality of experimental positions provided within a predetermined range, which are collected by each of the first sound collecting device, the second sound collecting device, and the third sound collecting device, are emitted. From the fourth audio signal, the fifth audio signal, and the sixth audio signal corresponding to the extracted audio, the phase of the fourth audio signal and the fifth
A third phase difference between the phase of the audio signal and the fourth phase difference between the phase of the fourth audio signal and the sixth audio signal are calculated for each experimental position. A second coefficient based on the relationship between the first phase difference and the experimental position, and a second coefficient based on the relationship between the fourth phase difference and the experimental position. Storage means for storing the first coefficient and the second coefficient is further provided; and the state detection means includes a relation between the first phase difference and the first coefficient, and a relation between the second phase difference and the second coefficient. Based on the relationship, the position of the sound source on the plane can be detected.

【００１４】第１の算出手段は、第２の音声信号および
第３の音声信号から、第２の音声信号の音圧と第３の音
声信号の音圧との第１の音圧比を、特徴量として、さら
に算出し、状態検出手段は、第１の音圧比および検出し
た音源の位置に基づいて、音源の向きを検出することが
できる。The first calculating means is configured to calculate a first sound pressure ratio between the sound pressure of the second sound signal and the sound pressure of the third sound signal from the second sound signal and the third sound signal. The amount is further calculated, and the state detection means can detect the direction of the sound source based on the first sound pressure ratio and the detected position of the sound source.

【００１５】第２の集音装置および第３の集音装置のそ
れぞれにより集音された、複数の基準位置に位置し、所
定の方向を向いた音源が発した音声に対応する第４の音
声信号および第５の音声信号から、第４の音声信号の音
圧と第５の音声信号の音圧の第２の音圧比を算出し、さ
らに、第２の音圧比と所定の方向との関係に基づく係数
を、基準位置毎に算出する第２の算出手段と、係数を、
基準位置に対応させて記憶する記憶手段とをさらに設
け、状態検出手段には、検出した音源の位置に対応する
基準位置を検出するとともに、記憶手段から、検出した
基準位置に対応する係数を選択し、選択した係数および
第１の音圧比との関係に基づいて、音源の向きを検出さ
せることができる。A fourth sound corresponding to a sound emitted by a sound source which is located at a plurality of reference positions and is oriented in a predetermined direction, which is collected by each of the second sound collecting device and the third sound collecting device; A second sound pressure ratio between the sound pressure of the fourth sound signal and the sound pressure of the fifth sound signal is calculated from the signal and the fifth sound signal, and further, a relationship between the second sound pressure ratio and a predetermined direction. A second calculating means for calculating a coefficient based on each of the reference positions,
Storage means for storing the reference position corresponding to the reference position, wherein the state detection means detects a reference position corresponding to the position of the detected sound source, and selects a coefficient corresponding to the detected reference position from the storage means. Then, the direction of the sound source can be detected based on the relationship between the selected coefficient and the first sound pressure ratio.

【００１６】第１の算出手段は、第１の集音装置、第２
の集音装置、第３の集音装置、および第４の集音装置に
より集音された、所定の範囲内に位置する音源が発した
音声に対応する第１の音声信号、第２の音声信号、第３
の音声信号、および第４の音声信号から、第１の音声信
号の位相と第２の音声信号の位相との第１の位相差、第
２の音声信号の位相と第３の音声信号の位相との第２の
位相差、および第３の音声信号の位相と第４の音声信号
の位相との第３の位相差を、特徴量として算出し、状態
検出手段は、第１の位相差、第２の位相差、および第３
の位相差に基づいて、音源の、空間上の位置を検出する
ことができる。The first calculating means comprises a first sound collecting device, a second sound collecting device,
A first sound signal and a second sound corresponding to sounds emitted by a sound source located within a predetermined range, which are collected by the sound collecting device, the third sound collecting device, and the fourth sound collecting device. Signal, third
From the audio signal and the fourth audio signal, a first phase difference between the phase of the first audio signal and the phase of the second audio signal, the phase of the second audio signal, and the phase of the third audio signal , And a third phase difference between the phase of the third audio signal and the phase of the fourth audio signal are calculated as a feature quantity, and the state detection unit calculates the first phase difference, A second phase difference, and a third
Based on the phase difference of the sound source, the position of the sound source in space can be detected.

【００１７】第１の集音装置、第２の集音装置、第３の
集音装置、および第４の集音装置のそれぞれにより集音
された、所定の範囲内に設けられた複数の実験位置に位
置した音源が発した音声に対応する第５の音声信号、第
６の音声信号、第７の音声信号、および第８の音声信号
から、第５の音声信号の位相と第６の音声信号の位相と
の第４の位相差、第６の音声信号の位相と第７の音声信
号の位相との第５の位相差、および第７の音声信号の位
相と第８の音声信号の位相との第６の位相差を、実験位
置毎に算出し、さらに、第４の位相差と実験位置との関
係に基づく第１の係数、第５の位相差と実験位置との関
係に基づく第２の係数、および第６の位相差と実験位置
との関係に基づく第３の係数を算出する第２の算出手段
と、第２の算出手段により算出された第１の係数、第２
の係数、および第３の係数を記憶する記憶手段とをさら
に設け、状態検出手段には、第１の位相差と第１の係数
との関係、第２の位相差と第２の係数との関係、および
第３の位相差と第３の係数との関係に基づいて、音源
の、空間上の位置を検出させることができる。A plurality of experiments provided within a predetermined range and collected by each of the first sound collecting device, the second sound collecting device, the third sound collecting device, and the fourth sound collecting device. From the fifth audio signal, the sixth audio signal, the seventh audio signal, and the eighth audio signal corresponding to the audio emitted by the sound source located at the position, the phase of the fifth audio signal and the sixth audio signal A fourth phase difference from the signal phase, a fifth phase difference between the sixth audio signal phase and the seventh audio signal phase, and a seventh audio signal phase and an eighth audio signal phase Is calculated for each experimental position, a first coefficient based on the relationship between the fourth phase difference and the experimental position, and a sixth coefficient based on the relationship between the fifth phase difference and the experimental position. A second calculating means for calculating a second coefficient and a third coefficient based on a relationship between the sixth phase difference and the experimental position; and a second calculating means. First coefficient calculated by the second
And a storage means for storing a third coefficient and a third coefficient. The state detection means includes a relation between the first phase difference and the first coefficient, and a relation between the second phase difference and the second coefficient. The position of the sound source in space can be detected based on the relationship and the relationship between the third phase difference and the third coefficient.

【００１８】第１の算出手段は、第１の音声信号、第２
の音声信号、第３の音声信号、および第４の音声信号か
ら、第１の音声信号の音圧と第２の音声信号の音圧との
第１の音圧比、第２の音声信号の音圧と第３の音声信号
の音圧との第２の音圧比、および第３の音声信号の音圧
と第４の音声信号の音圧との第３の音圧比を、特徴量と
して、さらに算出し、状態検出手段は、第１の音圧比、
第２の音圧比、および第３の音圧比、並びに検出した音
源の位置に基づいて、音源の向きを検出することができ
る。The first calculating means comprises a first audio signal, a second audio signal,
The first sound pressure ratio between the sound pressure of the first sound signal and the sound pressure of the second sound signal, and the sound of the second sound signal from the sound signal, the third sound signal, and the fourth sound signal. The second sound pressure ratio between the pressure and the sound pressure of the third sound signal, and the third sound pressure ratio between the sound pressure of the third sound signal and the sound pressure of the fourth sound signal are further defined as characteristic amounts. The state detection means calculates the first sound pressure ratio,
The direction of the sound source can be detected based on the second sound pressure ratio, the third sound pressure ratio, and the detected position of the sound source.

【００１９】第１の集音装置、第２の集音装置、第３の
集音装置、および第４の集音装置のそれぞれにより集音
された、複数の基準位置に位置し、所定の方向を向いた
音源が発した音声に対応する第５の音声信号、第６の音
声信号、第７の音声信号、および第８の音声信号から、
第５の音声信号の音圧と第６の音声信号の音圧との第４
の音圧比、第６の音声信号の音圧と第７の音声信号の音
圧との第５の音圧比、および第７の音声信号の音圧と第
８の音声信号の音圧との第６の音圧比をそれぞれ算出す
る第２の算出手段と、基準位置に対応させて、第４の音
圧比、第５の音圧比、および第６の音圧比、並びに方向
を記憶する記憶手段とをさらに設け、状態検出手段に
は、記憶手段から、検出した音源の位置に対応する基準
位置、および第１の音圧比、第２の音圧比と第３の音圧
比に対応する第４の音圧比、第５の音圧比と第６の音圧
比に対応する方向を、音源の向きとして検出させること
ができる。The sound pickup device is located at a plurality of reference positions, each of which is collected by each of the first sound collection device, the second sound collection device, the third sound collection device, and the fourth sound collection device, in a predetermined direction. From the fifth audio signal, the sixth audio signal, the seventh audio signal, and the eighth audio signal corresponding to the audio emitted by the sound source directed to
The fourth of the sound pressure of the fifth sound signal and the sound pressure of the sixth sound signal
Sound pressure ratio, a fifth sound pressure ratio between the sound pressure of the sixth sound signal and the sound pressure of the seventh sound signal, and a fifth sound pressure ratio between the sound pressure of the seventh sound signal and the sound pressure of the eighth sound signal. And second storage means for storing the fourth sound pressure ratio, the fifth sound pressure ratio, the sixth sound pressure ratio, and the direction corresponding to the reference position. The state detecting means is further provided with a reference position corresponding to the position of the detected sound source, and a fourth sound pressure ratio corresponding to the first sound pressure ratio, the second sound pressure ratio, and the third sound pressure ratio. , The direction corresponding to the fifth sound pressure ratio and the sixth sound pressure ratio can be detected as the direction of the sound source.

【００２０】本発明の情報処理方法は、所定の位置に設
置された複数の集音装置により集音された、所定の音源
が発した音声に対応する音声信号から、音源の状態に対
応する特徴量を算出する算出ステップと、算出ステップ
の処理で算出された特徴量に基づいて、音源の状態を検
出する状態検出ステップとを含むことを特徴とする。The information processing method according to the present invention is characterized in that a sound signal corresponding to the state of a sound source is obtained from a sound signal corresponding to a sound emitted from a predetermined sound source, which is collected by a plurality of sound collecting devices installed at predetermined positions. It is characterized by including a calculating step of calculating the amount and a state detecting step of detecting the state of the sound source based on the characteristic amount calculated in the processing of the calculating step.

【００２１】本発明の第１の記録媒体のプログラムは、
所定の位置に設置された複数の集音装置により集音され
た、所定の音源が発した音声に対応する音声信号から、
音源の状態に対応する特徴量を算出する算出ステップ
と、算出ステップの処理で算出された特徴量に基づい
て、音源の状態を検出する状態検出ステップとを含むこ
とを特徴とする。[0021] The program of the first recording medium of the present invention comprises:
From sound signals corresponding to sounds emitted by a predetermined sound source, collected by a plurality of sound collection devices installed at predetermined positions,
The method includes a calculating step of calculating a feature amount corresponding to a state of the sound source, and a state detecting step of detecting a state of the sound source based on the feature amount calculated in the processing of the calculating step.

【００２２】本発明の第１のプログラムは、所定の位置
に設置された複数の集音装置により集音された、所定の
音源が発した音声に対応する音声信号から、音源の状態
に対応する特徴量を算出する算出ステップと、算出ステ
ップの処理で算出された特徴量に基づいて、音源の状態
を検出する状態検出ステップとを含むことを特徴とす
る。A first program according to the present invention corresponds to a state of a sound source from an audio signal corresponding to a sound emitted by a predetermined sound source, which is collected by a plurality of sound collection devices installed at predetermined positions. It is characterized by including a calculation step of calculating a feature amount, and a state detection step of detecting a state of the sound source based on the feature amount calculated in the processing of the calculation step.

【００２３】本発明の情報処理装置および方法、並びに
第１のプログラムにおいては、所定の位置に設置された
複数の集音装置により集音された、所定の音源が発した
音声に対応する音声信号から、音源の状態に対応する特
徴量が算出され、算出された特徴量に基づいて、音源の
状態が検出される。[0023] In the information processing apparatus and method and the first program according to the present invention, an audio signal corresponding to a sound emitted from a predetermined sound source, collected by a plurality of sound collection devices installed at predetermined positions. Then, the feature amount corresponding to the state of the sound source is calculated, and the state of the sound source is detected based on the calculated feature amount.

【００２４】本発明の情報生成装置は、第１の集音装
置、第２の集音装置、および第３の集音装置のそれぞれ
により集音された、所定の範囲内に設けられた複数の実
験位置に位置した音源が発した音声に対応する第４の音
声信号、第５の音声信号、および第６の音声信号から、
第４の音声信号の位相と第５の音声信号の位相との第３
の位相差、および第４の音声信号の位相と第６の音声信
号との第４の位相差を、実験位置毎に算出する第１の算
出手段と、第３の位相差と実験位置との関係に基づい
て、第１の係数を算出し、第４の位相差と実験位置との
関係に基づいて、第２の係数を算出する第２の算出手段
とを備えることを特徴とする。The information generating apparatus according to the present invention comprises a plurality of sound collecting units provided within a predetermined range and collected by each of the first sound collecting unit, the second sound collecting unit, and the third sound collecting unit. From the fourth audio signal, the fifth audio signal, and the sixth audio signal corresponding to the audio emitted by the sound source located at the experiment position,
The third of the phase of the fourth audio signal and the phase of the fifth audio signal
A first calculating means for calculating, for each experimental position, a phase difference between the third phase difference and the experimental position, and a fourth phase difference between the fourth audio signal and the sixth audio signal. A second coefficient calculating means for calculating a first coefficient based on the relation and calculating a second coefficient based on a relation between the fourth phase difference and the experimental position.

【００２５】検出装置は、第２の集音装置および第３の
集音装置のそれぞれにより集音された、複数の基準位置
に位置する音源が発した音声に対応する第７の音声信号
および第８の音声信号から、第７の音声信号の音圧と第
８の音声信号の音圧の音圧比を算出し、さらに、音圧比
と第３の係数との関係に基づいて、音源の向きをさらに
検出し、第２の集音装置および第３の集音装置のそれぞ
れにより集音された、複数の基準位置に位置する音源が
発した音声に対応する第９の音声信号および第１０の音
声信号から、第９の音声信号の音圧と第１０の音声信号
の音圧の音圧比を算出する第３の算出手段と、第９の音
声信号の音圧と第１０の音声信号の音圧の音圧比と実験
位置との関係に基づいて、第３の係数を算出する第４の
算出手段とをさらに設けることができる。The detection device includes a seventh sound signal and a seventh sound signal corresponding to sounds emitted by sound sources located at a plurality of reference positions, which are collected by the second sound collection device and the third sound collection device, respectively. The sound pressure ratio between the sound pressure of the seventh sound signal and the sound pressure of the eighth sound signal is calculated from the sound signal of No. 8, and the direction of the sound source is further determined based on the relationship between the sound pressure ratio and the third coefficient. The ninth sound signal and the tenth sound corresponding to the sounds emitted from the sound sources located at the plurality of reference positions, which are detected and collected by the second sound collecting device and the third sound collecting device, respectively. Third calculating means for calculating the sound pressure ratio between the sound pressure of the ninth sound signal and the sound pressure of the tenth sound signal from the signal, and the sound pressure of the ninth sound signal and the sound pressure of the tenth sound signal And a fourth calculating means for calculating a third coefficient based on a relationship between the sound pressure ratio of the first position and the experimental position. It can be provided.

【００２６】本発明の情報生成方法は、第１の集音装
置、第２の集音装置、および第３の集音装置のそれぞれ
により集音された、所定の範囲内に設けられた複数の実
験位置に位置した音源が発した音声に対応する第４の音
声信号、第５の音声信号、および第６の音声信号から、
第４の音声信号の位相と第５の音声信号の位相との第３
の位相差、および第４の音声信号の位相と第６の音声信
号との第４の位相差を、実験位置毎に算出する第１の算
出ステップと、第３の位相差と実験位置との関係に基づ
いて、第１の係数を算出し、第４の位相差と実験位置と
の関係に基づいて、第２の係数を算出する第２の算出ス
テップとを含むことを特徴とする。[0026] The information generating method of the present invention comprises the steps of: collecting a plurality of sounds provided by a first sound collecting device, a second sound collecting device, and a third sound collecting device and provided in a predetermined range; From the fourth audio signal, the fifth audio signal, and the sixth audio signal corresponding to the audio emitted by the sound source located at the experiment position,
The third of the phase of the fourth audio signal and the phase of the fifth audio signal
A first calculating step of calculating, for each experimental position, a phase difference between the third phase difference and the experimental position, and a fourth phase difference between the fourth audio signal and the sixth audio signal. A second calculating step of calculating a first coefficient based on the relationship and calculating a second coefficient based on a relationship between the fourth phase difference and the experimental position.

【００２７】本発明の第２の記録媒体のプログラムは、
第１の集音装置、第２の集音装置、および第３の集音装
置のそれぞれにより集音された、所定の範囲内に設けら
れた複数の実験位置に位置した音源が発した音声に対応
する第４の音声信号、第５の音声信号、および第６の音
声信号から、第４の音声信号の位相と第５の音声信号の
位相との第３の位相差、および第４の音声信号の位相と
第６の音声信号との第４の位相差を、実験位置毎に算出
する第１の算出ステップと、第３の位相差と実験位置と
の関係に基づいて、第１の係数を算出し、第４の位相差
と実験位置との関係に基づいて、第２の係数を算出する
第２の算出ステップとを含むことを特徴とする。The program of the second recording medium of the present invention comprises:
The sound collected by each of the first sound collecting device, the second sound collecting device, and the third sound collecting device, and emitted from the sound sources located at a plurality of experimental positions provided within a predetermined range. A third phase difference between the phase of the fourth audio signal and the phase of the fifth audio signal, and the fourth audio from the corresponding fourth, fifth, and sixth audio signals A first calculating step of calculating a fourth phase difference between the signal phase and the sixth audio signal for each experimental position, and a first coefficient based on a relationship between the third phase difference and the experimental position. And a second calculating step of calculating a second coefficient based on the relationship between the fourth phase difference and the experimental position.

【００２８】本発明の第２のプログラムは、第１の集音
装置、第２の集音装置、および第３の集音装置のそれぞ
れにより集音された、所定の範囲内に設けられた複数の
実験位置に位置した音源が発した音声に対応する第４の
音声信号、第５の音声信号、および第６の音声信号か
ら、第４の音声信号の位相と第５の音声信号の位相との
第３の位相差、および第４の音声信号の位相と第６の音
声信号との第４の位相差を、実験位置毎に算出する第１
の算出ステップと、第３の位相差と実験位置との関係に
基づいて、第１の係数を算出し、第４の位相差と実験位
置との関係に基づいて、第２の係数を算出する第２の算
出ステップとを含む処理をコンピュータに実行させるこ
とを特徴とする。The second program according to the present invention includes a plurality of programs provided within a predetermined range and collected by each of the first sound collecting device, the second sound collecting device, and the third sound collecting device. From the fourth, fifth, and sixth audio signals corresponding to the audio emitted by the sound source located at the experimental position, the phase of the fourth audio signal and the phase of the fifth audio signal are calculated. And the fourth phase difference between the phase of the fourth audio signal and the sixth audio signal is calculated for each experimental position.
And the first coefficient is calculated based on the relationship between the third phase difference and the experimental position, and the second coefficient is calculated based on the relationship between the fourth phase difference and the experimental position. The program is characterized by causing a computer to execute a process including a second calculating step.

【００２９】本発明の情報生成装置および方法、並びに
第２のプログラムにおいては、第１の集音装置、第２の
集音装置、および第３の集音装置のそれぞれにより集音
された、所定の範囲内に設けられた複数の実験位置に位
置した音源が発した音声に対応する第４の音声信号、第
５の音声信号、および第６の音声信号から、第４の音声
信号の位相と第５の音声信号の位相との第３の位相差、
および第４の音声信号の位相と第６の音声信号との第４
の位相差が、実験位置毎に算出され、第３の位相差と実
験位置との関係に基づいて、第１の係数が算出され、第
４の位相差と実験位置との関係に基づいて、第２の係数
が算出される。According to the information generating apparatus and method of the present invention, and the second program, the predetermined sound collected by each of the first sound collecting device, the second sound collecting device, and the third sound collecting device. From the fourth, fifth, and sixth audio signals corresponding to the sounds emitted by the sound sources located at the plurality of experimental positions provided in the range of the fourth audio signal, A third phase difference from the phase of the fifth audio signal,
And the fourth of the phase of the fourth audio signal and the sixth audio signal
Is calculated for each experimental position, a first coefficient is calculated based on the relationship between the third phase difference and the experimental position, and based on the relationship between the fourth phase difference and the experimental position, A second coefficient is calculated.

【００３０】[0030]

BEST MODE FOR CARRYING OUT THE INVENTION

【００３１】図１は、本発明を適用した音源状態検出シ
ステムの第１の実施の形態の構成例を示している。この
システムでは、所定の位置にある音源が発した音声か
ら、音源の向きが検出される。なお、ここでは、音源
は、所定の位置にあるユーザの口元とし、ユーザが発し
た音声から、ユーザの口元の向き（ユーザの顔の向き）
が検出される。FIG. 1 shows a configuration example of a first embodiment of a sound source state detection system to which the present invention is applied. In this system, the direction of the sound source is detected from the sound emitted by the sound source at a predetermined position. Here, the sound source is the mouth of the user at a predetermined position, and the direction of the mouth of the user (the direction of the face of the user) is determined based on the voice uttered by the user.
Is detected.

【００３２】マイクロフォン（以下、マイクと略称す
る）１Ｌおよびマイク１Ｒは、図２（Ａ）に示すよう
に、ユーザの頭部の中心点が位置Ａに位置する場合にお
いて、ユーザの顔が正面方向を向いているとき、マイク
１Ｌとユーザの口元（音源）の距離Ｌと、マイクＲと音
源の距離Ｒが、それぞれ等しくなる位置に設置されてい
る。As shown in FIG. 2A, when the center point of the user's head is located at position A, the microphone (hereinafter referred to as microphone) 1L and microphone 1R Is located at a position where the distance L between the microphone 1L and the mouth (sound source) of the user and the distance R between the microphone R and the sound source are equal to each other.

【００３３】図１に戻り、マイク１Ｌおよびマイク１Ｒ
のそれぞれは、音源であるユーザの口元から発せられた
音声を集音し、マイクアンプ２Ｌおよびマイクアンプ２
Ｒに供給する。Returning to FIG. 1, the microphone 1L and the microphone 1R
Collects the sound emitted from the mouth of the user, which is the sound source, and collects the microphone amplifier 2L and the microphone amplifier 2L.
Supply to R.

【００３４】マイクアンプ２Ｌは、マイク１Ｌからの音
声信号を増幅して、処理用音声信号Ｌを生成し、検出装
置３に供給する。マイクアンプ２Ｒは、マイク１Ｒから
の音声信号を増幅して、処理用音声信号Ｒを生成し、検
出装置３に供給する。The microphone amplifier 2L amplifies the audio signal from the microphone 1L, generates a processing audio signal L, and supplies it to the detection device 3. The microphone amplifier 2R amplifies the audio signal from the microphone 1R, generates a processing audio signal R, and supplies the processing audio signal R to the detection device 3.

【００３５】検出装置３は、マイクアンプ２Ｌから供給
される処理用音声信号Ｌおよびマイクアンプ２Ｒから供
給される処理用音声信号Ｒから、処理用音声信号Ｌの音
圧Ｌと処理用音声信号Ｒの音圧Ｒとの音圧比ＲＬ（＝音
圧Ｒ／音圧Ｌ）を算出する。From the processing audio signal L supplied from the microphone amplifier 2L and the processing audio signal R supplied from the microphone amplifier 2R, the detection device 3 determines the sound pressure L of the processing audio signal L and the processing audio signal R The sound pressure ratio RL (= sound pressure R / sound pressure L) to the sound pressure R is calculated.

【００３６】検出装置３は、算出した音圧比ＲＬに対応
する音源の向き、すなわち、この例の場合、ユーザの口
元の向き（ユーザの顔の向き）を、図３に示す、音圧比
ＲＬと口元（音源）の向きとの対応関係を示すデータを
参照することで検出し、それを、図示せぬ外部の装置に
出力する。The detection device 3 determines the direction of the sound source corresponding to the calculated sound pressure ratio RL, that is, in this example, the direction of the mouth of the user (the direction of the face of the user) to the sound pressure ratio RL shown in FIG. Detection is performed by referring to data indicating a correspondence relationship with the direction of the mouth (sound source), and the detection is output to an external device (not shown).

【００３７】ところで、音声の音圧は、音源からの距離
が短ければ短いほど大きくなることから、図２（Ｃ）に
示すように、ユーザの顔（口元）が正面方向に対して右
側を向いており、距離Ｒが、距離Ｌより短い場合、この
とき検出される、マイク１Ｒにより集音される、ユーザ
が発した音声の音圧（処理用音声信号Ｒの音圧Ｒ）は、
マイク１Ｌにより集音される、ユーザが発した音声の音
圧（処理用音声信号Ｌの音圧Ｌ）より大きなる。すなわ
ちこの場合、音圧比ＲＬは、１より大きな所定の値とな
る。By the way, since the sound pressure of the voice increases as the distance from the sound source decreases, as shown in FIG. 2C, the face (mouth) of the user faces rightward with respect to the front direction. If the distance R is shorter than the distance L, the sound pressure detected by the microphone 1R and detected by the user (sound pressure R of the processing sound signal R) is:
The sound pressure is higher than the sound pressure of the sound emitted by the user (sound pressure L of the processing sound signal L) collected by the microphone 1L. That is, in this case, the sound pressure ratio RL becomes a predetermined value larger than 1.

【００３８】また、図２（Ｂ）に示すように、口元が正
面方向に対して左側を向いており、距離Ｒが、距離Ｌよ
り長い場合、音圧Ｒは、音圧Ｌより小さくなるので、音
圧比ＲＬは、１より小さい所定の値となる。Further, as shown in FIG. 2B, when the mouth is directed to the left with respect to the front direction and the distance R is longer than the distance L, the sound pressure R becomes smaller than the sound pressure L. , The sound pressure ratio RL is a predetermined value smaller than 1.

【００３９】また、図２（Ａ）に示すように、口元が正
面方向を向いており、距離Ｒが、距離Ｌと等しい場合、
音圧Ｒは、音圧Ｌと等しくなるので、音圧比ＲＬは、１
となる。Also, as shown in FIG. 2A, when the mouth is in the front direction and the distance R is equal to the distance L,
Since the sound pressure R is equal to the sound pressure L, the sound pressure ratio RL is 1
It becomes.

【００４０】このように、音圧比ＲＬ（＝音圧Ｒ／音圧
Ｌ）は、口元（音源）の向き（正確には、口元とマイク
１との距離）に対応するので、例えば、口元が向く方向
を変え、そのとき発せられた音声の音圧比ＲＬ、および
口元が向く方向を表す値を集計することで、図３に示す
データが得られる。As described above, the sound pressure ratio RL (= sound pressure R / sound pressure L) corresponds to the direction of the mouth (sound source) (more precisely, the distance between the mouth and the microphone 1). The data shown in FIG. 3 is obtained by changing the heading direction and summing up the sound pressure ratio RL of the sound emitted at that time and the value indicating the heading direction.

【００４１】図４は、検出装置３の構成例を示してい
る。FIG. 4 shows an example of the configuration of the detection device 3.

【００４２】入力部１１は、マイクアンプ２Ｌおよびマ
イクアンプ２Ｒのそれぞれから適宜供給される処理用音
声信号Ｌおよび処理用音声信号Ｒを入力する。入力部１
１は、入力した、処理用音声信号Ｌ，Ｒの一部（例え
ば、所定の時間内に入力した部分）を、特徴量算出部１
２に供給する。The input section 11 inputs the processing audio signal L and the processing audio signal R appropriately supplied from each of the microphone amplifier 2L and the microphone amplifier 2R. Input unit 1
Reference numeral 1 designates a part of the input processing audio signals L and R (for example, a part input within a predetermined time) as a feature value calculation unit 1.
Feed to 2.

【００４３】特徴量算出部１２は、入力部１１から供給
された処理用音声信号Ｌ，Ｒから、それぞれの音圧Ｌ，
Ｒを算出するとともに、音圧比ＲＬ（＝音圧Ｒ／音圧
Ｌ）を算出し、それを、検出部１３に供給する。＄検出
部１３は、図３に示した、音圧比ＲＬと口元（音源）の
対応関係を示すデータを保持しており、そのデータに基
づいて、特徴量算出部１２から供給された音圧比ＲＬに
対応する口元の向きを検出し、出力部１４に供給する。The feature quantity calculating section 12 converts the sound pressures L, R from the processing sound signals L, R supplied from the input section 11 into respective sound pressures L, R.
R is calculated, and a sound pressure ratio RL (= sound pressure R / sound pressure L) is calculated and supplied to the detection unit 13. The detection unit 13 holds data indicating the correspondence between the sound pressure ratio RL and the mouth (sound source) shown in FIG. 3, and based on the data, the sound pressure ratio RL supplied from the feature amount calculation unit 12. Is detected, and supplied to the output unit 14.

【００４４】出力部１４は、検出部１３から供給された
口元（音源）の向きを、外部の装置に供給する。The output unit 14 supplies the direction of the mouth (sound source) supplied from the detection unit 13 to an external device.

【００４５】図５は、特徴量算出部１２の構成例を示し
ている。入力部１１から供給された処理用音声信号Ｌ
は、音圧算出部２１Ｌに供給され、処理用音声信号Ｒ
は、音圧算出部２１Ｒに、それぞれ供給される。FIG. 5 shows an example of the configuration of the feature amount calculation unit 12. Processing audio signal L supplied from input unit 11
Is supplied to the sound pressure calculation unit 21L, and the processing audio signal R
Are supplied to the sound pressure calculation unit 21R.

【００４６】音圧算出部２１Ｌは、例えば、入力された
処理用音声信号Ｌが示す値の２乗平均値、２乗平均の平
方根、または絶対値の平均値を、処理用音声信号Ｌの音
圧Ｌとして算出し、その算出結果を、音圧比算出部２２
に供給する。音圧算出部２１Ｒは、音圧算出部２１Ｌと
同様の方法で、処理用音声信号Ｒの音圧Ｒを算出し、音
圧比算出部２２に供給する。The sound pressure calculating section 21L calculates, for example, the mean square value, the square root of the mean square, or the average value of the absolute values of the values indicated by the input processing audio signal L, Pressure L, and the result of the calculation is used as the sound pressure ratio calculation unit 22.
To supply. The sound pressure calculation unit 21R calculates the sound pressure R of the processing audio signal R in the same manner as the sound pressure calculation unit 21L, and supplies the sound pressure R to the sound pressure ratio calculation unit 22.

【００４７】音圧比算出部２２は、音圧算出部２１Ｌか
らの音圧Ｌと、音圧算出部２１Ｒからの音圧Ｒに基づい
て、音圧比ＲＬ（＝音圧Ｒ／音圧Ｌ）を算出し、その算
出結果を、検出部１３に供給する。The sound pressure ratio calculator 22 calculates the sound pressure ratio RL (= sound pressure R / sound pressure L) based on the sound pressure L from the sound pressure calculator 21L and the sound pressure R from the sound pressure calculator 21R. It calculates and supplies the calculation result to the detection unit 13.

【００４８】次に、第１の実施の形態における音源状態
検出システムの動作を、図６のフローチャートを参照し
て説明する。Next, the operation of the sound source state detection system according to the first embodiment will be described with reference to the flowchart of FIG.

【００４９】ステップＳ１において、マイク１Ｌ，１Ｒ
のそれぞれは、所定の方向を向いているユーザが発した
（所定の方向を向いている口元からの）音声を集音し、
その結果得られた音声信号を、マイクアンプ２Ｌ，２Ｒ
に供給する。In step S1, the microphones 1L, 1R
Each collects the sound (from the lips facing the predetermined direction) emitted by the user facing the predetermined direction,
The resulting audio signal is transmitted to microphone amplifiers 2L, 2R
To supply.

【００５０】ステップＳ２において、マイクアンプ２
Ｌ，２Ｒのそれぞれは、マイク１Ｌ，１Ｒから供給され
た音声信号を増幅して、処理用音声信号Ｌ，Ｒを生成
し、検出装置３に供給する。In step S2, the microphone amplifier 2
Each of L and 2R amplifies the audio signal supplied from the microphones 1L and 1R, generates processing audio signals L and R, and supplies the processed audio signals L and R to the detection device 3.

【００５１】ステップＳ３において、検出装置３は、マ
イクアンプ２Ｌ，２Ｒのそれぞれから供給された処理用
音声信号Ｌ，Ｒに基づいて、口元（音源）の向きを検出
する。ここでの処理の詳細は、図７のフローチャートに
示されている。In step S3, the detection device 3 detects the direction of the mouth (sound source) based on the processing audio signals L and R supplied from the microphone amplifiers 2L and 2R, respectively. Details of the processing here are shown in the flowchart of FIG.

【００５２】ステップＳ１１において、検出装置３の入
力部１１は、入力したマイクアンプ２Ｌ，２Ｒのそれぞ
れからの処理用音声信号Ｌ，Ｒの一部（所定の時間内に
入力した部分）を、特徴量算出部１２に供給する。In step S11, the input unit 11 of the detecting device 3 outputs a part of the input processing audio signals L and R from the microphone amplifiers 2L and 2R (portion input within a predetermined time). It is supplied to the quantity calculation unit 12.

【００５３】次に、ステップＳ１２において、特徴量算
出部１２（音圧算出部２１Ｌ，２１Ｒ）は、入力部１１
から供給された処理用音声信号Ｌ，Ｒから、それらの音
圧Ｌ，Ｒをそれぞれ算出する。Next, in step S12, the characteristic amount calculation unit 12 (sound pressure calculation units 21L and 21R)
From the processing audio signals L and R supplied from the computer, respectively.

【００５４】ステップＳ１３において、特徴量算出部１
２（音圧比算出部２２）は、ステップＳ１２で算出した
音圧Ｌ，Ｒに基づいて、音圧比ＲＬ（＝音圧Ｒ／音圧
Ｌ）を算出し、検出部１３に供給する。In step S13, the characteristic amount calculation unit 1
2 (sound pressure ratio calculation unit 22) calculates a sound pressure ratio RL (= sound pressure R / sound pressure L) based on the sound pressures L and R calculated in step S <b> 12, and supplies the calculated sound pressure ratio RL to the detection unit 13.

【００５５】次に、ステップＳ１４において、検出部１
３は、音圧比ＲＬと口元（音源）の向きとの対応関係を
示すデータ（図３）を参照して、特徴量算出部１２から
供給された音圧比ＲＬに対応する音源の向きを検出し、
出力部１４に供給する。Next, in step S14, the detecting unit 1
3 detects the direction of the sound source corresponding to the sound pressure ratio RL supplied from the feature amount calculation unit 12 with reference to the data (FIG. 3) indicating the correspondence between the sound pressure ratio RL and the direction of the mouth (sound source). ,
It is supplied to the output unit 14.

【００５６】ステップＳ１５において、出力部１４は、
検出部１３から供給された口元（音源）の向きを、外部
の装置に出力する。In step S15, the output unit 14
The direction of the mouth (sound source) supplied from the detection unit 13 is output to an external device.

【００５７】その後、検出装置３の処理は終了し、音源
状態検出システムの動作（図６）も、終了する。Thereafter, the processing of the detection device 3 ends, and the operation of the sound source state detection system (FIG. 6) also ends.

【００５８】図８は、本発明を適用した音源状態検出シ
ステムの第２の実施の形態の構成例を示している。この
システムでは、第１の実施の形態（図１）における場合
と同様に、所定の位置にある音源が発した音声から、音
源の向きが検出される。FIG. 8 shows a configuration example of a sound source state detection system according to a second embodiment of the present invention. In this system, as in the case of the first embodiment (FIG. 1), the direction of the sound source is detected from the sound emitted by the sound source at a predetermined position.

【００５９】このシステムには、図１に示す検出装置３
に代えて、検出装置３１が設けられている。その他の構
成は、図１における場合と同様であり、その説明は適宜
省略する。This system includes a detecting device 3 shown in FIG.
, A detection device 31 is provided. Other configurations are the same as those in FIG. 1, and the description thereof will be omitted as appropriate.

【００６０】マイク１Ｌとマイク１Ｒは、第１の実施の
形態における場合と同様の位置に設置されている（図
９）。The microphone 1L and the microphone 1R are installed at the same positions as in the first embodiment (FIG. 9).

【００６１】検出装置３１は、マイクアンプ２Ｌから供
給される処理用音声信号Ｌおよびマイクアンプ２Ｒから
供給される処理用音声信号Ｒから、処理用音声信号Ｌの
高周波部分（以下、高周波処理用音声信号Ｌhと称す
る）の音圧Ｌhと、処理用音声信号Ｒの高周波部分（以
下、高周波処理用音声信号Ｒhと称する）の音圧Ｒhとの
音圧比ＲＬh（＝音圧Ｒh／音圧Ｈh）を算出する。The detecting device 31 extracts a high-frequency portion (hereinafter referred to as a high-frequency processing audio signal) of the processing audio signal L from the processing audio signal L supplied from the microphone amplifier 2L and the processing audio signal R supplied from the microphone amplifier 2R. A sound pressure ratio RLh (= sound pressure Rh / sound pressure Hh) between a sound pressure Lh of the signal Lh) and a sound pressure Rh of a high-frequency portion of the processing sound signal R (hereinafter, referred to as a high-frequency processing sound signal Rh). Is calculated.

【００６２】検出装置３１は、算出した音圧比ＲＬhに
対応する口元（音源）の向きを、図１０に示す、音圧比
ＲＬhと口元の向きとの対応関係を示すデータに基づい
て検出し、それを、図示せぬ外部の装置に出力する。The detecting device 31 detects the direction of the mouth (sound source) corresponding to the calculated sound pressure ratio RLh based on the data shown in FIG. 10 indicating the correspondence between the sound pressure ratio RLh and the direction of the mouth. Is output to an external device (not shown).

【００６３】ところで、音声、特にその高周波部分には
指向性があるので、その高周波部分の音圧は、音源に対
する角度が小さければ小さいほど大きくなる。このこと
から、図９（Ｃ）に示すように、口元が正面方向に対し
て右側を向いており、角度Ｒが、角度Ｌより小さい場
合、このとき検出される、マイク１Ｒにより集音され
る、ユーザが発した音声の高周波部分の音圧（高周波処
理用音声信号Ｒhの音圧Ｒh）は、マイク１Ｌにより集音
される、ユーザが発した音声の高周波部分の音圧（高周
波処理用音声信号Ｌhの音圧Ｌh）よりも大きくなる。す
なわちこの場合、音圧比ＲＬhは、１より大きい所定の
値となる。By the way, since sound, especially its high frequency part has directivity, the sound pressure of the high frequency part becomes larger as the angle with respect to the sound source becomes smaller. From this, as shown in FIG. 9C, when the mouth faces rightward with respect to the front direction and the angle R is smaller than the angle L, sound is collected by the microphone 1R detected at this time. The sound pressure of the high-frequency part of the sound emitted by the user (the sound pressure Rh of the sound signal Rh for high-frequency processing) is collected by the microphone 1L, It becomes larger than the sound pressure Lh) of the signal Lh. That is, in this case, the sound pressure ratio RLh is a predetermined value larger than 1.

【００６４】また、図９（Ｂ）に示すように、口元が正
面方向に対して左側を向いており、角度Ｒが、角度Ｌよ
り大きい場合、音圧Ｒhは、音圧Ｌhよりも小さくなるの
で、音圧比ＲＬhは、１より小さい所定の値となる。As shown in FIG. 9 (B), when the mouth is directed to the left with respect to the front direction and the angle R is larger than the angle L, the sound pressure Rh becomes smaller than the sound pressure Lh. Therefore, the sound pressure ratio RLh becomes a predetermined value smaller than 1.

【００６５】また、図９（Ａ）に示すように、口元が正
面方向を向いており、角度Ｒが、角度Ｌと等しい場合、
音圧Ｒhは、音圧Ｌhと等しくなるので、音圧比ＲＬh
は、１となる。Further, as shown in FIG. 9A, when the mouth is directed to the front and the angle R is equal to the angle L,
Since the sound pressure Rh is equal to the sound pressure Lh, the sound pressure ratio RLh
Becomes 1.

【００６６】このように、音圧比ＲＬhは、口元（音
源）の向き（正確には、口元が向いている方向と、口元
の位置からのマイク１が位置する方向との角度）に対応
するので、口元が向く方向を変え、そのとき発せられた
音声の音圧比ＲＬh、および口元が向く方向を表す値を
集計することで、図１０に示すデータが得られる。As described above, the sound pressure ratio RLh corresponds to the direction of the mouth (sound source) (more precisely, the angle between the direction in which the mouth is facing and the direction in which the microphone 1 is located from the position of the mouth). The direction shown in FIG. 10 is obtained by changing the direction in which the lips face and changing the sound pressure ratio RLh of the sound emitted at that time and the value indicating the direction in which the lips face.

【００６７】図１１は、検出装置３１の構成例を示して
いる。この検出装置３１には、図４の検出装置３の特徴
量算出部１２および検出部１３に代えて、特徴量算出部
４１および検出部４２が設けられている。他の部分は、
図４における場合と同様である。FIG. 11 shows a configuration example of the detection device 31. The detection device 31 includes a feature value calculation unit 41 and a detection unit 42 instead of the feature value calculation unit 12 and the detection unit 13 of the detection device 3 in FIG. The other part is
This is the same as in FIG.

【００６８】特徴量算出部４１は、入力部１１から供給
された処理用音声信号Ｌ，Ｒの高周波部分のみを通過さ
せることで得られた、高周波処理用音声信号Ｌhおよび
高周波処理用音声信号Ｒhから、それらの音圧Ｌhおよび
音圧Ｒhをそれぞれ算出するとともに、算出した音圧Ｌh
と音圧Ｒhの音圧比ＲＬh（＝音圧Ｒh／音圧Ｌh）を算出
し、検出部４２に供給する。The feature value calculating section 41 outputs the high-frequency processing audio signal Lh and the high-frequency processing audio signal Rh obtained by passing only the high-frequency portions of the processing audio signals L and R supplied from the input section 11. From these, the sound pressure Lh and the sound pressure Rh are calculated respectively, and the calculated sound pressure Lh
And a sound pressure ratio RLh (= sound pressure Rh / sound pressure Lh) of the sound pressure Rh and the sound pressure Rh.

【００６９】検出部４２は、図１０に示した、音圧比Ｒ
Ｌhと口元（音源）の向きとの対応関係を示すデータを
保持しており、そのデータに基づいて、特徴量算出部４
１から供給された音圧比ＲＬhに対応する口元の向きを
検出し、出力部１４に供給する。The detector 42 detects the sound pressure ratio R shown in FIG.
It holds data indicating the correspondence between Lh and the direction of the mouth (sound source), and based on the data, the feature amount calculation unit 4
The direction of the mouth corresponding to the sound pressure ratio RLh supplied from 1 is detected and supplied to the output unit 14.

【００７０】図１２は、特徴量算出部４１の構成例を示
している。この特徴量算出部４１には、図５の特徴量算
出部１２の音圧算出部２１Ｌおよび音圧算出部２１Ｒの
前段に、ハイパスフィルタ（ＨＰＦ）５１Ｌおよびハイ
パスフィルタ５１Ｒが設けられている。他の構成は、図
５における場合と同様である。FIG. 12 shows an example of the configuration of the feature amount calculator 41. In the feature amount calculation unit 41, a high-pass filter (HPF) 51L and a high-pass filter 51R are provided before the sound pressure calculation unit 21L and the sound pressure calculation unit 21R of the feature amount calculation unit 12 in FIG. Other configurations are the same as those in FIG.

【００７１】ハイパスフィルタ５１Ｌは、入力部１１か
ら供給された処理用音声信号Ｌの高周波部分のみを通過
させて、音圧算出部２１Ｌに供給する。すなわち、高周
波処理用音声信号Ｌhが、音圧算出部２１Ｌに供給され
る。The high-pass filter 51L passes only the high-frequency portion of the processing audio signal L supplied from the input unit 11 and supplies it to the sound pressure calculation unit 21L. That is, the high-frequency processing audio signal Lh is supplied to the sound pressure calculation unit 21L.

【００７２】ハイパスフィルタ５１Ｒは、入力部１１か
ら供給された処理用音声信号Ｒの高周波部分のみを通過
させて、音圧算出部２１Ｒに供給する。すなわち、高周
波処理用音声信号Ｒhが、音圧算出部２１Ｒに供給され
る。The high-pass filter 51R allows only the high-frequency portion of the processing audio signal R supplied from the input unit 11 to pass and supplies it to the sound pressure calculation unit 21R. That is, the high-frequency processing audio signal Rh is supplied to the sound pressure calculation unit 21R.

【００７３】図１３は、ハイパスフィルタ５１のカット
オフ特性を示している。カットオフ周波数ｆは、例え
ば、それを固定の値とすることや、ユーザの音声のピッ
チ周波数に比例する周波数とすることができる。FIG. 13 shows the cut-off characteristic of the high-pass filter 51. The cutoff frequency f can be, for example, a fixed value or a frequency proportional to the pitch frequency of the user's voice.

【００７４】ここでピッチ周波数とは、図１４に示すよ
うなユーザの音声の波形の中で繰り返される基本的な波
形の周期Ｔの逆数である。ピッチ周波数は、人それぞれ
により異なる。Here, the pitch frequency is the reciprocal of the period T of the basic waveform repeated in the waveform of the user's voice as shown in FIG. The pitch frequency is different for each person.

【００７５】図１２に戻り、音圧算出部２１Ｌは、ハイ
パスフィルタ５１Ｌから供給される高周波処理用音声信
号Ｌhの音圧Ｌhを算出し、音圧比算出部２２に供給す
る。音圧算出部２１Ｒは、ハイパスフィルタ５１Ｒから
供給される高周波処理用音声信号Ｒhの音圧Ｒhを算出
し、音圧比算出部２２に供給する。Returning to FIG. 12, the sound pressure calculator 21L calculates the sound pressure Lh of the high-frequency processing sound signal Lh supplied from the high-pass filter 51L, and supplies the sound pressure Lh to the sound pressure ratio calculator 22. The sound pressure calculation unit 21R calculates the sound pressure Rh of the high-frequency processing audio signal Rh supplied from the high-pass filter 51R, and supplies the sound pressure Rh to the sound pressure ratio calculation unit 22.

【００７６】音圧比算出部２２は、音圧算出部２１Ｌか
らの音圧Ｌhと、音圧算出部２１Ｒからの音圧Ｒhとの音
圧比ＲＬh（＝音圧Ｒh／音圧Ｌh）を算出し、検出部４
２に供給する。The sound pressure ratio calculator 22 calculates a sound pressure ratio RLh (= sound pressure Rh / sound pressure Lh) between the sound pressure Lh from the sound pressure calculator 21L and the sound pressure Rh from the sound pressure calculator 21R. , Detection unit 4
Feed to 2.

【００７７】次に、第２の実施の形態における音源状態
検出システムの動作を、図１５のフローチャートを参照
して説明する。Next, the operation of the sound source state detection system according to the second embodiment will be described with reference to the flowchart of FIG.

【００７８】ステップＳ２１，Ｓ２２においては、図６
のステップＳ１，Ｓ２における場合と同様の処理が実行
されるので、その説明は省略する。In steps S21 and S22, FIG.
Since the same processes as those in steps S1 and S2 are performed, description thereof will be omitted.

【００７９】ステップＳ２３において、検出装置３１
は、マイクアンプ２Ｌ，２Ｒのそれぞれから供給された
処理用音声信号Ｌ，Ｒに基づいて、口元（音源）の向き
を検出する。ここでの処理の詳細は、図１６のフローチ
ャートに示されている。In step S23, the detecting device 31
Detects the direction of the mouth (sound source) based on the processing audio signals L and R supplied from the microphone amplifiers 2L and 2R, respectively. Details of the processing here are shown in the flowchart of FIG.

【００８０】ステップＳ３１において、入力部１１は、
マイクアンプ２Ｌ，２Ｒのそれぞれからの処理用音声信
号Ｌ，Ｒの一部（例えば、所定の時間内に入力した部
分）を、特徴量算出部４１に供給する。In step S31, the input unit 11
A part of the processing audio signals L and R from each of the microphone amplifiers 2L and 2R (for example, a part input within a predetermined time) is supplied to the feature amount calculation unit 41.

【００８１】次に、ステップＳ３２において、特徴量算
出部４１（ハイパスフィルタ５１Ｌ，５１Ｒ）は、入力
部１１から供給された処理用音声信号Ｌ，Ｒの高周波部
分を濾過し、高周波処理用音声信号Ｌh，Ｒhを生成す
る。Next, in step S 32, the characteristic amount calculating section 41 (high-pass filters 51 L, 51 R) filters the high-frequency portions of the processing audio signals L, R supplied from the input section 11, and Lh and Rh are generated.

【００８２】ステップＳ３３において、特徴量算出部４
１（音圧算出部２１Ｌ，２１Ｒ）は、ステップＳ３２で
生成した高周波処理用音声信号Ｌh，Ｒhから、音圧Ｌ
h，Ｒhをそれぞれ算出する。In step S33, the characteristic amount calculating section 4
1 (sound pressure calculation units 21L and 21R) calculates the sound pressure L from the high-frequency processing sound signals Lh and Rh generated in step S32.
h and Rh are calculated respectively.

【００８３】次に、ステップＳ３４において、特徴量算
出部４１（音圧比算出部２２）は、ステップＳ３３で算
出した音圧Ｌh，Ｒhに基づいて、音圧比ＲＬh（＝音圧
Ｒh／音圧Ｌh）を算出し、検出部４２に供給する。Next, in step S34, the characteristic amount calculation unit 41 (sound pressure ratio calculation unit 22) uses the sound pressure ratio RLh (= sound pressure Rh / sound pressure Lh) based on the sound pressures Lh and Rh calculated in step S33. ) Is calculated and supplied to the detection unit 42.

【００８４】ステップＳ３５において、検出部４２は、
音圧比ＲＬhと口元（音源）の向きとの対応関係を示す
データ（図１０）を参照して、特徴量算出部４１から供
給された音圧比ＲＬhに対応する音源の向きを検出し、
出力部１４に供給する。In step S35, the detection unit 42
With reference to data (FIG. 10) indicating the correspondence between the sound pressure ratio RLh and the direction of the mouth (sound source), the direction of the sound source corresponding to the sound pressure ratio RLh supplied from the feature amount calculation unit 41 is detected.
It is supplied to the output unit 14.

【００８５】次に、ステップＳ３６において、出力部１
４は、検出部４２から供給された口元（音源）を、外部
の装置に出力する。Next, in step S36, the output unit 1
4 outputs the mouth (sound source) supplied from the detection unit 42 to an external device.

【００８６】その後、検出装置３１の処理は終了し、音
源状態検出システムの動作（図１５）も終了する。Thereafter, the processing of the detection device 31 ends, and the operation of the sound source state detection system (FIG. 15) also ends.

【００８７】図１７は、本発明を適用した音源状態検出
システムの第３の実施の形態の構成例を示している。こ
のシステムでは、第１の実施の形態（図１）および第２
の実施の形態（図８）における場合と同様に、所定の位
置にある音源が発した音声から、音源の向きが検出され
る。FIG. 17 shows a configuration example of a third embodiment of a sound source state detection system to which the present invention is applied. In this system, the first embodiment (FIG. 1) and the second embodiment
As in the case of the embodiment (FIG. 8), the direction of the sound source is detected from the sound emitted by the sound source at the predetermined position.

【００８８】このシステムには、図１に示す検出装置３
に代えて、検出装置６１が設けられている。他の構成に
ついては同様である。＄マイク１Ｌとマイク１Ｒは、第
１の実施の形態における場合と同様の位置に設置されて
いる（図２）。This system includes the detecting device 3 shown in FIG.
, A detection device 61 is provided. The same applies to other configurations. (4) The microphone 1L and the microphone 1R are installed at the same positions as those in the first embodiment (FIG. 2).

【００８９】検出装置６１は、マイクアンプ２Ｌから供
給される処理用音声信号Ｌおよびマイクアンプ２Ｒから
供給される処理用音声信号Ｒから、処理用音声信号Ｒの
位相に対する処理用音声信号Ｌの位相のずれ（位相差）
を算出する。検出装置６１は、算出した位相差に対応す
るユーザの口元（音源）の向きを、図１８に示す、位相
差と口元（音源）の向きとの対応関係を示すデータに基
づいて検出し、図示せぬ外部の装置に出力する。The detection device 61 determines the phase of the processing audio signal L with respect to the phase of the processing audio signal R from the processing audio signal L supplied from the microphone amplifier 2L and the processing audio signal R supplied from the microphone amplifier 2R. Deviation (phase difference)
Is calculated. The detection device 61 detects the direction of the user's lip (sound source) corresponding to the calculated phase difference based on the data indicating the correspondence between the phase difference and the direction of the lip (sound source) shown in FIG. Output to an external device not shown.

【００９０】ところで、音声は、マイク１と音源との距
離が長ければ長いほど、マイク１に到達するのに時間が
かかり、また後に到達した音声の位相は、先に到達した
音声の位相に比べ遅れる特徴がある。このことから、例
えば、図２（Ｂ）に示すように、口元が正面方向に対し
て左側を向いており、距離Ｒが、距離Ｌより長い場合、
このときユーザから発せられた音声は、マイク１Ｌに先
に到達し、その後マイク１Ｒに到達するので、このとき
検出されるマイク１Ｒにより集音される音声（処理用音
声信号Ｒ）の位相（図１９（Ｂ））は、マイク１Ｌによ
り集音される音声（処理用音声信号Ｌ）の位相（図１９
（Ａ））に比べ遅れる。すなわちこの場合、処理用音声
信号Ｒの位相に対する処理用音声信号Ｌの位相の差は、
負の値となる。By the way, the longer the distance between the microphone 1 and the sound source, the longer it takes to reach the microphone 1, and the phase of the sound that arrives later is smaller than the phase of the sound that arrives first. There is a delay feature. From this, for example, as shown in FIG. 2 (B), when the mouth is directed to the left with respect to the front direction and the distance R is longer than the distance L,
At this time, since the sound emitted from the user reaches the microphone 1L first and then reaches the microphone 1R, the phase of the sound (processing sound signal R) collected by the microphone 1R detected at this time (see FIG. 19 (B)) is the phase of the sound (processing sound signal L) collected by the microphone 1L (FIG. 19).
(A)). That is, in this case, the difference between the phase of the processing audio signal R and the phase of the processing audio signal R is
This will be a negative value.

【００９１】なおこのとき、距離Ｌは、距離Ｒより短い
ので、図１９（Ａ）に示す処理用音声信号Ｌのレベル
は、図１９（Ｂ）に示す処理用音声信号Ｒのレベルより
高くなる。At this time, since the distance L is shorter than the distance R, the level of the processing audio signal L shown in FIG. 19A is higher than the level of the processing audio signal R shown in FIG. 19B. .

【００９２】また、図２（Ｃ）に示すように、口元が正
面方向に対して右側を向いており、距離Ｒが、距離Ｌよ
り短い場合、このときユーザから発せられた音声は、マ
イク１Ｒに先に到達し、その後マイク１Ｌに到達するの
で、処理用音声信号Ｒ（図２０（Ｂ））の位相は、処理
用音声信号Ｌ（図２０（Ａ））の位相に比べ進む。すな
わちこの場合、処理用音声信号Ｒの位相に対する処理用
音声信号Ｌの位相の差は、正の値となる。＄なおこのと
き、距離Ｒは、距離Ｌより短いので、図２０（Ｂ）に示
す処理用音声信号Ｒのレベルは、図２０（Ａ）に示す処
理用音声信号Ｌのレベルより高い。As shown in FIG. 2 (C), when the mouth is directed to the right with respect to the front direction and the distance R is shorter than the distance L, the voice uttered by the user at this time is the microphone 1R , And then reaches the microphone 1L, so that the phase of the processing audio signal R (FIG. 20B) advances as compared with the phase of the processing audio signal L (FIG. 20A). That is, in this case, the difference between the phase of the processing audio signal R and the phase of the processing audio signal L is a positive value. At this time, since the distance R is shorter than the distance L, the level of the processing audio signal R shown in FIG. 20B is higher than the level of the processing audio signal L shown in FIG.

【００９３】また、図２（Ａ）に示すように、ユーザの
顔が正面方向を向いており、距離Ｒが、距離Ｌと等しい
場合、このときユーザから発せられた音声は、マイク１
Ｒとマイク１Ｌに同時到達するので、処理用音声信号Ｒ
（図２１（Ｂ））の位相と、処理用音声信号Ｌ（図２１
（Ａ））の位相は、一致する。すなわちこの場合、処理
用音声信号Ｒの位相に対する処理用音声信号Ｌの位相の
位相差は、零となる。As shown in FIG. 2A, when the user's face is facing the front and the distance R is equal to the distance L, the voice uttered by the user at this time is
R and the microphone 1L at the same time, so that the processing audio signal R
(FIG. 21B) and the processing audio signal L (FIG. 21B).
The phases of (A)) match. That is, in this case, the phase difference between the phase of the processing audio signal L and the phase of the processing audio signal L is zero.

【００９４】なおこのとき、距離Ｒと距離Ｌは等しいの
で、図２１（Ｂ）に示す処理用音声信号Ｒのレベルと図
２１（Ａ）に示す処理用音声信号Ｌのレベルは同じにな
る。At this time, since the distance R is equal to the distance L, the level of the processing audio signal R shown in FIG. 21B and the level of the processing audio signal L shown in FIG.

【００９５】このように、位相差は、口元（音源）の向
きに対応するので、口元が向く方向を変え、そのとき発
せられた音声の位相差、および口元が向く方向を表す値
を集計することで、図１８に示すデータが得られる。As described above, since the phase difference corresponds to the direction of the mouth (sound source), the direction of the mouth is changed, and the phase difference of the sound emitted at that time and the value indicating the direction of the mouth are totaled. Thus, the data shown in FIG. 18 is obtained.

【００９６】図２２は、検出装置６１の構成例を示して
いる。この検出装置６１には、図４の検出装置３の特徴
量算出部１２および検出部１３に代えて、特徴量算出部
７１および検出部７２が設けられている。他の構成は、
図４における場合と同様である。FIG. 22 shows a configuration example of the detection device 61. The detection device 61 includes a feature value calculation unit 71 and a detection unit 72 instead of the feature value calculation unit 12 and the detection unit 13 of the detection device 3 in FIG. Other configurations are
This is the same as in FIG.

【００９７】特徴量算出部７１は、入力部１１から供給
された処理用音声信号Ｌと処理用音声信号Ｒから、処理
用音声信号Ｒの位相に対する処理用音声信号Ｌの位相の
位相差を算出する。The feature quantity calculating section 71 calculates a phase difference between the phase of the processing audio signal R and the phase of the processing audio signal R from the processing audio signal L and the processing audio signal R supplied from the input section 11. I do.

【００９８】具体的には、特徴量算出部７１は、処理用
音声信号Ｌ，Ｒを、所定のサンプリング周期Ｔでサンプ
リングして、量子化する。これにより、処理用音声信号
Ｌからは、Ｎ個のサンプルＬ₁乃至Ｌ_Nが、処理用音声信
号Ｒからは、Ｎ個のサンプルＲ₁乃至Ｒ_Nがそれぞれ得ら
れる。なお、サンプルＬまたはサンプルＲに付されてい
る下付の番号は、サンプリング順を表してる。すなわ
ち、サンプルＬ₁は、最初にサンプリングされたサンプ
ル（第１番目のサンプル）を示す。More specifically, the feature quantity calculating section 71 samples and quantizes the processing audio signals L and R at a predetermined sampling period T. Thus, the processing audio signals L, N samples L ₁ through L _N is, from the processing speech signals R, N samples R ₁ through R _N are obtained, respectively. The subscript number given to the sample L or the sample R indicates the sampling order. That is, samples L ₁ represents the first sampled samples (first-th sample).

【００９９】次に、特徴量算出部７１は、ｊが１乃至ｎ
であるときの式（１）をそれぞれ算出し、その算出結果
の値が最大となるｊの値ｍを検出する。そして特徴量算
出部７１は、検出した値ｍにサンプリング周期Ｔを掛け
合わせて得られた値（時間）を、処理用音声信号Ｒの位
相に対する処理用音声信号Ｌの位相の位相差とする。Next, the feature quantity calculation unit 71 determines that j is 1 to n.
Is calculated, and the value m of j at which the value of the calculation result is maximum is detected. Then, the feature amount calculating unit 71 sets a value (time) obtained by multiplying the detected value m by the sampling period T as a phase difference between the phase of the processing audio signal R and the phase of the processing audio signal L.

【数１】 (Equation 1)

【０１００】式中、Ｌi-jは、サンプルＬ_i-jの値を示
し、Ｒiは、サンプルＲ_iの値を示す。[0100] In the formula, Li-j indicates the value of the sample L _ij, Ri indicates the value of the sample R _i.

【０１０１】すなわち、処理用音声信号Ｒと処理用音声
信号Ｌとの相関値が、処理用音声信号Ｒに対して、処理
用音声信号Ｌが、ｎ個のサンプル分の時間まで、１サン
プリング時間ずつずらされて算出され、その算出結果に
おいて最も高い相関値が得られたときの時間のずれが、
処理用音声信号Ｒの位相に対する処理用音声信号Ｌの位
相の差とされる。That is, the correlation value between the processing audio signal R and the processing audio signal L is such that the processing audio signal L is equal to the processing audio signal R for one sampling time up to the time of n samples. Are calculated at the same time, and the time lag when the highest correlation value is obtained in the calculation result is
The difference is the difference between the phase of the processing audio signal R and the phase of the processing audio signal L.

【０１０２】検出部７２は、図１８に示した、位相差と
口元（音源）の向きとの対応関係を示すデータを保持し
ており、そのデータに基づいて、特徴量算出部７１から
供給された位相差に対応する音源の向きを検出し、出力
部１４に供給する。The detecting section 72 holds data indicating the correspondence between the phase difference and the direction of the mouth (sound source) shown in FIG. 18, and is supplied from the feature amount calculating section 71 based on the data. The direction of the sound source corresponding to the detected phase difference is detected and supplied to the output unit 14.

【０１０３】次に、第３の実施の形態における音源状態
検出システムの動作を、図２３のフローチャートを参照
して説明する。Next, the operation of the sound source state detection system according to the third embodiment will be described with reference to the flowchart in FIG.

【０１０４】ステップＳ４１，Ｓ４２においては、図６
のステップＳ１，Ｓ２における場合と同様の処理が実行
されるので、その説明は省略する。In steps S41 and S42, FIG.
Since the same processes as those in steps S1 and S2 are performed, description thereof will be omitted.

【０１０５】ステップＳ４３において、検出装置６１
は、マイクアンプ２Ｌ，２Ｒのそれぞれから供給された
処理用音声信号Ｌ，Ｒに基づいて、口元（音源）の向き
を検出する。ここでの処理の詳細は、図２４のフローチ
ャートに示されている。In step S43, the detecting device 61
Detects the direction of the mouth (sound source) based on the processing audio signals L and R supplied from the microphone amplifiers 2L and 2R, respectively. Details of the processing here are shown in the flowchart of FIG.

【０１０６】ステップＳ５１において、入力部１１は、
マイクアンプ２Ｌ，２Ｒのそれぞれからの処理用音声信
号Ｌ，Ｒの一部（例えば、所定の時間内に入力した部
分）を、特徴量算出部７１に供給する。In step S51, the input unit 11
A part of the processing audio signals L and R from each of the microphone amplifiers 2L and 2R (for example, a part input within a predetermined time) is supplied to the feature amount calculation unit 71.

【０１０７】次に、ステップＳ５２において、特徴量算
出部７１は、入力部１１から供給された処理用音声信号
Ｌ，Ｒから、処理用音声信号Ｒの位相に対する処理用音
声信号Ｌの位相の位相差を算出し、検出部７２に供給す
る。Next, in step S 52, the feature quantity calculating section 71 calculates the position of the phase of the processing audio signal L with respect to the phase of the processing audio signal R from the processing audio signals L and R supplied from the input section 11. The phase difference is calculated and supplied to the detection unit 72.

【０１０８】ステップＳ５３において、検出部７２は、
位相差と口元（音源）の向きとの対応関係を示すデータ
（図１８）を参照して、特徴量算出部７１から供給され
た位相差に対応する音源の向きを検出し、出力部１４に
供給する。In step S53, the detection unit 72
With reference to the data (FIG. 18) indicating the correspondence between the phase difference and the direction of the mouth (sound source), the direction of the sound source corresponding to the phase difference supplied from the feature amount calculation unit 71 is detected. Supply.

【０１０９】ステップＳ５４において、出力部１４は、
検出部７２から供給された口元（音源）の向きを、外部
の装置に出力する。In step S54, the output unit 14
The direction of the mouth (sound source) supplied from the detection unit 72 is output to an external device.

【０１１０】その後、検出装置６１の処理は終了し、音
源状態検出システムの動作（図２３）も、終了する。Thereafter, the processing of the detection device 61 ends, and the operation of the sound source state detection system (FIG. 23) also ends.

【０１１１】図２５は、本発明を適用した音源状態検出
システムの第４の実施の形態の構成例を示している。第
１の実施の形態乃至第３の実施の形態においては、音源
は、移動しないことを前提として、所定の位置ある音源
の向きが検出されたが、このシステムでは、所定の範囲
内で移動する音源の位置およびその向きが検出される。FIG. 25 shows a configuration example of a sound source state detection system according to a fourth embodiment of the present invention. In the first to third embodiments, the direction of the sound source at a predetermined position is detected on the assumption that the sound source does not move. However, in this system, the sound source moves within a predetermined range. The position of the sound source and its direction are detected.

【０１１２】このシステムには、図１に示す検出装置３
に代えて、検出装置１０１が設けられているとともに、
マイク１Ｌ、マイク１Ｃ、およびマイク１Ｒの３個のマ
イク１、並びにマイク１Ｌにより集音された音声を増幅
するマイクアンプ２Ｌ、マイク１Ｃにより集音された音
声を増幅するマイクアンプ２Ｃ、およびマイク１Ｒによ
り集音された音声を増幅するマイクアンプ２Ｒの３個の
マイクアンプ２が設けられている。In this system, the detecting device 3 shown in FIG.
Instead of the detection device 101 is provided,
Microphone 1L, microphone 1C, and microphone 1R, three microphones 1, microphone amplifier 2L for amplifying audio collected by microphone 1L, microphone amplifier 2C for amplifying audio collected by microphone 1C, and microphone 1R The three microphone amplifiers 2 of the microphone amplifier 2R for amplifying the sound collected by the microphone are provided.

【０１１３】マイク１Ｌおよびマイク１Ｒは、図２６に
示すように、例えば、ユーザの頭部の中心点が位置Ａに
位置する場合において、口元が正面方向を向いていると
き、マイク１Ｌと口元の距離Ｌと、マイク１Ｒと口元の
距離Ｒが、それぞれ等しくなる位置に設置されている。
マイク１Ｃは、正面方向に、位置Ａから所定の距離だけ
離れた位置に設置されている。As shown in FIG. 26, for example, when the center of the user's head is located at position A and the mouth is facing the front as shown in FIG. The distance L and the distance R between the microphone 1R and the mouth are set to be equal to each other.
The microphone 1C is installed at a position away from the position A by a predetermined distance in the front direction.

【０１１４】図２５に戻り、検出装置１０１は、マイク
アンプ２Ｃから供給される信号（以下、処理用音声信号
Ｃと称する）の位相に対する、マイクアンプ２Ｌから供
給された処理用音声信号Ｌの位相の位相差ＣＬ、および
処理用音声信号Ｃの位相に対する、マイクアンプ２Ｒか
ら供給された処理用音声信号Ｒの位相の位相差ＣＲ、お
よび所定の係数（後述）に基づいて、所定の範囲内で移
動する口元（音源）の位置を検出する。Returning to FIG. 25, the detection apparatus 101 detects the phase of the processing audio signal L supplied from the microphone amplifier 2L with respect to the phase of the signal supplied from the microphone amplifier 2C (hereinafter referred to as the processing audio signal C). Within a predetermined range based on the phase difference CL of the phase of the processing audio signal R supplied from the microphone amplifier 2R with respect to the phase difference CL of the phase and the phase of the processing audio signal C, and a predetermined coefficient (described later). The position of the moving mouth (sound source) is detected.

【０１１５】この例の場合、音源の位置は、図２７に示
すように、ユーザの頭部の中心点が位置Ａにある場合に
おいて、ユーザの顔が正面方向を向いているときの口元
（音源）の位置（以下、基準位置と称する）を原点とす
る座標（ｘ，ｙ）で示される。In the case of this example, as shown in FIG. 27, when the center of the user's head is at position A, as shown in FIG. ) (Hereinafter referred to as a reference position) as an origin (x, y).

【０１１６】検出装置１０１はまた、検出した口元（音
源）の位置に対応する所定の係数（後述）、および処理
用音声信号Ｌの音圧Ｌと処理用音声信号Ｒの音圧Ｒとの
音圧比ＲＬ（＝音圧Ｒ／音圧Ｌ）に基づいて、音源の向
きを検出する。The detection apparatus 101 also generates a predetermined coefficient (described later) corresponding to the detected position of the mouth (sound source), and the sound pressure L of the processing audio signal L and the sound pressure R of the processing audio signal R. The direction of the sound source is detected based on the pressure ratio RL (= sound pressure R / sound pressure L).

【０１１７】この例の場合、音源の向きは、図２７に示
すように、Ｙ軸に対する角度θで示される。In the case of this example, the direction of the sound source is indicated by an angle θ with respect to the Y axis as shown in FIG.

【０１１８】図２８は、検出装置１０１の構成例を示し
ている。FIG. 28 shows a configuration example of the detection device 101.

【０１１９】この検出装置１０１には、図４の検出装置
３の特徴量算出部１２および検出部１３に代えて、特徴
量算出部１１１および検出部１１２が設けられている。
他の構成は、図４における場合と同様である。The detection device 101 is provided with a feature value calculation unit 111 and a detection unit 112 instead of the feature value calculation unit 12 and the detection unit 13 of the detection device 3 in FIG.
Other configurations are the same as those in FIG.

【０１２０】入力部１１は、マイクアンプ２Ｌ，２Ｃ，
２Ｒのそれぞれから適宜供給される処理用音声信号Ｌ，
Ｃ，Ｒを入力し、特徴量算出部１１１に供給する。The input section 11 includes microphone amplifiers 2L, 2C,
2R, the processing audio signals L,
C and R are input and supplied to the feature amount calculation unit 111.

【０１２１】特徴量算出部１１１は、入力部１１から供
給された処理用音声信号Ｌ、処理用音声信号Ｃ、および
処理用音声信号Ｒから、位相差ＣＬ（処理用音声信号Ｃ
の位相に対する処理用音声信号Ｌの位相の位相差）、お
よび位相差ＣＲ（処理用音声信号Ｃの位相に対する処理
用音声信号Ｒの位相の位相差）を算出する。The characteristic amount calculating section 111 converts the phase difference CL (processing audio signal C) from the processing audio signal L, the processing audio signal C, and the processing audio signal R supplied from the input section 11.
, And a phase difference CR (a phase difference between the phase of the processing audio signal R and the phase of the processing audio signal C).

【０１２２】特徴量算出部１１１はまた、処理用音声信
号Ｌの音圧Ｌと処理用音声信号Ｒの音圧Ｒとの音圧比Ｒ
Ｌ（＝音圧Ｒ／音圧Ｌ）を算出する。The feature quantity calculation unit 111 also calculates a sound pressure ratio R between the sound pressure L of the processing audio signal L and the sound pressure R of the processing audio signal R.
L (= sound pressure R / sound pressure L) is calculated.

【０１２３】特徴量算出部１１１は、それぞれ算出した
位相差ＣＬおよび位相差ＣＲ、並びに音圧比ＲＬを、検
出部１１２に出力する。The characteristic amount calculation section 111 outputs the calculated phase difference CL and phase difference CR, and the sound pressure ratio RL to the detection section 112.

【０１２４】検出部１１２は、特徴量算出部１１１から
の位相差ＣＬおよび位相差ＣＲ、並びに所定の係数（後
述）に基づいて、所定の範囲内で移動する口元（音源）
の位置を検出する。The detection unit 112 is configured to move within a predetermined range (sound source) based on the phase difference CL and the phase difference CR from the feature amount calculation unit 111 and a predetermined coefficient (described later).
Detect the position of.

【０１２５】検出部１１２はまた、検出した口元（音
源）の位置に対応する所定の係数（後述）（正確には、
検出された音源の位置に対応する基準位置に対応する係
数）、および特徴量算出部１１１からの音圧比ＲＬに基
づいて、音源の向きを検出する。The detecting unit 112 also determines a predetermined coefficient (described later) corresponding to the detected position of the mouth (sound source) (more precisely,
The direction of the sound source is detected based on the detected position of the sound source and the sound pressure ratio RL from the feature amount calculation unit 111.

【０１２６】検出部１１２は、検出した口元の位置およ
びその向きを、出力部１４に供給する。The detecting section 112 supplies the detected position of the mouth and its orientation to the output section 14.

【０１２７】図２９は、特徴量算出部１１１の構成例を
示している。FIG. 29 shows an example of the configuration of the feature amount calculation unit 111.

【０１２８】位相差算出部１２１は、位相差ＣＬ（処理
用音声信号Ｃの位相に対する処理用音声信号Ｌの位相の
位相差）を算出する。The phase difference calculator 121 calculates a phase difference CL (a phase difference between the phase of the processing audio signal C and the phase of the processing audio signal L).

【０１２９】具体的には、位相差算出部１２１は、処理
用音声信号Ｌ，Ｃを、所定のサンプリング周期Ｔでサン
プリングして、量子化する。これにより、処理用音声信
号Ｌからは、Ｎ個のサンプルＬ₁乃至Ｌ_Nが、処理用音声
信号Ｃからは、Ｎ個のサンプルＣ₁乃至Ｃ_Nがそれぞれ得
られる。More specifically, the phase difference calculator 121 samples and quantizes the processing audio signals L and C at a predetermined sampling period T. Thus, N samples L _{1 to} L _N are obtained from the processing audio signal L, and N samples C _{1 to} C _N are obtained from the processing audio signal C.

【０１３０】位相差算出部１２１は、次に、ｊが１乃至
ｎであるときの式（２）をそれぞれ算出し、その算出結
果の値が最大となるｊの値ｍを検出する。そして位相差
算出部１２１は、検出した値ｍにサンプリング周期Ｔを
乗算し、その乗算結果（時間）を、位相差ＣＬとして、
検出部１１２に出力する。Next, the phase difference calculation section 121 calculates the equation (2) when j is 1 to n, and detects the value m of j at which the value of the calculation result is the maximum. Then, the phase difference calculation unit 121 multiplies the detected value m by the sampling period T, and uses the multiplication result (time) as a phase difference CL.
Output to the detection unit 112.

【数２】 (Equation 2)

【０１３１】すなわち、処理用音声信号Ｌと処理用音声
信号Ｃとの相関値が、処理用音声信号Ｃに対して、処理
用音声信号Ｌが、ｎ個のサンプル分の時間まで、１サン
プリング時間ずつずらされて算出され、その算出結果に
おいて最も高い相関値が得られたときの時間（値ｍ×サ
ンプリング周期Ｔ）が、位相差ＣＬとされる。That is, the correlation value between the processing audio signal L and the processing audio signal C is such that the processing audio signal L is equal to the processing audio signal C by one sampling time up to the time of n samples. The time (value m × sampling period T) when the highest correlation value is obtained in the calculation result is defined as the phase difference CL.

【０１３２】位相差算出部１２２は、位相差ＣＲ（処理
用音声信号Ｃの位相に対する処理用音声信号Ｒの位相の
位相差）を算出する。The phase difference calculation section 122 calculates a phase difference CR (a phase difference between the phase of the processing audio signal C and the phase of the processing audio signal R).

【０１３３】具体的には、位相差算出部１２２は、処理
用音声信号Ｃ，Ｒを、所定のサンプリング周期Ｔでサン
プリングして、量子化する。これにより、処理用音声信
号Ｒからは、Ｎ個のサンプルＲ₁乃至Ｒ_Nが、処理用音声
信号Ｃからは、Ｎ個のサンプルＣ₁乃至Ｃ_Nがそれぞれ得
られる。More specifically, the phase difference calculator 122 samples and quantizes the processing audio signals C and R at a predetermined sampling period T. Thus, from the processing speech signals R, N samples R ₁ through R _N are from processing audio signals C, N samples C ₁ to C _N are obtained, respectively.

【０１３４】次に、位相差算出部１２２は、ｊが１乃至
ｎであるときの式（３）をそれぞれ算出し、その算出結
果の値が最大となるｊの値ｍを検出する。そして位相差
算出部１２２は、検出した値ｍにサンプリング周期Ｔを
乗算し、その乗算結果（時間）を、位相差ＣＲとして、
検出部１１２に出力する。Next, the phase difference calculation unit 122 calculates the equation (3) when j is 1 to n, and detects the value m of j that maximizes the calculation result. Then, the phase difference calculation unit 122 multiplies the detected value m by the sampling period T, and uses the multiplication result (time) as a phase difference CR.
Output to the detection unit 112.

【数３】 [Equation 3]

【０１３５】すなわち、処理用音声信号Ｒと処理用音声
信号Ｃとの相関値が、処理用音声信号Ｃに対して、処理
用音声信号Ｒが、ｎ個のサンプル分の時間まで、１サン
プリング時間ずつずらされて算出され、その算出結果に
おいて最も高い相関値が得られたときの時間（値ｍ×サ
ンプリング周期Ｔ）が、位相差ＣＲとされる。That is, the correlation value between the processing audio signal R and the processing audio signal C is such that the processing audio signal R is different from the processing audio signal C by one sampling time up to the time of n samples. The time (value m × sampling period T) when the highest correlation value is obtained in the calculation result is defined as the phase difference CR.

【０１３６】音圧算出部１２３は、入力された処理用音
声信号Ｌが示す値の２乗平均値、２乗平均の平方根、ま
たは絶対値の平均値を、処理用音声信号Ｌの音圧Ｌとし
て算出し、その算出結果を、音圧比算出部１２５に供給
する。音圧算出部１２４は、音圧算出部１２３と同様の
方法で、処理用音声信号Ｒの音圧Ｒを算出し、音圧比算
出部１２５に供給する。The sound pressure calculating section 123 calculates the mean square value, the square root of the square mean, or the average value of the absolute values of the values indicated by the input processing audio signal L, and calculates the sound pressure L of the processing audio signal L. And the calculation result is supplied to the sound pressure ratio calculation unit 125. The sound pressure calculator 124 calculates the sound pressure R of the processing audio signal R in the same manner as the sound pressure calculator 123 and supplies the sound pressure R to the sound pressure ratio calculator 125.

【０１３７】音圧比算出部１２５は、音圧算出部１２３
からの音圧Ｌと、音圧算出部１２４からの音圧Ｒに基づ
いて、音圧比ＲＬ（＝音圧Ｒ／音圧Ｌ）を算出し、その
算出結果を、検出部１１２に供給する。The sound pressure ratio calculating section 125 includes a sound pressure calculating section 123
The sound pressure ratio RL (= sound pressure R / sound pressure L) is calculated based on the sound pressure L from the sound pressure calculation unit 124 and the sound pressure R from the sound pressure calculation unit 124, and the calculation result is supplied to the detection unit 112.

【０１３８】図３０は、検出部１１２の構成例を示して
いる。FIG. 30 shows a configuration example of the detection unit 112.

【０１３９】位置算出部１３１は、特徴量算出部１１１
（位相差算出部１２１，１２２）からの位相差ＣＬおよ
び位相差ＣＲ、並びに位置係数記憶部１３２に記憶され
ている、係数ａ_L、係数ｂ_L、および係数ｃ_Lからなるxお
よびｙの一次方程式（式（４））、並びに係数ａ_R、係
数ｂ_R、および係数ｃ_Rからなるxおよびｙの一次方程式
（式（５））を解き、座標（ｘ，ｙ）（図２７）を算出
する。ａ_L×ｘ＋ｂ_L×ｙ＋ｃ_L＝位相差CL…（４）ａ_R×ｘ＋ｂ_R×ｙ＋ｃ_R＝位相差CR…（５）The position calculating section 131 is provided with a feature amount calculating section 111.
(Phase difference CL and phase difference CR from (Phase difference calculation units 121 and 122), and primary of x and y consisting of coefficient a _L , coefficient b _L , and coefficient c _L stored in the position coefficient storage unit 132 Solving the equation (Equation (4)) and the linear equation (Equation (5)) of x and y composed of coefficient a _R , coefficient b _R , and coefficient c _R to calculate coordinates (x, y) (FIG. 27) I do. a _L × x + b _L × y + c _L = phase difference CL (4) a _R × x + b _R × y + c _R = phase difference CR (5)

【０１４０】位置算出部１３１は、算出した位置（座標
（ｘ，ｙ））を、向き係数決定部１３３および出力部１
４に供給する。The position calculation section 131 converts the calculated position (coordinates (x, y)) into the direction coefficient determination section 133 and the output section 1.
4

【０１４１】ところで、マイク１Ｃと口元（音源）の距
離Ｃと、マイク１Ｌと音源の距離Ｌ、およびマイク１Ｒ
と音源の距離Ｒの関係は、式（６）に示すように表すこ
とができる。距離Ｌ−距離Ｃ＝Δ_L 距離Ｒ−距離Ｃ＝Δ_R…（６）By the way, the distance C between the microphone 1C and the mouth (sound source), the distance L between the microphone 1L and the sound source, and the distance between the microphone 1R and the microphone 1R
And the distance R between the sound source and the sound source can be expressed as shown in Expression (6). Distance L- distance C = delta _L distance R- distance C = Δ _R ... (6)

【０１４２】Δ_LおよびΔ_Rは、図３１に示すように、Ｘ
軸、Ｙ軸、および距離の軸からなる座標空間において、
なだらかな曲面で表される。すなわち、音源が、原点か
ら比較的近い範囲内で移動する場合、Δ_LおよびΔ_Rを、
ほぼ平面に近似することができるので、Δ_LおよびΔ
_Rを、式（７）に示すように、ｘおよびｙの一次方程式
で表すことができる。ａ'_L×ｘ＋ｂ'_L×ｙ＋ｃ'_L＝Δ_L ａ'_R×ｘ＋ｂ'_R×ｙ＋ｃ'_R＝Δ_R…（７）[0142] delta _L and delta _R, as shown in FIG. 31, X
In a coordinate space consisting of an axis, a Y axis, and a distance axis,
It is represented by a gentle curved surface. That is, the sound source is, when moving within relatively close range from the origin, the delta _L and delta _R,
It is possible to approximate the substantially planar, delta _L and delta
_R can be represented by a linear equation of x and y, as shown in equation (7). _{a 'L × x + b'} L × y + c 'L = Δ L a' R × x + b 'R × y + c' R = Δ R ... (7)

【０１４３】一方、音声は、上述したように、音源から
の距離が長ければ長いほど伝達に時間がかかるので、例
えば、ユーザが、図３２に示すように、正面方向に対し
て左側を向いているとき、距離Ｃは、距離Ｌより長くな
るので、マイク１Ｃには、マイク１Ｌに比べ音源からの
音声が遅れて到達する。On the other hand, as described above, the longer the distance from the sound source is, the longer it takes to transmit the voice. Therefore, for example, as shown in FIG. Since the distance C is longer than the distance L, the sound from the sound source reaches the microphone 1C later than the microphone 1L.

【０１４４】また、遅れて到達した音声の位相は、先に
到達した音声の位相に比べ遅れるので、すなわち、先に
到達した音声の位相は、後に到達した音声の位相に比べ
進むので、例えば、図３２の例の場合、処理用音声信号
Ｌの位相は、図３３（Ａ）に示すように、図３３（Ｂ）
に示す処理用音声信号Ｃの位相より進む。このときの位
相のずれが、位相差ＣＬである。Also, since the phase of a voice arriving late is later than the phase of a voice arriving earlier, that is, the phase of a voice arriving earlier advances compared to the phase of a voice arriving later, for example, In the case of the example of FIG. 32, the phase of the processing audio signal L is, as shown in FIG.
The phase advances from the phase of the processing audio signal C shown in FIG. The phase shift at this time is the phase difference CL.

【０１４５】一方、距離Ｃが距離Ｒより長い場合、音源
からの音声は、マイク１Ｃよりマイク１Ｒに先に到達す
るので、処理用音声信号Ｒの位相は、図３３（Ｃ）に示
すように、処理用音声信号Ｃの位相より進む。このとき
の位相のずれが、位相差ＣＲである。On the other hand, when the distance C is longer than the distance R, the sound from the sound source reaches the microphone 1R earlier than the microphone 1C, so that the phase of the processing sound signal R is as shown in FIG. , From the phase of the processing audio signal C. The phase shift at this time is the phase difference CR.

【０１４６】このように、位相差ＣＬおよび位相差ＣＲ
は、上述したように、距離Ｌ、距離Ｃ、および距離Ｒに
対応する。すなわち、位相差ＣＬおよび位相差ＣＲは、
音源が原点から比較的近い範囲内で移動し、Δ_Lおよび
Δ_Rを、平面に近似することができる場合、Δ_LおよびΔ
_Rに対応するので、位相差ＣＬおよび位相差ＣＲは、上
述した式（４）および式（５）に示すように、ｘおよび
ｙの一次方程式で表すことができる。As described above, the phase difference CL and the phase difference CR
Corresponds to the distance L, the distance C, and the distance R as described above. That is, the phase difference CL and the phase difference CR are
If the sound source moves within relatively close range from the origin, the delta _L and delta _R, can be approximated to a plane, delta _L and delta
Since the phase difference CL corresponds to _R , the phase difference CL and the phase difference CR can be expressed by linear equations of x and y as shown in the above-described equations (4) and (5).

【０１４７】このシステムでは、所定の範囲内で移動す
る音源の位置および向きが検出されるが、所定の範囲と
は、Δ_LおよびΔＲを、平面に近似することができる範
囲内、すなわち、式（７）が成り立つ範囲を意味する。[0147] range in this system, the position and orientation of a sound source which moves within a predetermined range is detected, the predetermined range, the delta _L and [Delta] R, can be approximated to a plane, i.e., the formula It means the range where (7) holds.

【０１４８】位置係数記憶部１３２に記憶されている係
数ａ_L、係数ｂ_L、係数ｃ_L、係数ａ_R、係数ｂ_R、および
係数ｃ_Rは、次に説明するように、それら係数に関する
連立一次方程式を、最小自乗法を用いて解くことで求め
られる。The coefficient a _L , coefficient b _L , coefficient c _L , coefficient a _R , coefficient b _R , and coefficient c _R stored in the position coefficient storage unit 132 are, as described below, It is obtained by solving a linear equation using the least squares method.

【０１４９】口元（音源）を、所定の複数の位置（座標
（ｘ_k，ｙ_k）（ｋ＝１，２，…Ｋ）で特定されるＫ箇所
の位置）に移動させ、各位置にあるときの音源から発せ
られた音声を集音し、位相差ＣＬ_kおよび位相差ＣＲ_kを
算出する実験を行うことで、式（８）および式（９）に
示すように、それぞれＫ個の、ｘおよびｙの一次方程式
が求められる。The mouth (sound source) is moved to a plurality of predetermined positions (K positions specified by coordinates (x _k , y _k ) (k = 1, 2,..., K)). The sound emitted from the sound source at the time is collected, and an experiment for calculating the phase difference CL _k and the phase difference CR _k is performed. As shown in Expressions (8) and (9), K values of A linear equation of x and y is determined.

【数４】 (Equation 4)

【数５】 (Equation 5)

【０１５０】式（８）は、式（１０）そしてさらに式
（１１）に示すように展開することで解くことができ
る。これにより、係数ａ_L、係数ｂ_L、および係数ｃ_Lが
求められる。式（９）も、式（１２）そしてさらに式
（１３）に示すように展開することで解くことができ
る。これにより、係数ａ_R、係数ｂ_R、および係数ｃ_Rが
求められる。Equation (8) can be solved by expanding as shown in equation (10) and further as equation (11). Thereby, the coefficient a _L , the coefficient b _L , and the coefficient c _L are obtained. Equation (9) can also be solved by expanding as shown in equation (12) and further equation (13). Thus, a coefficient a _R , a coefficient b _R , and a coefficient c _R are obtained.

【数６】 (Equation 6)

【数７】 (Equation 7)

【数８】 (Equation 8)

【数９】 (Equation 9)

【０１５１】なお、式（１１）および式（１３）中、”
＋”が上付された行列は、擬似逆行列である。In the formulas (11) and (13), "
The matrix with the superscript "+" is a pseudo inverse matrix.

【０１５２】図３０に戻り、向き係数決定部１３３は、
位置算出部１３１から供給された座標（ｘ，ｙ）に対応
する基準位置（後述）を検出する。Referring back to FIG. 30, the orientation coefficient determining unit 133
A reference position (described later) corresponding to the coordinates (x, y) supplied from the position calculation unit 131 is detected.

【０１５３】向き係数決定部１３３はまた、Ｐ個の基準
位置ｐ（ｐ＝１，２，…Ｐ）のそれぞれに対応する係数
α_p、係数β_p、係数γ_p、および係数δ_pを記憶してお
り、記憶しているそれらの係数から、検出した基準位置
に対応する、係数α、係数β、係数γ、および係数δを
選択し、向き算出部１３４に供給する。The direction coefficient determination unit 133 also stores a coefficient α _p , a coefficient β _p , a coefficient γ _p , and a coefficient δ _p corresponding to each of the P reference positions p (p = 1, 2,... P). A coefficient α, a coefficient β, a coefficient γ, and a coefficient δ corresponding to the detected reference position are selected from the stored coefficients, and are supplied to the direction calculation unit 134.

【０１５４】向き算出部１３４は、向き係数決定部１３
３からの係数α、係数β_、係数γ、および係数δ、並び
に特徴量算出部１１１からの音圧比ＲＬに基づいて、式
（１４）に示すθの３次方程式を解いてθを算出し、出
力部１４に供給する。The direction calculating section 134 is provided for the direction coefficient determining section 13.
Based on the coefficient α, the coefficient β _{, the} coefficient γ, and the coefficient δ from 3 and the sound pressure ratio RL from the feature amount calculation unit 111, a cubic equation of θ shown in Expression (14) is solved to calculate θ. It is supplied to the output unit 14.

【数１０】 (Equation 10)

【０１５５】ところで、音圧比ＲＬは、音源の位置およ
び向きにより異なる。例えば、図３４に示すように、音
源が、座標（ｘ_p，ｙ_p）（ｐ＝１，２，…９）で特定さ
れる基準位置ｐにあるときに発せられた音声による音圧
比ＲＬは、それぞれ異なり、また向きによっても異な
る。By the way, the sound pressure ratio RL differs depending on the position and the direction of the sound source. For example, as shown in FIG. 34, the sound pressure ratio RL of the sound emitted when the sound source is at the reference position p specified by the coordinates (x _p , y _p ) (p = 1, 2,... 9) is , Respectively, and also depending on the orientation.

【０１５６】図３５（Ａ）は、基準位置ｐ（＝４）に位
置し、角度θ_L乃至角度θ_Rの方向を向いているときの音
源から発生された音声による音圧比ＲＬ₄を表してい
る。図３５（Ｂ）は、基準位置ｐ（＝５）に位置し、角
度θ_L乃至角度θ_Rの方向を向いているときの音源から発
生された音声による音圧比ＲＬ₅を表している。図３５
（Ｃ）は、基準位置ｐ（＝６）に位置し、角度θ_L乃至
角度θ_Rの方向を向いているときの音源から発生された
音声による音圧比ＲＬ₆を表している。[0156] Figure 35 (A) is located at the reference position p (= 4), representative of the sound pressure ratio RL ₄ voice generated from the sound source when facing the direction of the angle theta _L to the angle theta _R I have. Fig. 35 (B) located at the reference position p (= 5), represents the sound pressure ratio RL ₅ voice generated from the sound source when facing the direction of the angle theta _L to angle theta _R. FIG.
(C) is located at the reference position p (= 6), it represents the sound pressure ratio RL ₆ voice generated from the sound source when facing the direction of the angle theta _L to angle theta _R.

【０１５７】図３５（Ａ）乃至（Ｃ）に示す軌跡から、
音圧比ＲＬは、式（１４）に示すように、角度θの３次
式で近似される。From the trajectories shown in FIGS. 35A to 35C,
The sound pressure ratio RL is approximated by a cubic expression of the angle θ as shown in Expression (14).

【０１５８】向き係数決定部１３３に記憶されている、
Ｐ個の基準位置ｐのそれぞれに対応する係数α_p、係数
β_p、係数γ_p、および係数δ_pは、次に説明するように
して求められる。The direction coefficient determination unit 133 stores
The coefficient α _p , coefficient β _p , coefficient γ _p , and coefficient δ _p corresponding to each of the P reference positions p are obtained as described below.

【０１５９】音源の位置および向きにより音圧比ＲＬが
異なるので、音源が基準位置ｐに位置し、所定の方向
（角度θ_q（ｑ＝１，２，…Ｑ）の方向）を向いている
ときの音源から発せられる音声の音圧比ＲＬ_pqを求める
ことで、基準位置１乃至基準位置Ｐ毎に、Ｑ個の、θの
３次元方程式が求められる。Since the sound pressure ratio RL differs depending on the position and the direction of the sound source, the sound source is located at the reference position p and is oriented in a predetermined direction (the direction of the angle θ _q (q = 1, 2,... Q)). By _calculating the sound pressure ratio RL _pq of the sound emitted from the sound source, Q three-dimensional equations of θ are obtained for each of the reference positions 1 to P.

【０１６０】例えば、基準位置１に位置する音源が、そ
れぞれ角度θ₁乃至θ_Qの方向を向いているときに発した
音声から、音圧比ＲＬ₁₁乃至ＲＬ_1Qを求めることで、式
（１５）に示すような、Ｑ個の、θの３次元方程式が求
められる。[0160] For example, a sound source located at the reference position 1, the sound produced by the time that each oriented at an angle theta ₁ to theta _Q, by obtaining the sound pressure ratio RL ₁₁ to RL _1Q, formula (15) The Q three-dimensional equations of θ as shown in FIG.

【数１１】 [Equation 11]

【０１６１】同様にして、基準位置２乃至基準位置Ｐに
ついても、それぞれＱ個の、θの３次元方程式が得ら
れ、これらを、ニュートン法やシンプレックス法などに
よる非線形最小自乗法により解くことで、基準位置１乃
至基準位置Ｐ毎に、係数α、係数β、係数γ、および係
数δが求められる。Similarly, for each of the reference positions 2 to P, Q three-dimensional equations of θ are obtained, and these are solved by the nonlinear least squares method such as the Newton method or the simplex method. A coefficient α, a coefficient β, a coefficient γ, and a coefficient δ are obtained for each of the reference positions 1 to P.

【０１６２】次に、検出装置１０１の動作を、図３６の
フローチャートを参照して説明する。Next, the operation of the detection apparatus 101 will be described with reference to the flowchart in FIG.

【０１６３】ステップＳ６１において、検出装置１０１
の入力部１１は、入力したマイクアンプ２Ｌ，２Ｃ，２
Ｒのそれぞれからの処理用音声信号Ｌ，Ｃ，Ｒの一部
（所定の時間内に入力した部分）を、特徴量算出部１１
１に供給する。In step S61, the detecting device 101
Input unit 11 receives the input microphone amplifiers 2L, 2C, 2
Each of the processing audio signals L, C, and R from each of the R (portion input within a predetermined time) is converted into a feature amount calculation unit 11.
Supply 1

【０１６４】次に、ステップＳ６２において、特徴量算
出部１１１（位相差算出部１２１，１２２）は、入力部
１１から供給された処理用音声信号Ｃ，Ｌから、位相差
ＣＬを算出するとともに、処理用音声信号Ｃ，Ｒから、
位相差ＣＲを算出する。算出された位相差ＣＬおよび位
相差ＣＲは、検出部１１２に供給される。Next, in step S62, the feature amount calculation unit 111 (phase difference calculation units 121 and 122) calculates the phase difference CL from the processing audio signals C and L supplied from the input unit 11, and From the processing audio signals C and R,
The phase difference CR is calculated. The calculated phase difference CL and phase difference CR are supplied to the detection unit 112.

【０１６５】ステップＳ６３において、特徴量算出部１
１１（音圧算出部１２３，１２４、音圧比算出部１２
５）は、処理用音声信号Ｌ，Ｒの音圧Ｌ，Ｒをそれぞれ
算出するとともに、音圧比ＲＬ（＝音圧Ｒ／音圧Ｌ）を
算出する。算出された音圧比ＲＬは、検出部１１２に供
給される。In step S63, the characteristic amount calculation unit 1
11 (sound pressure calculation units 123 and 124, sound pressure ratio calculation unit 12
5) calculates the sound pressures L and R of the processing audio signals L and R, and calculates the sound pressure ratio RL (= sound pressure R / sound pressure L). The calculated sound pressure ratio RL is supplied to the detection unit 112.

【０１６６】次に、ステップＳ６４において、検出部１
１２（位置算出部１３１）は、特徴量算出部１１１から
供給された位相差ＣＬ、並びに自分自身（位置係数記憶
部１３２）が記憶する係数ａ_L、係数ｂ_L、および係数ｃ
_Lからなる式（４）と、位相差ＣＲ、並びに係数ａ_R、係
数ｂ_R、および係数ｃ_Rからなる式（５）の連立方程式を
解き、ｘおよびｙを算出する。算出された座標（ｘ，
ｙ）は、出力部１４に供給される。Next, in step S64, the detecting unit 1
12 (the position calculating unit 131), the phase difference CL supplied from the feature amount calculating unit 111, and the coefficients a _L , b _L , and c stored by itself (the position coefficient storing unit 132).
X and y are calculated by solving a simultaneous equation of equation (4) consisting of _L and equation (5) consisting of phase difference CR and coefficients a _R , b _R , and c _R. The calculated coordinates (x,
y) is supplied to the output unit 14.

【０１６７】ステップＳ６５において、検出部１１２
（向き係数決定部１３３）は、算出した座標（ｘ，ｙ）
で特定される位置に最も近い基準位置を検出する。In step S65, the detecting unit 112
The (direction coefficient determination unit 133) calculates the calculated coordinates (x, y)
The reference position closest to the position specified by is detected.

【０１６８】次に、ステップＳ６６において、検出部１
１２（向き係数決定部１３３）は、記憶している、Ｐ個
の基準位置ｐのそれぞれに対応する係数α_p、係数β_p、
係数γ_p、および係数δ_pから、決定した基準位置に対応
する、係数α、係数β、係数γ、および係数δを選択す
る。Next, in step S66, the detecting unit 1
12 (orientation coefficient determination unit 133) stores the stored coefficient α _p , coefficient β _p ,
A coefficient α, a coefficient β, a coefficient γ, and a coefficient δ corresponding to the determined reference position are selected from the coefficient γ _p and the coefficient δ _p .

【０１６９】ステップＳ６７において、検出部１１２
（向き算出部１３４）は、特徴量算出部１１１からの音
圧比ＲＬ、並びにステップＳ６６で選択した係数α、係
数β、係数γ、および係数δからなる式（１４）を解
き、角度θを算出する。算出された角度θは、出力部１
４に供給される。At step S67, the detecting unit 112
The (direction calculation unit 134) calculates the angle θ by solving the expression (14) including the sound pressure ratio RL from the feature amount calculation unit 111 and the coefficients α, β, γ, and δ selected in step S66. I do. The calculated angle θ is output unit 1
4 is supplied.

【０１７０】次に、ステップＳ６８において、出力部１
４は、検出部１１２から供給された座標（ｘ，ｙ）を、
口元（音源）の位置として、そして角度θを、音源の向
きとして外部の装置に出力する。Next, in step S68, the output unit 1
4 represents the coordinates (x, y) supplied from the detection unit 112,
The position of the mouth (sound source) and the angle θ are output to an external device as the direction of the sound source.

【０１７１】その後、処理は終了する。Thereafter, the processing ends.

【０１７２】なお、以上においては、検出部１１２にお
いて、算出された座標（ｘ，ｙ）に対応する基準位置が
検出されるとともに、記憶されている、それぞれＰ個の
係数α_p、係数β_p、係数γ_p、および係数δ_pのうちか
ら、検出された基準位置に対応する係数α、係数β、係
数γ、および係数δが選択されたが、検出部１１２が、
式（１６）乃至式（１９）に示すように、係数αが算出
されるｘ，ｙの関数ｅ、係数βが算出されるｘ，ｙの関
数ｆ、係数γが算出されるｘ，ｙの関数ｇ、および係数
δが算出されるｘ，ｙの関数ｈを記憶し、検出された座
標（ｘ，ｙ）でそれらの関数を演算することで、係数
α、係数β、係数γ、および係数δのそれぞれを算出す
ることもできる。 α＝ｅ（ｘ，ｙ）…（１６） β＝ｆ（ｘ，ｙ）…（１７） γ＝ｇ（ｘ，ｙ）…（１８） δ＝ｈ（ｘ，ｙ）…（１９）In the above description, the detection unit 112 detects the reference position corresponding to the calculated coordinates (x, y), and stores and stores the P coefficients α _p and β _p , respectively. , Coefficient γ _p , and coefficient δ _p , the coefficient α, coefficient β, coefficient γ, and coefficient δ corresponding to the detected reference position are selected.
As shown in Expressions (16) to (19), a function e of x and y for calculating the coefficient α, a function f of x and y for calculating the coefficient β, and a function f of x and y for calculating the coefficient γ The function g and the function h of x and y for which the coefficient δ is calculated are stored, and those functions are calculated at the detected coordinates (x, y) to obtain the coefficient α, the coefficient β, the coefficient γ, and the coefficient Each of δ can also be calculated. α = e (x, y) (16) β = f (x, y) (17) γ = g (x, y) (18) δ = h (x, y) (19)

【０１７３】この場合の検出装置１０１の動作を、図３
７のフローチャートを参照して説明する。The operation of the detection device 101 in this case is described in FIG.
This will be described with reference to the flowchart of FIG.

【０１７４】ステップＳ７１乃至ステップＳ７４、およ
びステップＳ７７においては、図３６のステップＳ６１
乃至ステップＳ６４、およびステップＳ６８における場
合と同様の処理が行われるので、その説明は省略する。In steps S71 to S74 and step S77, step S61 in FIG.
Since the same processing as in steps S64 and S68 is performed, the description thereof is omitted.

【０１７５】ステップＳ７５において、検出装置１０１
の検出部１１２は、関数ｅ（式（１６））、関数ｆ（式
（１７））、関数ｇ（式（１８））、および関数ｈ（式
（１９））をそれぞれ演算し、係数α、係数β、係数
γ、および係数δを算出する。In step S75, the detecting device 101
, The function e (equation (16)), the function f (equation (17)), the function g (equation (18)), and the function h (equation (19)) are calculated, and the coefficient α, A coefficient β, a coefficient γ, and a coefficient δ are calculated.

【０１７６】次に、ステップＳ７６において、検出部１
１２は、特徴量算出部１１１からの音圧比ＲＬ、並びに
ステップＳ７５で算出した係数α、係数β、係数γ、お
よび係数δで、式（１４）を演算し、角度θを算出す
る。Next, in step S76, the detecting unit 1
12 calculates the angle θ by calculating the equation (14) using the sound pressure ratio RL from the feature amount calculation unit 111 and the coefficients α, β, γ, and δ calculated in step S75.

【０１７７】また、以上においては、特徴量算出部１１
１は、処理用音声信号Ｌ，Ｒから直接音圧Ｌ，Ｒを算出
し、そして音圧比ＲＬを算出する場合を例として説明し
たが、処理用音声信号Ｌ，Ｒの高周波部分の信号（高周
波処理用音声信号Ｌh，Ｒh）の音圧Ｌh，Ｒhを算出し、
それらの音圧比ＲＬhを算出することができる。Further, in the above description, the feature amount calculating unit 11
1 has been described as an example in which the sound pressures L and R are calculated directly from the processing audio signals L and R, and then the sound pressure ratio RL is calculated. The sound pressures Lh and Rh of the processing audio signals Lh and Rh) are calculated,
The sound pressure ratio RLh can be calculated.

【０１７８】この例の場合における特徴量算出部１１１
の構成例を、図３８に示す。この特徴量算出部には、図
２９の特徴量算出部１１１の音圧算出部１２３および音
圧算出部１２４の前段に、ハイパスフィルタ１４１およ
びハイパスフィルタ１４２が設けられている。In this example, the characteristic amount calculation unit 111 is used.
38 is shown in FIG. In the feature amount calculation unit, a high-pass filter 141 and a high-pass filter 142 are provided at a stage preceding the sound pressure calculation unit 123 and the sound pressure calculation unit 124 of the feature amount calculation unit 111 in FIG.

【０１７９】ハイパスフィルタ１４１は、入力部１１か
ら供給された処理用音声信号Ｌの高周波部分のみを通過
させて、音圧算出部１２３に供給する。ハイパスフィル
タ１４２は、入力部１１から供給された処理用音声信号
Ｒの高周波部分のみを通過させて、音圧算出部１２４に
供給する。The high-pass filter 141 passes only the high-frequency portion of the processing audio signal L supplied from the input unit 11 and supplies the processed audio signal L to the sound pressure calculation unit 123. The high-pass filter 142 passes only the high-frequency portion of the processing audio signal R supplied from the input unit 11 and supplies the processed audio signal R to the sound pressure calculation unit 124.

【０１８０】音圧算出部１２３は、高周波処理用音声信
号Ｌhの音圧Ｌhを算出し、音圧比算出部１２５に供給す
る。音圧算出部１２４は、高周波処理用音声信号Ｒhの
音圧Ｒhを算出し、音圧比算出部１２５に供給する。The sound pressure calculation section 123 calculates the sound pressure Lh of the high-frequency processing audio signal Lh and supplies it to the sound pressure ratio calculation section 125. The sound pressure calculator 124 calculates the sound pressure Rh of the high-frequency processing audio signal Rh, and supplies the calculated sound pressure Rh to the sound pressure ratio calculator 125.

【０１８１】音圧比算出部１２５は、音圧比ＲＬhを算
出し、検出部１１２に供給する。The sound pressure ratio calculator 125 calculates the sound pressure ratio RLh and supplies it to the detector 112.

【０１８２】図３９は、本発明を適用した音源状態検出
システムの第５の実施の形態の構成例を示している。こ
のシステムでは、第４の実施の形態における場合と同様
に、所定の範囲内で移動する音源の位置およびその向き
の両方が検出されるが、この例の場合、３次元空間上の
位置および向きが検出される。FIG. 39 shows a configuration example of a sound source state detection system according to a fifth embodiment of the present invention. In this system, as in the fourth embodiment, both the position and the direction of the sound source moving within the predetermined range are detected. In this example, the position and the direction in the three-dimensional space are detected. Is detected.

【０１８３】このシステムには、図１に示す検出装置３
に代えて、検出装置２０１が設けられている他、４個の
マイク１―１乃至マイク１−４、および４個のマイクア
ンプ２−１乃至２−４が設けられている。This system includes the detecting device 3 shown in FIG.
, A detection device 201 is provided, and in addition, four microphones 1-1 to 1-4 and four microphone amplifiers 2-1 to 2-4 are provided.

【０１８４】マイク１−１乃至マイク１−４は、図４０
に示すような、それぞれ直交するＸ軸、Ｙ軸、およびＺ
軸からなる３次元空間上に一定の広がりを持って配置さ
れている。なお、図４０中の原点、Ｘ軸、およびＹ軸
は、図２７の原点、Ｘ軸、およびＹ軸に対応するもので
ある。すなわち、図４０に示すＺ軸は、図２７において
は、ユーザから見れば、原点から垂直方向に延びる軸で
ある。Microphones 1-1 to 1-4 are shown in FIG.
X-axis, Y-axis and Z-axis, respectively, as shown in
They are arranged with a certain extent in a three-dimensional space consisting of axes. Note that the origin, X axis, and Y axis in FIG. 40 correspond to the origin, X axis, and Y axis in FIG. That is, the Z axis shown in FIG. 40 is an axis extending in the vertical direction from the origin as viewed from the user in FIG.

【０１８５】図３９に戻り、マイク１−１乃至マイク１
−４は、音源からの音声（ユーザが発した音声）を集音
し、音声信号として、マイクアンプ２−１乃至２−４に
供給する。Returning to FIG. 39, the microphones 1-1 to 1
-4 collects the sound from the sound source (the sound uttered by the user) and supplies it to the microphone amplifiers 2-1 to 2-4 as a sound signal.

【０１８６】マイクアンプ２−１乃至２−４は、マイク
１−１乃至１−４からの音声信号を増幅し、処理用音声
信号Ｓ１乃至Ｓ４として、検出装置２０１に供給する。The microphone amplifiers 2-1 to 2-4 amplify the audio signals from the microphones 1-1 to 1-4 and supply the amplified audio signals to the detection device 201 as processing audio signals S1 to S4.

【０１８７】検出装置２０１は、音源（口元）の位置に
対応する、図４０の３次元空間における座標（ｘ，ｙ，
ｚ）を検出するとともに、口元の向きに対応する、Ｚ軸
の回りの角度θ、Ｙ軸の回りの角度φ、およびＸ軸の回
りの角度ψのそれぞれを検出する。The detection device 201 calculates the coordinates (x, y, x, y, y) in the three-dimensional space in FIG. 40 corresponding to the position of the sound source (mouth).
z), and an angle θ around the Z axis, an angle φ around the Y axis, and an angle ψ around the X axis corresponding to the orientation of the mouth are detected.

【０１８８】図４１は、検出装置２０１の構成例を示し
ている。この検出装置２０１には、図４の検出装置３の
特徴量算出部１２に代えて特徴量算出部２１１が、また
検出部１３に代えて、検出部２１２が設けられている。FIG. 41 shows a configuration example of the detection device 201. The detection device 201 includes a feature value calculation unit 211 instead of the feature value calculation unit 12 of the detection device 3 in FIG. 4, and a detection unit 212 instead of the detection unit 13.

【０１８９】入力部１１は、マイクアンプ２−１乃至２
−４のそれぞれから適宜供給される処理用音声信号Ｓ１
乃至Ｓ４を入力し、特徴量算出部２１１に供給する。The input section 11 is provided with microphone amplifiers 2-1 to 2
-4, the processing audio signal S1 appropriately supplied from
To S4, and supplies them to the feature amount calculation unit 211.

【０１９０】特徴量算出部２１１は、処理用音声信号Ｓ
１の位相と処理用音声信号Ｓ２の位相との差（以下、位
相差Ａ１と称する）、処理用音声信号Ｓ２の位相と処理
用音声信号Ｓ３の位相との差（以下、位相差Ａ２と称す
る）、および処理用音声信号Ｓ３の位相と処理用音声信
号Ｓ４の位相との差（以下、位相差Ａ３と称する）を、
それぞれ算出する。The feature amount calculating section 211 outputs the processing audio signal S
1 and the phase of the processing audio signal S2 (hereinafter referred to as a phase difference A1), and the difference between the phase of the processing audio signal S2 and the phase of the processing audio signal S3 (hereinafter referred to as a phase difference A2). ) And the difference between the phase of the processing audio signal S3 and the phase of the processing audio signal S4 (hereinafter referred to as phase difference A3)
Calculate each.

【０１９１】特徴量算出部２１１はまた、処理用音声信
号Ｓ１乃至Ｓ４の音圧をそれぞれ算出するとともに、処
理用音声信号Ｓ１の音圧と処理用音声信号Ｓ２の音圧の
比（以下、音圧比Ｂ１と称する）、処理用音声信号Ｓ２
の音圧と処理用音声信号Ｓ３の音圧の比（以下、音圧比
Ｂ２と称する）、処理用音声信号Ｓ３の音圧と処理用音
声信号Ｓ４の音圧の比（以下、音圧比Ｂ３と称する）
を、それぞれ算出する。The feature amount calculation unit 211 calculates the sound pressures of the processing audio signals S1 to S4, and also calculates the ratio of the sound pressure of the processing audio signal S1 to the sound pressure of the processing audio signal S2 (hereinafter referred to as the sound pressure). Pressure ratio B1), processing audio signal S2
, And the ratio of the sound pressure of the processing audio signal S3 (hereinafter referred to as the sound pressure ratio B2), and the ratio of the sound pressure of the processing audio signal S3 to the sound pressure of the processing audio signal S4 (hereinafter, referred to as the sound pressure ratio B3). Name)
Are calculated respectively.

【０１９２】特徴量算出部２１１は、算出した位相差Ａ
１乃至位相差Ａ３、および音圧比Ｂ１乃至音圧比Ｂ３
を、検出部２１２に出力する。The feature amount calculating section 211 calculates the calculated phase difference A
1 to phase difference A3, and sound pressure ratio B1 to sound pressure ratio B3
Is output to the detection unit 212.

【０１９３】検出部２１２は、特徴量算出部２１１から
の位相差Ａ１乃至位相差Ａ３、および所定の係数（後
述）に基づいて、口元（音源）の位置を表す座標（ｘ，
ｙ，ｚ）を算出する。Based on the phase differences A1 to A3 from the feature amount calculation unit 211 and a predetermined coefficient (described later), the detection unit 212 determines the coordinates (x,
y, z) is calculated.

【０１９４】検出部２１２はまた、予め求められた、所
定の位置、所定の向き、並びにその位置およびその向き
の音源から発せられた音声の音圧比Ｂ１乃至Ｂ３の対応
関係を表すデータから、算出した音圧比Ｂ１乃至Ｂ３お
よび算出した音源の位置（座標）に対応する向きを、こ
のときの音源の向きとして検出する。The detecting unit 212 also calculates a predetermined position, a predetermined direction, and a corresponding relationship between the sound pressure ratios B1 to B3 of the sound emitted from the sound source of the position and the direction, which are obtained in advance. The direction corresponding to the determined sound pressure ratios B1 to B3 and the calculated position (coordinates) of the sound source is detected as the direction of the sound source at this time.

【０１９５】図４２は、特徴量算出部２１１の構成例を
示している。FIG. 42 shows an example of the configuration of the feature amount calculation unit 211.

【０１９６】位相差算出部２２１は、位相差Ａ１を算出
し、検出部２１２に供給する。位相差算出部２２２は、
位相差Ａ２を算出し、検出部２１２に供給する。位相差
算出部２２３は、位相差Ａ３を算出し、検出部２１２に
供給する。The phase difference calculator 221 calculates the phase difference A1 and supplies it to the detector 212. The phase difference calculation unit 222
The phase difference A2 is calculated and supplied to the detection unit 212. The phase difference calculation unit 223 calculates the phase difference A3 and supplies the same to the detection unit 212.

【０１９７】音圧算出部２２４は、処理用音声信号Ｓ１
の音圧を算出し、音圧比算出部２２８に供給する。音圧
算出部２２５は、処理用音声信号Ｓ２の音圧を算出し、
音圧比算出部２２８および音圧比算出部２２９に供給す
る。The sound pressure calculating section 224 outputs the processing sound signal S1.
Is calculated and supplied to the sound pressure ratio calculation unit 228. The sound pressure calculation unit 225 calculates the sound pressure of the processing audio signal S2,
It is supplied to the sound pressure ratio calculation unit 228 and the sound pressure ratio calculation unit 229.

【０１９８】音圧算出部２２６は、処理用音声信号Ｓ３
の音圧を算出し、音圧比算出部２２９および音圧比算出
部２３０に供給する。音圧算出部２２７は、処理用音声
信号Ｓ４の音圧を算出し、音圧比算出部２３０に供給す
る。The sound pressure calculating section 226 outputs the processing sound signal S3
Is calculated and supplied to the sound pressure ratio calculation unit 229 and the sound pressure ratio calculation unit 230. The sound pressure calculator 227 calculates the sound pressure of the processing audio signal S4 and supplies the calculated sound pressure to the sound pressure ratio calculator 230.

【０１９９】音圧比算出部２２８は、音圧比Ｂ１を算出
し、検出部２１２に供給する。音圧比算出部２２９は、
音圧比Ｂ２を算出し、検出部２１２に供給する。音圧比
算出部２３０は、音圧比Ｂ３を算出し、検出部２１２に
供給する。The sound pressure ratio calculation section 228 calculates the sound pressure ratio B1 and supplies it to the detection section 212. The sound pressure ratio calculation unit 229
The sound pressure ratio B2 is calculated and supplied to the detection unit 212. The sound pressure ratio calculation unit 230 calculates the sound pressure ratio B3 and supplies it to the detection unit 212.

【０２００】図４３は、検出部２１２の構成例を示して
いる。FIG. 43 shows an example of the configuration of the detection section 212.

【０２０１】位置算出部２４１には、特徴量算出部２１
１（位相差算出部２２１乃至位相差算出部２２３）から
の位相差Ａ１乃至位相差Ａ３、並びに位置係数記憶部２
４２に記憶されている係数ａ_1、係数ｂ₁、係数ｃ₁、およ
び係数ｄ₁、係数ａ_2、係数ｂ₂、係数ｃ₂、および係数
ｄ₂、並びに係数ａ_3、係数ｂ₃、係数ｃ₃、および係数ｄ₃
が供給される。The position calculating section 241 includes the feature amount calculating section 21
1 (phase difference calculation units 221 to 223) and the phase coefficient storage unit 2
42 coefficients a ₁ stored in the _coefficient b _1, coefficients c _1, and coefficient d _1, coefficients a _2, the coefficient b _2, coefficients c _2, and the coefficient d _2, and the coefficient a _3, the coefficient b _3, coefficient c ₃ and coefficient d ₃
Is supplied.

【０２０２】位置算出部２４１は、係数ａ_1、係数ｂ₁、
係数ｃ₁、および係数ｄ₁、並びに位相差Ａ１からなる
ｘ，ｙ，ｚの１次方程式（式（２０））、係数ａ_2、係数
ｂ₂、係数ｃ₂、および係数ｄ₂、並びに位相差Ａ２から
なるｘ，ｙ，ｚの１次方程式（式（２１））、そして係
数ａ_3、係数ｂ₃、係数ｃ₃、および係数ｄ₃、並びに位相
差Ａ３からなるｘ，ｙ，ｚの１次方程式（式（２２））
の連立方程式を演算し、ｘ，ｙ，ｚを算出する。ａ₁×ｘ＋ｂ₁×ｙ＋ｃ₁×ｚ＋ｄ₁＝位相差Ａ１…（２０）ａ₂×ｘ＋ｂ₂×ｙ＋ｃ₂×ｚ＋ｄ₂＝位相差Ａ２…（２１）ａ₃×ｘ＋ｂ₃×ｙ＋ｃ₃×ｚ＋ｄ₃＝位相差Ａ３…（２２）The position calculating section 241 calculates the coefficient a _{1, the} coefficient b ₁ ,
X, y, z linear equations (Equation (20)) consisting of the coefficients c ₁ and d ₁ and the phase difference A1, the coefficients a _2, b ₂ , c ₂ and d ₂ , and x consisting retardation A2, y, 1 equations of z (formula (21)), and the coefficient a _3, the coefficient b _3, coefficients c _3, and the coefficient d _3, and consists of a phase difference A3 x, y, and z Linear equation (Equation (22))
X, y, and z are calculated. a ₁ × x + b ₁ × y + c ₁ × z + d ₁ = phase difference A1 (20) a ₂ × x + b ₂ × y + c ₂ × z + d ₂ = phase difference A2 (21) a ₃ × x + b ₃ × y + c ₃ × z + d ₃ = Phase difference A3 (22)

【０２０３】位置算出部２４１は、算出した座標（ｘ，
ｙ，ｚ）（音源の位置）を、向き検出部２４３および出
力部１４に供給する。The position calculator 241 calculates the calculated coordinates (x,
y, z) (the position of the sound source) is supplied to the direction detection unit 243 and the output unit 14.

【０２０４】位置係数記憶部２４２が記憶する、係数ａ
₁乃至ｄ₁、係数ａ₂乃至ｄ₂、および係数ａ₃乃至ｄ₃は、
基本的には、第４の実施の形態における係数ａ_L乃至ｃ_L
および係数ａ_R乃至ｃ_Rを算出する場合と同様にして求め
られる。The coefficient a stored in the position coefficient storage unit 242
_{1 to} d ₁ , coefficients a _{2 to} d ₂ , and coefficients a _{3 to} d ₃ are:
Basically, the coefficients a _{L to} c _L in the fourth embodiment are used.
And the coefficients a _{R to} c _R are calculated in the same manner as in the calculation.

【０２０５】すなわち、口元（音源）を、所定の複数の
位置（座標（ｘ_k，ｙ_k，ｚ_k））に移動させ、各位置に
あるときの音源から発せられた音声を集音し、位相差Ａ
１_K、位相差Ａ２_k、および位相差Ａ３_kを算出する実験
を行うことで、式（２３）乃至式（２５）に示すような
連立方程式が求められる。That is, the mouth (sound source) is moved to a plurality of predetermined positions (coordinates (x _k , y _k , z _k )), and the sound emitted from the sound source at each position is collected. Phase difference A
By conducting an experiment for calculating 1 _K , the phase difference A2 _k , and the phase difference A3 _k , simultaneous equations as shown in Expressions (23) to (25) are obtained.

【数１２】 (Equation 12)

【数１３】 (Equation 13)

【数１４】 [Equation 14]

【０２０６】式（２３）乃至式（２５）を、最小自乗法
により解くことで、係数ａ₁乃至ｄ₁、係数ａ₂乃至ｄ₂、
および係数ａ₃乃至ｄ₃のそれぞれが求められる。By solving the equations (23) to (25) by the least square method, the coefficients a _{1 to} d ₁ , the coefficients a _{2 to} d ₂ ,
And the coefficients a _{3 to} d ₃ are obtained.

【０２０７】向き検出部２４３は、図４４に示すよう
な、予め求められた、所定の位置（座標（ｘ，ｙ，
ｚ））、所定の向き（角度θ，φ，ψ）、並びにその位
置およびその向きの音源から発せられた音声の音圧比Ｂ
１，Ｂ２，Ｂ３の対応関係を表す対応表を記憶してい
る。すなわち、向き検出部２４３は、位置算出部２４１
からの座標（ｘ，ｙ，ｚ）および特徴量算出部２１１か
らの音圧比Ｂ１，Ｂ２，Ｂ３に基づいて、それらに対応
する角度θ，φ，ψを、音源の向きとして検出し、出力
部１４に出力する。The direction detecting section 243 determines a predetermined position (coordinates (x, y,
z)), predetermined directions (angles θ, φ, ψ), and the sound pressure ratio B of the sound emitted from the sound source at the position and the direction.
A correspondence table representing the correspondence between 1, B2, and B3 is stored. That is, the direction detection unit 243 includes the position calculation unit 241
Based on the coordinates (x, y, z) from the data and the sound pressure ratios B1, B2, and B3 from the feature amount calculation unit 211, the angles θ, φ, and 対応 corresponding thereto are detected as the direction of the sound source, and the output unit 14 is output.

【０２０８】この例の場合の検出装置２０１の動作を、
図４５のフローチャートを参照して説明する。The operation of the detecting device 201 in this example is as follows.
This will be described with reference to the flowchart in FIG.

【０２０９】ステップＳ８１において、検出装置２０１
の入力部１１は、入力したマイクアンプ２−１乃至２−
４のそれぞれからの処理用音声信号Ｓ１乃至Ｓ４の一部
（所定の時間内に入力した部分）を、特徴量算出部２１
１に供給する。At step S81, the detecting device 201
Input unit 11 is provided with the input microphone amplifiers 2-1 through 2-
4 (parts input within a predetermined time) from the feature amount calculating unit 21.
Supply 1

【０２１０】次に、ステップＳ８２において、特徴量算
出部２１１（位相差算出部２２１乃至２２３）は、マイ
クアンプ２―１乃至２−４から供給された処理用音声信
号Ｓ１乃至Ｓ４から、処理用音声信号Ｓ１の位相と処理
用音声信号Ｓ２の位相との位相差Ａ１、処理用音声信号
Ｓ２の位相と処理用音声信号Ｓ３の位相との位相差Ａ
２、および処理用音声信号Ｓ３の位相と処理用音声信号
Ｓ４の位相との位相差Ａ３をそれぞれ算出する。算出さ
れた位相差Ａ１乃至位相差Ａ３は、検出部２１２に供給
される。Next, in step S82, the characteristic amount calculation unit 211 (phase difference calculation units 221 to 223) converts the processing audio signals S1 to S4 supplied from the microphone amplifiers 2-1 to 2-4 into processing signals. The phase difference A1 between the phase of the audio signal S1 and the phase of the processing audio signal S2, and the phase difference A between the phase of the processing audio signal S2 and the phase of the processing audio signal S3.
2, and a phase difference A3 between the phase of the processing audio signal S3 and the phase of the processing audio signal S4. The calculated phase differences A1 to A3 are supplied to the detection unit 212.

【０２１１】ステップＳ８３において、特徴量算出部２
１１（音圧算出部２２４乃至２２７、音圧比算出部２２
８乃至２３０）は、処理用音声信号Ｓ１，Ｓ２，Ｓ３，
Ｓ４の音圧を算出するとともに、処理用音声信号Ｓ１，
Ｓ２の音圧から、音圧比Ｂ１を、処理用音声信号Ｓ２，
Ｓ３の音圧から、音圧比Ｂ２を、そして処理用音声信号
Ｓ３，Ｓ４の音圧から、音圧比Ｂ３を、それぞれ算出す
る。算出された音圧比Ｂ１乃至音圧比Ｂ３は、検出部２
１２に供給される。In step S83, the characteristic amount calculation unit 2
11 (sound pressure calculation units 224 to 227, sound pressure ratio calculation unit 22
8 to 230) are processing audio signals S1, S2, S3,
While calculating the sound pressure of S4, the processing sound signal S1,
From the sound pressure of S2, the sound pressure ratio B1 is converted to the processing sound signal S2.
The sound pressure ratio B2 is calculated from the sound pressure of S3, and the sound pressure ratio B3 is calculated from the sound pressures of the processing audio signals S3 and S4. The calculated sound pressure ratio B1 to sound pressure ratio B3 is
12 is supplied.

【０２１２】次に、ステップＳ８４において、検出部２
１２（位置算出部２４１）は、特徴量算出部２１１から
供給された位相差Ａ１乃至位相差Ａ３、並びに自分自身
（位置係数記憶部２４２）が記憶する係数ａ₁乃至ｄ₁、
係数ａ₂乃至ｄ₂、および係数ａ₃乃至ｄ₃の式（２０）乃
至式（２２）からなる連立方程式を解き、ｘ、ｙ、およ
びｚを算出する。算出された座標（ｘ，ｙ，ｚ）は、出
力部１４に供給される。Next, in step S84, the detecting unit 2
12 (the position calculation unit 241) includes the phase differences A1 to A3 supplied from the feature amount calculation unit 211, and the coefficients a _{1 to} d ₁ stored in itself (the position coefficient storage unit 242).
X, y, and z are calculated by solving simultaneous equations composed of the equations (20) to (22) of the coefficients a _{2 to} d ₂ and the coefficients a _{3 to} d ₃ . The calculated coordinates (x, y, z) are supplied to the output unit 14.

【０２１３】ステップＳ８５において、検出部２１２
（向き検出部２４３）は、図４４の対応表から、算出さ
れた座標（ｘ，ｙ，ｚ）に最も近い座標および算出され
た音圧比Ｂ１，Ｂ２，Ｂ３に最も近い音圧比を検出する
とともに、それらに対応する角度θ，φ，角度ψを検出
する。検出された角度θ、角度φ、および角度ψは、出
力部１４に供給される。At step S85, the detecting section 212
The (direction detection unit 243) detects the coordinates closest to the calculated coordinates (x, y, z) and the sound pressure ratios closest to the calculated sound pressure ratios B1, B2, and B3 from the correspondence table in FIG. , And the angles θ, φ, and ψ corresponding to them are detected. The detected angle θ, angle φ, and angle ψ are supplied to the output unit 14.

【０２１４】ステップＳ８６において、出力部１４は、
検出部２１２から供給された座標（ｘ，ｙ，ｚ）を、口
元（音源）の位置として、並びに角度θ、角度φ、およ
び角度ψを、口元の向きとして、外部の装置に出力す
る。In step S86, the output unit 14
The coordinates (x, y, z) supplied from the detection unit 212 are output to an external device as the position of the mouth (sound source), and the angles θ, φ, and ψ as the directions of the mouth.

【０２１５】その後、処理は終了する。Thereafter, the processing ends.

【０２１６】なお、以上においては、特徴量算出部２１
１は、処理用音声信号Ｓ１乃至Ｓ４から直接音圧を算出
し、そして音圧比Ｂ１乃至Ｂ３を算出する場合を例とし
て説明したが、処理用音声信号Ｓ１乃至Ｓ４の高周波部
分の信号（高周波処理用音声信号Ｓ１h乃至Ｓ４h）の音
圧を算出し、それらの音圧比Ｂ１h乃至Ｂ３hを算出する
こともできる。高周波部分の信号の音圧比は、音源との
距離に応じても変化するが、指向性があることから、そ
れを利用することで、より正確に音源の向きを検出する
ことができる。特に有効である。In the above description, the feature value calculating unit 21
1 has been described as an example in which the sound pressure is directly calculated from the processing audio signals S1 to S4 and the sound pressure ratios B1 to B3 are calculated, but the signals of the high frequency portions of the processing audio signals S1 to S4 (high-frequency processing It is also possible to calculate the sound pressures of the audio signals for use S1h to S4h) and calculate their sound pressure ratios B1h to B3h. The sound pressure ratio of the signal in the high-frequency portion changes depending on the distance from the sound source, but since it has directivity, the direction of the sound source can be detected more accurately by using it. Especially effective.

【０２１７】この例の場合における特徴量算出部２１１
の構成例を、図４６に示す。この特徴量算出部２１１に
は、図４２の特徴量算出部２１１の音圧算出部２２４乃
至２２７の前段に、ハイパスフィルタ２５１乃至２５４
が設けられている。In this example, the feature amount calculation unit 211 is used.
46 is shown in FIG. This feature amount calculation unit 211 includes high-pass filters 251 to 254 before the sound pressure calculation units 224 to 227 of the feature amount calculation unit 211 in FIG.
Is provided.

【０２１８】ハイパスフィルタ２５１は、入力部１１か
ら供給された処理用音声信号Ｓ１の高周波部分のみを通
過させて、音圧算出部２２４に供給する。ハイパスフィ
ルタ２５２は、入力部１１から供給された処理用音声信
号Ｓ２の高周波部分のみを通過させて、音圧算出部２２
５に供給する。The high-pass filter 251 passes only the high-frequency portion of the processing audio signal S 1 supplied from the input unit 11 and supplies it to the sound pressure calculation unit 224. The high-pass filter 252 allows only the high-frequency portion of the processing audio signal S2 supplied from the input unit 11 to pass, and
5

【０２１９】ハイパスフィルタ２５３は、入力部１１か
ら供給された処理用音声信号Ｓ３の高周波部分のみを通
過させて、音圧算出部２２６に供給する。ハイパスフィ
ルタ２５４は、入力部１１から供給された処理用音声信
号Ｓ４の高周波部分のみを通過させて、音圧算出部２２
７に供給する。The high-pass filter 253 allows only the high-frequency portion of the processing audio signal S3 supplied from the input unit 11 to pass and supplies it to the sound pressure calculation unit 226. The high-pass filter 254 allows only the high-frequency portion of the processing audio signal S4 supplied from the input unit 11 to pass, and
7

【０２２０】音圧算出部２２４は、高周波処理用音声信
号Ｓ１hの音圧を算出し、音圧比算出部２２８に供給す
る。音圧算出部２２５は、高周波処理用音声信号Ｓ２h
の音圧を算出し、音圧比算出部２２８および音圧比算出
部２２９に供給する。[0220] The sound pressure calculation section 224 calculates the sound pressure of the audio signal S1h for high-frequency processing and supplies it to the sound pressure ratio calculation section 228. The sound pressure calculation unit 225 outputs the high-frequency processing audio signal S2h.
Is calculated and supplied to the sound pressure ratio calculation unit 228 and the sound pressure ratio calculation unit 229.

【０２２１】音圧算出部２２６は、高周波処理用音声信
号Ｓ３hの音圧を算出し、音圧比算出部２２９および音
圧比算出部２３０に供給する。音圧算出部２２７は、高
周波処理用音声信号Ｓ４hの音圧を算出し、音圧比算出
部２３０に供給する。The sound pressure calculation section 226 calculates the sound pressure of the high frequency processing audio signal S3h, and supplies it to the sound pressure ratio calculation section 229 and the sound pressure ratio calculation section 230. The sound pressure calculation section 227 calculates the sound pressure of the high-frequency processing audio signal S4h and supplies it to the sound pressure ratio calculation section 230.

【０２２２】音圧比算出部２２８は、高周波処理用音声
信号Ｓ１hの音圧と高周波処理用音声信号Ｓ２hの音圧と
の音圧比Ｂ１を算出し、検出部２１２に供給する。音圧
比算出部２２９は、高周波処理用音声信号Ｓ２hの音圧
と高周波処理用音声信号Ｓ３hの音圧との音圧比Ｂ２を
算出し、検出部２１２に供給する。音圧比算出部２３０
は、高周波処理用音声信号Ｓ３hの音圧と高周波処理用
音声信号Ｓ４hの音圧との音圧比Ｂ３を算出し、検出部
２１２に供給する。The sound pressure ratio calculating section 228 calculates the sound pressure ratio B1 between the sound pressure of the high-frequency processing audio signal S1h and the sound pressure of the high-frequency processing audio signal S2h, and supplies it to the detection section 212. The sound pressure ratio calculation unit 229 calculates a sound pressure ratio B2 between the sound pressure of the high-frequency processing audio signal S2h and the sound pressure of the high-frequency processing audio signal S3h, and supplies this to the detection unit 212. Sound pressure ratio calculator 230
Calculates the sound pressure ratio B3 between the sound pressure of the high-frequency processing sound signal S3h and the sound pressure of the high-frequency processing sound signal S4h, and supplies the sound pressure ratio B3 to the detection unit 212.

【０２２３】なお、以上においては、音源（口元）の状
態（位置、または向き）を検出するタイミングについて
は、言及しなかったが、検出装置３，３１，６１，１０
１，２０１に、操作部を設け、その操作部が操作された
とき、音源の状態が検出されるようにしたり、また所定
の周期で、所定の期間継続して検出されるようにしたり
することができる。Although the timing of detecting the state (position or direction) of the sound source (mouth) has not been described above, the detection devices 3, 31, 61, 10
An operation unit is provided in each of the devices 201 and 201 so that when the operation unit is operated, the state of the sound source is detected, or is detected at a predetermined cycle and continuously for a predetermined period. Can be.

【０２２４】また、以上においては、第１の実施の形態
乃至第５の実施の形態について、それぞれ別個に説明し
たが、それらを組み合わせて、ユーザの状態を検出する
ようにすることもできる。例えば、第４の実施の形態お
よび第５の実施の形態においては、音源の位置および向
きの両方を算出する場合を例として説明したが、音源の
位置のみを検出することもできる。Further, in the above, the first to fifth embodiments have been described separately, but they may be combined to detect the state of the user. For example, in the fourth and fifth embodiments, the case where both the position and the direction of the sound source are calculated has been described as an example. However, only the position of the sound source may be detected.

【０２２５】また、第４の実施の形態および第５の実施
の形態において、処理用音声信号から直接音圧を算出
し、その音圧で音圧比を算出する場合と、高周波処理用
音声信号の音圧を算出し、その音圧で音圧比を算出する
場合を別個に説明したが、それらを組み合わせて利用す
ることもできる。In the fourth and fifth embodiments, the case where the sound pressure is directly calculated from the processing audio signal and the sound pressure ratio is calculated based on the calculated sound pressure is different from the case where the sound pressure ratio is calculated based on the sound pressure. Although the case where the sound pressure is calculated and the sound pressure ratio is calculated based on the sound pressure has been described separately, they can be used in combination.

【０２２６】上述した一連の処理は、ハードウエアによ
り実現させることもできるが、ソフトウエアにより実現
させることもできる。一連の処理をソフトウエアにより
実現する場合には、そのソフトウエアを構成するプログ
ラムがコンピュータにインストールされ、そのプログラ
ムがコンピュータで実行されることより、上述した検出
装置３，３１，６１，１０１，２０１が機能的に実現さ
れる。The series of processes described above can be realized by hardware, but can also be realized by software. When a series of processing is realized by software, a program constituting the software is installed in a computer, and the program is executed by the computer, so that the detection devices 3, 31, 61, 101, 201 described above are used. Is functionally realized.

【０２２７】図４７は、上述のような検出装置３，３
１，６１，１０１，１０２として機能するコンピュータ
５０１の一実施の形態の構成を示すブロック図である。
CPU（Central Processing Unit）５１１にはバス５１５
を介して入出力インタフェース５１６が接続されてお
り、CPU５１１は、入出力インタフェース５１６を介し
て、ユーザから、キーボード、マウスなどよりなる入力
部５１８から指令が入力されると、例えば、ROM（Read
Only Memory）５１２、ハードディスク５１４、または
ドライブ５２０に装着される磁気ディスク５３１、光デ
ィスク５３２、光磁気ディスク５３３、若しくは半導体
メモリ５３４などの記録媒体に格納されているプログラ
ムを、RAM（Random Access Memory）５１３にロードし
て実行する。これにより、上述した各種の処理が行われ
る。さらに、CPU５１１は、その処理結果を、例えば、
入出力インタフェース５１６を介して、LCD（Liquid Cr
ystalDisplay）などよりなる表示部５１７に必要に応じ
て出力する。なお、プログラムは、ハードディスク５１
４やROM５１２に予め記憶しておき、コンピュータ５０
１と一体的にユーザに提供したり、磁気ディスク５３
１、光ディスク５３２、光磁気ディスク５３３，半導体
メモリ５３４等のパッケージメディアとして提供した
り、衛星、ネットワーク等から通信部５１９を介してハ
ードディスク５１４に提供することができる。FIG. 47 shows the detection devices 3 and 3 as described above.
FIG. 1 is a block diagram illustrating a configuration of an embodiment of a computer 501 that functions as 1, 61, 101, and 102.
A bus 515 is provided for a CPU (Central Processing Unit) 511.
An input / output interface 516 is connected via the input / output interface 516. When a command is input from the user via the input / output interface 516 to the input unit 518 including a keyboard, a mouse, and the like, the CPU 511 reads, for example, a ROM (Read
Only memory) 512, a hard disk 514, or a program stored in a recording medium such as a magnetic disk 531, an optical disk 532, a magneto-optical disk 533, or a semiconductor memory 534 mounted on the drive 520 is stored in a random access memory (RAM) 513. And run it. Thereby, the various processes described above are performed. Further, the CPU 511 transmits the processing result to, for example,
LCD (Liquid Cr) via the input / output interface 516
Output to a display unit 517 such as a ystalDisplay) as necessary. The program is stored on the hard disk 51
4 or ROM 512 in advance, and the computer 50
1 and the magnetic disk 53
1. It can be provided as a package medium such as an optical disk 532, a magneto-optical disk 533, or a semiconductor memory 534, or can be provided to a hard disk 514 via a communication unit 519 from a satellite, a network, or the like.

【０２２８】なお、本明細書において、記録媒体により
提供されるプログラムを記述するステップは、記載され
た順序に沿って時系列的に行われる処理はもちろん、必
ずしも時系列的に処理されなくとも、並列的あるいは個
別に実行される処理をも含むものである。In the present specification, the step of describing a program provided by a recording medium may be performed not only in chronological order according to the described order but also in chronological order. This also includes processing executed in parallel or individually.

【０２２９】また、本明細書において、システムとは、
複数の装置により構成される装置全体を表すものであ
る。[0229] In this specification, the system is
It represents the entire device composed of a plurality of devices.

【０２３０】[0230]

【発明の効果】本発明の情報処理装置および方法、並び
に第１のプログラムによれば、所定の位置に設置された
複数の集音装置により集音された、所定の音源が発した
音声に対応する音声信号から、音源の状態に対応する特
徴量を算出し、算出された特徴量に基づいて、音源の状
態を検出するようにしたので、音源が発した音声から、
音源の状態を検出することができる。According to the information processing apparatus and method and the first program of the present invention, it is possible to cope with a sound emitted by a predetermined sound source, which is collected by a plurality of sound collection devices installed at predetermined positions. From the audio signal to be calculated, a feature amount corresponding to the state of the sound source is calculated, and based on the calculated feature amount, the state of the sound source is detected.
The state of the sound source can be detected.

【０２３１】本発明の情報生成装置および方法、並びに
第２のプログラムによれば、第１の集音装置、第２の集
音装置、および第３の集音装置のそれぞれにより集音さ
れた、所定の範囲内に設けられた複数の実験位置に位置
した音源が発した音声に対応する第４の音声信号、第５
の音声信号、および第６の音声信号から、第４の音声信
号の位相と第５の音声信号の位相との第３の位相差、お
よび第４の音声信号の位相と第６の音声信号との第４の
位相差を、実験位置毎に算出するようにしたので、第３
の位相差と実験位置との関係に基づいて、第１の係数を
算出し、第４の位相差と実験位置との関係に基づいて、
第２の係数が算出することができる。According to the information generating apparatus and method of the present invention, and the second program, the sound collected by each of the first sound collecting apparatus, the second sound collecting apparatus, and the third sound collecting apparatus, A fourth sound signal corresponding to a sound emitted from a sound source located at a plurality of experiment positions provided within a predetermined range, a fifth sound signal
From the audio signal and the sixth audio signal, the third phase difference between the phase of the fourth audio signal and the phase of the fifth audio signal, and the phase of the fourth audio signal and the sixth audio signal. Since the fourth phase difference is calculated for each experimental position,
A first coefficient is calculated based on the relationship between the phase difference and the experimental position, and based on the relationship between the fourth phase difference and the experimental position,
A second coefficient can be calculated.

[Brief description of the drawings]

【図１】本発明を適用した音源状態検出システムの第１
の実施の形態の構成例を示すブロック図である。FIG. 1 shows a first example of a sound source state detection system to which the present invention is applied.
It is a block diagram which shows the example of a structure of 1st Embodiment.

【図２】マイク１の設置位置を示す図である。FIG. 2 is a diagram showing an installation position of a microphone 1;

【図３】音圧比ＲＬと音源の向きとの対応関係を示す図
である。FIG. 3 is a diagram showing a correspondence relationship between a sound pressure ratio RL and a direction of a sound source.

【図４】図１の検出装置３の構成例を示すブロック図で
ある。FIG. 4 is a block diagram showing a configuration example of a detection device 3 of FIG.

【図５】図４の特徴量算出部１２の構成例を示すブロッ
ク図である。FIG. 5 is a block diagram illustrating a configuration example of a feature amount calculation unit 12 in FIG. 4;

【図６】音源状態検出システムの動作を説明するフロー
チャートである。FIG. 6 is a flowchart illustrating an operation of the sound source state detection system.

【図７】図６のステップＳ３の処理の詳細を説明するフ
ローチャートである。FIG. 7 is a flowchart illustrating details of a process in step S3 of FIG. 6;

【図８】本発明を適用した音源状態検出システムの第２
の実施の形態の構成例を示すブロック図である。FIG. 8 shows a second example of the sound source state detection system to which the present invention is applied.
It is a block diagram which shows the example of a structure of 1st Embodiment.

【図９】マイク１の設置位置を示す他の図である。FIG. 9 is another diagram showing an installation position of the microphone 1;

【図１０】音圧比ＲＬhと音源の向きとの対応関係を示
す図である。FIG. 10 is a diagram showing a correspondence relationship between a sound pressure ratio RLh and a direction of a sound source.

【図１１】図８の検出装置３１の構成例を示すブロック
図である。11 is a block diagram illustrating a configuration example of a detection device 31 in FIG.

【図１２】図１１の特徴量算出部４１の構成例を示すブ
ロック図である。12 is a block diagram illustrating a configuration example of a feature amount calculation unit 41 in FIG. 11;

【図１３】カットオフ特性を示す図である。FIG. 13 is a diagram illustrating cutoff characteristics.

【図１４】ピッチ周波数を説明する図である。FIG. 14 is a diagram illustrating a pitch frequency.

【図１５】音源状態検出システムの他の動作を説明する
フローチャートである。FIG. 15 is a flowchart illustrating another operation of the sound source state detection system.

【図１６】図１５のステップＳ２３の処理の詳細を説明
するフローチャートである。FIG. 16 is a flowchart illustrating details of a process in step S23 in FIG. 15;

【図１７】本発明を適用した音源状態検出システムの第
３の実施の形態の構成例を示すブロック図である。FIG. 17 is a block diagram illustrating a configuration example of a third embodiment of a sound source state detection system to which the present invention has been applied.

【図１８】位相差と顔の向きとの対応関係を示す図であ
る。FIG. 18 is a diagram illustrating a correspondence relationship between a phase difference and a face direction.

【図１９】位相差を説明する図である。FIG. 19 is a diagram illustrating a phase difference.

【図２０】位相差を説明する他の図である。FIG. 20 is another diagram illustrating a phase difference.

【図２１】位相差を説明する他の図である。FIG. 21 is another diagram illustrating a phase difference.

【図２２】図１７の検出装置６１の構成例を示すブロッ
ク図である。22 is a block diagram illustrating a configuration example of a detection device 61 in FIG.

【図２３】音源状態検出システムの他の動作を説明する
フローチャートである。FIG. 23 is a flowchart illustrating another operation of the sound source state detection system.

【図２４】図２３のステップＳ４３の処理の詳細を説明
するフローチャートである。FIG. 24 is a flowchart illustrating details of a process in step S43 of FIG. 23;

【図２５】本発明を適用した音源状態検出システムの第
４の実施の形態の構成例を示すブロック図である。FIG. 25 is a block diagram illustrating a configuration example of a fourth embodiment of a sound source state detection system to which the present invention has been applied.

【図２６】マイク１の配置位置を説明する図である。FIG. 26 is a diagram illustrating an arrangement position of the microphone 1.

【図２７】位置および向きを説明する図である。FIG. 27 is a diagram illustrating a position and an orientation.

【図２８】図２５の検出装置１０１の構成例を示すブロ
ック図である。FIG. 28 is a block diagram illustrating a configuration example of a detection device 101 in FIG. 25.

【図２９】図２８の特徴量算出部１１１の構成例を示す
ブロック図である。29 is a block diagram illustrating a configuration example of a feature amount calculation unit 111 in FIG. 28.

【図３０】図２８の検出部１１２の構成例を示すブロッ
ク図である。30 is a block diagram illustrating a configuration example of a detection unit 112 in FIG. 28.

【図３１】ΔＬおよびΔＲを説明する図である。FIG. 31 is a diagram illustrating ΔL and ΔR.

【図３２】距離Ｃおよび距離Ｌを説明する図である。FIG. 32 is a diagram illustrating a distance C and a distance L.

【図３３】位相差ＣＬおよび位相差ＣＲを説明する図で
ある。FIG. 33 is a diagram illustrating a phase difference CL and a phase difference CR.

【図３４】基準位置ｐを説明する図である。FIG. 34 is a diagram illustrating a reference position p.

【図３５】音圧比ＲＬを説明する図である。FIG. 35 is a diagram illustrating a sound pressure ratio RL.

【図３６】検出装置１０１の動作を説明するフローチャ
ートである。FIG. 36 is a flowchart illustrating an operation of the detection device 101.

【図３７】検出装置１０１の他の動作を説明するフロー
チャートである。FIG. 37 is a flowchart illustrating another operation of the detection device 101.

【図３８】特徴量算出部１１１の他の構成例を示すブロ
ック図である。38 is a block diagram illustrating another configuration example of the feature amount calculation unit 111. FIG.

【図３９】本発明を適用した音源状態検出システムの第
５の実施の形態の構成例を示すブロック図である。FIG. 39 is a block diagram illustrating a configuration example of a fifth embodiment of a sound source state detection system to which the present invention has been applied.

【図４０】位置および向きを説明する他の図である。FIG. 40 is another diagram illustrating a position and an orientation.

【図４１】図３９の検出装置２０１の構成例を示すブロ
ック図である。41 is a block diagram illustrating a configuration example of a detection device 201 in FIG. 39.

【図４２】図４１の特徴量算出部２１１の構成例を示す
ブロック図である。42 is a block diagram illustrating a configuration example of a feature amount calculation unit 211 in FIG. 41.

【図４３】図４１の検出部２１２の構成例を示すブロッ
ク図である。FIG. 43 is a block diagram illustrating a configuration example of a detection unit 212 in FIG. 41.

【図４４】座標、音圧比、および角度の対応関係を示す
対応表の例を示す図である。FIG. 44 is a diagram showing an example of a correspondence table showing a correspondence relationship among coordinates, sound pressure ratios, and angles.

【図４５】検出装置２０２の動作を説明するフローチャ
ートである。FIG. 45 is a flowchart illustrating an operation of the detection device 202.

【図４６】特徴量算出部２１１の他の構成例を示すブロ
ック図である。FIG. 46 is a block diagram illustrating another configuration example of the feature amount calculation unit 211.

【図４７】パーソナルコンピュータ５０１の構成例を示
すブロック図である。FIG. 47 is a block diagram illustrating a configuration example of a personal computer 501.

【符号の説明】１マイクロフォン，２マイクアンプ，３検出
装置，１１入力部，１２特徴量算出部，１３
検出部，１４出力部，２１音圧算出部，２
２音圧比算出部，３１検出装置，４１特徴量
算出部，４２検出部，５１ハイパスフィルタ，
６１検出装置，７１特徴量算出部，７２検出
部，１０１検出装置，１１１特徴量算出部，
１１２検出部，１２１位相差算出部，１２２位
相差算出部，１２３音圧算出部，１２４音圧算
出部，１２５音圧比算出部，１３１位置算出
部，１３２位置係数記憶部，１３３向き係数決
定部，１３４向き算出部，１４１ハイパスフィ
ルタ，１４２ハイパスフィルタ，２０１検出装
置，２１１特徴量算出部，２１２検出部，２
２１位相差算出部，２２２位相差算出部，２２
３位相差算出部，２２４音圧算出部，２２５
音圧算出部，２２６音圧算出部，２２７音圧算
出部，２２８音圧比算出部，２２９音圧比算出
部，２３０音圧比算出部，２４１位置算出部，
２４２位置係数記憶部，２４３向き検出部，
２５１ハイパスフィルタ，２５２ハイパスフィル
タ，２５３ハイパスフィルタ，２５４ハイパス
フィルタ[Description of Signs] 1 microphone, 2 microphone amplifier, 3 detection device, 11 input unit, 12 feature amount calculation unit, 13
Detection unit, 14 output unit, 21 sound pressure calculation unit, 2
2 sound pressure ratio calculation section, 31 detection device, 41 feature quantity calculation section, 42 detection section, 51 high-pass filter,
61 detection device, 71 feature amount calculation unit, 72 detection unit, 101 detection device, 111 feature amount calculation unit,
112 detection section, 121 phase difference calculation section, 122 phase difference calculation section, 123 sound pressure calculation section, 124 sound pressure calculation section, 125 sound pressure ratio calculation section, 131 position calculation section, 132 position coefficient storage section, 133 direction coefficient determination section , 134 orientation calculation unit, 141 high-pass filter, 142 high-pass filter, 201 detection device, 211 feature amount calculation unit, 212 detection unit, 2
21 phase difference calculator, 222 phase difference calculator, 22
3 phase difference calculator, 224 sound pressure calculator, 225
Sound pressure calculating section, 226 sound pressure calculating section, 227 sound pressure calculating section, 228 sound pressure ratio calculating section, 229 sound pressure ratio calculating section, 230 sound pressure ratio calculating section, 241 position calculating section,
242 position coefficient storage unit, 243 direction detection unit,
251 high-pass filter, 252 high-pass filter, 253 high-pass filter, 254 high-pass filter

───────────────────────────────────────────────────── フロントページの続き (72)発明者藤島幸一東京都品川区北品川６丁目７番35号ソニー株式会社内 (72)発明者大月知之東京都品川区北品川６丁目７番35号ソニー株式会社内Ｆターム(参考） 5D018 BB23 5D020 BB04 ────────────────────────────────────────────────── ─── Continuation of front page (72) Inventor Koichi Fujishima 6-7-35 Kita Shinagawa, Shinagawa-ku, Tokyo Soni ー Inc. (72) Inventor Tomoyuki Otsuki 6-7-35 Kita Shinagawa, Shinagawa-ku, Tokyo Soni ー Inc. F term (reference) 5D018 BB23 5D020 BB04

Claims

[Claims]

1. A method for calculating a feature amount corresponding to a state of a sound source from an audio signal corresponding to a sound emitted from a predetermined sound source, which is collected by a plurality of sound collecting devices installed at a predetermined position. An information processing apparatus, comprising: a first calculating unit; and a state detecting unit that detects a state of the sound source based on the feature amount calculated by the first calculating unit.

2. The information processing apparatus according to claim 1, wherein the state is a direction of the sound source at a predetermined position.

3. The apparatus according to claim 2, wherein the state detecting means is obtained in advance.
2. The apparatus according to claim 1, wherein correspondence data indicating a correspondence relationship between the feature amount and the state is held, and the state is detected based on the correspondence data and the feature amount.
An information processing apparatus according to claim 1.

4. The first sound signal corresponding to the sound emitted by the sound source, collected by the first sound collecting device, and the second sound collecting device. A sound pressure ratio between a sound pressure of the first sound signal and a sound pressure of the second sound signal, from a second sound signal corresponding to a sound emitted by the sound source collected by Calculated as an amount, the state detection means holds the previously obtained correspondence data indicating the correspondence between the sound pressure ratio and the state, based on the correspondence data and the sound pressure ratio,
The information processing apparatus according to claim 3, wherein the state is detected.

5. The first calculating means includes: a first sound signal corresponding to a sound emitted by the sound source, collected by the first sound collecting device; and a second sound collecting device. A sound pressure of a first high-frequency sound signal composed of a high-frequency portion of the first sound signal from the second sound signal corresponding to the sound emitted by the sound source, and a second sound A sound pressure ratio between a sound pressure of a second high-frequency sound signal composed of a high-frequency portion of the signal and the sound pressure is calculated as the characteristic amount. Holding the corresponding relationship data shown, based on the corresponding relationship data and the sound pressure ratio,
The information processing apparatus according to claim 3, wherein the state is detected.

6. The first sound signal corresponding to the sound emitted by the sound source, collected by the first sound collecting device, and the second sound collecting device. A phase difference between the phase of the first audio signal and the phase of the second audio signal from the second audio signal corresponding to the audio emitted by the sound source, collected as Calculating, the state detection means holds the correspondence data indicating the correspondence between the phase difference and the state obtained in advance, based on the correspondence data and the phase difference,
The information processing apparatus according to claim 3, wherein the state is detected.

7. The first calculating means is located within a predetermined range collected by the first sound collecting device, the second sound collecting device, and the third sound collecting device. From the first audio signal, the second audio signal, and the third audio signal corresponding to the audio emitted by the sound source, the phase of the first audio signal and the phase of the second audio signal And the second phase difference between the phase of the first audio signal and the phase of the third audio signal are calculated as the characteristic amount. And the second
The information processing apparatus according to claim 1, wherein the position of the sound source on a plane is detected based on the phase difference of the sound source.

8. A plurality of experimental positions provided within a predetermined range and collected by each of the first sound collecting device, the second sound collecting device, and the third sound collecting device. From the fourth audio signal, the fifth audio signal, and the sixth audio signal corresponding to the audio emitted by the located sound source, the phase of the fourth audio signal and the phase of the fifth audio signal Calculating a third phase difference with respect to the phase and a fourth phase difference between the phase of the fourth sound signal and the sixth sound signal for each of the experimental positions; Second calculating means for calculating a first coefficient based on a relationship between the first phase difference and the experimental position, and a second coefficient based on a relationship between the fourth phase difference and the experimental position; and the second calculating means. Storage means for storing the first coefficient and the second coefficient calculated by The state detecting means includes: on the plane of the sound source, based on a relationship between the first phase difference and the first coefficient and a relationship between the second phase difference and the second coefficient. The information processing apparatus according to claim 7, wherein the position of the information processing device is detected.

9. The method according to claim 1, wherein the first calculating unit calculates a sound pressure of the second sound signal and a sound pressure of the third sound signal from the second sound signal and the third sound signal. 1 is further calculated as the characteristic amount, and the state detection means detects the direction of the sound source based on the first sound pressure ratio and the detected position of the sound source. The information processing device according to claim 7.

10. A sound source, which is collected by each of the second sound collecting device and the third sound collecting device, is located at a plurality of reference positions and corresponds to a sound emitted by the sound source directed in a predetermined direction. Calculating a second sound pressure ratio between the sound pressure of the fourth sound signal and the sound pressure of the fifth sound signal from the fourth sound signal and the fifth sound signal. A second calculating unit that calculates a coefficient based on the relationship between the sound pressure ratio and the predetermined direction for each of the reference positions; and a storage unit that stores the coefficient in association with the reference position. The state detection means detects the reference position corresponding to the detected position of the sound source, selects the coefficient corresponding to the detected reference position from the storage means, and selects the selected coefficient and the first coefficient. Based on the relationship with the sound pressure ratio of The information processing apparatus according to claim 9, characterized in that detecting the direction of the source.

11. The first calculating means, wherein sound is collected by a first sound collecting device, a second sound collecting device, a third sound collecting device, and a fourth sound collecting device. From the first audio signal, the second audio signal, the third audio signal, and the fourth audio signal corresponding to the audio emitted by the sound source located within a predetermined range, A first phase difference between the phase of the audio signal and the phase of the second audio signal, a second phase difference between the phase of the second audio signal and the phase of the third audio signal, and A third phase difference between the phase of the third audio signal and the phase of the fourth audio signal is calculated as the feature quantity, and the state detection unit calculates the first phase difference and the second phase difference And detecting a spatial position of the sound source based on the third phase difference.
An information processing apparatus according to claim 1.

12. Within a predetermined range collected by each of the first sound collecting device, the second sound collecting device, the third sound collecting device, and the fourth sound collecting device. A fifth sound signal, a sixth sound signal, and a seventh sound signal corresponding to sounds emitted by the sound sources located at the provided plurality of experimental positions.
From the audio signal and the eighth audio signal, the fourth phase difference between the phase of the fifth audio signal and the phase of the sixth audio signal, the phase of the sixth audio signal and the fourth Calculating a fifth phase difference from the phase of the audio signal No. 7 and a sixth phase difference between the phase of the seventh audio signal and the phase of the eighth audio signal for each of the experimental positions; A first coefficient based on the relationship between the fourth phase difference and the experimental position, a second coefficient based on the relationship between the fifth phase difference and the experimental position, and the sixth phase difference A second calculating means for calculating a third coefficient based on a relationship with an experimental position; and the first coefficient calculated by the second calculating means;
A storage unit that stores the second coefficient and the third coefficient, wherein the state detection unit determines a relationship between the first phase difference and the first coefficient, the second phase difference The position of the sound source in space is detected based on a relationship between the sound source and the second coefficient and a relationship between the third phase difference and the third coefficient. An information processing apparatus according to claim 1.

13. The method according to claim 1, wherein the first calculating unit calculates the first audio signal from the first audio signal, the second audio signal, the third audio signal, and the fourth audio signal. A first sound pressure ratio between a sound pressure and a sound pressure of the second sound signal; a second sound pressure ratio between a sound pressure of the second sound signal and a sound pressure of the third sound signal; A third sound pressure ratio between the sound pressure of the sound signal of No. 3 and the sound pressure of the fourth sound signal is further calculated as the characteristic amount. The information processing apparatus according to claim 11, wherein the direction of the sound source is detected based on the sound pressure ratio of 2, the third sound pressure ratio, and the detected position of the sound source.

14. A plurality of reference positions collected by each of the first sound collecting device, the second sound collecting device, the third sound collecting device, and the fourth sound collecting device. A fifth sound signal, a sixth sound signal, a seventh sound signal, and an eighth sound signal corresponding to sounds emitted by the sound source positioned and directed in a predetermined direction; 5
The fourth of the sound pressure of the sound signal of
, A fifth sound pressure ratio between the sound pressure of the sixth sound signal and the sound pressure of the seventh sound signal, and the sound pressure of the seventh sound signal and the sound of the eighth sound signal. Second calculating means for calculating a sixth sound pressure ratio with respect to the pressure, respectively, and the fourth sound pressure ratio, the fifth sound pressure ratio, and the sixth sound pressure ratio, A storage unit for storing the direction, wherein the state detection unit stores the reference position corresponding to the detected position of the sound source from the storage unit;
The direction corresponding to the sound pressure ratio, the fourth sound pressure ratio corresponding to the second sound pressure ratio and the third sound pressure ratio, and the direction corresponding to the fifth sound pressure ratio and the sixth sound pressure ratio is defined as the direction of the sound source. The information processing apparatus according to claim 13, wherein the information is detected.

15. A calculation for calculating a characteristic amount corresponding to a state of a sound source from an audio signal corresponding to a sound emitted by a predetermined sound source, which is collected by a plurality of sound collection devices installed at a predetermined position. And a state detecting step of detecting a state of the sound source based on the feature amount calculated in the processing of the calculating step.

16. A calculation for calculating a feature amount corresponding to a state of a sound source from an audio signal corresponding to a sound emitted by a predetermined sound source, which is collected by a plurality of sound collection devices installed at a predetermined position. And a state detecting step of detecting a state of the sound source based on the feature amount calculated in the processing of the calculating step. A recording medium in which a computer-readable program is recorded.

17. A calculation for calculating a feature amount corresponding to a state of a sound source from an audio signal corresponding to a sound emitted by a predetermined sound source, which is collected by a plurality of sound collection devices installed at a predetermined position. A program causing a computer to execute a process including: a step; and a state detection step of detecting a state of the sound source based on the feature amount calculated in the processing of the calculation step.

18. A first sound collecting device, a second sound collecting device, and a first sound collecting device, the first sound collecting device corresponding to the sound emitted from a sound source located within a predetermined range, which is collected by a third sound collecting device. Audio signal,
From the second audio signal and the third audio signal, the first
Of the phase of the second audio signal and the phase of the second audio signal
And the phase of the first audio signal and the third
Calculating a second phase difference with the phase of the audio signal of
The first coefficient and the first coefficient for a detection device that detects the position of the sound source based on the relationship between the phase difference and the first coefficient, and the relationship between the second phase difference and the second coefficient. An information generation device that generates a second coefficient, provided in a predetermined range collected by each of the first sound collection device, the second sound collection device, and the third sound collection device. From the fourth sound signal, the fifth sound signal, and the sixth sound signal corresponding to sounds emitted by the sound source located at the plurality of test positions, the phase of the fourth sound signal and Calculating a third phase difference between the phase of the fifth audio signal and a fourth phase difference between the phase of the fourth audio signal and the sixth audio signal for each experimental position; Calculating means, based on a relationship between the third phase difference and the experimental position,
An information generating apparatus comprising: a second calculating unit configured to calculate the first coefficient and calculate the second coefficient based on a relationship between the fourth phase difference and the experimental position. .

19. The detection device according to claim 1, wherein said detection device is configured to generate a sound corresponding to a sound emitted by said sound source located at a plurality of reference positions and collected by each of said second sound collection device and said third sound collection device. Calculating a sound pressure ratio between the sound pressure of the seventh sound signal and the sound pressure of the eighth sound signal from the sound signal of No. 7 and the eighth sound signal; Based on the relationship, the direction of the sound source is further detected, and the sound emitted by the sound source located at a plurality of reference positions is collected by each of the second sound collecting device and the third sound collecting device. Third calculating means for calculating a sound pressure ratio between the sound pressure of the ninth sound signal and the sound pressure of the tenth sound signal from the ninth sound signal and the tenth sound signal corresponding to The sound pressure ratio between the sound pressure of the ninth sound signal and the sound pressure of the tenth sound signal The experiments on the basis of the relationship between the position information generating apparatus according to claim 18, further comprising a fourth calculating means for calculating the third coefficient.

20. A first sound collecting device, a second sound collecting device, and a first sound collecting device, the first sound collecting device corresponding to a sound emitted from a sound source located within a predetermined range, which is collected by a third sound collecting device. Audio signal,
From the second audio signal and the third audio signal, the first
Of the phase of the second audio signal and the phase of the second audio signal
And the phase of the first audio signal and the third
Calculating a second phase difference with the phase of the audio signal of
The first coefficient and the first coefficient for a detection device that detects the position of the sound source based on the relationship between the phase difference and the first coefficient, and the relationship between the second phase difference and the second coefficient. In the information generation method of the information generation device that generates a second coefficient, a predetermined sound collected by each of the first sound collection device, the second sound collection device, and the third sound collection device The fourth sound signal, the fifth sound signal, and the sixth sound signal corresponding to sounds emitted by the sound sources located at a plurality of experimental positions provided in a range, from the fourth sound signal, A third phase difference between the signal phase and the fifth audio signal, and a fourth phase difference between the fourth audio signal and the sixth audio signal are determined for each of the experimental positions. A first calculating step of calculating; and a relation between the third phase difference and the experimental position. On the basis of,
A second calculating step of calculating the first coefficient and calculating the second coefficient based on a relationship between the fourth phase difference and the experimental position. .

21. A first sound collecting device, a second sound collecting device, and a first sound collecting device, the first sound collecting device corresponding to a sound emitted from a sound source located within a predetermined range, which is collected by a third sound collecting device. Audio signal,
From the second audio signal and the third audio signal, the first
Of the phase of the second audio signal and the phase of the second audio signal
And the phase of the first audio signal and the third
Calculating a second phase difference with the phase of the audio signal of
The first coefficient and the first coefficient for a detection device that detects the position of the sound source based on the relationship between the phase difference and the first coefficient, and the relationship between the second phase difference and the second coefficient. A program of an information generating device for generating a second coefficient, wherein a predetermined sound collected by each of the first sound collecting device, the second sound collecting device, and the third sound collecting device. The fourth sound signal, the fifth sound signal, and the sixth sound signal corresponding to sounds emitted by the sound sources located at a plurality of experimental positions provided in a range, from the fourth sound signal, A third phase difference between the signal phase and the fifth audio signal, and a fourth phase difference between the fourth audio signal and the sixth audio signal are determined for each of the experimental positions. A first calculating step of calculating; and a relationship between the third phase difference and the experimental position. Based on,
A second calculating step of calculating the first coefficient and calculating the second coefficient based on a relationship between the fourth phase difference and the experimental position. A recording medium on which a possible program is recorded.

22. A first sound collecting device, a second sound collecting device, and a first sound collecting device, the first sound collecting device corresponding to the sound emitted from a sound source located within a predetermined range, collected by the third sound collecting device. Audio signal,
From the second audio signal and the third audio signal, the first
Of the phase of the second audio signal and the phase of the second audio signal
And the phase of the first audio signal and the third
Calculating a second phase difference with the phase of the audio signal of
The first coefficient and the first coefficient for a detection device that detects the position of the sound source based on the relationship between the phase difference and the first coefficient, and the relationship between the second phase difference and the second coefficient. A program of an information generating device for generating a second coefficient, wherein a predetermined sound collected by each of the first sound collecting device, the second sound collecting device, and the third sound collecting device. The fourth sound signal, the fifth sound signal, and the sixth sound signal corresponding to sounds emitted by the sound sources located at a plurality of experimental positions provided in a range, from the fourth sound signal, A third phase difference between the signal phase and the fifth audio signal, and a fourth phase difference between the fourth audio signal and the sixth audio signal are determined for each of the experimental positions. A first calculating step of calculating; and a relationship between the third phase difference and the experimental position. Based on,
Calculating a first coefficient, and causing a computer to execute processing including a second calculation step of calculating the second coefficient based on a relationship between the fourth phase difference and the experimental position. Features program.