JP4841863B2 - バイノーラル信号に基づいた音源定位 - Google Patents
バイノーラル信号に基づいた音源定位 Download PDFInfo
- Publication number
- JP4841863B2 JP4841863B2 JP2005152195A JP2005152195A JP4841863B2 JP 4841863 B2 JP4841863 B2 JP 4841863B2 JP 2005152195 A JP2005152195 A JP 2005152195A JP 2005152195 A JP2005152195 A JP 2005152195A JP 4841863 B2 JP4841863 B2 JP 4841863B2
- Authority
- JP
- Japan
- Prior art keywords
- sound source
- itd
- ild
- probability distribution
- frequency
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/802—Systems for determining direction or deviation from predetermined direction
- G01S3/803—Systems for determining direction or deviation from predetermined direction using amplitude comparison of signals derived from receiving transducers or transducer systems having differently-oriented directivity characteristics
- G01S3/8034—Systems for determining direction or deviation from predetermined direction using amplitude comparison of signals derived from receiving transducers or transducer systems having differently-oriented directivity characteristics wherein the signals are derived simultaneously
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/802—Systems for determining direction or deviation from predetermined direction
- G01S3/808—Systems for determining direction or deviation from predetermined direction using transducers spaced apart and measuring phase or time difference between signals therefrom, i.e. path-difference systems
- G01S3/8083—Systems for determining direction or deviation from predetermined direction using transducers spaced apart and measuring phase or time difference between signals therefrom, i.e. path-difference systems determining direction of source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
Description
第1に、2つの信号3、4は、2つの2D時間周波数スペクトラム6(各信号3、4につき1つ)を得るために事前処理される(ステップ5)。次いで、2D時間周波数スペクトラムの時間ステップごとのITDおよびILD測定値を相関アルゴリズムを使用して算出し(ステップ7)、方位位置に依存するITDおよびILDの2D周波数対変位マトリクスを別個に抽出する。相関は、信号3の時間周波数スペクトラムからの点別ウィンドウ領域と、信号4の時間周波数スペクトラムからの対応するウィンドウ領域とを比較することによって算出される。ITDについて、点別比較は、例えばSSD(差の2乗和)または標準相関係数によって算出可能である。ILDについては、スペクトラムの対数事前処理の後に絶対値のノルムの差を算出することによって実行される。相関を(2D時間周波数スペクトラムが相互にシフトされることを意味する)全必要変位について算出し、すべての可能な時間シフトを検出する。最大変位パラメータは経験的に判断され、周波数帯域、ヘッドの形状および2つの検出器間の距離に左右される。
次いで、未知の位置の音源の測定ITDおよびILD周波数対変位マトリクス10、11は、周波数チャネルごとに音源位置の確率分布18、19を得るために、学習周波数対変位マトリクス14、15と比較される(ステップ16、17)。比較はITDおよびILDについて別個に実行される。従って、ITD比較16は、ITDの測定周波数対変位マトリクス10と、学習周波数対変位マトリクス(2D基準パターン)14とを比較し、ITD確率分布マトリクス18を出力することである。
3、4 音信号成分
30 バイノーラル検出器
31、32 音響センサ
33 音源位置
a 方位位置
d 特定の距離
S 音源
Claims (7)
- a)バイノーラル信号を構成し、かつ既知の位置で既知の音源から発信される2つの信号成分を処理して、2つの信号の各々の2D時間周波数スペクトラムを得るステップと、
b)前記時間周波数スペクトラムのITDおよびILD測定値を合同算出して、一方が前記ITD測定値用の、他方が前記ILD測定値用の、前記音源を共に特徴付ける2つの周波数対変位マトリクスを生成するステップと、
c)前記音源の異なる位置のそれぞれにおいてステップa)およびb)を反復し、前記ITDおよび前記ILD測定値について別個に前記音源のそれぞれの位置における平均をとり周波数対変位マトリクスを学習するステップと、
d)前記ITDおよび前記ILD測定値について別個に、未知の音源の測定周波数対変位マトリクスとステップc)で学習されたマトリクスとを比較して、周波数チャネルごとに音源位置の確率分布を得るステップと、
e)前記ITDおよびILD確率分布マトリクスを結合して、音源定位について単一の結合確率分布を得るステップと、
f)前記結合ITDおよびILD確率分布マトリクスに基づいて前記音源位置を推定するステップと、
を有する音源定位のための方法。 - 前記ITDおよびILD測定結果の結合が、周波数依存的に実行されることを特徴とする、請求項1に記載の方法。
- 前記ITDおよびILD確率分布の結合が、音源位置パラメータに応じて実行されることを特徴とする、請求項1に記載の方法。
- 抽出された音源定位の確率分布が、複数の音源についての情報を得るために用いられることを特徴とする、請求項1から3のいずれかに記載の方法。
- 最終時間ステップの確率分布が適切に伝搬/推定されて、次の確率分布の予測を獲得し、該予測は次いで新たな測定確率分布と結合されて、それを経時的に改善し、移動音源を追跡することを特徴とする、請求項1乃至請求項4のいずれかに記載の方法。
- バイノーラル検出器と、請求項1乃至請求項5のいずれかに記載の方法によって前記バイノーラル検出器の出力を処理するように設計されている計算ユニットと、を有するシステム。
- 計算装置で稼動する場合に、請求項1乃至請求項5のいずれかに記載の方法を実現するためのコンピュータソフトウェアプログラム。
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP04012473.7 | 2004-05-26 | ||
EP04012473 | 2004-05-26 | ||
EP04030651A EP1600791B1 (en) | 2004-05-26 | 2004-12-23 | Sound source localization based on binaural signals |
EP04030651.6 | 2004-12-23 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2005338086A JP2005338086A (ja) | 2005-12-08 |
JP4841863B2 true JP4841863B2 (ja) | 2011-12-21 |
Family
ID=34927979
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2005152195A Expired - Fee Related JP4841863B2 (ja) | 2004-05-26 | 2005-05-25 | バイノーラル信号に基づいた音源定位 |
Country Status (4)
Country | Link |
---|---|
US (1) | US7693287B2 (ja) |
EP (1) | EP1600791B1 (ja) |
JP (1) | JP4841863B2 (ja) |
CN (1) | CN1703118B (ja) |
Families Citing this family (55)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1119218B1 (en) * | 2000-01-21 | 2018-06-20 | Oticon A/S | Electromagnetic feedback reduction in communication device |
EP1600791B1 (en) | 2004-05-26 | 2009-04-01 | Honda Research Institute Europe GmbH | Sound source localization based on binaural signals |
EP1806593B1 (en) | 2006-01-09 | 2008-04-30 | Honda Research Institute Europe GmbH | Determination of the adequate measurement window for sound source localization in echoic environments |
EP1862813A1 (en) | 2006-05-31 | 2007-12-05 | Honda Research Institute Europe GmbH | A method for estimating the position of a sound source for online calibration of auditory cue to location transformations |
EP1947471B1 (en) * | 2007-01-16 | 2010-10-13 | Harman Becker Automotive Systems GmbH | System and method for tracking surround headphones using audio signals below the masked threshold of hearing |
US8767975B2 (en) | 2007-06-21 | 2014-07-01 | Bose Corporation | Sound discrimination method and apparatus |
US8611554B2 (en) | 2008-04-22 | 2013-12-17 | Bose Corporation | Hearing assistance apparatus |
KR101038574B1 (ko) * | 2009-01-16 | 2011-06-02 | 전자부품연구원 | 3차원 오디오 음상 정위 방법과 장치 및 이와 같은 방법을 구현하는 프로그램이 기록되는 기록매체 |
US8174932B2 (en) * | 2009-06-11 | 2012-05-08 | Hewlett-Packard Development Company, L.P. | Multimodal object localization |
DK2449798T4 (da) * | 2009-08-11 | 2020-12-21 | Sivantos Pte Ltd | System og fremgangsmåde til estimering af en lyds ankomstretning |
CN101673549B (zh) * | 2009-09-28 | 2011-12-14 | 武汉大学 | 一种移动音源空间音频参数预测编解码方法及系统 |
JP5397131B2 (ja) * | 2009-09-29 | 2014-01-22 | 沖電気工業株式会社 | 音源方向推定装置及びプログラム |
CN101762806B (zh) * | 2010-01-27 | 2013-03-13 | 华为终端有限公司 | 声源定位方法和装置 |
CN101957443B (zh) * | 2010-06-22 | 2012-07-11 | 嘉兴学院 | 声源定位方法 |
EP2423702A1 (en) * | 2010-08-27 | 2012-02-29 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for resolving ambiguity from a direction of arrival estimate |
KR101694822B1 (ko) * | 2010-09-20 | 2017-01-10 | 삼성전자주식회사 | 음원출력장치 및 이를 제어하는 방법 |
CN101982793B (zh) * | 2010-10-20 | 2012-07-04 | 武汉大学 | 一种基于立体声信号的移动音源定位方法 |
US9078077B2 (en) | 2010-10-21 | 2015-07-07 | Bose Corporation | Estimation of synthetic audio prototypes with frequency-based input signal decomposition |
CN102809742B (zh) * | 2011-06-01 | 2015-03-18 | 杜比实验室特许公司 | 声源定位设备和方法 |
US9435873B2 (en) | 2011-07-14 | 2016-09-06 | Microsoft Technology Licensing, Llc | Sound source localization using phase spectrum |
CN103403801B (zh) * | 2011-08-29 | 2015-11-25 | 华为技术有限公司 | 参数多通道编码器和解码器 |
CN102438189B (zh) * | 2011-08-30 | 2014-07-09 | 东南大学 | 基于双通路声信号的声源定位方法 |
CN102565759B (zh) * | 2011-12-29 | 2013-10-30 | 东南大学 | 一种基于子带信噪比估计的双耳声源定位方法 |
KR102192361B1 (ko) | 2013-07-01 | 2020-12-17 | 삼성전자주식회사 | 머리 움직임을 이용한 사용자 인터페이스 방법 및 장치 |
BR122022016682B1 (pt) * | 2014-03-28 | 2023-03-07 | Samsung Electronics Co., Ltd | Método de renderização de um sinal acústico, e aparelho para renderização de um sinal acústico |
CN103901401B (zh) * | 2014-04-10 | 2016-08-17 | 北京大学深圳研究生院 | 一种基于双耳匹配滤波器的双耳声音源定位方法 |
WO2016025812A1 (en) * | 2014-08-14 | 2016-02-18 | Rensselaer Polytechnic Institute | Binaurally integrated cross-correlation auto-correlation mechanism |
CN104837106B (zh) * | 2015-05-25 | 2018-01-26 | 上海音乐学院 | 一种用于空间化声音的音频信号处理方法及装置 |
CN105159066B (zh) * | 2015-06-18 | 2017-11-07 | 同济大学 | 一种智能音乐厅调控方法及调控装置 |
WO2017063688A1 (en) * | 2015-10-14 | 2017-04-20 | Huawei Technologies Co., Ltd. | Method and device for generating an elevated sound impression |
JP6538624B2 (ja) * | 2016-08-26 | 2019-07-03 | 日本電信電話株式会社 | 信号処理装置、信号処理方法および信号処理プログラム |
CN106501772B (zh) * | 2016-10-18 | 2018-12-14 | 武汉轻工大学 | 一种基于双耳线索的空间音源定位方法及系统 |
US10375498B2 (en) * | 2016-11-16 | 2019-08-06 | Dts, Inc. | Graphical user interface for calibrating a surround sound system |
US11322169B2 (en) * | 2016-12-16 | 2022-05-03 | Nippon Telegraph And Telephone Corporation | Target sound enhancement device, noise estimation parameter learning device, target sound enhancement method, noise estimation parameter learning method, and program |
WO2019002831A1 (en) | 2017-06-27 | 2019-01-03 | Cirrus Logic International Semiconductor Limited | REPRODUCTIVE ATTACK DETECTION |
GB2563953A (en) | 2017-06-28 | 2019-01-02 | Cirrus Logic Int Semiconductor Ltd | Detection of replay attack |
GB201713697D0 (en) | 2017-06-28 | 2017-10-11 | Cirrus Logic Int Semiconductor Ltd | Magnetic detection of replay attack |
GB201801527D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Method, apparatus and systems for biometric processes |
GB201801532D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Methods, apparatus and systems for audio playback |
GB201801526D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Methods, apparatus and systems for authentication |
GB201803570D0 (en) | 2017-10-13 | 2018-04-18 | Cirrus Logic Int Semiconductor Ltd | Detection of replay attack |
GB201801874D0 (en) | 2017-10-13 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Improving robustness of speech processing system against ultrasound and dolphin attacks |
GB201801664D0 (en) | 2017-10-13 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Detection of liveness |
GB201801659D0 (en) | 2017-11-14 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Detection of loudspeaker playback |
EP3717531A4 (en) | 2017-12-01 | 2021-08-25 | The Regents of the University of California | BIOLOGICAL SOIL RESISTANT COATINGS AND THEIR MANUFACTURING AND USE PROCESSES |
US11264037B2 (en) | 2018-01-23 | 2022-03-01 | Cirrus Logic, Inc. | Speaker identification |
US11929091B2 (en) | 2018-04-27 | 2024-03-12 | Dolby Laboratories Licensing Corporation | Blind detection of binauralized stereo content |
EP3785453B1 (en) * | 2018-04-27 | 2022-11-16 | Dolby Laboratories Licensing Corporation | Blind detection of binauralized stereo content |
US10692490B2 (en) | 2018-07-31 | 2020-06-23 | Cirrus Logic, Inc. | Detection of replay attack |
US10915614B2 (en) | 2018-08-31 | 2021-02-09 | Cirrus Logic, Inc. | Biometric authentication |
US11037574B2 (en) | 2018-09-05 | 2021-06-15 | Cirrus Logic, Inc. | Speaker recognition and speaker change detection |
US10880669B2 (en) * | 2018-09-28 | 2020-12-29 | EmbodyVR, Inc. | Binaural sound source localization |
JP7252785B2 (ja) * | 2019-02-28 | 2023-04-05 | 株式会社デンソーテン | 音像予測装置および音像予測方法 |
JP2022535416A (ja) | 2019-06-05 | 2022-08-08 | ザ リージェンツ オブ ザ ユニバーシティ オブ カリフォルニア | 耐生物付着コーティングならびにその作製および使用方法 |
CN112946578B (zh) * | 2021-02-02 | 2023-04-21 | 上海头趣科技有限公司 | 双耳定位方法 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3838593A (en) * | 1972-11-06 | 1974-10-01 | Exxon Research Engineering Co | Acoustic leak location and detection system |
SE456279B (sv) * | 1986-09-16 | 1988-09-19 | Bost & Co Ab | Sett och anordning for att tidsbestemma en akustisk puls |
US5724313A (en) * | 1996-04-25 | 1998-03-03 | Interval Research Corp. | Personal object detector |
US6307941B1 (en) * | 1997-07-15 | 2001-10-23 | Desper Products, Inc. | System and method for localization of virtual sound |
US6990205B1 (en) * | 1998-05-20 | 2006-01-24 | Agere Systems, Inc. | Apparatus and method for producing virtual acoustic sound |
US7215786B2 (en) * | 2000-06-09 | 2007-05-08 | Japan Science And Technology Agency | Robot acoustic device and robot acoustic system |
US20020140804A1 (en) * | 2001-03-30 | 2002-10-03 | Koninklijke Philips Electronics N.V. | Method and apparatus for audio/image speaker detection and locator |
US20030035553A1 (en) * | 2001-08-10 | 2003-02-20 | Frank Baumgarte | Backwards-compatible perceptual coding of spatial cues |
ES2323294T3 (es) * | 2002-04-22 | 2009-07-10 | Koninklijke Philips Electronics N.V. | Dispositivo de decodificacion con una unidad de decorrelacion. |
US7333622B2 (en) * | 2002-10-18 | 2008-02-19 | The Regents Of The University Of California | Dynamic binaural sound capture and reproduction |
JP4521549B2 (ja) * | 2003-04-25 | 2010-08-11 | 財団法人くまもとテクノ産業財団 | 上下、左右方向の複数の音源の分離方法、そのためのシステム |
EP1586421B1 (en) | 2004-04-16 | 2008-03-05 | Honda Research Institute Europe GmbH | Self-calibrating orienting system for a manipulating device |
EP1600791B1 (en) | 2004-05-26 | 2009-04-01 | Honda Research Institute Europe GmbH | Sound source localization based on binaural signals |
-
2004
- 2004-12-23 EP EP04030651A patent/EP1600791B1/en not_active Expired - Fee Related
-
2005
- 2005-05-25 JP JP2005152195A patent/JP4841863B2/ja not_active Expired - Fee Related
- 2005-05-25 US US11/137,981 patent/US7693287B2/en not_active Expired - Fee Related
- 2005-05-26 CN CN2005100746215A patent/CN1703118B/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
US7693287B2 (en) | 2010-04-06 |
EP1600791B1 (en) | 2009-04-01 |
CN1703118B (zh) | 2013-05-08 |
US20050276419A1 (en) | 2005-12-15 |
CN1703118A (zh) | 2005-11-30 |
JP2005338086A (ja) | 2005-12-08 |
EP1600791A1 (en) | 2005-11-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4841863B2 (ja) | バイノーラル信号に基づいた音源定位 | |
US10531198B2 (en) | Apparatus and method for decomposing an input signal using a downmixer | |
EP3369260B1 (en) | Apparatus and method for generating a filtered audio signal realizing elevation rendering | |
EP1522868B1 (en) | System for determining the position of a sound source and method therefor | |
Raspaud et al. | Binaural source localization by joint estimation of ILD and ITD | |
US10068586B2 (en) | Binaurally integrated cross-correlation auto-correlation mechanism | |
US11032663B2 (en) | System and method for virtual navigation of sound fields through interpolation of signals from an array of microphone assemblies | |
Tylka et al. | Soundfield navigation using an array of higher-order ambisonics microphones | |
Woodruff et al. | Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization | |
Parisi et al. | Cepstrum prefiltering for binaural source localization in reverberant environments | |
US8923536B2 (en) | Method and apparatus for localizing sound image of input signal in spatial position | |
Hammond et al. | Robust full-sphere binaural sound source localization | |
Park et al. | A model of sound localisation applied to the evaluation of systems for stereophony | |
Braasch | Sound localization in the presence of multiple reflections using a binaurally integrated cross-correlation/auto-correlation mechanism | |
Wang et al. | Method of estimating three-dimensional direction-of-arrival based on monaural modulation spectrum | |
Pörschmann et al. | Spatial upsampling of individual sparse head-related transfer function sets by directional equalization | |
Ji et al. | Robust noise PSD estimation for binaural hearing aids in time-varying diffuse noise field | |
Hammond et al. | Robust median-plane binaural sound source localization | |
EP4346235A1 (en) | Apparatus and method employing a perception-based distance metric for spatial audio | |
Lopez et al. | Compensating first reflections in non-anechoic head-related transfer function measurements | |
Kobayashi et al. | Temporal convolutional neural networks to generate a head-related impulse response from one direction to another | |
Pörschmann et al. | Full Reviewed Paper at ICSA 2019 | |
Li et al. | Cross-validation of acquisition methods for near-field Head-Related Transfer Functions with a high distance resolution | |
Ravi | Design of equalization filter for non-linear distortion of the loudspeaker array with listener's movement | |
Liu et al. | Probabilistic binaural multiple sources localization based on time-delay compensation estimator and clustering analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20071019 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20100917 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20100929 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20101216 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20101221 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20110920 |
|
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20111005 |
|
R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20141014 Year of fee payment: 3 |
|
LAPS | Cancellation because of no payment of annual fees |