CN108680900A - A kind of ESPRIT sound localization methods based on RANSAC - Google Patents

A kind of ESPRIT sound localization methods based on RANSAC Download PDF

Info

Publication number
CN108680900A
CN108680900A CN201810486052.2A CN201810486052A CN108680900A CN 108680900 A CN108680900 A CN 108680900A CN 201810486052 A CN201810486052 A CN 201810486052A CN 108680900 A CN108680900 A CN 108680900A
Authority
CN
China
Prior art keywords
signal
submatrix
ransac
matrix
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810486052.2A
Other languages
Chinese (zh)
Inventor
徐正勤
伍世虔
张俊勇
邹谜
张琴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University of Science and Engineering WUSE
Wuhan University of Science and Technology WHUST
Original Assignee
Wuhan University of Science and Engineering WUSE
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University of Science and Engineering WUSE filed Critical Wuhan University of Science and Engineering WUSE
Priority to CN201810486052.2A priority Critical patent/CN108680900A/en
Publication of CN108680900A publication Critical patent/CN108680900A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S5/00Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
    • G01S5/18Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)

Abstract

The invention discloses a kind of, and the ESPRIT sound localization methods based on RANSAC obtain a frame voice signal X first to one section of voice signal X (t) progress sub-frame processingw(t);To Xw(t) being decomposed into two has invariable rotary characteristic submatrix, the first submatrix signal Xw1(t) and the second submatrix signal Xw2(t), and data covariance matrix R is calculated;Then Eigenvalues Decomposition, the signal subspace U that the big characteristic vector to obtain the first submatrix is turned into are carried out to data covariance matrixs1The signal subspace U being turned into the big characteristic vector of the second submatrixs2;The constant matrix Ψ of the rotation relationship containing target signal direction position is obtained followed by signal subspace;Then matrix Ψ constant to rotation relationship carries out Eigenvalues Decomposition and obtains K characteristic value, the location parameter of corresponding K signal;Finally after obtaining the sound source position parameter of each frame, the corrupt data in these parameters is eliminated, an optimal sample set is obtained and then fits accurate location parameter.The present invention can obtain more accurate positional parameter.

Description

A kind of ESPRIT sound localization methods based on RANSAC
Technical field
The invention belongs to auditory localization technical fields, are related to a kind of sound localization method, and in particular to one kind is based on The ESPRIT sound localization methods of RANSAC.
Background technology
Based on microphone array acoustic model, the direction of arrival (DOA) of voice signal can be by formula X (t)=AS (t)+N (t) it obtains.Wherein A is the array manifold of signal, and S (t) is echo signal, and N (t) is noise signal.ESPRIT algorithms hypothesis is made an uproar Acoustical signal N (t) is to be received and due to the complexity of communication environments in actual conditions with the incoherent Gaussian noise of echo signal Be in signal have existing for coherent signal source, including co-channel interference and due to background object reflect caused by multipath-propagated signal, The accuracy of ESPRIT algorithms can be caused to decline at this time.
Invention content
In order to solve the above technical problem, the present invention provides a kind of ESPRIT sound localization methods based on RANSAC, The fitting effect for capableing of positional parameter by this method is more preferable, improves accuracy and the robustness of ESPRIT location algorithms.
The technical solution adopted in the present invention is:A kind of ESPRIT sound localization methods based on RANSAC, feature exist In including the following steps:
Step 1:Sub-frame processing X is carried out by window function w (t) to one section of voice signal X (t)w(t)=X (t) * w (t), obtain To a frame voice signal Xw(t);
Step 2:To Xw(t) two submatrix signals with invariable rotary characteristic are decomposed into, wherein the first submatrix signal is remembered For Xw1(t), the second submatrix signal is denoted as Xw2(t), and data covariance matrix R is calculated;
Step 3:Eigenvalues Decomposition is carried out to data covariance matrix, the big characteristic vector to obtain the first submatrix is turned into Signal subspace Us1The signal subspace U being turned into the big characteristic vector of the second submatrixs2
Step 4:The constant matrix of the rotation relationship containing target signal direction position is obtained using signal subspace
Step 5:Matrix Ψ constant to rotation relationship carries out Eigenvalues Decomposition and obtains K characteristic value, the position of corresponding K signal Set parameter;
Step 6:After obtaining the sound source position parameter of each frame, eliminated in these parameters not using RANSAC algorithms Authentic data obtains an optimal sample set and then fits accurate location parameter.
Influence present invention reduces coherent to positioning.Removed in position fixing process due to noise by RANSAC algorithms The unreliable point generated so that the fitting effect of positional parameter is more preferable, improves accuracy and the robustness of ESPRIT location algorithms.
Description of the drawings
Fig. 1 is the flow chart of the embodiment of the present invention;
Fig. 2 is the experimental result comparison diagram of the embodiment of the present invention.
Specific implementation mode
Understand for the ease of those of ordinary skill in the art and implement the present invention, with reference to the accompanying drawings and embodiments to this hair It is bright to be described in further detail, it should be understood that implementation example described herein is merely to illustrate and explain the present invention, not For limiting the present invention.
Referring to Fig.1, a kind of ESPRIT sound localization methods based on RANSAC provided by the invention, include the following steps:
Step 1:Sub-frame processing X is carried out by window function w (t) to one section of voice signal X (t)w(t)=X (t) * w (t), obtain To a frame voice signal Xw(t);
Step 2:To Xw(t) two submatrix signals with invariable rotary characteristic are decomposed into, wherein the first submatrix signal is remembered For Xw1(t), the second submatrix signal is denoted as Xw2(t), and data covariance matrix R is calculated;
Step 3:Eigenvalues Decomposition is carried out to data covariance matrix, the big characteristic vector to obtain the first submatrix is turned into Signal subspace Us1The signal subspace U being turned into the big characteristic vector of the second submatrixs2
Step 4:The constant matrix of the rotation relationship containing target signal direction position is obtained using signal subspace
In the present embodiment, Xw(t) the submatrix signal X that there is invariable rotary characteristic with twow1(t) and Xw2(t) pass between System is:
Data covariance matrix R is:
R=E [Xw(t)Xw(t)H] (2)
Eigenvalues Decomposition is carried out to data covariance matrix and obtains the signal subspace U of two data matrixess1And Us2
In formula, UsFor the signal subspace that the corresponding characteristic vector of big characteristic value is turned into, ∑sTo include the spy of big characteristic value Value indicative diagonal matrix, UNFor the noise subspace that the corresponding characteristic vector of small characteristic value is turned into, ∑NTo be corresponded to comprising small characteristic value Characteristic value diagonal matrix;
Calculate the constant matrix Ψ of rotation relationship:
Step 5:Matrix Ψ constant to rotation relationship carries out Eigenvalues Decomposition and obtains K characteristic value, the position of corresponding K signal Set parameter;
Step 6:After obtaining the sound source position parameter of each frame, eliminated in these parameters not using RANSAC algorithms Authentic data obtains an optimal sample set and then fits accurate location parameter.
The wherein described RANSAC algorithms, including following sub-step:
Step 6.1:Select point set V of the N frame alignment parameter as RANSAC iteration;
Step 6.2:N value is randomly selected in V fits a preliminary sound source incident angleAnd remaining N-n Incident angle is θi
Step 6.3:Assuming that error threshold is ε, a consistency set V is definedk, willθiIt is put into consistent Property set, and record VkIn data amount check;
Step 6.4:Assuming that VkIn data amount check and V in data amount check ratio be this calculating accuracy δk, when δkWhen less than preset model accuracy δ, step 6.2 and step 6.3 are repeated;If working as δkIt is correct more than preset model When rate δ, by consistency set Vk'sAs optimal solution;
Step 6.5:When obtaining most homogeneous set V in step 6.4kAfterwards, optimal solution is found out using least square method.
It is the experimental result comparison diagram of the present embodiment see Fig. 2, comparison algorithm improves front and back as signal-to-noise ratio (SNR) becomes The root-mean-square error (RMSE) of change, as shown in Fig. 2, when signal-to-noise ratio is reduced to -20dB, algorithm is still kept preferable after improvement Positioning accuracy, and algorithm produces larger error before improving.Illustrate that algorithm has higher precision and stronger after improving Noise immunity.
The auditory localization of ESPRIT algorithms can be removed in the process due to the corrupt data that noise generates by this method, obtained To more accurate positional parameter.
It should be understood that the part that this specification does not elaborate belongs to the prior art.
It should be understood that the above-mentioned description for preferred embodiment is more detailed, can not therefore be considered to this The limitation of invention patent protection range, those skilled in the art under the inspiration of the present invention, are not departing from power of the present invention Profit requires under protected ambit, can also make replacement or deformation, each fall within protection scope of the present invention, this hair It is bright range is claimed to be determined by the appended claims.

Claims (3)

1. a kind of ESPRIT sound localization methods based on RANSAC, which is characterized in that include the following steps:
Step 1:Sub-frame processing X is carried out by window function w (t) to one section of voice signal X (t)w(t)=X (t) * w (t), obtain one Frame voice signal Xw(t);
Step 2:To Xw(t) two submatrix signals with invariable rotary characteristic are decomposed into, wherein the first submatrix signal is denoted as Xw1 (t), the second submatrix signal is denoted as Xw2(t), and data covariance matrix R is calculated;
Step 3:Eigenvalues Decomposition, the letter that the big characteristic vector to obtain the first submatrix is turned into are carried out to data covariance matrix Work song space Us1The signal subspace U being turned into the big characteristic vector of the second submatrixs2
Step 4:The constant matrix of the rotation relationship containing target signal direction position is obtained using signal subspace
Step 5:Matrix Ψ constant to rotation relationship carries out Eigenvalues Decomposition and obtains K characteristic value, the location parameter of corresponding K signal;
Step 6:After obtaining the sound source position parameter of each frame, it is unreliable in these parameters to be eliminated using RANSAC algorithms Data obtain an optimal sample set and then fit accurate location parameter.
2. the ESPRIT sound localization methods according to claim 1 based on RANSAC, which is characterized in that Xw1(t) and Xw2 (t) relationship between is:
Data covariance matrix R is:
R=E [Xw(t)Xw(t)H] (2)
Eigenvalues Decomposition is carried out to data covariance matrix and obtains the signal subspace U of two data matrixess1And Us2
In formula, UsFor the signal subspace that the corresponding characteristic vector of big characteristic value is turned into, ∑sTo include the characteristic value of big characteristic value Diagonal matrix, UNFor the noise subspace that the corresponding characteristic vector of small characteristic value is turned into, ∑NTo include the corresponding spy of small characteristic value Value indicative diagonal matrix;
Calculate the constant matrix Ψ of rotation relationship:
3. the ESPRIT sound localization methods according to claim 1 based on RANSAC, which is characterized in that institute in step 6 State RANSAC algorithms, including following sub-step:
Step 6.1:Select point set V of the N frame alignment parameter as RANSAC iteration;
Step 6.2:N value is randomly selected in V fits a preliminary sound source incident angleAnd remaining N-n incident Angle is θi
Step 6.3:Assuming that error threshold is ε, a consistency set V is definedk, willθiIt is put into consistency collection It closes, and records VkIn data amount check;
Step 6.4:Assuming that VkIn data amount check and V in data amount check ratio be this calculating accuracy δk, work as δkIt is small When preset model accuracy δ, step 6.2 and step 6.3 are repeated;If working as δkMore than preset model accuracy δ When, by consistency set Vk'sAs optimal solution;
Step 6.5:When obtaining most homogeneous set V in step 6.4kAfterwards, the set is fitted using least square method Find out optimal solution.
CN201810486052.2A 2018-05-21 2018-05-21 A kind of ESPRIT sound localization methods based on RANSAC Pending CN108680900A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810486052.2A CN108680900A (en) 2018-05-21 2018-05-21 A kind of ESPRIT sound localization methods based on RANSAC

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810486052.2A CN108680900A (en) 2018-05-21 2018-05-21 A kind of ESPRIT sound localization methods based on RANSAC

Publications (1)

Publication Number Publication Date
CN108680900A true CN108680900A (en) 2018-10-19

Family

ID=63805284

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810486052.2A Pending CN108680900A (en) 2018-05-21 2018-05-21 A kind of ESPRIT sound localization methods based on RANSAC

Country Status (1)

Country Link
CN (1) CN108680900A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102879764A (en) * 2012-10-16 2013-01-16 浙江大学 Underwater sound source direction estimating method
CN104766319A (en) * 2015-04-02 2015-07-08 西安电子科技大学 Method for improving registration precision of images photographed at night
CN105894000A (en) * 2016-03-28 2016-08-24 江南大学 RANSAC-based laser network mark image feature extraction
CN108401565B (en) * 2015-05-28 2017-12-15 西北工业大学 Remote sensing image registration method based on improved KAZE features and Pseudo-RANSAC algorithms

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102879764A (en) * 2012-10-16 2013-01-16 浙江大学 Underwater sound source direction estimating method
CN104766319A (en) * 2015-04-02 2015-07-08 西安电子科技大学 Method for improving registration precision of images photographed at night
CN108401565B (en) * 2015-05-28 2017-12-15 西北工业大学 Remote sensing image registration method based on improved KAZE features and Pseudo-RANSAC algorithms
CN105894000A (en) * 2016-03-28 2016-08-24 江南大学 RANSAC-based laser network mark image feature extraction

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
SYLVESTER0510: "RANSAC介绍(Matlab版直线拟合+平面拟合)", 《HTTPS://BLOG.CSDN.NET/U010128736/ARTICLE/DETAILS/53422070?SPM=1001.2014.3001.5501》 *
王园: "基于阵列转换的二维MUSIC算法的二维DOA估计", 《中国优秀博硕士学位论文全文数据库(硕士)信息科技辑》 *
马先哲: "基于TDOA的声源定位及与视频信息融合的研究", 《中国优秀博硕士学位论文全文数据库(硕士)信息科技辑》 *

Similar Documents

Publication Publication Date Title
US10979805B2 (en) Microphone array auto-directive adaptive wideband beamforming using orientation information from MEMS sensors
US7626889B2 (en) Sensor array post-filter for tracking spatial distributions of signals and noise
US9100734B2 (en) Systems, methods, apparatus, and computer-readable media for far-field multi-source tracking and separation
US9633651B2 (en) Apparatus and method for providing an informed multichannel speech presence probability estimation
US7613310B2 (en) Audio input system
US20200342887A1 (en) Microphone array-based target voice acquisition method and device
US7813923B2 (en) Calibration based beamforming, non-linear adaptive filtering, and multi-sensor headset
US20160322055A1 (en) Processing multi-channel audio waveforms
EP2197219B1 (en) Method for determining a time delay for time delay compensation
US8947978B2 (en) System and method for estimating the direction of arrival of a sound
CN110085247B (en) Double-microphone noise reduction method for complex noise environment
CN106292732A (en) Intelligent robot rotating method based on sound localization and Face datection
Taseska et al. Informed spatial filtering for sound extraction using distributed microphone arrays
CN106340305B (en) Self-calibration far-field voice interaction device and far-field voice self-calibration method
CN104021798A (en) Method for soundproofing an audio signal by an algorithm with a variable spectral gain and a dynamically modulatable hardness
CN106872945A (en) Sound localization method, device and electronic equipment
CN106992010A (en) Without the microphone array speech enhancement device under the conditions of direct sound wave
CN108008394B (en) Biomimetic sonar two dimension undergoes drawing drawing method, storage medium and equipment
CN114167356A (en) Sound source positioning method and system based on polyhedral microphone array
CN116312602B (en) Voice signal beam forming method based on interference noise space spectrum matrix
CN108680900A (en) A kind of ESPRIT sound localization methods based on RANSAC
Levin et al. Near-field signal acquisition for smartglasses using two acoustic vector-sensors
CN108717183A (en) A kind of Toeplitz sound localization methods based on RANSAC
CN114563764A (en) Self-adaptive beam generation method and system
CN110858485B (en) Voice enhancement method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181019