CN104715758A - Branched processing array type speech positioning and enhancement method - Google Patents
Branched processing array type speech positioning and enhancement method Download PDFInfo
- Publication number
- CN104715758A CN104715758A CN201510066532.XA CN201510066532A CN104715758A CN 104715758 A CN104715758 A CN 104715758A CN 201510066532 A CN201510066532 A CN 201510066532A CN 104715758 A CN104715758 A CN 104715758A
- Authority
- CN
- China
- Prior art keywords
- signal
- filter
- noise
- component
- self
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Circuit For Audible Band Transducer (AREA)
Abstract
The invention provides a branched processing array type speech positioning and enhancement method. The method relates to the basic structure of a generalized sidelobe canceller, the design of a block matrix, the design of a component filter and an external Wiener filtering part. According to the method, a component structure is used for reference, a rear Wiener filter is additionally arranged, a part of adaptive technology is used, the denoising performance of an algorithm is guaranteed, noncoherent noise and coherent noise are effectively restrained, the rate of convergence of the algorithm is increased, and the operational complexity is lowered. Compared with a microphone array speech enhancement system of a traditional generalized sidelobe canceller, the improved speed enhancement system has a higher output signal-to-noise ratio. A simulation experiment test structure shows that compared with the microphone array speech enhancement system based on the full-band generalized sidelobe canceller, the method can achieve the higher output signal-to-noise ratio.
Description
Technical field
The present invention relates to voice process technology field, particularly relate to a kind of voice location and Enhancement Method.
Background technology
Adaptive beamformer technology is widely used in Aeronautics and Astronautics, radar and communication system.Adaptive beamformer, by regulating the weighted amplitude of each array element and weighted phases, changes the directional diagram of array, makes the main lobe of array antenna aim at desired user, and zero point of array antenna and secondary lobe aim at other users simultaneously, thus improve received signal to noise ratio.
Linear constraint minimal variance (LCMV) criterion is the most frequently used Adaptive beamformer method.Generalized Sidelobe Canceller (GSC) be a kind of equivalence of LCMV realize structure, the constrained optimization question variation of Adaptive beamformer is unconfined optimization problem by GSC structure, be divided into self-adaptation and non-self-adapting two branch roads, be called main branch road and auxiliary branch, require that wanted signal can only pass through from the main branch road of non-self-adapting, and only containing interference and noise component in adaptive auxiliary branch, when high s/n ratio, some wanted signal is leaked in auxiliary branch, has occurred signal cancellation phenomenon.
Summary of the invention
In order to solve the problems of the prior art, the present invention proposes a kind of branch process formula array voice location and Enhancement Method, adopt transfer function generalized sidelobe canceller structure and rearmounted S filter structure, the input signal of its adaptation module is decomposed in component filter structure and processes, and hyperchannel Wiener filtering module is introduced the non-self-adapting branch road of transport function, can more effectively suppress coherent noise and noncoherent noise, and the effective speed of convergence accelerating speech-enhancement system.
The present invention is achieved through the following technical solutions:
A kind of branch's process formula array voice location and Enhancement Method, comprise the following steps:
Step 1: acoustic pickup is arranged with linear array, acoustic pickup number more than 3 or 3, accepts external sound signal;
Step 2: utilize the cross correlation function of every road signal to obtain the time delay of Received signal strength, carries out delay compensation to every road signal and to align every road signal, utilizes the time delay obtained accurately can estimate Sounnd source direction, the location of realize target source of students;
Step 3: the digital signal after compensation comprises noise, sends into the self-adaptation blocking matrix module after optimizing, separates the noise signal on a road fewer than total way;
Step 4: this noise signal is sent into adaptive component filters, obtains directivity noisiness after filtering;
Step 5: this step and step 4 are carried out simultaneously, sends the signal after compensating into the rearmounted S filter module of hyperchannel, obtains elementary denoising result, now still comprise directivity noise;
Step 6: the result of step 5 and step 4 result are done difference and drawn final speech de-noising signal.
The invention has the beneficial effects as follows: the method mirror component structure that the present invention proposes, additional rearmounted S filter, utilize partial adaptivity technology, ensure that the denoising performance of algorithm, effectively suppress noncoherent noise and coherent noise, accelerate convergence of algorithm speed, reduce computational complexity, relative to the microphone array speech enhancement system of traditional Generalized Sidelobe Canceller, the speech-enhancement system improved is adopted to have higher output signal-to-noise ratio.
Accompanying drawing explanation
Fig. 1 is branch of the present invention process formula array voice location and Enhancement Method process flow diagram;
Fig. 2 is self-adaptation blocking matrix design drawing of the present invention;
Fig. 3 is the sample modulation design of filter figure of M passage.
Embodiment
In order to make object of the present invention, technical scheme and advantage clearly understand, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.
Accompanying drawing 1 is branch of the present invention process formula array voice location and Enhancement Method process flow diagram, and method of the present invention comprises the following steps:
Step 1: acoustic pickup is arranged with linear array, acoustic pickup number more than 3 or 3, accepts external sound signal; For acoustic pickup number 4, interval 2.25cm, accept extraneous voice analog signal then AD conversion be digital signal.
Step 2: because sound source may from different directions, each road acknowledge(ment) signal sometimes.Utilize the cross correlation function of every road signal to obtain time delay, delay compensation is carried out to every road signal and to align every road signal.Utilize the time delay obtained accurately can estimate Sounnd source direction, the location of realize target sound source.
Step 3: the digital signal after compensation comprises noise, sends into the self-adaptation blocking matrix module after optimizing, separates the noise signal on a road fewer than total way.
Step 4: this noise signal is sent into adaptive component filters, obtains directivity noisiness after filtering.
Step 5: this step and step 4 are carried out simultaneously, sends the signal after compensating into the rearmounted Wiener filtering module of hyperchannel, obtains elementary denoising result (still comprising directivity noise).
Step 6: a upper result and step 4 result are done difference and drawn final speech de-noising signal, reaches more significantly speech enhan-cement effect.
Wherein, step 2 is specially: by four road time-domain signals by x in Fourier transform to frequency domain
i(n)=s (n-τ
i)+v
i(n) → X
i(ω)=Se
j ω τ i+ V, then utilizes
calculate cross-power spectrum peak, obtain time delay τ.Sound in atmosphere velocity of propagation v is about 340m/s, so orient sound source Plane Angle is
finally respectively all the other sound channel signals are postponed τ respectively, 2 τ, 3 τ ..., obtain the speech data alignd.
Step 3 replaces corresponding transition function by utilizing the ratio of transition function, designs a kind of self-adaptation blocking matrix and meets output only noisy requirement.In accompanying drawing 2, the matrix of design meets the requirements, and the component of signal that non-self-adapting branch road exports is not original signal S, but the component of signal A that first row array element receives
is, wherein A
ithe coefficient of signal after Short Time Fourier Transform.Acknowledge(ment) signal can be expressed as x
i=a
i* s+v
ii=1,2 ..., after the Fourier transform of M both sides, have X
i=A
is+V
i,wherein v
ibe noise, * represents convolution, and M is acoustic pickup number.
The basic thought of step 4 self-adaptation component filters group is that the noise signal of input is carried out band splitting, and by the component signal that signal decomposition becomes to be positioned on different frequency bands, the feature then for each component signal processes respectively.Self-adaptation component filters is made up of analysis filter and composite filter.Input signal bank of filters H by analysis in accompanying drawing 3
iz () resolves into a series of component signal y
in (), becomes y after the process such as coding, compression, transmission
i' (n), finally by synthesis filter banks G
iz () reconstructs signal x (n).
Its analysis filter transfer function H
i(z) and synthesis filter transport function G
i(z), i=1,2 ..., M carries out cosine modulation acquisition respectively by low-pass filter l (z) having linear phase characteristic to.Therefore analysis filter H
i(z) and composite filter G
iz the impulse response of () is as follows:
K=0 in formula, 1 ..., K-1, i=1,2 ..., M, M are decomposition channels numbers, and K is filter order.This shows that analysis filter and composite filter have following relation
g
ithe conjugate transpose of (z).
In step 5 coefficient of S filter be according to each Channel Received Signal between auto-correlation and cross-correlation obtain, namely the coefficient of S filter is adaptive change.Noisy speech signal after postponing-adding up obtains the target voice estimated signal under minimum mean square error criterion after Wiener filtering.This method obtains good de-noising performance by relatively less microphone number under the environment of uncorrelated noise.
The invention has the beneficial effects as follows: the method mirror component structure that the present invention proposes, additional rearmounted S filter, utilize partial adaptivity technology, ensure that the denoising performance of algorithm, effectively suppress noncoherent noise and coherent noise, accelerate convergence of algorithm speed, reduce computational complexity, relative to the microphone array speech enhancement system of traditional Generalized Sidelobe Canceller, the speech-enhancement system improved is adopted to have higher output signal-to-noise ratio.Emulation experiment test structure shows, relative to the microphone array speech enhancement system based on full band generalized sidelobe canceller, method of the present invention has higher output signal-to-noise ratio.
Above content is in conjunction with concrete preferred implementation further description made for the present invention, can not assert that specific embodiment of the invention is confined to these explanations.For general technical staff of the technical field of the invention, without departing from the inventive concept of the premise, some simple deduction or replace can also be made, all should be considered as belonging to protection scope of the present invention.
Claims (7)
1. branch's process formula array voice location and an Enhancement Method, comprises the following steps:
Step 1: acoustic pickup is arranged with linear array, acoustic pickup number more than 3 or 3, accepts external sound signal;
Step 2: utilize the cross correlation function of every road signal to obtain the time delay of Received signal strength, carries out delay compensation to every road signal and to align every road signal, utilizes the time delay obtained accurately can estimate Sounnd source direction, the location of realize target source of students;
Step 3: the digital signal after compensation comprises noise, sends into the self-adaptation blocking matrix module after optimizing, separates the noise signal on a road fewer than total way;
Step 4: this noise signal is sent into adaptive component filters, obtains directivity noisiness after filtering;
Step 5: this step and step 4 are carried out simultaneously, sends the signal after compensating into the rearmounted S filter module of hyperchannel, obtains elementary denoising result, now still comprise directivity noise;
Step 6: the result of step 5 and step 4 result are done difference and drawn final speech de-noising signal.
2. method according to claim 1, is characterized in that: the acoustic pickup in described step 1 is four, and interval d=2.25cm arranges in a line, accept voice analog signal then AD conversion be digital signal.
3. method according to claim 1, is characterized in that: described step 2 is specially: every road time-domain signal by Fourier transform in frequency domain
Then utilize
calculate cross-power spectrum peak, obtain time delay τ, orienting sound source Plane Angle is
finally respectively all the other sound channel signals are postponed τ respectively, 2 τ, 3 τ ..., obtain the speech data alignd.
4. method according to claim 1, is characterized in that: the self-adaptation blocking matrix in described step 3 replaces corresponding transition function to design by utilizing the ratio of transition function, satisfied output only noisy requirement, and described self-adaptation blocking matrix is:
The component of signal that non-self-adapting branch road exports is not original signal S, but the component of signal A that first row array element receives
is, wherein A
ibe the coefficient of signal after Short Time Fourier Transform, Received signal strength can be expressed as x
i=a
i* s+v
ii=1,2 ..., after the Fourier transform of M both sides, have X
i=A
is+V
i,wherein v
ibe noise, * represents convolution, and M is acoustic pickup number.
5. method according to claim 1, it is characterized in that: the noise signal of input is carried out band splitting by the self-adaptation component filters group in described step 4, by the component signal that signal decomposition becomes to be positioned on different frequency bands, the feature then for each component signal processes respectively.Self-adaptation component filters is made up of analysis filter and composite filter, input noise signal bank of filters H by analysis
iz () resolves into a series of component signal y
in (), becomes y after the process such as coding, compression, transmission
i' (n), finally by synthesis filter banks G
iz () reconstructs signal x (n); Its analysis filter transfer function H
i(z) and synthesis filter transport function G
i(z), i=1,2 ..., M carries out cosine modulation acquisition respectively by the low-pass filter having linear phase characteristic to.
6. method according to claim 5, is characterized in that: analysis filter H
i(z) and composite filter G
iz the impulse response of () is as follows:
K=0 in formula, 1 ..., K-1, i=1,2 ..., M, M are decomposition channels numbers, and K is filter order.This shows that analysis filter and composite filter have following relation
g
ithe conjugate transpose of (z).
7. method according to claim 1, is characterized in that: in described step 5 coefficient of S filter be according to each Channel Received Signal between auto-correlation and cross-correlation obtain, namely the coefficient of S filter is adaptive change; Noisy speech signal after postponing-adding up obtains the target voice estimated signal under minimum mean square error criterion after Wiener filtering.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510066532.XA CN104715758A (en) | 2015-02-06 | 2015-02-06 | Branched processing array type speech positioning and enhancement method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510066532.XA CN104715758A (en) | 2015-02-06 | 2015-02-06 | Branched processing array type speech positioning and enhancement method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104715758A true CN104715758A (en) | 2015-06-17 |
Family
ID=53415019
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510066532.XA Pending CN104715758A (en) | 2015-02-06 | 2015-02-06 | Branched processing array type speech positioning and enhancement method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104715758A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105489226A (en) * | 2015-11-23 | 2016-04-13 | 湖北工业大学 | Wiener filtering speech enhancement method for multi-taper spectrum estimation of pickup |
CN109061567A (en) * | 2018-08-15 | 2018-12-21 | 广东海洋大学 | Voice accurate positioning method under multi-source environment |
CN111863017A (en) * | 2020-07-20 | 2020-10-30 | 上海汽车集团股份有限公司 | In-vehicle directional pickup method based on double-microphone array and related device |
CN112530453A (en) * | 2020-11-27 | 2021-03-19 | 五邑大学 | Voice recognition method and device suitable for noise environment |
CN113362846A (en) * | 2021-06-29 | 2021-09-07 | 辽宁工业大学 | Voice enhancement method based on generalized sidelobe cancellation structure |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101976565A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Dual-microphone-based speech enhancement device and method |
CN102509552A (en) * | 2011-10-21 | 2012-06-20 | 浙江大学 | Method for enhancing microphone array voice based on combined inhibition |
CN102664023A (en) * | 2012-04-26 | 2012-09-12 | 南京邮电大学 | Method for optimizing speech enhancement of microphone array |
CN102831898A (en) * | 2012-08-31 | 2012-12-19 | 厦门大学 | Microphone array voice enhancement device with sound source direction tracking function and method thereof |
US20140372129A1 (en) * | 2013-06-14 | 2014-12-18 | GM Global Technology Operations LLC | Position directed acoustic array and beamforming methods |
-
2015
- 2015-02-06 CN CN201510066532.XA patent/CN104715758A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101976565A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Dual-microphone-based speech enhancement device and method |
CN102509552A (en) * | 2011-10-21 | 2012-06-20 | 浙江大学 | Method for enhancing microphone array voice based on combined inhibition |
CN102664023A (en) * | 2012-04-26 | 2012-09-12 | 南京邮电大学 | Method for optimizing speech enhancement of microphone array |
CN102831898A (en) * | 2012-08-31 | 2012-12-19 | 厦门大学 | Microphone array voice enhancement device with sound source direction tracking function and method thereof |
US20140372129A1 (en) * | 2013-06-14 | 2014-12-18 | GM Global Technology Operations LLC | Position directed acoustic array and beamforming methods |
Non-Patent Citations (1)
Title |
---|
栗晓丽: "基于GSC结构的麦克风阵列语音增强算法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105489226A (en) * | 2015-11-23 | 2016-04-13 | 湖北工业大学 | Wiener filtering speech enhancement method for multi-taper spectrum estimation of pickup |
CN109061567A (en) * | 2018-08-15 | 2018-12-21 | 广东海洋大学 | Voice accurate positioning method under multi-source environment |
CN109061567B (en) * | 2018-08-15 | 2022-08-09 | 广东海洋大学 | Voice accurate positioning method under multi-source environment |
CN111863017A (en) * | 2020-07-20 | 2020-10-30 | 上海汽车集团股份有限公司 | In-vehicle directional pickup method based on double-microphone array and related device |
CN112530453A (en) * | 2020-11-27 | 2021-03-19 | 五邑大学 | Voice recognition method and device suitable for noise environment |
CN112530453B (en) * | 2020-11-27 | 2022-04-05 | 五邑大学 | Voice recognition method and device suitable for noise environment |
CN113362846A (en) * | 2021-06-29 | 2021-09-07 | 辽宁工业大学 | Voice enhancement method based on generalized sidelobe cancellation structure |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8223989B2 (en) | Signal processing system and method for calibrating channel signals supplied from an array of sensors having different operating characteristics | |
CN104715758A (en) | Branched processing array type speech positioning and enhancement method | |
CN102509552B (en) | Method for enhancing microphone array voice based on combined inhibition | |
CN109889186B (en) | Broadband beam forming method based on multistage filter bank | |
CN103854660B (en) | A kind of four Mike's sound enhancement methods based on independent component analysis | |
CN105355210A (en) | Preprocessing method and device for far-field speech recognition | |
CN104835503A (en) | Improved GSC self-adaptive speech enhancement method | |
CN102831898A (en) | Microphone array voice enhancement device with sound source direction tracking function and method thereof | |
CN101964934A (en) | Binary microphone microarray voice beam forming method | |
CN105590631A (en) | Method and apparatus for signal processing | |
CN106772457A (en) | A kind of satellite navigation anti-interference method based on space-time frequency framework | |
CN104683614A (en) | Adaptive echo cancellation method adopting memory proportionate affine projection and based on M-estimation | |
CN202307119U (en) | Multiple-microphone-array noise eliminating device and system thereof | |
CN112180320B (en) | Unmanned aerial vehicle passive positioning system and method | |
CN109637554A (en) | MCLP speech dereverberation method based on CDR | |
CN110047507A (en) | A kind of identification of sound source method and device | |
CN104865584B (en) | A kind of implementation method of null tone adaptive navigation Anti-interference algorithm | |
CN104777491A (en) | Blind beam broadband interference suppression method and device | |
CN102165708A (en) | Signal processing method, signal processing device, and signal processing program | |
CN113504549B (en) | Navigation space-time anti-interference method based on generalized sidelobe canceller | |
CN103916199A (en) | Device and method for time delay and phase adjustment of antenna signal | |
CN110166024B (en) | Weak sound beacon signal enhancement method | |
Stolbov et al. | Speech enhancement with microphone array using frequency-domain alignment technique | |
CN110310650A (en) | A kind of voice enhancement algorithm based on second-order differential microphone array | |
Oliinyk et al. | Center weighted median filter application to time delay estimation in non-Gaussian noise environment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20150617 |