CN106601265A - Method for eliminating noise in millimeter wave biological radar voice - Google Patents

Method for eliminating noise in millimeter wave biological radar voice Download PDF

Info

Publication number
CN106601265A
CN106601265A CN201611163151.4A CN201611163151A CN106601265A CN 106601265 A CN106601265 A CN 106601265A CN 201611163151 A CN201611163151 A CN 201611163151A CN 106601265 A CN106601265 A CN 106601265A
Authority
CN
China
Prior art keywords
imf
voice
noise
signal
frequency
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201611163151.4A
Other languages
Chinese (zh)
Other versions
CN106601265B (en
Inventor
陈扶明
王健琪
李盛
李钊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fourth Military Medical University FMMU
904th Hospital of the Joint Logistics Support Force of PLA
Original Assignee
中国人民解放军第四军医大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国人民解放军第四军医大学 filed Critical 中国人民解放军第四军医大学
Priority to CN201611163151.4A priority Critical patent/CN106601265B/en
Publication of CN106601265A publication Critical patent/CN106601265A/en
Application granted granted Critical
Publication of CN106601265B publication Critical patent/CN106601265B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0224Processing in the time domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • G10L21/0308Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02087Noise filtering the noise being separate speech, e.g. cocktail party

Abstract

The invention discloses a method for eliminating noise in millimeter wave biological radar voice. After an acquired millimeter wave radar voice signal is subjected to empirical mode decomposition, the method, on the basis of analysis of noise content in an intrinsic mode function, de-noises a high-frequency part, a intermediate-frequency part, and a low-frequency part by using an adaptive threshold according to the characteristics of the noise distribution in the intrinsic mode function so as to eliminate the noise content in the voice in a targeted way based on the amount of the noise content, has strong adaptability and effectiveness, in particular can well retain detail information of the voice, and contributes to a significant improvement in radar voice quality. An example using the method shows that the targeted empirical mode decomposition method can effectively eliminate the noise content in the millimeter wave radar voice. Compared with a conventional voice enhancement method, the method has strong adaptability and can significantly improve voice intelligibility on the basis of eliminating the noise content in the radar voice.

Description

A kind of method for eliminating millimeter wave bioradar noise in voice
【Technical field】
The present invention relates to make an uproar in radar field of voice signal, more particularly to a kind of elimination millimeter wave bioradar voice The method of sound.
【Background technology】
There is the strong speech detection method of noncontact, remote, acoustic resistive noise jamming ability, bioradar as one Speech detection technological break-through microphone is easily by acoustic noise interference and other speech detections for needing with contact human skin The limitation of device, is gradually applied and is developed in speech detection field.At present, although millimeter wave bioradar can have Effect obtain 20 meters outside human body voice signal, but obtain voice signal often by electromagnetism harmonic noise, circuit noise and Ambient noise etc. is disturbed, and the presence of these noises reduces to a certain extent the quality of radar voice, or even have impact on language The intelligibility of sound, therefore, how the effective noise content eliminated in radar voice exists to millimeter wave bioradar technology Further apply significant in speech detection field.
【The content of the invention】
It is an object of the invention to:A kind of method for eliminating noise in millimeter wave bioradar is provided, the method is according to Jing Radar voice noise characteristic distributions after empirical mode decomposition, on the premise of voice signal distortion and excessive damage is not caused, Effectively remove the noise content in radar voice.For bioradar speech detection technology development provide speech de-noising in terms of Technical support.
Effectively to realize above-mentioned purpose, the technical solution used in the present invention is:
A kind of method for eliminating millimeter wave bioradar noise in voice, comprises the following steps:
1) bioradar voice is sampled, obtains grandfather tape and make an uproar radar voice signal;
2) radar voice signal of making an uproar to the grandfather tape of collection carries out empirical mode decomposition for intrinsic mode function IMF;
3) noise content analysis is carried out to the intrinsic mode function after empirical mode decomposition;
4) according to radar voice noise characteristic distributions, the selection of noise component(s) number is carried out using Mutual information entropy, to distinguish High frequency and intermediate frequency separation, using given threshold intermediate frequency and low frequency separation are distinguished;
5) intrinsic mode function Jing after self-adaptive solution is reconstructed and obtains enhanced radar voice.
As a further improvement on the present invention, step 2) specifically include following steps:
Assume that given voice signal is x (t), empirical mode decomposition method is adaptive by the original letter by screening process Number intrinsic mode function is decomposed into, and each intrinsic mode function IMF is a sub- frequency content of primary signal, its tool The screening process of body is as follows:
2.1) all of maximum of signal x (t) and minimum point are found out;
2.2) maximum and minimum point are fitted respectively by three sample interpolations, respectively obtain maximum envelope eu(t) With minimum envelope ed(t);
2.3) average m of maximum envelope and minimum envelope is calculated1(t)=(eu(t)+ed(t))/2, and with original Signal deducts the average and can obtain h1(t)=x (t)-m1(t);
2.4) h is judged1T whether () meet two screening primary conditions of intrinsic mode function IMF:
A () in all of signal length, the zero passage of each intrinsic mode function IMF points must count out phase with extreme value Same, or both number is more or less the same in 1;
B () is zero in the average of any sample point, maximum envelope and minimum envelope, that is, meet IMF with regard to the time Axle is geometrically symmetric;
Now, if h1T () meets (a) and (b) two primary conditions of intrinsic mode function IMF, then IMF1(t)=h1 (t);
If 2.5) h1T () is unsatisfactory for two primary conditions of IMF, then make h1T () is used as new primary signal repeat step 2.1), 2.2), 2.3) h is obtained2(t)=h1(t)-m2(t);If h2T () meets IMF and sieves condition then IMF substantially1(t)=h2 (t), if now h2T () is unsatisfactory for IMF screenings condition and then performs stopping screening threshold value, the stopping criterion such as following formula:
Stopping screening threshold value SD span should meet between 0.2 and 0.3;
If h2T () meets stopping criterion, then IMF1(t)=h2(t);
If h2T () is unsatisfactory for, then by h2(t) repeat step 2.1) -2.4), under continuation h is goti(t), until hi(t) Meet two primary conditions or meet stopping criterion, then IMF1(t)=hi(t);
2.6) when getting IMF1After (t), it is removed from primary signal and obtains residual signals r1(t):r1(t)=x (t)-IMF1(t), using residual signals as a new primary signal, repeat step 2.1) -2.5), until obtaining next residual Difference signal, then decompose finally, then residual signals are represented with following formula:rn(t)=rn-1(t)-IMFn(t);
Now, rnT () is a monotonic sequence, after the completion of screening, primary signal is broken down into multiple intrinsic mode function IMF1 (t)、IMF2(t)…IMFn(t) and residual sequence rn(t);N is positive integer;Therefore, primary signal is represented by:
As a further improvement on the present invention, by original radar voice signal Jing empirical mode decompositions, it is decomposed into radio-frequency head Point, intermediate-frequency section, low frequency part, noise concentrates on HFS, and intermediate-frequency section is useful primary speech signal, and low frequency is language The detailed information of message number.
As a further improvement on the present invention, step 4) in, bioradar voice intrinsic mode function calculates mutual trust entropy side Method is as follows:
The mutual trust entropy of two variable Xs and Y can be calculated according to the following formula:
Wherein, p (x, y) is the Joint Distribution of variable X and Y, and p (x) and p (y) is respectively marginal distribution, or Mutual information entropy Following formula is expressed as with the form of entropy:
I(X;Y)=H (X)-H (X | Y) (4)
Wherein:
H (X) is the intrinsic entropy of variable X, and H (X | Y) it is the entropy of variable X on the premise of variable Y occurs.
As a further improvement on the present invention, step 4) in, high frequency, intermediate frequency separation, intermediate frequency, low frequency separation according to Lower step determines:
The mutual trust entropy between adjacent IMF is calculated, according to mutual trust entropy theory, adjacent Mutual information entropy there will be following rule: First reduce and increase afterwards, then in increase:
According to IMF during formula (7) selection information entropy minimum of a value as the separation for distinguishing high frequency and intermediate frequency, and distinguish intermediate frequency With the separation of low frequency by carrying out to radar voice signal after empirical mode decomposition, according to noise in each intrinsic mode function point Cloth situation finally determines that fixed threshold FT is 10-1;If the amplitude of the maximum of IMFs is less than threshold value FT, then IMFs is as low Frequency part.
As a further improvement on the present invention, step 5) in, to containing noisy intrinsic mode function Jing adaptive thresholds Denoising, is that the function of wherein adaptive threshold is to radar speech de-noising using empirical mode decomposition method:
Wherein, N is the length of signal, and σ is the noise variance that each rank IMF estimates:
To HFS using formula (8) threshold function table, the collapse threshold that intermediate-frequency section is adopted for:
Soft threshold method is used to eliminate the noise signal of high frequency and intermediate-frequency section:
Wherein, sign is sign function, and i represents the i-th rank intrinsic mode function.
As a further improvement on the present invention, step 5) in, the high frequency with remaining low frequency component and Jing after threshold denoising Voice signal is reconstructed with intermediate frequency component, reconstruction signal is represented with following formula:
Wherein, the high fdrequency component number that k is to determine, the intrinsic mode number of components for distinguishing high frequency and intermediate frequency that l is to determine, n It is the intrinsic mode number after empirical modal boundary.
The present invention is due to using above-mentioned technology, being allowed to the good effect having compared with existing radar speech enhancement technique It is:
The present invention is by the millimeter radar voice signal to obtaining Jing after empirical mode decomposition, in intrinsic mode function On the basis of noise content is analyzed, according in intrinsic mode function the characteristics of noise profile, using adaptive threshold to height Frequently, intermediate frequency, low frequency part carry out denoising such that it is able to according to noise content number targetedly eliminate voice in noise Content, with stronger adaptability and validity, particularly can be good at retaining the detailed information of voice, contribute in radar Voice quality is significantly improved.Show this can have with targetedly empirical mode decomposition method using the example of the method Effect eliminates the noise content in millimetre-wave radar voice, compared with traditional sound enhancement method, with stronger adaptability, and The intelligibility of voice can be significantly improved on the basis of radar noise in voice content is eliminated.Do not causing radar voice signal On the premise of distortion, radar noise in voice is efficiently removed.The present invention can be for later using bioradar detection human body language Sound aspect provides effective technical support.Therefore, the present invention has stronger use value in terms of radar voice noise is eliminated And application prospect.
Further, the intrinsic mode function to decomposing is divided into high frequency, intermediate frequency and the part of low frequency three, by observing eigen mode Noise profile situation in state function, be by radar noise in voice characteristic distributions:Noise focuses primarily upon HFS, intermediate frequency portion Divide predominantly useful primary speech signal, but still contain a small amount of noise signal, it is thin that low frequency part is mainly voice signal Section signal.On this basis high frequency and intermediate frequency component number are selected by Mutual information entropy, wherein, HFS contains Much noise, arranges higher threshold value.And the separation for distinguishing intermediate frequency and low frequency finds out fixed threshold 10 according to experiment-1.If The amplitude of the maximum of IMFs is less than threshold value FT, then these IMFs are low frequency part.For intermediate-frequency section arranges higher-frequency The less adaptive threshold in part, low frequency part does not do denoising.
【Description of the drawings】
Fig. 1 is a kind of method flow diagram of elimination millimeter wave bioradar noise in voice;
Fig. 2 is one section of radar phonetic material Jing after empirical mode decomposition, wherein (a) is containing noisy original radar language Message number;B () is each rank intrinsic mode function after empirical mode decomposition;
Fig. 3 is the original radar voice signal of radar system collection, wherein, when (a) being millimetre-wave radar voice signal Domain waveform;B () is the sound spectrograph of millimetre-wave radar voice signal;
Fig. 4 is that traditional algorithm spectrum-subtraction carries out the voice signal after de-noising, wherein, (a) it is millimetre-wave radar voice signal Time domain waveform;B () is the sound spectrograph of millimetre-wave radar voice signal;
Fig. 5 is that the empirical mode decomposition described in this patent and Mutual information entropy algorithm eliminate the radar voice signal after noise, Wherein, (a) be millimetre-wave radar voice signal time domain waveform;B () is the sound spectrograph of millimetre-wave radar voice signal.
【Specific embodiment】
Below in conjunction with the accompanying drawings, the specific embodiment of the present invention is described in detail, but the invention is not restricted to the enforcement Example.In order that the public has to the present invention thoroughly understanding, in present invention below preferably applies example concrete details is described in detail.
Referring to Fig. 1, the present invention eliminates the general principle of the method for millimeter wave bioradar noise in voice and is:First to original Beginning radar voice signal carries out empirical mode decomposition;Noise profile in intrinsic mode function Jing after empirical mode decomposition is carried out Divide;The separation of HFS and intermediate-frequency section is divided using Mutual information entropy, each rank intrinsic mode function is carried out Adaptive Wavelet Thrinkage;Radar voice signal after process is reconstructed.
A kind of method of elimination millimeter wave bioradar noise in voice of the present invention, it is comprised the following steps that:
First, bioradar voice is sampled, obtains grandfather tape and make an uproar radar voice signal;
Secondly, the original radar voice signal to gathering carries out empirical mode decomposition;
It is as follows that millimetre-wave radar voice signal of making an uproar to described band carries out empirical mode decomposition step:
Assume that given voice signal is x (t), empirical mode decomposition method is original by this by " screening " process adaptive Signal decomposition is intrinsic mode function (IMFs), and each IMF is a sub- frequency content of primary signal.Its is specific Screening process is as follows:
1) all of maximum of signal x (t) and minimum point are found out.
2) maximum and minimum point are fitted respectively by three sample interpolations, respectively obtain maximum envelope eu(t) and Minimum envelope ed(t)。
3) average m of maximum envelope and minimum envelope is calculated1(t)=(eu(t)+ed(t))/2, and use original letter Number deducting the average can obtain h1(t)=x (t)-m1(t).
4) h is judged1T whether () meet two screening primary conditions of IMF:
(a) in all of signal length, the zero passage of each IMF points must count out with extreme value it is identical, or at most Difference one.
B () is necessary for zero in the average of any sample point, maximum envelope and minimum envelope, that is to say, that IMF is It is geometrically symmetric with regard to time shaft.
Now, if h1T () meets two primary conditions of IMF, then IMF1(t)=h1(t)
If 5) h1T () is unsatisfactory for two primary conditions of IMF, then h1T () is used as new primary signal repeat step 1), 2), 3) h is obtained2(t)=h1(t)-m2(t), if h2T () meets IMF and sieves condition then IMF substantially1(t)=h2(t).Now such as Fruit h2T () is unsatisfactory for IMF screenings condition then a new stopping screening threshold value is performed.The stopping criterion such as following formula:
Generally, SD spans are between 0.2 and 0.3.If h2T () meets stopping criterion, then IMF1(t)= h2(t).If h2T () is unsatisfactory for, then by h2(t) repeat step 1) -4), under continuation h is goti(t), until hiT () meets two Individual stop condition meets stopping criterion.Then IMF1(t)=hi(t)。
6) when getting IMF1After (t), it is removed from primary signal and obtains residual signals r1(t):r1(t)=x (t)- IMF1(t). now, residual signals are used as a new primary signal, repeat step 1-5, until obtaining next residual signals. So decompose finally, then residual signals can be represented with following formula:rn(t)=rn-1(t)-IMFn(t)。
Now, rnT () is a monotonic sequence, after the completion of screening, primary signal is broken down into multiple intrinsic mode function IMF1 (t),IMF2(t),…IMFn(t) and residual sequence rn(t). therefore, primary signal is represented by:
It is intrinsic mode function by original radar voice signal Jing empirical mode decompositions.Fig. 2 is one section of Jing empirical modal point Radar phonetic material after solution.Fig. 2 (a) is containing noisy original radar voice signal.Fig. 2 (b) is after empirical mode decomposition Each rank intrinsic mode function, from top to bottom, the frequency of the intrinsic mode function is gradually lowered.Due to radar voice it is main Noise source be white Gaussian noise, therefore, in general, noise can be distributed in whole intrinsic mode function, from figure (b) in can To find out, voice signal waveform, predominantly noise signal are substantially not visible in first three rank intrinsic mode function, and from 4-9 ranks Intrinsic mode starts, mainly voice signal waveform, and there is partial noise signal.From the 10th rank to last single order, eigen mode The frequency of state function is low and amplitude very little, but the wherein detailed information comprising some voices, therefore, we can make such Assume:By original radar voice signal Jing empirical mode decompositions, HFS can be decomposed into, intermediate-frequency section, low frequency part is made an uproar Sound focuses primarily upon HFS, and intermediate-frequency section is mainly useful primary speech signal, and low frequency is believed for the details of voice signal Breath.
Again, noise content analysis is carried out to the intrinsic mode function after empirical mode decomposition.
Then, according to radar voice noise characteristic distributions, the selection of noise component(s) number is carried out using Mutual information entropy, with area Divide high frequency and intermediate frequency separation, using given threshold intermediate frequency and low frequency separation are distinguished.
It is as follows that bioradar voice intrinsic mode function calculates mutual trust entropy method:
Mutual information entropy is all generally that the mutual trust entropy of nonnegative number, two variable Xs and Y can be calculated according to the following formula:
Wherein, p (x, y) is the Joint Distribution of variable X and Y, and p (x) and p (y) are respectively marginal distributions.Or Mutual information entropy Following formula is expressed as with the form of entropy:
I(X;Y)=H (X)-H (X | Y) (4)
Wherein:
H (X) is the intrinsic entropy of variable X, and H (X | Y) it is the entropy of variable X on the premise of variable Y occurs.
When event X is more uncertain, entropy is bigger. and in general, the correlation between two variables is stronger, then mutual between variable Information entropy is bigger.According to above feature, the mutual trust entropy between adjacent IMF is calculated, according to mutual trust entropy theory, adjacent mutual trust entropy will Have following rule:First reduce and increase afterwards, then in increase:
According to IMF during formula (7) selection information entropy minimum of a value as the separation for distinguishing high frequency and intermediate frequency, and distinguish intermediate frequency With the separation of low frequency by carrying out to radar voice signal after empirical mode decomposition, according to noise in each intrinsic mode function point Cloth situation finally determines that fixed threshold (FT) is 10-1;If the amplitude of the maximum of IMFs is less than threshold value FT, then IMFs is Low frequency part.
Finally, the intrinsic mode function Jing after self-adaptive solution is reconstructed and obtains enhanced radar voice.
To containing noisy intrinsic mode function Jing Adaptive Wavelet Thrinkages, its step is as follows:
Using empirical mode decomposition method in radar speech de-noising, the selection of threshold value plays vital effect.It is logical Often the generic function of threshold value by:
Herein, N is the length of signal, and σ is the noise variance that each rank IMF estimates:
Herein, the threshold function table of formula (8) is adopted to HFS, it is relatively small for intermediate-frequency section noisiness, To avoid voice signal distortion, the collapse threshold that intermediate-frequency section is adopted for:
Soft threshold method is used to eliminate the noise signal of high frequency and intermediate-frequency section:
Then, the high frequency and intermediate frequency component with remaining low frequency component and Jing after threshold denoising carries out weight to voice signal Structure, reconstruction signal is represented with following formula:
Wherein, the high fdrequency component number that k is to determine, the intrinsic mode number of components for distinguishing high frequency and intermediate frequency that l is to determine, n It is the intrinsic mode number after empirical modal boundary.
Generally speaking, the radar voice enhancement algorithm that the application is proposed is mainly following steps,
1. original radar voice signal x (t) is sieved using empirical mode decomposition.
2. the Energy-Entropy of each component is calculated by formula (5) and (6).
3. the Mutual information entropy for calculating each adjacent component presses formula (4).
4. the separation of high frequency and intermediate frequency component is determined according to formula (7).
5. the separation of intermediate frequency and low frequency component is determined using IMF fixed thresholds.
6. the noise in high frequency and intermediate frequency voice signal is leached using formula (8)-(11).
7. voice signal is reconstructed with the height after the low frequency component and formula (12) denoising not processed, intermediate frequency component.
Embodiment
In order to ensure the uniformity of sound source, the recording materials " 1-2-3-4-5-6 " of a women experimenter is selected, and by sound Case is being played at 5 meters of radar system.
Radar voice is strengthened with following step:
1) bioradar voice is sampled, obtains grandfather tape and make an uproar radar voice signal;
2) the original radar voice signal to gathering carries out empirical mode decomposition;
3) noise content analysis is carried out to the intrinsic mode function after empirical mode decomposition;
4) according to radar voice noise characteristic distributions, using Mutual information entropy the selection of noise component(s) number is carried out;
5) intrinsic mode function Jing after self-adaptive solution is reconstructed and obtains enhanced radar voice.
Result is contrasted, and be refer to shown in Fig. 3, Fig. 4, Fig. 5, wherein (a) is millimetre-wave radar voice signal in each figure Time domain waveform, scheme (b) be millimetre-wave radar voice signal sound spectrograph.
Fig. 2 is the original radar voice signal gathered by radar system, either from the time domain of original radar voice signal In waveform or sound spectrograph it can be seen that, voice signal receive noise jamming, these noise sources mainly have electromagnetism harmonic noise, Circuit noise and ambient noise.The presence of these noises often have impact on the quality of radar voice signal.
Fig. 4 is to carry out the voice signal after de-noising by traditional algorithm spectrum-subtraction, as can be seen that voice letter from time domain waveform Noise in number is significantly eliminated, the noise especially mourned in silence in section, again it can be seen that Jing from Fig. 4 (b) sound spectrographs After spectrum-subtraction process, the noise mourned in silence in section is significantly eliminated, but in HFS, some non-primary radar voice letters Composition in number is introduced, and is referred to as " music noise ", and this often affects voice quality acoustically.
Fig. 5 is that the empirical mode decomposition described in this patent and Mutual information entropy algorithm eliminate the radar voice signal after noise. It can be seen that the noise in radar voice signal has also obtained significantly suppressing, although can see in Fig. 5 (b) Arrive, in non-speech segment, however it remains part residual noise, it is to avoid causing voice signal distortion that this is the method, using adaptive Answer the result of threshold value.For the validity of further checking the method, Subjective audiometry experiment, experimental result table are synchronously carried out Bright, compared with spectrum cuts algorithm, the method for proposition effectively eliminates radar voice letter on the premise of voice signal distortion is not caused Noise in number, improves voice quality.
Although the method for eliminating the noise in millimetre-wave radar voice that the present invention is discussed has specific aim, it is especially suitable for Millimetre-wave radar voice signal, but the scope that the present invention is used is not limited to millimetre-wave radar voice signal, for centimetre Ripple radar voice signal, and some acoustic speech signals for gathering under equivalent environment equally have important directive significance and Reference value.Embodiments of the present invention are explained in detail above in conjunction with accompanying drawing, not the present invention is imposed any restrictions, it is all It is substantially above embodiment is made any simple modification, change and equivalent structure change according to the technology of the present invention, Still fall within the protection domain of technical solution of the present invention.

Claims (7)

1. it is a kind of eliminate millimeter wave bioradar noise in voice method, it is characterised in that comprise the following steps:
1) bioradar voice is sampled, obtains grandfather tape and make an uproar radar voice signal;
2) radar voice signal of making an uproar to the grandfather tape of collection carries out empirical mode decomposition for intrinsic mode function IMF;
3) noise content analysis is carried out to the intrinsic mode function after empirical mode decomposition;
4) according to radar voice noise characteristic distributions, the selection of noise component(s) number is carried out using Mutual information entropy, to distinguish high frequency With intermediate frequency separation, using given threshold intermediate frequency and low frequency separation are distinguished;
5) intrinsic mode function Jing after self-adaptive solution is reconstructed and obtains enhanced radar voice.
2. it is according to claim 1 it is a kind of eliminate millimeter wave bioradar noise in voice method, it is characterised in that:Step It is rapid 2) to specifically include following steps:
Assume that given voice signal is x (t), empirical mode decomposition method is adaptive by the primary signal point by screening process Solve as intrinsic mode function, and each intrinsic mode function IMF is a sub- frequency content of primary signal, and its is specific Screening process is as follows:
2.1) all of maximum of signal x (t) and minimum point are found out;
2.2) maximum and minimum point are fitted respectively by three sample interpolations, respectively obtain maximum envelope euIt is (t) and minimum Value envelope ed(t);
2.3) average m of maximum envelope and minimum envelope is calculated1(t)=(eu(t)+ed(t))/2, and use primary signal Deducting the average can obtain h1(t)=x (t)-m1(t);
2.4) h is judged1T whether () meet two screening primary conditions of intrinsic mode function IMF:
(a) in all of signal length, the zero passage of each intrinsic mode function IMF points must count out with extreme value it is identical, Or both number be more or less the same in 1;
B () is zero in the average of any sample point, maximum envelope and minimum envelope, that is, meet IMF is with regard to time shaft Geometrically symmetric;
Now, if h1T () meets (a) and (b) two primary conditions of intrinsic mode function IMF, then IMF1(t)=h1(t);
If 2.5) h1T () is unsatisfactory for two primary conditions of IMF, then make h1T () is used as new primary signal repeat step 2.1), 2.2), 2.3) h is obtained2(t)=h1(t)-m2(t);If h2T () meets IMF and sieves condition then IMF substantially1(t)=h2 (t), if now h2T () is unsatisfactory for IMF screenings condition and then performs stopping screening threshold value, the stopping criterion such as following formula:
S D ( i ) = Σ t = 0 N | h i - 1 ( t ) - h i ( t ) | 2 h i - 1 2 ( t ) - - - ( 1 )
Stopping screening threshold value SD span should meet between 0.2 and 0.3;
If h2T () meets stopping criterion, then IMF1(t)=h2(t);
If h2T () is unsatisfactory for, then by h2(t) repeat step 2.1) -2.4), under continuation h is goti(t), until hiT () meets Two primary conditions meet stopping criterion, then IMF1(t)=hi(t);
2.6) when getting IMF1After (t), it is removed from primary signal and obtains residual signals r1(t):r1(t)=x (t)- IMF1(t), using residual signals as a new primary signal, repeat step 2.1) -2.5), until obtaining next residual error letter Number, then decompose finally, then residual signals are represented with following formula:rn(t)=rn-1(t)-IMFn(t);
Now, rnT () is a monotonic sequence, after the completion of screening, primary signal is broken down into multiple intrinsic mode function IMF1(t)、 IMF2(t)…IMFn(t) and residual sequence rn(t);N is positive integer;Therefore, primary signal is represented by:
x ( t ) = Σ i = 1 n IMF i ( t ) + r n ( t ) . - - - ( 2 )
3. it is according to claim 2 it is a kind of eliminate millimeter wave bioradar noise in voice method, it is characterised in that:Will Original radar voice signal Jing empirical mode decompositions, are decomposed into HFS, intermediate-frequency section, low frequency part, and noise concentrates on height Frequency part, intermediate-frequency section is useful primary speech signal, and low frequency is the detailed information of voice signal.
4. it is according to claim 1 it is a kind of eliminate millimeter wave bioradar noise in voice method, it is characterised in that:Step It is rapid 4) in, bioradar voice intrinsic mode function calculate mutual trust entropy method it is as follows:
The mutual trust entropy of two variable Xs and Y can be calculated according to the following formula:
I ( X ; Y ) = Σ Σ p ( x , y ) log 2 ( p ( x , y ) p ( x ) p ( y ) ) - - - ( 3 )
Wherein, p (x, y) is the Joint Distribution of variable X and Y, and p (x) and p (y) is respectively marginal distribution, or Mutual information entropy entropy Form be expressed as following formula:
I(X;Y)=H (X)-H (X | Y) (4)
Wherein:
H ( X ) = - Σ x ∈ Ω x p ( x ) log 2 ( p ( x ) ) - - - ( 5 )
H ( X | Y ) = - Σ Σ x ∈ Ω x p ( x , y ) log 2 ( p ( x | y ) ) - - - ( 6 )
H (X) is the intrinsic entropy of variable X, and H (X | Y) it is the entropy of variable X on the premise of variable Y occurs.
5. it is according to claim 4 it is a kind of eliminate millimeter wave bioradar noise in voice method, it is characterised in that:Step It is rapid 4) in, high frequency, intermediate frequency separation, intermediate frequency, low frequency separation according to following steps determine:
The mutual trust entropy between adjacent IMF is calculated, according to mutual trust entropy theory, adjacent mutual trust entropy there will be following rule:First reduce After increase, then increase:
I f I ( IMF i , IMF i + 1 ) ↓ a n d I ( IMF i + 1 , IMF i + 2 ) ↑ k = f i r s t ( arg min 1 ≤ i ≤ n - 1 [ I ( IMF i , IMF i + 1 ) ] ) - - - ( 7 )
IMF is used as the separation for distinguishing high frequency and intermediate frequency when selecting information entropy minimum of a value according to formula (7), and distinguishes intermediate frequency and low The separation of frequency by carrying out to radar voice signal after empirical mode decomposition, according to noise profile feelings in each intrinsic mode function Condition finally determines that fixed threshold FT is 10-1;If the amplitude of the maximum of IMFs is less than threshold value FT, then IMFs is low frequency portion Point.
6. according to a kind of method of the elimination millimeter wave bioradar noise in voice described in claim 1, it is characterised in that:Step 5) in, to containing noisy intrinsic mode function Jing Adaptive Wavelet Thrinkages, being to radar language using empirical mode decomposition method The function of sound denoising, wherein adaptive threshold is:
Thr i = σ i 2 log ( N ) - - - ( 8 )
Wherein, N is the length of signal, and σ is the noise variance that each rank IMF estimates:
σ = m e d i a n { | IMF 1 ( t ) - m e d i a n { IMF 1 ( t ) } | } 0.675 - - - ( 9 )
To HFS using formula (8) threshold function table, the collapse threshold that intermediate-frequency section is adopted for:
Thr i = σ i 2 log ( N ) / i - - - ( 10 )
Soft threshold method is used to eliminate the noise signal of high frequency and intermediate-frequency section:
IMF i ′ ( t ) = s i g n { IMF i ( t ) } { IMF i ( t ) - Thr i } | IMF i ( t ) | ≥ Thr i 0 | IMF i ( t ) | ≤ Thr i - - - ( 11 )
Wherein, sign is sign function, and i represents the i-th rank intrinsic mode function.
7. according to a kind of method of the elimination millimeter wave bioradar noise in voice described in claim 6, it is characterised in that:Step 5) in, the high frequency and intermediate frequency component with remaining low frequency component and Jing after threshold denoising is reconstructed to voice signal, reconstruct letter Number represented with following formula:
y ( t ) = Σ i = 1 k IMF i ′ ( t ) + Σ i = k + 1 l IMF i ′ ( t ) + Σ i = l + 1 n IMF i ( t ) - - - ( 12 )
Wherein, the high fdrequency component number that k is to determine, the intrinsic mode number of components for distinguishing high frequency and intermediate frequency that l is to determine, n is Jing The intrinsic mode number tested after mode boundary.
CN201611163151.4A 2016-12-15 2016-12-15 A method of eliminating millimeter wave bioradar noise in voice Expired - Fee Related CN106601265B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611163151.4A CN106601265B (en) 2016-12-15 2016-12-15 A method of eliminating millimeter wave bioradar noise in voice

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611163151.4A CN106601265B (en) 2016-12-15 2016-12-15 A method of eliminating millimeter wave bioradar noise in voice

Publications (2)

Publication Number Publication Date
CN106601265A true CN106601265A (en) 2017-04-26
CN106601265B CN106601265B (en) 2019-08-13

Family

ID=58801731

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611163151.4A Expired - Fee Related CN106601265B (en) 2016-12-15 2016-12-15 A method of eliminating millimeter wave bioradar noise in voice

Country Status (1)

Country Link
CN (1) CN106601265B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107993316A (en) * 2017-11-28 2018-05-04 广东工业大学 A kind of rifle cabinet management method, system, medium and equipment
CN109086684A (en) * 2018-07-11 2018-12-25 盐城线尚天使科技企业孵化器有限公司 Biometric discrimination method under multi-modal more application scenarios
CN110602621A (en) * 2019-09-09 2019-12-20 惠州市锦好医疗科技股份有限公司 Noise reduction method and system for digital hearing aid and special DSP
CN111276154A (en) * 2020-02-26 2020-06-12 中国电子科技集团公司第三研究所 Wind noise suppression method and system and shot sound detection method and system
CN113178198A (en) * 2021-04-16 2021-07-27 杭州环木信息科技有限责任公司 Non-contact voiceprint biometric authentication method based on millimeter wave perception

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101320566A (en) * 2008-06-30 2008-12-10 中国人民解放军第四军医大学 Non-air conduction speech reinforcement method based on multi-band spectrum subtraction
CN103778921A (en) * 2013-12-28 2014-05-07 中国人民解放军兰州军区乌鲁木齐总医院 Method for eliminating nonuniform noise in speech collected by radar
CN104299620A (en) * 2014-09-22 2015-01-21 河海大学 Speech enhancement method based on EMD algorithm
CN105788603A (en) * 2016-02-25 2016-07-20 深圳创维数字技术有限公司 Audio identification method and system based on empirical mode decomposition

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101320566A (en) * 2008-06-30 2008-12-10 中国人民解放军第四军医大学 Non-air conduction speech reinforcement method based on multi-band spectrum subtraction
CN103778921A (en) * 2013-12-28 2014-05-07 中国人民解放军兰州军区乌鲁木齐总医院 Method for eliminating nonuniform noise in speech collected by radar
CN104299620A (en) * 2014-09-22 2015-01-21 河海大学 Speech enhancement method based on EMD algorithm
CN105788603A (en) * 2016-02-25 2016-07-20 深圳创维数字技术有限公司 Audio identification method and system based on empirical mode decomposition

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107993316A (en) * 2017-11-28 2018-05-04 广东工业大学 A kind of rifle cabinet management method, system, medium and equipment
CN109086684A (en) * 2018-07-11 2018-12-25 盐城线尚天使科技企业孵化器有限公司 Biometric discrimination method under multi-modal more application scenarios
CN110602621A (en) * 2019-09-09 2019-12-20 惠州市锦好医疗科技股份有限公司 Noise reduction method and system for digital hearing aid and special DSP
CN110602621B (en) * 2019-09-09 2021-06-01 惠州市锦好医疗科技股份有限公司 Noise reduction method and system for digital hearing aid and special DSP
CN111276154A (en) * 2020-02-26 2020-06-12 中国电子科技集团公司第三研究所 Wind noise suppression method and system and shot sound detection method and system
CN111276154B (en) * 2020-02-26 2022-12-09 中国电子科技集团公司第三研究所 Wind noise suppression method and system and shot sound detection method and system
CN113178198A (en) * 2021-04-16 2021-07-27 杭州环木信息科技有限责任公司 Non-contact voiceprint biometric authentication method based on millimeter wave perception

Also Published As

Publication number Publication date
CN106601265B (en) 2019-08-13

Similar Documents

Publication Publication Date Title
CN106601265A (en) Method for eliminating noise in millimeter wave biological radar voice
US10319390B2 (en) Method and system for multi-talker babble noise reduction
Madhu et al. The potential for speech intelligibility improvement using the ideal binary mask and the ideal wiener filter in single channel noise reduction systems: Application to auditory prostheses
Daqrouq et al. An investigation of speech enhancement using wavelet filtering method
Pichevar et al. Auditory-inspired sparse representation of audio signals
Edraki et al. Speech intelligibility prediction using spectro-temporal modulation analysis
Nongpiur Impulse noise removal in speech using wavelets
Vaz et al. A two-step technique for MRI audio enhancement using dictionary learning and wavelet packet analysis.
Nongpiur et al. Impulse-noise suppression in speech using the stationary wavelet transform
Islam et al. Rayleigh modeling of teager energy operated perceptual wavelet packet coefficients for enhancing noisy speech
Whitmal et al. Reducing correlated noise in digital hearing aids
Hamid et al. Speech enhancement using EMD based adaptive soft-thresholding (EMD-ADT)
CN106782588A (en) A kind of millimeter wave bioradar sound enhancement method based on sense of hearing wavelet packet
Siafarikas et al. Wavelet packet based speaker verification
Geravanchizadeh et al. Monaural speech enhancement based on multi-threshold masking
Nisa et al. The speech signal enhancement approach with multiple sub-frames analysis for complex magnitude and phase spectrum recompense
Hajiaghababa et al. An undecimated wavelet-based method for cochlear implant speech processing
Yann Transform based speech enhancement techniques
Kumar et al. Noise Reduction Algorithm for Speech Enhancement
Upadhyay An improved multi-band speech enhancement utilizing masking properties of human hearing system
Ravi Performance analysis of adaptive wavelet denosing by speech discrimination and thresholding
Singh et al. Noise removal in single channel Hindi speech patterns by using binary mask thresholding function in various mother wavelets
Kim et al. A new binary mask based on noise constraints for improved speech intelligibility
Ravi et al. Wavelet based noise reduction techniques for real time speech enhancement
Sudheer Kumar et al. Noise Reduction in Audio File Using Spectral Gatting and FFT by Python Modules

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20200305

Address after: 710032 No. 169 West Changle Road, Shaanxi, Xi'an

Co-patentee after: No. 94 Hospital of the Joint Logistics Support Force of the Chinese People's Liberation Army

Patentee after: FOURTH MILITARY MEDICAL University

Address before: 710032 No. 169 West Changle Road, Xincheng District, Shaanxi, Xi'an

Patentee before: FOURTH MILITARY MEDICAL University

TR01 Transfer of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190813

Termination date: 20201215

CF01 Termination of patent right due to non-payment of annual fee