CN106601265B - A method of eliminating millimeter wave bioradar noise in voice - Google Patents

A method of eliminating millimeter wave bioradar noise in voice Download PDF

Info

Publication number
CN106601265B
CN106601265B CN201611163151.4A CN201611163151A CN106601265B CN 106601265 B CN106601265 B CN 106601265B CN 201611163151 A CN201611163151 A CN 201611163151A CN 106601265 B CN106601265 B CN 106601265B
Authority
CN
China
Prior art keywords
noise
imf
voice
signal
frequency
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201611163151.4A
Other languages
Chinese (zh)
Other versions
CN106601265A (en
Inventor
陈扶明
王健琪
李盛
李钊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fourth Military Medical University FMMU
904th Hospital of the Joint Logistics Support Force of PLA
Original Assignee
中国人民解放军第四军医大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国人民解放军第四军医大学 filed Critical 中国人民解放军第四军医大学
Priority to CN201611163151.4A priority Critical patent/CN106601265B/en
Publication of CN106601265A publication Critical patent/CN106601265A/en
Application granted granted Critical
Publication of CN106601265B publication Critical patent/CN106601265B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0224Processing in the time domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • G10L21/0308Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02087Noise filtering the noise being separate speech, e.g. cocktail party

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Machine Translation (AREA)
  • Noise Elimination (AREA)

Abstract

The invention discloses a kind of methods for eliminating millimeter wave bioradar noise in voice.The present invention be by the millimeter radar voice signal to acquisition after empirical mode decomposition, on the basis of analyzing noise content in intrinsic mode function, according in intrinsic mode function the characteristics of noise profile, high frequency, intermediate frequency, low frequency part are denoised using adaptive threshold, so as to according to noise content number it is targeted eliminate voice in noise content, with stronger adaptability and validity, it particularly can be good at the detailed information for retaining voice, facilitate significantly improving in radar voice quality.Using the example of this method show it is this have targetedly empirical mode decomposition method can effectively eliminate noise content in millimetre-wave radar voice, compared with traditional sound enhancement method, with stronger adaptability, and the intelligibility of voice can be significantly improved on the basis of eliminating radar noise in voice content.

Description

A method of eliminating millimeter wave bioradar noise in voice
[technical field]
The present invention relates to make an uproar in radar field of voice signal more particularly to a kind of elimination millimeter wave bioradar voice The method of sound.
[background technique]
As one with the speech detection method non-contact, remote, acoustic resistive noise jamming ability is strong, bioradar Speech detections that speech detection technological break-through microphone is interfered vulnerable to acoustic noise and other need to contact with human skin The limitation of device is gradually applied and is developed in speech detection field.Currently, although millimeter wave bioradar can have Effect obtain 20 meters outside human body voice signal, however obtain voice signal often by electromagnetism harmonic noise, circuit noise and Ambient noise etc. is interfered, and the presence of these noises reduces the quality of radar voice to a certain extent, or even affects language The intelligibility of sound, therefore, how the effective noise content eliminated in radar voice exists to millimeter wave bioradar technology Further application in speech detection field is of great significance.
[summary of the invention]
It is an object of the invention to: a kind of method for eliminating noise in millimeter wave bioradar is provided, this method is according to warp Radar voice noise characteristic distributions after empirical mode decomposition, under the premise of not causing voice signal to be distorted and excessive damage, Effectively remove the noise content in radar voice.For bioradar speech detection technology development provide speech de-noising in terms of Technical support.
Effectively to realize above-mentioned purpose, the technical solution adopted by the present invention is that:
A method of eliminating millimeter wave bioradar noise in voice, comprising the following steps:
1) bioradar voice is sampled, obtains grandfather tape and makes an uproar radar voice signal;
2) to the grandfather tape of acquisition make an uproar radar voice signal carry out empirical mode decomposition be intrinsic mode function IMF;
3) noise content analysis is carried out to the intrinsic mode function after empirical mode decomposition;
4) according to radar voice noise characteristic distributions, the selection of noise component(s) number is carried out using Mutual information entropy, to distinguish High frequency and intermediate frequency separation distinguish intermediate frequency and low frequency separation using given threshold;
5) intrinsic mode function after self-adaptive solution is reconstructed to obtain enhanced radar voice.
As a further improvement of the present invention, step 2) specifically includes the following steps:
Assuming that be x (t) to speech signal, empirical mode decomposition method by screening process it is adaptive by the original letter Number it is decomposed into intrinsic mode function, and each intrinsic mode function IMF is a sub- frequency content of original signal, tool The screening process of body is as follows:
2.1) all maximum of signal x (t) and minimum point are found out;
2.2) maximum and minimum point are fitted by sample interpolation three times respectively, respectively obtain maximum envelope eu(t) With minimum envelope ed(t);
2.3) the mean value m of maximum envelope and minimum envelope is calculated1(t)=(eu(t)+ed(t))/2, and with original Signal, which subtracts the mean value, can obtain h1(t)=x (t)-m1(t);
2.4) judge h1(t) whether meet two screening primary conditions of intrinsic mode function IMF:
(a) in all signal lengths, the zero passage points of each intrinsic mode function IMF must be with extreme point number phase Same, or both number is not much different in 1;
(b) at any sampled point, the mean value of maximum envelope and minimum envelope is zero, that is, meets IMF about the time Axis is geometrically symmetric;
At this point, if h1(t) meet (a) and (b) two primary conditions of intrinsic mode function IMF, then IMF1(t)=h1 (t);
If 2.5) h1(t) two primary conditions of IMF are unsatisfactory for, then make h1(t) step is repeated as new original signal 2.1), 2.2), 2.3) obtain h2(t)=h1(t)-m2(t);If h2(t) meet IMF and sieve condition then IMF substantially1(t)=h2 (t), at this time if h2(t) it is unsatisfactory for IMF screening condition and then executes stopping screening threshold value, the stopping criterion such as following formula:
Stopping screening threshold value SD value range should meet between 0.2 and 0.3;
If h2(t) meet stopping criterion, then IMF1(t)=h2(t);
If h2(t) it is unsatisfactory for, then by h2(t) step 2.1) -2.4 is repeated), get h under continuingi(t), until hi(t) Meet two primary conditions or meet stopping criterion, then IMF1(t)=hi(t);
2.6) when getting IMF1(t) after, it is removed from original signal and obtains residual signals r1(t):r1(t)=x (t)-IMF1(t), the original signal that residual signals are new as one repeats step 2.1) -2.5), until obtaining next residual Difference signal, then decomposing finally, then residual signals are indicated with following formula: rn(t)=rn-1(t)-IMFn(t);
At this point, rnIt (t) is a monotonic sequence, after the completion of screening, original signal is broken down into multiple intrinsic mode function IMF1 (t)、IMF2(t)…IMFn(t) and residual sequence rn(t);N is positive integer;Therefore, original signal may be expressed as:
As a further improvement of the present invention, original radar voice signal is decomposed into radio-frequency head through empirical mode decomposition Divide, intermediate-frequency section, low frequency part, noise concentrates on high frequency section, and intermediate-frequency section is useful primary speech signal, and low frequency is language The detailed information of sound signal.
As a further improvement of the present invention, in step 4), bioradar voice intrinsic mode function calculates Mutual information entropy Method is as follows:
The Mutual information entropy of two variable Xs and Y can calculate according to the following formula:
Wherein, p (x, y) is variable X and the Joint Distribution of Y, and p (x) and p (y) are marginal distribution or Mutual information entropy respectively Following formula is expressed as with the form of entropy:
I(X;Y)=H (X)-H (X | Y) (4)
Wherein:
H (X) is the intrinsic entropy of variable X, and H (X | Y) it is the entropy of variable X under the premise of variable Y occurs.
As a further improvement of the present invention, in step 4), high frequency, intermediate frequency separation, intermediate frequency, low frequency separation according to Lower step determines:
The mutual information entropy between adjacent IMF is calculated, according to Mutual information entropy theory, adjacent Mutual information entropy there will be following rule Rule: first reduce and increase afterwards, then increasing:
IMF is as distinguishing the separation of high frequency and intermediate frequency when selecting information entropy minimum value according to formula (7), and distinguishes intermediate frequency With the separation of low frequency by carrying out empirical mode decomposition to radar voice signal after, according to noise in each intrinsic mode function point Cloth situation finally determines that fixed threshold FT is 10-1;If the amplitude of the maximum value of IMFs is less than threshold value FT, IMFs is as low Frequency part.
As a further improvement of the present invention, in step 5), to noise-containing intrinsic mode function through adaptive threshold Denoising is using empirical mode decomposition method to radar speech de-noising, wherein the function of adaptive threshold are as follows:
Wherein, N is the length of signal, and σ is the noise variance of each rank IMF estimation:
The threshold function table that formula (8) are used to high frequency section, the collapse threshold that intermediate-frequency section is used are as follows:
Soft threshold method is used to eliminate the noise signal of high frequency and intermediate-frequency section:
Wherein, sign is sign function, and i represents the i-th rank intrinsic mode function.
As a further improvement of the present invention, in step 5), with remaining low frequency component and the high frequency after threshold denoising Voice signal is reconstructed with intermediate frequency component, reconstruction signal is indicated with following formula:
Wherein, k is determining high fdrequency component number, and l is the intrinsic mode number of components of determining differentiation high frequency and intermediate frequency, n It is the intrinsic mode number after empirical modal boundary.
The present invention is allowed to the good effect having compared with existing radar speech enhancement technique due to using above-mentioned technology It is:
The present invention is through the millimeter radar voice signal to acquisition after empirical mode decomposition, in intrinsic mode function On the basis of noise content is analyzed, according in intrinsic mode function the characteristics of noise profile, using adaptive threshold to height Frequently, intermediate frequency, low frequency part are denoised, so as to according to noise content number it is targeted eliminate voice in noise Content has stronger adaptability and validity, particularly can be good at the detailed information for retaining voice, facilitates in radar Voice quality significantly improves.Using the example of this method show it is this have targetedly empirical mode decomposition method can have Effect eliminates the noise content in millimetre-wave radar voice, compared with traditional sound enhancement method, has stronger adaptability, and The intelligibility of voice can be significantly improved on the basis of eliminating radar noise in voice content.Do not causing radar voice signal Under the premise of distortion, radar noise in voice is efficiently removed.The present invention can detect human body language using bioradar to be later Effective technical support is provided in terms of sound.Therefore, the present invention has stronger use value in terms of eliminating radar voice noise And application prospect.
Further, high frequency, intermediate frequency and low frequency three parts are divided into the intrinsic mode function of decomposition, by observing eigen mode Noise profile situation in state function, by radar noise in voice characteristic distributions are as follows: noise focuses primarily upon high frequency section, intermediate frequency portion Divide predominantly useful primary speech signal, but still contain a small amount of noise signal, low frequency part is mainly that voice signal is thin Save signal.High frequency and intermediate frequency component number are selected by Mutual information entropy on this basis, wherein high frequency section contains Higher threshold value is arranged in much noise.And the separation for distinguishing intermediate frequency and low frequency finds out fixed threshold 10 according to experiment-1.If The amplitude of the maximum value of IMFs is less than threshold value FT, then these IMFs are low frequency part.Higher-frequency is arranged for intermediate-frequency section The lesser adaptive threshold in part, low frequency part do not do denoising.
[Detailed description of the invention]
Fig. 1 is a kind of method flow diagram for eliminating millimeter wave bioradar noise in voice;
Fig. 2 is one section of radar phonetic material after empirical mode decomposition, wherein (a) is noise-containing original radar language Sound signal;(b) it is each rank intrinsic mode function after empirical mode decomposition;
Fig. 3 is the original radar voice signal of radar system acquisition, wherein when (a) is millimetre-wave radar voice signal Domain waveform;(b) be millimetre-wave radar voice signal sound spectrograph;
Fig. 4 is that traditional algorithm spectrum-subtraction carries out the voice signal after de-noising, wherein (a) is millimetre-wave radar voice signal Time domain waveform;(b) be millimetre-wave radar voice signal sound spectrograph;
Fig. 5 is that empirical mode decomposition described in this patent and Mutual information entropy algorithm eliminate the radar voice signal after noise, Wherein, (a) is the time domain waveform of millimetre-wave radar voice signal;(b) be millimetre-wave radar voice signal sound spectrograph.
[specific embodiment]
With reference to the accompanying drawing, a specific embodiment of the invention is described in detail, but the present invention is not limited to the implementations Example.In order to make the public have thorough understanding to the present invention, is preferably applied in following present invention and concrete details is described in detail in example.
Referring to Fig. 1, the present invention eliminates the basic principle of the method for millimeter wave bioradar noise in voice are as follows: first to original Beginning radar voice signal carries out empirical mode decomposition;Noise profile in intrinsic mode function after empirical mode decomposition is carried out It divides;It is divided using separation of the Mutual information entropy to high frequency section and intermediate-frequency section, each rank intrinsic mode function is carried out Adaptive Wavelet Thrinkage;To treated, radar voice signal is reconstructed.
A kind of method of elimination millimeter wave bioradar noise in voice of the invention, the specific steps of which are as follows:
Firstly, sampled to bioradar voice, obtains grandfather tape and make an uproar radar voice signal;
Secondly, the original radar voice signal to acquisition carries out empirical mode decomposition;
Making an uproar to described band, steps are as follows for millimetre-wave radar voice signal progress empirical mode decomposition:
Assuming that being x (t) to speech signal, empirical mode decomposition method is original by this by " screening " process adaptive Signal decomposition is intrinsic mode function (IMFs), and each IMF is a sub- frequency content of original signal.It is specific Screening process is as follows:
1) all maximum of signal x (t) and minimum point are found out.
2) maximum and minimum point are fitted by sample interpolation three times respectively, respectively obtain maximum envelope eu(t) and Minimum envelope ed(t)。
3) the mean value m of maximum envelope and minimum envelope is calculated1(t)=(eu(t)+ed(t))/2, and with original letter H can be obtained by number subtracting the mean value1(t)=x (t)-m1(t).
4) judge h1(t) whether meet two screening primary conditions of IMF:
(a) in all signal lengths, the zero passage points of each IMF must be identical as extreme point number, or at most Difference one.
(b) at any sampled point, the mean value of maximum envelope and minimum envelope is necessary for zero, that is to say, that IMF is It is geometrically symmetric about time shaft.
At this point, if h1(t) meet two primary conditions of IMF, then IMF1(t)=h1(t)
If 5) h1(t) two primary conditions of IMF are unsatisfactory for, then h1(t) step 1) is repeated as new original signal, 2), 3) h is obtained2(t)=h1(t)-m2(t), if h2(t) meet IMF and sieve condition then IMF substantially1(t)=h2(t).At this time such as Fruit h2(t) being unsatisfactory for IMF screening condition, then a new stopping screening threshold value is performed.The stopping criterion such as following formula:
Under normal circumstances, SD value range is between 0.2 and 0.3.If h2(t) meet stopping criterion, then IMF1(t)= h2(t).If h2(t) it is unsatisfactory for, then by h2(t) step 1) -4 is repeated), get h under continuingi(t), until hi(t) meet two A stop condition meets stopping criterion.Then IMF1(t)=hi(t)。
6) when getting IMF1(t) after, it is removed from original signal and obtains residual signals r1(t):r1(t)=x (t)- IMF1(t) is at this point, the residual signals original signal new as one, repeats step 1-5, until obtaining next residual signals. It so decomposes finally, then residual signals can be indicated with following formula: rn(t)=rn-1(t)-IMFn(t)。
At this point, rnIt (t) is a monotonic sequence, after the completion of screening, original signal is broken down into multiple intrinsic mode function IMF1 (t),IMF2(t),…IMFn(t) and residual sequence rn(t) therefore, original signal may be expressed as:
By original radar voice signal through empirical mode decomposition be intrinsic mode function.Fig. 2 is one section through empirical modal point Radar phonetic material after solution.Fig. 2 (a) is noise-containing original radar voice signal.After Fig. 2 (b) is empirical mode decomposition Each rank intrinsic mode function, from top to bottom, the frequency of the intrinsic mode function gradually decreases.Since radar voice is main Noise source be white Gaussian noise, therefore, in general, noise can be distributed in entire intrinsic mode function, can from figure (b) To find out, it is substantially not visible voice signal waveform, predominantly noise signal in first three rank intrinsic mode function, and from 4-9 rank Intrinsic mode starts, mainly voice signal waveform, and there are partial noise signals.From the 10th rank single order to the end, eigen mode The frequency of state function is low and amplitude very little, however wherein includes the detailed information of some voices, and therefore, we can make such Assuming that: by original radar voice signal through empirical mode decomposition, it can be decomposed into high frequency section, intermediate-frequency section, low frequency part is made an uproar Sound focuses primarily upon high frequency section, and intermediate-frequency section is mainly useful primary speech signal, and low frequency is that the details of voice signal is believed Breath.
Again, noise content analysis is carried out to the intrinsic mode function after empirical mode decomposition.
Then, according to radar voice noise characteristic distributions, the selection of noise component(s) number is carried out, using Mutual information entropy with area Divide high frequency and intermediate frequency separation, given threshold is utilized to distinguish intermediate frequency and low frequency separation.
It is as follows that bioradar voice intrinsic mode function calculates Mutual information entropy method:
Mutual information entropy is all nonnegative number under normal circumstances, and the Mutual information entropy of two variable Xs and Y can calculate according to the following formula:
Wherein, p (x, y) is variable X and the Joint Distribution of Y, and p (x) and p (y) are marginal distribution respectively.Or Mutual information entropy Following formula is expressed as with the form of entropy:
I(X;Y)=H (X)-H (X | Y) (4)
Wherein:
H (X) is the intrinsic entropy of variable X, and H (X | Y) it is the entropy of variable X under the premise of variable Y occurs.
When event X is more uncertain, in general, the correlation between two variables is stronger, then mutual between variable by the bigger of entropy Information entropy is bigger.According to the above feature, the mutual information entropy between adjacent IMF is calculated, according to Mutual information entropy theory, adjacent mutual trust Breath entropy will have following rule: first reduces and increases afterwards, then increasing:
IMF is as distinguishing the separation of high frequency and intermediate frequency when selecting information entropy minimum value according to formula (7), and distinguishes intermediate frequency With the separation of low frequency by carrying out empirical mode decomposition to radar voice signal after, according to noise in each intrinsic mode function point Cloth situation finally determines that fixed threshold (FT) is 10-1;If the amplitude of the maximum value of IMFs is less than threshold value FT, IMFs is Low frequency part.
Finally, the intrinsic mode function after self-adaptive solution is reconstructed to obtain enhanced radar voice.
To noise-containing intrinsic mode function through Adaptive Wavelet Thrinkage, its step are as follows:
Using empirical mode decomposition method in radar speech de-noising, the selection of threshold value is played a crucial role.It is logical The generic function of normal threshold value by:
Herein, N is the length of signal, and σ is the noise variance of each rank IMF estimation:
Herein, the threshold function table of formula (8) is used to high frequency section, it is relatively small for intermediate-frequency section noisiness, To avoid voice signal from being distorted, to the collapse threshold of intermediate-frequency section use are as follows:
Soft threshold method is used to eliminate the noise signal of high frequency and intermediate-frequency section:
Then, weight is carried out to voice signal with remaining low frequency component and the high frequency after threshold denoising and intermediate frequency component Structure, reconstruction signal are indicated with following formula:
Wherein, k is determining high fdrequency component number, and l is the intrinsic mode number of components of determining differentiation high frequency and intermediate frequency, n It is the intrinsic mode number after empirical modal boundary.
Generally speaking, the radar voice enhancement algorithm that the application proposes is mainly following steps,
1. original radar voice signal x (t) is sieved using empirical mode decomposition.
2. calculating the Energy-Entropy of each component by formula (5) and (6).
3. the Mutual information entropy for calculating each adjacent component presses formula (4).
4. determining the separation of high frequency and intermediate frequency component according to formula (7).
5. determining the separation of intermediate frequency and low frequency component using IMF fixed threshold.
6. being filtered out using formula (8)-(11) to the noise in high frequency and intermediate frequency voice signal.
7. voice signal is reconstructed with the height after low frequency component and formula (12) denoising not processed, intermediate frequency component.
Embodiment
In order to guarantee the consistency of sound source, the recording materials " 1-2-3-4-5-6 " of a women experimenter is selected, and by sound Case plays at away from 5 meters of radar system.
Radar voice is enhanced with following step:
1) bioradar voice is sampled, obtains grandfather tape and makes an uproar radar voice signal;
2) empirical mode decomposition is carried out to the original radar voice signal of acquisition;
3) noise content analysis is carried out to the intrinsic mode function after empirical mode decomposition;
4) according to radar voice noise characteristic distributions, the selection of noise component(s) number is carried out using Mutual information entropy;
5) intrinsic mode function after self-adaptive solution is reconstructed to obtain enhanced radar voice.
Processing result comparison, please refers to shown in Fig. 3, Fig. 4, Fig. 5, wherein (a) is millimetre-wave radar voice signal in each figure Time domain waveform, figure (b) be millimetre-wave radar voice signal sound spectrograph.
Fig. 2 is the original radar voice signal acquired by radar system, the either time domain from original radar voice signal In waveform or sound spectrograph it can be seen that, voice signal by noise jamming, these noise sources mainly have electromagnetism harmonic noise, Circuit noise and ambient noise.The presence of these noises often affects the quality of radar voice signal.
Fig. 4 is that the voice signal after de-noising is carried out by traditional algorithm spectrum-subtraction, as can be seen that voice letter from time domain waveform Noise in number is significantly eliminated, the noise in especially silent section, from Fig. 4 (b) sound spectrograph again it can be seen that through After spectrum-subtraction processing, the noise in silent section is significantly eliminated, but in high frequency section, some non-primary radar voice letters Ingredient in number is introduced, referred to as " music noise ", this is acoustically often influencing voice quality.
Fig. 5 is that empirical mode decomposition described in this patent and Mutual information entropy algorithm eliminate the radar voice signal after noise. It can be seen from the figure that the noise in radar voice signal has also obtained significantly inhibiting, although can see in Fig. 5 (b) It arrives, in non-speech segment, however it remains part residual noise, this is that this method is to avoid voice signal distortion, and use is adaptive Answer the result of threshold value.For the validity for further verifying this method, synchronizes and carried out Subjective audiometry experiment, experimental result table Bright, compared with spectrum cuts algorithm, the method for proposition effectively eliminates radar voice letter under the premise of not causing voice signal to be distorted Noise in number, improves voice quality.
Although the discussed method for eliminating the noise in millimetre-wave radar voice of the invention has specific aim, it is especially suitable for Millimetre-wave radar voice signal, but the range that the present invention uses is not limited to millimetre-wave radar voice signal, for centimetre Wave radar voice signal and some acoustic speech signals acquired under identical environment equally have important directive significance and Reference value.Embodiments of the present invention are explained in detail above in conjunction with attached drawing, is not intended to limit the invention in any way, all It is according to the technical essence of the invention to any simple modification, change and equivalent structural changes made by the above embodiment, It still falls in the protection scope of technical solution of the present invention.

Claims (6)

1. a kind of method for eliminating millimeter wave bioradar noise in voice, which comprises the following steps:
1) bioradar voice is sampled, obtains grandfather tape and makes an uproar radar voice signal;
2) to the grandfather tape of acquisition make an uproar radar voice signal carry out empirical mode decomposition be intrinsic mode function IMF;
3) noise content analysis is carried out to the intrinsic mode function after empirical mode decomposition;
4) according to radar voice noise characteristic distributions, the selection of noise component(s) number is carried out, using Mutual information entropy to distinguish high frequency With intermediate frequency separation, intermediate frequency and low frequency separation are distinguished using given threshold;
5) intrinsic mode function after self-adaptive solution is reconstructed to obtain enhanced radar voice;
Step 2) specifically includes the following steps:
Assuming that being x (t) to speech signal, empirical mode decomposition method divides the original signal by the way that screening process is adaptive Solution is intrinsic mode function, and each intrinsic mode function IMF is a sub- frequency content of original signal, specific Screening process is as follows:
2.1) all maximum of signal x (t) and minimum point are found out;
2.2) maximum and minimum point are fitted by sample interpolation three times respectively, respectively obtain maximum envelope eu(t) and it is minimum It is worth envelope ed(t);
2.3) the mean value m of maximum envelope and minimum envelope is calculated1(t)=(eu(t)+ed(t))/2, and original signal is used H can be obtained by subtracting the mean value1(t)=x (t)-m1(t);
2.4) judge h1(t) whether meet two screening primary conditions of intrinsic mode function IMF:
(a) in all signal lengths, the zero passage points of each intrinsic mode function IMF must be identical as extreme point number, Or both number be not much different in 1;
(b) at any sampled point, the mean value of maximum envelope and minimum envelope is zero, that is, meeting IMF about time shaft is Geometrically symmetric;
At this point, if h1(t) meet (a) and (b) two primary conditions of intrinsic mode function IMF, then IMF1(t)=h1(t);
If 2.5) h1(t) two primary conditions of IMF are unsatisfactory for, then make h1(t) step is repeated as new original signal 2.1), 2.2), 2.3) obtain h2(t)=h1(t)-m2(t);If h2(t) meet IMF and sieve condition then IMF substantially1(t)=h2 (t), at this time if h2(t) it is unsatisfactory for IMF screening condition and then executes stopping screening threshold value, the stopping criterion such as following formula:
Stopping screening threshold value SD value range should meet between 0.2 and 0.3;
If h2(t) meet stopping criterion, then IMF1(t)=h2(t);
If h2(t) it is unsatisfactory for, then by h2(t) step 2.1) -2.4 is repeated), get h under continuingi(t), until hi(t) meet Two primary conditions meet stopping criterion, then IMF1(t)=hi(t);
2.6) when getting IMF1(t) after, it is removed from original signal and obtains residual signals r1(t):r1(t)=x (t)- IMF1(t), the original signal that residual signals are new as one repeats step 2.1) -2.5), until obtaining next residual error letter Number, then decomposing finally, then residual signals are indicated with following formula: rn(t)=rn-1(t)-IMFn(t);
At this point, rnIt (t) is a monotonic sequence, after the completion of screening, original signal is broken down into multiple intrinsic mode function IMF1(t)、 IMF2(t)…IMFn(t) and residual sequence rn(t);N is positive integer;Therefore, original signal may be expressed as:
2. a kind of method for eliminating millimeter wave bioradar noise in voice according to claim 1, it is characterised in that: will Original radar voice signal is decomposed into high frequency section, intermediate-frequency section, low frequency part through empirical mode decomposition, and noise concentrates on height Frequency part, intermediate-frequency section are useful primary speech signal, and low frequency is the detailed information of voice signal.
3. a kind of method for eliminating millimeter wave bioradar noise in voice according to claim 1, it is characterised in that: step It is rapid 4) in, bioradar voice intrinsic mode function calculate Mutual information entropy method it is as follows:
The Mutual information entropy of two variable Xs and Y can calculate according to the following formula:
Wherein, p (x, y) is variable X and the Joint Distribution of Y, and p (x) and p (y) are marginal distribution or Mutual information entropy entropy respectively Form be expressed as following formula:
I(X;Y)=H (X)-H (X | Y) (4)
Wherein:
H (X) is the intrinsic entropy of variable X, and H (X | Y) it is the entropy of variable X under the premise of variable Y occurs.
4. a kind of method for eliminating millimeter wave bioradar noise in voice according to claim 3, it is characterised in that: step It is rapid 4) in, high frequency, intermediate frequency separation, intermediate frequency, low frequency separation according to following steps determine:
The mutual information entropy between adjacent IMF is calculated, according to Mutual information entropy theory, adjacent Mutual information entropy there will be following rule: First reduce and increase afterwards, is then increasing:
IMF is as distinguishing the separation of high frequency and intermediate frequency when selecting information entropy minimum value according to formula (7), and distinguishes intermediate frequency and low After the separation of frequency is by carrying out empirical mode decomposition to radar voice signal, according to noise profile feelings in each intrinsic mode function Condition finally determines that fixed threshold FT is 10-1;If the amplitude of the maximum value of IMFs is less than threshold value FT, IMFs is low frequency portion Point.
5. according to a kind of method for eliminating millimeter wave bioradar noise in voice described in claim 1, it is characterised in that: step It 5) is using empirical mode decomposition method to radar language to noise-containing intrinsic mode function through Adaptive Wavelet Thrinkage in Sound denoises, wherein the function of adaptive threshold are as follows:
Wherein, N is the length of signal, and σ is the noise variance of each rank IMF estimation:
The threshold function table that formula (8) are used to high frequency section, the collapse threshold that intermediate-frequency section is used are as follows:
Soft threshold method is used to eliminate the noise signal of high frequency and intermediate-frequency section:
Wherein, sign is sign function, and i represents the i-th rank intrinsic mode function.
6. according to a kind of method of elimination millimeter wave bioradar noise in voice described in claim 5, it is characterised in that: step 5) in, voice signal is reconstructed with remaining low frequency component and the high frequency after threshold denoising and intermediate frequency component, reconstruct letter It number is indicated with following formula:
Wherein, k is determining high fdrequency component number, and l is the intrinsic mode number of components of determining differentiation high frequency and intermediate frequency, n be through Intrinsic mode number after testing mode boundary.
CN201611163151.4A 2016-12-15 2016-12-15 A method of eliminating millimeter wave bioradar noise in voice Expired - Fee Related CN106601265B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611163151.4A CN106601265B (en) 2016-12-15 2016-12-15 A method of eliminating millimeter wave bioradar noise in voice

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611163151.4A CN106601265B (en) 2016-12-15 2016-12-15 A method of eliminating millimeter wave bioradar noise in voice

Publications (2)

Publication Number Publication Date
CN106601265A CN106601265A (en) 2017-04-26
CN106601265B true CN106601265B (en) 2019-08-13

Family

ID=58801731

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611163151.4A Expired - Fee Related CN106601265B (en) 2016-12-15 2016-12-15 A method of eliminating millimeter wave bioradar noise in voice

Country Status (1)

Country Link
CN (1) CN106601265B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107993316B (en) * 2017-11-28 2020-09-11 广东工业大学 Gun cabinet management method, system, medium and equipment
CN109086684A (en) * 2018-07-11 2018-12-25 盐城线尚天使科技企业孵化器有限公司 Biometric discrimination method under multi-modal more application scenarios
CN110602621B (en) * 2019-09-09 2021-06-01 惠州市锦好医疗科技股份有限公司 Noise reduction method and system for digital hearing aid and special DSP
CN111276154B (en) * 2020-02-26 2022-12-09 中国电子科技集团公司第三研究所 Wind noise suppression method and system and shot sound detection method and system
CN113178198A (en) * 2021-04-16 2021-07-27 杭州环木信息科技有限责任公司 Non-contact voiceprint biometric authentication method based on millimeter wave perception

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101320566A (en) * 2008-06-30 2008-12-10 中国人民解放军第四军医大学 Non-air conduction speech reinforcement method based on multi-band spectrum subtraction
CN103778921A (en) * 2013-12-28 2014-05-07 中国人民解放军兰州军区乌鲁木齐总医院 Method for eliminating nonuniform noise in speech collected by radar
CN104299620A (en) * 2014-09-22 2015-01-21 河海大学 Speech enhancement method based on EMD algorithm
CN105788603A (en) * 2016-02-25 2016-07-20 深圳创维数字技术有限公司 Audio identification method and system based on empirical mode decomposition

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101320566A (en) * 2008-06-30 2008-12-10 中国人民解放军第四军医大学 Non-air conduction speech reinforcement method based on multi-band spectrum subtraction
CN103778921A (en) * 2013-12-28 2014-05-07 中国人民解放军兰州军区乌鲁木齐总医院 Method for eliminating nonuniform noise in speech collected by radar
CN104299620A (en) * 2014-09-22 2015-01-21 河海大学 Speech enhancement method based on EMD algorithm
CN105788603A (en) * 2016-02-25 2016-07-20 深圳创维数字技术有限公司 Audio identification method and system based on empirical mode decomposition

Also Published As

Publication number Publication date
CN106601265A (en) 2017-04-26

Similar Documents

Publication Publication Date Title
CN106601265B (en) A method of eliminating millimeter wave bioradar noise in voice
Hu et al. Speech enhancement based on wavelet thresholding the multitaper spectrum
Chatlani et al. EMD-based filtering (EMDF) of low-frequency noise for speech enhancement
Sheikhzadeh et al. An improved wavelet-based speech enhancement system
US7620546B2 (en) Isolating speech signals utilizing neural networks
CN110277087B (en) Pre-judging preprocessing method for broadcast signals
Eshaghi et al. Voice activity detection based on using wavelet packet
Upadhyay et al. An improved multi-band spectral subtraction algorithm for enhancing speech in various noise environments
Morales-Cordovilla et al. Feature extraction based on pitch-synchronous averaging for robust speech recognition
Sanam et al. Enhancement of noisy speech based on a custom thresholding function with a statistically determined threshold
Nongpiur et al. Impulse-noise suppression in speech using the stationary wavelet transform
Hsu et al. Voice activity detection based on frequency modulation of harmonics
Islam et al. Rayleigh modeling of teager energy operated perceptual wavelet packet coefficients for enhancing noisy speech
Hamid et al. Speech enhancement using EMD based adaptive soft-thresholding (EMD-ADT)
Özaydın et al. Speech enhancement using maximal overlap discrete wavelet transform
Rao et al. Speech enhancement using sub-band cross-correlation compensated Wiener filter combined with harmonic regeneration
JP2003510665A (en) Apparatus and method for de-esser using adaptive filtering algorithm
Sulong et al. Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing.
Farooq et al. Mel-scaled wavelet filter based features for noisy unvoiced phoneme recognition
Upadhyay et al. Single channel speech enhancement utilizing iterative processing of multi-band spectral subtraction algorithm
Sulong et al. Speech enhancement based on wiener filter and compressive sensing
Sanam et al. A combination of semisoft and μ-law thresholding functions for enhancing noisy speech in wavelet packet domain
Kumar et al. Noise Reduction Algorithm for Speech Enhancement
Ravi et al. Wavelet based noise reduction techniques for real time speech enhancement
Kim et al. A new binary mask based on noise constraints for improved speech intelligibility

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20200305

Address after: 710032 No. 169 West Changle Road, Shaanxi, Xi'an

Co-patentee after: No. 94 Hospital of the Joint Logistics Support Force of the Chinese People's Liberation Army

Patentee after: FOURTH MILITARY MEDICAL University

Address before: 710032 No. 169 West Changle Road, Xincheng District, Shaanxi, Xi'an

Patentee before: FOURTH MILITARY MEDICAL University

TR01 Transfer of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190813

Termination date: 20201215

CF01 Termination of patent right due to non-payment of annual fee