CN110875054B - Far-field noise suppression method, device and system - Google Patents

Far-field noise suppression method, device and system Download PDF

Info

Publication number
CN110875054B
CN110875054B CN201811012141.XA CN201811012141A CN110875054B CN 110875054 B CN110875054 B CN 110875054B CN 201811012141 A CN201811012141 A CN 201811012141A CN 110875054 B CN110875054 B CN 110875054B
Authority
CN
China
Prior art keywords
noise
energy
voice
obtaining
path
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811012141.XA
Other languages
Chinese (zh)
Other versions
CN110875054A (en
Inventor
余涛
银鞍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201811012141.XA priority Critical patent/CN110875054B/en
Publication of CN110875054A publication Critical patent/CN110875054A/en
Application granted granted Critical
Publication of CN110875054B publication Critical patent/CN110875054B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0264Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02165Two microphones, one receiving mainly the noise signal and the other one mainly the speech signal
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T90/00Enabling technologies or technologies with a potential or indirect contribution to GHG emissions mitigation

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

The application discloses a far-field noise suppression method, and a far-field noise suppression device and system. The far-field noise suppression method comprises the following steps: according to the method, the energy of a voice beam and the energy of a noise beam are obtained, the energy of the voice beam and the energy of the noise beam are compared and analyzed, two-path noise energy is obtained, according to the two-path noise energy and the energy of the voice beam, a first parameter for eliminating non-stationary noise is obtained, and according to the first parameter, the voice beam is subjected to filtering processing, so that first enhanced voice data are obtained. On the basis of the voice wave beam, the noise wave beam is further combined to accurately estimate the non-stationary noise in the voice signal, so that the elimination of the non-stationary noise is realized, and the accuracy of far-field voice recognition can be improved in a complex environment.

Description

Far-field noise suppression method, device and system
Technical Field
The application relates to the field of far-field voice signal processing, in particular to a far-field noise suppression method. The application also relates to a far-field noise suppression device and a far-field noise suppression system.
Background
With the continuous development of artificial intelligence technology, far-field speech recognition is becoming more and more important as a key technology of man-machine interaction, and people hope that a machine can understand human speech instructions, so that control over the machine is realized. Despite the rapid development of speech recognition technology in the past decades, the dependence of far-field speech recognition on the environment is still strong, and the accuracy of far-field speech recognition is severely degraded by a large amount of environmental noise. As the application scenarios of the voice communication device (e.g., malls, stations, streets, etc.) are increasing, the types of noise signals accompanying the voice signals are also increasing, wherein there are not only a large number of stationary noise signals but also a large number of non-stationary noise signals. This requires a better adaptation of the method for noise estimation of the speech signal, and an accurate estimate of the noise in the speech signal is made, thus suppressing the noise signal in the far-field speech signal.
In order to solve the above-mentioned problems, in the prior art, noise in a speech signal is generally estimated by using a single-path noise estimation method, so that the noise signal in the speech recognition process can be eliminated to a certain extent, and the accuracy of speech signal recognition is improved. However, the single-pass noise estimation method also exposes the following drawbacks: because only one path of target voice wave beam is used as a basis, noise estimation has certain limitation, and noise can not be eliminated well, and especially in complex environmental noise, the reliability of the noise estimation is low, the result of the noise estimation is inaccurate, and therefore, the accuracy of far-field voice recognition is low.
Disclosure of Invention
The application provides a far-field noise suppression method, which aims to solve the problems of low reliability of noise estimation and inaccuracy of noise estimation results in complex environmental noise in the prior art. The present application additionally provides a far-field noise suppression device and system.
The far-field noise suppression method provided by the application comprises the following steps:
acquiring a voice beam acquired by a first microphone and a noise beam acquired by a second microphone;
according to the voice beam and the noise beam, obtaining the energy of the voice beam and the energy of the noise beam;
comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
and filtering the voice beam according to the first parameter to obtain first enhanced voice data.
Optionally, the far-field noise suppression method further includes:
carrying out single-path noise estimation on the voice wave beam to obtain single-path noise energy, wherein the single-path noise energy is the energy of stable noise;
Performing noise comprehensive analysis according to the two-path noise energy, the one-path noise energy and the energy of the voice beam to obtain comprehensive noise energy;
obtaining a second parameter for eliminating non-stationary noise and stationary noise according to the integrated noise energy and the energy of the voice beam;
and filtering the voice beam according to the second parameter to obtain second enhanced voice data.
Optionally, the first parameter is a first nanofiltration coefficient for suppressing non-stationary noise data;
the second parameter is a second wiener filter coefficient for suppressing stationary noise data and non-stationary noise data.
Optionally, the obtaining the energy of the voice beam and the energy of the noise beam specifically includes: obtaining the energy of the voice beam at the first time point and the energy of the noise beam at the first time point;
the comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain the double-path noise energy specifically comprises:
obtaining the probability of occurrence of the voice beam at the first time point according to the comparison result of the energy of the voice beam at the first time point and the energy of the noise beam at the first time point;
And obtaining double-path noise energy according to the occurrence probability of the voice wave beam at the first time point.
Optionally, the acquiring the voice beam acquired by the first microphone and the noise beam acquired by the second microphone specifically includes:
acquiring position information of the first microphone and the second microphone and a distance value between the first microphone and the second microphone;
and acquiring a voice beam acquired by the first microphone and a noise beam acquired by the second microphone according to the position information and the distance value.
Optionally, the far-field noise suppression method further includes:
obtaining the occurrence probability of the voice wave beam at the first time point;
and determining a smoothing factor of the first microphone at the first time point according to the probability of occurrence of the voice wave beam at the first time point.
Optionally, the obtaining the two-way noise energy according to the probability of occurrence of the voice beam at the first time point specifically includes: :
determining a smoothing factor of the first microphone at a first time point according to the occurrence probability of the voice wave beam at the first time point;
and determining double-path noise energy according to the smoothing factor of the first time point and the energy of the voice beam of the first time point.
Optionally, the performing noise comprehensive analysis according to the two-way noise energy, the one-way noise energy and the energy of the voice beam to obtain comprehensive noise energy specifically includes:
obtaining a noise comprehensive analysis algorithm:
wherein,,for the single-path noise energy, +.>For the two-way noise energy, alpha is the energy of the voice beam, and J is the comprehensive noise energy;
and inputting the two-path noise energy, the one-path noise energy and the energy of the voice beam into the noise comprehensive analysis algorithm to perform noise comprehensive analysis, so as to obtain comprehensive noise energy.
Optionally, the obtaining a second parameter for eliminating non-stationary noise and stationary noise according to the integrated noise energy and the energy of the voice beam specifically includes:
obtaining a ratio of the speech beam energy to the noise synthesis energy and the speech beam energy;
determining the second parameter for eliminating non-stationary noise and stationary noise according to the ratio;
wherein, the ratio is: g=speech beam energy/(speech beam energy + noise integrated energy), where g is the second parameter.
Optionally, the performing single-path noise estimation on the voice beam to obtain single-path noise energy specifically includes:
Preliminary estimation is carried out on the voice wave beam, and the minimum value of the voice wave beam is obtained;
obtaining the probability of occurrence of the voice wave beam according to the minimum value of the voice wave beam;
and estimating the energy of the single-path noise by using recursive average according to the occurrence probability of the voice wave beam.
Correspondingly, the application also provides a far-field noise suppression device, which comprises:
the acquisition unit is used for acquiring the voice beam acquired by the first microphone and the noise beam acquired by the second microphone;
the first obtaining unit is used for obtaining the energy of the voice beam and the energy of the noise beam according to the voice beam and the noise beam;
the first analysis unit is used for comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
a second obtaining unit configured to obtain a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
and the first processing unit is used for carrying out filtering processing on the voice wave beam according to the first parameter to obtain first enhanced voice data.
Optionally, the far-field noise suppression device further includes:
The third obtaining unit is used for carrying out single-path noise estimation on the voice wave beam to obtain single-path noise energy, wherein the single-path noise energy is the energy of stationary noise;
the second analysis unit is used for carrying out noise comprehensive analysis according to the two-path noise energy, the one-path noise energy and the energy of the voice beam to obtain comprehensive noise energy;
a fourth obtaining unit for obtaining a second parameter for eliminating non-stationary noise and stationary noise according to the integrated noise energy and the energy of the voice beam;
and the second processing unit is used for carrying out filtering processing on the voice wave beam according to the second parameter to obtain second enhanced voice data.
Optionally, the first parameter is a first nanofiltration coefficient for suppressing non-stationary noise data;
the second parameter is a second wiener filter coefficient for suppressing stationary noise data and non-stationary noise data.
Optionally, the obtaining the energy of the voice beam and the energy of the noise beam specifically includes: obtaining the energy of the voice beam at the first time point and the energy of the noise beam at the first time point;
the comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain the double-path noise energy specifically comprises:
Obtaining the probability of occurrence of the voice beam at the first time point according to the comparison result of the energy of the voice beam at the first time point and the energy of the noise beam at the first time point;
and obtaining double-path noise energy according to the occurrence probability of the voice wave beam at the first time point.
Optionally, the acquiring the voice beam acquired by the first microphone and the noise beam acquired by the second microphone specifically includes:
acquiring position information of the first microphone and the second microphone and a distance value between the first microphone and the second microphone;
and acquiring a voice beam acquired by the first microphone and a noise beam acquired by the second microphone according to the position information and the distance value.
Optionally, the far-field noise suppression device further includes:
obtaining the occurrence probability of the voice wave beam at the first time point;
and determining a smoothing factor of the first microphone at the first time point according to the probability of occurrence of the voice wave beam at the first time point.
Optionally, the obtaining the two-way noise energy according to the probability of occurrence of the voice beam at the first time point specifically includes: :
determining a smoothing factor of the first microphone at a first time point according to the occurrence probability of the voice wave beam at the first time point;
And determining double-path noise energy according to the smoothing factor of the first time point and the energy of the voice beam of the first time point.
Optionally, the performing noise comprehensive analysis according to the two-way noise energy, the one-way noise energy and the energy of the voice beam to obtain comprehensive noise energy specifically includes:
obtaining a noise comprehensive analysis algorithm:
wherein,,for the single-path noise energy, +.>For the two-way noise energy, alpha is the energy of the voice beam, and J is the comprehensive noise energy;
and inputting the two-path noise energy, the one-path noise energy and the energy of the voice beam into the noise comprehensive analysis algorithm to perform noise comprehensive analysis, so as to obtain comprehensive noise energy.
Optionally, the obtaining a second parameter for eliminating non-stationary noise and stationary noise according to the integrated noise energy and the energy of the voice beam specifically includes:
obtaining a ratio of the speech beam energy to the noise synthesis energy and the speech beam energy;
determining the second parameter for eliminating non-stationary noise and stationary noise according to the ratio;
wherein, the ratio is: g=speech beam energy/(speech beam energy + noise integrated energy), where g is the second parameter.
Optionally, the performing single-path noise estimation on the voice beam to obtain single-path noise energy specifically includes:
preliminary estimation is carried out on the voice wave beam, and the minimum value of the voice wave beam is obtained;
obtaining the probability of occurrence of the voice wave beam according to the minimum value of the voice wave beam;
and estimating the energy of the single-path noise by using recursive average according to the occurrence probability of the voice wave beam.
Correspondingly, the application also provides a far-field noise suppression system, which comprises: the far-field noise suppression device according to any one of the above technical solutions.
Correspondingly, the application also provides electronic equipment, which comprises:
a processor; and
a memory for storing a program of a far-field noise suppression method, the apparatus, after powering on and running the program of the far-field noise suppression method by the processor, performing the steps of:
acquiring a voice beam acquired by a first microphone and a noise beam acquired by a second microphone;
according to the voice beam and the noise beam, obtaining the energy of the voice beam and the energy of the noise beam;
comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
Obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
and filtering the voice beam according to the first parameter to obtain first enhanced voice data.
Accordingly, the present application further provides a storage device storing a program of the far-field noise suppression method, the program being executed by a processor to perform the steps of:
acquiring a voice beam acquired by a first microphone and a noise beam acquired by a second microphone;
according to the voice beam and the noise beam, obtaining the energy of the voice beam and the energy of the noise beam;
comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
and filtering the voice beam according to the first parameter to obtain first enhanced voice data.
Compared with the prior art, the application has the following advantages:
the application provides a far-field noise suppression method, in particular to a far-field noise suppression method under a complex environment. The method comprises the steps of obtaining a voice beam collected by a first microphone and a noise beam collected by a second microphone, obtaining the energy of the voice beam and the energy of the noise beam according to the voice beam and the noise beam, comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-way noise energy, wherein the two-way noise energy is the energy of non-stationary noise, obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam, and performing filtering processing on the voice beam according to the first parameter to obtain first enhanced voice data. On the basis of the voice wave beam, the noise wave beam is further combined to accurately estimate the non-stationary noise in the voice signal, so that the elimination of the non-stationary noise is realized, and the accuracy of far-field voice recognition can be improved in a complex environment.
Drawings
FIG. 1 is a flow chart of an embodiment of a far field noise suppression method of the present application;
FIG. 2 is a schematic diagram of an embodiment of a far field noise suppression device of the present application;
FIG. 3 is a schematic diagram of an embodiment of a far field noise suppression electronic device of the present application;
FIG. 4 is a flowchart of an embodiment of the far field noise suppression system of the present application;
fig. 5 is a flow chart of an embodiment of a method of suppressing stationary noise and non-stationary noise according to the present application.
Detailed Description
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present application. This application is, however, susceptible of embodiment in many other ways than those herein described and similar generalizations can be made by those skilled in the art without departing from the spirit of the application and, therefore, the application is not limited to the specific embodiments disclosed below.
Embodiments thereof are described in detail below based on the far-field noise suppression method of the present application. In addition, in the following description, each step of the present method will be described in detail. Please refer to fig. 1, which is a flowchart illustrating an embodiment of a far-field noise suppression method of the present application.
Step S101: and acquiring a voice beam acquired by the first microphone and a noise beam acquired by the second microphone.
In the far-field voice recognition process, people need to acquire voice data through a multi-microphone array, and then further process the acquired voice data, so that the far-field voice data can be accurately recognized. However, in the actual voice data acquisition process, a large amount of non-stationary noise data is often mixed with the acquired voice signal data due to interference of background noise, so that the use requirement cannot be met. Therefore, in this embodiment, a specific method for eliminating noise data in a speech signal by using a speech enhancement manner is provided, where, to effectively suppress non-stationary noise in a complex background noise environment, a first microphone is required to collect a speech beam, and on the basis of the speech beam, a second microphone is further utilized to collect a noise beam to perform two-way noise estimation, so as to track and eliminate the non-stationary noise. It should be noted that, in this embodiment, the classical beam forming method MVDR/LCMV is used to generate the voice beam and the noise beam, where constraint conditions for generating the voice beam coefficient are as follows:
w speech =armin w (w H R NN w),s.t w H a(θ)=1
the noise beam is obtained from coefficients of the noise beam, the noise beam coefficients resulting from the following constraints:
Wherein R is NN A (θ) is a steering vector corresponding to the target speech direction θ, which is a noise correlation matrix.
Step S102: and obtaining the energy of the voice beam and the energy of the noise beam according to the voice beam and the noise beam.
In this embodiment, the main process of the two-way noise estimation method is: the existence probability of the voice wave beam at the frequency point in the current frame is judged through the energy difference of the same frame and the same frequency point among channels, a smoothing factor is determined according to the existence probability of the voice wave beam, and noise estimation is carried out by combining the voice spectrum information containing noise signals collected by the microphone. When the frequency point is determined to have voice, the noise estimation result is updated in real time, the energy value of the current frame is used as the noise estimation result of the current frame, and when the frequency point is determined to have voice, the noise estimation result is obtained from the noise estimation result of the previous frame of the current frame. The energy of the voice beam and the energy of the noise beam obtained according to the voice beam and the noise beam described in the present embodiment are specifically the energy of the voice beam at the first time point and the energy of the noise beam at the first time point. After the energy of the voice beam at the first time point and the energy of the noise beam at the first time point are obtained, further comparing and analyzing the energy of the voice beam at the first time point and the energy of the noise beam at the first time point to obtain the probability of occurrence of the voice beam at the first time point, and further obtaining the double-path noise energy according to the probability of occurrence of the voice beam at the first time point.
Step S103: and comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain double-path noise energy, wherein the double-path noise energy is the energy of non-stationary noise.
In this embodiment, the specific process of comparing and analyzing the energy of the voice beam and the energy of the noise beam is: assuming that the moment of the first time point is t, the probability of occurrence of the voice beam at the first time point is obtained according to the following formula:
wherein atan is an arctangent function,for the energy of the speech beam at said first point in time,/for the first point in time>The energy of the noise beam at the first time point, P (t) is the probability of occurrence of the speech beam at the first time point.
Further, the two-way noise energy is obtained according to the following formula:
wherein, gamma is a smoothing factor;for the single-path noise energy, +.>And t is the first time point for the two-way noise energy.
The energy of the two-way noise in this embodiment is the energy of the non-stationary noise.
Step S104: and obtaining a first parameter for eliminating non-stationary noise according to the two-path noise energy and the energy of the voice beam.
In this embodiment, the first parameter is a first nanofiltration coefficient for suppressing non-stationary noise data, and the obtaining the first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam specifically includes: obtaining a solving formula of wiener filter coefficients: g=speech beam energy/(speech beam energy+noise energy), where g is the obtained first wiener coefficient, and inputting the two-way noise energy and the speech beam energy into a derivative formula of the wiener filter coefficient, thereby obtaining the first wiener filter coefficient for suppressing non-stationary noise data.
Step S105: and filtering the voice beam according to the first parameter to obtain first enhanced voice data.
In this embodiment, the filtering process is performed on the speech beam according to the first wiener enhancement coefficient, and the specific process is to obtain a wiener filtering process algorithm formula: s=g×y, where g is the first nanofiltration coefficient, y is the speech data before removing the non-stationary noise, and s is the speech data after removing the non-stationary noise. And inputting the first wiener enhancement coefficient to the algorithm formula of the wiener filtering processing process to obtain first enhanced voice data after suppressing non-stationary noise data.
In addition, the embodiment of the present application further provides a preferred implementation manner, and please refer to fig. 5, which is a flowchart of an embodiment of a method for suppressing stationary noise and non-stationary noise in the present application.
On the basis of suppressing non-stationary noise in the above embodiment, the preferred embodiment further performs single-path noise estimation on the single-path voice beam to obtain single-path noise energy. Wherein the single-path noise energy is the energy of stationary noise. And carrying out noise comprehensive analysis according to the obtained two-way noise energy, the single-way noise energy and the energy of the voice beam to obtain comprehensive noise energy, so as to obtain second parameters for eliminating non-stationary noise and stationary noise according to the comprehensive noise energy, and carrying out filtering processing on the voice beam to be identified through the second parameters to obtain second enhanced voice data. Wherein the second parameter is a second wiener filter coefficient for suppressing stationary noise data and non-stationary noise data.
It should be noted that, the performing the one-way noise estimation on the speech beam to obtain one-way noise energy specifically includes: and carrying out preliminary estimation on the voice beam by using a minimum tracking method to obtain the minimum value of the voice beam, obtaining the occurrence probability of the voice beam through the minimum value of the voice beam, and estimating and obtaining the energy of the single-path noise by using recursive average according to the occurrence probability of the voice beam.
The noise comprehensive analysis process is carried out according to the two-way noise energy, the one-way noise energy and the energy of the voice beam, and specifically comprises the following steps: obtaining a noise comprehensive analysis algorithm:wherein (1)>For the single-path noise energy, +.>And inputting the two-path noise energy, the single-path noise energy and the energy of the voice beam into a noise comprehensive analysis algorithm to perform noise comprehensive analysis to obtain comprehensive noise energy, so that non-stationary noise and stationary noise are eliminated at the same time, and the accuracy of voice recognition is further improved.
The application provides a far-field noise suppression method, in particular to a far-field noise suppression method under a complex environment. The method comprises the steps of obtaining a voice beam collected by a first microphone and a noise beam collected by a second microphone, obtaining the energy of the voice beam and the energy of the noise beam according to the voice beam and the noise beam, comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-way noise energy, wherein the two-way noise energy is the energy of non-stationary noise, obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam, and performing filtering processing on the voice beam according to the first parameter to obtain first enhanced voice data. On the basis of the voice wave beam, the noise wave beam is further combined to accurately estimate the non-stationary noise in the voice signal, so that the elimination of the non-stationary noise is realized, and the accuracy of far-field voice recognition can be improved in a complex environment.
Corresponding to the far-field noise suppression method, the present application further provides a far-field noise suppression device, where the far-field noise suppression method can be applied to the device, and please refer to fig. 2, which is a schematic diagram of an embodiment of the far-field noise suppression device of the present application. Since the present apparatus embodiment is similar to the method embodiment, the description is relatively simple, and the relevant points are only required to be referred to in the system embodiment section, and the apparatus embodiment is described below only schematically.
An acquisition unit 201, configured to acquire a voice beam acquired by the first microphone and a noise beam acquired by the second microphone.
In this embodiment, a specific method for eliminating noise data in a speech signal by using a speech enhancement manner is provided, where, to effectively suppress non-stationary noise in a complex background noise environment, a first microphone is required to collect a speech beam, and on the basis of the speech beam, a second microphone is further utilized to collect a noise beam to perform two-way noise estimation, so as to track and eliminate the non-stationary noise. It should be noted that, in this embodiment, the classical beam forming method MVDR/LCMV is used to generate the voice beam and the noise beam, where the voice beam coefficient is generated from the following constraint conditions:
w speech =armin w (w H R NN w),s.t w H a(θ)=1
The noise beam is obtained from coefficients of the noise beam, the noise beam coefficients resulting from the following constraints:
wherein R is NN A (θ) is a steering vector corresponding to the target speech direction θ, which is a noise correlation matrix.
A first obtaining unit 202, configured to obtain the energy of the voice beam and the energy of the noise beam according to the voice beam and the noise beam.
In this embodiment, the obtaining the energy of the voice beam and the energy of the noise beam according to the voice beam and the noise beam specifically includes obtaining the energy of the voice beam at the first time point and the energy of the noise beam at the first time point, comparing and analyzing the energy of the voice beam at the first time point and the energy of the noise beam at the first time point to obtain the probability of occurrence of the voice beam at the first time point, and further obtaining the two-way noise energy according to the probability of occurrence of the voice beam at the first time point.
The first analysis unit 203 is configured to perform a comparative analysis on the energy of the voice beam and the energy of the noise beam, so as to obtain two-path noise energy, where the two-path noise energy is energy of non-stationary noise.
In this embodiment, the specific process of performing the comparative analysis on the energy of the voice beam and the energy of the noise beam is to obtain the probability of occurrence of the voice beam at the first time point according to the following formula, assuming that the time of the first time point is t:
Wherein atan is an arctangent function,for the first time pointIs +.>The energy of the noise beam at the first time point, P (t) is the probability of occurrence of the speech beam at the first time point.
Further, the two-way noise energy is obtained according to the following formula:
wherein, gamma is a smoothing factor;for the single-path noise energy, +.>And t is the first time point for the two-way noise energy.
The energy of the two-way noise in this embodiment is the energy of the non-stationary noise.
A second obtaining unit 204, configured to obtain a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam.
In this embodiment, the first parameter is a first wiener filter coefficient for suppressing non-stationary noise data, and the specific process according to the two-path noise energy and the energy of the voice beam is to obtain a solution formula of the wiener filter coefficient: g=speech beam energy/(speech beam energy+noise energy), where g is the obtained first wiener coefficient, and inputting the two-way noise energy and the speech beam energy into a derivative formula of the wiener filter coefficient, thereby obtaining the first wiener filter coefficient for suppressing non-stationary noise data.
The first processing unit 205 is configured to perform filtering processing on the voice beam according to the first parameter, so as to obtain first enhanced voice data.
In this embodiment, the filtering process is performed on the speech beam according to the first wiener enhancement coefficient, and the specific process is to obtain a wiener filtering process algorithm formula: s=g×y, where g is the first nanofiltration coefficient, y is the speech data before removing the non-stationary noise, and s is the speech data after removing the non-stationary noise. And inputting the first wiener enhancement coefficient to the algorithm formula of the wiener filtering processing process to obtain first enhanced voice data after suppressing non-stationary noise data.
In addition, the embodiment of the present application further provides a preferred implementation manner, and please refer to fig. 5, which is a flowchart of an embodiment of a method for suppressing stationary noise and non-stationary noise in the present application.
On the basis of suppressing non-stationary noise in the above embodiment, the preferred embodiment further performs single-path noise estimation on the single-path voice beam to obtain single-path noise energy. Wherein the single-path noise energy is the energy of stationary noise. And carrying out noise comprehensive analysis according to the obtained two-way noise energy, the single-way noise energy and the energy of the voice beam to obtain comprehensive noise energy, so as to obtain second parameters for eliminating non-stationary noise and stationary noise according to the comprehensive noise energy, and carrying out filtering processing on the voice beam to be identified through the second parameters to obtain second enhanced voice data. Wherein the second parameter is a second wiener filter coefficient for suppressing stationary noise data and non-stationary noise data.
It should be noted that, the performing the one-way noise estimation on the speech beam to obtain one-way noise energy specifically includes: and carrying out preliminary estimation on the voice beam by using a minimum tracking method to obtain the minimum value of the voice beam, obtaining the occurrence probability of the voice beam through the minimum value of the voice beam, and estimating and obtaining the energy of the single-path noise by using recursive average according to the occurrence probability of the voice beam.
Corresponding to the far-field noise suppression device, the present application further provides a far-field noise suppression system, where the far-field noise suppression device can be applied to the system, and please refer to fig. 4, which is a flowchart illustrating an embodiment of the far-field noise suppression system of the present application. Since the present system embodiment is similar to the device embodiment, the description is relatively simple, and the relevant points are only required to be referred to in the system embodiment section, and the device embodiment is described below only by way of illustration.
In this embodiment, if the far-field noise suppression system is to suppress far-field noise data, firstly, dereverberation processing is required to be performed on multi-channel far-field speech data, data of each channel is selected and sent to an endpoint detection module to obtain an estimate of start-stop time of the noise data and the speech data, further multi-channel data after dereverberation is sent to a sound source positioning module to obtain speaker position information, meanwhile, beam forming processing is performed on the multi-channel data after dereverberation together with the endpoint detection information and angle information to obtain two audio streams of a speech beam and a noise beam, wherein the first microphone collects the speech beam, the second microphone collects the noise beam, noise estimation is performed on the two audio streams of the collected speech beam and the noise beam, so that noise data is suppressed, and a processed clean speech is obtained.
Corresponding to the method for recording database information provided above, the embodiment of the present application further provides an electronic device for far-field noise suppression, including: the device comprises a processor and a memory, wherein the memory is specifically used for storing a program of a far-field noise suppression method, and after the device is powered on and the program of the far-field noise suppression method is run by the processor, the following steps are executed:
acquiring a voice beam acquired by a first microphone and a noise beam acquired by a second microphone;
according to the voice beam and the noise beam, obtaining the energy of the voice beam and the energy of the noise beam;
comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
and filtering the voice beam according to the first parameter to obtain first enhanced voice data.
It should be noted that, for the detailed description of an electronic device provided in the embodiments of the present application, reference may be made to the related description of a far-field noise suppression method provided in the embodiments of the present application, which is not repeated here.
Corresponding to the method for recording database information provided above, the embodiment of the present application further provides a storage device for far-field noise suppression, including: the far-field noise suppression method described in the above embodiment.
In the storage device, a program of a far-field noise suppression method is stored, which is executed by a processor, performing the steps of:
acquiring a voice beam acquired by a first microphone and a noise beam acquired by a second microphone;
according to the voice beam and the noise beam, obtaining the energy of the voice beam and the energy of the noise beam;
comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
and filtering the voice beam according to the first parameter to obtain first enhanced voice data.
It should be noted that, for the detailed description of a storage device provided in the embodiments of the present application, reference may be made to the related description of a far-field noise suppression method provided in the embodiments of the present application, which is not repeated here.
While the preferred embodiment has been described, it is not intended to limit the invention thereto, and any person skilled in the art may make variations and modifications without departing from the spirit and scope of the present invention, so that the scope of the present invention shall be defined by the claims of the present application.

Claims (23)

1. A method of far-field noise suppression, comprising:
acquiring a voice beam acquired by a first microphone and a noise beam acquired by a second microphone;
according to the voice beam and the noise beam, obtaining the energy of the voice beam and the energy of the noise beam;
comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
and filtering the voice beam according to the first parameter to obtain first enhanced voice data.
2. The far-field noise suppression method according to claim 1, further comprising:
carrying out single-path noise estimation on the voice wave beam to obtain single-path noise energy, wherein the single-path noise energy is the energy of stable noise;
Performing noise comprehensive analysis according to the two-path noise energy, the one-path noise energy and the energy of the voice beam to obtain comprehensive noise energy;
obtaining a second parameter for eliminating non-stationary noise and stationary noise according to the integrated noise energy and the energy of the voice beam;
and filtering the voice beam according to the second parameter to obtain second enhanced voice data.
3. The method of far-field noise suppression according to claim 2, wherein the first parameter is a first nanofiltration coefficient for suppressing non-stationary noise data;
the second parameter is a second wiener filter coefficient for suppressing stationary noise data and non-stationary noise data.
4. The far-field noise suppression method according to claim 1, wherein the obtaining the energy of the speech beam and the energy of the noise beam specifically comprises: obtaining the energy of the voice beam at the first time point and the energy of the noise beam at the first time point;
the comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain the double-path noise energy specifically comprises:
obtaining the probability of occurrence of the voice beam at the first time point according to the comparison result of the energy of the voice beam at the first time point and the energy of the noise beam at the first time point;
And obtaining double-path noise energy according to the occurrence probability of the voice wave beam at the first time point.
5. The method of claim 1, wherein the acquiring the voice beam acquired by the first microphone and the noise beam acquired by the second microphone specifically comprises:
acquiring position information of the first microphone and the second microphone and a distance value between the first microphone and the second microphone;
and acquiring a voice beam acquired by the first microphone and a noise beam acquired by the second microphone according to the position information and the distance value.
6. The far-field noise suppression method according to claim 4, further comprising:
obtaining the occurrence probability of the voice wave beam at the first time point;
and determining a smoothing factor of the first microphone at the first time point according to the probability of occurrence of the voice wave beam at the first time point.
7. The method of far-field noise suppression according to claim 6, wherein the obtaining the two-way noise energy according to the probability of occurrence of the voice beam at the first time point specifically includes: :
determining a smoothing factor of the first microphone at a first time point according to the occurrence probability of the voice wave beam at the first time point;
And determining double-path noise energy according to the smoothing factor of the first time point and the energy of the voice beam of the first time point.
8. The far-field noise suppression method according to claim 2, wherein the performing noise comprehensive analysis according to the two-way noise energy, the one-way noise energy and the energy of the voice beam to obtain comprehensive noise energy specifically includes:
obtaining a noise comprehensive analysis algorithm:
wherein,,for the single-path noise energy, +.>For the two-way noise energy, alpha is the energy of the voice beam, and J is the comprehensive noise energy;
and inputting the two-path noise energy, the one-path noise energy and the energy of the voice beam into the noise comprehensive analysis algorithm to perform noise comprehensive analysis, so as to obtain comprehensive noise energy.
9. The far-field noise suppression method according to claim 2, wherein the obtaining the second parameter for eliminating the non-stationary noise and stationary noise based on the integrated noise energy and the energy of the voice beam specifically includes:
obtaining a ratio of the speech beam energy to the noise synthesis energy and the speech beam energy;
determining the second parameter for eliminating non-stationary noise and stationary noise according to the ratio;
Wherein, the ratio is: g=speech beam energy/(speech beam energy + noise integrated energy), where g is the second parameter.
10. The far-field noise suppression method according to claim 2, wherein the performing single-path noise estimation on the voice beam to obtain single-path noise energy specifically comprises:
preliminary estimation is carried out on the voice wave beam, and the minimum value of the voice wave beam is obtained;
obtaining the probability of occurrence of the voice wave beam according to the minimum value of the voice wave beam;
and estimating the energy of the single-path noise by using recursive average according to the occurrence probability of the voice wave beam.
11. A far-field noise suppression device, comprising:
the acquisition unit is used for acquiring the voice beam acquired by the first microphone and the noise beam acquired by the second microphone;
the first obtaining unit is used for obtaining the energy of the voice beam and the energy of the noise beam according to the voice beam and the noise beam;
the first analysis unit is used for comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
A second obtaining unit configured to obtain a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
and the first processing unit is used for carrying out filtering processing on the voice wave beam according to the first parameter to obtain first enhanced voice data.
12. The far-field noise suppression device of claim 11, further comprising:
the third obtaining unit is used for carrying out single-path noise estimation on the voice wave beam to obtain single-path noise energy, wherein the single-path noise energy is the energy of stationary noise;
the second analysis unit is used for carrying out noise comprehensive analysis according to the two-path noise energy, the one-path noise energy and the energy of the voice beam to obtain comprehensive noise energy;
a fourth obtaining unit for obtaining a second parameter for eliminating non-stationary noise and stationary noise according to the integrated noise energy and the energy of the voice beam;
and the second processing unit is used for carrying out filtering processing on the voice wave beam according to the second parameter to obtain second enhanced voice data.
13. The far-field noise suppression device according to claim 12, wherein the first parameter is a first nanofiltration coefficient for suppressing non-stationary noise data;
The second parameter is a second wiener filter coefficient for suppressing stationary noise data and non-stationary noise data.
14. The far-field noise suppression device according to claim 11, wherein the obtaining the energy of the speech beam and the energy of the noise beam comprises: obtaining the energy of the voice beam at the first time point and the energy of the noise beam at the first time point;
the comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain the double-path noise energy specifically comprises:
obtaining the probability of occurrence of the voice beam at the first time point according to the comparison result of the energy of the voice beam at the first time point and the energy of the noise beam at the first time point;
and obtaining double-path noise energy according to the occurrence probability of the voice wave beam at the first time point.
15. The far-field noise suppression device according to claim 11, wherein the acquiring the speech beam acquired by the first microphone and the noise beam acquired by the second microphone specifically comprises:
acquiring position information of the first microphone and the second microphone and a distance value between the first microphone and the second microphone;
And acquiring a voice beam acquired by the first microphone and a noise beam acquired by the second microphone according to the position information and the distance value.
16. The far-field noise suppression device of claim 14, further comprising:
obtaining the occurrence probability of the voice wave beam at the first time point;
and determining a smoothing factor of the first microphone at the first time point according to the probability of occurrence of the voice wave beam at the first time point.
17. The far-field noise suppression device according to claim 16, wherein the obtaining the two-way noise energy according to the probability of occurrence of the voice beam at the first time point specifically comprises: :
determining a smoothing factor of the first microphone at a first time point according to the occurrence probability of the voice wave beam at the first time point;
and determining double-path noise energy according to the smoothing factor of the first time point and the energy of the voice beam of the first time point.
18. The far-field noise suppression device according to claim 12, wherein the noise synthesis analysis is performed according to two-way noise energy, one-way noise energy and the energy of the voice beam to obtain synthesized noise energy, and specifically comprises:
Obtaining a noise comprehensive analysis algorithm:
wherein,,for the single-path noise energy, +.>For the two-way noise energy, alpha is the energy of the voice beam, and J is the comprehensive noise energy;
and inputting the two-path noise energy, the one-path noise energy and the energy of the voice beam into the noise comprehensive analysis algorithm to perform noise comprehensive analysis, so as to obtain comprehensive noise energy.
19. The far-field noise suppression device according to claim 12, wherein the obtaining a second parameter for canceling non-stationary noise and stationary noise from the integrated noise energy and the energy of the speech beam comprises:
obtaining a ratio of the speech beam energy to the noise synthesis energy and the speech beam energy;
determining the second parameter for eliminating non-stationary noise and stationary noise according to the ratio;
wherein, the ratio is: g=speech beam energy/(speech beam energy + noise integrated energy), where g is the second parameter.
20. The far-field noise suppression apparatus according to claim 12, wherein the performing a one-way noise estimation on the voice beam to obtain one-way noise energy specifically comprises:
Preliminary estimation is carried out on the voice wave beam, and the minimum value of the voice wave beam is obtained;
obtaining the probability of occurrence of the voice wave beam according to the minimum value of the voice wave beam;
and estimating the energy of the single-path noise by using recursive average according to the occurrence probability of the voice wave beam.
21. A far field noise suppression system, comprising: far field noise suppression apparatus according to any one of the preceding claims 11-20.
22. An electronic device, comprising:
a processor; and
a memory for storing a program of a far-field noise suppression method, the apparatus, after powering on and running the program of the far-field noise suppression method by the processor, performing the steps of:
acquiring a voice beam acquired by a first microphone and a noise beam acquired by a second microphone;
according to the voice beam and the noise beam, obtaining the energy of the voice beam and the energy of the noise beam;
comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
And filtering the voice beam according to the first parameter to obtain first enhanced voice data.
23. A storage device storing a program of a far-field noise suppression method, the program being executed by a processor to perform the steps of:
acquiring a voice beam acquired by a first microphone and a noise beam acquired by a second microphone;
according to the voice beam and the noise beam, obtaining the energy of the voice beam and the energy of the noise beam;
comparing and analyzing the energy of the voice beam and the energy of the noise beam to obtain two-path noise energy, wherein the two-path noise energy is the energy of non-stationary noise;
obtaining a first parameter for eliminating non-stationary noise according to the two-way noise energy and the energy of the voice beam;
and filtering the voice beam according to the first parameter to obtain first enhanced voice data.
CN201811012141.XA 2018-08-31 2018-08-31 Far-field noise suppression method, device and system Active CN110875054B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811012141.XA CN110875054B (en) 2018-08-31 2018-08-31 Far-field noise suppression method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811012141.XA CN110875054B (en) 2018-08-31 2018-08-31 Far-field noise suppression method, device and system

Publications (2)

Publication Number Publication Date
CN110875054A CN110875054A (en) 2020-03-10
CN110875054B true CN110875054B (en) 2023-07-25

Family

ID=69715316

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811012141.XA Active CN110875054B (en) 2018-08-31 2018-08-31 Far-field noise suppression method, device and system

Country Status (1)

Country Link
CN (1) CN110875054B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111696567B (en) * 2020-06-12 2022-04-01 思必驰科技股份有限公司 Noise estimation method and system for far-field call
CN112037825B (en) * 2020-08-10 2022-09-27 北京小米松果电子有限公司 Audio signal processing method and device and storage medium
CN111968615A (en) * 2020-08-31 2020-11-20 Oppo广东移动通信有限公司 Noise reduction processing method and device, terminal equipment and readable storage medium
CN112969130A (en) * 2020-12-31 2021-06-15 维沃移动通信有限公司 Audio signal processing method and device and electronic equipment

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107316649A (en) * 2017-05-15 2017-11-03 百度在线网络技术(北京)有限公司 Audio recognition method and device based on artificial intelligence

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8046219B2 (en) * 2007-10-18 2011-10-25 Motorola Mobility, Inc. Robust two microphone noise suppression system
CN101964934A (en) * 2010-06-08 2011-02-02 浙江大学 Binary microphone microarray voice beam forming method
CN102456351A (en) * 2010-10-14 2012-05-16 清华大学 Voice enhancement system
EP2659487B1 (en) * 2010-12-29 2016-05-04 Telefonaktiebolaget LM Ericsson (publ) A noise suppressing method and a noise suppressor for applying the noise suppressing method
CN102938254B (en) * 2012-10-24 2014-12-10 中国科学技术大学 Voice signal enhancement system and method
CN102969002B (en) * 2012-11-28 2014-09-03 厦门大学 Microphone array speech enhancement device capable of suppressing mobile noise
US10482899B2 (en) * 2016-08-01 2019-11-19 Apple Inc. Coordination of beamformers for noise estimation and noise suppression
CN107017003B (en) * 2017-06-02 2020-07-10 厦门大学 Microphone array far-field speech enhancement device

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107316649A (en) * 2017-05-15 2017-11-03 百度在线网络技术(北京)有限公司 Audio recognition method and device based on artificial intelligence

Also Published As

Publication number Publication date
CN110875054A (en) 2020-03-10

Similar Documents

Publication Publication Date Title
CN110875054B (en) Far-field noise suppression method, device and system
JP4842583B2 (en) Method and apparatus for multisensory speech enhancement
US20030177007A1 (en) Noise suppression apparatus and method for speech recognition, and speech recognition apparatus and method
RU2642353C2 (en) Device and method for providing informed probability estimation and multichannel speech presence
JP4986393B2 (en) Method for determining an estimate for a noise reduction value
US8712074B2 (en) Noise spectrum tracking in noisy acoustical signals
Krueger et al. Model-based feature enhancement for reverberant speech recognition
EP1891627B1 (en) Multi-sensory speech enhancement using a clean speech prior
US8296135B2 (en) Noise cancellation system and method
Chang et al. Speech enhancement: new approaches to soft decision
CN111312275B (en) On-line sound source separation enhancement system based on sub-band decomposition
CN110176243B (en) Speech enhancement method, model training method, device and computer equipment
Saleem et al. Low rank sparse decomposition model based speech enhancement using gammatone filterbank and Kullback–Leibler divergence
Jiang et al. An improved unsupervised single-channel speech separation algorithm for processing speech sensor signals
EP2745293B1 (en) Signal noise attenuation
JP2003177770A (en) Method and device for calculating voice absence probability and method and device for noise elimination using these method and device
JP2008145923A (en) Speech signal section estimating device, method thereof, program thereof, and recording medium
Jeon et al. Mechanical noise suppression based on non-negative matrix factorization and multi-band spectral subtraction for digital cameras
CN103270772B (en) Signal handling equipment, signal processing method
Nakatani et al. Simultaneous denoising, dereverberation, and source separation using a unified convolutional beamformer
Malek et al. Speaker extraction using LCMV beamformer with DNN-based SPP and RTF identification scheme
Sehr et al. Model-based dereverberation in the Logmelspec domain for robust distant-talking speech recognition
KR101051035B1 (en) Wide Probability Based Wide Decision Method for Secondary Conditions for Speech Enhancement
Ji et al. Improved a priori SAP Estimator in Complex Noisy Environment for Dual Channel Microphone System.
Ince et al. Robust ego noise suppression of a robot

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40024971

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant