CN110010117B - Voice active noise reduction method and device - Google Patents

Voice active noise reduction method and device Download PDF

Info

Publication number
CN110010117B
CN110010117B CN201910289962.6A CN201910289962A CN110010117B CN 110010117 B CN110010117 B CN 110010117B CN 201910289962 A CN201910289962 A CN 201910289962A CN 110010117 B CN110010117 B CN 110010117B
Authority
CN
China
Prior art keywords
voice
signal
microphone
noise
voice signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201910289962.6A
Other languages
Chinese (zh)
Other versions
CN110010117A (en
Inventor
胡书山
王鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hubei University
Original Assignee
Hubei University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hubei University filed Critical Hubei University
Priority to CN201910289962.6A priority Critical patent/CN110010117B/en
Publication of CN110010117A publication Critical patent/CN110010117A/en
Application granted granted Critical
Publication of CN110010117B publication Critical patent/CN110010117B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1781Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions
    • G10K11/17821Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions characterised by the analysis of the input signals only
    • G10K11/17823Reference signals, e.g. ambient acoustic environment
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1781Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions
    • G10K11/17821Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions characterised by the analysis of the input signals only
    • G10K11/17825Error signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1787General system configurations
    • G10K11/17879General system configurations using both a reference signal and an error signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention relates to a device applied to voice active noise reduction, which is characterized in that a first voice signal and current environment noise data are collected through a first microphone, and the first microphone is externally arranged in the voice active noise reduction device; acquiring a second voice signal through the second microphone, wherein the second microphone is arranged in the voice active noise reduction device and is symmetrical to the first microphone in position; acquiring a third voice signal by the third microphone, the third microphone being interposed between the first microphone and the second microphone; acquiring a fourth voice signal through a fourth microphone, the fourth microphone being interposed between the first microphone and the third microphone; and the voice active noise reduction device carries out self-adaptive synthesis on the first, second, third and fourth voice signals according to the current environmental noise data so as to actively reduce the noise of the voice.

Description

Voice active noise reduction method and device
Technical Field
The invention relates to the technical field of audio, in particular to a method and a device for voice active noise reduction.
Background
The active voice noise reduction device (such as a far-field sound box, an intelligent earphone, a headset and the like) is a brand-new technology, and a voice active noise reduction algorithm is applied to a conventional sound source, so that the purpose of reducing noise is achieved.
At present, when a voice active noise reduction device is used for playing audio or in a call (such as amazon ECHO), the surrounding environment is often unexpected, some are quite quiet, some are quite noisy, and even the surrounding noise is larger than 100 dB. In the prior art, in order to solve the problem, a traditional voice active noise reduction algorithm has been developed to eliminate noise; however, the technology is not suitable for any scene (especially noisy scene), and the voice quality is poor.
Disclosure of Invention
Therefore, it is necessary to provide a method and an apparatus for active noise reduction of speech for solving the problem of how to adaptively synthesize different noise environments in a manner of active noise reduction of speech in the prior art.
The embodiment of the invention provides a method for actively reducing noise of voice, which comprises the following steps:
acquiring a first voice signal and current environment noise data through the first microphone, wherein the first microphone is externally arranged in the voice active noise reduction device;
acquiring a second voice signal through the second microphone, wherein the second microphone is arranged in the voice active noise reduction device and is symmetrical to the first microphone in position;
acquiring a third voice signal by the third microphone, the third microphone being interposed between the first microphone and the second microphone;
acquiring a fourth voice signal through a fourth microphone, the fourth microphone being interposed between the first microphone and the third microphone;
and the voice active noise reduction device carries out self-adaptive synthesis on the first, second, third and fourth voice signals according to the current environmental noise data so as to form the voice active noise reduction self-adaptive synthesis voice active noise reduction device.
In one embodiment, the apparatus for adaptively performing active speech noise reduction by adaptively synthesizing speech for the first, second, third and fourth speech signals according to the current ambient noise data comprises:
and if the current environmental noise power is smaller than a first threshold value, using the first or the fourth voice signal as a voice input signal of the voice active noise reduction device.
In one embodiment, adaptively synthesizing the first, second, third and fourth speech signals to actively reduce noise in speech based on the current ambient noise data comprises:
if the current environmental noise power is greater than or equal to a first threshold value and less than or equal to a second threshold value, mixing the first voice signal with the second voice signal and/or the fourth voice signal according to a preset proportion, and taking the mixed voice signal as a voice input signal of the voice active noise reduction device, wherein the second threshold value is greater than the first threshold value.
In one embodiment, adaptively synthesizing the first, second, third and fourth speech signals to actively reduce noise in speech based on the current ambient noise data comprises:
if the current environmental noise power is larger than a second threshold value, estimating noise by using an adaptive filtering algorithm, and generating an audio signal with the same amplitude and opposite phase to the noise to offset the noise, wherein the third voice signal is used as an expected signal of the adaptive filter, the first voice signal is used as an input signal of the adaptive filter, an error signal is obtained through the adaptive filtering algorithm, the difference between the third voice signal and the error signal is the noise, and the audio signal with the same amplitude and opposite phase to the noise is generated to offset the noise;
and eliminating echo in the second voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as an expected signal of the adaptive filtering algorithm, the signal with the noise offset is used as an input signal of the adaptive filtering algorithm, an error signal generated by processing the second voice signal by using the adaptive filtering algorithm is used as the signal with the echo eliminated, and the signal with the echo eliminated and the fourth voice signal are mixed in proportion to generate a voice input signal of the voice active noise reduction device.
In one embodiment, the obtaining current voice active noise reduction device environmental noise data includes:
and estimating the environmental noise in the first voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as an input signal of the adaptive filtering algorithm, the first voice signal is used as a desired signal of the adaptive filtering algorithm, and an error signal output by the algorithm is estimated environmental noise data.
The embodiment of the present invention further provides a voice active noise reduction device, which includes a first microphone, a second microphone, a third microphone, a fourth microphone and a controller,
the first microphone is used for acquiring a first voice signal and current environment noise data, and the first microphone is externally arranged in the voice active noise reduction device;
the second microphone is used for acquiring a second voice signal, and the second microphone is arranged in the voice active noise reduction device and is symmetrical to the first microphone in position;
the third microphone is used for collecting a third voice signal, and the third microphone is arranged between the first microphone and the second microphone;
acquiring a fourth voice signal through a fourth microphone, the fourth microphone being interposed between the first microphone and the third microphone;
the controller is used for acquiring the environmental noise data of the current voice active noise reduction device, and carrying out self-adaptive synthesis on the first, second, third and fourth voice signals according to the current environmental noise data so as to actively reduce noise and self-adaptively synthesize the voice active noise reduction device.
In one embodiment, the controller is configured to adaptively synthesize the first, second, third and fourth speech signals according to the current environmental noise data to perform active speech noise reduction, and the apparatus includes:
and if the current environmental noise power is smaller than a first threshold value, using the first or the fourth voice signal as a voice input signal of the voice active noise reduction device.
In one embodiment, the controller is configured to adaptively synthesize the first, second, third and fourth speech signals according to the current ambient noise data to perform active speech noise reduction, and includes:
if the current environmental noise power is greater than or equal to a first threshold value and less than or equal to a second threshold value, mixing the first voice signal with the second voice signal and/or the fourth voice signal according to a preset proportion, and taking the mixed voice signal as a voice input signal of the voice active noise reduction device, wherein the second threshold value is greater than the first threshold value.
In one embodiment, the controller is configured to adaptively synthesize the first, second, third and fourth speech signals according to the current ambient noise data to perform active speech noise reduction according to the current ambient noise data, and includes:
estimating noise by using an adaptive filtering algorithm, generating an audio signal with the same amplitude and opposite phase to the noise to cancel the noise, wherein the third voice signal is used as a desired signal of an adaptive filter, the first voice signal is used as an input signal of the adaptive filter, an error signal is obtained by the adaptive filtering algorithm, the difference between the third voice signal and the error signal is the noise, and the audio signal with the same amplitude and opposite phase to the noise is generated to cancel the noise;
and eliminating echo in the second voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as an expected signal of the adaptive filtering algorithm, the signal with the noise offset is used as an input signal of the adaptive filtering algorithm, an error signal generated by processing the second voice signal by using the adaptive filtering algorithm is used as the signal with the echo eliminated, and the signal with the echo eliminated and the fourth voice signal are mixed in proportion to generate a voice input signal of the voice active noise reduction device.
In one embodiment, the controller is configured to obtain current voice active noise reduction device ambient noise data, and includes:
and estimating the environmental noise in the first voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as an input signal of the adaptive filtering algorithm, the first voice signal is used as a desired signal of the adaptive filtering algorithm, and an error signal output by the algorithm is estimated environmental noise data.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings of the embodiments can be obtained according to the drawings without creative efforts.
FIG. 1 is a flow diagram of a method for active noise reduction of speech in an embodiment;
fig. 2 is a schematic diagram of an internal structure of an active voice noise reduction device according to an embodiment.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in detail below. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, and in order to provide a better understanding of the present invention. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete. This invention can be embodied in many different forms than those herein described and many modifications may be made by those skilled in the art without departing from the spirit of the invention.
Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present invention, "a plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise. In the description of the present invention, "a plurality" means at least one, e.g., one, two, etc., unless specifically limited otherwise.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.
As shown in fig. 1, an embodiment of the present invention provides a method for active noise reduction of a voice, applied to an active noise reduction device of a voice, the active noise reduction device of a voice having a fourth microphone, a third microphone, a second microphone, and a first microphone, wherein the method includes:
s101, acquiring a first voice signal and current environment noise data through a first microphone, wherein the first microphone is externally arranged in the voice active noise reduction device;
s102, collecting a second voice signal through a second microphone, wherein the second microphone is arranged in the voice active noise reduction device and is symmetrical to the first microphone in position; s103, collecting a third voice signal through the third microphone, wherein the third microphone is arranged between the first microphone and the second microphone;
in one embodiment, the acquiring current ambient noise data may specifically be:
the adaptive filtering algorithm is used to estimate the environmental noise in the first speech signal, wherein the second speech signal is used as the input signal of the adaptive filtering algorithm, the first speech signal is used as the expected signal of the adaptive filtering algorithm, and the error signal e1 output by the algorithm is the estimated environmental noise data. Alternatively, the ambient noise power P may be used as the ambient noise data.
In the embodiment of the present invention, the external noise data needs to be collected in real time or at regular time to determine the current external noise scene, and optionally, the external noise scene may be defined as a plurality of scenes, such as quiet, ordinary noisy, comparatively noisy, and very noisy. For example, quiet noise size ranges are below 30 decibels, such as in office, home, etc. scenarios; the size range of the general noisy noise can be 30-70 decibels, such as in the scenes of shopping malls, supermarkets and the like; the range of the noise with noise is 70-90 decibels, such as high-speed rail and other scenes, and the noise with noise is more than 90 decibels, such as airplane take-off, concert and other scenes.
S104, collecting a fourth voice signal through a fourth microphone, wherein the fourth microphone is arranged between the first microphone and the third microphone; the fourth microphone is used as a supplement of the first microphone, and can form an external microphone array together with the first microphone to realize dual-channel voice input; furthermore, the fourth microphone may be arranged after the first microphone, and the noise interference may be low compared to the first microphone. In addition, the fourth microphone can be set to be sensitive to low frequency and insensitive to high frequency, and meanwhile, the first microphone can be set to be sensitive to high frequency and insensitive to low frequency, and the first microphone and the second microphone complement each other to realize that the input of the audio signal is more stable.
And S105, performing self-adaptive synthesis on the first, second, third and fourth voice signals according to the current environmental noise data to actively reduce noise of voice.
In the embodiment of the invention, an active adaptive synthesis voice input signal mode is creatively provided, by picking up voice signals at different positions and adaptively changing the voice input signal of a voice active noise reduction device according to different environmental noise data, and by mixing a first voice signal and a second voice signal, or mixing the first voice signal, the second voice signal and the third voice signal according to different modes, or mixing the first voice signal, the second voice signal, the third voice signal and the fourth voice signal according to different modes, the input mode of the voice signals is adaptively switched in different noise scenes (such as low noise, medium noise and high noise, or low noise, medium noise, high noise and ultrahigh noise) so as to realize the input of high-quality voice signals. The noise reduction, echo cancellation, high-quality voice pickup and normal communication under the high-decibel noise environment are achieved.
In one embodiment, adaptively synthesizing the first, second, third and fourth speech signals to actively reduce noise in speech based on the current ambient noise data comprises:
if the current environmental noise power is smaller than a first threshold (for example, 30 db), the first or the fourth voice signal is used as the voice input signal of the voice active noise reduction apparatus. (since the tone of the speech signal collected by the first/fourth MIC is better than that collected by the second microphone)
In one embodiment, adaptively synthesizing the first, second, third and fourth speech signals to actively reduce noise of speech according to the current ambient noise data includes:
if the current environmental noise power is greater than or equal to a first threshold value and less than or equal to a second threshold value (for example, 70 decibels), mixing the first voice signal with the second voice signal and/or the fourth voice signal according to a preset proportion, and taking the mixed voice signal as a voice input signal of the voice active noise reduction device, wherein the second threshold value is greater than the first threshold value. That is, in this case, the voice signal collected by the second microphone or the third MIC is low in noise but lacks in tone color, the voice signal collected by the first MIC is used to compensate for it, and the voice signals collected by the first and/or fourth MIC and the third MIC are mixed in a ratio such that the sound is relatively comfortable and natural.
In one embodiment, adaptively synthesizing the first, second, third and fourth speech signals to actively reduce noise in speech based on the current ambient noise data comprises:
if the current environmental noise power is larger than a second threshold value, estimating noise by using an adaptive filtering algorithm, and generating an audio signal with the same amplitude and opposite phase with the noise to cancel the noise, wherein the third voice signal is used as a desired signal of the adaptive filter, the first voice signal is used as an input signal of the adaptive filter, an error signal e2 is obtained through the adaptive filtering algorithm, the difference between the third voice signal and the error signal e2 is the noise, and the audio signal with the same amplitude and opposite phase with the noise is generated to cancel the noise;
and eliminating echo in the second voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as a desired signal of the adaptive filtering algorithm, the signal after noise cancellation is used as an input signal of the adaptive filtering algorithm, an error signal generated by processing the second voice signal by using the adaptive filtering algorithm is a signal e3 after echo cancellation, and the signal e3 after echo cancellation and the fourth voice signal are proportionally mixed to generate a voice input signal of the voice active noise reduction device.
The adaptive filtering algorithm is a voice active noise reduction algorithm, and voice active noise reduction is carried out through a voice active noise reduction algorithm ANC. The basic principle of an Active Noise Control (ANC) algorithm is that a Noise reduction system generates reverse sound waves equal to external Noise to neutralize the Noise. The ANC chip can be placed in the voice active noise reduction device or be externally placed on the voice active noise reduction device. In the embodiment of the present invention, algorithms such as an OMLSA (optimal-modified log-spectral amplitude speech estimator) and an mcra (minimum controlled recursive noise estimator) may be adopted to perform active voice noise reduction, which is not described in detail in the embodiment of the present invention.
The embodiment of the present invention further provides a voice active noise reduction device 20, in the embodiment of the present invention, the voice active noise reduction device 20 may be a sound box, such as a passive sound box, an active sound box, or a far-field sound box, and may also be an ear-plugging type, in-ear type, head-wearing type, or ear-hanging type earphone. The voice active noise reduction device can be communicated with a terminal in a wired or wireless (such as Bluetooth and infrared) mode, and data transmission is achieved. The terminal may be any terminal device including a mobile phone, a tablet computer, a PDA (Personal Digital Assistant), a Point of Sales (POS), a vehicle-mounted computer, a wearable device, and the like.
As shown in fig. 2, the active voice noise reduction apparatus includes: a voice active noise reduction device housing 21, a controller 22, a first microphone 23, a second microphone 24, a third microphone 25 and a fourth microphone 26 in the voice active noise reduction device housing 21, wherein the first microphone is externally arranged in the voice active noise reduction device, the second microphone is internally arranged in the voice active noise reduction device and is symmetrical to the first microphone in position, the third microphone is arranged between the first microphone and the second microphone, the fourth microphone is arranged between the first microphone and the third microphone, and the controller 22 is electrically connected with the first microphone 23, the second microphone 24, the third microphone 25 and the fourth microphone 26.
The first microphone 23 is used for collecting a first voice signal and current environmental noise;
the second microphone 24 is used for collecting a second voice signal;
the third microphone 25 is used for collecting a third voice signal;
the fourth microphone 26 is used for collecting a fourth voice signal;
the controller 22 is configured to obtain current environmental noise data of the active voice noise reduction device, and perform adaptive synthesis on the first, second, third, and fourth voice signals according to the current environmental noise data to perform active voice noise reduction adaptive synthesis.
In one embodiment, the controller 22 obtains the current ambient noise data, which may specifically be:
the adaptive filtering algorithm is used to estimate the environmental noise in the first speech signal, wherein the second speech signal is used as the input signal of the adaptive filtering algorithm, the first speech signal is used as the expected signal of the adaptive filtering algorithm, and the error signal e1 output by the algorithm is the estimated environmental noise data. Alternatively, the ambient noise power P may be used as the ambient noise data.
In the embodiment of the present invention, external noise data needs to be collected in real time or at regular time to determine the current external noise scene, and optionally, the external noise scene may be defined as three scenes, namely quiet, generally noisy, and relatively noisy. For example, quiet noise size ranges are below 30 decibels, such as in office, home, etc. scenarios; the size range of the general noisy noise can be 30-70 decibels, such as in downtown scenes of shopping malls, supermarkets and the like; the range of the noise with noise can be more than 70 decibels, such as scenes of takeoff of an airplane, high-speed rails, subways and the like.
In one embodiment, the controller 22 adaptively synthesizes the first, second, third and fourth speech signals according to the current environmental noise data to perform active speech noise reduction, and includes:
if the current environmental noise power is smaller than a first threshold (e.g. 40 db), the first or the fourth speech signal is used as the speech input signal. (since the tone of the voice signal collected by the first or fourth MIC is better than that collected by the second microphone of the voice active noise reduction device.)
In one embodiment, the controller 22 adaptively synthesizes the first, second, third and fourth speech signals according to the current environmental noise data and the current environmental noise data to perform active speech noise reduction, specifically:
if the current environmental noise power is greater than or equal to a first threshold value and less than or equal to a second threshold value (for example, 60 decibels), mixing the first voice signal with the second voice signal and/or the fourth voice signal according to a preset proportion, and taking the mixed voice signal as a voice input signal, wherein the second threshold value is greater than the first threshold value.
In one embodiment, the controller 22 adaptively synthesizes the first, second, third and fourth speech signals according to the current ambient noise data to perform active speech noise reduction according to the current ambient noise data, including:
if the current environmental noise power is larger than a second threshold value, estimating noise by using an adaptive filtering algorithm, and generating an audio signal with the same amplitude and opposite phase with the noise to cancel the noise, wherein the third voice signal is used as a desired signal of the adaptive filter, the first voice signal is used as an input signal of the adaptive filter, an error signal e2 is obtained through the adaptive filtering algorithm, the difference between the third voice signal and the error signal e2 is the noise, and the audio signal with the same amplitude and opposite phase with the noise is generated to cancel the noise;
and eliminating echo in the second voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as a desired signal of the adaptive filtering algorithm, the signal after noise cancellation is used as an input signal of the adaptive filtering algorithm, an error signal generated by processing the second voice signal by using the adaptive filtering algorithm is a signal e3 after echo cancellation, and the signal e3 after echo cancellation and the fourth voice signal are proportionally mixed to generate the voice active noise reduction device.
The adaptive filtering algorithm is a voice active noise reduction algorithm, and voice active noise reduction is carried out through a voice active noise reduction algorithm ANC. The basic principle of an Active Noise Control (ANC) algorithm is that a Noise reduction system generates reverse sound waves equal to external Noise to neutralize the Noise. The ANC chip can be placed in the earphone or be externally placed on the earphone, and the reference microphone is arranged on the earphone ear cover to collect the ambient noise. And (4) collecting residual noise after noise reduction treatment in an earphone by using an Error mic (Error Microphone). Speaker plays ANC processed anti-noise. In the embodiment of the present invention, algorithms such as an OMLSA (optimal-modified log-spectral amplitude speech estimator) and an mcra (minimum controlled recursive noise estimator) may be adopted to perform active voice noise reduction, which is not described in detail in the embodiment of the present invention.
The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features. It should be noted that "in one embodiment," "for example," "as another example," and the like, are intended to illustrate the application and are not intended to limit the application.
The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (8)

1. A method for active noise reduction of speech, the method comprising:
acquiring a first voice signal and current environment noise data through a first microphone, wherein the first microphone is externally arranged in a voice active noise reduction device;
acquiring a second voice signal through a second microphone, wherein the second microphone is arranged in the voice active noise reduction device and is symmetrical to the first microphone in position;
acquiring a third voice signal by a third microphone, the third microphone being interposed between the first microphone and the second microphone;
acquiring a fourth voice signal through a fourth microphone, the fourth microphone being interposed between the first microphone and the third microphone;
the voice active noise reduction device carries out self-adaptive synthesis on the first, second, third and fourth voice signals according to the current environmental noise data so as to carry out voice active noise reduction;
wherein adaptively synthesizing the first, second, third and fourth speech signals to actively reduce noise in speech according to the current ambient noise data comprises:
and if the current environmental noise power is smaller than a first threshold value, using the first or the fourth voice signal as a voice input signal of the voice active noise reduction device.
2. The method of claim 1, wherein adaptively synthesizing the first, second, third and fourth speech signals for active speech noise reduction based on the current ambient noise data comprises:
if the current environmental noise power is greater than or equal to a first threshold value and less than or equal to a second threshold value, mixing the first voice signal with the second voice signal and/or the fourth voice signal according to a preset proportion, and taking the mixed voice signal as a voice input signal of the voice active noise reduction device, wherein the second threshold value is greater than the first threshold value.
3. The method of claim 2, wherein adaptively synthesizing the first, second, third and fourth speech signals for active speech noise reduction based on the current ambient noise data comprises:
if the current environmental noise power is larger than a second threshold value, estimating noise by using an adaptive filtering algorithm, and generating an audio signal with the same amplitude and opposite phase to the noise to offset the noise, wherein the third voice signal is used as an expected signal of the adaptive filter, the first voice signal is used as an input signal of the adaptive filter, an error signal is obtained through the adaptive filtering algorithm, the difference between the third voice signal and the error signal is the noise, and the audio signal with the same amplitude and opposite phase to the noise is generated to offset the noise;
and eliminating echo in the second voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as an expected signal of the adaptive filtering algorithm, the signal with the noise offset is used as an input signal of the adaptive filtering algorithm, an error signal generated by processing the second voice signal by using the adaptive filtering algorithm is used as the signal with the echo eliminated, and the signal with the echo eliminated and the fourth voice signal are mixed in proportion to generate a voice input signal of the voice active noise reduction device.
4. The method of any of claims 1-3, wherein the obtaining current ambient noise data comprises:
and estimating the environmental noise in the first voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as an input signal of the adaptive filtering algorithm, the first voice signal is used as a desired signal of the adaptive filtering algorithm, and an error signal output by the algorithm is estimated environmental noise data.
5. A voice active noise reduction device is characterized in that the voice active noise reduction device comprises a first microphone, a second microphone, a third microphone, a fourth microphone and a controller, wherein,
the first microphone is used for acquiring a first voice signal and current environment noise data, and the first microphone is externally arranged in the voice active noise reduction device;
the second microphone is used for acquiring a second voice signal, and the second microphone is arranged in the voice active noise reduction device and is symmetrical to the first microphone in position;
the third microphone is used for collecting a third voice signal, and the third microphone is arranged between the first microphone and the second microphone;
acquiring a fourth voice signal through a fourth microphone, the fourth microphone being interposed between the first microphone and the third microphone;
the controller is used for acquiring the environmental noise data of the current voice active noise reduction device, and performing self-adaptive synthesis on the first, second, third and fourth voice signals according to the current environmental noise data so as to perform voice active noise reduction self-adaptive synthesis;
wherein the controller is configured to adaptively perform adaptive synthesis on the first, second, third and fourth speech signals according to the current environmental noise data to perform active speech noise reduction, and the apparatus includes:
and if the current environmental noise power is smaller than a first threshold value, using the first or the fourth voice signal as a voice input signal of the voice active noise reduction device.
6. The active speech noise reduction device of claim 5, wherein the controller is configured to adaptively synthesize the first, second, third and fourth speech signals according to the current ambient noise data to reduce active speech noise, and comprises:
if the current environmental noise power is greater than or equal to a first threshold value and less than or equal to a second threshold value, mixing the first voice signal with the second voice signal and/or the fourth voice signal according to a preset proportion, and taking the mixed voice signal as a voice input signal of the voice active noise reduction device, wherein the second threshold value is greater than the first threshold value.
7. The active speech noise reduction device of claim 6, wherein the controller is configured to adaptively synthesize the first, second, third and fourth speech signals according to the current ambient noise data to reduce the active speech noise according to the current ambient noise data, and comprises:
estimating noise by using an adaptive filtering algorithm, generating an audio signal with the same amplitude and opposite phase to the noise to cancel the noise, wherein the third voice signal is used as a desired signal of an adaptive filter, the first voice signal is used as an input signal of the adaptive filter, an error signal is obtained by the adaptive filtering algorithm, the difference between the third voice signal and the error signal is the noise, and the audio signal with the same amplitude and opposite phase to the noise is generated to cancel the noise;
and eliminating echo in the second voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as an expected signal of the adaptive filtering algorithm, the signal with the noise offset is used as an input signal of the adaptive filtering algorithm, an error signal generated by processing the second voice signal by using the adaptive filtering algorithm is used as the signal with the echo eliminated, and the signal with the echo eliminated and the fourth voice signal are mixed in proportion to generate a voice input signal of the voice active noise reduction device.
8. The active speech noise reducer according to any one of claims 5-7, wherein the controller is configured to obtain current active speech noise reducer ambient noise data, and comprises:
and estimating the environmental noise in the first voice signal by using an adaptive filtering algorithm, wherein the second voice signal is used as an input signal of the adaptive filtering algorithm, the first voice signal is used as a desired signal of the adaptive filtering algorithm, and an error signal output by the algorithm is estimated environmental noise data.
CN201910289962.6A 2019-04-11 2019-04-11 Voice active noise reduction method and device Expired - Fee Related CN110010117B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910289962.6A CN110010117B (en) 2019-04-11 2019-04-11 Voice active noise reduction method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910289962.6A CN110010117B (en) 2019-04-11 2019-04-11 Voice active noise reduction method and device

Publications (2)

Publication Number Publication Date
CN110010117A CN110010117A (en) 2019-07-12
CN110010117B true CN110010117B (en) 2021-06-25

Family

ID=67171132

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910289962.6A Expired - Fee Related CN110010117B (en) 2019-04-11 2019-04-11 Voice active noise reduction method and device

Country Status (1)

Country Link
CN (1) CN110010117B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111464868A (en) * 2020-03-31 2020-07-28 深圳Tcl数字技术有限公司 Display terminal shutdown control method, device, equipment and readable storage medium
CN112102825B (en) * 2020-08-11 2021-11-26 湖北亿咖通科技有限公司 Audio processing method and device based on vehicle-mounted machine voice recognition and computer equipment
CN112420066B (en) * 2020-11-05 2024-05-14 深圳市卓翼科技股份有限公司 Noise reduction method, device, computer equipment and computer readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0720878A (en) * 1993-06-21 1995-01-24 Daikin Ind Ltd Control device of active muffler
CN101989423A (en) * 2009-07-30 2011-03-23 Nxp股份有限公司 Active noise reduction method using perceptual masking
CN104754436A (en) * 2015-03-13 2015-07-01 钰太芯微电子科技(上海)有限公司 Active noise reducing method and noise reducing earphone
CN106792315A (en) * 2017-01-05 2017-05-31 歌尔科技有限公司 A kind of method and apparatus and a kind of active noise reduction earphone for offsetting ambient noise

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1947642B1 (en) * 2007-01-16 2018-06-13 Apple Inc. Active noise control system
US9208769B2 (en) * 2012-12-18 2015-12-08 Apple Inc. Hybrid adaptive headphone

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0720878A (en) * 1993-06-21 1995-01-24 Daikin Ind Ltd Control device of active muffler
CN101989423A (en) * 2009-07-30 2011-03-23 Nxp股份有限公司 Active noise reduction method using perceptual masking
CN104754436A (en) * 2015-03-13 2015-07-01 钰太芯微电子科技(上海)有限公司 Active noise reducing method and noise reducing earphone
CN106792315A (en) * 2017-01-05 2017-05-31 歌尔科技有限公司 A kind of method and apparatus and a kind of active noise reduction earphone for offsetting ambient noise

Also Published As

Publication number Publication date
CN110010117A (en) 2019-07-12

Similar Documents

Publication Publication Date Title
CN110996209B (en) Active noise reduction method and system and earphone
US9922663B2 (en) Voice signal processing method and apparatus
US9749731B2 (en) Sidetone generation using multiple microphones
CN110933554B (en) Active noise reduction method and system and earphone
KR101482488B1 (en) Integrated psychoacoustic bass enhancement (pbe) for improved audio
CN110010117B (en) Voice active noise reduction method and device
US10475434B2 (en) Electronic device and control method of earphone device
CN100508536C (en) Filter coefficient setting device, filter coefficient setting method, and program
CN105612576A (en) Limiting active noise cancellation output
CN106716526A (en) Method and apparatus for enhancing sound sources
JP2011527025A (en) System and method for providing noise suppression utilizing nulling denoising
JP2006139307A (en) Apparatus having speech effect processing and noise control and method therefore
CN112399301B (en) Earphone and noise reduction method
CN112954530B (en) Earphone noise reduction method, device and system and wireless earphone
CN110856072A (en) Earphone conversation noise reduction method and earphone
CN109155802A (en) For generating the device of audio output
CN110972018B (en) Method and system for carrying out transparent transmission on earphone and earphone
JP2023551556A (en) Audio signal processing method and system for echo suppression
CN111801727A (en) Active Noise Cancellation (ANC) system with selectable sample rate
CN110012378B (en) Method for reducing noise of voice, earplug and computer storage medium
CN116208879A (en) Earphone with active noise reduction function and active noise reduction method
US10297245B1 (en) Wind noise reduction with beamforming
CN115250397A (en) TWS earphone and playing method and device thereof
CN113038318B (en) Voice signal processing method and device
CN110896512B (en) Noise reduction method and system for semi-in-ear earphone and semi-in-ear earphone

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20210625