CN106816156B - Method and device for enhancing audio quality - Google Patents

Method and device for enhancing audio quality Download PDF

Info

Publication number
CN106816156B
CN106816156B CN201710064271.7A CN201710064271A CN106816156B CN 106816156 B CN106816156 B CN 106816156B CN 201710064271 A CN201710064271 A CN 201710064271A CN 106816156 B CN106816156 B CN 106816156B
Authority
CN
China
Prior art keywords
audio signal
signal
audio
sound source
energy spectrum
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710064271.7A
Other languages
Chinese (zh)
Other versions
CN106816156A (en
Inventor
张晨
张兴涛
孙学京
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Tuoling Inc
Original Assignee
Beijing Tuoling Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Tuoling Inc filed Critical Beijing Tuoling Inc
Priority to CN201710064271.7A priority Critical patent/CN106816156B/en
Publication of CN106816156A publication Critical patent/CN106816156A/en
Application granted granted Critical
Publication of CN106816156B publication Critical patent/CN106816156B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information

Abstract

The application relates to a method and a device for enhancing audio quality, wherein the method comprises the following steps: acquiring an audio signal with a preset format; preprocessing the audio signals, wherein the preprocessing comprises calculating an average signal of each audio signal in the audio signals and/or performing beamforming processing on the audio signals; and carrying out noise suppression processing on the audio signal based on the signal obtained by preprocessing to obtain the audio signal with enhanced tone quality. The method and the device for enhancing the audio quality can effectively improve the audio quality of the three-dimensional microphone array.

Description

Method and device for enhancing audio quality
Technical Field
The present application relates to the field of audio processing technologies, and in particular, to a method and an apparatus for enhancing audio quality.
Background
With the development of scientific technology, the pursuit of audio quality in various fields is higher and higher, and the objects of audio research are gradually transited from the original mono (mono) to stereo (stereo), surround (surround) and 3D (3-dimensional) audio. Unlike single-channel audio, multiple channels of audio are typically obtained through an array of microphones. For 3D audio, to pick up audio in various directions, a stereo microphone array is typically used, which can obtain three-dimensional information of the horizontal azimuth, vertical azimuth and distance of the sound source from the microphone array reference point.
In the prior art, the audio enhancement technology aiming at the linear microphone array and the planar microphone array can obtain effective effects. However, for a stereo microphone array, the prior art has not been able to achieve an effective audio enhancement effect.
Disclosure of Invention
The present application aims to provide a method and an apparatus for enhancing audio quality, which can effectively improve the audio quality of a stereo microphone array.
To achieve the above object, an aspect of the present application provides an audio quality enhancement method, including: acquiring an audio signal with a preset format; preprocessing the audio signals, wherein the preprocessing comprises calculating an average signal of each audio signal in the audio signals and/or performing beamforming processing on the audio signals; and carrying out noise suppression processing on the audio signal based on the signal obtained by preprocessing to obtain the audio signal with enhanced tone quality.
Further, when the preprocessing is to calculate an average signal of each audio signal in the audio signals, the step of performing noise suppression processing on the audio signals based on the preprocessed signals specifically includes: determining a noise energy spectrum and a signal energy spectrum corresponding to the audio signal according to the average signal; and according to the noise energy spectrum and the signal energy spectrum, carrying out noise suppression processing on the audio signal to obtain the audio signal with enhanced tone quality.
Further, when the preprocessing is to perform beamforming processing on the audio signal, the step of performing noise suppression processing on the audio signal based on a signal obtained by the preprocessing specifically includes: performing inner product processing on the audio signals by using a first guide vector and a second guide vector opposite to the first guide vector in direction to obtain a first path of audio signals and a first path of audio signals after the inner product processing; obtaining an audio signal of a preset position in the audio signal according to the first guide vector; determining a noise energy spectrum and a signal energy spectrum corresponding to the audio signal according to the first path of audio signal and the second path of audio signal after the inner product processing; and according to the noise energy spectrum and the signal energy spectrum, carrying out noise suppression processing on the first path of audio signal after the inner product processing to obtain an audio signal with enhanced tone quality.
Further, when the preprocessing is to calculate an average signal of each audio signal in the audio signals and perform beamforming processing on the audio signals, the step of performing noise suppression processing on the audio signals based on the signals obtained by the preprocessing specifically includes: carrying out inner product processing on the audio signal by utilizing a first guide vector to obtain an audio signal after inner product processing; obtaining an audio signal of a preset position in the audio signal according to the first guide vector; determining a noise energy spectrum and a signal energy spectrum corresponding to the audio signal according to the average signal; and according to the noise energy spectrum and the signal energy spectrum, carrying out noise suppression processing on the audio signal subjected to the inner product processing to obtain an audio signal with enhanced tone quality.
Further, when the preprocessing is to calculate an average signal of each audio signal in the audio signals and perform beamforming processing on the audio signals, the step of performing noise suppression processing on the audio signals based on the signals obtained by the preprocessing specifically includes: performing inner product processing on the audio signal by using a first guide vector and a second guide vector opposite to the first guide vector in direction to obtain a first path of audio signal and a second path of audio signal after the inner product processing; obtaining an audio signal of a preset position in the audio signal according to the first guide vector; determining a noise suppression factor corresponding to the audio signal according to the average signal and the first and second paths of audio signals after the inner product processing; and according to the noise suppression factor, performing noise suppression processing on the first path of audio signal after the inner product processing to obtain an audio signal with enhanced tone quality.
Further, before pre-processing the audio signal, the method further comprises: acquiring sound field parameters of the audio signal, wherein the sound field parameters comprise at least one of sound source orientation, sound source energy and sound source divergence.
Further, estimating the noise energy spectrum corresponding to the audio signal specifically includes: judging the magnitude between the sound source energy of the Z signal in the sound field parameters and a first threshold, and when the sound source energy of the Z signal in the sound field parameters is larger than the first threshold, estimating a noise energy spectrum corresponding to the audio signal by using a smoothing factor with a numerical value smaller than a second threshold; and when the sound source energy of the Z signal in the sound field parameters is smaller than or equal to the first threshold, estimating a noise energy spectrum corresponding to the audio signal by using a smoothing factor with a value larger than or equal to the second threshold.
Further, the performing the beamforming processing on the audio signal specifically includes: determining a target guide vector according to the sound source orientation in the sound field parameters; and performing inner product processing on the target steering vector and the audio signal to obtain a beam-formed audio signal.
Further, based on the signal obtained by the preprocessing, the noise suppression processing on the audio signal specifically includes: determining an adjusting factor for noise suppression processing according to the sound source divergence in the sound field parameters; and carrying out noise suppression processing on the audio signal according to the determined adjusting factor.
Further, determining an adjustment factor for performing noise suppression processing according to the sound source divergence in the sound field parameters specifically includes: judging the size between the sound source divergence degree in the sound field parameters and a third threshold value, and determining an adjusting factor with a numerical value larger than a fourth threshold value when the sound source divergence degree is larger than the third threshold value; determining an adjustment factor having a value less than or equal to the fourth threshold when the sound source divergence in the sound field parameters is less than or equal to the third threshold.
To achieve the above object, another aspect of the present application further provides an audio quality enhancement method, including: acquiring an audio signal with a preset format; and performing beam forming processing on the audio signal to obtain the audio signal with enhanced tone quality.
Further, the beamforming specifically includes: and carrying out inner product processing by combining the guide vector in the preset direction with the audio signal to obtain the audio signal enhanced in the preset direction.
Further, before pre-processing the audio signal, the method further comprises: acquiring sound field parameters of the audio signal, wherein the sound field parameters comprise at least one of sound source orientation, sound source energy and sound source divergence.
Further, the performing the beamforming processing on the audio signal specifically includes: determining a target guide vector according to the sound source orientation in the sound field parameters; and carrying out inner product processing by utilizing the target guide vector and the audio signal to obtain an audio signal enhanced in the target direction.
To achieve the above object, another aspect of the present application further provides an apparatus for audio quality enhancement, the apparatus comprising: the audio signal acquisition unit is used for acquiring an audio signal in a preset format; the preprocessing unit is used for preprocessing the audio signals, and the preprocessing comprises calculating average signals of all paths of audio signals in the audio signals and/or performing beam forming processing on the audio signals; and the noise suppression processing unit is used for carrying out noise suppression processing on the audio signal based on the signal obtained by preprocessing so as to obtain the audio signal with enhanced tone quality.
The method and the device for enhancing the audio quality provided by the embodiment of the invention can perform audio enhancement processing on the signal with the preset format, and further perform noise suppression processing and beam forming processing by combining sound field parameters (sound source direction, sound source energy and sound source divergence), so that the audio quality can be effectively improved, and the expected effect is achieved.
Drawings
FIG. 1 is a flow chart of a method for audio quality enhancement in one embodiment of the present application;
FIG. 2 is a diagram of four audio signals according to an embodiment of the present application;
FIG. 3 is a flow chart of a method of audio quality enhancement in another embodiment of the present application;
FIG. 4 is a flow chart of a method of audio quality enhancement in another embodiment of the present application;
FIG. 5 is a flow chart of a method of audio quality enhancement in another embodiment of the present application;
FIG. 6 is a flow chart of a method of audio quality enhancement in another embodiment of the present application;
FIG. 7 is a flow chart of a method of audio quality enhancement in another embodiment of the present application;
fig. 8 is a functional block diagram of an apparatus for audio quality enhancement according to an embodiment of the present application.
Detailed Description
In order to make those skilled in the art better understand the technical solutions in the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art without any inventive work based on the embodiments in the present application shall fall within the scope of protection of the present application.
Referring to fig. 1, an embodiment of the present application provides a method for audio quality enhancement, which includes the following steps.
S1: and acquiring an audio signal with a preset format.
In this embodiment, the audio signal in the preset format may be an audio signal in an Ambisonic a format. The audio signal of the Ambisonic A format is a four-way audio signal (LFU, RFD, LBD, RBU). The four audio signals may be as shown in fig. 2.
S2: and preprocessing the audio signals, wherein the preprocessing comprises calculating an average signal of each audio signal in the audio signals and/or performing beam forming processing on the audio signals.
In this embodiment, the audio signal in Ambisonic a format may be preprocessed, and the purpose of the preprocessing is to perform enhancement processing on the audio signal. Specifically, in this embodiment, the preprocessing manner may include calculating an average signal of each of the audio signals and/or performing beamforming processing on the audio signals.
Wherein, the average signal x of each audio signal in the audio signalave(n):
Figure BDA0001220339480000051
Wherein n is the label of the sampling point in the audio time domain signal, L is the frame length of the audio signal processing, xi(n) is the ith audioOf the time domain signal.
Beamforming process xbf(n):
Figure BDA0001220339480000052
Wherein θ is [0,360 ]]Azimuthal angle in range, piAnd (theta) is a guide vector in the theta direction.
When estimating the noise energy spectrum corresponding to the audio signal, an average signal of each path of audio signal in the audio signal may be calculated, and then a smoothing factor for estimating the noise energy spectrum may be determined according to the average signal. The smoothing factor may be represented, for example, by the following equation:
αs(λ,k)=αd+(1-αd)p(λ,k)
wherein λ represents the number of audio frames in the audio signal, k represents the number of frequency points in the audio signal, αs(λ, k) denotes the designated audio frame and the corresponding smoothing factor at the designated frequency point, αdAnd the smoothing coefficient is represented and takes a value of 0.85, and p (lambda, k) represents the corresponding average signal at the designated audio frame and the designated frequency point. Thus, different audio frames and frequency points can correspond to different smoothing factors, and the smoothing factors can be determined by the average signal.
In this embodiment, a noise energy spectrum corresponding to the audio signal may be estimated according to the smoothing factor. Specifically, the formula for estimating the noise energy spectrum may be as follows:
D(λ,k)=αs(λ,k)D(λ-1,k)+(1-αs(λ,k))|Y(λ,k)|2
where D (λ, k) represents the corresponding estimated noise energy spectrum at the specified audio frame and at the specified frequency point, and Y (λ, k) represents the audio amplitude at the specified audio frame and at the specified frequency point.
In this embodiment, please refer to fig. 3, a beamforming process may be performed on the audio signal. Specifically, a steering vector (steering vector) of a preset direction may be inner-product-processed with the audio signal, so that the audio signal may be enhanced in the preset direction. This effectively enhances the sound source in a particular direction.
In one embodiment of the present application, referring to fig. 4, the noise energy spectrum can be estimated in combination with the sound field parameters. Specifically, sound field parameters of the audio signal may be acquired, the sound field parameters including at least one of a sound source orientation (sound location), a sound source energy (sound power), and a sound source divergence (sound divergence). The sound field parameters may be acquired by a Direction of Arrival (DOA) method.
In this embodiment, the smoothing factor may have different values according to different audio frames and frequency points, and thus the actually used smoothing factor may be determined according to the magnitude between the sound source energy of the Z signal in the sound field parameter and the first threshold. Specifically, when the sound source energy of the Z signal in the sound field parameters is greater than the first threshold, a smoothing factor with a value smaller than a second threshold is used to estimate a noise energy spectrum corresponding to the audio signal; and when the sound source energy of the Z signal in the sound field parameters is smaller than or equal to the first threshold, estimating a noise energy spectrum corresponding to the audio signal by using a smoothing factor with a value larger than or equal to the second threshold. Specifically, if there are a plurality of smoothing factors smaller than the second threshold, any one of the smoothing factors may be used for the estimation. Similarly, if there are more than one smoothing factor greater than or equal to the second threshold, any one of the smoothing factors may be used for estimation. Specifically, the first threshold range is [0.3,0.6], and the second threshold range is [0.05,0.4 ].
Wherein, the Z signal is obtained according to the conversion matrix A:
Figure BDA0001220339480000071
wherein the transformation matrix A ═ a11a12a13a14]Element a of said A11,a12,......,a14Is constant and is determined by different sound source scenes.
The energy of the Z signal is
Figure BDA0001220339480000072
In this embodiment, please refer to fig. 5, a beam forming process can be performed in combination with the sound field parameters. Specifically, a target steering vector may be adaptively determined according to a sound source direction in the sound field parameters, and then an inner product process may be performed with the audio signal using the target steering vector to obtain a beamformed audio signal.
S3: and carrying out noise suppression processing on the audio signal based on the signal obtained by preprocessing to obtain the audio signal with enhanced tone quality.
In this embodiment, after the audio signal is preprocessed, the audio signal may be subjected to noise suppression processing, so as to obtain an audio signal with enhanced sound quality. Specifically, the noise suppression may be performed by a spectral subtraction method, or may be performed by a wiener filtering method. Both the spectral subtraction method and the wiener filtering method can be realized in a frequency domain. The noise suppression process may be performed in the entire frequency band or in the sub-bands.
In this embodiment, referring to fig. 6, after the audio signal is beamformed, noise suppression processing may be performed. Specifically, a first steering vector and a second steering vector opposite to the first steering vector in direction may be respectively used to perform inner product processing on the audio signal, so as to obtain a first channel of audio signal and a second channel of audio signal after the inner product processing; obtaining an audio signal of a preset position in the audio signal according to the first guide vector; then, the first path of audio signal and the second path of audio signal after the inner product processing can be respectively transformed into frequency domain signals, and noise suppression processing is performed in the frequency domain.
Specifically, the beamforming process is as follows:
Figure BDA0001220339480000081
wherein θ is [0,360 ]]Within the range ofAzimuth angle, pi(theta) is a guide vector in the theta direction, xiAnd (n) is the ith audio time domain signal.
The time domain signal is transformed into the frequency domain signal, which can be realized by Discrete Fourier Transform (DFT), Fast Fourier Transform (FFT) or Modified Discrete Cosine Transform (MDCT).
It should be noted that the present embodiment may also perform beamforming processing only on the audio signal. Specifically, the embodiment of the present application provides a method for enhancing audio quality, the method comprising:
acquiring an audio signal with a preset format;
performing beamforming processing on the audio signal, wherein the beamforming processing specifically includes:
and carrying out inner product processing on the audio signal by combining a guide vector in a preset direction so as to enhance the audio signal in the preset direction.
Referring to fig. 7, it is understood that noise suppression processing may be performed in combination with the sound field parameters. Specifically, a first steering vector and a second steering vector opposite to the first steering vector in direction may be respectively used to perform inner product processing on the audio signal, so as to obtain a first channel of audio signal and a second channel of audio signal after the inner product processing; obtaining an audio signal of a preset position in the audio signal according to the first guide vector; then, the first path of audio signal and the second path of audio signal after the inner product processing may be respectively converted into frequency domain signals, an adjustment factor for performing noise suppression processing may be determined according to a sound source divergence in the sound field parameters, and finally, noise suppression processing may be performed on the audio signals according to the determined adjustment factor. Specifically, in the step of determining an adjustment factor for performing noise suppression processing according to the sound source divergence in the sound field parameter, the size between the sound source divergence in the sound field parameter and a third threshold may be determined, and when the sound source divergence is greater than the third threshold, an adjustment factor having a value greater than a fourth threshold is determined; determining an adjustment factor having a value less than or equal to the fourth threshold when the sound source divergence in the sound field parameters is less than or equal to the third threshold. Specifically, the third threshold range is [0.3,0.5], and the fourth threshold range is [0.05,0.5 ].
Referring to fig. 8, an embodiment of the present application further provides an apparatus for enhancing audio quality, where the apparatus includes:
an audio signal acquiring unit 100 configured to acquire an audio signal in a preset format;
a preprocessing unit 200, configured to perform preprocessing on the audio signal, where the preprocessing includes calculating an average signal of each of the audio signals and/or performing beamforming on the audio signal;
and the noise suppression processing unit 300 is configured to perform noise suppression processing on the audio signal based on the signal obtained through the preprocessing, so as to obtain an audio signal with enhanced tone quality.
In an embodiment of the present application, the preprocessing unit 200 specifically includes:
the average signal calculation module is used for calculating the average signal of each path of audio signal in the audio signals;
a smoothing factor determination module for determining a smoothing factor for estimating a noise energy spectrum from the averaged signal;
and the estimation module is used for estimating the noise energy spectrum corresponding to the audio signal according to the smoothing factor.
The method and the device for enhancing the audio quality provided by the embodiment of the invention can perform audio enhancement processing on the signal with the preset format, and further perform noise suppression processing and beam forming processing by combining sound field parameters (sound source direction, sound source energy and sound source divergence), so that the audio quality can be effectively improved, and the expected effect is achieved.
The foregoing description of various embodiments of the present application is provided for the purpose of illustration to those skilled in the art. It is not intended to be exhaustive or to limit the invention to a single disclosed embodiment. As described above, various alternatives and modifications of the present application will be apparent to those skilled in the art to which the above-described technology pertains. Thus, while some alternative embodiments have been discussed in detail, other embodiments will be apparent or relatively easy to derive by those of ordinary skill in the art. This application is intended to cover all alternatives, modifications, and variations of the invention that have been discussed herein, as well as other embodiments that fall within the spirit and scope of the above-described application.

Claims (3)

1. A method of audio quality enhancement, the method comprising:
acquiring an audio signal with a preset format;
preprocessing the audio signals, wherein the preprocessing comprises calculating average signals of all paths of audio signals in the audio signals and performing beam forming processing on the audio signals;
based on the signal obtained by preprocessing, carrying out noise suppression processing on the audio signal to obtain an audio signal with enhanced tone quality;
the step of performing noise suppression processing on the audio signal based on the signal obtained by the preprocessing specifically includes:
carrying out inner product processing on the audio signal by utilizing a first guide vector to obtain an audio signal after inner product processing; obtaining an audio signal of a preset position in the audio signal according to the first guide vector;
determining a noise energy spectrum and a signal energy spectrum corresponding to the audio signal according to the average signal;
according to the noise energy spectrum and the signal energy spectrum, noise suppression processing is carried out on the audio signal after the inner product processing, and an audio signal with enhanced tone quality is obtained;
determining a smoothing factor for estimating a noise energy spectrum according to the average signal, and estimating a noise energy spectrum corresponding to the audio signal according to the smoothing factor;
prior to pre-processing the audio signal, the method further comprises:
acquiring sound field parameters of the audio signal, wherein the sound field parameters comprise at least one of sound source orientation, sound source energy and sound source divergence;
accordingly, estimating the noise energy spectrum corresponding to the audio signal specifically includes:
judging the magnitude between the sound source energy of the Z signal in the sound field parameters and a first threshold, and when the sound source energy of the Z signal in the sound field parameters is larger than the first threshold, estimating a noise energy spectrum corresponding to the audio signal by using a smoothing factor with a numerical value smaller than a second threshold;
when the sound source energy of the Z signal in the sound field parameters is smaller than or equal to the first threshold, adopting a smoothing factor with the value larger than or equal to the second threshold to estimate the noise energy spectrum corresponding to the audio signal;
correspondingly, based on the signal obtained by preprocessing, the noise suppression processing of the audio signal specifically includes:
determining an adjusting factor for noise suppression processing according to the sound source divergence in the sound field parameters;
carrying out noise suppression processing on the audio signal according to the determined adjusting factor;
wherein the Z signal is obtained from a product of the transformation matrix and the four audio signals.
2. The method of claim 1, wherein determining the adjustment factor for noise suppression according to the sound source divergence in the sound field parameters specifically comprises:
judging the size between the sound source divergence degree in the sound field parameters and a third threshold value, and determining an adjusting factor with a numerical value larger than a fourth threshold value when the sound source divergence degree is larger than the third threshold value;
determining an adjustment factor having a value less than or equal to the fourth threshold when the sound source divergence in the sound field parameters is less than or equal to the third threshold.
3. An apparatus for audio quality enhancement, the apparatus comprising:
the audio signal acquisition unit is used for acquiring an audio signal in a preset format;
the preprocessing unit is used for preprocessing the audio signals, and the preprocessing comprises calculating average signals of all paths of audio signals in the audio signals and performing beam forming processing on the audio signals;
the noise suppression processing unit is used for carrying out noise suppression processing on the audio signal based on the signal obtained by preprocessing to obtain an audio signal with enhanced tone quality;
the step of performing noise suppression processing on the audio signal based on the signal obtained by the preprocessing specifically includes:
carrying out inner product processing on the audio signal by utilizing a first guide vector to obtain an audio signal after inner product processing; obtaining an audio signal of a preset position in the audio signal according to the first guide vector;
determining a noise energy spectrum and a signal energy spectrum corresponding to the audio signal according to the average signal;
according to the noise energy spectrum and the signal energy spectrum, noise suppression processing is carried out on the audio signal after the inner product processing, and an audio signal with enhanced tone quality is obtained;
determining a smoothing factor for estimating a noise energy spectrum according to the average signal, and estimating a noise energy spectrum corresponding to the audio signal according to the smoothing factor;
before preprocessing the audio signal, the method further comprises:
acquiring sound field parameters of the audio signal, wherein the sound field parameters comprise at least one of sound source orientation, sound source energy and sound source divergence;
accordingly, estimating the noise energy spectrum corresponding to the audio signal specifically includes:
judging the magnitude between the sound source energy of the Z signal in the sound field parameters and a first threshold, and when the sound source energy of the Z signal in the sound field parameters is larger than the first threshold, estimating a noise energy spectrum corresponding to the audio signal by using a smoothing factor with a numerical value smaller than a second threshold;
when the sound source energy of the Z signal in the sound field parameters is smaller than or equal to the first threshold, adopting a smoothing factor with the value larger than or equal to the second threshold to estimate the noise energy spectrum corresponding to the audio signal;
correspondingly, based on the signal obtained by preprocessing, the noise suppression processing of the audio signal specifically includes:
determining an adjusting factor for noise suppression processing according to the sound source divergence in the sound field parameters;
carrying out noise suppression processing on the audio signal according to the determined adjusting factor;
wherein the Z signal is obtained from a product of the transformation matrix and the four audio signals.
CN201710064271.7A 2017-02-04 2017-02-04 Method and device for enhancing audio quality Active CN106816156B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710064271.7A CN106816156B (en) 2017-02-04 2017-02-04 Method and device for enhancing audio quality

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710064271.7A CN106816156B (en) 2017-02-04 2017-02-04 Method and device for enhancing audio quality

Publications (2)

Publication Number Publication Date
CN106816156A CN106816156A (en) 2017-06-09
CN106816156B true CN106816156B (en) 2020-06-30

Family

ID=59111991

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710064271.7A Active CN106816156B (en) 2017-02-04 2017-02-04 Method and device for enhancing audio quality

Country Status (1)

Country Link
CN (1) CN106816156B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107920303B (en) * 2017-11-21 2019-12-24 北京时代拓灵科技有限公司 Audio acquisition method and device
CN108520756B (en) * 2018-03-20 2020-09-01 北京时代拓灵科技有限公司 Method and device for separating speaker voice
GB2577885A (en) * 2018-10-08 2020-04-15 Nokia Technologies Oy Spatial audio augmentation and reproduction
CN113077787A (en) * 2020-12-22 2021-07-06 珠海市杰理科技股份有限公司 Voice data identification method, device, chip and readable storage medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1809105B (en) * 2006-01-13 2010-05-12 北京中星微电子有限公司 Dual-microphone speech enhancement method and system applicable to mini-type mobile communication devices
CN100524465C (en) * 2006-11-24 2009-08-05 北京中星微电子有限公司 A method and device for noise elimination
WO2010079526A1 (en) * 2009-01-06 2010-07-15 三菱電機株式会社 Noise cancellation device and noise cancellation program
CN102801861B (en) * 2012-08-07 2015-08-19 歌尔声学股份有限公司 A kind of sound enhancement method and device being applied to mobile phone
CN104065798B (en) * 2013-03-21 2016-08-03 华为技术有限公司 Audio signal processing method and equipment
US10186277B2 (en) * 2015-03-19 2019-01-22 Intel Corporation Microphone array speech enhancement

Also Published As

Publication number Publication date
CN106816156A (en) 2017-06-09

Similar Documents

Publication Publication Date Title
JP7011075B2 (en) Target voice acquisition method and device based on microphone array
CN106816156B (en) Method and device for enhancing audio quality
AU2012343907B2 (en) Apparatus and method for microphone positioning based on a spatial power density
US10694306B2 (en) Apparatus, method or computer program for generating a sound field description
EP2647221B1 (en) Apparatus and method for spatially selective sound acquisition by acoustic triangulation
EP3120355B1 (en) Noise suppression
US8213623B2 (en) Method to generate an output audio signal from two or more input audio signals
US8654990B2 (en) Multiple microphone based directional sound filter
US8947978B2 (en) System and method for estimating the direction of arrival of a sound
US20150310857A1 (en) Apparatus and method for providing an informed multichannel speech presence probability estimation
US10856094B2 (en) Method and device for sound source localization
US20220150657A1 (en) Apparatus, method or computer program for processing a sound field representation in a spatial transform domain
US10854210B2 (en) Device and method for capturing and processing a three-dimensional acoustic field
Corey et al. Motion-tolerant beamforming with deformable microphone arrays
Goetze et al. Direction of arrival estimation based on the dual delay line approach for binaural hearing aid microphone arrays
US20230319469A1 (en) Suppressing Spatial Noise in Multi-Microphone Devices
Tachioka et al. Direction of arrival estimation by cross-power spectrum phase analysis using prior distributions and voice activity detection information
CN108845292B (en) Sound source positioning method and device
WO2023148426A1 (en) Apparatus, methods and computer programs for enabling rendering of spatial audio
Zhang et al. A frequency domain approach for speech enhancement with directionality using compact microphone array.
Tomita et al. Quantitative evaluation of segregated signal with frequency domain binaural model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant