CN116994596A - Howling suppression method and device, storage medium and electronic equipment - Google Patents

Howling suppression method and device, storage medium and electronic equipment Download PDF

Info

Publication number
CN116994596A
CN116994596A CN202311133870.1A CN202311133870A CN116994596A CN 116994596 A CN116994596 A CN 116994596A CN 202311133870 A CN202311133870 A CN 202311133870A CN 116994596 A CN116994596 A CN 116994596A
Authority
CN
China
Prior art keywords
howling
state variable
processed
variable filter
audio frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311133870.1A
Other languages
Chinese (zh)
Inventor
肖强
黄荣均
肖全之
刘帅
程子胜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhuhai Jieli Technology Co Ltd
Original Assignee
Zhuhai Jieli Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhuhai Jieli Technology Co Ltd filed Critical Zhuhai Jieli Technology Co Ltd
Priority to CN202311133870.1A priority Critical patent/CN116994596A/en
Publication of CN116994596A publication Critical patent/CN116994596A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/02Circuits for transducers, loudspeakers or microphones for preventing acoustic reaction, i.e. acoustic oscillatory feedback
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Quality & Reliability (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The application discloses a howling suppression method, a howling suppression device, a storage medium and electronic equipment, wherein the howling suppression method can be used for acquiring an audio frame to be processed; detecting whether a howling frequency point exists in an audio frame to be processed; if yes, updating parameters of the state variable filter based on the howling frequency points; if not, judging whether the undetected howling frequency points exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result; and filtering the audio frame to be processed by using a state variable filter so as to inhibit the howling frequency points. The scheme can improve the effect of suppressing howling.

Description

Howling suppression method and device, storage medium and electronic equipment
Technical Field
The present application relates to the field of audio signal processing technologies, and in particular, to a howling suppression method and apparatus, a storage medium, and an electronic device.
Background
In general, howling often occurs when a speaker such as a sound box is used for speaker amplification, particularly when a microphone is used indoors as a sound pickup. The reason why howling occurs is that: when the reflected sound of the original sound and the generated echo pass through the microphone, the sound is picked up again by the microphone, so that the echo enters sound amplifying equipment such as sound equipment for amplification; and when the echo is the same as the original sound, the two sounds are overlapped and enhanced, the enhanced sound and the regenerated echo pass through the microphone again and enter loudspeaker equipment such as sound equipment to be amplified, so that the echo is enhanced again, and the repeated operation is repeated, so that strong and harsh howling is generated. Howling can not only produce signal distortion and harsh sounds but also affect voice communications, causing components to burn out even when the output exceeds the power of the speaker system. Meanwhile, when howling occurs, the signal output by the speaker generates a high-intensity oscillation phenomenon by single sound at a certain frequency, thereby causing a strong uncomfortable feeling to the user.
The traditional howling suppression method comprises a notch method, a frequency shift method, an adaptive feedback cancellation method and the like. Among them, the notch method is most widely used in practice. However, the notch method has the defects of insufficient attenuation of notch points and high computational complexity of the notch filter, so that the suppression effect on howling is not ideal.
Disclosure of Invention
The application provides a howling suppression method, a howling suppression device, a storage medium and electronic equipment, which can improve the howling suppression effect.
In a first aspect, the present application provides a howling suppression method, including:
acquiring an audio frame to be processed;
detecting whether howling frequency points exist in the audio frame to be processed;
if yes, updating parameters of a state variable filter based on the howling frequency points;
if not, judging whether the undetected howling frequency points exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result;
and filtering the audio frame to be processed by using the state variable filter so as to inhibit the howling frequency points.
In the howling suppression method provided by the application, if yes, updating parameters of a state variable filter based on the howling frequency point, including:
When detecting that a howling frequency point exists in an audio frame to be processed, judging whether the howling frequency point exists in a state variable filter or not;
if yes, increasing the attenuation gain of the state variable filter and reducing the bandwidth factor of the state variable filter;
if not, adding the howling frequency point into the state variable filter, and initializing the state variable filter.
In the howling suppression method provided by the present application, the initializing the state variable filter includes:
setting the attenuation gain of the state variable filter to 0 and setting the bandwidth factor of the state variable filter to a first preset value.
In the howling suppression method provided by the application, the corresponding processing of the parameters of the state variable filter according to the judgment result includes:
when the audio frame to be processed has undetected howling frequency points, a counter is started to count;
and correspondingly processing the parameters of the state variable filter according to the counting result.
In the howling suppression method provided by the present application, the performing corresponding processing on the parameters of the state variable filter according to the counting result includes:
When the continuous frame number of the audio frames to be processed of the undetected howling frequency points is larger than the preset frame number, increasing the bandwidth factor of the state variable filter;
and when the continuous frame number of the audio frames to be processed of the undetected howling frequency points is smaller than or equal to the preset frame number, maintaining the parameters of the state variable filter.
In the howling suppression method provided by the present application, the determining, based on the state variable filter, whether there are undetected howling frequency points in the audio frame to be processed includes:
comparing the howling frequency point in the state variable filter with the frequency point in the audio frame to be processed;
and judging whether undetected howling frequency points exist in the audio frame to be processed according to the comparison result.
In the howling suppression method provided by the present application, the detecting whether the howling frequency point exists in the audio frame to be processed includes:
acquiring a peak adjacent power ratio and/or a peak harmonic power ratio of the audio frame to be processed;
and detecting whether howling frequency points exist in the audio frame to be processed according to the peak adjacent power ratio and/or the peak harmonic power ratio.
In a second aspect, the present application provides a howling suppressing apparatus comprising:
An audio acquisition unit for acquiring an audio frame to be processed;
the howling detection unit is used for detecting whether howling frequency points exist in the audio frame to be processed;
the first updating unit is used for updating parameters of the state variable filter based on the howling frequency point when the howling frequency point exists in the audio frame to be processed;
the second updating unit is used for judging whether the howling frequency point which is not detected exists in the audio frame to be processed or not based on the state variable filter when the howling frequency point does not exist in the audio frame to be processed, and correspondingly processing the parameters of the state variable filter according to a judging result;
and the howling suppression unit is used for performing filtering processing on the audio frame to be processed by using the state variable filter so as to suppress the howling frequency points.
In a third aspect, the present application provides a storage medium storing a plurality of instructions adapted to be loaded by a processor to perform the howling suppression method of any one of the above.
In a fourth aspect, the present application provides an electronic device, including a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor implements any one of the above howling suppression methods when executing the computer program.
In summary, the howling suppression method provided by the embodiment of the present application may obtain an audio frame to be processed; detecting whether howling frequency points exist in the audio frame to be processed; if yes, updating parameters of a state variable filter based on the howling frequency points; if not, judging whether the undetected howling frequency points exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result; and filtering the audio frame to be processed by using the state variable filter so as to inhibit the howling frequency points. According to the method and the device, parameters of the state variable filter can be adjusted in real time according to the howling frequency points, and then the state variable filter after the parameters are adjusted is adopted to conduct filtering processing on the audio frames to be processed with the howling frequency points, so that the filtering calculation complexity is effectively reduced, and simultaneously, each howling frequency point is restrained in real time, and therefore the suppression effect on howling is improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the description of the embodiments will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a flow chart of a howling suppression method according to an embodiment of the present application.
Fig. 2 is another flow chart of the howling suppression method according to the embodiment of the present application.
Fig. 3 is a schematic structural diagram of a howling suppression apparatus according to an embodiment of the present application.
Fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
Reference will now be made in detail to exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numbers in different drawings refer to the same or similar elements, unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the application. Rather, they are merely examples of apparatus and methods consistent with aspects of the application as detailed in the accompanying claims.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, the element defined by the phrase "comprising one … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element, and furthermore, elements having the same name in different embodiments of the application may have the same meaning or may have different meanings, the particular meaning of which is to be determined by its interpretation in this particular embodiment or by further combining the context of this particular embodiment.
It should be understood that, although the steps in the flowcharts in the embodiments of the present application are shown in order as indicated by the arrows, these steps are not necessarily performed in order as indicated by the arrows. The steps are not strictly limited in order and may be performed in other orders, unless explicitly stated herein. Moreover, at least some of the steps in the figures may include multiple sub-steps or stages that are not necessarily performed at the same time, but may be performed at different times, the order of their execution not necessarily occurring in sequence, but may be performed alternately or alternately with other steps or at least a portion of the other steps or stages.
It should be noted that, in this document, step numbers such as 101 and 102 are used for the purpose of more clearly and briefly describing the corresponding contents, and not to constitute a substantial limitation on the sequence, and those skilled in the art may execute 102 first and then execute 101 when they are implemented, which is within the scope of the present application.
Reference herein to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of the application. The appearances of such phrases in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Those of skill in the art will explicitly and implicitly appreciate that the embodiments described herein may be combined with other embodiments.
In the following description, suffixes such as "module", "part" or "unit" for representing elements are used only for facilitating the description of the present application, and have no specific meaning per se. Thus, "module," "component," or "unit" may be used in combination.
In the description of the present application, it should be noted that the directions or positional relationships indicated by the terms "upper", "lower", "left", "right", "inner", "outer", "middle", etc. are based on the directions or positional relationships shown in the drawings, are merely for convenience of describing the present application and simplifying the description, and do not indicate or imply that the apparatus or elements referred to must have a specific orientation, be constructed and operated in a specific orientation, and thus should not be construed as limiting the present application. Furthermore, the terms "first," "second," and the like are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.
The traditional howling suppression method comprises a notch method, a frequency shift method, an adaptive feedback cancellation method and the like. Among them, the notch method is most widely used in practice. However, the notch method has the defects of insufficient attenuation of notch points and high computational complexity of the notch filter, so that the suppression effect on howling is not ideal.
Based on this, the embodiment of the application provides a howling suppression method, a howling suppression device, a storage medium and an electronic device, wherein the howling suppression device can be integrated in the electronic device, and the electronic device can be a server or a terminal and other devices; the terminal can comprise a mobile phone, a wearable intelligent device, a tablet computer, a notebook computer, a personal computer (PC, personal Computer) and the like; the server may be a single server, may be a server cluster composed of a plurality of servers, and may be an entity server or a virtual server.
The technical schemes shown in the application will be respectively described in detail through specific examples. The following description of the embodiments is not intended to limit the priority of the embodiments.
Referring to fig. 1, fig. 1 is a flowchart illustrating a howling suppression method according to an embodiment of the application. The specific flow of the howling suppression method can be as follows:
101. and obtaining the audio frame to be processed.
In some embodiments, the audio signal may be acquired by a microphone or other manner of the electronic device, and then the audio signal is subjected to framing processing, so as to obtain an audio frame to be processed. It can be understood that the specific acquisition mode of the audio signal can be determined according to the actual use requirement, and the embodiment of the application is not limited.
The framing process may be to frame the audio signal according to a fixed time length, and then encapsulate each frame of data, so as to obtain at least one audio frame to be processed. Each audio frame to be processed may include the entire audio frequency band or may include a portion of the audio frequency band. In a practical scenario, the howling phenomenon may occur in a frame of audio to be processed, or may occur in multiple frames of audio to be processed that are continuous or segmented.
102. And detecting whether howling frequency points exist in the audio frames to be processed.
In some embodiments, the audio frame to be processed may be transformed from a time domain space to a frequency domain space, thereby determining whether howling frequency points are present based on the audio frame to be processed in the frequency domain space. Specifically, the audio frame to be processed may be subjected to fourier transform or windowed processing and then fourier transform to obtain a spectrum signal corresponding to the audio frame to be processed, where the spectrum signal is the audio frame to be processed in the frequency domain space.
It is understood that the spectral signal corresponding to the audio frame to be processed includes a plurality of frequency bins. In the practical application process, whether a frequency point with howling occurs in the frequency spectrum signal can be determined by analyzing a plurality of frequency points of the frequency spectrum signal, namely whether the howling frequency point exists in an audio frame to be processed or not is determined, and then the howling frequency point can be directly restrained, so that the instantaneity and the accuracy of the howling restraining are improved.
Because the power or energy of the howling frequency point is very high and far exceeds the power or energy of other audio or noise frequency points, the howling frequency point can be obtained by analyzing the power or energy of each frequency point in the frequency spectrum signal and the like. For example, the howling frequency bin of the spectral signal may be detected based on a Peak-to-Neighboring Power Raio (PNPR), and/or a Peak-to-Harmonics Power Raio (PHPR), peak-to-harmonic power ratio (Peak-to-Harmonics Power Raio), to determine whether the howling frequency bin is present in the audio frame to be processed. In this embodiment, a specific acquisition mode of the howling frequency point is not limited.
103. If the howling frequency point exists in the audio frame to be processed, updating parameters of the state variable filter based on the howling frequency point.
It will be appreciated that no one procedure is perfectly possible to set. Therefore, even if the program setting for detecting the howling frequency point is nearly complete, there is a possibility that a miss occurs in detecting the presence of the howling frequency point in the audio frame to be processed. In some embodiments, the audio frame to be processed may be further processed by a state variable filter to reduce undetected howling frequency points.
In the embodiment of the application, a preset number of historical howling frequency points are prestored in the state variable filter. It can be understood that the historical howling frequency point may be the howling frequency point detected in the previous howling suppression process, or may be a howling frequency point set according to actual experience.
The parameters of the state variable filter comprise attenuation gain, bandwidth factor Q, frequency mapping g, gain A, first filter coefficient R, second filter coefficient beta, third filter coefficient alpha, high-pass gain chp, low-pass gain clp, band-pass gain cbp and the like. The specific expression of the parameters is as follows:
g=tan(fc*π/fs)
beta=g+2*R
chp=1
clp=1
where fc is a howling frequency point and fs is a sampling rate. As can be seen from the above expression, most of the parameters of the state variable filter can be adjusted by adjusting the attenuation gain and the bandwidth factor Q.
Specifically, when the howling frequency point exists in the audio frame to be processed, judging whether the state variable filter has the howling frequency point or not; if yes, increasing the attenuation gain of the state variable filter and reducing the bandwidth factor of the state variable filter; if not, adding the howling frequency point into the state variable filter, and initializing the state variable filter.
The initializing the state variable filter may specifically be setting the attenuation gain of the state variable filter to 0, and setting the bandwidth factor of the state variable filter to a first preset value, so as to adjust other parameters of the state variable filter.
In the embodiment of the present application, the first preset value is a minimum value of the bandwidth factor.
Before adding the howling frequency points to the state variable filter, it is necessary to determine whether the number of howling frequency points stored in the state variable filter reaches a maximum value, so that corresponding processing is performed according to a determination result.
Specifically, when the number of howling frequency points stored in the state variable filter does not reach the maximum value, the howling frequency points can be directly added to the state variable filter, and then the state variable filter is initialized. When the number of howling frequency points stored in the state variable filter reaches the maximum value, the maximum value of the bandwidth factor of the state variable filter can be searched, the maximum value of the bandwidth factor is replaced, and then the state variable filter is initialized.
104. If the howling frequency points do not exist in the audio frame to be processed, judging whether the howling frequency points which are not detected exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result.
In some embodiments, when there is an undetected howling frequency point in the audio frame to be processed, the howling frequency point in the state variable filter may be compared with the frequency point in the audio frame to be processed, so as to determine whether there is an undetected howling frequency point in the audio frame to be processed according to the comparison result.
It can be appreciated that comparing the howling frequency point in the state variable filter with the frequency point in the audio frame to be processed is to determine whether the audio frame to be processed has the same frequency point as the howling frequency point in the state variable filter.
In the embodiment of the application, when undetected howling frequency points exist in the audio frame to be processed, a counter is started for counting; and correspondingly processing parameters of the state variable filter according to the counting result.
Specifically, when the number of continuous frames of the audio frames to be processed of the undetected howling frequency points is greater than a preset number of frames, increasing the bandwidth factor of the state variable filter; when the number of continuous frames of the audio frames to be processed of the howling frequency points which are not detected is smaller than or equal to the preset number of frames, maintaining the parameters of the state variable filter.
The preset frame number can be set according to actual requirements. In the embodiment of the present application, the preset frame number is 4 frames.
In the embodiment of the application, the step of increasing the attenuation gain is gain max /(time fs), step of increasing bandwidth factor is (Q) max -Q min ) /(time fs), decreasing the step of bandwidth factor to (Q) min -Q max )/(time*fs)。
Wherein, gain max For maximum attenuation gain, time is the state variable filter fade-in fade-out time, and fs is the sampling rate. Q (Q) max Maximum value of bandwidth factor, Q min Is the minimum of the bandwidth factor.
105. And filtering the audio frame to be processed by using a state variable filter so as to inhibit the howling frequency points.
It can be understood that the howling frequency point in the audio frame to be processed can be suppressed by performing filtering processing on the audio frame to be processed by using the state variable filter after the parameters are adjusted.
In summary, the howling suppression method provided in the present embodiment may obtain an audio frame to be processed; detecting whether a howling frequency point exists in an audio frame to be processed; if yes, updating parameters of the state variable filter based on the howling frequency points; if not, judging whether the undetected howling frequency points exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result; and filtering the audio frame to be processed by using a state variable filter so as to inhibit the howling frequency points. According to the method and the device, parameters of the state variable filter can be adjusted in real time according to the howling frequency points, and then the state variable filter after the parameters are adjusted is adopted to conduct filtering processing on the audio frames to be processed with the howling frequency points, so that the filtering calculation complexity is effectively reduced, and simultaneously, each howling frequency point is restrained in real time, and therefore the suppression effect on howling is improved.
In addition, the howling frequency point detection and suppression are carried out on the audio frame to be processed through the state variable filter, other analog circuits are not required to be additionally relied on, and the cost is low.
The embodiment of the application also provides another howling suppression method, which is further described. Wherein the meaning of the noun is the same as that in the above-mentioned howling suppression method, and specific implementation details can be referred to the description in the above-mentioned howling suppression method.
201. And obtaining the audio frame to be processed.
In some embodiments, the audio signal may be acquired by a microphone or other manner of the electronic device, and then the audio signal is subjected to framing processing, so as to obtain an audio frame to be processed. It can be understood that the specific acquisition mode of the audio signal can be determined according to the actual use requirement, and the embodiment of the application is not limited.
202. A peak adjacent power ratio and/or a peak harmonic power ratio of the audio frame to be processed is obtained.
The peak adjacent power ratio refers to the power ratio of the frequency points adjacent to the first candidate howling frequency point. Specifically, a first peak point of the spectrum signal corresponding to the audio frame to be processed may be obtained, and the first peak point is used as a first candidate howling frequency point.
The peak harmonic power ratio refers to the power ratio of the power of the second candidate howling frequency point to the power of the harmonic thereof. Specifically, a second peak point of the spectrum signal corresponding to the audio frame to be processed may be obtained, and the second peak point is used as a second candidate howling frequency point.
203. And detecting whether howling frequency points exist in the audio frame to be processed according to the peak adjacent power ratio and/or the peak harmonic power ratio.
In some embodiments, after determining the first candidate howling frequency point according to the peak adjacent power ratio, it may be further determined whether the power of the first candidate howling frequency point is higher than the powers of the left and right M adjacent frequency points, and if the condition is satisfied, the first candidate howling frequency point is determined to be a howling frequency point. Wherein M can be set according to the need, and preferably M is 5. The specific expression of the peak adjacent power ratio is as follows:
wherein PNPR (k, w) is peak adjacent power ratio, |Y (k, w) | 2 Power of the first candidate howling frequency point, |y (k, w+w) i )| 2 And the adjacent frequency point power of the first candidate howling frequency point is obtained.
In another embodiment, the spectral signal of the speech audio signal has harmonic peaks, whereas the spectral signal corresponding to howling noise does not contain harmonic peaks. Therefore, the peak harmonic power ratio of the second candidate howling frequency point can be compared with a preset threshold value, so that whether the second candidate howling frequency point is the howling frequency point or not can be determined, wherein the preset threshold value can be set according to requirements. It can be understood that when the peak harmonic power of the second candidate howling frequency point is greater than the preset threshold, the second candidate howling frequency point is determined to be the howling frequency point. Wherein, the expression of the peak harmonic power ratio is as follows:
Wherein PHPR (k, w) is peak harmonic power ratio, |Y (k, w) | 2 Power of second candidate howling frequency point, |y (k, mw) | 2 And the power of the harmonic frequency point of the second candidate howling frequency point.
In the practical application process, other ways can be adopted to detect whether the howling frequency point exists in the audio frame to be processed. Such as peak-to-average power ratio, inter-frame peak hold, and inter-frame amplitude slope deviation, the specific implementation of which will not be described in detail herein.
In the embodiment of the present application, the howling frequency point may be determined according to any one of the peak-to-average power ratio, the inter-frame peak hold, and the inter-frame amplitude slope deviation, or may be determined by combining two or more methods, which is not limited herein.
It can be appreciated that the more ways to determine the howling frequency points, the higher the accuracy of detecting the howling frequency points.
204. If the howling frequency point exists in the audio frame to be processed, updating parameters of the state variable filter based on the howling frequency point.
For specific implementation details, reference may be made to the description of the howling suppression method, which is not described herein.
205. If the howling frequency points do not exist in the audio frame to be processed, judging whether the howling frequency points which are not detected exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result.
For specific implementation details, reference may be made to the description of the howling suppression method, which is not described herein.
206. And filtering the audio frame to be processed by using a state variable filter so as to inhibit the howling frequency points.
For specific implementation details, reference may be made to the description of the howling suppression method, which is not described herein.
In summary, the howling suppression method provided in the present embodiment may obtain an audio frame to be processed; acquiring a peak adjacent power ratio and/or a peak harmonic power ratio of an audio frame to be processed; detecting whether a howling frequency point exists in the audio frame to be processed according to the peak adjacent power ratio and/or the peak harmonic power ratio; if yes, updating parameters of the state variable filter based on the howling frequency points; if not, judging whether the undetected howling frequency points exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result; and filtering the audio frame to be processed by using a state variable filter so as to inhibit the howling frequency points. According to the method and the device, parameters of the state variable filter can be adjusted in real time according to the howling frequency points, and then the state variable filter after the parameters are adjusted is adopted to conduct filtering processing on the audio frames to be processed with the howling frequency points, so that the filtering calculation complexity is effectively reduced, and simultaneously, each howling frequency point is restrained in real time, and therefore the suppression effect on howling is improved.
In addition, the howling frequency point detection and suppression are carried out on the audio frame to be processed through the state variable filter, other analog circuits are not required to be additionally relied on, and the cost is low.
In order to facilitate better implementation of the howling suppression method provided by the embodiment of the application, the embodiment of the application also provides a howling suppression device. Wherein the meaning of the noun is the same as in the howling suppression method described above, specific implementation details may be referred to the description in the method embodiment.
Referring to fig. 3, fig. 3 is a schematic structural diagram of a howling suppression device according to an embodiment of the application. The howling suppression apparatus may include an audio acquisition unit 201, a howling detection unit 202, a first update unit 203, a second update unit 204, and a howling suppression unit 205. Wherein, the liquid crystal display device comprises a liquid crystal display device,
an audio acquisition unit 201 for acquiring an audio frame to be processed;
a howling detection unit 202, configured to detect whether a howling frequency point exists in an audio frame to be processed;
a first updating unit 203, configured to update parameters of the state variable filter based on the howling frequency point when the howling frequency point exists in the audio frame to be processed;
a second updating unit 204, configured to determine, based on the state variable filter, whether there is an undetected howling frequency point in the audio frame to be processed when there is no howling frequency point in the audio frame to be processed, and perform corresponding processing on parameters of the state variable filter according to a determination result;
The howling suppression unit 205 is configured to perform filtering processing on the audio frame to be processed by using the state variable filter, so as to suppress the howling frequency point.
The specific embodiments of the above units may be referred to the above embodiments of the howling suppression method, and will not be described herein in detail.
In summary, the howling suppression device provided by the embodiment of the present application may acquire an audio frame to be processed through the audio acquisition unit 201; detecting whether a howling frequency point exists in the audio frame to be processed by the howling detection unit 202; when a howling frequency point exists in the audio frame to be processed, the first updating unit 203 updates parameters of the state variable filter based on the howling frequency point; when no howling frequency point exists in the audio frame to be processed, the second updating unit 204 judges whether the undetected howling frequency point exists in the audio frame to be processed or not based on the state variable filter, and carries out corresponding processing on parameters of the state variable filter according to a judging result; the audio frame to be processed is subjected to filtering processing by the howling suppression unit 205 using a state variable filter to suppress howling frequency points. According to the method and the device, parameters of the state variable filter can be adjusted in real time according to the howling frequency points, and then the state variable filter after the parameters are adjusted is adopted to conduct filtering processing on the audio frames to be processed with the howling frequency points, so that the filtering calculation complexity is effectively reduced, and simultaneously, each howling frequency point is restrained in real time, and therefore the suppression effect on howling is improved.
In addition, the howling frequency point detection and suppression are carried out on the audio frame to be processed through the state variable filter, other analog circuits are not required to be additionally relied on, and the cost is low.
The embodiment of the present application further provides an electronic device, in which the howling suppression device according to the embodiment of the present application may be integrated, as shown in fig. 4, which shows a schematic structural diagram of the electronic device according to the embodiment of the present application, specifically:
the electronic device may include Radio Frequency (RF) circuitry 601, memory 602 including one or more computer readable storage media, input unit 603, display unit 604, sensor 605, audio circuitry 606, wireless fidelity (WiFi, wireless Fidelity) module 607, processor 608 including one or more processing cores, and power supply 609. Those skilled in the art will appreciate that the electronic device structure shown in fig. 4 is not limiting of the electronic device and may include more or fewer components than shown, or may combine certain components, or may be arranged in different components.
Wherein:
the RF circuit 601 may be used for receiving and transmitting signals during a message or a call, and in particular, after receiving downlink information of a base station, the downlink information is processed by one or more processors 608; in addition, data relating to uplink is transmitted to the base station. Typically, RF circuitry 601 includes, but is not limited to, an antenna, at least one amplifier, a tuner, one or more oscillators, a subscriber identity module (SIM, subscriber Identity Module) card, a transceiver, a coupler, a low noise amplifier (LNA, low Noise Amplifier), a duplexer, and the like. In addition, the RF circuitry 601 may also communicate with networks and other devices through wireless communications. The wireless communication may use any communication standard or protocol including, but not limited to, global system for mobile communications (GSM, global System of Mobile communication), general packet radio service (GPRS, general Packet Radio Service), code division multiple access (CDMA, code Division Multiple Access), wideband code division multiple access (WCDMA, wideband Code Division Multiple Access), long term evolution (LTE, long Term Evolution), email, short message service (SMS, short Messaging Service), and the like.
The memory 602 may be used to store software programs and modules, and the processor 608 may execute various functional applications and information processing by executing the software programs and modules stored in the memory 602. The memory 602 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program (such as a sound playing function, an image playing function, etc.) required for at least one function, and the like; the storage data area may store data created according to the use of the electronic device (such as audio data, phonebooks, etc.), and the like. In addition, the memory 602 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage device. Accordingly, the memory 602 may also include a memory controller to provide access to the memory 602 by the processor 608 and the input unit 603.
The input unit 603 may be used to receive input numeric or character information and to generate keyboard, mouse, joystick, optical or trackball signal inputs related to user settings and function control. In particular, in one particular embodiment, the input unit 603 may include a touch-sensitive surface, as well as other input devices. The touch-sensitive surface, also referred to as a touch display screen or a touch pad, may collect touch operations thereon or thereabout by a user (e.g., operations thereon or thereabout by a user using any suitable object or accessory such as a finger, stylus, etc.), and actuate the corresponding connection means according to a predetermined program. Alternatively, the touch-sensitive surface may comprise two parts, a touch detection device and a touch controller. The touch detection device detects the touch azimuth of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch detection device and converts it into touch point coordinates, which are then sent to the processor 608, and can receive commands from the processor 608 and execute them. In addition, touch sensitive surfaces may be implemented in a variety of types, such as resistive, capacitive, infrared, and surface acoustic waves. The input unit 603 may comprise other input devices in addition to a touch sensitive surface. In particular, other input devices may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a trackball, mouse, joystick, etc.
The display unit 604 may be used to display information entered by a user or provided to a user as well as various graphical user interfaces of the electronic device, which may be composed of graphics, text, icons, video, and any combination thereof. The display unit 604 may include a display panel, which may be optionally configured in the form of a liquid crystal display (LCD, liquid Crystal Display), an Organic Light-Emitting Diode (OLED), or the like. Further, the touch-sensitive surface may overlay a display panel, and upon detection of a touch operation thereon or thereabout, the touch-sensitive surface is passed to the processor 608 to determine the type of touch event, and the processor 608 then provides a corresponding visual output on the display panel based on the type of touch event. Although in fig. 4 the touch sensitive surface and the display panel are implemented as two separate components for input and output functions, in some embodiments the touch sensitive surface may be integrated with the display panel to implement the input and output functions.
The electronic device may also include at least one sensor 605, such as a light sensor, a motion sensor, and other sensors. In particular, the light sensor may include an ambient light sensor that may adjust the brightness of the display panel according to the brightness of ambient light, and a proximity sensor that may turn off the display panel and/or backlight when the electronic device is moved to the ear. As one of the motion sensors, the gravity acceleration sensor can detect the acceleration in all directions (generally three axes), and can detect the gravity and the direction when the mobile phone is stationary, and can be used for applications of recognizing the gesture of the mobile phone (such as horizontal and vertical screen switching, related games, magnetometer gesture calibration), vibration recognition related functions (such as pedometer and knocking), and the like; other sensors such as gyroscopes, barometers, hygrometers, thermometers, infrared sensors, etc. that may also be configured with the electronic device are not described in detail herein.
Audio circuitry 606, speakers, and a microphone may provide an audio interface between the user and the electronic device. The audio circuit 606 may transmit the received electrical signal after audio data conversion to a speaker, where the electrical signal is converted to a sound signal for output; on the other hand, the microphone converts the collected sound signals into electrical signals, which are received by the audio circuit 606 and converted into audio data, which are processed by the audio data output processor 608 for transmission via the RF circuit 601 to, for example, another electronic device, or which are output to the memory 602 for further processing. The audio circuit 606 may also include an ear bud jack to provide communication of the peripheral ear bud with the electronic device.
WiFi belongs to a short-distance wireless transmission technology, and the electronic equipment can help a user to send and receive emails, browse webpages, access streaming media and the like through the WiFi module 607, so that wireless broadband Internet access is provided for the user. Although fig. 4 shows a WiFi module 607, it is understood that it does not belong to the necessary constitution of the electronic device, and can be omitted entirely as needed within the scope of not changing the essence of the invention.
The processor 608 is a control center of the electronic device that uses various interfaces and lines to connect the various parts of the overall handset, performing various functions of the electronic device and processing the data by running or executing software programs and/or modules stored in the memory 602, and invoking data stored in the memory 602, thereby performing overall monitoring of the handset. Optionally, the processor 608 may include one or more processing cores; preferably, the processor 608 may integrate an application processor that primarily handles operating systems, user interfaces, applications, etc., with a modem processor that primarily handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 608.
The electronic device also includes a power supply 609 (e.g., a battery) for powering the various components, which may be logically connected to the processor 608 via a power management system so as to perform functions such as managing charge, discharge, and power consumption via the power management system. The power supply 609 may also include one or more of any components, such as a direct current or alternating current power supply, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator, and the like.
Although not shown, the electronic device may further include a camera, a bluetooth module, etc., which will not be described herein. In particular, in this embodiment, the processor 608 in the electronic device loads executable files corresponding to the processes of one or more application programs into the memory 602 according to the following instructions, and the processor 608 executes the application programs stored in the memory 602, so as to implement various functions, for example:
acquiring an audio frame to be processed;
detecting whether a howling frequency point exists in an audio frame to be processed;
if yes, updating parameters of the state variable filter based on the howling frequency points;
if not, judging whether the undetected howling frequency points exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result;
And filtering the audio frame to be processed by using a state variable filter so as to inhibit the howling frequency points.
In summary, the electronic device provided by the embodiment of the application acquires the audio frame to be processed; detecting whether a howling frequency point exists in an audio frame to be processed; if yes, updating parameters of the state variable filter based on the howling frequency points; if not, judging whether the undetected howling frequency points exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result; and filtering the audio frame to be processed by using a state variable filter so as to inhibit the howling frequency points. According to the method and the device, parameters of the state variable filter can be adjusted in real time according to the howling frequency points, and then the state variable filter after the parameters are adjusted is adopted to conduct filtering processing on the audio frames to be processed with the howling frequency points, so that the filtering calculation complexity is effectively reduced, and simultaneously, each howling frequency point is restrained in real time, and therefore the suppression effect on howling is improved.
In the foregoing embodiments, the descriptions of the embodiments are focused on, and for those portions of an embodiment that are not described in detail, reference may be made to the foregoing detailed description of the howling suppression method, which is not repeated herein.
It should be noted that, for the howling suppression method in the embodiment of the present application, it will be understood by those skilled in the art that all or part of the flow of implementing the howling suppression method in the embodiment of the present application may be implemented by controlling related hardware by a computer program, where the computer program may be stored in a computer readable storage medium, such as a memory of a terminal, and executed by at least one processor in the terminal, and the execution may include, for example, the flow of the embodiment of the howling suppression method.
For the howling suppression device in the embodiment of the application, each functional module may be integrated in one processing chip, or each module may exist alone physically, or two or more modules may be integrated in one module. The integrated modules may be implemented in hardware or in software functional modules. The integrated modules may also be stored in a computer readable storage medium if implemented as software functional modules and sold or used as a stand-alone product.
To this end, an embodiment of the present application provides a storage medium having stored therein a plurality of instructions capable of being loaded by a processor to perform the steps of any one of the howling suppression methods provided in the embodiment of the present application. The storage medium may be a magnetic disk, an optical disk, a Read Only MeMory (ROM), a random access MeMory (RAM, random Access Memory), or the like.
The howling suppression method, device, storage medium and electronic equipment provided by the application are respectively described in detail, and specific examples are applied to illustrate the principle and implementation of the application, and the description of the above examples is only used for helping to understand the core idea of the application; meanwhile, as those skilled in the art will vary in the specific embodiments and application scope according to the ideas of the present application, the present description should not be construed as limiting the present application in summary.

Claims (10)

1. A howling suppression method, characterized by comprising:
acquiring an audio frame to be processed;
detecting whether howling frequency points exist in the audio frame to be processed;
if yes, updating parameters of a state variable filter based on the howling frequency points;
if not, judging whether the undetected howling frequency points exist in the audio frame to be processed or not based on the state variable filter, and correspondingly processing parameters of the state variable filter according to a judging result;
and filtering the audio frame to be processed by using the state variable filter so as to inhibit the howling frequency points.
2. The howling suppression method according to claim 1, wherein if yes, updating parameters of a state variable filter based on the howling frequency point includes:
when detecting that a howling frequency point exists in an audio frame to be processed, judging whether the howling frequency point exists in a state variable filter or not;
if yes, increasing the attenuation gain of the state variable filter and reducing the bandwidth factor of the state variable filter;
if not, adding the howling frequency point into the state variable filter, and initializing the state variable filter.
3. The howling suppression method according to claim 2, wherein said initializing said state variable filter comprises:
setting the attenuation gain of the state variable filter to 0 and setting the bandwidth factor of the state variable filter to a first preset value.
4. The howling suppression method according to claim 1, wherein said performing a corresponding process on the parameters of the state variable filter according to the determination result comprises:
when the audio frame to be processed has undetected howling frequency points, a counter is started to count;
And correspondingly processing the parameters of the state variable filter according to the counting result.
5. The howling suppression method according to claim 4, wherein said performing a corresponding process on the parameters of said state variable filter according to the count result comprises:
when the continuous frame number of the audio frames to be processed of the undetected howling frequency points is larger than the preset frame number, increasing the bandwidth factor of the state variable filter;
and when the continuous frame number of the audio frames to be processed of the undetected howling frequency points is smaller than or equal to the preset frame number, maintaining the parameters of the state variable filter.
6. The howling suppression method according to claim 1, wherein said determining whether there are undetected howling frequency points in the audio frame to be processed based on the state variable filter comprises:
comparing the howling frequency point in the state variable filter with the frequency point in the audio frame to be processed;
and judging whether undetected howling frequency points exist in the audio frame to be processed according to the comparison result.
7. The howling suppression method according to claim 1, wherein said detecting whether a howling frequency point exists in the audio frame to be processed comprises:
Acquiring a peak adjacent power ratio and/or a peak harmonic power ratio of the audio frame to be processed;
and detecting whether howling frequency points exist in the audio frame to be processed according to the peak adjacent power ratio and/or the peak harmonic power ratio.
8. A howling suppressing apparatus, comprising:
an audio acquisition unit for acquiring an audio frame to be processed;
the howling detection unit is used for detecting whether howling frequency points exist in the audio frame to be processed;
the first updating unit is used for updating parameters of the state variable filter based on the howling frequency point when the howling frequency point exists in the audio frame to be processed;
the second updating unit is used for judging whether the howling frequency point which is not detected exists in the audio frame to be processed or not based on the state variable filter when the howling frequency point does not exist in the audio frame to be processed, and correspondingly processing the parameters of the state variable filter according to a judging result;
and the howling suppression unit is used for performing filtering processing on the audio frame to be processed by using the state variable filter so as to suppress the howling frequency points.
9. A storage medium storing a plurality of instructions adapted to be loaded by a processor to perform the howling suppression method of any one of claims 1-7.
10. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the howling suppression method according to any of claims 1-7 when executing the computer program.
CN202311133870.1A 2023-09-04 2023-09-04 Howling suppression method and device, storage medium and electronic equipment Pending CN116994596A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311133870.1A CN116994596A (en) 2023-09-04 2023-09-04 Howling suppression method and device, storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311133870.1A CN116994596A (en) 2023-09-04 2023-09-04 Howling suppression method and device, storage medium and electronic equipment

Publications (1)

Publication Number Publication Date
CN116994596A true CN116994596A (en) 2023-11-03

Family

ID=88526822

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311133870.1A Pending CN116994596A (en) 2023-09-04 2023-09-04 Howling suppression method and device, storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN116994596A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117641218A (en) * 2024-01-25 2024-03-01 北京中电慧声科技有限公司 Howling detection and suppression method, system and medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117641218A (en) * 2024-01-25 2024-03-01 北京中电慧声科技有限公司 Howling detection and suppression method, system and medium
CN117641218B (en) * 2024-01-25 2024-04-12 北京中电慧声科技有限公司 Howling detection and suppression method, system and medium

Similar Documents

Publication Publication Date Title
CN107277912B (en) Radio frequency interference processing method, device, storage medium and terminal
CN107393548B (en) Method and device for processing voice information collected by multiple voice assistant devices
CN109768810B (en) Signal processing circuit, terminal equipment and signal processing method
EP4175349A1 (en) Method for processing relax measurement parameter update and terminal device
CN106528545B (en) Voice information processing method and device
CN106384597B (en) Audio data processing method and device
CN109379145B (en) Signal processing circuit, terminal equipment and signal processing method
CN109817241B (en) Audio processing method, device and storage medium
CN110266323B (en) Signal control circuit, terminal equipment and signal control method
CN109243488B (en) Audio detection method, device and storage medium
CN111093259A (en) Bluetooth power adjustment method, device, storage medium and terminal
CN108492837B (en) Method, device and storage medium for detecting audio burst white noise
JP7324949B2 (en) Application sharing method, first electronic device and computer readable storage medium
CN116994596A (en) Howling suppression method and device, storage medium and electronic equipment
CN109041190B (en) Power control method and terminal
CN114520672A (en) Power calibration method for maximum power fluctuation, terminal device and storage medium
CN106126170B (en) Sound effect setting method of terminal and terminal
CN110636530B (en) Method for processing adjacent channel interference and terminal equipment
CN110995302B (en) Gain stage adjusting method and electronic equipment
CN106817324B (en) Frequency response correction method and device
CN111372003A (en) Camera switching method and device and terminal
CN111327343A (en) Common receiving channel radio frequency device, method and mobile terminal
CN107728990B (en) Audio playing method, mobile terminal and computer readable storage medium
CN105635379B (en) Noise suppression method and device
CN113990363A (en) Audio playing parameter adjusting method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination