CN111797708A

CN111797708A - Airflow noise detection method and device, terminal and storage medium

Info

Publication number: CN111797708A
Application number: CN202010536859.XA
Authority: CN
Inventors: 吴锐兴; 田晓晖; 叶利剑
Original assignee: AAC Technologies Pte Ltd
Current assignee: AAC Technologies Pte Ltd
Priority date: 2020-06-12
Filing date: 2020-06-12
Publication date: 2020-10-20
Also published as: WO2021248522A1

Abstract

The embodiment of the invention discloses an airflow noise detection method, which comprises the steps of collecting an original audio signal in a micro loudspeaker, extracting the characteristics of the original audio signal to obtain audio characteristics, and finally detecting whether the original audio signal has airflow noise or not according to the audio characteristics by a machine learning classifier. In addition, an airflow noise detection device, a computer device and a storage medium are also provided.

Description

Airflow noise detection method and device, terminal and storage medium

[ technical field ] A method for producing a semiconductor device

The invention relates to the technical field of computers, in particular to an airflow noise detection method, an airflow noise detection device, a terminal and a storage medium.

[ background of the invention ]

The micro-speaker cavity is narrow and small, and the air current that the vibrating diaphragm vibration caused during operation flows and is not smooth and easy. Under the condition of high voltage, the motion amplitude of the diaphragm is large, air in the cavity generates turbulence, and airflow noise of ' sand and ' fizz ' is caused under certain frequencies, so that the user experience is seriously influenced.

With the popularization of power amplifier applications of intelligent devices such as mobile phones and tablets, the problem of airflow noise when music is played under a large voltage is increasingly serious. To improve this problem, it is necessary to detect whether or not there is a noise in the micro-speaker. At present, the existing detection method for the noise of the airflow comprises the following steps: (1) the micro loudspeaker of the intelligent equipment plays audio signals and judges and detects the audio signals through human ears; (2) and analyzing and detecting the audio signal sent by the micro loudspeaker of the intelligent equipment by adopting an electroacoustic instrument. These detection methods have high process cost, long cycle, difficult accuracy guarantee, and limited versatility, and therefore, a new method for detecting the noise of the airflow is urgently needed.

[ summary of the invention ]

In view of the above, the present invention provides an airflow noise detection method, an airflow noise detection device, a computer device, and a storage medium, which are used to solve the problem of low efficiency in airflow noise elimination detection in the prior art.

The specific technical scheme of the embodiment of the invention is as follows:

in a first aspect, an embodiment of the present invention provides an airflow noise detection method, which is applied to a micro speaker, and the method includes:

collecting original audio signals in a micro loudspeaker;

extracting the characteristics of the original audio signal to obtain audio characteristics;

and detecting whether the original audio signal has air flow noise or not according to the audio features through a machine learning classifier.

In a second aspect, an embodiment of the present invention further provides an airflow noise detection apparatus, where the apparatus includes:

the signal acquisition module is used for acquiring original audio signals in the micro loudspeaker;

the characteristic extraction module is used for extracting the characteristics of the original audio signal to obtain audio characteristics;

and the noise detection module is used for detecting whether the original audio signal has airflow noise or not through a machine learning classifier according to the audio features.

In a third aspect, an embodiment of the present invention further provides a computer device, including a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor implements the steps of the airflow noise detection method as described above when executing the computer program.

In a fourth aspect, embodiments of the present invention also provide a computer-readable storage medium, which includes computer instructions, when the computer instructions are executed on a computer, cause the computer to execute the steps of the airflow noise detection method as described above.

The embodiment of the invention has the following beneficial effects:

after the method, the device, the terminal and the storage medium for detecting the airflow noise are adopted, the original audio signal in the micro loudspeaker is collected, then the characteristic extraction is carried out on the original audio signal to obtain the audio characteristic, finally, the machine learning classifier is used for detecting whether the airflow noise exists in the original audio signal according to the audio characteristic, and the machine learning classifier can reflect the inherent characteristics of the existence of the airflow noise or the absence of the airflow noise through training, so that the machine learning classifier is used for detecting whether the airflow noise exists in the original audio signal, the accuracy rate of the airflow noise detection is obviously improved, the cost is reduced, the real-time detection or the off-line detection can be realized, and the applicability of the detection method is improved.

[ description of the drawings ]

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

Wherein:

FIG. 1 is a flow chart of a method of detecting noise in an air flow according to one embodiment;

FIG. 2 is a flow diagram of a method for audio feature extraction according to one embodiment;

FIG. 3 is a flow chart of a method of detecting airflow noise according to another embodiment;

FIG. 4 is a flow diagram of a method for machine learning classifier training in accordance with one embodiment;

FIG. 5 is a flow diagram of a sample audio signal set acquisition method according to one embodiment;

FIG. 6 is a schematic diagram of an embodiment of an apparatus for detecting noise in an airflow;

fig. 7 is a schematic diagram of an internal structure of a computer device for operating the airflow noise detection method according to an embodiment.

[ detailed description ] embodiments

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The problem of low accuracy and high cost of manual detection or instrument detection in the traditional technology is solved.

In view of the above problems, in the present embodiment, an airflow noise detection method is particularly proposed. The method may be implemented in dependence on a computer program which is executable on a computer system based on the von neumann architecture.

As shown in fig. 1, the airflow noise detection method provided in this embodiment is applied to a micro speaker, and specifically includes the following steps:

step 102: the original audio signal in the micro-speaker is captured.

Wherein the original audio signal refers to an unprocessed signal in the WeChat loudspeaker. Specifically, sound information on the micro-speaker may be collected as an original audio signal by a sound card, and video or audio data may also be collected as an original audio signal from the micro-speaker. Generally, the original audio signal is small in the cavity of the micro-speaker and precise in structure, but the amplitude of the diaphragm is large, so that the air flow easily forms turbulence in the cavity and generates flow-induced noise.

Step 104: and extracting the characteristics of the original audio signal to obtain audio characteristics.

The audio features are features for characterizing an audio signal, wherein the audio features may be audio features including time domain features, frequency domain features, cepstral domain features, or a combination of these features. Specifically, the original audio signal may be subjected to feature extraction by a toolkit such as C/C + +, Python, MATLAB, and the like, to obtain an audio feature, and more specifically, the time domain feature, the frequency domain feature, and the cepstral domain feature may be a mel-frequency cepstrum coefficient (MFCC), a linear prediction coefficient (LPCC), a component envelope, and the like. Preferably, in order to distinguish the airflow noise more accurately, a combination of several features, i.e., a time domain feature, a frequency domain feature and a cepstrum domain feature, may be used as the audio feature, so as to improve the accuracy of analyzing the airflow noise in the original audio signal.

Step 106: and detecting whether the airflow noise exists in the original audio signal according to the audio features through a machine learning classifier.

The Machine Learning (ML) classifier is a Machine Learning algorithm model with classification capability after training. The machine learning classifier of the present embodiment is configured to classify an original audio signal characterized by audio features into one of an air flow noise signal and an air flow noise signal. The Machine learning classifier may adopt an SVM (Support Vector Machine) classifier, logistic regression, an integration method, a random forest, a neural network model, and the like, wherein the integration method may be a Boosting algorithm, a Bagging algorithm, and a variation thereof. In practice, the recognition accuracy reaches more than 98% when the SVM classifier is used for detecting the airflow noise, and the recognition accuracy reaches more than 97% when the integration method is used for detecting the airflow noise.

Specifically, the classifier that can utilize at least one machine learning model for classification is used as part of the training of the machine learning classifier, the training input is the audio features of various acquired original audio signals, and a relational classifier is established that corresponds the audio features to whether or not there is an airflow noise. The machine learning classifier is enabled to have the capability of judging whether the input audio features have airflow noise or not. In this embodiment, the machine learning classifier is a two-classifier, and two classification results are obtained, that is, there is an airflow noise or there is no airflow noise. The machine learning classifier can reflect the inherent characteristics of existence or nonexistence of the airflow noise through training, so that whether the airflow noise exists in the original audio signal or not is detected by the machine learning classifier, and the accuracy of airflow noise detection is obviously improved.

It is worth mentioning that, in this embodiment, the machine learning classifier is used, and whether the original audio signal has the airflow noise is detected according to the audio features, so that the method is applicable to an airflow noise detection scene of offline detection or real-time detection, and the applicability of the airflow noise detection method is improved.

According to the airflow noise detection method, the original audio signals in the micro loudspeaker are collected, feature extraction is carried out on the original audio signals to obtain audio features, and finally, the machine learning classifier is used for detecting whether the original audio signals have the airflow noise or not according to the audio features.

As shown in fig. 2, in an embodiment, performing feature extraction on an original audio signal to obtain an audio feature includes:

step 104A: segmenting an original audio signal to obtain a plurality of signal segments;

step 104B: respectively extracting the characteristics of each signal segment to obtain sub-audio characteristics;

step 104C: and combining the sub-audio features according to the signal segments to obtain the audio features.

The signal segment refers to a segment of audio signal in the original audio signal, and specifically, the original audio signal may be divided into a plurality of audio frames, the division may be performed in an overlap framing manner, that is, an end portion (e.g., 100 milliseconds) of a previous frame of the original audio signal is used as a start portion of a next frame of the original audio signal, and a plurality of signal segments of the original audio signal may be obtained through overlap framing. The original audio signal may also be framed by a framing function in the MATLAB tool, such as enframe (), to obtain a plurality of signal segments of the original audio signal. And then, respectively performing feature extraction on each divided signal segment to obtain sub-audio features corresponding to each signal segment, where the sub-audio features in this embodiment are the same as the audio features in step 104, and are not described herein again. And finally, combining the plurality of sub-audio features according to the sequence corresponding to the signal segment, wherein the combination mode can be that all the sub-audio features are combined, or a specific number of sub-audio features are selected to be combined to obtain the audio features. It can be understood that the original audio signal is segmented and then subjected to feature extraction, so that the features of each signal segment are more uniform and accurate, and the subsequent analysis is more reliable.

Furthermore, the original audio signals in different time periods are acquired in an off-line mode, and after the original audio signals are segmented, the speed of feature extraction can be increased by extracting features, so that the features of the signal segments are more uniform and abundant, and the accuracy of audio features is improved.

In one embodiment, before performing feature extraction on each signal segment to obtain sub-audio features, the method further includes:

and carrying out normalization processing on the signal segment.

Wherein, each sub-audio feature distribution that each signal section corresponds can be wider, can adjust these signal sections to predetermineeing the interval through the normalization and handle for the characteristic of each signal section is obvious abundantly more, thereby has further guaranteed the unity of each signal section, is favorable to improving the degree of accuracy of the sub-audio feature that the signal section corresponds.

As shown in fig. 3, in one embodiment, the method further comprises:

step 108: acquiring sample audio signals and corresponding marks in the sample audio signal set, wherein the marks of the positive sample audio signals are free of air flow noise, and the marks of the negative sample audio signals are air flow noise;

step 110: extracting each sample audio characteristic in the sample audio signal set to obtain a sample audio characteristic set;

step 112: and training a machine learning classifier according to the sample audio feature set and the corresponding marks.

Specifically, the method includes collecting original audio signals played by a WeChat speaker, wherein a part of the original audio signals contain obvious airflow noise, and the other part of the original audio signals do not contain airflow noise, dividing the original audio signals into signal segments for labeling, then extracting features of each audio signal, wherein the features include time domain features, frequency domain features, cepstrum domain features and the like, mixing audio features of audio signals of positive samples with audio features of audio signals of negative samples, wherein random mixing can be adopted, all sample audio features and corresponding labels form a sample audio feature set, in the embodiment, the sample audio signals comprise positive samples and negative samples in a centralized manner, the negative samples comprise characteristics of the audio features with airflow noise, and a machine learning classifier trained by using the sample audio signal set can learn more accurate classification rules, thereby further improving the accuracy of the detection of the airflow noise.

In one embodiment, the audio features include at least one of time domain features, frequency domain features, and cepstral domain features.

Wherein, the time domain characteristics comprise a short-time average zero crossing rate and a short-time autocorrelation function; the frequency domain characteristics comprise extracting a short-time power spectral density function; cepstral domain feature mel-frequency cepstral coefficients and linear prediction cepstral coefficients. In this embodiment, at least one audio feature is extracted to analyze audio signals by using characteristics of different audio features, thereby improving the accuracy of detection.

It should be noted that, in this embodiment, after the time domain feature, the frequency domain feature, the cepstrum domain feature, or the combination of these features is trained by the machine learning classifier, the optimal audio feature is determined according to the machine learning classifier.

As shown in fig. 4, in one embodiment, training a machine learning classifier based on a sample audio feature set and corresponding labels includes:

step 112A: acquiring a discrete parameter value set of a machine learning classifier;

step 112B: training a machine learning classifier corresponding to each parameter value according to each parameter value in the discrete parameter value set and the sample audio feature set, and obtaining the classification prediction accuracy of the machine learning classifier corresponding to the corresponding parameter value;

step 112C: screening out the maximum classification prediction accuracy, acquiring corresponding parameter values and audio features of the sample audio feature set, and training a machine learning classifier according to the acquired parameter values and the sample audio feature set.

The discrete parameter value set is a set formed by a plurality of discrete parameter values. The parameter value is a value of a parameter required by training the machine learning classifier. Specifically, sampling can be performed in a continuous parameter value range according to a first step length to obtain a series of discrete parameter values, so as to form a discrete parameter value set. If the machine learning classifier includes a plurality of parameters to be learned, a discrete parameter value set corresponding to each parameter can be obtained. If the machine learning classifier adopts an SVM classifier, the parameters are punishment coefficients. Specifically, each parameter value in the discrete parameter value set can be traversed, the currently traversed parameter value and the audio feature corresponding to the sample audio feature set are respectively utilized to train the machine learning classifier, and the classification prediction accuracy corresponding to the machine learning classifier is obtained until all the parameter values in the discrete parameter value set and the audio feature corresponding to the sample audio feature set are traversed.

Further, dividing the sample audio feature set into a training set and a testing set, traversing each parameter value in the discrete parameter value set and the audio feature corresponding to the sample audio feature set, training a machine learning classifier by using the currently traversed parameter value and the audio feature corresponding to the sample audio feature set in the training set, predicting the testing set by using the trained machine learning classifier, obtaining the known classification result of the testing set, and comparing the predicted result with the known classification result to obtain the classification prediction accuracy of the corresponding machine learning classifier. And comparing the classification prediction accuracy rates obtained in the step 112B to find out the maximum classification prediction accuracy rate, and obtaining the parameter value used by the machine learning classifier for training the maximum classification prediction accuracy rate and the audio feature corresponding to the sample audio feature set, so as to continue training the machine learning classifier by using the obtained parameter value and the audio feature corresponding to the sample audio feature set.

In this embodiment, the sample audio feature set is used to quickly find out appropriate parameter values and audio features, so that the parameter values, the audio features and the sample audio feature set are used for training, and the efficiency of training the machine learning classifier can be improved.

As shown in fig. 5, in one embodiment, obtaining sample audio signals and corresponding labels in a sample audio signal set comprises:

step 108A: under the same environment, the same audio acquisition equipment is adopted, and a sample audio signal set is obtained by adjusting the gain acquisition of the micro loudspeaker;

step 108B: dividing sample audio signals in the sample audio signal set to obtain a plurality of sample audio segments corresponding to each sample audio signal;

step 108C: and determining the mark of the sample audio signal through a preset loudspeaker model according to the sample audio segment.

Specifically, in order to ensure diversity and watchband property of the acquired original audio signals, it is ensured that one part of the original audio signals contains airflow noise and the other part of the original audio signals does not have airflow noise, so that hardware and human interference are avoided in the same environment. Simultaneously, the same audio acquisition equipment is adopted, a sample audio signal set is acquired by adjusting the gain of the micro loudspeaker, namely, the high point of the voltage is controlled by adjusting the gain, so that the vibration amplitude of the vibrating diaphragm is controlled, the richness of the acquired original audio signal is ensured, and the richness and the comprehensiveness of the sample audio signal set are improved.

The method comprises the steps of dividing sample audio signals in a sample audio signal set to obtain a plurality of sample audio segments corresponding to each sample audio signal, dividing the sample audio signals to be beneficial to improving the uniformity of the sample audio segments, determining the marks of the sample audio signals through a preset loudspeaker model according to the sample audio segments, and ensuring the accuracy of the marks of the sample audio signals through a continuous learning method. In the embodiment, the accuracy of training of the machine learning classifier is improved by acquiring a comprehensive and abundant sample audio signal set and accurately marking.

Based on the same inventive concept, an embodiment of the present invention provides an airflow noise detection apparatus 600, as shown in fig. 6, including: a signal collecting module 602, configured to collect an original audio signal in the micro speaker; a feature extraction module 604, configured to perform feature extraction on the original audio signal to obtain an audio feature; and a noise detection module 606, configured to detect whether an airflow noise exists in the original audio signal according to the audio feature by using a machine learning classifier.

Specifically, the airflow noise detection apparatus 600 of the present embodiment, as shown in fig. 6, includes: a signal collecting module 602, configured to collect an original audio signal in the micro speaker; a feature extraction module 604, configured to perform feature extraction on the original audio signal to obtain an audio feature; and a noise detection module 606, configured to detect whether an airflow noise exists in the original audio signal according to the audio feature by using a machine learning classifier. The machine learning classifier can reflect the inherent characteristics of existence or nonexistence of the air flow noise through training, so that whether the air flow noise exists in the original audio signal or not is detected by the machine learning classifier, the accuracy rate of air flow noise detection is obviously improved, the cost is reduced, and the applicability of the detection method is improved.

It should be noted that, the implementation of the apparatus for detecting an airflow noise in this embodiment is consistent with the implementation concept of the method for detecting an airflow noise, and the implementation principle thereof is not described herein again, and specific reference may be made to the corresponding content in the method.

FIG. 7 is a diagram illustrating an internal structure of a computer device in one embodiment. The computer device may specifically be a server or a terminal. As shown in fig. 7, the computer device 700 includes a processor 710, a memory 720, and a network interface 730, all connected via a system bus. The memory 720 includes a nonvolatile storage medium and an internal memory. The non-volatile storage medium of the computer device stores an operating system and may also store a computer program that, when executed by the processor, causes the processor to implement a method of airflow noise detection. The internal memory may also have stored therein a computer program that, when executed by the processor, causes the processor to perform a method of airflow noise detection. Those skilled in the art will appreciate that the architecture shown in fig. 7 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as a particular computing device may include more or less components than those shown in fig. 7, or may combine certain components, or have a different arrangement of components.

In one embodiment, the method for detecting the noise of the air flow provided by the present application can be realized in the form of a computer program, and the computer program can be run on a computer device as shown in fig. 7. The memory of the computer device may have stored therein the various program modules that make up the means for airflow noise detection. For example, the signal acquisition module 602, the feature extraction module 604, and the noise detection module 606.

A computer device comprising a memory and a processor, the memory storing a computer program that, when executed by the processor, causes the processor to perform the steps of: collecting original audio signals in a micro loudspeaker; extracting the characteristics of the original audio signal to obtain audio characteristics; and detecting whether the original audio signal has air flow noise or not according to the audio features through a machine learning classifier.

In one embodiment, the performing feature extraction on the original audio signal to obtain an audio feature includes: segmenting the original audio signal to obtain a plurality of signal segments; respectively extracting the characteristics of each signal segment to obtain sub-audio characteristics; and combining the sub-audio features according to the signal segments to obtain the audio features.

In one embodiment, before the performing the feature extraction on each signal segment to obtain the sub-audio feature, the method further includes: and carrying out normalization processing on the signal segment.

In one embodiment, the method further comprises: acquiring sample audio signals and corresponding marks in the sample audio signal set, wherein the marks of the positive sample audio signals are free of air flow noise, and the marks of the negative sample audio signals are air flow noise; extracting each sample audio characteristic in the sample audio signal set to obtain a sample audio characteristic set; and training a machine learning classifier according to the sample audio feature set and the corresponding marks.

In one embodiment, training a machine learning classifier based on the sample audio feature set and corresponding labels includes: acquiring a discrete parameter value set of a machine learning classifier; training a machine learning classifier corresponding to each parameter value according to each parameter value in the discrete parameter value set and the sample audio feature set, and obtaining the classification prediction accuracy of the machine learning classifier corresponding to the corresponding parameter value; screening out the maximum classification prediction accuracy, acquiring corresponding parameter values and audio features of the sample audio feature set, and training a machine learning classifier according to the acquired parameter values and the sample audio feature set.

In one embodiment, the obtaining of sample audio signals and corresponding labels in a sample audio signal set comprises: under the same environment, the same audio acquisition equipment is adopted, and the sample audio signal set is obtained by adjusting the gain acquisition of the micro loudspeaker; dividing the sample audio signals in the sample audio signal set to obtain the plurality of sample audio segments corresponding to each sample audio signal; and determining the mark of the sample audio signal through a preset loudspeaker model according to the sample audio segment.

A computer-readable storage medium storing a computer program, the computer program when executed by a processor implementing the steps of: collecting original audio signals in a micro loudspeaker; extracting the characteristics of the original audio signal to obtain audio characteristics; and detecting whether the original audio signal has air flow noise or not according to the audio features through a machine learning classifier.

It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a non-volatile computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the program is executed. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).

The above disclosure is only for the purpose of illustrating the preferred embodiments of the present invention, and it is therefore to be understood that the invention is not limited by the scope of the appended claims.

Claims

1. An airflow noise detection method applied to a micro-speaker, the method comprising:

collecting original audio signals in a micro loudspeaker;

2. The method for detecting airflow noise according to claim 1, wherein said extracting the characteristics of the original audio signal to obtain the audio characteristics comprises:

segmenting the original audio signal to obtain a plurality of signal segments;

respectively extracting the characteristics of each signal segment to obtain sub-audio characteristics;

and combining the sub-audio features according to the signal segments to obtain the audio features.

3. The method according to claim 2, wherein before said separately performing feature extraction on each of said signal segments to obtain sub-audio features, the method further comprises:

and carrying out normalization processing on the signal segment.

4. The airflow noise detection method of claim 1, further comprising:

acquiring sample audio signals and corresponding marks in the sample audio signal set, wherein the marks of the positive sample audio signals are free of air flow noise, and the marks of the negative sample audio signals are air flow noise;

extracting each sample audio characteristic in the sample audio signal set to obtain a sample audio characteristic set;

and training a machine learning classifier according to the sample audio feature set and the corresponding marks.

5. The airflow noise detection method of claim 1, wherein the audio features include at least one of time domain features, frequency domain features, and cepstral domain features.

6. The airflow noise detection method of claim 5, wherein training a machine learning classifier based on the sample audio feature set and corresponding labels comprises:

acquiring a discrete parameter value set of a machine learning classifier;

training a machine learning classifier corresponding to each parameter value according to each parameter value in the discrete parameter value set and the sample audio feature set, and obtaining the classification prediction accuracy of the machine learning classifier corresponding to the corresponding parameter value;

screening out the maximum classification prediction accuracy, acquiring corresponding parameter values and audio features of the sample audio feature set, and training a machine learning classifier according to the acquired parameter values and the sample audio feature set.

7. The airflow noise detection method of claim 4, wherein said obtaining sample audio signals and corresponding labels in a sample audio signal set comprises:

under the same environment, the same audio acquisition equipment is adopted, and the sample audio signal set is obtained by adjusting the gain acquisition of the micro loudspeaker;

dividing the sample audio signals in the sample audio signal set to obtain a plurality of sample audio segments corresponding to each sample audio signal;

and determining the mark of the sample audio signal through a preset loudspeaker model according to the sample audio segment.

8. An airflow noise detection apparatus, the apparatus comprising:

9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor when executing the computer program performs the steps of the airflow noise detection method of any one of claims 1 to 7.

10. A computer readable storage medium comprising computer instructions which, when run on a computer, cause the computer to perform the steps of the airflow noise detection method of any of claims 1 to 7.