CN117727314A - Filtering enhancement method for ecological audio information - Google Patents

Filtering enhancement method for ecological audio information Download PDF

Info

Publication number
CN117727314A
CN117727314A CN202410179556.5A CN202410179556A CN117727314A CN 117727314 A CN117727314 A CN 117727314A CN 202410179556 A CN202410179556 A CN 202410179556A CN 117727314 A CN117727314 A CN 117727314A
Authority
CN
China
Prior art keywords
time sequence
audio
frequency domain
sequence interval
bird
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202410179556.5A
Other languages
Chinese (zh)
Other versions
CN117727314B (en
Inventor
高树会
宋佳男
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Bainiao Data Technology Beijing Co ltd
Original Assignee
Bainiao Data Technology Beijing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bainiao Data Technology Beijing Co ltd filed Critical Bainiao Data Technology Beijing Co ltd
Priority to CN202410179556.5A priority Critical patent/CN117727314B/en
Publication of CN117727314A publication Critical patent/CN117727314A/en
Application granted granted Critical
Publication of CN117727314B publication Critical patent/CN117727314B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Circuit For Audible Band Transducer (AREA)

Abstract

The application relates to the technical field of voice processing, and provides a filtering enhancement method for ecological audio information, which comprises the following steps: acquiring bird audio signal data; acquiring a frequency spectrum energy difference index according to bird audio signal data; acquiring an audio noise receiving factor according to the spectrum energy difference index; acquiring a noise disturbance index according to the audio noise factor; acquiring a wind sound interference degree index according to bird audio signal data; acquiring a wind sound interference adjustment coefficient according to the wind sound interference degree index; obtaining a wiener filter smooth adjustment coefficient according to the noise disturbance index and the wind noise disturbance adjustment coefficient; obtaining a wiener filter smoothing coefficient according to the wiener filter smoothing adjustment coefficient; and acquiring bird audio signal data after the filtering enhancement based on the wiener filtering smoothing coefficient by using a wiener filtering algorithm. According to the bird audio signal data filtering enhancement method and device, the adaptive wiener filtering smoothing coefficient is used, and the filtering enhancement effect on bird audio signal data is improved.

Description

Filtering enhancement method for ecological audio information
Technical Field
The application relates to the technical field of voice processing, in particular to a filtering enhancement method for ecological audio information.
Background
With the development of society, people are increasingly conscious of ecological environment protection, people are becoming aware that many wild birds and their habitat are not protected, and more species may be extinct, so that people continuously strengthen the monitoring protection of birds. At present, an ecological system AI voiceprint monitoring technology is mostly adopted, bird audio information is monitored through voiceprint sensing equipment arranged in a field environment, and monitoring protection of birds is achieved through the bird audio information.
Because the positions of the voiceprint sensing equipment are different, the moving range of birds is larger, and the bird audio information is weaker in the process of collecting the audio data, so that the bird audio information is difficult to distinguish, and the filter enhancement methods such as Kalman filtering, wiener filtering and the like are generally required to be adopted, so that the identification degree of bird singing in the audio is improved.
In addition, the audio data capable of representing birds actually has related audio information of environmental noise besides the bird song, and the related audio information of the environmental noise generally affects the recognition degree of the bird song in the audio, so that the audio data needs to be removed as noise in the process of denoising. However, the traditional filtering enhancement algorithm has poor filtering enhancement effect on bird audio information, and misjudgment is generated on bird song detection. For example, in the traditional wiener filtering algorithm, the influence of environmental noise on the audio is not analyzed, the accuracy of the smoothing coefficient is low, and the filtering enhancement effect of bird audio information is poor.
Disclosure of Invention
The application provides a filtering enhancement method for ecological audio information, which aims to solve the problem of poor filtering enhancement effect of bird audio information, and the adopted technical scheme is as follows:
one embodiment of the present application provides a filtering enhancement method for eco-audio information, the method comprising the steps of:
acquiring bird audio signal data;
acquiring a frequency domain waveform diagram of each time sequence interval according to bird audio signal data, and acquiring all frequency component peak points and oscillation trailing wave peak points in the frequency domain waveform diagram of each time sequence interval according to the frequency domain waveform diagram of each time sequence interval; acquiring an oscillation tailing effect characteristic sequence of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval and the bandwidth of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval according to the frequency component peak point and the oscillation tailing peak point; acquiring noise disturbance indexes of bird audio signal data according to the oscillation tailing effect characteristic sequence of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval and the bandwidth of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval;
acquiring all bird song existence time sequence intervals and ecological environment audio time sequence intervals according to the bird audio signal data; acquiring a wind-sound interference adjustment coefficient of bird audio signal data according to a bird song existence time sequence interval and an ecological environment audio time sequence interval; obtaining a wiener filter smooth adjustment coefficient according to the noise disturbance index and the wind noise disturbance adjustment coefficient of the bird audio signal data;
and acquiring wiener filter smoothing coefficients according to the wiener filter smoothing adjustment coefficients, and acquiring bird audio signal data after filter enhancement based on the wiener filter smoothing coefficients by using a wiener filter algorithm.
Preferably, the method for acquiring the frequency domain waveform diagram of each time sequence interval according to the bird audio signal data and acquiring all the frequency component peak points and the oscillation trailing peak points in the frequency domain waveform diagram of each time sequence interval according to the frequency domain waveform diagram of each time sequence interval comprises the following steps:
taking the interval of each first preset parameter in the acquisition time of the bird audio signal data as a time sequence interval;
taking bird audio signal data in each time sequence interval as input of Fourier transform, and obtaining frequency domain data of each time sequence interval by utilizing the Fourier transform; drawing the frequency domain data of each time sequence interval by utilizing MATLAB software to obtain a frequency domain waveform diagram of each time sequence interval, wherein the horizontal axis of the frequency domain waveform diagram is frequency, and the vertical axis of the frequency domain waveform diagram is frequency domain energy;
for the frequency domain waveform diagram of each time sequence interval, taking the frequency domain energy of all wave crest points in the frequency domain waveform diagram as input of an Ojin threshold method, and obtaining a division threshold value by using the Ojin threshold method; and taking each wave peak point with the frequency domain energy larger than or equal to the division threshold value in the frequency domain waveform as a frequency component wave peak point, and taking each wave peak point with the frequency domain energy smaller than the division threshold value in the frequency domain waveform as an oscillation trailing wave peak point.
Preferably, the method for obtaining the oscillation tailing effect characteristic sequence of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval and the bandwidth of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval according to the frequency component peak point and the oscillation tailing peak point comprises the following steps:
taking each frequency component peak point in the frequency domain waveform diagram of each time sequence interval as a target frequency component peak point, and taking a sequence formed by all oscillation tailing peak points between the target frequency component peak point and the next frequency component peak point according to the ascending order of the frequencies as an oscillation tailing effect characteristic sequence of the target frequency component peak point;
and regarding the frequency domain waveform diagram of each time sequence interval, taking the frequency domain energy which is the frequency domain energy multiplied by the second preset parameter of the minimum frequency domain energy of all the frequency component peak points in the frequency domain waveform diagram as bandwidth construction energy, and taking the width of the waveform corresponding to each frequency component peak point, where the transverse straight line where the bandwidth construction energy is located, passing through each frequency component peak point as the bandwidth of each frequency component peak point in the frequency domain waveform diagram.
Preferably, the method for obtaining the noise disturbance index of the bird audio signal data according to the oscillation tailing effect characteristic sequence of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval and the bandwidth of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval comprises the following steps:
taking the difference value of the frequency domain energy of each frequency component peak point in the frequency domain waveform diagram and the frequency domain energy average value of all oscillation trailing wave peak points in the oscillation trailing effect characteristic sequence of each frequency component peak point as a molecule for the frequency domain waveform diagram of each time sequence interval; calculating the absolute value of the difference between the frequency domain energy of each frequency component peak point and the frequency domain energy mean value of all frequency component peak points in the frequency domain waveform chart, and taking the sum of the absolute value and a third preset parameter as a denominator; taking the average value of the sum of the absolute values of the ratios of the numerator and the denominator on the frequency domain waveform diagram as the spectrum energy difference index of the time sequence interval;
acquiring an audio noise receiving factor of each time sequence interval according to the frequency spectrum energy difference index of each time sequence interval;
and taking the average value of the sum of the spectrum energy difference index and the audio noise factor and the sum of the spectrum energy difference index and the audio noise factor over all time sequence intervals as the noise disturbance index of the bird audio signal data.
Preferably, the method for obtaining the audio noise factor of each time sequence interval according to the spectrum energy difference index of each time sequence interval comprises the following steps:
taking a spectrum energy difference index of each time sequence interval as a molecule for the frequency domain waveform diagram of each time sequence interval;
calculating the absolute value of the difference between the bandwidth of each frequency component peak point in the frequency domain waveform diagram and the bandwidth average value of all frequency component peak points, calculating the accumulation sum of the absolute value on the frequency domain waveform diagram, and taking the product of the accumulation sum and the frequency domain energy of the fundamental frequency in the frequency domain waveform diagram as a denominator;
the ratio of the numerator to the denominator is used as the audio noise factor of the time sequence interval.
Preferably, the method for acquiring all the bird song existence time sequence intervals and the ecological environment audio time sequence intervals according to the bird audio signal data comprises the following steps:
taking the amplitude values of all audio signals in the bird audio signal data as the input of an Ojin threshold method, and obtaining a bird song audio dividing threshold value by using the Ojin threshold method;
calculating the average value of the amplitude values of all audio signals in the bird audio signal data in each time sequence interval, taking each time sequence interval with the average value of the amplitude values being more than or equal to a bird song audio frequency dividing threshold value as a bird song existence time sequence interval, and taking each time sequence interval with the average value of the amplitude values being less than the bird song audio frequency dividing threshold value as an ecological environment audio frequency time sequence interval.
Preferably, the method for obtaining the wind noise interference adjustment coefficient of the bird audio signal data according to the bird song existence time sequence interval and the ecological environment audio time sequence interval comprises the following steps:
taking each ecological environment audio time sequence interval as a target ecological environment audio time sequence interval, and taking the total number of frequency component peak points in a frequency domain waveform diagram of the target ecological environment audio time sequence interval as bird behavior audio weight factors of the target ecological environment audio time sequence interval when the audio noise receiving factor corresponding to the target ecological environment audio time sequence interval is smaller than or equal to the audio noise receiving factor of the last time sequence interval; when the audio noise factor corresponding to the target ecological environment audio time sequence interval is larger than the audio noise factor of the previous time sequence interval, calculating the difference value of the audio noise factor corresponding to the target ecological environment audio time sequence interval and the audio noise factor of the previous time sequence interval, and taking the sum of the total number of the frequency component peak points and the difference value as a wind-up audio weight factor of the target ecological environment audio time sequence interval;
acquiring a wind sound interference degree index of the target ecological environment audio time sequence interval according to a wind sound weighting factor of the target ecological environment audio time sequence interval;
and taking the average value of the wind noise interference degree indexes corresponding to all the ecological environment audio time sequence intervals as a wind noise interference adjustment coefficient of the bird audio signal data.
Preferably, the method for obtaining the wind noise interference degree index of the target ecological environment audio time sequence interval according to the wind noise weight factor of the target ecological environment audio time sequence interval comprises the following steps:
calculating the amplitude average value of all audio signals in the bird audio signal data in each ecological environment audio time sequence interval, calculating the absolute value of the difference between the amplitude average value corresponding to the target ecological environment audio time sequence interval and the amplitude average value corresponding to each ecological environment audio time sequence interval, and taking the natural constant as a base number and taking the mapping result of the sum of the absolute value on the total number of the ecological environment audio time sequence intervals as an index as a first summation factor;
taking the air-blast audio weight factors of the target ecological environment audio time sequence interval as molecules; calculating a measurement distance between a maximum amplitude point of the target ecological environment audio time sequence interval and a maximum amplitude point of each bird song existence time sequence interval, and taking the sum of the measurement distances on the total number of the bird song existence time sequence intervals as a denominator;
and taking the sum of the ratio of the numerator and the denominator and the first summation factor as an index of the wind noise interference degree of the target ecological environment audio time sequence interval.
Preferably, the method for obtaining the wiener filter smooth adjustment coefficient according to the noise disturbance index and the wind noise disturbance adjustment coefficient of the bird audio signal data comprises the following steps:
and calculating the product of the noise disturbance index and the wind noise disturbance adjustment coefficient of the bird audio signal data, and taking the product as a wiener filtering smooth adjustment coefficient.
Preferably, the method for obtaining wiener filter smoothing coefficients according to wiener filter smoothing adjustment coefficients and obtaining bird audio signal data after filter enhancement based on the wiener filter smoothing coefficients by using a wiener filter algorithm comprises the following steps:
calculating the sum of the wiener filter smoothing adjustment coefficient and a fourth preset parameter, and taking the product of the sum and the preset smoothing coefficient as the wiener filter smoothing coefficient;
and taking the bird audio signal data and the wiener filter smoothing coefficient as inputs of a wiener filter algorithm respectively, and obtaining the bird audio signal data after the filter enhancement by using the wiener filter algorithm.
The beneficial effects of this application are: through the frequency domain characteristics of noise interference of field audio data and analysis of behavior activity sound of wind noise interference, noise interference indexes and wind noise interference adjustment coefficients of the audio data are constructed, further wiener filter smooth adjustment coefficients are constructed, and smooth coefficients in a wiener filter algorithm are improved, so that the wiener filter algorithm can adaptively adjust the smooth coefficients according to noise conditions of the field audio and the existence condition of the wind noise interference audio, more noise is removed on the basis of reserving bird behavior audio, more accurate field bird audio is obtained, and more accurate data support is provided for follow-up field bird automatic monitoring. Therefore, the method improves the filtering enhancement effect on bird audio signal data through the self-adaptive wiener filtering smoothing coefficient.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings may be obtained according to these drawings without inventive faculty for a person skilled in the art.
Fig. 1 is a flowchart of a filtering enhancement method for ecological audio information according to an embodiment of the present application;
fig. 2 is a flowchart of an implementation of a filtering enhancement method for eco-audio information according to an embodiment of the present application.
Detailed Description
The following description of the embodiments of the present application will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are only some, but not all, of the embodiments of the present application. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the present disclosure, are within the scope of the present disclosure.
Referring to fig. 1, a flowchart of a filtering enhancement method for ecological audio information according to an embodiment of the present application is shown, the method includes the following steps:
and S001, acquiring bird audio information data in a field ecological environment.
The method comprises the steps of collecting bird audio signal data by arranging voiceprint sensing equipment in a field ecological environment, firstly collecting audio signal data for 5min, determining the maximum amplitude value of the audio signal data through a time sequence formed by the amplitude values of the audio signal data according to the time ascending order, formally starting to collect the bird audio signal data, setting the sampling rate to be twice the maximum amplitude value, collecting the total duration to be 720min, and recording the data as bird audio signal data.
Thus, bird audio signal data in the field ecological environment are obtained.
Step S002, obtaining a frequency spectrum energy difference index according to the bird audio signal data, obtaining an audio noise receiving factor according to the frequency spectrum energy difference index, and obtaining a noise interference index according to the audio noise receiving factor.
Because the positions of the voiceprint sensing equipment are relatively fixed, the moving range, time and the like of field birds are greatly changed, and the sound intensity and characteristics of the birds are weak in audio data collected in many times, the birds are difficult to accurately monitor relative to the bird audio data in the audio data. Thus, filtering enhancement is typically performed on the collected bird audio signal data.
In addition, because more than one bird may exist in the monitored area, the audio data segment with higher aliasing effect may be the result of multiple birds ringing at the same time, so that the actual noise influence needs to be analyzed to determine the influence of the noise on the bird audio signal data, thereby improving the filtering effect and enhancing the bird voice data.
Specifically, each 1min interval in the acquisition time of the acquired bird audio signal data is divided into a time sequence interval, bird audio signal data in each time sequence interval is used as input of Fourier transform, output of the Fourier transform is used as frequency domain data of each time sequence interval, MATLAB software is utilized to draw a frequency domain waveform diagram of each time sequence interval based on the frequency domain data of each time sequence interval, and the horizontal axis of the frequency domain waveform diagram is frequency and the vertical axis is frequency domain energy. Wherein the fourier transform is a well-known technique, and the process thereof is not described in detail in this application.
Further, in the frequency domain waveform diagram, each frequency corresponds to one frequency domain energy, the greater the degree of noise interference of the audio signal, the more additional frequency components are, and the greater the bandwidth of the fundamental frequency, the greater the difference of the fundamental frequency energy in each time sequence interval.
Specifically, for the frequency domain waveform diagram of the x time sequence interval, all wave peak points are obtained, wherein the wave peak points comprise all frequency component wave peak points and oscillation trailing wave peak points generated by frequency leakage, the frequency domain energy of all wave peak points is used as input of an Ojin threshold method, the output of the Ojin threshold method is used as a dividing threshold of the wave peak points, each wave peak point with the frequency domain energy being greater than or equal to the dividing threshold is used as a frequency component wave peak point, and each wave peak point with the frequency domain energy being smaller than the dividing threshold is used as an oscillation trailing wave peak point. The method of threshold value of Ojin is a well-known technique, and the process thereof is not described in detail in this application.
Further, since the oscillation tailing is usually followed by each frequency component, the sequence of all the oscillation tailing peaks from each frequency component peak to the next frequency component peak in ascending order of frequency is used as the oscillation tailing characteristic sequence of each frequency component peak, and the oscillation tailing characteristic sequence of the c-th frequency component peak is recorded as. At the same time, the minimum frequency domain energy of the peak points of all frequency components>The frequency domain energy is taken as bandwidth construction energy, a transverse straight line where the bandwidth construction energy is located in the frequency domain waveform diagram passes through waveform diagrams of all frequency component peak points, and the width of the waveform passing through each frequency component peak point is recorded as the bandwidth of the frequency component peak point.
Calculating a noise disturbance index of the bird audio signal data:
in the method, in the process of the invention,for the spectrum energy difference index of the xth time sequence interval, V is the total number of the wave peak points of the frequency component in the frequency domain waveform diagram of the xth time sequence interval, +.>Frequency domain energy of peak point of c-th frequency component in frequency domain waveform diagram of x-th time sequence interval,/L>Is the frequency domain energy average value of all oscillation trailing wave peak points in the oscillation trailing effect characteristic sequence of the c-th frequency component wave peak point in the frequency domain waveform diagram of the x-th time sequence interval,/the frequency domain energy average value of all oscillation trailing wave peak points in the c-th frequency component wave peak point in the frequency domain waveform diagram of the x-th time sequence is->Is the frequency domain energy mean value of all frequency component peak points in the frequency domain waveform diagram of the xth time sequence interval,/L>The error parameter is avoided to be 0, and the experience value of the error parameter is 0.1;
audio noise factor for the xth time interval, +.>Frequency domain energy of fundamental frequency in frequency domain waveform diagram of xth time sequence interval, +.>Bandwidth of peak point of c-th frequency component in frequency domain waveform diagram of x-th time sequence interval, +.>The average value of the bandwidths of all the frequency component peak points in the frequency domain waveform diagram of the xth time sequence interval;
n is the noise disturbance index of the bird audio signal data, and N is the total number of time sequence intervals in the bird audio signal data.
When the difference of the frequency domain energy corresponding to each frequency component peak point in the frequency domain waveform diagram of the xth time sequence interval is smaller, the frequency domain energy corresponding to each frequency component peak point and the oscillation tailing effect corresponding to each frequency component peak point are smallerThe larger the difference in the frequency domain energy mean value should be, i.eThe larger the value of the frequency component difference in the spectrogram of the audio data in the time sequence interval is, the larger the difference between the frequency component difference and the energy generated by the oscillation tail is, the more likely the audio data is interfered by noise, and the larger the spectrum energy difference index is; the smaller the energy of the frequency domain corresponding to the fundamental frequency in the frequency domain waveform diagram of the xth time interval is, the smaller the bandwidth difference corresponding to each frequency component peak point is, i.e.)>The larger the value of the frequency component is, the smaller the ratio of the energy of the fundamental frequency in the time sequence interval to the total energy in the audio data is, and the smaller the bandwidth difference of the frequency components is, the larger the corresponding audio data receives noise interference degree is, and the larger the audio noise receiving factor is. The greater the audio noise level in the xth time sequence interval, the more complex the frequency domain energy variation, i.e +>The larger the value of (c) is, the more aliased noise in the audio and the greater the interference level, i.e. the greater the N, the greater the smoothing coefficient should be.
Thus, the noise disturbance index of the bird audio signal data is obtained.
Step S003, obtaining a wind noise interference adjustment coefficient according to bird audio signal data, obtaining a wiener filter smooth adjustment coefficient according to a noise disturbance index and the wind noise interference adjustment coefficient, and obtaining the wiener filter smooth coefficient according to the wiener filter smooth adjustment coefficient.
However, in the outdoor sound collection process, the wind sound in the environment is the wind sound with the greatest interference to the bird sound, and compared with other environmental noises, the wind sound has the characteristics of variable duration, irregular appearance time and the like, and when the wind sound appears in the environment and the wind sound is larger, the influence on the definition of the bird sound is larger, so that the recognition and monitoring of the birds are further influenced.
In order to ensure that the bird voice data can be finally and clearly identified from the field voice data, besides the adjustment of the filtering effect by judging the noise interference degree received by the audio data, the wind noise condition should be analyzed. Specifically, when bird song is found in the field audio data collected by the voiceprint sensing device, the time sequence interval corresponding to the bird song is possibly influenced by wind sound, and then the audio of the bird song is more unclear.
In the field audio data before bird song, when birds do not generate sound of behavioral activities, only environmental noise should exist in the audio data, which is usually relatively stable, and when wind starts to generate in the field environment, the original environmental noise of the field audio data generates a certain change relative to the previous one, and the aliasing condition of the specific audio is enhanced, the sound source is increased, and a new larger amplitude condition may be generated.
Specifically, since the amplitude of the audio generated by the bird song is much higher than the amplitude generated by the environmental noise under normal conditions, the amplitude of all the audio signals in the bird audio signal data is used as the input of the dyadic threshold method, the output of the dyadic threshold method is used as the bird song audio dividing threshold value of the bird audio signal data, the amplitude average value of all the audio signals in the bird audio signal data in each time sequence interval is calculated, each time sequence interval, of which the amplitude average value is greater than or equal to the bird song audio dividing threshold value, is used as one bird song existence time sequence interval, and each time sequence interval, of which the amplitude average value is smaller than the bird song audio dividing threshold value, is used as one ecological environment audio time sequence interval. The method of threshold value of Ojin is a well-known technique, and the process thereof is not described in detail in this application.
Calculating the wind-sound interference adjustment coefficient of the bird audio signal data:
in the method, in the process of the invention,a ventilating audio weight factor of the t th ecological environment audio time sequence interval,/a ventilating audio weight factor>The total number of the peak points of the frequency components in the frequency domain waveform diagram of the t time sequence interval is +.>The audio noise receiving factors of the t-th ecological environment audio time sequence interval and the time sequence intervals above the t-th ecological environment audio time sequence interval are respectively obtained;
wind noise interference degree index for the t-th time interval,>the average value of the amplitude values of all audio signals in the bird audio signal data in the t-th and y-th ecological environment audio time sequence intervals are respectively>Is the total number of the time sequence intervals of the ecological environment audio frequency>Total number of timing intervals for bird song, +.>The maximum amplitude point of the t-th ecological environment audio time sequence interval and the z-th bird song existence time sequence interval are respectively +.>Is a Euclidean distance function, ">The Euclidean distance between the maximum amplitude point of the time sequence interval of the t ecological environment audio frequency and the time sequence interval of the z-th bird song;
and D is the wind noise interference adjustment coefficient of the bird audio signal data.
When the noise reception of the t-th eco-environment audio time sequence interval is enhanced to some extent compared with the noise reception of the previous time sequence interval,the audio frequency in the time sequence interval of the ecological environment audio frequency is possibly added with new components, the stronger the aliasing effect of the time sequence interval of the ecological environment audio frequency is, and when the aliasing effect of the time sequence interval of the t-th ecological environment audio frequency is enhanced, the closer and more the time sequence interval is to the existence of bird song, the more the time sequence interval is>The larger the value of the (C) is, and the average value of the amplitude corresponding to the time sequence interval of the ecological environment audio is larger than the average value of the amplitude of the rest time sequence intervals of the ecological environment audio, namely the first summation factor +.>The greater the value of (2), i.e +.>The larger the time sequence interval is, the greater the possibility of existence of wind sound frequency is, and the greater the interference influence degree on the bird song frequency is. Thus, the greater the likelihood that bird behavioral audio is present in the field audio data>The greater the value of D, the greater the degree of wind noise interference with the bird song in the field audio data, the greater the filter smoothing coefficient should be increased for the audio to better enhance the portion of the bird song in the audio.
Further, according to the noise disturbance index and the wind noise disturbance adjustment coefficient of the bird audio signal data, a wiener filtering smooth adjustment coefficient is constructed:
in the method, in the process of the invention,smooth adjustment coefficients for wiener filtering, +.>The noise disturbance index of the bird audio signal data is represented by D, the wind noise disturbance adjustment coefficient of the bird audio signal data is represented by e, and the natural constant is represented by e.
When the influence degree of the bird voice frequency signal data by noise is larger, and the interference degree of the bird song voice frequency data by wind sound in the voice frequency is more obvious,the greater the value of (2), i.e +.>The larger the smoothing coefficient of wiener filtering should be increased.
Further, the adjusted wiener filter smoothing coefficients are calculated:
in the method, in the process of the invention,for the adjusted wiener filter smoothing coefficients, < >>For the initial wiener filter smoothing coefficients, the empirical value is 0.55,/for the smoothing coefficients>Smooth adjustment coefficients for wiener filtering, +.>To adjust the parameters, the empirical value was taken to be 0.5.
So far, the adjusted wiener filter smoothing coefficient is obtained.
And S004, acquiring bird audio signal data after the filtering enhancement based on the wiener filtering smoothing coefficient by using a wiener filtering algorithm.
The bird audio signal data is subjected to filter enhancement based on wiener filtering by using the adjusted wiener filtering smoothing coefficients, the bird audio signal data is used as input of a wiener filtering algorithm, the adjusted wiener filtering smoothing coefficients are used as smoothing parameters of the wiener filtering algorithm, and output of the wiener filtering algorithm is used as the bird audio signal data subjected to filter enhancement, and the wiener filtering algorithm is a known technology and is not repeated. A flow chart of an implementation of the present application is shown in fig. 2.
Thus, the filtering enhancement method for the ecological audio information is completed.
In this specification, each embodiment is described in a progressive manner, and identical and similar parts of each embodiment are all referred to each other, and each embodiment mainly describes differences from other embodiments. The foregoing description of the preferred embodiments of the present application is not intended to be limiting, but rather is intended to cover any and all modifications, equivalents, alternatives, and improvements within the principles of the present application.

Claims (10)

1. A filter enhancement method for eco-audio information, characterized in that the method comprises the steps of:
acquiring bird audio signal data;
acquiring a frequency domain waveform diagram of each time sequence interval according to bird audio signal data, and acquiring all frequency component peak points and oscillation trailing wave peak points in the frequency domain waveform diagram of each time sequence interval according to the frequency domain waveform diagram of each time sequence interval; acquiring an oscillation tailing effect characteristic sequence of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval and the bandwidth of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval according to the frequency component peak point and the oscillation tailing peak point; acquiring noise disturbance indexes of bird audio signal data according to the oscillation tailing effect characteristic sequence of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval and the bandwidth of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval;
acquiring all bird song existence time sequence intervals and ecological environment audio time sequence intervals according to the bird audio signal data; acquiring a wind-sound interference adjustment coefficient of bird audio signal data according to a bird song existence time sequence interval and an ecological environment audio time sequence interval; obtaining a wiener filter smooth adjustment coefficient according to the noise disturbance index and the wind noise disturbance adjustment coefficient of the bird audio signal data;
and acquiring wiener filter smoothing coefficients according to the wiener filter smoothing adjustment coefficients, and acquiring bird audio signal data after filter enhancement based on the wiener filter smoothing coefficients by using a wiener filter algorithm.
2. The method for enhancing the filtering of the ecological audio information according to claim 1, wherein the method for obtaining the frequency domain waveform of each time sequence interval according to the bird audio signal data and obtaining all the frequency component peak points and the oscillation trailing wave peak points in the frequency domain waveform of each time sequence interval according to the frequency domain waveform of each time sequence interval is as follows:
taking the interval of each first preset parameter in the acquisition time of the bird audio signal data as a time sequence interval;
taking bird audio signal data in each time sequence interval as input of Fourier transform, and obtaining frequency domain data of each time sequence interval by utilizing the Fourier transform; drawing the frequency domain data of each time sequence interval by utilizing MATLAB software to obtain a frequency domain waveform diagram of each time sequence interval, wherein the horizontal axis of the frequency domain waveform diagram is frequency, and the vertical axis of the frequency domain waveform diagram is frequency domain energy;
for the frequency domain waveform diagram of each time sequence interval, taking the frequency domain energy of all wave crest points in the frequency domain waveform diagram as input of an Ojin threshold method, and obtaining a division threshold value by using the Ojin threshold method; and taking each wave peak point with the frequency domain energy larger than or equal to the division threshold value in the frequency domain waveform as a frequency component wave peak point, and taking each wave peak point with the frequency domain energy smaller than the division threshold value in the frequency domain waveform as an oscillation trailing wave peak point.
3. The method for enhancing the filtering of the ecological audio information according to claim 1, wherein the method for obtaining the oscillation tailing effect characteristic sequence of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval and the bandwidth of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval according to the frequency component peak point and the oscillation tailing peak point is as follows:
taking each frequency component peak point in the frequency domain waveform diagram of each time sequence interval as a target frequency component peak point, and taking a sequence formed by all oscillation tailing peak points between the target frequency component peak point and the next frequency component peak point according to the ascending order of the frequencies as an oscillation tailing effect characteristic sequence of the target frequency component peak point;
and regarding the frequency domain waveform diagram of each time sequence interval, taking the frequency domain energy which is the frequency domain energy multiplied by the second preset parameter of the minimum frequency domain energy of all the frequency component peak points in the frequency domain waveform diagram as bandwidth construction energy, and taking the width of the waveform corresponding to each frequency component peak point, where the transverse straight line where the bandwidth construction energy is located, passing through each frequency component peak point as the bandwidth of each frequency component peak point in the frequency domain waveform diagram.
4. The method for enhancing filtering of ecological audio information according to claim 1, wherein the method for obtaining noise disturbance index of bird audio signal data according to the oscillation tailing effect characteristic sequence of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval and the bandwidth of each frequency component peak point in the frequency domain waveform diagram of each time sequence interval comprises the following steps:
taking the difference value of the frequency domain energy of each frequency component peak point in the frequency domain waveform diagram and the frequency domain energy average value of all oscillation trailing wave peak points in the oscillation trailing effect characteristic sequence of each frequency component peak point as a molecule for the frequency domain waveform diagram of each time sequence interval; calculating the absolute value of the difference between the frequency domain energy of each frequency component peak point and the frequency domain energy mean value of all frequency component peak points in the frequency domain waveform chart, and taking the sum of the absolute value and a third preset parameter as a denominator; taking the average value of the sum of the absolute values of the ratios of the numerator and the denominator on the frequency domain waveform diagram as the spectrum energy difference index of the time sequence interval;
acquiring an audio noise receiving factor of each time sequence interval according to the frequency spectrum energy difference index of each time sequence interval;
and taking the average value of the sum of the spectrum energy difference index and the audio noise factor and the sum of the spectrum energy difference index and the audio noise factor over all time sequence intervals as the noise disturbance index of the bird audio signal data.
5. The filtering enhancement method for ecological audio information as claimed in claim 4, wherein the method for obtaining the audio noise receiving factor of each time interval according to the spectrum energy difference index of each time interval is as follows:
taking a spectrum energy difference index of each time sequence interval as a molecule for the frequency domain waveform diagram of each time sequence interval;
calculating the absolute value of the difference between the bandwidth of each frequency component peak point in the frequency domain waveform diagram and the bandwidth average value of all frequency component peak points, calculating the accumulation sum of the absolute value on the frequency domain waveform diagram, and taking the product of the accumulation sum and the frequency domain energy of the fundamental frequency in the frequency domain waveform diagram as a denominator;
the ratio of the numerator to the denominator is used as the audio noise factor of the time sequence interval.
6. The method for enhancing filtering of ecological audio information according to claim 1, wherein the method for acquiring all of the bird song presence timing intervals and the ecological environment audio timing intervals from the bird audio signal data is as follows:
taking the amplitude values of all audio signals in the bird audio signal data as the input of an Ojin threshold method, and obtaining a bird song audio dividing threshold value by using the Ojin threshold method;
calculating the average value of the amplitude values of all audio signals in the bird audio signal data in each time sequence interval, taking each time sequence interval with the average value of the amplitude values being more than or equal to a bird song audio frequency dividing threshold value as a bird song existence time sequence interval, and taking each time sequence interval with the average value of the amplitude values being less than the bird song audio frequency dividing threshold value as an ecological environment audio frequency time sequence interval.
7. The filtering enhancement method for ecological audio information according to claim 1, wherein the method for acquiring the wind noise interference adjustment coefficient of the bird audio signal data according to the bird song presence time sequence interval and the ecological environment audio time sequence interval is as follows:
taking each ecological environment audio time sequence interval as a target ecological environment audio time sequence interval, and taking the total number of frequency component peak points in a frequency domain waveform diagram of the target ecological environment audio time sequence interval as bird behavior audio weight factors of the target ecological environment audio time sequence interval when the audio noise receiving factor corresponding to the target ecological environment audio time sequence interval is smaller than or equal to the audio noise receiving factor of the last time sequence interval; when the audio noise factor corresponding to the target ecological environment audio time sequence interval is larger than the audio noise factor of the previous time sequence interval, calculating the difference value of the audio noise factor corresponding to the target ecological environment audio time sequence interval and the audio noise factor of the previous time sequence interval, and taking the sum of the total number of the frequency component peak points and the difference value as a wind-up audio weight factor of the target ecological environment audio time sequence interval;
acquiring a wind sound interference degree index of the target ecological environment audio time sequence interval according to a wind sound weighting factor of the target ecological environment audio time sequence interval;
and taking the average value of the wind noise interference degree indexes corresponding to all the ecological environment audio time sequence intervals as a wind noise interference adjustment coefficient of the bird audio signal data.
8. The method for enhancing filtering of audio information according to claim 7, wherein the method for obtaining the wind noise interference level index of the target audio time sequence section according to the wind noise weight factor of the target audio time sequence section comprises:
calculating the amplitude average value of all audio signals in the bird audio signal data in each ecological environment audio time sequence interval, calculating the absolute value of the difference between the amplitude average value corresponding to the target ecological environment audio time sequence interval and the amplitude average value corresponding to each ecological environment audio time sequence interval, and taking the natural constant as a base number and taking the mapping result of the sum of the absolute value on the total number of the ecological environment audio time sequence intervals as an index as a first summation factor;
taking the air-blast audio weight factors of the target ecological environment audio time sequence interval as molecules; calculating a measurement distance between a maximum amplitude point of the target ecological environment audio time sequence interval and a maximum amplitude point of each bird song existence time sequence interval, and taking the sum of the measurement distances on the total number of the bird song existence time sequence intervals as a denominator;
and taking the sum of the ratio of the numerator and the denominator and the first summation factor as an index of the wind noise interference degree of the target ecological environment audio time sequence interval.
9. The method for enhancing the filtering of the ecological audio information according to claim 1, wherein the method for obtaining the wiener filter smoothing adjustment coefficient according to the noise disturbance index and the wind noise disturbance adjustment coefficient of the bird audio signal data is as follows:
and calculating the product of the noise disturbance index and the wind noise disturbance adjustment coefficient of the bird audio signal data, and taking the product as a wiener filtering smooth adjustment coefficient.
10. The method for enhancing the filtering of the ecological audio information according to claim 1, wherein the method for obtaining wiener filter smoothing coefficients according to wiener filter smoothing adjustment coefficients and obtaining the bird audio signal data after the filtering enhancement based on the wiener filter smoothing coefficients by using a wiener filter algorithm is as follows:
calculating the sum of the wiener filter smoothing adjustment coefficient and a fourth preset parameter, and taking the product of the sum and the preset smoothing coefficient as the wiener filter smoothing coefficient;
and taking the bird audio signal data and the wiener filter smoothing coefficient as inputs of a wiener filter algorithm respectively, and obtaining the bird audio signal data after the filter enhancement by using the wiener filter algorithm.
CN202410179556.5A 2024-02-18 2024-02-18 Filtering enhancement method for ecological audio information Active CN117727314B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410179556.5A CN117727314B (en) 2024-02-18 2024-02-18 Filtering enhancement method for ecological audio information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410179556.5A CN117727314B (en) 2024-02-18 2024-02-18 Filtering enhancement method for ecological audio information

Publications (2)

Publication Number Publication Date
CN117727314A true CN117727314A (en) 2024-03-19
CN117727314B CN117727314B (en) 2024-04-26

Family

ID=90205709

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410179556.5A Active CN117727314B (en) 2024-02-18 2024-02-18 Filtering enhancement method for ecological audio information

Country Status (1)

Country Link
CN (1) CN117727314B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103440871A (en) * 2013-08-21 2013-12-11 大连理工大学 Method for suppressing transient noise in voice
CN105513605A (en) * 2015-12-01 2016-04-20 南京师范大学 Voice enhancement system and method for cellphone microphone
CN108899042A (en) * 2018-06-25 2018-11-27 天津科技大学 A kind of voice de-noising method based on mobile platform
US20190206420A1 (en) * 2017-12-29 2019-07-04 Harman Becker Automotive Systems Gmbh Dynamic noise suppression and operations for noisy speech signals
CN112750442A (en) * 2020-12-25 2021-05-04 浙江弄潮儿智慧科技有限公司 Nipponia nippon population ecosystem monitoring system with wavelet transformation and wavelet transformation method thereof
WO2021197566A1 (en) * 2020-03-30 2021-10-07 Harman Becker Automotive Systems Gmbh Noise supression for speech enhancement

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103440871A (en) * 2013-08-21 2013-12-11 大连理工大学 Method for suppressing transient noise in voice
CN105513605A (en) * 2015-12-01 2016-04-20 南京师范大学 Voice enhancement system and method for cellphone microphone
US20190206420A1 (en) * 2017-12-29 2019-07-04 Harman Becker Automotive Systems Gmbh Dynamic noise suppression and operations for noisy speech signals
CN108899042A (en) * 2018-06-25 2018-11-27 天津科技大学 A kind of voice de-noising method based on mobile platform
WO2021197566A1 (en) * 2020-03-30 2021-10-07 Harman Becker Automotive Systems Gmbh Noise supression for speech enhancement
CN112750442A (en) * 2020-12-25 2021-05-04 浙江弄潮儿智慧科技有限公司 Nipponia nippon population ecosystem monitoring system with wavelet transformation and wavelet transformation method thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
郭业才;陈小燕;王超;: "LCMV分频的改进维纳滤波后置波束形成算法", 电子测量与仪器学报, no. 10, 15 October 2017 (2017-10-15) *

Also Published As

Publication number Publication date
CN117727314B (en) 2024-04-26

Similar Documents

Publication Publication Date Title
KR101034831B1 (en) System for suppressing wind noise
US9916841B2 (en) Method and apparatus for suppressing wind noise
CN106052852B (en) A kind of detection method and device of pulse acoustical signal
CN117033911B (en) Step counting analysis method based on intelligent glasses data
CN105590630B (en) Orientation noise suppression method based on nominated bandwidth
CN113642484B (en) Magnetotelluric signal noise suppression method and system based on BP neural network
CN110974189A (en) Method, device, equipment and system for detecting signal quality of pulse wave
CN106504760B (en) Broadband ambient noise and speech Separation detection system and method
WO2014206265A1 (en) Harmonic analysis method and device and inter-harmonic clutter determination method and device
CN103778921A (en) Method for eliminating nonuniform noise in speech collected by radar
CN116626408A (en) Power supply ripple noise detection method based on machine learning
CN117727314B (en) Filtering enhancement method for ecological audio information
CN112183407B (en) Tunnel seismic wave data denoising method and system based on time-frequency domain spectral subtraction
CN111839847B (en) Myoelectric prosthetic limb control method, myoelectric prosthetic limb control device, electronic equipment and computer readable medium
CN109542021A (en) A kind of sensor weak signal data acquisition method and device
US10013992B2 (en) Fast computation of excitation pattern, auditory pattern and loudness
Qiao et al. Spectral entropy based dolphin whistle detection algorithm and its possible application for biologically inspired communication
CN113375065B (en) Method and device for eliminating trend signal in pipeline leakage monitoring
CN112908344B (en) Intelligent bird song recognition method, device, equipment and medium
CN113948088A (en) Voice recognition method and device based on waveform simulation
CN113838476A (en) Noise estimation method and device for noisy speech
CN114970647B (en) Electromagnetic data identification method and system based on probabilistic neural network
CN117727313A (en) Intelligent noise reduction method for wild bird sound data
CN117727332B (en) Ecological population assessment method based on language spectrum feature analysis
CN116935880B (en) Integrated machine man-machine interaction system and method based on artificial intelligence

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant