CN105810213A

CN105810213A - Typical abnormal sound detection method and device

Info

Publication number: CN105810213A
Application number: CN201410850883.5A
Authority: CN
Inventors: 高娅; 乔刚; 张兴明
Original assignee: Zhejiang Dahua Technology Co Ltd
Current assignee: Zhejiang Dahua Technology Co Ltd
Priority date: 2014-12-30
Filing date: 2014-12-30
Publication date: 2016-07-27

Abstract

The invention discloses a typical abnormal sound detection method and a device, relates to the voice signal processing field and particularly provides an abnormal sound detection method based on spectrograms. According to the method, a sound signal is acquired, the sound signal is pre-processed, and effective signal segments of the sound signal are determined; the spectrograms of the effective signal segments are acquired, a to-be-tested identification characteristic matrix is determined, and the to-be-tested identification characteristic matrix is used for representing sound intensity distribution of the sound signal on the time-frequency domain; similarities between the to-be-tested identification characteristic matrix and each standard identification characteristic matrix in an abnormal sound model database are calculated, and abnormal sound types of the effective signal segments are determined according to calculation results; computational complexity in an abnormal sound identification process is simplified, and detection accuracy is improved.

Description

A kind of typical case's abnormal sound detection method and device

Technical field

The present invention relates to field of voice signal, particularly relate to a kind of typical case's abnormal sound detection method and device.

Background technology

Abnormal sound refers to lower sound that should not occur of home, is often referred in the normal productive life of the mankind, the sound relevant to people's security of the lives and property occurred suddenly.Abnormal sound detection refers to the end points (beginning and end) finding out abnormal sound from one section of continuous print acoustical signal, and is extracted from which by effective abnormal sound signal.The aspect ratio of glass breaking sound is more apparent, detection is got up more convenient in actual applications, and general all along with glass breaking sound when blasting in some occasions such as market, bus, street etc., therefore carry out detecting certain practical value as a kind of typical abnormal sound using glass breaking sound.

Technology in the outer speech recognition technology being both along about the characteristic parameter adopted in the Study of recognition of abnormal sound of Present Domestic and parameter, the correlational study about abnormal sound detection mainly has:

First kind of way: carry out abnormal sound detection based on cepstral analysis, namely mainly adopts cepstral domain feature parameter to carry out the feature description of sound.Generally the research of glass breaking sound is continued to use mostly the technology of speech recognition, utilize mel cepstrum coefficients, short-time energy etc. are as the eigenvalue of this abnormal sound, to the eigenvalue extracted, utilize some models, such as HMM, neural network model, gauss hybrid models etc., are trained eigenvalue identifying, detect glass breaking sound.But only when the output of each passband is added up impartial, mel cepstrum coefficients could express maximum quantity of information, and in fact, abnormal sound distribution in frequency band is not necessarily equally distributed, and utilize the mode of training to identify, complexity is higher, can be subject to certain restrictions in actual applications, and glass breaking sound can not be well identified by the method for therefore traditional speech recognition.

The second way: voice data is divided into 50ms mono-frame, using ZCR (zero-crossing rate), LPC (linear prediction), LPCC (linear prediction residue error) etc. as feature set, gauss hybrid models is as Multilayer Classifier, its core manifestation is in multi-level categorizing process, ground floor distinguishes background sound, the second layer distinguishes voice and non-voice, third layer identification abnormal sound.Although this method is it can be considered that the impact of background noise, but it is extremely complex to realize process, it is not easy to be applied to video and the Voice Surveillance field of reality.

Summary of the invention

The embodiment of the present invention provides a kind of typical case's abnormal sound detection method and device, it is possible to the feature based on sound spectrogram detects abnormal sound, simplifies the computation complexity in exceptional sound recognition process, improves the accuracy rate of detection.

The embodiment of the present invention provides a kind of typical case's abnormal sound detection method, and the method includes:

Collected sound signal, carries out pretreatment to described acoustical signal, it is determined that the useful signal fragment of described acoustical signal；

Obtaining the sound spectrogram of described useful signal fragment and determine identification eigenmatrix to be tested according to described sound spectrogram, described identification eigenmatrix to be tested is for representing the distribution situation of acoustical signal intensity of sound on time-frequency domain；

Calculate the similarity of described identification eigenmatrix to be tested and each standard identification eigenmatrix in abnormal sound model library, determine the abnormal sound type in described useful signal fragment according to result of calculation.

The embodiment of the present invention also provides for a kind of typical case's abnormal sound detecting device, and this device includes:

Collecting unit, for collected sound signal, carries out pretreatment to described acoustical signal, it is determined that the useful signal fragment of described acoustical signal；Obtain the sound spectrogram of described useful signal fragment；

Determining unit, for determining the identification eigenmatrix of abnormal sound sound spectrogram, described identification eigenmatrix to be tested is for representing the distribution situation of acoustical signal intensity of sound on time-frequency domain；

Computing unit, for calculating the similarity of described identification eigenmatrix to be tested and each standard identification eigenmatrix in abnormal sound model library, determines the abnormal sound type in described useful signal fragment according to result of calculation.

Visible, the embodiment of the present invention does not use the method for traditional speech recognition to detect abnormal sound, but uses sound spectrogram that abnormal sound is detected.Avoid in characteristic extraction procedure owing to using the extraction such as mel cepstrum coefficients of the speech characteristic value of routine, cause because of abnormal sound uneven distribution in frequency band, cause that the output statistics of each passband is unequal, so that mel cepstrum coefficients can not well characterize the feature of sound, and the problem that the recognition accuracy caused is not high.It addition, the method that the embodiment of the present invention adopts sound spectrogram detects abnormal sound, computation complexity is relatively low, makes it possible to well be applied in actual field of video monitoring, expands the scope used.

Accompanying drawing explanation

In order to be illustrated more clearly that the technical scheme in the embodiment of the present invention, below the accompanying drawing used required during embodiment is described is briefly introduced, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the premise not paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.

The schematic flow sheet of a kind of typical case's abnormal sound detection method that Fig. 1 provides for the embodiment of the present invention；

The schematic flow sheet of a kind of abnormal sound detection method based on sound spectrogram that Fig. 2 provides for the embodiment of the present invention；

Fig. 3 extracts the schematic flow sheet of process for the identification eigenmatrix a kind of to be tested that the embodiment of the present invention provides；

A kind of schematic flow sheet calculating similarity that Fig. 4 provides for the embodiment of the present invention；

The structural representation of a kind of typical case's abnormal sound detecting device that Fig. 5 provides for the embodiment of the present invention.

Detailed description of the invention

In order to make the object, technical solutions and advantages of the present invention clearly, below in conjunction with accompanying drawing, the present invention is described in further detail, it is clear that described embodiment is only a part of embodiment of the present invention, rather than whole embodiments.Based on the embodiment in the present invention, all other embodiments that those of ordinary skill in the art obtain under not making creative work premise, broadly fall into the scope of protection of the invention.

The embodiment of the present invention proposes a kind of method that feature based on sound spectrogram detects abnormal sound.For acoustic information, people rely on tone color, pitch, three features of volume to make a distinction, and these features all can by the parameter characterization in sound spectrogram time-frequency structure.Sound spectrogram is a kind of three-dimensional perception figure describing acoustical signal, is made up of frequency, time, three dimensional information of intensity of sound.And the time-frequency structure of different abnormal sound sound spectrograms towards with fine degree in have significant difference, different abnormal sound signals has the announcement of essence simultaneously possess again good distinctive.Using sound spectrogram as primitive character source, according to certain rule, extract the eigenmatrix of sound spectrogram, by calculating the distance identified between eigenmatrix in identification eigenmatrix to be tested and abnormal sound model library, detect abnormal sound.The embodiment of the present invention can better characterize the feature of sound, improves the accuracy rate of detection, and identifies that process computation is fairly simple, and complexity is relatively low, it is possible to is well applied in actual video monitoring.

Fig. 1 illustrates the schematic flow sheet of a kind of typical abnormal sound detection method that the embodiment of the present invention provides, as it is shown in figure 1, the method may include that

Step 11: collected sound signal, carries out pretreatment to acoustical signal, extracts the useful signal fragment of acoustical signal.

Step 12: obtaining the sound spectrogram of this useful signal fragment and extract identification eigenmatrix to be tested according to this sound spectrogram, this identification eigenmatrix to be tested is for representing the distribution situation of acoustical signal intensity of sound on time-frequency domain.

Step 13: calculate the similarity of identification eigenmatrix to be tested and each standard identification eigenmatrix in abnormal sound model library, determine the abnormal sound type in useful signal fragment according to result of calculation.

Optionally, in above-mentioned steps 12, the oscillogram of useful signal fragment is converted to the sound spectrogram being made up of the frequency of useful signal fragment, time and three dimensional information of intensity of sound；Extract the identification eigenmatrix to be tested being used for characterizing the intensity of sound distribution situation of sound spectrogram.

Optionally, in above-mentioned steps 13, calculate the similarity of identification eigenmatrix to be tested and each standard identification eigenmatrix in abnormal sound model library, by the noise type corresponding to the standard identification eigenmatrix in the abnormal sound model library maximum with identifying eigenmatrix similarity, it is determined that for the abnormal sound type of useful signal fragment.

Optionally, in above-mentioned steps, useful signal fragment windowing is divided into several frames；Each frame being carried out Short Time Fourier Transform, obtains the spectrum information of this frame, this spectrum information is for representing the relation between the frequency of this frame and intensity of sound；Connecting the spectrum information of all frames, obtain the sound spectrogram of useful signal fragment, sound spectrogram is made up of several points, and (x, y) is used for representing that this point is in the x moment coordinate of any point, intensity of sound corresponding in y frequency.

Optionally, in above-mentioned steps, frame at each point in sound spectrogram, frequency and intensity of sound, extract sound spectrogram identification eigenmatrix；For each frame, extract intensity of sound value front K frequency values from big to small in sound spectrogram matrix；According to the relative position in time domain of each frequency values in K frequency values, generate the identification eigenmatrix of intensity of sound distribution situation for characterizing sound spectrogram.

Optionally, in above-mentioned steps 13, for the either standard identification eigenmatrix in described abnormal sound model library, element by the last string of the first row of the element of the first row first row of described standard identification eigenmatrix to described standard identification eigenmatrix, determine respectively and in described identification eigenmatrix to be tested difference between the value of the first row first row less than each element in the described standard identification eigenmatrix of predetermined threshold value, carry out each element present position respectively as starting point setting window operation, respectively obtain the window matrix that this window covers, this window is identical with the dimension of identification eigenmatrix to be tested；Calculate business's matrix of each window matrix and identification eigenmatrix to be tested, each row element in described business's matrix is multiplied by default weighted value respectively, obtain the similarity of each window matrix and described identification eigenmatrix to be tested；Whole similarity according to each standard identification eigenmatrix in calculated described identification eigenmatrix to be tested and abnormal sound model library, is defined as the similarity of described standard identification eigenmatrix and described identification eigenmatrix to be tested by similarity maximum for the value in whole similarities.

Optionally, in above-mentioned steps 11, acoustical signal is carried out preemphasis process, obtain the acoustical signal after preemphasis；Acoustical signal after preemphasis is carried out framing windowing process, useful signal fragment will be defined as in short time period self-energy value more than one section of acoustical signal of predetermined threshold value.

It should be noted that, the abnormal sound data base that the noise model used in this method can be through investigating the related data arranging the research of domestic and international abnormal sound and obtains includes various unlike material, different size, glass breaking sound when different-thickness, in order to close to actual use occasion, be also added into the background noise of typical public place: railway station and large-scale square.The embodiment of the present invention only carries out citing with above-mentioned noise model and describes, but the embodiment of the present invention is not limited only to detection glass breaking sound, it is also possible to be preferably applicable to detection and the identification of other typical case's abnormal sounds.

Below the embodiment of the present invention is described in detail.

Fig. 2 illustrates the schematic flow sheet of a kind of abnormal sound detection method based on sound spectrogram that the embodiment of the present invention provides, as in figure 2 it is shown, the method may include that

Step 21: collected sound signal, carries out pretreatment to the acoustical signal collected, and extracts useful signal fragment.

Concrete, acoustical signal is carried out preemphasis process, obtains the acoustical signal after preemphasis；Acoustical signal after preemphasis is carried out framing windowing process, useful signal fragment will be defined as in short time period self-energy value more than one section of acoustical signal of predetermined threshold value.

When implementing, in embodiments of the present invention the acoustical signal collected is carried out pretreatment and can include preemphasis, end-point detection and framing windowing etc..In the embodiment of the present invention, first, the acoustical signal collected is carried out preemphasis, utilize the HFS of the specific high pass filter acoustical signal to collecting to compensate；Then, acoustical signal after preemphasis is carried out end-point detection, determine the starting point of useful signal in this acoustical signal, in general, would be likely to occur the quiet or blank of a period of time due in the acoustical signal that collects, in order to improve the detection efficiency of abnormal sound, (namely the embodiment of the present invention can be determined by the effective acoustic information in acoustical signal, useful signal) starting point, then useful signal is carried out abnormal sound coupling or detection again；Subsequently, to having determined that the useful signal of starting point characteristic parameter within the specific limits carries out framing windowing process so that it is meet statistical property steady.

Step 22: the oscillogram of useful signal fragment is converted to the sound spectrogram being made up of the frequency of useful signal fragment, time and three dimensional information of intensity of sound.

When implementing, sound spectrogram is widely used in analysis and the process of audio signal.It is X-Y scheme, although each corresponding three-dimensional value on it.Its transverse axis is the time, and the longitudinal axis is frequency.(x, y) corresponding point represents that, at moment x, the intensity of sound on frequency y, this is showed by different colors with coordinate.From the sound spectrogram of acoustical signal, it is possible to investigate out distribution and the situation of change of intensity of sound within the scope of whole T/F；And this is cannot to present in oscillogram.For obtaining sound spectrogram, acoustical signal is divided into very short frame, and consecutive frame has certain overlap.Then each frame is done Short Time Fourier Transform and obtains the spectrum information of correspondence, owing to sound spectrogram is made up of frequency, time, three dimensional information of intensity of sound, it is therefore desirable to the value of intensity of sound is calculated.Spectrum information connects into complete sound spectrogram the most at last.

For example, useful signal fragment windowing is divided into several frames by the embodiment of the present invention；Each frame being carried out Short Time Fourier Transform, obtains the spectrum information of this frame, spectrum information is for representing the relation between the frequency of this frame and intensity of sound；Connecting the spectrum information of all frames, obtain the sound spectrogram of useful signal fragment, sound spectrogram is made up of several points, and (x, y) is used for representing that this point is in the x moment coordinate of any point, intensity of sound corresponding in y frequency.In embodiments of the present invention, the clock signal of sound being carried out Fourier's change in short-term, the length of Fourier transformation is 2N point, and the signal of so each frame can obtain the frequency spectrum that length is N, and the sound pressure level of every bit is expressed as: P=20*log₁₀|x(1/N)|

Wherein, P is the sound pressure level of this point, and x is the spectrum value of this frame signal.

Step 23: distinguish abnormal sound and background noise according to sound spectrogram.

When implementing, it is found through experiments, when glass breaking occurs, owing to different tensions produces some overtones, therefore in sound spectrogram except including the harmonics of fundamental frequency integral multiple, also have non-integral multiple harmonics, the time-frequency structure of sound spectrogram shows as each frequency range the band of irregular ripples grain pattern occurs.And the background noise energy of public place is evenly distributed, its sound spectrogram does not have obvious time-frequency result, and sound spectrogram therefore can be utilized to remove the noise impact on voice recognition.It should be noted that the embodiment of the present invention is not limited only to produced abnormal sound during detection glass breaking, it is also with various abnormal sound model and other various types of abnormal sounds are detected and identify.

Step 24: frame at each point in sound spectrogram, frequency and intensity of sound, generates sound spectrogram matrix.

When implementing, each point in sound spectrogram actually corresponds to the intensity of sound value in preset time, given frequency, therefore selects sound spectrogram can better show the time-frequency structure of acoustical signal as primitive character source.

For example, frequency spectrum is pressed predefined parameter discretization by the process of Fourier transformation, frequency spectrum array corresponding for every frame is coupled together, is the formation of sound spectrogram matrix.Wherein, the row in sound spectrogram matrix are corresponding to the frame on the time period, and the row in sound spectrogram matrix is corresponding to the frequency on frequency band.

Step 25: extract the identification eigenmatrix to be tested being used for characterizing the intensity of sound distribution situation of sound spectrogram from sound spectrogram matrix.

When implementing, identification eigenmatrix to be tested is extracted from sound spectrogram matrix, because the frequency that intensity is less is little on the impact of sound effect, therefore for each frame data, it is ranked up by the intensity level according to each frequency, it is dynamically K frequency values of every frame preservation maximum intensity, ignore other frequency, minimum space so can be used to preserve the most significant information, greatly reduce the memory space of feature, so it is possible not only to reduce computation complexity, and useful information can be preserved preferably, do not lose the representational of content.

Due to when generating sound spectrogram, the frequency domain set is unified, discrete is uniform, what therefore identical in sound spectrogram matrix line identifier was corresponding is identical frequency, the time dependent situation of the frequency being identical that identical row is corresponding, therefore, line identifier is equivalent to the strongest frequency relative position on frequency domain.Fig. 3 illustrates that the identification eigenmatrix a kind of to be tested that the embodiment of the present invention provides extracts the schematic flow sheet of process, as it is shown on figure 3, this process can include but not limited to following steps:

First, sound spectrogram matrix finds the line identifier that in each column, maximum is corresponding (such as, what the first row was corresponding is designated A, what the second row was corresponding is designated B, what the third line was corresponding is designated C, ... by that analogy, repeat no more here), these line identifiers are stored in the first row identifying eigenmatrix.Arranging corresponding to the frame on the time period, row is corresponding to the frequency on frequency band, and therefore line identifier can regard the conversion expression-form of frequency values as, say, that line identifier is the positional information that peak frequency is corresponding；Different line identifiers is stored in identify eigenmatrix the first row can be understood as identification eigenmatrix in store the different frequency value that diverse location is corresponding.If it should be noted that there is multiple maximum, then preserve minimum line identifier (that is, line identifier from small to large is followed successively by A～Z).Secondly, find out the line identifier that Second Largest Value in each column is corresponding, be stored in the second row identifying eigenmatrix, by that analogy, repeat K time altogether, constitute the eigenmatrix of K row, and its columns is equal to frame number.Owing to experiment finds, when the value of K is more than 5, the impact of experimental result is little, therefore the value of K can be defined as 5 by the embodiment of the present invention, so, is possible not only to reduce computation complexity, moreover it is possible to well represent the feature of sound.Further, what last column of identification eigenmatrix to be tested was deposited is the short-time average magnitude value that front 5 maximums are corresponding, constitutes the identification eigenmatrix to be tested of (K+1) * frame number.And the line identifier deposited in identification eigenmatrix to be tested represents the strongest frequency relative position in frequency domain.

Step 26: calculate the similarity of identification eigenmatrix to be tested and each standard identification eigenmatrix in abnormal sound model library.

When implementing, owing to the data in identification eigenmatrix to be tested are not continuous print characteristic vector values, Euclidean distance can not well represent the characteristic of two matrixes, and therefore the embodiment of the present invention calculates the similarity of two matrixes in the following ways.

Preferably, abnormal sound model library storage in embodiments of the present invention has the standard identification eigenmatrix corresponding to various types of typical abnormal sound template types.

It should be noted that in embodiments of the present invention, only it is only used as, with following steps S41～step S47, the citing calculating similarity and is described in detail, the method being not limited to step S41～step S47 when practical application.In abnormal sound detection process, after performing step 26, continue executing with step 27.

Fig. 4 illustrates a kind of schematic flow sheet calculating similarity that the embodiment of the present invention provides, and as shown in Figure 4, this process may include that

Step S41: judge that in identification eigenmatrix to be tested, whether the value of the first row first row element is closest with the value of the first row i-th column element of the standard identification eigenmatrix in abnormal sound model library, if so, then performs step S43；Otherwise, step S42 is performed.

Step S42: sliding window is moved rearwards by 1 row by the i-th row in the standard identification eigenmatrix in abnormal sound model library, makes i=i+1.And perform S41 (that is, what now compare is the value that in identification eigenmatrix to be tested, the first row i-th (i=i+1) of the standard identification eigenmatrix in the value of the first row first row element and abnormal sound model library arranges), until the standard identification eigenmatrix in abnormal sound model library finding the value of string and identifying that in eigenmatrix, the value of the first row first row element is closest.

Step S43: i-th in the standard identification eigenmatrix in abnormal sound model library is arranged the first row as sliding window, using the dimension of the identification eigenmatrix to be tested dimension as sliding window, it is determined that go out the matrix of and identification eigenmatrix same dimension to be tested.

Step S44: calculate business's matrix of sliding window place matrix and identification eigenmatrix to be tested.

For example, sliding window the data point of element in a matrix divided by the value of the element being in same position in identification eigenmatrix to be tested, element each in two matrixes is sequentially carried out a division operation, thus obtaining business's matrix of sliding window place matrix and identification eigenmatrix to be tested, if the element value being in two matrix same positions is equal, then it is designated as 1 in the value of business's matrix corresponding position.

Step S45: the similarity according to business's matrix calculus sliding window place matrix Yu identification eigenmatrix to be tested.

Step S46: judge whether this window moves to the last string of the first row of standard identification eigenmatrix, if so, then continues executing with step S47, otherwise, performs step S42.

When implementing, until this sliding window moves to the final position (the last string position of the first row) of standard identification eigenmatrix, obtain each similarity of this window place matrix and described identification eigenmatrix to be tested.

It should be noted that, in abnormal sound model library, storage has various typical case's abnormal sound model, each corresponding standard identification eigenmatrix corresponding with the characteristic of this typical case's abnormal sound of typical case's abnormal sound model, only any one the standard identification eigenmatrix in abnormal sound model library is mated by above-mentioned steps S41～step S46, in order to mate with the typical abnormal sound model of every kind in abnormal sound model library, the embodiment of the present invention can also for other typical case's abnormal sound model of typical case's abnormal sound model, perform above-mentioned steps S41～step S46 successively, until to the whole typical case's abnormal sound models in abnormal sound model library (namely, whole standard identification eigenmatrixes) it is sequentially carried out coupling, so that it is determined that go out the abnormal sound type of useful signal fragment.After to standard identification eigenmatrix standard identification eigenmatrix, continue executing with step S47.

Step S47: similarity maximum for the value in each similarity is defined as the similarity of described standard identification eigenmatrix and described identification eigenmatrix to be tested.

In embodiments of the present invention, to ensure that the dimension of two matrixes is equal when calculating business's matrix, and the duration of generally corresponding than each standard identification eigenmatrix in the abnormal sound model library sound of the duration of identification eigenmatrix correspondence sound to be tested is short, it is thus desirable to find out the dimension identical with identification eigenmatrix to be tested, one can be utilized always to start to slide into the last string of the first row with the first row first row of the sliding window standard identification eigenmatrix from abnormal sound model library identifying eigenmatrix same dimension, and compare one by one.In order to improve computational efficiency, first find with the first row the first train value close proximity in identification eigenmatrix to be tested as original position, mark the window matrix of a piece and identification eigenmatrix formed objects to be tested, calculate business's matrix of identification eigenmatrix to be tested and window matrix.Due to the first row the strongest frequency values of correspondence, corresponding strong frequency values of the second row in identification eigenmatrix to be tested, and so on.As a rule, intensity is more weak, is more easily subject to external influence, and error is more big.Treat it is thus desirable to each row to be regarded as different significance levels, it is possible to being multiplied by weights for each row, intensity is more little, weights are more little, through many experiments, the embodiment of the present invention can be set to secondary from first row to last leu for the weight (that is, each row element in business's matrix being multiplied by default weighted value respectively) of the business's matrix calculated: 1,1,0.8,0.6,0.5,0.6, thus calculating the similarity of two matrixes.Followed by moving window, if first position is not right in window, avoid the need for continuing to compare backward, direct moving window, until finding new original position, therefore calculated similarity is likely to more than one, select similarity maximum as such, it is desirable to continue executing with step 27.It should be noted that default weighted value in embodiments of the present invention only illustrates for above-mentioned more preferably value, this default weighted value can also be other value, no longer repeats one by one here.

For example, standard identification eigenmatrix in described abnormal sound model library finds and the immediate numerical value present position of value of the first row first row in described identification eigenmatrix to be tested, and this position is set window as starting point, this window is identical with the dimension of identification eigenmatrix to be tested, calculate business's matrix of this window and identification eigenmatrix to be tested, by the business's matrix calculated and default weighted value, obtain the similarity of window and identification eigenmatrix to be tested.Then continuing to moving window, if the first of new window position is kept off with recognition matrix to be tested, then continuing to be moved rearwards by window without comparing, until finding new original position.So can obtain more than one similarity, using wherein maximum as the Similarity value identifying eigenmatrix and identification eigenmatrix to be tested in model library.Further, each the standard identification eigenmatrix in abnormal sound model library is performed aforesaid operations, and obtains several similarities.

Step 27: by the abnormal sound model corresponding to each standard identification eigenmatrix in the abnormal sound model library maximum with identification eigenmatrix similarity to be tested, it is determined that for the abnormal sound type of useful signal fragment.

Based on identical technology design, the embodiment of the present invention also provides for a kind of device that can be used for performing above-mentioned abnormal sound detection method, and Fig. 5 illustrates a kind of typical case's abnormal sound detecting device that the embodiment of the present invention provides, as it is shown in figure 5, this device may include that

Collecting unit 51, for collected sound signal, carries out pretreatment to described acoustical signal, it is determined that the useful signal fragment of described acoustical signal；Obtain the sound spectrogram of described useful signal fragment；

Determining unit 52, for determining the identification eigenmatrix of abnormal sound sound spectrogram, described identification eigenmatrix to be tested is for representing the distribution situation of acoustical signal intensity of sound on time-frequency domain；

Computing unit 53, for calculating the similarity of described identification eigenmatrix to be tested and each standard identification eigenmatrix in abnormal sound model library, determines the abnormal sound type in described useful signal fragment according to result of calculation.

Optionally, described collecting unit 51 specifically for: the oscillogram of described useful signal fragment is converted to the sound spectrogram being made up of the frequency of described useful signal fragment, time and three dimensional information of intensity of sound；Described determine that unit 52 is specifically for obtaining the identification eigenmatrix of intensity of sound distribution situation for characterizing described sound spectrogram；Described computing unit 53 specifically for: calculate the similarity of each standard identification eigenmatrix in described identification eigenmatrix to be tested and abnormal sound model library, by the abnormal sound model maximum with described identification eigenmatrix similarity to be tested, it is determined that for the abnormal sound type of described useful signal fragment.

Optionally, described collecting unit 51 specifically for: described useful signal fragment is divided into several frames according to default sample rate；Any frame being carried out Short Time Fourier Transform, obtains the spectrum information of this frame, described spectrum information is for representing the relation between the frequency of this frame and intensity of sound；Connecting the spectrum information of all frames, obtain the sound spectrogram of described useful signal fragment, described sound spectrogram is made up of several points, and (x, y) is used for representing that this point is in the x moment coordinate of any point, intensity of sound corresponding in y frequency.

Optionally, described determine that unit 52 is specifically for frame at each point in described sound spectrogram, frequency and intensity of sound, it is determined that sound spectrogram identification eigenmatrix；For each frame, it is determined that intensity of sound value front K frequency values from big to small in described sound spectrogram identification eigenmatrix；According to the relative position in time domain of each frequency values in described K frequency values, generate the identification eigenmatrix to be tested of intensity of sound distribution situation for characterizing described sound spectrogram.

Optionally, described computing unit 53 specifically for: for the either standard identification eigenmatrix in described abnormal sound model library, element by the last string of the first row of the element of the first row first row of described standard identification eigenmatrix to described standard identification eigenmatrix, determine respectively and in described identification eigenmatrix to be tested difference between the value of the first row first row less than each element in the described standard identification eigenmatrix of predetermined threshold value, carry out each element present position respectively as starting point setting window operation, respectively obtain the window matrix that this window covers, this window is identical with the dimension of identification eigenmatrix to be tested；Calculate business's matrix of each window matrix and identification eigenmatrix to be tested, each row element in described business's matrix is multiplied by default weighted value respectively, obtain the similarity of each window matrix and described identification eigenmatrix to be tested；Whole similarity according to each standard identification eigenmatrix in calculated described identification eigenmatrix to be tested and abnormal sound model library, is defined as the similarity of described standard identification eigenmatrix and described identification eigenmatrix to be tested by similarity maximum for the value in whole similarities.

Optionally, described collecting unit 51 specifically for: described acoustical signal is carried out preemphasis process, obtains the acoustical signal after preemphasis；Acoustical signal after preemphasis is carried out framing windowing process, useful signal fragment will be defined as in short time period self-energy value more than one section of acoustical signal of predetermined threshold value.

The present invention is that flow chart and/or block diagram with reference to method according to embodiments of the present invention, equipment (system) and computer program describe.It should be understood that can by the combination of the flow process in each flow process in computer program instructions flowchart and/or block diagram and/or square frame and flow chart and/or block diagram and/or square frame.These computer program instructions can be provided to the processor of general purpose computer, special-purpose computer, Embedded Processor or other programmable data processing device so that the function that the instruction performed by the processor of this computer or other programmable data processing device can be specified in a flow process in flowchart or multiple flow process and/or one square frame of block diagram or multiple square frame.

These computer program instructions may be alternatively stored in and can guide in the computer-readable memory that computer or other programmable data processing device work in a specific way, the instruction making to be stored in this computer-readable memory produces to include the manufacture of command device, and this command device realizes the function specified in one flow process of flow chart or multiple flow process and/or one square frame of block diagram or multiple square frame.

These computer program instructions also can be loaded in computer or other programmable data processing device, make on computer or other programmable devices, to perform sequence of operations step to produce computer implemented process, thus the instruction performed on computer or other programmable devices provides for realizing the step of function specified in a flow process of flow chart or a square frame of multiple flow process and/or block diagram or multiple square frame.

Although preferred embodiments of the present invention have been described, but those skilled in the art are once know basic creative concept, then these embodiments can be made other change and amendment.So, claims are intended to be construed to include preferred embodiment and fall into all changes and the amendment of the scope of the invention.

Obviously, the present invention can be carried out various change and modification without deviating from the spirit and scope of the present invention by those skilled in the art.So, if these amendments of the present invention and modification belong within the scope of the claims in the present invention and equivalent technologies thereof, then the present invention is also intended to comprise these change and modification.

Claims

1. a typical abnormal sound detection method, it is characterised in that the method includes:

2. the method for claim 1, it is characterised in that the sound spectrogram of described acquisition described useful signal fragment also determines identification eigenmatrix to be tested according to described sound spectrogram, specifically includes:

The oscillogram of described useful signal fragment is converted to the sound spectrogram being made up of the frequency of described useful signal fragment, time and three dimensional information of intensity of sound；

Determine the identification eigenmatrix to be tested of intensity of sound distribution situation for characterizing described sound spectrogram；

The described identification eigenmatrix to be tested of described calculating and the similarity of each standard identification eigenmatrix in abnormal sound model library, determine the abnormal sound type in described useful signal fragment according to result of calculation, specifically include:

Calculate the similarity of described identification eigenmatrix to be tested and each standard identification eigenmatrix in abnormal sound model library, by the abnormal sound model maximum with described identification eigenmatrix similarity to be tested, it is determined that for the abnormal sound type of described useful signal fragment.

3. method as claimed in claim 2, it is characterised in that the described oscillogram by described useful signal fragment is converted to sound spectrogram, specifically includes:

Described useful signal fragment windowing is divided into several frames；

Each frame being carried out Short Time Fourier Transform, obtains the spectrum information of this frame, described spectrum information is for representing the relation between the frequency of this frame and intensity of sound；

Connecting the spectrum information of all frames, obtain the sound spectrogram of described useful signal fragment, described sound spectrogram is made up of several points, and (x, y) is used for representing that this point is in the x moment coordinate of any point, intensity of sound corresponding in y frequency.

4. method as claimed in claim 3, it is characterised in that described determine identification eigenmatrix to be tested according to described sound spectrogram, specifically includes:

Frame at each point in described sound spectrogram, frequency and intensity of sound, it is determined that sound spectrogram matrix；

For each frame, it is determined that intensity of sound value front K frequency values from big to small in described sound spectrogram matrix；

According to the relative position on frequency domain of each frequency values in described K frequency values, generate the identification eigenmatrix to be tested of intensity of sound distribution situation for characterizing described sound spectrogram.

5. the method as according to any one of claim 1-4, it is characterised in that the described identification eigenmatrix to be tested of described calculating and the similarity of each standard identification eigenmatrix in abnormal sound model library, specifically includes:

For the either standard identification eigenmatrix in described abnormal sound model library, element by the last string of the first row of the element of the first row first row of described standard identification eigenmatrix to described standard identification eigenmatrix, determine respectively and in described identification eigenmatrix to be tested difference between the value of the first row first row less than each element in the described standard identification eigenmatrix of predetermined threshold value, carry out each element present position respectively as starting point setting window operation, respectively obtain the window matrix that this window covers, this window is identical with the dimension of identification eigenmatrix to be tested；Calculate business's matrix of each window matrix and identification eigenmatrix to be tested, each row element in described business's matrix is multiplied by default weighted value respectively, obtain the similarity of each window matrix and described identification eigenmatrix to be tested；

Whole similarity according to each standard identification eigenmatrix in calculated described identification eigenmatrix to be tested and abnormal sound model library, is defined as the similarity of described standard identification eigenmatrix and described identification eigenmatrix to be tested by similarity maximum for the value in whole similarities.

6. the method as according to any one of claim 1-5, it is characterised in that described described acoustical signal is carried out pretreatment, it is determined that the useful signal fragment of described acoustical signal, specifically includes:

Described acoustical signal is carried out preemphasis process, obtains the acoustical signal after preemphasis；

Acoustical signal after preemphasis is carried out framing windowing process, useful signal fragment will be defined as in short time period self-energy value more than one section of acoustical signal of predetermined threshold value.

7. a typical abnormal sound detecting device, it is characterised in that this device includes:

Determining unit, for determining the identification eigenmatrix to be tested of abnormal sound sound spectrogram, described identification eigenmatrix to be tested is for representing the distribution situation of acoustical signal intensity of sound on time-frequency domain；

Computing unit, for calculating the described to be tested similarity identifying eigenmatrix and each standard identification eigenmatrix in abnormal sound model library, determines the abnormal sound type in described useful signal fragment according to result of calculation.

8. device as claimed in claim 7, it is characterised in that described collecting unit specifically for: the oscillogram of described useful signal fragment is converted to the sound spectrogram being made up of the frequency of described useful signal fragment, time and three dimensional information of intensity of sound；

Described determine unit specifically for: determine the identification eigenmatrix of intensity of sound distribution situation for characterizing described sound spectrogram；

Described computing unit specifically for: calculate the similarity of each standard identification eigenmatrix in described identification eigenmatrix to be tested and abnormal sound model library, by the abnormal sound model maximum with described identification eigenmatrix similarity to be tested, it is determined that for the abnormal sound type of described useful signal fragment.

9. device as claimed in claim 8, it is characterised in that described collecting unit specifically for:

Described useful signal fragment windowing is divided into several frames；

Each frame being carried out Short Time Fourier Transform, obtains the spectrum information of this frame, described spectrum information is for representing the relation between the frequency of this frame and intensity of sound；Connecting the spectrum information of all frames, obtain the sound spectrogram of described useful signal fragment, described sound spectrogram is made up of several points, and (x, y) is used for representing that this point is in the x moment coordinate of any point, intensity of sound corresponding in y frequency.

10. device as claimed in claim 9, it is characterised in that described determine unit specifically for:

For each frame, it is determined that intensity of sound value front K frequency values from big to small in described sound spectrogram matrix；According to the relative position on frequency domain of each frequency values in described K frequency values, generate the identification eigenmatrix to be tested of intensity of sound distribution situation for characterizing described sound spectrogram.

11. the device as according to any one of claim 7-10, it is characterised in that described computing unit specifically for:

12. the device as according to any one of claim 7-11, it is characterised in that described collecting unit specifically for:

Described acoustical signal is carried out preemphasis process, obtains the acoustical signal after preemphasis；Acoustical signal after preemphasis is carried out framing windowing process, useful signal fragment will be defined as in short time period self-energy value more than one section of acoustical signal of predetermined threshold value.