CN109920448A

CN109920448A - A kind of identifying system and method for automatic driving vehicle traffic environment special type sound

Info

Publication number: CN109920448A
Application number: CN201910141320.1A
Authority: CN
Inventors: 王若平; 李仁仁; 房宇; 王雪钊; 李文睿
Original assignee: Jiangsu University
Current assignee: Jiangsu University
Priority date: 2019-02-26
Filing date: 2019-02-26
Publication date: 2019-06-21

Abstract

The present invention provides the identifying systems and method of a kind of automatic driving vehicle traffic environment special type sound, including information acquisition unit, information process unit, acoustic recognition unit, storage unit and control unit, information acquisition unit obtains the real time environment voice signal in traffic environment, and is passed along information process unit；Information process unit carries out abnormal sound detection to the voice signal of acquisition, after the signal for detecting suspected target special type sound, carries out feature extraction to echo signal, and be transferred to acoustic recognition unit；Acoustic recognition unit identifies sound type according to the sound characteristic of acquisition, and result is conveyed to vehicle-mounted ECU, realizes the perception of traffic sounds environment.The present invention can be according to the extraordinary sound detected, and identifies extraordinary sound type, supplements the information blind spot of traditional camera and radar；More fully traffic information is provided for autonomous driving vehicle.

Description

A kind of identifying system and method for automatic driving vehicle traffic environment special type sound

Technical field

The invention belongs to automatic Pilot environment sensing field, especially a kind of automatic driving vehicle traffic environment special type sound Identifying system and method.

Background technique

In recent years, with the development of artificial intelligence and autonomous driving vehicle, the safety and reliability of automatic Pilot is more next More more paid close attention to.Automated driving system mainly includes three parts: perception, decision, control.Wherein context aware systems It is to realize the basis of automatic Pilot, and Multi-sensor Fusion is to realize the inexorable trend of automatic Pilot environment sensing.Currently, existing Onboard sensor mainly include laser radar, millimetre-wave radar, camera etc..But in city traffic road condition, due to height The barriers such as building block, it is difficult to find the traffic incident that " do not see ".

Summary of the invention

For effective solution problem above, the invention proposes a kind of automatic driving vehicle traffic environment special type sound Identifying system and method according to the extraordinary sound detected, and identify extraordinary sound type, provide more for autonomous driving vehicle Comprehensive traffic information.The extraordinary sound of this patent meaning specifically includes that the energy such as police whistle sound, whistle sound, impact sound and shriek Show the sound of traffic events.

To achieve the above object, the present invention is as follows using more specific technical solution:

A kind of identifying system of automatic driving vehicle traffic environment special type sound, which is characterized in that including information collection list Member, information process unit, acoustic recognition unit, storage unit and control unit,

Voice signal is converted to digital signal for acquiring traffic environment voice signal by the information acquisition unit, It is transferred to information process unit；

The input terminal of the information process unit is connected with information acquisition unit, output end is connected with acoustic recognition unit, For handling the traffic environment audio digital signal transmitted, the abnormal sound whether having in ambient sound digital signal detected Sound signal, and the characteristic parameter of abnormal sound segment is extracted, characteristic parameter is transferred to acoustic recognition unit；

The output end of acoustic recognition unit is connected with vehicle ECU, for identification the motion state of sound source and including which kind of out Extraordinary voice signal, and recognition result is conveyed to vehicle ECU, so that vehicle makes corresponding measures to keep clear；The storage Unit includes sound status memory block and sound characteristic parameter memory block, is respectively used to storage sample audio motion state and sample Sound characteristic parameter；

The acoustic recognition unit, storage unit are connected with control unit respectively, and control unit can call storage unit The data of middle storage are for training and updating identification model.

Further, the storage unit can be automatically updated from internet obtains sound characteristic parameter, also can be from shifting Dynamic storage dish obtains, for updating identification model.

Further, the information acquisition unit is the microphone array that roof is arranged in.

The recognition methods of the identifying system of automatic driving vehicle traffic environment special type sound, which is characterized in that including as follows Step:

S1 information acquisition unit obtains the ambient sound in road, and is transformed into digital signal and is conveyed to signal processing list Member；

S2 signal processing unit carries out framing to the digital signal information stream of ambient sound, adding window pre-processes；Number is believed Number carry out end-point detection, detect whether with abnormal sound signal, if there is abnormal sound signal, then utilize blind source separating side Method isolates abnormal sound signal；

S3 sound processing unit extracts the time and frequency domain characteristics of abnormal sound signal, scramble characteristic of field；

The method that S4 acoustic recognition unit voice recognition process uses two-stage classification,

First order identification: machine sound and voice are distinguished, and identifies the motion state of sound source；

Which kind of second level identification: identify as extraordinary voice signal.

Further, n microphone array of roof is located in the S1, obtains airborne sound in traffic environment Sound signal, when system is opened, microphone array starts to acquire airborne voice signal in real-time traffic environment.

Further, end-point detection uses auto-relativity function method to utilize formula that is, by after signal framing in the S2 (1) short-time autocorrelation function of every frame data is sought；The otherness of the auto-correlation function of noise and extraordinary sound is recycled to detect Extraordinary voice signal out；

In formula: R is the auto-correlation function of signal；I is frame number, indicates the i-th frame；K is retardation；L is frame length；Y is signal Amplitude.

Further, the method for the blind source separating is: the process of sound mix is interpreted as m source sound by mixing System receives n signal by n microphone, and the hybrid matrix A of sound mix system is found out using its corresponding relationship, recycles Algorithm finds out separation system matrix B, so as to find out the abnormal sound signal isolated；

During the blind source separating, m source sound S_i(t) after mixing, n signal X is received by n microphone_j (t), X_j(t) and S_i(t) shown in relationship such as formula (2),

In above formula: m is the number of source sound；N is the number of microphone；T is the time；X_j(t) it is received for microphone array The voice signal arrived；V_jIt (t) is the engine noise and road noise in environment；S_iIt (t) is source voice signal；a_jiFor sound mix system The hybrid parameter of system；

The voice signal Y isolated_i(t) and X_i(t) shown in relationship such as formula (3):

In above formula: m is the number of source sound；N is the number of microphone；T is the time；X_j(t) it is received for microphone array The voice signal arrived；Y_i(t) the source voice signal to isolate；b_ijFor the separation parameter of sound separation system.

The hybrid parameter a of hybrid system is found out using fast independent component analysis (FastICA) algorithm_jiWith separation system Separation parameter b_ij, source voice signal Y is found out using the corresponding relationship of formula (2) and formula (3)_i(t), to complete voice signal Separation.

Further, the extracting method of the time and frequency domain characteristics parameter is first to find out the time-frequency spectrum of abnormal sound, is then used The maximum power spectral densities of each time point track time frequency signal, seek the number of peaks of characteristic parameter unit time；Then The signal obtained to tracking is smoothed, and finds out characteristic parameter time-frequency derivative；

The specific method is as follows for the extraction of scramble characteristic of field:

(1) abnormal sound signal is set as S (t), the extraordinary sound of every frame is obtained after filtering, preemphasis, framing, adding window Sound signal S (t), to S (t)) it does discrete Fourier transform (DFT) and obtains frequency domain signal X (ω), time domain is converted into frequency domain；It asks The X (ω) obtained is indicated are as follows:

In above formula: t is the time；ω is frequency；X (ω) is frequency-region signal；X (t) is time-domain signal after framing；N is in Fu Leaf transformation siding-to-siding block length；

(2) square of X (ω), i.e. energy spectrum are asked；By the filter group that is made of M filter to energy spectral filter, The centre frequency of m-th of filter is f (m), m=1,2 ..., M；M-th of triangular filter transmission function are as follows:

In above formula, H_m(ω) is triangular filter transmission function,

(3) the energy logarithm superposition for calculating m-th of filter output is denoted as S (m), and expression formula is as follows:

In formula: S (m) is the energy logarithm of superposition；M is filter group number；X (ω) is frequency-region signal；H_m(ω) is three Angle filter transfer function；

(4) mel cepstrum coefficients (MFCC) is obtained through discrete cosine transform (DCT) again:

In above formula: C (n) is mel cepstrum coefficients；S (m) is the energy logarithm of superposition；M is filter group number；L is plum That cepstrum coefficient dimension.

Further, the dimension of mel cepstrum coefficients takes L=12 herein.

Further, the number M of filter and the number of critical band are close in filter group, and M takes 22-26；Filter is adopted Use triangular filter.

Further, the voice recognition process is specific as follows:

(1) first order identifies, according to the time-frequency characteristics of extraction: passing through band limits, the number of peaks of unit time and list Zero-crossing rate in the time of position tentatively identifies mechanic sound and voice；Furthermore according to the peak value of unit time in the time-frequency domain of extraction Zero-crossing rate in amount and unit time, identifies the motion state of sound, method particularly includes: the number of peaks and unit of unit time Zero-crossing rate in time becomes larger explanation and is proximate to state, and gradually becoming smaller explanation is far from state, and two kinds are basically unchanged explanation It is to maintain relative static conditions；

(2) second level identifies, according to the MFCC parameter training identification model for the standardized special sound that test obtains, then will The extraordinary voice signal of preliminary classification is input in trained SVM identification model, completes voice recognition.

Specifically, SVM identification model algorithm is as follows:

Svm classifier model is established, characteristic parameter C (n) the training SVM of the sample special type sound stored in storage unit is passed through The characteristic parameter C ' (n) of the abnormal sound obtained in real time is carried out Classification and Identification as input by classifier, and specific SVM is non-linear Algorithm is as follows:

Training sampleX in formula_i∈R¹It is i-th of input pattern, d_i∈ {+1, -1 } is that its is corresponding Output expectation.Pass through Nonlinear Mapping φ (x): R¹→Rⁿ, former space input vector is mapped to N-dimensional feature space, it is optimal super Plane:W is hyperplane normal in formula, is adjustable weight vector, and b is biasing, determines phase For the optimal location of origin.Problem above can be converted to when meeting certain condition, and be minimized | | W | | the problem of, it can table It is shown as following form:

In formula, C is constant, influences decision accuracy, and ξ is slack variable.Above-mentioned optimization is solved using Langrange multiplier to ask Topic, expression formula:

In formula, α_i, β_iFor non-negative Lagrange coefficient, enable:

It obtains:

For solving high-dimensional feature space dot product φ (x_i)^T·φ(x_j), without knowing Nonlinear Mapping, always find one A kernel function that can satisfy Mercer condition original space, so that: K (x_i, x_j)=φ (x_i)^T·φ(x_j), therefore can again by Problem description are as follows:

Selection of kernel function Gauss kernel function herein:

σ is bandwidth in formula, controls radial effect range.

After sample data training, obtains the classifier that optimal hyperlane and relevant parameter are constituted and divided according to the following formula Class:

To complete the Classification and Identification of extraordinary sound.

The present invention can help autonomous driving vehicle to obtain more fully traffic route information, supplement traditional camera and thunder The information blind spot reached；It can not only identify that sound type can also identify the motion state of sound source using time and frequency domain characteristics parameter, Such as: identify opposing stationary police and the police whistle sound in still relative motion；The present invention can solve the extraordinary sound of small number Sound overlapping, the present invention divides two stage recognition voice signal using the time-frequency feature and scramble spectrum signature of sound, to improve sound Accuracy of identification；Furthermore two-stage classification of the invention, which can guarantee to reach under conditions of certain accuracy of identification, reduces characteristic parameter The purpose of dimension, to improve recognition speed；The present invention can also identify the motion state of sound source.

Detailed description of the invention

Fig. 1 is the identifying system structural schematic diagram of automatic driving vehicle traffic environment special type sound；

Fig. 2 is extraordinary voice recognition flow chart

Fig. 3 is blind source separating schematic diagram

Specific embodiment

With reference to the accompanying drawing and specific embodiment the present invention will be described in detail, but these embodiments are not intended to limit The present invention, structure that those skilled in the art are made according to these embodiments, method or transformation functionally are equal It is included within the scope of the present invention.

Fig. 1 show the schematic diagram of the identifying system of automatic driving vehicle traffic environment special type sound of the present invention.Institute The identifying system of automatic driving vehicle traffic environment special type sound is stated, mainly includes information acquisition unit, information process unit, sound Sound recognition unit, storage unit and control unit.

The information acquisition unit is the microphone array that roof is arranged in, and for acquiring traffic environment voice signal, is wrapped Include police whistle sound, whistle sound, shriek and other sound；And voice signal is converted to digital signal, it is transferred to information processing list Member.

The input terminal of the information process unit is connected with information acquisition unit, output end is connected with acoustic recognition unit. Information process unit is to receive real-time environmental sound signal, handles the traffic environment audio digital signal transmitted, is detected The abnormal sound signal whether having in ambient sound digital signal, and the characteristic parameter of abnormal sound segment is extracted, feature is joined Number is transferred to acoustic recognition unit.

Acoustic recognition unit goes out the motion state of sound source and for identification including which kind of extraordinary voice signal.Voice recognition list The output end of member is connected with vehicle ECU, and recognition result is conveyed to vehicle ECU, provides traffic sounds environmental information, and then make It obtains vehicle and makes corresponding measures to keep clear.

The storage unit includes sound status memory block and sound characteristic parameter memory block, is respectively used to storage sound fortune Dynamic state and sound characteristic parameter；The acoustic recognition unit, storage unit are connected with control unit respectively, and control unit can Call the data stored in storage unit for training and updating identification model.

Storage unit can receive the target special type sound for the standard that the experiment in internet or mobile hard disk obtains Characteristic parameter, and be supplied to acoustic recognition unit improves accuracy of identification in order to update identification model with this, increases identification sound The quantity of sound.

The course of work of the identifying system of automatic driving vehicle traffic environment special type sound of the present invention are as follows: starting vehicle Unlocking vehicle control system, microphone array obtains the real time environment voice signal in traffic environment, and is passed along at information Manage unit；Information process unit carries out extraordinary sound detection to the voice signal of acquisition, detects suspected target special type sound After signal, feature extraction is carried out to echo signal, and be transferred to acoustic recognition unit；Acoustic recognition unit is according to the sound of acquisition Feature identifies sound type, and result is conveyed to vehicle-mounted ECU, realizes the perception of traffic sounds environment.

Specifically, the recognition methods of the automatic driving vehicle traffic environment special type sound, as shown in Fig. 2, including following Step:

The microphone array of S1 roof obtains the ambient sound in road, and is transformed into digital signal and is conveyed to signal processing Unit.

S2 signal processing unit carries out framing, adding window to the digital signal information stream of ambient sound.Endpoint is carried out to signal Detection, is detected whether with abnormal sound signal, if there is abnormal sound signal, is then isolated using the method for blind source separating different Normal voice signal.

The end-point detection is by taking auto-relativity function method end-point detection as an example.After signal framing, asked using following formula (1) Take the short-time autocorrelation function of every frame data.The otherness of the auto-correlation function of noise and extraordinary sound is recycled to detect special type Voice signal.

In above formula (1): R is the auto-correlation function of signal；I is frame number, indicates the i-th frame；K is retardation；L is frame length；Y is Signal amplitude.

The energy threshold E obtained using the energy size of extraordinary voice signal, duration information and test₀With time threshold Value T₀Compare to have discriminated whether that extraordinary voice signal overlapping then utilizes the method for blind source separating if there is extraordinary sound is overlapped Find out the extraordinary voice signal of overlapping.

The principle of blind source separating is interpreted as m source sound by mixed stocker as shown in figure (3), by the process of sound mix System, receives n signal by n microphone, and the hybrid matrix A of sound mix system is found out using its corresponding relationship, recycles and calculates Method finds out separation system matrix B, so as to find out the abnormal sound signal isolated.

In above formula (2): m is the number of source sound；N is the number of microphone；T is the time；X_j(t) it is connect for microphone array The voice signal received；V_jIt (t) is the engine noise and road noise in environment；S_iIt (t) is source voice signal；a_jiFor sound mix The hybrid parameter of system；

The voice signal Y isolated_i(t) and X_j(t) shown in relationship such as formula (3):

In above formula (3): m is the number of source sound；N is the number of microphone；T is the time；X_j(t) it is connect for microphone array The voice signal received；Y_i(t) the source voice signal to isolate；b_ijFor the separation parameter of sound separation system；

For S3 after Signal separator, or when judging to be overlapped without extraordinary voice signal, sound processing unit extracts abnormal sound The time and frequency domain characteristics of sound signal, scramble characteristic of field.

Wherein, the extraction of time and frequency domain characteristics parameter is the time-frequency spectrum for first finding out abnormal sound, then with each time point Maximum power spectral densities track time frequency signal, seek the number of peaks of characteristic parameter unit time；Then tracking is obtained Signal is smoothed, and finds out characteristic parameter time-frequency derivative.

By taking police whistle sound as an example, first find out the time-frequency spectrum of police whistle sound, then with the maximum power spectral densities of each time point come Time frequency signal is tracked, seeks the characteristic parameter of time frequency signal: the number of peaks of unit time, band limits and in the unit time The characteristic parameters such as zero-crossing rate.

That scramble field parameter extracts is mel cepstrum coefficients C (n), and the specific method is as follows:

(1) original object special type voice signal or the signal separated by blind source separating, i.e. abnormal sound signal, Be set as S (n), the extraordinary voice signal S (t) of every frame obtained after filtering, preemphasis, framing, adding window, to S (t)) do it is discrete Fourier transformation (DFT) obtains frequency domain signal X (ω), and time domain is converted into frequency domain；The X (ω) acquired is indicated are as follows:

In above formula (4): t is the time；ω is frequency；X (ω) is frequency-region signal；X (t) is time-domain signal after framing；N is Fu In leaf transformation siding-to-siding block length.

(2) square of X (ω), i.e. energy spectrum are asked.It is filtered by using the filter group of M filter, filter Number and critical band number it is close, for the filter used for triangular filter, the centre frequency of m-th of filter is f (m), m=1,2 ..., M, M usually take 22-26.M-th of triangular filter transmission function are as follows:

In above formula (5), H_m(ω) is triangular filter transmission function,

In formula (6): S (m) is the energy logarithm of superposition；M is filter group number；X (ω) is frequency-region signal；H_m(ω) is Triangular filter transmission function；

In formula (7): C (n) is mel cepstrum coefficients；S (m) is the energy logarithm of superposition；M is filter group number；L is plum That cepstrum coefficient dimension.

After obtaining MFCC, determines the dimension of mel cepstrum coefficients under certain precision, preferably take L=12.

S4 voice recognition.The method that voice recognition process uses two-stage classification, first order identification, distinguishes machine sound, voice With the relative motion state of sound source；Second level identification, identifies specific voice signal.

(1) first order identifies, according to the time-frequency characteristics of extraction: when band limits, the number of peaks and unit of unit time Interior zero-crossing rate can tentatively identify mechanic sound and voice；Furthermore according to the peak value of unit time in the time-frequency domain of extraction Zero-crossing rate in amount and unit time, can identify the motion state of sound, such as, the number of peaks and unit of unit time Zero-crossing rate in time becomes larger explanation and is proximate to state, and gradually becoming smaller explanation is two kinds of principle state, is basically unchanged explanation It is to maintain relative static conditions.The category identification process of machine sound and life is without Hidden Markov or support vector machines etc. Identification model is by realizing with the selection algorithm of threshold value comparison.

(2) second level identifies, according to the MFCC parameter training identification model for the standardized special sound that test obtains, then will The extraordinary voice signal of preliminary classification is input in trained SVM identification model, completes voice recognition.Specific SVM identification Model algorithm is as follows:

In formula, α_i, β_iFor non-negative Lagrange coefficient, enable:

It obtains:

Selection of kernel function Gauss kernel function herein:

σ is bandwidth in formula, controls radial effect range.

To complete the Classification and Identification of extraordinary sound.

Recognition result is conveyed to vehicle ECU by control module, sound, image and radar signal are integrated by ECU, further It makes decisions and controls.

The embodiment is a preferred embodiment of the present invention, but present invention is not limited to the embodiments described above, not In the case where substantive content of the invention, any conspicuous improvement that those skilled in the art can make, replacement Or modification all belongs to the scope of protection of the present invention.

Claims

1. a kind of identifying system of automatic driving vehicle traffic environment special type sound, which is characterized in that including information acquisition unit, Information process unit, acoustic recognition unit, storage unit and control unit,

Voice signal is converted to digital signal for acquiring traffic environment voice signal by the information acquisition unit, is transmitted To information process unit；

The input terminal of the information process unit is connected with information acquisition unit, output end is connected with acoustic recognition unit, is used for The traffic environment audio digital signal transmitted is handled, the abnormal sound message whether having in ambient sound digital signal is detected Number, and the characteristic parameter of abnormal sound segment is extracted, characteristic parameter is transferred to acoustic recognition unit；

Acoustic recognition unit goes out the motion state of sound source and for identification including which kind of extraordinary voice signal；

The storage unit includes sound status memory block and sound characteristic parameter memory block, is respectively used to storage sound movement shape State and sound characteristic parameter；

The acoustic recognition unit, storage unit are connected with control unit respectively, and control unit can be called in storage unit and be deposited The data of storage are for training and updating identification model.

2. the identifying system of automatic driving vehicle traffic environment special type sound according to claim 1, which is characterized in that described deposits Storage unit can be automatically updated from internet obtains sound characteristic parameter, can also obtain from mobile storage disc, for updating identification Model.

3. the identifying system of automatic driving vehicle traffic environment special type sound according to claim 1, which is characterized in that the information Acquisition unit is the microphone array that roof is arranged in.

4. the recognition methods of the identifying system of claim 1 automatic driving vehicle traffic environment special type sound, which is characterized in that packet Include following steps:

S1 information acquisition unit obtains the ambient sound in road, and is transformed into digital signal and is conveyed to signal processing unit；

S2 signal processing unit carries out framing to the digital signal information stream of ambient sound, adding window pre-processes；To digital signal into Row end-point detection is detected whether with abnormal sound signal, if there is abnormal sound signal, then utilizes the method for blind source separating point Separate out abnormal sound signal；

5. the recognition methods of automatic driving vehicle traffic environment special type sound according to claim 3, which is characterized in that institute End-point detection seeks every frame data in short-term using formula (1) that is, by after signal framing using auto-relativity function method in the S2 stated Auto-correlation function；The otherness of the auto-correlation function of noise and extraordinary sound is recycled to detect extraordinary voice signal；

In formula (1): R is the auto-correlation function of signal；I is frame number, indicates the i-th frame；K is retardation；L is frame length；Y is signal width Value.

6. the recognition methods of automatic driving vehicle traffic environment special type sound according to claim 3, which is characterized in that

The method of the blind source separating is: the process of sound mix is interpreted as m source sound by hybrid system, by n wheat Gram wind receives n signal, and the hybrid matrix A of sound mix system is found out using its corresponding relationship, recycles algorithm, finds out separation Sytem matrix B, so as to find out the abnormal sound signal isolated；

During the blind source separating, m source sound S_i(t) after mixing, n signal X is received by n microphone_j(t), X_j (t) and S_i(t) shown in relationship such as formula (2),

In above formula (2): m is the number of source sound；N is the number of microphone；T is the time；X_j(t) it is received for microphone array Voice signal；V_jIt (t) is the engine noise and road noise in environment；S_iIt (t) is source voice signal；a_jiFor sound mix system Hybrid parameter；

In above formula (3): m is the number of source sound；N is the number of microphone；T is the time；X_j(t) it is received for microphone array Voice signal；Y_i(t) the source voice signal to isolate；b_ijFor the separation parameter of sound separation system；

The hybrid parameter a of hybrid system is found out using fast independent component analysis (FastICA) algorithm_jiWith the separation of separation system Parameter b_ij, source voice signal Y is found out using the corresponding relationship of formula (2) and formula (3)_i(t), to complete point of voice signal From.

7. the recognition methods of automatic driving vehicle traffic environment special type sound according to claim 3, which is characterized in that institute The extracting method for stating time and frequency domain characteristics parameter is first to find out the time-frequency spectrum of abnormal sound, then uses the maximum power of each time point Spectrum density tracks time frequency signal, seeks the number of peaks of characteristic parameter unit time；Then the signal that tracking obtains is carried out Smoothing processing finds out characteristic parameter time-frequency derivative；

(1) abnormal sound signal is set as S (t), the extraordinary sound letter of every frame is obtained after filtering, preemphasis, framing, adding window Number S (t), does discrete Fourier transform (DFT) to S (t) and obtains frequency domain signal X (ω), time domain is converted into frequency domain；The X acquired (ω) is indicated are as follows:

In above formula (4): t is the time；ω is frequency；X (ω) is frequency-region signal；X (t) is time-domain signal after framing；N is Fourier Convert siding-to-siding block length；

(2) square of X (ω), i.e. energy spectrum are asked；By the filter group that is made of M filter to energy spectral filter, m-th The centre frequency of filter is f (m), m=1,2 ..., M；M-th of triangular filter transmission function are as follows:

In above formula (5), H_m(ω) is triangular filter transmission function,

In formula (6): S (m) is the energy logarithm of superposition；M is filter group number；X (ω) is frequency-region signal；H_m(ω) is triangle Filter transfer function；

In formula (7): C (n) is mel cepstrum coefficients；S (m) is the energy logarithm of superposition；M is filter group number；L falls for Meier Spectral coefficient dimension.

8. the recognition methods of automatic driving vehicle traffic environment special type sound according to claim 7, which is characterized in that plum You take L=12 by the dimension of cepstrum coefficient herein；The number M of filter and the number of critical band are close in filter group, and M takes 22- 26；Filter uses triangular filter.

9. the recognition methods of automatic driving vehicle traffic environment special type sound according to claim 3, which is characterized in that institute It is specific as follows to state voice recognition process:

(1) first order identifies, according to the time-frequency characteristics of extraction: when by band limits, the number of peaks and unit of unit time Interior zero-crossing rate tentatively identifies mechanic sound and voice；Furthermore according to the unit time in the time-frequency domain of extraction number of peaks and Zero-crossing rate in unit time identifies the motion state of sound, specific recognition methods are as follows: the number of peaks and unit of unit time Zero-crossing rate in time becomes larger explanation and is proximate to state, and gradually becoming smaller explanation is far from state, and two kinds are basically unchanged explanation It is to maintain relative static conditions；

(2) second level identifies, then will be preliminary according to the MFCC parameter training identification model for the standardized special sound that test obtains The extraordinary voice signal of classification is input in trained SVM identification model, completes voice recognition.

10. the recognition methods of automatic driving vehicle traffic environment special type sound according to claim 9, which is characterized in that The SVM identification model algorithm is as follows:

Svm classifier model is established, characteristic parameter C (n) the training svm classifier of the sample special type sound stored in storage unit is passed through The characteristic parameter C ' (n) of the abnormal sound obtained in real time is carried out Classification and Identification, specific SVM nonlinear algorithm as input by device It is as follows:

Training sampleX in formula_i∈R¹It is i-th of input pattern, d_i∈ {+1, -1 } is its corresponding output It is expected that；Pass through Nonlinear Mapping φ (x): R¹→Rⁿ, former space input vector is mapped to N-dimensional feature space, it is optimal super flat Face:W is hyperplane normal in formula, is adjustable weight vector, and b is biasing, is determined opposite In the optimal location of origin；Can be converted to when meeting certain condition, minimize | | W | | the problem of, it is expressed as form:

In formula, C is constant, influences decision accuracy, and ξ is slack variable；

Above-mentioned optimization problem is solved using Langrange multiplier, expression formula:

In formula, α_i, β_iFor non-negative Lagrange coefficient, enable:

It obtains:

For solving high-dimensional feature space dot product φ (x_i)^T·φ(x_j), without knowing Nonlinear Mapping, always find an energy Enough meet the kernel function in Mercer condition original space, so that: K (x_i, x_j)=φ (x_i)^T·φ(x_j), therefore again describe problem Are as follows:

The Selection of kernel function Gauss kernel function:

σ is bandwidth in formula, controls radial effect range.

After sample data training, obtains the classifier that optimal hyperlane and relevant parameter are constituted and classifies according to the following formula:

To complete the Classification and Identification of extraordinary sound.