CN106907697A - A kind of intelligent road-lamp of built-in collision sound detection function - Google Patents
A kind of intelligent road-lamp of built-in collision sound detection function Download PDFInfo
- Publication number
- CN106907697A CN106907697A CN201710256831.9A CN201710256831A CN106907697A CN 106907697 A CN106907697 A CN 106907697A CN 201710256831 A CN201710256831 A CN 201710256831A CN 106907697 A CN106907697 A CN 106907697A
- Authority
- CN
- China
- Prior art keywords
- module
- road
- lamp
- sound
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F21—LIGHTING
- F21V—FUNCTIONAL FEATURES OR DETAILS OF LIGHTING DEVICES OR SYSTEMS THEREOF; STRUCTURAL COMBINATIONS OF LIGHTING DEVICES WITH OTHER ARTICLES, NOT OTHERWISE PROVIDED FOR
- F21V33/00—Structural combinations of lighting devices with other articles, not otherwise provided for
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/18—Artificial neural networks; Connectionist approaches
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F21—LIGHTING
- F21W—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES F21K, F21L, F21S and F21V, RELATING TO USES OR APPLICATIONS OF LIGHTING DEVICES OR SYSTEMS
- F21W2131/00—Use or application of lighting devices or systems not provided for in codes F21W2102/00-F21W2121/00
- F21W2131/10—Outdoor lighting
- F21W2131/103—Outdoor lighting of streets or roads
Abstract
The invention discloses a kind of intelligent road-lamp of built-in collision sound detection function, sound acquisition module, impact sound identification module, memory module and for Street lamps control and the controller for road lamp of information transfer are set in street lamp;The voice signal that sound acquisition module is used in continuous acquisition road;Memory module is used to store the voice signal that sound acquisition module is gathered;Impact sound identification module be used to recognizing in gathered voice signal it is whether miscellaneous be sent to controller for road lamp containing impact sound and by recognition result, whether controller for road lamp judges anomalous event occur in road and will send information to remote monitoring center with this.Using technical scheme, realize that impact sound is recognized using deep neural network, so as to improve impact sound accuracy of identification, while using the further perfect conduct monitoring at all levels of road safety of monitor mode that audio frequency and video are combined, and can be in the road traffic of early warning in time anomalous event.
Description
Technical field
The present invention relates to intelligent transportation field, more particularly to a kind of built-in collision sound detection function of being applied to road lighting
Intelligent road-lamp.
Background technology
In recent years, with cloud computing, big data, artificial intelligence, the especially maturation of high-performance calculation hardware technology, machine
The maturation of learning areas deep learning (deep learning, DL) theory and automated characterization learning neural network model, depth
, in artificial intelligence field (intelligent image and voice recognition) extensive use, major IT giants are by artificial intelligence for neutral net
As next core technology growth point, meanwhile, the integrated core of large quantities of build-in depths neutral net frameworks has also been expedited the emergence of in market
Piece, such as the tall and handsome chip Tesla P100 that first design of starting from scratch exclusively for deep learning is released up to (NVIDIA), the core
Sheet data processing speed is release GPU series for its 2014 12 times;Google is the chip TPU of machine learning customization by hardware
Can be promoted to equivalent to the level after developing 7 years according to Moore's Law;In artificial intelligence field, also unwillingness falls domestic relevant enterprise
Afterwards, such as Chengdu is opened Ying Tailun science and technology and releases a intelligent sound chip CI1006, is the artificial intelligence language based on ASIC frameworks
Sound identification chip, contains deep neural network treatment hardware cell, perfect can support DNN computing frameworks, carries out high-performance
Data parallel, can greatly improve treatment effeciency of the artificial intelligence deep learning voice technology to mass data;
CI1006 can reduce dependence of the product for network, lifting intelligent sound identification response using local Neural Network Data treatment
And control speed, can be widely applied to the product scopes such as intelligent household appliances, robot, wisdom toy.
Therefore, by deep neural network be implanted in existing embedded intelligence control system technically completely into
It is ripe.Although the street lamp of prior art has formed intelligent control network, its intellectuality for only realizing road lighting, and street lamp
System, if integrated security monitoring function, will be able to play street lamp bigger as a huge network in physical space
Effect, such as major traffic accidents monitoring.When major traffic accidents occur, travel condition of vehicle have sent corresponding change,
Especially when high velocity impact, with the sound of sharp impacts, by recognizing that impact sound can just detect traffic accident.So
And, ambient noise is noisy in road, and impact sound cannot be accurately detected using existing sound detection technology.
Therefore, for drawbacks described above present in currently available technology, it is necessary to be studied in fact, to provide a kind of scheme,
Solve defect present in prior art.
The content of the invention
In view of this, it is necessory to provide the built-in intelligent road-lamp for colliding sound detection function such that it is able to quick detection weight
Big traffic accident, timely early warning, rescue, it is to avoid accident is sent out after causing.
In order to overcome the defect of prior art, technical scheme is as follows:
A kind of intelligent road-lamp of built-in collision sound detection function, sets sound acquisition module, impact sound identification in street lamp
Module, memory module and for Street lamps control and the controller for road lamp of information transfer;
The voice signal that the sound acquisition module is used in continuous acquisition road;
The memory module is used to store the voice signal that the sound acquisition module is gathered;
Whether the impact sound identification module is used to recognizing in gathered voice signal miscellaneous containing impact sound and will identification knot
Fruit is sent to controller for road lamp, and whether the controller for road lamp judges anomalous event occur in road and will send information to this
Remote monitoring center;
The impact sound identification module includes the extraction module of feature first, the first normalization module, neural network classification mould
Block and neural metwork training module, wherein,
The fisrt feature extraction module is used to receive acquired original voice data, and acquired original voice data is carried out
Feature extraction;
The first normalization module is used to carry out the data after feature extraction Gaussian normalization processing, output normalization
Data;
The neural network classification module is used to receive normalization data and the good deep neural network of training in advance, and leads to
Crossing deep neural network carries out Classification and Identification acquisition sorting result information to the normalization data, and the sorting result information is
Whether the miscellaneous probable value containing impact sound of acquired original voice data, when the probable value exceedes threshold value set in advance, then judges
Occurs anomalous event in road;
The neural metwork training module is used to receive training data and the training of neutral net is carried out according to training data,
Obtain abundant learning tape to make an uproar the deep neural network of the Nonlinear Mapping relation between sample and clean sample, and export the depth
Spend neutral net to the neural network classification module;
The neural metwork training module includes that second feature extraction module, the second normalization module, unsupervised learning are pre-
Training module and supervised learning optimization module, wherein, the unsupervised learning pre-training module is used to find depth in input data
Layer abstract characteristics, using be restricted Boltzmann machine (RBM) model carry out pre-training and by unsupervised learning by the way of by
Layer progressive learning neural network parameter;The supervised learning optimization module uses backpropagation (back-propagation, BP)
Algorithm, the intense adjustment for having supervision is carried out to neural network parameter using labeled data.
Preferably, the fisrt feature extraction module/second feature extraction module further includes that framing module, DFT become
Mold changing block and log power spectrum processing module, wherein, the framing module is used to carry out sub-frame processing to input data;It is described
DFT transform module obtains frequency domain information for carrying out discrete Fourier transform to the data after sub-frame processing;The log power
Spectrum processing module is used to carry out log power spectrum treatment to frequency domain information.
Preferably, the impact sound identification module also includes average energy detection module and frame energy comparison module, wherein,
The average energy detection module is used to calculate the average energy value of present frame log power spectrum and be sent to the frame energy ratio
Compared with module;Whether the frame energy comparison module is used to calculate the difference of consecutive frame the average energy value and judges the difference beyond pre-
If threshold value, if it exceeds then opening the neural network classification module.
Preferably, the frame energy comparison module receives current information of vehicle flowrate, and calculates present frame the average energy value and be
The no energy value scope beyond corresponding to default corresponding information of vehicle flowrate, if it exceeds then opening the neural network classification mould
Block.
Preferably, at the sound acquisition module further includes to be made up of multiple microphones microphone array, audio
Reason module and control module, wherein,
It is multiple in the microphone array that microphone is in certain geometrical shape and each microphone has unique ID;
The audio processing modules are used to synchronously obtain and identify the voice signal of each microphone collection and to the sound
Message number exports audio-frequency information after being processed;
The control module is connected with the audio processing modules, for control the audio processing modules work simultaneously
Audio-frequency information after audio processing modules treatment is stored in the memory module.
Preferably, also including impact sound locating module, the impact sound locating module is connected with the controller for road lamp,
For obtaining the particular location of impact sound and positional information being sent into the controller for road lamp;
When the impact sound identification module judges anomalous event occur, the impact sound locating module obtains the abnormal thing
The temporal information of part simultaneously obtains the audio-frequency information that corresponding microphone array is listed in the temporal information from the memory module, and root
The parameter information of the position relationship fixed according to each microphone and each microphone correspondence audio-frequency information in the temporal information
Determine the particular location residing for impact sound.
Preferably, the parameter information be each microphone in the temporal information correspondence audio-frequency information peak strength with
And each microphone is in the peak strength corresponding time difference.
Preferably, the controller for road lamp is also connected with Rotatable camera device, and the Rotatable camera device is set
Rotated on light pole and according to the control instruction of the controller for road lamp;
When the impact sound identification module judges anomalous event occur in road, the controller for road lamp control is described can
Rotating pick-up device is rotated to particular location determined by the impact sound locating module.
Preferably, the multiple microphones in the microphone array are arranged on lamp surface with certain geometrical shape.
Preferably, the impact sound identification module is realized using the artificial intelligence chip of build-in depths neutral net.
Compared with prior art, be integrated in impact sound identification technology in street lamp and road monitoring neck is applied to by the present invention
Domain, realizes that impact sound is recognized using deep neural network, so that impact sound accuracy of identification is improved, the monitoring combined using audio frequency and video
The further perfect conduct monitoring at all levels of road safety of mode, and can be in the road traffic of early warning in time anomalous event.
Brief description of the drawings
Fig. 1 is the theory diagram of the intelligent road-lamp of the built-in collision sound detection function of the present invention.
Fig. 2 is the theory diagram of impact sound identification module in the built-in intelligent road-lamp for colliding sound detection function of the present invention.
Fig. 3 is to be restricted Boltzmann machine (RBM) structural representation.
Fig. 4 is the pre-training schematic diagram of RBM in the present invention.
Fig. 5 is the structured flowchart of the deep neural network that training is obtained.
Fig. 6 is the theory diagram of characteristic extracting module in the present invention.
Fig. 7 is the theory diagram of impact sound identification module another embodiment in the present invention.
Fig. 8 is the theory diagram of sound acquisition module in the present invention.
Fig. 9 is the schematic diagram of microphone array arrangement.
Figure 10 is the theory diagram of another preferred embodiment of the invention.
Figure 11 is the Organization Chart of the intelligent sound Processing with Neural Network chip CI1006 that the present invention is used.
Specific examples below will further illustrate the present invention with reference to above-mentioned accompanying drawing.
Specific embodiment
The intelligent road-lamp of the built-in collision sound detection function of being provided the present invention below with reference to accompanying drawing is described further.
As background technology is introduced, machine learning field deep learning (deep learning, DL) it is theoretical and from
The maturation of dynamic feature learning neural network model, deep neural network in artificial intelligence field (intelligent image and voice recognition)
Through extensive use, the speech recognition technology of prior art can recognize the voice of people, Er Qieneng under complicated Background environmental noise
Relatively accurately identify semanteme.In technical field of voice recognition, recognized whether under complicated Background environmental noise voice (or
Person other sound) and in the absence of technical difficulty, and real difficulty is semantics recognition, the conversion of voice accuracy high is written
Word not enough, will understand what the mankind saying, what to be expressed and is intended to, and this is only the jewel on imperial crown.This is primarily due to
The species of voice has almost countless and different people to be even more different accents, while the ambient noise ring residing for voice
Border is even more changeable, and almost each voice scene can have different ambient noises.Accordingly, it would be desirable to huge amount of calculation could be completed
Real-time semantic analysis.
Relative to the applied environment of the application, although ambient noise is complex in road, but car crass sound type
It is to be relatively fixed, the especially impact sound produced by high velocity impact, the complexity of impact sound identification is not as good as the ten thousand of voice complexity
/ mono-, while impact sound instantaneous strength is greatly, the easy subregion of sound characteristic.Using the intelligent sound of deep neural network (DNN)
Sound identification is that conventional acoustic treatment technology needs to assume various preferable shapes relative to the advantage of conventional acoustic treatment technology
State, the desirability that these are assumed turns into the key factor of influence performance naturally, and DNN is with little need for any other condition
It is assumed that can constantly be approached by constantly study, so as to reach the purpose of accurate identification.That is DNN by multilayer and it is thousands of on
Ten thousand neuron nodes with computing capability are superimposed as a depth network structure, then this DNN are trained, with big
Road noise training DNN under amount clean sample and various situations, the purpose is to know to enough from known data learning
Know, be then generalized to following emerging data, make effective decision-making.Namely made an uproar sample and clean as learning tape with DNN
The regression model of the Nonlinear Mapping relation between sample, using the depth structure and non-linear simulation ability of DNN, Ke Yichong
The complicated interaction relationship for dividing learning tape to make an uproar between sample and clean sample, the learning process of neutral net is one unlimited
The process of approaching to reality, can automatically adjust the parameter and weight of neutral net according to the data of input, and the data of its training are more,
The result of identification is more accurate.After DNN training is completed, when actually detected, the road acoustical signal input that will be gathered in real road
In DNN, so as to judge whether the voice signal is miscellaneous containing impact sound.
Referring to Fig. 1, the theory diagram of the intelligent road-lamp of the built-in collision sound detection function of the present invention is shown, set in street lamp
Put sound acquisition module, impact sound identification module, memory module and for Street lamps control and the controller for road lamp of information transfer;
The voice signal that sound acquisition module is used in continuous acquisition road;Memory module is used to store what sound acquisition module was gathered
Voice signal;Whether impact sound identification module is used to recognizing in gathered voice signal miscellaneous sends out containing impact sound and by recognition result
Controller for road lamp is sent to, whether controller for road lamp judges anomalous event occur in road and will send information to remote monitoring with this
Center.
Referring to Fig. 2, the original of impact sound identification module in the intelligent road-lamp of the built-in collision sound detection function of the present invention is shown
Reason block diagram, including the extraction module of feature first, the first normalization module, neural network classification module and neural metwork training mould
Block, wherein, fisrt feature extraction module is used to receive acquired original voice data, and carries out feature to acquired original voice data
Extract;First normalization module is used to carry out the data after feature extraction Gaussian normalization processing, exports normalization data;
Neural network classification module is used to receive normalization data and the good deep neural network of training in advance, and by depth
Degree neutral net carries out Classification and Identification to normalization data and obtains sorting result information, and sorting result information is acquired original sound
The no miscellaneous probable value containing impact sound of data, when the probable value exceedes threshold value set in advance, then judges exception occur in road
Event;
Neural metwork training module is used to receive training data and the training of neutral net is carried out according to training data, obtains
Abundant learning tape is made an uproar the deep neural network of the Nonlinear Mapping relation between sample and clean sample, exports deep neural network
To neural network classification module;
Neural metwork training module includes second feature extraction module, the second normalization module, unsupervised learning pre-training
Module and supervised learning optimization module, wherein, second feature extraction module is complete with the functional structure of fisrt feature extraction module
It is identical, the feature for extracting training data;Second normalization module is identical with the functional structure of the first normalization module,
Feature to being extracted carries out Gaussian normalization, i.e., into 0, it is 1 that variance is regular to the mean normalization of all training datas.It is unsupervised
Study pre-training module carries out unsupervised learning initial training using training data is pre-processed as input, deep for initializing generation
The structure of neutral net is spent, the successively progressive learning neural network parameter by way of unsupervised learning, in finding input data
The abstract characteristics of deep layer.Every layer of use of neutral net is restricted Boltzmann machine (RBM) model and carries out pre-training, by multiple
RBM is superimposed as a depth network structure.Referring to Fig. 3, showing is restricted Boltzmann machine (RBM) structural representation, RBM's
Symmetrical connection is existed only between aobvious node layer and hidden node, and does not have any shape in the inside of aobvious node layer and hidden node
The connection of formula, it is believed that interlayer is full connection, is connectionless in layer.RBM is used as a kind of condition random field, each of which neuron
Node describes a distribution situation for stochastic variable, and the higher order statistical phase in input vector is captured by each neuron node
Closing property trains the potential rule included in input vector to explain and find.
Due to all not connected inside the aobvious layers of RBM and hidden layer, can very easily obtain each under data and model profile
The conditional expectation of state.For given training quantity v, the state of hidden node can be calculated by below equation (1):
P(hi=1 | v)=σ (bj+∑viwij) (1)
Then use to sdpecific dispersion algorithm (CD1 algorithms, Contrastive Divergence, CD) to train RBM, then make
RBM parameters are updated with gradient descent algorithm:
In above formula (2), η is parameter renewal learning speed, and i is iterations.By the model parameter for adjusting RBM so that
Reduced by the energy of the aobvious layer data specified of RBM, so that increase the probability that aobvious layer data occurs, and then training is arrived in RBM study
True distribution P (v) of data.
According to the method described above training complete a RBM after, study to weight fix, by training data calculate
The RBM hidden layer states for obtaining can be used to as the input data for training another RBM, namely train first using training data
Individual RBM obtains a hidden layer L1 and its network weight W1, reuses the output of previous hidden layer as input data, successively instructs
The follow-up RBM of white silk obtains 2~W3 of hidden layer L2~L3 and network weight matrix W.Specific training process show this referring to Fig. 4
The pre-training schematic diagram of RBM, all of network weight is successively initialized with this Greedy in invention, so that further unsupervised
Study RBM Hidden units between dependence.After all of RBM has been trained, each RBM is superimposed, then most
Later layer is superimposed one softmax layers, thus constitute one it is bottom-up feedforward, deep layer, distinction for classifying
Deep-neural-network.Because the accumulation using RBM constitutes a depth network structure, having in this, as deep-neural-network
Initialization networking weight during supervised training, can prevent it to be absorbed in local optimum.
Trained by RBM generative natures successively, we can find a more preferable region in weight space, from this
Region is set out, and the supervised learning of distinction can be made to optimize (intense adjustment) and proceed by supervision relative to from random initializtion
Practise optimization (intense adjustment) and obtain better performance lifting, can also substantially reduce the possibility of over-fitting.Supervised in the present invention and learned
Backpropagation (back-propagation, BP) algorithm that optimization module is commonly used using prior art is practised, using labeled data pair
Neural network parameter carries out the intense adjustment for having supervision.In the algorithm, two steps are generally divided into:1) response is propagated forward, will be defeated
Enter and exciter response is obtained by each hidden layer, and the output of last layer is next layer of input, is to the last predicted for one layer
Value;2) backward error propagate, according to forward response travel to last layer, the prediction to signal can be obtained, this predicted value and
The difference of reference signal, exactly needs the mistake of backpropagation.Have the reversely mistake passed back, it is possible to according to this mistake come
Adjust each weight and the biasing of neutral net.It is ready to after the input data of DNN and output data, it is possible to start to update
The weight and offset parameter of network, i.e. W and b, shown in equation below 3:
Here λ represents learning rate, and E represents an object function for optimizing, can be accurate using least mean-square error
Then;RepresentThe parameter for having weight to be learned and biasing of layer.L represents the number of hidden layer really, then L+1 is just
Represent output layer.By above-mentioned formula as can be seen that in the renewal process of model parameter, being almost set without any hypothesis,
Therefore, DNN can well be fitted the non-linear relation that band is made an uproar between sample and clean sample.
In actual neural metwork training, it is the key factor for influenceing accuracy of detection that whether training data complete.In the present invention,
Build " impact sound training dataset " and " road noise training dataset ", wherein, road noise training dataset is by reality
The voice data of various situations is gathered under the road environment of border and data are labeled;Impact sound training dataset collects various cars
The voice data of type impact test, and data are labeled according to impact strength;Clean impact sound collection is made an uproar with road respectively
Sound is added together, obtains band and makes an uproar sample.By the above training data sample input neural network model training network weight and
Offset parameter.Referring to Fig. 5, the structured flowchart of the deep neural network that training is obtained is shown, neutral net includes 1 input
Layer, L1~L3 and output layer of 3 hidden layers.During input signal feature extraction, signal is sampled 8KHz, corresponding every
Individual frame length is set to 256 sample points (32 milliseconds), and it is 128 sample points that frame is moved, and Short Time Fourier Analysis are used to calculate
The DFT coefficient of each overlapping frame, therefore, input layer uses 128 nodes, and the dimension of correspondence input data, output layer is three-dimensional
Data output, corresponds to pure noise, miscellaneous containing impact sound and miscellaneous containing voice respectively.Every node layer is 2048 in L1~L3, and it takes
Certainly in the number of training data, 2048 correspondences, 1,000,000 training datas.The iteration of the pre-training of each limited Boltzmann machine
Number of times is 50 times, and the learning rate of pre-training is 0.0005, and the learning rate of the first two tuning for having supervision of ten times is 0.1, so
Learning rate is successively decreased 10 every time afterwards, total iterations is 100 times.
The deep neural network trained using aforesaid way, with the increase of training data, systematic function is improved constantly,
In class test, actual discrimination reaches 80%, the threshold value of early warning can be set into 60% in practice, can be used as road exception
The effective evaluation index of event early warning.
Referring to Fig. 6, the theory diagram of characteristic extracting module in the present invention is shown, fisrt feature extraction module/the second is special
Levy extraction module and further include framing module, DFT transform module and log power spectrum processing module, wherein, framing module is used
In sub-frame processing is carried out to input data, using overlapping segmentation, the proportion that general frame shifting accounts for frame length is 0-50%;DFT transform mould
Block obtains frequency domain information for carrying out discrete Fourier transform to the data after sub-frame processing;Log power spectrum processing module is used for
Log power spectrum treatment is carried out to frequency domain information, the quadratic sum equivalent to each coefficient modulus after DFT transform is taken the logarithm, and it is right to take
Number can simulate nonlinear perception characteristic of the human ear to the sound intensity, and information, than more complete, is not almost lost on log power spectrum in addition
Any information is lost, is conducive to improving accuracy of detection.
In a preferred embodiment, impact sound identification module uses the artificial intelligence chip of build-in depths neutral net
Realize.Although having the artificial intelligence chip of many powerful build-in depths neutral nets in the prior art, chip-scale
Calculating performance can't compare favourably with PC grades of calculating performance after all, cannot generally meet the requirement of real-time;Meanwhile, in this Shen
In applied environment please, car crass after all or small probability event (especially major traffic accidents), it is therefore not necessary to open in real time
Neural network classification module is opened to be identified.Referring to Fig. 7, impact sound identification module another embodiment in the present invention is shown
Theory diagram, impact sound identification module also include average energy detection module and frame energy comparison module, wherein, average energy
Detection module is used to calculate the average energy value of present frame log power spectrum and be sent to frame energy comparison module;Frame energy comparison
Whether module is used to calculate the difference of consecutive frame the average energy value and judges the difference beyond default threshold value, if it exceeds then opening
Open neural network classification module.Relative to the microphone that position is fixed, voice signal Energy distribution and the actual sound field of its collection
Distribution proportion relation.And in the applied environment of the application, under normal circumstances, the energy of voice signal is relatively flat
Steady fluctuation, its fluctuation range can be previously obtained by practical measurement;And impact sound is a flash energy for burst, its
The far super normal fluctuation range of intensity level.Therefore can decide whether pulse occur by front and rear frame energy comparison, if strong arteries and veins
Rush energy to uprush, whether impact sound is (main to be then turned on flash during neural network classification module carries out judging this section audio signal
Distinguish sound of blowing a whistle).Using above-mentioned technical proposal, unlocking condition is used as by frame energy anticipation, deep neural network is without real
Shi Yunhang, can reduce system power dissipation.
In a preferred embodiment, frame energy comparison module receives current information of vehicle flowrate, and it is flat to calculate present frame
Whether equal energy value is beyond the energy value scope preset corresponding to corresponding information of vehicle flowrate, if it exceeds then opening neutral net point
Generic module.Analyzed as above-mentioned, the noise that traffic current is produced is in proportionate relationship with vehicle flowrate, although various vehicle meetings
There is different, be substantially opposite the energy range that specific vehicle flowrate has correspondence voice signal, we can be by adopting
Voice data in collection real road occasion is simultaneously analyzed to mass data this energy range is obtained ahead of time.Therefore, lead to
Cross judge current energy value whether correspondence vehicle flowrate energy range within as neural network classification module open strip
Part, accuracy of detection can be further improved while neutral net operating time is reduced.
Referring to Fig. 8, show the theory diagram of sound acquisition module in the present invention, sound acquisition module further include by
Microphone array, audio processing modules and control module that multiple microphones are constituted, wherein, multiple microphones in microphone array
In certain geometrical shape and each microphone has unique ID;Audio processing modules are used to synchronously obtain and identify each wheat
The voice signal of gram elegance collection simultaneously exports audio-frequency information after processing voice signal;Control module and audio processing modules phase
Connection, the audio-frequency information for controlling the work of audio processing modules and after audio processing modules are processed is stored in memory module
In.Using above-mentioned technical proposal, by the audio signal of audio processing modules synchronous acquisition microphone array, the wheat of any one ID
Gram wind all gathers continuous audio-frequency information, and stores in a storage module.Due to setting multiple microphones, so as to ensure to be gathered
The integrality of audio signal, compensate for the defect of audio quality difference when prior art video monitoring is applied in the road;Meanwhile, it is many
Individual microphone is fixedly installed in certain geometrical shape, referring to Fig. 9, show the schematic diagram of microphone array arrangement, multiple unique
The mark rounded setting of microphone and synchronized sampling, in theory, when sound source sends sound in region, because sound source distance is every
Individual microphone is apart from different, therefore each microphone receives the time that the intensity and signal of signal reaches and difference occurs
It is different, therefore, the relative position information that the otherness signal received according to each microphone and each microphone determine just can be true
Determine the location of sound source.
In order to determine the particular location of impact sound in time, referring to Figure 10, another side of being preferable to carry out of the invention is shown
The theory diagram of formula, also including impact sound locating module, impact sound locating module is connected with controller for road lamp, is touched for obtaining
Hit the particular location of sound and positional information is sent to controller for road lamp;When impact sound identification module judges anomalous event occur
When, impact sound locating module obtains the temporal information of the anomalous event and corresponding microphone array is obtained from memory module and is listed in this
Audio-frequency information in temporal information, and according to each microphone fix position relationship and each microphone in the temporal information
The parameter information of middle correspondence audio-frequency information determines the particular location residing for impact sound.Further, parameter information is each microphone
The peak strength and each microphone of correspondence audio-frequency information are in the peak strength corresponding time difference in the temporal information.Namely
After impact sound identification module judges anomalous event occur in road, impact sound locating module can quickly determine anomalous event
Position such that it is able to which work is dredged in timely and effective development rescue.
In a preferred embodiment, controller for road lamp is also connected with Rotatable camera device, rotatable shooting dress
Put and be arranged on light pole and rotated according to the control instruction of controller for road lamp;In impact sound identification module judges road
When there is anomalous event, controller for road lamp control Rotatable camera device is rotated to specific position determined by impact sound locating module
Put.Using above-mentioned technical proposal, Rotatable camera device can more accurately collection site video, so as to pass through audio frequency and video knot
Close, reduce the blind area of monitoring.
In the prior art, roadway lighting system is used to carry out Based Intelligent Control to each road lamp, so as to provide one comfortably
Road lighting environment.Sound acquisition module can be integrated in road lamp, it is possible to reduce the quantities of system wiring.More preferably
Ground, microphone array can be arranged in the lamp surface of road lamp, so that when existing road is transformed, without rewiring.
In a preferred embodiment, also including traffic flow detecting device, traffic flow detecting device is used to gather wagon flow
Amount information is simultaneously sent to remote monitoring center or controller for road lamp.
In a preferred embodiment, the impact sound identification module uses the artificial intelligence of build-in depths neutral net
Chip is realized.Referring to Figure 11, the Organization Chart of the intelligent sound Processing with Neural Network chip CI1006 of present invention use is shown, be
Artificial intelligent voice identification chip based on ASIC frameworks, contains deep neural network treatment hardware cell, perfect can prop up
DNN computing frameworks are held, high performance data parallel is carried out, artificial intelligence deep learning voice technology pair is can greatly improve
The treatment effeciency of mass data;CI1006 can reduce dependence of the product for network using local Neural Network Data treatment,
Lifting intelligent sound identification response and control speed.The chip since phonetic entry, speech detection, speech feature extraction and
DNN computings are designed using hardware structure completely, and software is substantially carried out tone decoding and voice broadcast, compared to AP chip softwares
DNN schemes, with advantages such as operational performance and low cost, low-power consumption, small sizes higher.The chip can support local voice
Detection, wake-up, and the hundreds of identifications of offline order entry.Directly can also be replaced by the general controls interface of this chip
The original control MCU of equipment, realizes the voice-intelligent of equipment.The chip also has abundant Peripheral Interface, can pass through
The external WIFI chips of the interfaces such as SPI, UART are connected to high in the clouds, after local wake-up, can be connected by high in the clouds and realize that nature is man-machine
Interaction, or dock the types of applications service in high in the clouds.Off-network state is such as in, then automatically switches to local offline order word identification
Function.
The explanation of above example is only intended to help and understands the method for the present invention and its core concept.It should be pointed out that right
For those skilled in the art, under the premise without departing from the principles of the invention, the present invention can also be carried out
Some improvement and modification, these are improved and modification is also fallen into the protection domain of the claims in the present invention.
The foregoing description of the disclosed embodiments, enables professional and technical personnel in the field to realize or uses the present invention.
Various modifications to these embodiments will be apparent for those skilled in the art, as defined herein
General Principle can be realized in other embodiments without departing from the spirit or scope of the present invention.Therefore, the present invention
The embodiments shown herein is not intended to be limited to, and is to fit to and principles disclosed herein and features of novelty phase one
The scope most wide for causing.
Claims (10)
1. it is a kind of it is built-in collision sound detection function intelligent road-lamp, it is characterised in that in street lamp set sound acquisition module, touch
Hit sound identification module, memory module and for Street lamps control and the controller for road lamp of information transfer;
The voice signal that the sound acquisition module is used in continuous acquisition road;
The memory module is used to store the voice signal that the sound acquisition module is gathered;
Whether the impact sound identification module is used to recognizing in gathered voice signal miscellaneous sends out containing impact sound and by recognition result
Controller for road lamp is sent to, whether the controller for road lamp judges anomalous event occur in road and will send information to long-range with this
Surveillance center;
The impact sound identification module include the extraction module of feature first, first normalization module, neural network classification module and
Neural metwork training module, wherein,
The fisrt feature extraction module is used to receive acquired original voice data, and carries out feature to acquired original voice data
Extract;
The first normalization module is used to carry out the data after feature extraction Gaussian normalization processing, output normalization number
According to;
The neural network classification module is used to receive normalization data and the good deep neural network of training in advance, and by depth
Degree neutral net carries out Classification and Identification to the normalization data and obtains sorting result information, and the sorting result information is original
Whether the collection miscellaneous probable value containing impact sound of voice data, when the probable value exceedes threshold value set in advance, then judges road
In there is anomalous event;
The neural metwork training module is used to receive training data and the training of neutral net is carried out according to training data, obtains
Abundant learning tape is made an uproar the deep neural network of the Nonlinear Mapping relation between sample and clean sample, and exports the depth god
Through network to the neural network classification module;
The neural metwork training module includes second feature extraction module, the second normalization module, unsupervised learning pre-training
Module and supervised learning optimization module, wherein, the unsupervised learning pre-training module is used to find input data mid-deep strata
Abstract characteristics, using be restricted Boltzmann machine (RBM) model carry out pre-training and by unsupervised learning by the way of successively gradually
Enter learning neural network parameter;The supervised learning optimization module uses backpropagation (back-propagation, BP) algorithm,
The intense adjustment for having supervision is carried out to neural network parameter using labeled data.
2. it is according to claim 1 it is built-in collision sound detection function intelligent road-lamp, it is characterised in that the fisrt feature
Extraction module/second feature extraction module further includes framing module, DFT transform module and log power spectrum processing module,
Wherein, the framing module is used to carry out sub-frame processing to input data;After the DFT transform module is used for sub-frame processing
Data carry out discrete Fourier transform and obtain frequency domain information;The log power spectrum processing module is right for being carried out to frequency domain information
Number power spectrum treatment.
3. it is according to claim 1 and 2 it is built-in collision sound detection function intelligent road-lamp, it is characterised in that the collision
Sound identification module also includes average energy detection module and frame energy comparison module, wherein, the average energy detection module is used
In calculate present frame log power spectrum the average energy value and be sent to the frame energy comparison module;The frame energy comparison mould
Whether block is used to calculate the difference of consecutive frame the average energy value and judges the difference beyond default threshold value, if it exceeds then opening
The neural network classification module.
4. it is according to claim 3 it is built-in collision sound detection function intelligent road-lamp, it is characterised in that the frame energy ratio
Current information of vehicle flowrate is received compared with module, and whether calculates present frame the average energy value beyond default corresponding information of vehicle flowrate institute
Corresponding energy value scope, if it exceeds then opening the neural network classification module.
5. it is according to claim 1 and 2 it is built-in collision sound detection function intelligent road-lamp, it is characterised in that the sound
Acquisition module further includes microphone array, audio processing modules and the control module being made up of multiple microphones, wherein,
It is multiple in the microphone array that microphone is in certain geometrical shape and each microphone has unique ID;
The audio processing modules are used to synchronously obtain and identify the voice signal of each microphone collection and the sound are believed
Audio-frequency information is exported after number being processed;
The control module is connected with the audio processing modules, for controlling the work of the audio processing modules and by institute
The audio-frequency information after audio processing modules treatment is stated to store in the memory module.
6. it is according to claim 5 it is built-in collision sound detection function intelligent road-lamp, it is characterised in that also including impact sound
Locating module, the impact sound locating module is connected with the controller for road lamp, for obtain impact sound particular location simultaneously
Positional information is sent to the controller for road lamp;
When the impact sound identification module judges anomalous event occur, the impact sound locating module obtains the anomalous event
Temporal information simultaneously obtains the audio-frequency information that corresponding microphone array is listed in the temporal information from the memory module, and according to every
Position relationship and each microphone parameter information of correspondence audio-frequency information in the temporal information that individual microphone is fixed determine
Particular location residing for impact sound.
7. it is according to claim 6 it is built-in collision sound detection function intelligent road-lamp, it is characterised in that the parameter information
It is that each microphone peak strength and each microphone of correspondence audio-frequency information in the temporal information is corresponding in peak strength
Time difference.
8. it is according to claim 6 it is built-in collision sound detection function intelligent road-lamp, it is characterised in that the Street lamps control
Device is also connected with Rotatable camera device, and the Rotatable camera device is arranged on light pole and according to the Street lamps control
The control instruction of device is rotated;
When the impact sound identification module judges anomalous event occur in road, the controller for road lamp control is described rotatable
Camera head is rotated to particular location determined by the impact sound locating module.
9. it is according to claim 5 it is built-in collision sound detection function intelligent road-lamp, it is characterised in that the microphone array
Multiple microphones in row are arranged on lamp surface with certain geometrical shape.
10. it is according to claim 1 and 2 it is built-in collision sound detection function intelligent road-lamp, it is characterised in that the collision
Sound identification module is realized using the artificial intelligence chip of build-in depths neutral net.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710256831.9A CN106907697A (en) | 2017-04-19 | 2017-04-19 | A kind of intelligent road-lamp of built-in collision sound detection function |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710256831.9A CN106907697A (en) | 2017-04-19 | 2017-04-19 | A kind of intelligent road-lamp of built-in collision sound detection function |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106907697A true CN106907697A (en) | 2017-06-30 |
Family
ID=59210119
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710256831.9A Withdrawn CN106907697A (en) | 2017-04-19 | 2017-04-19 | A kind of intelligent road-lamp of built-in collision sound detection function |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106907697A (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108288392A (en) * | 2018-01-23 | 2018-07-17 | 罗秦昕 | Traffic light control system and traffic control system |
CN108764042A (en) * | 2018-04-25 | 2018-11-06 | 深圳市科思创动科技有限公司 | A kind of exception traffic information recognition methods, device and terminal device |
CN109903769A (en) * | 2017-12-08 | 2019-06-18 | Tcl集团股份有限公司 | A kind of method, apparatus and terminal device of terminal device interaction |
CN110276990A (en) * | 2019-07-22 | 2019-09-24 | 福建中宝万通物联网科技有限公司 | A kind of highway corrugated guardrail surface vibration sound wave conduction detection identifying system and its recognition methods |
CN111247815A (en) * | 2017-10-27 | 2020-06-05 | 昕诺飞控股有限公司 | Microphone calibration system |
CN111445926A (en) * | 2020-04-01 | 2020-07-24 | 杭州叙简科技股份有限公司 | Rural road traffic accident warning condition identification method based on sound |
CN112908313A (en) * | 2021-03-08 | 2021-06-04 | 深圳市英特飞电子有限公司 | Smart street lamp voice interaction method and device, computer equipment and storage medium |
CN113112681A (en) * | 2020-01-13 | 2021-07-13 | 阿里健康信息技术有限公司 | Vending equipment, and shipment detection method and device |
CN113506454A (en) * | 2021-05-18 | 2021-10-15 | 江西方兴科技有限公司 | Expressway induction lamp system and method based on audio event detection |
CN113966026A (en) * | 2021-11-02 | 2022-01-21 | 厦门人达科技有限公司 | Intelligent dimming light control method based on acceleration vibration perception vehicle |
CN115223370A (en) * | 2022-08-31 | 2022-10-21 | 四川九通智路科技有限公司 | Traffic accident detection method and system |
-
2017
- 2017-04-19 CN CN201710256831.9A patent/CN106907697A/en not_active Withdrawn
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111247815A (en) * | 2017-10-27 | 2020-06-05 | 昕诺飞控股有限公司 | Microphone calibration system |
CN111247815B (en) * | 2017-10-27 | 2022-02-01 | 昕诺飞控股有限公司 | Microphone calibration system |
CN109903769A (en) * | 2017-12-08 | 2019-06-18 | Tcl集团股份有限公司 | A kind of method, apparatus and terminal device of terminal device interaction |
CN108288392A (en) * | 2018-01-23 | 2018-07-17 | 罗秦昕 | Traffic light control system and traffic control system |
CN108764042B (en) * | 2018-04-25 | 2021-05-28 | 深圳市科思创动科技有限公司 | Abnormal road condition information identification method and device and terminal equipment |
CN108764042A (en) * | 2018-04-25 | 2018-11-06 | 深圳市科思创动科技有限公司 | A kind of exception traffic information recognition methods, device and terminal device |
CN110276990A (en) * | 2019-07-22 | 2019-09-24 | 福建中宝万通物联网科技有限公司 | A kind of highway corrugated guardrail surface vibration sound wave conduction detection identifying system and its recognition methods |
CN113112681A (en) * | 2020-01-13 | 2021-07-13 | 阿里健康信息技术有限公司 | Vending equipment, and shipment detection method and device |
CN111445926A (en) * | 2020-04-01 | 2020-07-24 | 杭州叙简科技股份有限公司 | Rural road traffic accident warning condition identification method based on sound |
CN111445926B (en) * | 2020-04-01 | 2023-01-03 | 杭州叙简科技股份有限公司 | Rural road traffic accident warning condition identification method based on sound |
CN112908313A (en) * | 2021-03-08 | 2021-06-04 | 深圳市英特飞电子有限公司 | Smart street lamp voice interaction method and device, computer equipment and storage medium |
CN113506454A (en) * | 2021-05-18 | 2021-10-15 | 江西方兴科技有限公司 | Expressway induction lamp system and method based on audio event detection |
CN113966026A (en) * | 2021-11-02 | 2022-01-21 | 厦门人达科技有限公司 | Intelligent dimming light control method based on acceleration vibration perception vehicle |
CN113966026B (en) * | 2021-11-02 | 2023-11-03 | 厦门人达科技有限公司 | Intelligent dimming lamp control method based on acceleration vibration sensing vehicle |
CN115223370A (en) * | 2022-08-31 | 2022-10-21 | 四川九通智路科技有限公司 | Traffic accident detection method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106907697A (en) | A kind of intelligent road-lamp of built-in collision sound detection function | |
CN107086036A (en) | A kind of freeway tunnel method for safety monitoring | |
CN106898346A (en) | A kind of freeway tunnel safety monitoring system | |
CN206931362U (en) | A kind of freeway tunnel safety monitoring system | |
US11908455B2 (en) | Speech separation model training method and apparatus, storage medium and computer device | |
WO2020220439A1 (en) | Highway traffic flow state recognition method based on deep neural network | |
CN103839373B (en) | A kind of unexpected abnormality event Intelligent Recognition alarm device and warning system | |
CN107194346A (en) | A kind of fatigue drive of car Forecasting Methodology | |
CN112216271B (en) | Audio-visual dual-mode speech recognition method based on convolution block attention mechanism | |
CN110717389B (en) | Driver fatigue detection method based on generation countermeasure and long-short term memory network | |
CN106203283A (en) | Based on Three dimensional convolution deep neural network and the action identification method of deep video | |
CN108269133A (en) | A kind of combination human bioequivalence and the intelligent advertisement push method and terminal of speech recognition | |
CN105469065A (en) | Recurrent neural network-based discrete emotion recognition method | |
CN109473119B (en) | Acoustic target event monitoring method | |
CN110007366A (en) | A kind of life searching method and system based on Multi-sensor Fusion | |
CN111354371B (en) | Method, device, terminal and storage medium for predicting running state of vehicle | |
CN112784730B (en) | Multi-modal emotion recognition method based on time domain convolutional network | |
CN110853656B (en) | Audio tampering identification method based on improved neural network | |
CN103994820B (en) | A kind of moving target recognition methods based on micropore diameter microphone array | |
CN206648005U (en) | A kind of intelligent road-lamp of built-in impact sound detection function | |
CN111274886A (en) | Deep learning-based pedestrian red light violation analysis method and system | |
Zaheer et al. | A preliminary study on deep-learning based screaming sound detection | |
Smailov et al. | A Novel Deep CNN-RNN Approach for Real-time Impulsive Sound Detection to Detect Dangerous Events | |
CN113989893A (en) | Expression and voice bimodal-based children emotion recognition algorithm | |
CN113674768A (en) | Call-for-help detection method, device, equipment and storage medium based on acoustics |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20170630 |