CN109979441A

CN109979441A - A kind of birds recognition methods based on deep learning

Info

Publication number: CN109979441A
Application number: CN201910264817.2A
Authority: CN
Inventors: 吕坤朋; 孙斌; 赵玉晓
Original assignee: China Jiliang University
Current assignee: China Jiliang University
Priority date: 2019-04-03
Filing date: 2019-04-03
Publication date: 2019-07-05

Abstract

The birds recognition methods based on deep learning that the present invention relates to a kind of, belongs to birdvocalization identification technology field.It mainly comprises the steps that and time frequency analysis is carried out to variety classes chirm first, obtain the time-frequency spectrum of variety classes chirm, the characteristics of image for extracting time-frequency spectrum by convolutional neural networks again, finally passes through classifier, carries out birds Classification and Identification according to feature.This method has the ability of stronger anticrossed jam item, and resolution ratio is higher, the various changeful syllable characteristics of birds is extracted as classification foundation, characteristic parameter representativeness is stronger, weak by Environmental Noise Influence.

Description

A kind of birds recognition methods based on deep learning

Technical field

The birds recognition methods based on deep learning that the present invention relates to a kind of, belongs to birdvocalization identification technology field.

Background technique

The song of birds is its important biological property, identical as other morphological features of birds, due to the difference of evolution Property, the song of birds is also unique between different plant species, so that carrying out birds identification using song is provided with feasibility.

Though birdvocalization identification technology there are many research achievements in recent years, all in all develop relatively slowly, side There are limitations for method.Research is concentrated mainly on characteristic parameter selection, disaggregated model technique study etc., wherein common special Sign parameter has amplitude, frequency, syllable length, sonograph, spectrogram, short-time energy, linear prediction residue error (Linear Predictive Cepstral Coding, LPCC) and mel cepstrum coefficients (Mel-Frequency Cepstrum Coefficient, MFCC) etc., common recognition methods and disaggregated model have dynamic time warping (Dynamic Time Warping, DTW) algorithm, error back propagation algorithm (Error Back Propagation, BP) algorithm, hidden Markov model (Hidden Markov Model, HMM) and gauss hybrid models (Gaussian Mixture Model, GMM) etc..There are The problems such as characteristic parameter representativeness is not strong enough, and larger by Environmental Noise Influence.

Summary of the invention

For the shortcoming of existing method, the present invention provides a kind of birds recognition methods based on deep learning.The party Method has the ability of stronger anticrossed jam item, and resolution ratio is higher, and the various changeful song characteristics of birds are extracted As classification foundation, characteristic parameter representativeness is stronger, and small by Environmental Noise Influence, convolutional network is integrated in software, operates phase To simple, recognition accuracy can also increase with the increase of convolutional neural networks training samples number.

The present invention is realized using following scheme: a kind of birds recognition methods based on deep learning, it is characterised in that including Following steps:

Step 1, the song for acquiring variety classes bird will wherein include the segment composition of effective syllable after voice signal pretreatment Sample database；

After step 2, sample data normalization and preemphasis processing, time-frequency spectrum is obtained by time frequency analysis algorithm；

Step 3, the characteristics of image that time-frequency spectrum is extracted by convolutional neural networks；

Step 4, by classifier, birds classification, identification are carried out according to feature；

The present invention is changing in more violent problem, pretreatment is adopted relative to conventional method in face of song segment duration Carry out noise reduction with to signal, and cut out the various segments with complete pitch period, sing, pipe syllable, will be effective Signal data is normalized and preemphasis, improves treatment effeciency to a certain extent, using adaptive optimal kernel time frequency analysis Method: Adaptive optimal kernel time-frequency representation (AOK), time frequency resolution is high, And the ability with very strong anticrossed jam item, time domain, frequency domain and the energy feature of signal can be accurately showed, volume is passed through Product Neural Network Data data mining duty, can accurately extract the feature of time frequency analysis figure, compiled good after time frequency analysis figure gray processing Convolutional neural networks algorithm extract feature, be input with grayscale image, the type of bird is output, and training neural network obtains most Excellent network returns classifier through Softmax, and so that feature is multiplied propertyization to recognition result influences, and improves recognition accuracy.

Detailed description of the invention

Fig. 1 is the overall flow figure of this method.

Fig. 2 is the convolutional neural networks structural schematic diagram of this method.

Specific embodiment

In conjunction with attached drawing, to the present invention, a kind of birds recognition methods based on deep learning is described further, such as Fig. 1 institute Show, the main foundation including chirping of birds sample database, sample preprocessing, time frequency analysis, time-frequency spectrum gray processing, convolutional neural networks are special Sign is extracted and Softmax returns six parts of classifier, the specific steps are as follows:

Step 1, the song for acquiring variety classes bird, by voice signal noise reduction and cut, will wherein have complete cycle sound The segment of section forms the respective sample database of every kind of birds, and for every kind of birds, randomly selecting for equivalent is part of as training Sample；

Step 2, compiling adaptive optimal accounting method, set relevant parameter, by the normalization of training sample data, preemphasis, pre-add Repeated factor takes 0.9375, then obtains time-frequency spectrum by adaptive optimal kernel time frequency analysis algorithm, image is carried out gray processing Processing obtains gray matrix, to reduce neural network computing amount, adjusts the size of image, is adjusted to 64*64 herein；

Step 3, as shown in Fig. 2, herein use single layer convolutional neural networks, according to experiment, convolutional layer takes 10 convolution kernels, size Sampling matrix size for 7*7, sub-sampling layer is 2*2, and full articulamentum connects characteristic pattern entirely, after training sample time frequency analysis Grayscale image as input, import convolutional neural networks and extract characteristics of image, it is trained to obtain using the type of bird as outputting standard Optimal network；

Step 4 returns classifier by Softmax, carries out birds Classification and Identification according to feature；

Above-described specific embodiment has carried out further in detail the purpose of the present invention, technical scheme and beneficial effects Illustrate, it should be understood that the foregoing is merely a specific embodiment of the invention, the guarantor that is not intended to limit the present invention Range is protected, all within the spirits and principles of the present invention, any modification, equivalent substitution, improvement and etc. done should be included in this Within the protection scope of invention.

Claims

1. a kind of birds recognition methods based on deep learning, which comprises the following steps:

Step 4, by classifier, birds Classification and Identification is carried out according to feature；

A kind of birds recognition methods based on deep learning according to claim (1), which is characterized in that described in step 1 Voice signal pretreatment includes noise reduction and cuts, and the feature of effective syllable has randomness and diversity.

2. a kind of birds recognition methods based on deep learning according to claim (1), which is characterized in that step 2 institute It states time frequency analysis algorithm and one-dimensional clock signal is converted into two-dimentional time-frequency spectrum, and include energy information, frequency division when described Analysis method includes but is not limited to wavelet transformation, adaptive optimal kernel etc..

3. a kind of birds recognition methods based on deep learning according to claim (1), which is characterized in that step 3 institute Convolutional neural networks are stated first using the time-frequency spectrum of gray processing as input, characteristics of image are extracted, using the type of known bird as defeated Standard trains the network out.

4. a kind of birds recognition methods based on deep learning according to claim (1), which is characterized in that step 4 institute Stating classifier is that Softmax returns classifier.