CN114373482A

CN114373482A - Method and system for recognizing animal emotion through voice based on convolutional neural network

Info

Publication number: CN114373482A
Application number: CN202111582306.9A
Authority: CN
Inventors: 杨兴海; 漆国强; 杨兴荣; 李建州; 李建新
Original assignee: Shijihengtong Technology Co ltd
Current assignee: Shijihengtong Technology Co ltd
Priority date: 2021-12-22
Filing date: 2021-12-22
Publication date: 2022-04-19

Abstract

The invention discloses a method for recognizing animal emotion through voice based on a convolutional neural network, which comprises the following steps: (1) collecting animal sound and converting the animal sound into a digital signal; (2) processing and compressing the digital signal and transmitting the digital signal to a sound characteristic identification module; (3) the voice feature recognition module extracts voice features by using a trained convolutional neural network model, converts the voice features and outputs the converted voice features to the emotion category recognition module; the trained convolutional neural network model is obtained by inputting the animal sound marked with sound characteristics into a convolutional neural network algorithm for training; (4) the emotion classification recognition module obtains emotion classifications corresponding to animal sounds by applying an SVM model; the method realizes the effect of recognizing the emotion of the animal according to the voice of the animal, so that the human can better understand the animal.

Description

Method and system for recognizing animal emotion through voice based on convolutional neural network

Technical Field

The invention relates to the field of voice recognition, in particular to a method for recognizing animal emotion through voice based on a convolutional neural network, and further relates to a system for recognizing animal emotion through voice based on the convolutional neural network.

Background

With the development of animal behavioral research, how to recognize the emotion of animals and apply the emotion to production and life becomes a future direction, for example, recognizing the emotion of a guide dog makes blind people more aware of the behavior of the guide dog, and recognizing the emotion of a police dog makes workers better understand the intention of the blind people. Animal emotion can often be expressed from the sound emitted by the animal emotion recognition device, but the animal emotion recognition device usually needs a plurality of experienced persons to accurately judge and recognize the animal emotion, and in a special case, when the animal emotion recognition person is inexperienced, task failure can be caused, and even serious consequences can be caused.

Disclosure of Invention

In view of the above, one of the objects of the present invention is to provide a method for recognizing emotion of an animal through voice based on a convolutional neural network, which can accurately recognize emotion of the animal according to voice of the animal without professional personnel, so that humans can better understand the animal.

One of the purposes of the invention is realized by the following technical scheme:

the method for recognizing the animal emotion through the voice based on the convolutional neural network comprises the following steps:

(1) collecting animal sound and converting the animal sound into a digital signal;

(2) processing and compressing the digital signal and transmitting the digital signal to a sound characteristic identification module;

(3) the voice feature recognition module extracts voice features by using a trained convolutional neural network model, converts the voice features and outputs the converted voice features to the emotion category recognition module; the trained convolutional neural network model is obtained by inputting the animal voice marked with the voice characteristics into a convolutional neural network algorithm for training;

(4) and the emotion category recognition module obtains the emotion category corresponding to the animal voice by using the SVM model.

Further, in the step (1), a microphone is used for collecting animal sound and converting the animal sound into a digital signal.

Further, the digital signal is processed in the step (2) to establish a data matrix that can be identified by the trained convolutional neural network model.

Further, the emotion categories in the step (4) are sad, happy and neutral.

The second purpose of the invention is realized by the following technical scheme:

the system for recognizing the animal emotion through voice based on the convolutional neural network comprises a voice input and converter, a data preprocessor, a voice feature recognition module, an emotion category recognition module and an animal emotion output module;

the voice recording and converting device is used for collecting animal voice and converting the animal voice into digital signals;

the data preprocessor is used for processing and compressing the digital signal and then transmitting the digital signal to the sound characteristic identification module;

the voice feature recognition module extracts voice features by using a trained convolutional neural network model, converts the voice features and outputs the converted voice features to the emotion category recognition module; the trained convolutional neural network model is obtained by inputting animal sounds with marked sound features into a convolutional neural network algorithm for training;

the emotion type recognition module obtains emotion types corresponding to animal sounds by using an SVM model;

the animal emotion output module is used for outputting animal emotion.

The invention has the beneficial effects that:

the method for recognizing the animal emotion through the voice based on the convolutional neural network comprises the steps of firstly extracting animal voice characteristics through a trained convolutional neural network model, then inputting the animal voice characteristics into an SVM model to obtain emotion categories corresponding to animal voices, achieving the effect of recognizing the emotion of the animals according to the animal voices, and enabling human beings to understand the animals better.

Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof.

Drawings

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be further described in detail with reference to the accompanying drawings, in which:

FIG. 1 is a flow chart of a method for recognizing animal emotion through voice based on a convolutional neural network.

FIG. 2 is a convolutional neural network model in the method for recognizing animal emotion through voice based on convolutional neural network.

FIG. 3 shows the classification of the emotion of the SVM model in the method for recognizing the emotion of an animal by voice based on the convolutional neural network.

Detailed Description

Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. It should be understood that the preferred embodiments are illustrative of the invention only and are not limiting upon the scope of the invention.

As shown in fig. 1-3, the method for recognizing animal emotion through voice based on convolutional neural network comprises the following steps:

(1) collecting animal sounds by a microphone and converting the animal sounds into digital signals;

(2) processing and compressing the digital signal and transmitting the digital signal to a sound characteristic identification module; processing the digital signal to establish a data matrix which can be identified by a trained convolutional neural network model;

(3) the voice feature recognition module extracts voice features by using a trained convolutional neural network model, converts the voice features and outputs the converted voice features to the emotion category recognition module; the trained convolutional neural network model is used for extracting sound characteristics from keyword time points of 3 angles of a time domain, a frequency domain and a cepstrum domain of sound; the trained convolutional neural network model is obtained by inputting the animal sound marked with sound characteristics into a convolutional neural network algorithm for training;

the convolutional neural network algorithm is as follows:

wherein L (x, z) is:

L(x,z)＝-lnp(z|x)

setting:

(4) the emotion classification recognition module obtains emotion classifications corresponding to animal sounds by applying an SVM model; the mood categories are sad, happy and neutral.

The SVM model algorithm is as follows:

wherein w is a target function and α is a langerhan multiplier;

the method for recognizing the animal emotion through the voice based on the convolutional neural network has two using modes, wherein the first mode can be offline animal emotion calculation, and corresponding emotion classification results are directly output after corresponding emotion classes are recognized by a voice feature recognition module and an emotion class recognition module; and secondly, performing real-time online emotion category calculation, namely inputting voice into a voice characteristic recognition module according to fixed time frequency, and obtaining an emotion result of an animal timeline through the emotion category recognition module, so that the intention of the animal can be well understood.

the voice feature recognition module extracts voice features by using a trained convolutional neural network model, converts the voice features and outputs the converted voice features to the emotion category recognition module; the trained convolutional neural network model is obtained by inputting animal sound with a marked sound characteristic into a convolutional neural network algorithm for training;

the emotion classification recognition module obtains emotion classifications corresponding to animal sounds by applying an SVM model;

animal emotion output module for outputting animal emotion

Finally, the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting, although the present invention is described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions can be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions, and all that should be covered by the claims of the present invention.

Claims

1. A method for recognizing animal emotion through voice based on a convolutional neural network is characterized in that: the method comprises the following steps:

(3) the voice feature recognition module extracts voice features by using a trained convolutional neural network model, converts the voice features and outputs the converted voice features to the emotion category recognition module; the trained convolutional neural network model is obtained by inputting the animal sound marked with sound characteristics into a convolutional neural network algorithm for training;

2. The method for recognizing emotion of animal by voice based on convolutional neural network as claimed in claim 1, wherein: and (2) adopting a microphone to collect animal sound and converting the animal sound into a digital signal in the step (1).

3. The method for recognizing emotion of animal by voice based on convolutional neural network as claimed in claim 1 or 2, wherein: and (3) processing the digital signals in the step (2) to establish a data matrix which can be identified by the trained convolutional neural network model.

4. The method for recognizing emotion of animal by voice based on convolutional neural network as claimed in claim 1 or 2 or 3, wherein: the emotion categories in the step (4) are sad, happy and neutral.

5. A system for implementing the method for recognizing emotion of animal by voice based on convolutional neural network as claimed in claim 1, wherein: the system comprises a voice input and converter, a data preprocessor, a voice feature recognition module, an emotion category recognition module and an animal emotion output module;

the animal emotion output module is used for outputting animal emotion.