CN109994102A

CN109994102A - A kind of outer paging system of intelligence based on Emotion identification

Info

Publication number: CN109994102A
Application number: CN201910303368.8A
Authority: CN
Inventors: 朱宇光
Original assignee: Upper Hainan Airlines Move Science And Technology Ltd
Current assignee: Upper Hainan Airlines Move Science And Technology Ltd
Priority date: 2019-04-16
Filing date: 2019-04-16
Publication date: 2019-07-09

Abstract

The outer paging system of intelligence that the invention discloses a kind of based on Emotion identification, including voice communications module, voice obtains module, audio dimension analysis module, text transcription module, model of place generation module, prompt generation module, text semantic analysis module, database, epidemic situation comparison module, Realtime Alerts module, it attends a banquet videograph module, user video logging modle and display screen, voice communications module obtains module with voice by signal and connect, voice obtains module and is connect by signal with audio dimension analysis module, audio dimension analysis module and text semantic analysis module pass through signal and connect with text transcription module, videograph module of attending a banquet and user video logging modle are connect by signal with text semantic analysis module, the system, it is added in common outgoing call of attending a banquet and is based on voice and text artificial intelligence's analytical technology , supervised with this, instruct both sides' mood, so that it is more standardized entirely to converse, hommization promotes user experience.

Description

A kind of outer paging system of intelligence based on Emotion identification

Technical field

The present invention relates to speech emotional processing technology field, specially a kind of outer paging system of intelligence based on Emotion identification.

Background technique

In terms of pattern-recognition, all means are almost utilized in speech emotional process field by various countries researcher, newly Method application and comparison emerge one after another, neural network classifier, Bayes classifier, K nearest neighbor classifier, SVM, GMM, HMM classifier, which has, to be used, although the research on speech emotion recognition has carried out very much, at entire speech emotional information Reason field is also in a lower level.Because the validity feature extracted first is limited, almost all of researcher is Using the combination of prosodic features or these features or derivative feature as analysis parameter, secondly, for the means of pattern-recognition, though So there are many different application methods, but the data as used in research project are different, and make analogy between these documents A possibility that very little, research finds that the research object in document is widely different, as a result different, only for discrimination, is just formed Great disparity as from 53% to 90%, and cannot say that high method of discrimination just centainly that side lower than discrimination Method is good, this is without comparativity.

So in summary introducing, the stage that the identification of speech emotional is explored and studied also in one, many problems It needs to solve with difficulty, speech emotional technology is applied to voice messaging inquiry system at present, and there are the correct recognition rata of emotion is general All over lower problem, the joint efforts of all research workers are also needed to the breakthrough in the field.

Summary of the invention

The outer paging system of intelligence that the purpose of the present invention is to provide a kind of based on Emotion identification, to solve above-mentioned background technique The problem of middle proposition.

In order to solve the above technical problem, the present invention provides following technical solutions: a kind of intelligence based on Emotion identification is outer Paging system, including voice communications module, voice obtain module, audio dimension analysis module, text transcription module, model of place life At module, prompt generation module, text semantic analysis module, database, epidemic situation comparison module, Realtime Alerts module, view of attending a banquet Frequency logging modle, user video logging modle and display screen, the voice communications module obtain module by signal and voice and connect It connects, the voice obtains module and connect by signal with audio dimension analysis module, the audio dimension analysis module and text Semantic module passes through signal and connect with text transcription module, videograph module and the user video record mould of attending a banquet Block is connect by signal with text semantic analysis module, and the text transcription module is connected by signal and model of place generation module It connects, the model of place generation module is connect with prompt generation module and database respectively by signal, and the database passes through Signal is connect with epidemic situation comparison module, and the epidemic situation comparison module is connect by signal with prompt generation module, the scene mould Type generation module is connect by signal with prompt generation module, and the prompt generation module is connect by signal with display screen.

According to the above technical scheme, the epidemic situation comparison module is connect by signal with Realtime Alerts module.

According to the above technical scheme, videograph module and the user video logging modle of attending a banquet passes through signal and data Library connection.

According to the above technical scheme, the database is to be bi-directionally connected with model of place generation module.

According to the above technical scheme, the audio dimension analysis module includes word speed phonic signal character analytical unit, width Spend phonic signal character analytical unit and fundamental frequency phonic signal character analytical unit.

According to the above technical scheme, the audio dimension analysis module is based on Parzen probabilistic neural network.

According to the above technical scheme, the epidemic situation comparison module includes historical baseline values comparing unit and average reference value ratio Compared with unit.

Compared with prior art, the beneficial effects obtained by the present invention are as follows being: it is somebody's turn to do the outer paging system of intelligence based on Emotion identification, The speech emotion recognition of " with text dependent and unrelated with speaker " is applied in voice messaging inquiry system, Bayes is utilized Minimal error rate decision theory determines optimal threshold, proposes a kind of new speech sound signal terminal point detection algorithm；Have studied word speed, Amplitude and fundamental frequency three classes phonic signal character, and utilize these features of fuzzy entropy theory analysis for the effective of emotional semantic classification Property, then select optimal characteristic parameter to combine to carry out speech emotion recognition；Have studied point suitable for speech emotion recognition Class device is completed the identification of speech emotional state using Parzen probabilistic neural network, substantially increases the whole discrimination of system.

Detailed description of the invention

Attached drawing is used to provide further understanding of the present invention, and constitutes part of specification, with reality of the invention It applies example to be used to explain the present invention together, not be construed as limiting the invention.In the accompanying drawings:

Fig. 1 is system flow chart of the invention.

Specific embodiment

Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.

Referring to Fig. 1, the present invention provides a kind of technical solution: a kind of outer paging system of intelligence based on Emotion identification, including Voice communications module, voice obtain module, audio dimension analysis module, text transcription module, model of place generation module, prompt Generation module, text semantic analysis module, database, epidemic situation comparison module, Realtime Alerts module, videograph module of attending a banquet, User video logging modle and display screen, voice communications module obtain module with voice by signal and connect, and voice obtains module Connect by signal with audio dimension analysis module, audio dimension analysis module include word speed phonic signal character analytical unit, Amplitude phonic signal character analytical unit and fundamental frequency phonic signal character analytical unit, audio dimension analysis module are based on Parzen Probabilistic neural network, audio dimension analysis module and text semantic analysis module pass through signal and connect with text transcription module, Videograph module of attending a banquet and user video logging modle are connect by signal with text semantic analysis module, videograph of attending a banquet Module and user video logging modle are connect by signal with database, and text transcription module is generated by signal and model of place Module connection, model of place generation module are connect with prompt generation module and database respectively by signal, database and scene Model generation module is to be bi-directionally connected, and database is connect by signal with epidemic situation comparison module, and epidemic situation comparison module passes through signal It is connect with Realtime Alerts module, epidemic situation comparison module includes historical baseline values comparing unit and average reference value comparing unit, shape State comparison module is connect by signal with prompt generation module, and model of place generation module is connected by signal and prompt generation module It connects, generation module is prompted to connect by signal with display screen；It user and attends a banquet normal voice is carried out by voice communications module Communication, meanwhile, the audio data stream that module obtains user and attends a banquet is obtained by voice, by audio dimension analysis module to language Sound carries out audio dimension analysis, then carries out text transcription and text semantic analysis to voice by text transcription module, passes through Text semantic analysis module combines the user and portrait of attending a banquet that videograph module and user video logging modle provide that attend a banquet, with And above-mentioned analysis is as a result, generate current scene drag using model of place generation module, according to model result by prompting life It attends a banquet own self emotion, user emotion and suggestion at module and display screen prompt, meanwhile, by the real-time analysis to call of attending a banquet, According to intonation, word speed, a reference value historical baseline values that voice is attended a banquet with this are compared, by Realtime Alerts module to the feelings attended a banquet Thread abnormal behaviour Realtime Alerts, and by obtaining the correlation that marketing effectiveness is preferably attended a banquet to the big data analysis largely attended a banquet Voice data supervises and guides other and attends a banquet and markets according to this standard.

It should be noted that, in this document, relational terms such as first and second and the like are used merely to a reality Body or operation are distinguished with another entity or operation, are deposited without necessarily requiring or implying between these entities or operation In any actual relationship or order or sequence.Moreover, the terms "include", "comprise" or its any other variant are intended to Non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only wanted including those Element, but also including other elements that are not explicitly listed, or further include for this process, method, article or equipment Intrinsic element.

Finally, it should be noted that the foregoing is only a preferred embodiment of the present invention, it is not intended to restrict the invention, Although the present invention is described in detail referring to the foregoing embodiments, for those skilled in the art, still may be used To modify the technical solutions described in the foregoing embodiments or equivalent replacement of some of the technical features. All within the spirits and principles of the present invention, any modification, equivalent replacement, improvement and so on should be included in of the invention Within protection scope.

Claims

1. a kind of outer paging system of intelligence based on Emotion identification, including voice communications module, voice obtain module, audio dimension point Analyse module, text transcription module, model of place generation module, prompt generation module, text semantic analysis module, database, shape State comparison module, Realtime Alerts module, videograph module of attending a banquet, user video logging modle and display screen, it is characterised in that: The voice communications module obtains module with voice by signal and connect, and the voice obtains module and passes through signal and audio dimension Analysis module connection, the audio dimension analysis module and text semantic analysis module pass through signal and text transcription module connects It connects, videograph module and the user video logging modle of attending a banquet is connect by signal with text semantic analysis module, described Text transcription module is connect by signal with model of place generation module, the model of place generation module pass through signal respectively with Prompt generation module is connected with database, and the database is connect by signal with epidemic situation comparison module, the epidemic situation comparison mould Block is connect by signal with prompt generation module, and the model of place generation module is connect by signal with prompt generation module, The prompt generation module is connect by signal with display screen.

2. a kind of outer paging system of intelligence based on Emotion identification according to claim 1, it is characterised in that: the state ratio It is connect by signal with Realtime Alerts module compared with module.

3. a kind of outer paging system of intelligence based on Emotion identification according to claim 1, it is characterised in that: the view of attending a banquet Frequency logging modle and user video logging modle are connect by signal with database.

4. a kind of outer paging system of intelligence based on Emotion identification according to claim 1, it is characterised in that: the database It is to be bi-directionally connected with model of place generation module.

5. a kind of outer paging system of intelligence based on Emotion identification according to claim 1, it is characterised in that: the audio dimension Spending analysis module includes word speed phonic signal character analytical unit, amplitude phonic signal character analytical unit and fundamental frequency voice signal Characteristic analysis unit.

6. a kind of outer paging system of intelligence based on Emotion identification according to claim 1, it is characterised in that: the audio dimension It spends analysis module and is based on Parzen probabilistic neural network.

7. a kind of outer paging system of intelligence based on Emotion identification according to claim 1, it is characterised in that: the state ratio It include historical baseline values comparing unit and average reference value comparing unit compared with module.