CN114021576A

CN114021576A - Text-based natural language understanding decision method

Info

Publication number: CN114021576A
Application number: CN202111257879.4A
Authority: CN
Inventors: 李晓燕; 刘楚雄
Original assignee: Sichuan Qiruike Technology Co Ltd
Current assignee: Sichuan Qiruike Technology Co Ltd
Priority date: 2021-10-27
Filing date: 2021-10-27
Publication date: 2022-02-08

Abstract

The invention provides a text-based natural language understanding decision method, which comprises the following steps of: establishing semantic cache; acquiring voice information of a user, and performing semantic analysis; and feeding back the optimal semantic analysis result to the user, and writing the semantic analysis result irrelevant to the context into a semantic cache. And obtaining cache semantics according to the received requested text statement, and returning the cache semantics if corresponding cache is matched. And if no matched cache exists, performing semantic analysis. And after the semantic analysis result is obtained, returning the optimal semantic analysis result on one side, and judging the write cache operation on the other side. The semantic meaning is clear, the semantic meaning irrelevant to the context can be written into the cache, so that the analysis result in the cache can be directly obtained and returned when the same semantic request comes next time, and the analysis efficiency is improved.

Description

Text-based natural language understanding decision method

Technical Field

The invention relates to the technical field of self-language understanding, in particular to a text-based natural language understanding decision method.

Background

With the rise of artificial intelligence, natural language understanding is an important direction in the field of artificial intelligence, and the theory and method of human and computer communication through natural language are mainly studied. At present, a smart television is popular when being operated by voice, and users can meet different requirements such as weather checking, video watching, music listening and the like through voice interaction. The process is to recognize the voice data to obtain corresponding text data, understand the natural language according to the text and feed back the understood result. Because Chinese is complex, a single semantic understanding mode cannot meet requirements, so that semantic understanding can be performed in multiple modes, semantic understanding results in different modes may be different, and analysis needs to be performed through a certain strategy if understanding which best meets user requirements is selected from multiple semantic understandings.

Disclosure of Invention

The invention aims to provide a text-based natural language understanding decision method. So as to solve the technical problems existing in the background technology.

In order to achieve the purpose, the invention adopts the following technical scheme:

a text-based natural language understanding decision method, comprising the steps of:

establishing semantic cache;

acquiring voice information of a user, and performing semantic analysis;

and feeding back the optimal semantic analysis result to the user, and writing the semantic analysis result irrelevant to the context into a semantic cache.

In some embodiments, the semantic cache includes a fixed semantic cache and a real-time semantic cache, where the fixed semantic cache is a semantic analysis result write-in cache of some commonly used statements counted in advance; the real-time semantic cache is the semantic analyzed after the user requests in real time, and the semantic analysis which is high in semantic analysis confidence and not influenced by context is stored in the cache.

In some embodiments, the obtaining voice information of a user and performing semantic parsing includes: semantic analysis is carried out by adopting three modes, wherein a, semantic analysis is carried out by adopting a grammar rule; b. performing semantic analysis by adopting an RNN algorithm; c. semantic parsing is carried out by using an Elastic Search (ES) intelligent search, and the confidence of a parsing result is calculated in each mode.

In some embodiments of the present invention, the,

semantic analysis results are screened, and the results have three conditions: a. the semantic analysis of the three modes is the same; b. the semantic analysis of the two modes is the same; c. the semantic analysis of the three modes is different; a and b directly select the result with the same semantic resolution, and c needs to be selected through three dimensions.

In some embodiments, the dimensions are, respectively, the current application of the television, the confidence level, the user profile.

Advantageous effects

Through the text-based natural language understanding decision method provided by the invention, the user experience is improved.

And obtaining cache semantics according to the received requested text statement, and returning the cache semantics if corresponding cache is matched. And if no matched cache exists, performing semantic analysis. And after the semantic analysis result is obtained, returning the optimal semantic analysis result on one side, and judging the write cache operation on the other side. The semantic meaning is clear, the semantic meaning irrelevant to the context can be written into the cache, so that the analysis result in the cache can be directly obtained and returned when the same semantic request comes next time, and the analysis efficiency is improved.

Drawings

FIG. 1 is a flow chart of obtaining semantic parsing.

Fig. 2 is a flow of semantic parsing result selection.

Detailed Description

In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

On the contrary, this application is intended to cover any alternatives, modifications, equivalents, and alternatives that may be included within the spirit and scope of the application as defined by the appended claims. Furthermore, in the following detailed description of the present application, certain specific details are set forth in order to provide a better understanding of the present application. It will be apparent to one skilled in the art that the present application may be practiced without these specific details.

The text-based natural language understanding decision method according to the embodiment of the present application will be described in detail below with reference to fig. 1-2. It is to be noted that the following examples are only for explaining the present application and do not constitute a limitation to the present application.

As shown in fig. 1, the text-based natural language understanding decision method includes the following steps:

establishing semantic cache; the semantic cache is divided into fixed semantic cache and real-time semantic cache. The fixed semantic cache is a cache into which semantic parsing results of some commonly used sentences counted in advance are written. The real-time semantic cache is the semantic analyzed after the user requests in real time, and the semantic analysis which has high semantic analysis confidence and is not influenced by the context is stored in the cache, so that the same statement request can be fed back very quickly. And if the requested statement is matched with the statement in the cache, directly outputting the cache result.

Acquiring voice information of a user, and performing semantic analysis; semantic analysis is carried out by adopting three modes, wherein a, semantic analysis is carried out by adopting a grammar rule; b. performing semantic analysis by adopting an RNN algorithm; c. semantic parsing is carried out by using an Elastic Search (ES) intelligent search, and the confidence of a parsing result is calculated in each mode. The present invention does not describe the method of semantic parsing in detail.

Because different modes are adopted for semantic analysis, analysis results may be inconsistent, semantic analysis results need to be screened, and the results have three conditions: a. the semantic analysis of the three modes is the same; b. the semantic analysis of the two modes is the same; c. the semantic analysis of the three modes is different; a and b directly select the result with the same semantic resolution, and c needs to be selected through three dimensions. For the case of c we can choose through three dimensions.

The dimensions are the current application of the television, the confidence level, and the user profile, respectively. The semantic analysis gives the domain to which the given semantic belongs, and if the domain after the semantic analysis is the same as the currently applied domain, the result is closer to the real semantic. If the request of "play you through the ocean" is that you can be both a tv show and a song, then it can be distinguished according to the current application, if the current application is a video application, the field is selected to be the parsing of the video, if the current application is a music application, the field is selected to be the parsing of the music.

If the analyzed fields are the same, the judgment can be carried out according to the confidence level of semantic analysis, and the preference with high confidence level is given. At this time, if the optimum analysis result is not selected, it can be judged by combining the user image. The user can be portrayed according to historical request data of the user, the request intention of the user is analyzed according to usual habit characteristics of the user, and semantic analysis is selected. At this time, if the result of semantic analysis can not be selected, the result is selected according to the order of grammar, algorithm and search, thereby obtaining the final analysis result.

And feeding back the optimal semantic analysis result to the user, and writing the semantic analysis result irrelevant to the context into a semantic cache. Some semantics of the final result after the semantic analysis are clear, and the semantic analysis irrelevant to the context can be written into the cache, so that the analysis result can be directly obtained and returned to the user when the same semantic request comes next time.

As shown in fig. 1, after receiving a requested text statement, cache semantics are obtained, and if there is a corresponding cache match, the cache semantics are returned. And if no matched cache exists, performing semantic analysis.

Semantic analysis is divided into three modes, namely grammar rule analysis, RNN algorithm analysis and elastic search analysis. And the grammar rule analysis is to define grammar rules through a grammar network and analyze the matching degree of the request text and the grammar rules to carry out semantic analysis. The RNN algorithm analysis is semantic analysis by using a recurrent neural network algorithm. The elastic search parsing is semantic parsing through an elastic search (es) algorithm. The present invention does not describe these three semantic parsing methods in detail. The semantic analysis of the three modes is carried out simultaneously.

After the semantic parsing of the three modes is finished, the parsing result which best accords with the request text is selected. As a result, there are three cases: a. the semantic analysis of the three modes is the same; b. the semantic analysis of the two modes is the same; c. the semantic parsing of the three modes is different. a and b directly select the result with the same semantic resolution, and the case of c needs to be selected through three dimensions. The three dimensions are the current application of the television, the confidence level, and the user profile, respectively. The semantic analysis in the three modes can provide the domain, intention, related attribute, attribute value and confidence degree to which the semantic belongs.

As shown in fig. 2, the flow of selecting the semantic parsing result is as follows:

firstly, judging whether the analyzed field is the same as the field currently applied by the television. If the request of 'playing the Drift sea to see you' is the request, the Drift sea to see you are both TV shows and songs in our database, the method can be distinguished according to the current application, if the current application is a video application, the selected field is the analysis result of the video, and if the current application is a music application, the selected field is the analysis result of the music. And judging the value of the confidence coefficient if the analysis result cannot be distinguished by the current application. And selecting the analysis result with the highest confidence coefficient.

If the analysis result cannot be distinguished by the confidence, the user image is combined for judgment. The user portrait is used for analyzing the usual habit characteristics of the user according to the historical request data of the user and portraying the user. If "play floating ocean and see you", the current television application is neither a video application nor a music application, and the confidence degrees of the two analysis modes are the same, then see the user portrait. If the user's image is favorite music, the analysis result of the music field is selected.

And if the best analysis result cannot be distinguished through three-dimensional analysis, selecting the optimal analysis result according to the grammar rule, the RNN algorithm and the elastic search sequence, and obtaining the final analysis result. Since the accuracy of semantic parsing is, from the top to the bottom, grammar rules, RNN algorithms, elastic search searches, according to a large number of test data analyses.

And after the semantic analysis result is obtained, returning the optimal semantic analysis result on one side, and judging the write cache operation on the other side. The semantic meaning is clear, the semantic meaning irrelevant to the context can be written into the cache, so that the analysis result in the cache can be directly obtained and returned when the same semantic request comes next time, and the analysis efficiency is improved.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims

1. A text-based natural language understanding decision method is characterized by comprising the following steps:

establishing semantic cache;

acquiring voice information of a user, and performing semantic analysis;

2. The text-based natural language understanding decision method according to claim 1, wherein the semantic cache comprises a fixed semantic cache and a real-time semantic cache, wherein the fixed semantic cache is a semantic parsing result write cache of some commonly used sentences counted in advance; the real-time semantic cache is the semantic analyzed after the user requests in real time, and the semantic analysis which is high in semantic analysis confidence and not influenced by context is stored in the cache.

3. The text-based natural language understanding decision method according to claim 1, wherein the obtaining of the speech information of the user and the semantic parsing comprise: semantic analysis is carried out by adopting three modes, wherein a, semantic analysis is carried out by adopting a grammar rule; b. performing semantic analysis by adopting an RNN algorithm; c. semantic parsing is carried out by using an Elastic Search (ES) intelligent search, and the confidence of a parsing result is calculated in each mode.

4. The text-based natural language understanding decision method according to claim 3, wherein semantic parsing results need to be filtered, and the results have three conditions: a. the semantic analysis of the three modes is the same; b. the semantic analysis of the two modes is the same; c. the semantic analysis of the three modes is different; a and b directly select the result with the same semantic resolution, and c needs to be selected through three dimensions.

5. The method of claim 4, wherein the dimensions are current application, confidence level, and user profile of the TV.