CN117217235A

CN117217235A - Intent recognition method and system based on large language model

Info

Publication number: CN117217235A
Application number: CN202311389237.9A
Authority: CN
Inventors: 翁鹤鸣; 周豪; 张瀚亓; 邓颖茹; 陈威
Original assignee: Shenzhen Turing Collaborative Technology Co ltd
Current assignee: Shenzhen Turing Collaborative Technology Co ltd
Priority date: 2023-10-24
Filing date: 2023-10-24
Publication date: 2023-12-12

Abstract

The invention relates to the technical field of electronic information processing, in particular to an intention recognition method and system based on a large language model. The identification method comprises the following steps: the method comprises the steps of obtaining and identifying the type of input information, analyzing the input information, and generating first text information; keyword extraction processing is carried out on the first text information through the large language model, and keyword information is obtained; the historical record information is called, and the large language model is utilized to combine and analyze the keyword information and the historical record information so as to generate operation content; and processing operation content, and generating and pushing response options. The identification method and the system provided by the invention realize more accurate intention identification, and the AI assistant can more accurately understand the requirements of the user by analyzing the specific input content of the user; more personalized response options the AI assistant can provide personalized response options according to the needs of the understanding.

Description

Intent recognition method and system based on large language model

Technical Field

The invention belongs to the technical field of electronic information processing, and particularly relates to an intention recognition method and system based on a large language model.

Background

At present, an AI assistant technology is adopted in many software systems to assist user operation, so that the difficulty of user operation is expected to be reduced, and the user experience is improved.

However, existing AI helper techniques understand the needs of the user primarily through text input entered by the user. The main problem with this approach is that it requires users to express their needs explicitly and completely, which can be difficult for many users. In addition to this, some AI helper technologies are interactive ways that employ preset commands or options, but such ways generally limit the freedom of interaction between the user and the AI helper.

Therefore, in the practical application process, the prior AI assistant technology has too high requirement on the capability of the user or limited AI interactive content. It is difficult to achieve the desired effect and the user experience is generally felt.

Disclosure of Invention

In view of the above problems, the present invention provides an intent recognition method based on a large language model, the recognition method including:

the method comprises the steps of obtaining and identifying the type of input information, analyzing the input information, and generating first text information;

keyword extraction processing is carried out on the first text information through the large language model, and keyword information is obtained;

the historical record information is called, and the large language model is utilized to combine and analyze the keyword information and the historical record information so as to generate operation content;

and generating and pushing response options according to the operation content.

Further, the type of the input information comprises a text, a picture, a file and system information;

the type of the system information comprises a literal text and/or a picture and/or a file.

Further, the analyzing the input information includes:

when the input information is a text, directly outputting the text information as first text information;

when the input information is a picture, recognizing a text in the picture through an OCR text recognition system and outputting the text as first text information;

when the input information is a file, recognizing a text of the text in the file through a document analysis system and outputting the text as first text information.

Further, the history information includes a user history operation record, a time of the corresponding operation record, and a user characteristic.

Further, the generating operation content includes:

extracting all history information related to the keyword information according to the keyword information;

combining the keyword information and the history information, and calculating the response relativity of each history information and the keyword;

and analyzing the operation content related to a plurality of history information with high response relevance.

The invention also provides an intention recognition system based on the large language model, which comprises:

the input module is used for acquiring and identifying the type of the input information, analyzing and processing the input information and generating first text information;

the analysis module is used for extracting keywords from the first text information through the large language model to obtain keyword information; the method comprises the steps of acquiring historical record information, and generating operation content by utilizing a large language model to combine and analyze keyword information and the historical record information;

the database is used for storing the history record information;

and the output module is used for generating and pushing response options according to the operation content.

Further, the input module includes:

a type identification unit for identifying and judging the type of the input information; when the input information is a text, directly outputting the text information as first text information; when the input information is a picture, the picture is sent to a picture identification unit; when the input information is a file, the file is sent to a document analysis unit;

the picture recognition unit is used for recognizing text in the picture through the OCR text recognition system and outputting the text as first text information;

and the document analysis unit is used for identifying the text of the characters in the file through the document analysis system and outputting the text as first text information.

Further, the input module further includes:

the disassembly unit is used for receiving the system information sent by the type identification unit, carrying out disassembly processing on the system information, and directly outputting the text obtained by processing as first text information; and/or sending the picture obtained by processing to a picture identification unit; and/or sending the file obtained by processing to a document parsing unit.

Further, the analysis module includes:

the keyword unit is used for extracting keywords from the first text information through the large language model to obtain keyword information; the method comprises the steps of calling out all history record information related to keyword information according to the keyword information;

the central processing unit is used for combining the keyword information and the history information and calculating the response relativity of each history information and the keyword; the method is used for analyzing the operation content related to a plurality of history information with high response relevance.

The beneficial effects of the invention are as follows:

1. the identification method and the system provided by the invention realize more accurate intention identification, and the AI assistant can more accurately understand the requirements of the user by analyzing the specific input content of the user; more personalized response options the AI assistant can provide personalized response options according to the needs of the understanding.

2. The invention enriches the types of input information, and besides inputting text, a user can also input the whole file in a direct dragging mode. The operation difficulty of the user is reduced, the diversification of the input information is realized, the dimension of recognition analysis is increased, and the accuracy of recognition of the user intention is higher.

Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention may be realized and attained by the structure particularly pointed out in the written description and drawings.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions of the prior art, the following description will briefly explain the drawings used in the embodiments or the description of the prior art, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a flow chart of an identification method according to an embodiment of the present invention;

fig. 2 shows a schematic structural diagram of an identification system according to an embodiment of the present invention.

Detailed Description

For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments of the present invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

The embodiment of the invention provides an intention recognition method based on a large language model, as shown in fig. 1, wherein the recognition method comprises the following steps:

and generating and pushing response options according to the operation content.

The identification method provided by the embodiment of the invention realizes more accurate intention identification, and the AI assistant can more accurately understand the requirements of the user by analyzing the specific input content of the user; more personalized response options the AI assistant can provide personalized response options according to the needs of the understanding.

Specifically, the types of the input information include, but are not limited to, text, pictures, files and system information.

The file formats include, but are not limited to, word, PDF, and text documents.

The analyzing processing of the input information comprises the following steps:

when the input information is a text, the input information is directly output as the first text information.

When the input information is a picture, recognizing a text in the picture through an OCR text recognition system and outputting the text as first text information.

It should be noted that the system information includes, but is not limited to, contact information and chat records. When the input information is system information, it is also necessary to identify the type of system information content, which includes text and/or pictures and/or files.

The invention enriches the types of input information, and besides inputting text, a user can also input the whole file in a direct dragging mode. The operation difficulty of the user is reduced, the diversification of the input information is realized, the dimension of recognition analysis is increased, and the accuracy of recognition of the user intention is higher.

Illustratively, the large language model may employ, but is not limited to, a LLaMA (Large Language Model Meta AI) language model or a ChatGLM (Chat General Language Model) model. The large language model is based on a general model, and keyword information in user input information can be identified and extracted through a large amount of corpus training.

In particular, the history information includes, but is not limited to, a user history, a time of the corresponding operation, and a user characteristic.

Illustratively, the user characteristics include, but are not limited to, one or more of the user's gender, age, preferences.

Further, the generating operation content includes:

For example, the keyword information and the history information are respectively converted into the ebedding sentence vector, and the response correlation degree of each history information and the keyword is calculated by using a cosine similarity calculation formula. And sequencing the response relativity of each user history operation record and the keyword information from big to small, and selecting the user history operation records with the top three ranks. The operation content in the three user history operation records is analyzed and read out. And generating corresponding response options by combining the operation content in the user history operation record. The user can select the response option autonomously to enter the next operation experience.

The identification method provided by the embodiment of the invention supports various input information, reduces the use requirement and the operation difficulty of the user, and simultaneously more accurately understands the requirement of the user. And provides personalized response options to improve the user experience.

In order to achieve the above-mentioned intent recognition method based on a large language model, the embodiment of the present invention further provides an intent recognition system based on a large language model, as shown in fig. 2, where the recognition system includes:

the database is used for storing the history record information;

The recognition system provided by the embodiment of the invention realizes more accurate intention recognition, and the AI assistant can more accurately understand the requirements of the user by analyzing the specific input content of the user; more personalized response options the AI assistant can provide personalized response options according to the needs of the understanding.

Specifically, the input module includes:

The recognition system provided by the invention enriches the types of input information, and besides inputting text, a user can also input the whole file in a direct dragging mode. The operation difficulty of the user is reduced, the diversification of the input information is realized, the dimension of recognition analysis is increased, and the accuracy of recognition of the user intention is higher.

Preferably, the input module further comprises:

It should be noted that the system information includes, but is not limited to, contact information and chat records. When the input information is system information, it is also necessary to identify the type of the content of the system information, which includes text and/or pictures and/or files. Since the system information may contain multiple data types, it is necessary to disassemble the system information first.

The identification system provided by the embodiment of the invention supports the identification analysis of the system information, further enriches the types of the input information, and improves the accuracy of identifying the intention of the user.

Illustratively, the large language model may employ, but is not limited to, a LLaMA (Large Language Model Meta AI) language model or a ChatGLM (Chat General Language Model) model. The large language model is based on a general model, and keyword information in user input information can be identified through a large amount of corpus training.

Further, the analysis module includes:

For example, the central processing unit converts the keyword information and the history information into the ebedding sentence vectors, respectively, and calculates the response correlation degree of each history information and the keyword by using a cosine similarity calculation formula. And sequencing the response relativity of each user history operation record and the keyword information from big to small, and selecting the user history operation records with the top three ranks. The operation content in the three user history operation records is analyzed and read out. And combining the historical operation record of the user, adjusting the operation content, and generating corresponding response options. The user can select the response option autonomously to enter the next operation experience.

The identification system provided by the embodiment of the invention supports various input information, reduces the use requirement and the operation difficulty of a user, and simultaneously more accurately understands the requirement of the user. And provides personalized response options to improve the user experience.

Although the invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims

1. An intent recognition method based on a large language model, the recognition method comprising:

and generating and pushing response options according to the operation content.

2. The large language model based intent recognition method as claimed in claim 1, wherein the type of the input information includes a literal text, a picture, a file and system information;

3. The large language model based intent recognition method as claimed in claim 2, wherein parsing the input information includes:

4. The large language model based intent recognition method of claim 1, wherein the history information includes user history operation records, time of corresponding operation records, and user characteristics.

5. The large language model based intent recognition method as claimed in claim 1, wherein generating operation contents includes:

6. An intent recognition system based on a large language model, the recognition system comprising:

the database is used for storing the history record information;

7. The large language model based intent recognition system of claim 6, wherein the input module comprises:

8. The large language model based intent recognition system of claim 7, wherein the input module further comprises:

9. The large language model based intent recognition system of claim 6, wherein the analysis module comprises: