CN117093679A

CN117093679A - Large language model intelligent inquiry dialogue method, system, equipment and medium

Info

Publication number: CN117093679A
Application number: CN202310728502.5A
Authority: CN
Inventors: 余忠庆; 郝赫; 冯大辉
Original assignee: Nocode Tech Co ltd
Current assignee: Nocode Tech Co ltd
Priority date: 2023-06-19
Filing date: 2023-06-19
Publication date: 2023-11-21
Anticipated expiration: 2043-06-19
Also published as: CN117093679B

Abstract

The invention discloses a large language model intelligent inquiry dialogue method, a system, equipment and a medium, comprising the following steps: acquiring user information before inquiry; according to the user information before inquiry, carrying out user intention prediction and related knowledge search to obtain a first knowledge search result, and taking the first knowledge search result as an initial dynamic knowledge set; acquiring user input content, and taking a dynamic knowledge set as background information; fusing user input content and background information and constructing a target promt; inputting the target prompt into a large language model to obtain inquiry reasoning; carrying out user intention prediction and related knowledge search asynchronously according to the existing dialogue history information and the user information while carrying out inquiry reasoning to obtain a second knowledge search result and updating the second knowledge search result into a dynamic knowledge set; and taking the updated dynamic knowledge set as background information of a follow-up consultation dialogue. The invention reduces the fact-type errors and ensures the real-time performance of dialogue response through the pre-calculation and asynchronously updated dynamic knowledge set.

Description

Large language model intelligent inquiry dialogue method, system, equipment and medium

Technical Field

The invention relates to the technical field of intelligent medical treatment, in particular to a large language model intelligent inquiry dialogue method, a large language model intelligent inquiry dialogue system, large language model intelligent inquiry dialogue equipment and medium.

Background

Conventional dialogue systems include Natural Language Understanding (NLU), dialogue management, natural Language Generation (NLG), etc. in order for the system to better understand input information and output desired results, many tasks such as feature mining, rule writing, template design, etc. are often required to be manually engaged. The novel dialogue model such as ChatGPT based on the large language model (Large Language Model, LLM) has strong natural language understanding and generating capability, can automatically learn and infer the intention and the semanteme of a user, and generate natural and smooth answers, thereby bringing new paradigms and prospects for realization and application of a dialogue system.

The medical consultation dialogue is a very serious application scene, and has very high requirements on the authenticity and accuracy of the reply content. LLM, while capable of generating natural fluent replies, is difficult to avoid factual errors. To reduce such errors, it is generally considered to combine with a search engine, such as the New Bing chat function, to search for relevant content prior to replying, and then to reply by LLM inferentially based on user input and search content. However, this LLM combined with search method is difficult to meet the requirements of dialogue systems with high real-time requirements, such as online voice or video interrogation. While LLM reasoning can guarantee response instantaneity through streaming response, failure of the search process to do streaming response creates a bottleneck for response time of such dialog systems.

Disclosure of Invention

In order to solve the problems, the technical scheme provided by the invention is as follows:

an intelligent inquiry dialogue method for a large language model, comprising the following steps:

acquiring user information before inquiry;

according to the user information before inquiry, carrying out user intention prediction and related knowledge search for a subsequent inquiry dialogue to obtain a first knowledge search result, and taking the first knowledge search result as an initial dynamic knowledge set;

acquiring user input content, and taking the dynamic knowledge set as background information; fusing the user input content and the background information and constructing a target campt; inputting the target prompt into a large language model to obtain inquiry reasoning for answering the input content;

asynchronous user intention prediction and related knowledge search are carried out according to the existing dialogue history information and the pre-consultation user information, a second knowledge search result is obtained, and the second knowledge search result is updated to the dynamic knowledge set; and taking the updated dynamic knowledge set as background information of a follow-up consultation dialogue.

The present invention is further arranged such that the user intention prediction comprises: and constructing user intention prediction campt according to the pre-consultation user information or dialogue history information, and calling a large language model to predict and obtain a plurality of problems possibly consulted by the user in the subsequent dialogue.

The invention is further arranged that the knowledge search comprises: according to the predicted user problem, carrying out knowledge search on an external knowledge base to obtain the first knowledge search result or the second knowledge search result;

if the external knowledge base is structured data, analyzing the predicted user problem by using an entity recognition, relationship recognition or intention understanding algorithm, and converting the predicted user problem into a query condition of the external knowledge base for query to obtain a knowledge search result;

if the external knowledge base is unstructured data, using a text search technology, carrying out entity recognition, intention understanding or query rewrite processing on the predicted user problem through a query analysis model, screening a candidate set related to query from query data through inverted index recall or semantic vector recall, and carrying out fine screening by using a sorting model or a sorting strategy on the basis of recall to obtain a candidate set to obtain a final knowledge search result.

The invention further provides that the first knowledge search result or the obtained second knowledge search result is temporarily stored in a memory or a cache as background information of a subsequent consultation session.

The invention is further arranged that the target template comprises an instruction part, a background information part and an input content part; the instruction part is a task to be executed by the large language model, the background information part is a knowledge search result obtained by searching, and the input content part is the user input content.

The invention is further arranged to record session ID, pre-consultation user information, user and system session history and dynamic knowledge sets during said consultation session.

The invention is further configured such that the pre-consultation user information includes at least one of a user's age, gender, past medical history, surgical history, allergy history, medication history, complaint symptoms, and counseling appeal.

The large language model intelligent inquiry dialogue system adopts the large language model intelligent inquiry dialogue method, which comprises the following steps:

the information collection module is used for collecting user information before inquiry;

the knowledge storage module comprises a user intention prediction sub-module, a knowledge search sub-module, a knowledge storage sub-module and an information updating sub-module; the user intention prediction submodule predicts the problem that the user possibly consults in the follow-up consultation dialogue according to the user information before the consultation or dialogue history information; the knowledge searching submodule performs knowledge searching according to the predicted user problem to acquire knowledge content required by a subsequent dialogue; the knowledge storage sub-module temporarily stores the searched knowledge content in a memory or a cache for subsequent consultation dialogue; the information updating sub-module updates information required by predicting user intention according to the dialogue process, and uses the updated information for predicting the user intention, so as to update a dynamic knowledge set associated with the current inquiry dialogue;

the inquiry dialogue module comprises a dialogue management module and a large language reasoning module; the dialogue management module is responsible for maintaining and updating the state of the consultation dialogue, and records dialogue ID, user information before the consultation, dialogue history and dynamic knowledge set of the user and the system in the process of the consultation dialogue; and the large language reasoning module is used for reasoning and replying the consultation of the user through the large language model according to the input content and the dynamic knowledge set of the user.

An apparatus comprising a memory and a processor, the memory storing a computer program that, when executed by the processor, causes the processor to perform the large language model intelligent inquiry dialogue method described above.

A storage medium having stored thereon computer program instructions which, when executed by a processor, implement the large language model intelligent inquiry dialogue method described above.

Compared with the prior art, the technical scheme provided by the invention has the following beneficial effects:

according to the technical scheme, the possible intention of the user in the subsequent dialogue is predicted according to the user information collected before the inquiry and the history information in the dialogue process, and the knowledge search is carried out to obtain a dynamic knowledge set, so that background knowledge is provided for the subsequent dialogue. The coverage rate of knowledge is guaranteed by predicting the follow-up intention of the user based on the current information, and the real-time performance of the response of the dialogue process is guaranteed by asynchronous calculation of knowledge updating.

Drawings

FIG. 1 is a flow chart of a large language model intelligent inquiry dialogue method according to an embodiment of the invention.

FIG. 2 is a schematic diagram of a large language model intelligent inquiry dialogue system according to an embodiment of the present invention.

FIG. 3 is a flow chart illustrating the operation of the knowledge storage module according to an embodiment of the present invention.

Fig. 4 is a flowchart illustrating the operation of the inquiry dialogue module according to an embodiment of the present invention.

Fig. 5 is a schematic diagram of user intention prediction by using GPT3.5 according to an embodiment of the present invention.

Fig. 6 is a schematic diagram of query reasoning reply using GPT3.5 according to an embodiment of the present invention.

Fig. 7 is a schematic diagram of an apparatus according to an embodiment of the present invention.

Detailed Description

For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments of the present invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

Example 1

With reference to fig. 1, the technical scheme of the invention is a large language model intelligent inquiry dialogue method, which comprises the following steps:

s100, acquiring user information before inquiry;

s200, according to the user information before the inquiry, carrying out user intention prediction and related knowledge search for a follow-up inquiry dialogue to obtain a first knowledge search result, and taking the first knowledge search result as an initial dynamic knowledge set;

s300, acquiring user input content, and taking the dynamic knowledge set as background information; fusing the user input content and the background information and constructing a target campt; inputting the target prompt into a large language model to obtain inquiry reasoning for answering the input content;

s400, carrying out asynchronous user intention prediction and related knowledge search according to the existing dialogue history information and the pre-consultation user information to obtain a second knowledge search result, and updating the second knowledge search result into the dynamic knowledge set; and taking the updated dynamic knowledge set as background information of a follow-up consultation dialogue.

In the above embodiment, the first knowledge search result is background knowledge stored in the dynamic knowledge set before the inquiry, and the second knowledge search result is background knowledge updated in the dynamic knowledge set during the inquiry.

In the above embodiment, in a general on-line consultation scenario, the user is required to pre-fill in information related to consultation, so that the doctor can know the condition of the patient, and then perform a consultation session. The specific collection mode can be in the form of dialogue or filling form, and the collected results are stored in the form of JSON, for example: "{" ID ":" user A_0001"," age ": 30", "sex": "female", "crowd label": "pregnant woman", "past medical history": "none", "allergy history": "none", "medication history": "none", "main complaint": abdominal pain "}".

In this embodiment, the user intention prediction includes: and constructing user intention prediction campt according to the pre-consultation user information or conversation history information, and calling a large language model to predict and obtain a plurality of problems most likely to be consulted by the user in the subsequent conversation.

In the above embodiment, the predicted problems to be consulted by the user in the subsequent dialogue are different according to the large language model, and are not uncertain to the scope of claims, and specifically, may be the first 3-5 problems with the highest consultation probability.

In the above embodiment, the problem that the user may consult in subsequent conversations is obtained by constructing the user intent prediction template and invoking the large language model. Taking the above information of the user a_0001 as an example, the user intention prediction template is constructed as follows: "this is an on-line inquiry scenario, consulting user information as follows: age 30, sex female, crowd label pregnant, past medical history, allergy history, medical history, complaint symptoms, abdominal pain. The 3 question types most likely to be asked by the user in the subsequent dialog, each of which gives one of the most likely questions (as complete as possible), are returned as [ { "type": "xxx", "question": "xxx" } ], without explanation. "as shown in fig. 5, the user intention prediction is performed at GPT3.5, and the problem that the user may consult is" what is i's abdominal pain caused by? "," what is my way to safely treat abdominal pain is my pregnancy? "do nothing to notice during pregnancy prevent abdominal pain? ".

In this embodiment, the knowledge search includes: according to the predicted user problem, carrying out knowledge search on an external knowledge base to obtain the first knowledge search result or the second knowledge search result;

In the above embodiment, also taking the information of the user a_0001 as an example, knowledge searching is performed based on the problem that the user may consult, which is predicted by the user intention, to obtain the knowledge content required by the subsequent dialog.

In the above embodiment, if the external knowledge base is structured data, for the predicted user problem "what is i's abdominal pain caused by? ", into knowledge base query conditions, in the form: "select_cause FROM table_knowledgewhere_type= 'symptom' and_name= 'abdominal pain'; "wherein table_knowledges represents a table of a knowledge base, _name represents a query object name, _type represents a query object type, _cause represents a" cause "attribute of the query object.

In the above embodiment, if the external knowledge base is unstructured data, the text search technology used includes two parts of offline preparation and online calculation, and the offline preparation work includes query analysis model training, index construction, ordering model training and the like; the on-line calculation flow comprises several main links of query analysis, recall and sorting, wherein the query analysis stage mainly comprises the processes of entity identification, intention understanding, query rewriting and the like, the recall stage mainly comprises the steps of quickly screening candidate sets related to the query from a large amount of data, the main modes of recall comprise inverted index recall, semantic vector recall and the like, and the sorting model or strategy is used for further screening on the basis of recall to obtain the candidate sets, so that the final knowledge search result is obtained.

In this embodiment, the first knowledge search result or the obtained second knowledge search result is temporarily stored in a memory or a cache, and is used as background information of a subsequent consultation session.

In the above embodiment, each knowledge point may use a key-value form to represent and store knowledge, where a key represents a name of the knowledge point, and a value represents content of the knowledge point, and generally, more than one knowledge point is required for one query dialog, and multiple knowledge points form a knowledge set. Taking the above information of user a_0001 as an example, it is assumed that "what is the cause of abdominal pain? What are the "and" methods of self-relief of pregnant woman abdominal pain? The two knowledge points are subjected to knowledge searching to obtain the following knowledge contents:

"{" A_0001_dynamic knowledge set "{" what are the common causes of pregnant woman abdominal pain? The causes of abdominal pain in pregnant women can be roughly divided into two types, one of which is physiological and the other of which is pathological, "? "the pregnant woman has symptoms of abdominal pain, and specific reasons for the abdominal pain need to be checked first, so that the pregnant woman can purposefully make a conditioning and then help relieve the symptoms.." }). The knowledge set may be stored with the inquiry session ID in the Redis for use in subsequent inquiry sessions.

In this embodiment, the target sample includes an instruction portion, a background information portion, and an input content portion; the instruction part is a task to be executed by the large language model, the background information part is a knowledge search result obtained by searching, and the input content part is the user input content.

In the above embodiment, taking the user a_0001 consultation as an example, the target campt may be constructed as follows: "this is an on-line inquiry task, which starts a dialogue with the user, with reference to background knowledge and dialogue history. Background knowledge: common causes of abdominal pain in pregnant women: abdominal pain of pregnant women may be caused by gastroenteritis, premature placenta peeling, threatened abortion, etc. If the pregnant woman eats dirty food, gastroenteritis may be caused, and gastrointestinal mucosa is easily stimulated, so that abdominal pain of the pregnant woman is caused. If the pregnant woman suffers from hypertension or the abdomen is wounded, premature placenta peeling may occur, and abdominal pain may occur due to placenta peeling. Threatened abortion may be caused by embryo chromosome abnormality and environmental factors, so that uterine contraction may be caused, and symptoms of abdominal pain and vaginal bleeding may appear in pregnant women. User information: pregnant women, with 30 years old, abdominal pain. The following is the dialog history: the user: how do i just eat a piece of watermelon, how do the bellies get painful? Predicting the next doctor's reply: "as shown in fig. 6, the target template is used as the input of the large language model, and the large language model is called for reasoning, so as to obtain the following reply: "the pregnant woman may have gastroenteritis caused by dirty food, and the gastrointestinal mucosa is easy to be stimulated, thereby causing abdominal pain of the pregnant woman. Advice you to pay attention to diet hygiene, drink more water and rest more. If symptoms are aggravated or accompanied by other symptoms, a prompt medical visit is advised. "

In the above embodiment, the initial dynamic knowledge set of the inquiry session is obtained through user intention prediction and related knowledge search based on the user information and the appeal collected before the inquiry; as the interrogation session progresses, user information and appeal may change, so too will the knowledge points required in the session. The knowledge update and the dialogue process are asynchronous, and dialogue reply reasoning does not need to wait for the knowledge update. In the conversation process, based on the current conversation history information and the user information, the user intention prediction and the related knowledge search are carried out, and then the dynamic knowledge set is updated.

In this embodiment, during the interview session, session ID, pre-interview user information, user and system session history and dynamic knowledge sets are recorded.

In this embodiment, the pre-consultation user information includes at least one of a user's age, gender, past medical history, surgical history, allergy history, medication history, complaint symptoms, and counseling appeal.

In this embodiment, the large language model in the above example is exemplified by GPT3.5, and other large language models may be used instead.

Example 2

Referring to fig. 2 to fig. 4, the technical scheme of the invention is a large language model intelligent inquiry dialogue system, and the large language model intelligent inquiry dialogue method described in embodiment 1 is adopted, comprising:

Example 3

Referring to fig. 7, the technical solution of the present invention is an apparatus, where the apparatus includes a memory 200 and a processor 100, where the memory 200 stores a computer program, and the computer program when executed by the processor 100 causes the processor 100 to execute the large language model intelligent inquiry dialogue method described in embodiment 1.

Example 4

The technical scheme of the invention is that the storage medium is stored with computer program instructions which when executed by a processor realize the large language model intelligent inquiry dialogue method described in the embodiment 1.

In the description, each embodiment is described in a progressive manner, and each embodiment is mainly described by the differences from other embodiments, so that the same similar parts among the embodiments are mutually referred.

It should also be noted that in this specification, relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. An intelligent inquiry dialogue method for a large language model, which is characterized by comprising the following steps:

acquiring user information before inquiry;

2. The large language model intelligent inquiry dialogue method as claimed in claim 1, wherein said user intention prediction comprises:

and constructing user intention prediction campt according to the pre-consultation user information or dialogue history information, and calling a large language model to predict and obtain a plurality of problems possibly consulted by the user in the subsequent dialogue.

3. The large language model intelligent inquiry dialogue method according to claim 2, wherein the knowledge search comprises:

according to the predicted user problem, carrying out knowledge search on an external knowledge base to obtain the first knowledge search result or the second knowledge search result;

4. The method of claim 3, wherein the first knowledge search result or the obtained second knowledge search result is temporarily stored in a memory or a cache as background information of a subsequent consultation session.

5. The large language model intelligent consultation dialogue method according to claim 4, wherein the target prompt comprises an instruction part, a background information part and an input content part; the instruction part is a task to be executed by the large language model, the background information part is a knowledge search result obtained by searching, and the input content part is the user input content.

6. A large language model intelligent consultation dialogue method according to any of claims 1 to 5 characterised in that dialogue ID, pre-consultation user information, user and system dialogue history and dynamic knowledge sets are recorded during the consultation dialogue.

7. A large language model intelligent consultation dialogue method according to any of claims 1 to 5 characterised in that the pre-consultation user information includes at least one of user age, sex, past medical history, surgical history, allergy history, medication history, complaint symptoms and counseling appeal.

8. A large language model intelligent inquiry dialogue system, characterized in that the large language model intelligent inquiry dialogue method as claimed in any one of claims 1 to 7 is adopted, comprising:

9. An apparatus comprising a memory and a processor, the memory storing a computer program that, when executed by the processor, causes the processor to perform the large language model intelligent inquiry dialogue method of any one of claims 1 to 7.

10. A storage medium having stored thereon computer program instructions which, when executed by a processor, implement the large language model intelligent inquiry dialogue method of any one of claims 1 to 7.