WO2022134421A1 - Multi-knowledge graph based intelligent reply method and apparatus, computer device and storage medium - Google Patents

Multi-knowledge graph based intelligent reply method and apparatus, computer device and storage medium Download PDF

Info

Publication number
WO2022134421A1
WO2022134421A1 PCT/CN2021/091291 CN2021091291W WO2022134421A1 WO 2022134421 A1 WO2022134421 A1 WO 2022134421A1 CN 2021091291 W CN2021091291 W CN 2021091291W WO 2022134421 A1 WO2022134421 A1 WO 2022134421A1
Authority
WO
WIPO (PCT)
Prior art keywords
current
entity name
question
entity
knowledge graph
Prior art date
Application number
PCT/CN2021/091291
Other languages
French (fr)
Chinese (zh)
Inventor
张松
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2022134421A1 publication Critical patent/WO2022134421A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition

Definitions

  • the present application relates to the field of knowledge graphs of big data, and in particular, to an intelligent reply method, apparatus, computer equipment and storage medium based on multi-knowledge graphs.
  • chat robots carried in servers are generally classified into chat robots, task (vertical) robots, and question answering (QA) robots according to application scenarios.
  • QA question answering
  • the embodiments of the present application provide an intelligent answering method, device, computer equipment and storage medium based on multi-knowledge graphs, aiming to solve the problem of accurate keyword matching in the process of intelligent questioning and answering with users in the prior art.
  • an embodiment of the present application provides an intelligent reply method based on a multi-knowledge graph, which includes:
  • Receive the dialogue text sent by the user terminal perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set;
  • the corresponding first current associated question set is obtained in the local first knowledge graph library, and the first current associated question set is sent to the client; wherein the first current associated question set is The number of association question subsets included in the current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the association questions in the first current association question set Subset;
  • the first question click instruction corresponding to the first current associated question set sent by the client is detected, the first target entity name corresponding to the first question click instruction is obtained, and the first target entity name corresponding to the first target entity name is obtained.
  • a set of associated entity names obtaining a corresponding second current associated problem set in the local first knowledge graph library according to the names of each first associated entity in the first associated entity name set, and sending the second current associated problem set to The user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association entity a name corresponding to one of the subset of contextual questions in the second current set of contextual questions; and
  • the second target entity name corresponding to the second question click instruction is acquired and the completion flag is added as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
  • an embodiment of the present application provides an intelligent replying device based on a multi-knowledge graph, which includes:
  • connection establishment unit configured to establish a connection with the user terminal and obtain a current user portrait corresponding to the user terminal if an intelligent session connection instruction sent by the user terminal is detected
  • the current entity recognition unit is used to receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity corresponding to each intention recognition keyword in the intention recognition keyword set. name, to make up the current set of entity names;
  • the first question set sending unit is configured to obtain the corresponding first current associated question set in the local first knowledge graph library according to each current entity name in the current entity name set, and send the first current associated question set to User terminal; wherein, the number of association question subsets included in the first current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to the first one of the subset of related questions in the current set of related questions;
  • the second question set sending unit is configured to acquire the name of the first target entity corresponding to the first question click instruction and obtain the The first associated entity name set corresponding to the first target entity name, according to each first associated entity name in the first associated entity name set, obtain the corresponding second current associated problem set in the local first knowledge graph library, The second current associated question set is sent to the client; wherein, the number of associated question subsets included in the second current associated question set and the number of first associated entity names included in the first associated entity name set are the same, and each first associated entity name corresponds to one of the associated problem subsets in the second current associated problem set; and
  • the result binding unit is configured to acquire the second target entity name addition completion identifier corresponding to the second question click instruction corresponding to the second question click instruction as the first an output result, the first output result and the current user portrait are mapped one-to-one to bind.
  • an embodiment of the present application further provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor executes the computer
  • the program implements the following steps:
  • Receive the dialogue text sent by the user terminal perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set;
  • the corresponding first current associated question set is obtained in the local first knowledge graph library, and the first current associated question set is sent to the client; wherein the first current associated question set is The number of association question subsets included in the current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the association questions in the first current association question set Subset;
  • the first question click instruction corresponding to the first current associated question set sent by the client is detected, the first target entity name corresponding to the first question click instruction is obtained, and the first target entity name corresponding to the first target entity name is obtained.
  • a set of associated entity names obtaining a corresponding second current associated problem set in the local first knowledge graph library according to the names of each first associated entity in the first associated entity name set, and sending the second current associated problem set to The user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association entity a name corresponding to one of the subset of contextual questions in the second current set of contextual questions; and
  • the second target entity name corresponding to the second question click instruction is acquired and the completion flag is added as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
  • embodiments of the present application further provide a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when executed by a processor, the computer program causes the processor to perform the following operations :
  • Receive the dialogue text sent by the user terminal perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set;
  • the corresponding first current associated question set is obtained in the local first knowledge graph library, and the first current associated question set is sent to the client; wherein the first current associated question set is The number of association question subsets included in the current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the association questions in the first current association question set Subset;
  • the first question click instruction corresponding to the first current associated question set sent by the client is detected, the first target entity name corresponding to the first question click instruction is obtained, and the first target entity name corresponding to the first target entity name is obtained.
  • a set of associated entity names obtaining a corresponding second current associated problem set in the local first knowledge graph library according to the names of each first associated entity in the first associated entity name set, and sending the second current associated problem set to The user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association entity a name corresponding to one of the subset of contextual questions in the second current set of contextual questions; and
  • the second target entity name corresponding to the second question click instruction is acquired and the completion flag is added as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
  • the embodiments of the present application provide an intelligent answering method, device, computer equipment and storage medium based on multiple knowledge graphs, to determine whether a user establishes a session with an intelligent dialogue robot for the first time, so as to call associated question sets from different knowledge graph libraries. Users with different levels of knowledge can recommend corresponding information accordingly, which can effectively improve the efficiency and accuracy of information acquisition by users.
  • FIG. 1 is a schematic diagram of an application scenario of an intelligent reply method based on a multi-knowledge graph provided by an embodiment of the present application;
  • FIG. 2 is a schematic flowchart of an intelligent reply method based on a multi-knowledge graph provided by an embodiment of the present application
  • FIG. 3 is a schematic block diagram of an intelligent replying device based on a multi-knowledge graph provided by an embodiment of the present application
  • FIG. 4 is a schematic block diagram of a computer device provided by an embodiment of the present application.
  • FIG. 1 is a schematic diagram of an application scenario of the multi-knowledge graph-based intelligent reply method provided by the embodiment of the application
  • FIG. 2 is a schematic flowchart of the multi-knowledge graph-based intelligent reply method provided by the embodiment of the application. , the multi-knowledge graph-based intelligent reply method is applied in the server, and the method is executed by the application software installed in the server.
  • the method includes steps S101-S109.
  • S101 Receive and save a first knowledge graph library and a second knowledge graph library uploaded by a business server; wherein, the first knowledge graph library includes a plurality of first sub-knowledge graphs, and each first sub-knowledge graph corresponds to several entity name, and there is an association relationship between entities corresponding to the entity name; the second knowledge graph library includes a plurality of second sub-knowledge graphs, each second sub-knowledge graph corresponds to several entity names, and the entity name corresponds to There are associations between entities.
  • the R&D personnel can upload the pre-built first knowledge graph library and the second knowledge graph library to the server.
  • the knowledge graph library Once the above-mentioned knowledge graph library is updated, it can be directly uploaded from the business server to the server for updating.
  • product knowledge or information from the server can be insurance products, wealth management products, electronic products, sports equipment products, etc.
  • the multiple first sub-knowledge maps included in the first knowledge map database correspond to A collection of answer texts for questions concerned by users in the first understanding stage of some products (which can be understood as the demand stimulation stage, where users pay more attention to the acquisition of product popularization knowledge), and multiple first subsections included in the first knowledge graph library.
  • the knowledge graph is mainly based on product-related popular science introduction; the multiple second sub-knowledge graphs included in the second knowledge graph base correspond to the second understanding stage of some products by the user (it can be understood as the concept introduction stage, users pay more attention to The acquisition of detailed parameters of the product) is a collection of answer texts for the concerned question, and the plurality of second sub-knowledge graphs included in the second knowledge graph base are the detailed parameters related to the product.
  • the second is the server.
  • the server can store the first knowledge graph library and the second knowledge graph library uploaded by the business server, and also store the user portraits corresponding to multiple users. These data are used as the background of the intelligent dialogue robot deployed in the server. data.
  • the relationship between the tag keywords in the user portrait and the entity names in the above two knowledge graph bases includes completely different, similar, and identical relationships.
  • a corresponding set of associated questions can be obtained from the first knowledge graph library or the second knowledge graph library through the tag keywords corresponding to the user and sent to the user terminal.
  • the third is the client.
  • Each client stores a corresponding user portrait in the server. If a certain client establishes a dialogue with the intelligent dialogue robot in the server, the user portrait of the user of the client can be obtained in the server.
  • the server After the server receives the first knowledge graph library and the second knowledge graph library uploaded by the business server, and stores the user portraits corresponding to multiple users, these data can be used in the dialog established between the user end and the intelligent dialog robot to conduct Recommendations for questions or knowledge points.
  • the server After a certain client (for example, denoted as client A) establishes a communication connection with the server and opens an intelligent session, the server establishes a connection with the client, and obtains the current user portrait corresponding to the client .
  • client A for example, denoted as client A
  • the server establishes a connection with the client, and obtains the current user portrait corresponding to the client .
  • the current user portrait can be understood as the basic portrait feature of the user.
  • the process of obtaining the user portrait corresponding to the user terminal is as follows:
  • the user unique identification code corresponding to the user terminal is obtained, and the current user portrait corresponding to the user terminal is obtained from the local user portrait library according to the user unique identification code.
  • the server in order to obtain the user portrait of the client more accurately, after the client establishes a connection with the server, the server obtains the user's unique identification code (such as a phone number, ID number, etc.) corresponding to the client. At the same time, according to the user's unique identification code in the local user portrait library of the server, the current user portrait uniquely corresponding to the user terminal can be obtained.
  • the user's unique identification code such as a phone number, ID number, etc.
  • S103 Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and acquire the current entity names corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form a current entity name. Entity name set.
  • the user terminal when the user terminal establishes a connection with the server and starts a dialogue with the intelligent dialogue robot, it is necessary to preliminarily analyze the user's intention according to the received dialogue text. Based on the conversation topics that the current user may be generating, extract the entities of the knowledge graph associated with this conversation process (such as the two entities of child + health). Since it is the first time for the client to establish a session with the intelligent dialogue robot, the sub-knowledge in the demand-stimulation stage can be called in the first knowledge map library based on the core entities extracted from this dialogue flow (such as the child + health in the above example). Atlas.
  • the step of performing intent recognition on the dialogue text in step S103 to obtain an intent recognition keyword set includes:
  • the dialogue text is subjected to word segmentation based on a probabilistic and statistical word segmentation model, and a word segmentation result corresponding to the dialogue text is obtained;
  • keywords that do not exceed the preset ranking threshold after the word segmentation results are sorted in descending order by the frequency-inverse text frequency index are extracted to form an intent identification keyword set.
  • the dialogue text is segmented based on the probability and statistical word segmentation model
  • C C1C2...Cm
  • C is the Chinese character string to be segmented
  • W W1W2...Wn
  • W is the cut
  • Wa, Wb, ..., Wk are all possible segmentation schemes of C.
  • the word segmentation model based on probability statistics can find the target word string W, so that W satisfies: P(W
  • C) MAX(P(Wa
  • the word string W obtained by the above word segmentation model is the word string with the largest estimated probability.
  • the dialogue text can be segmented based on the probability and statistics word segmentation model, so as to obtain a word segmentation result corresponding to the dialogue text.
  • the word frequency-inverse text frequency index model is used to extract the keywords in the word segmentation results.
  • the word frequency-inverse text frequency index model is the TF-IDF model, and TF-IDF is the abbreviation of Term Frequency-Inverse Document Frequency. Extracting the keywords that are located before the preset ranking value after the word segmentation results are sorted in descending order of frequency-inverse text frequency index, to form an intent-identifying keyword set.
  • the word frequency-inverse text frequency index model can accurately extract the core keywords in the text without manual reading.
  • step S103 the step of obtaining the current entity names corresponding to each intent identification keyword in the intent identification keyword set, so as to form the current entity name set, includes:
  • the current entity names corresponding to each intent identification keyword are obtained from the entity name database, so as to form a current entity name set.
  • the server locally pre-stores an entity name library composed of entity names in the first knowledge graph library and the sub-knowledge graphs included in the second knowledge graph library, and also stores a database for judging words and A thesaurus dictionary of similarity between words.
  • all the words included in the synonym words are organized in one or several tree structures (this tree or trees are recorded as the word forest), and the two words that need to be judged for the similarity of the words are found in the word forest. node, the path length of the two nodes can be used as the semantic distance between the two words (which can also be understood as the word similarity).
  • the intent recognition keyword set includes two intent keywords of child and health, and the word with the closest path length to the child in the word forest is parenting.
  • the intent keyword of child can be converted is the current entity name of parenting; in the same way, the word with the closest path length to health in the word forest is health big data.
  • the intention keyword of health can be converted into health big data.
  • S104 Acquire a corresponding first current associated question set in a local first knowledge graph library according to each current entity name in the current entity name set, and send the first current associated question set to the client; wherein, the The number of association question subsets included in the first current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the first current association question sets. Associated problem subsets.
  • the local first knowledge graph library of each current entity name in the current entity name set can be used at this time. Obtain the corresponding first current related question set from , so as to guide users to consult health big data + parenting related questions.
  • the intelligent dialogue robot mainly calls the sub-knowledge map corresponding to the entity of parenting in the first knowledge map database, and also calls the sub-knowledge map corresponding to the entity of health big data in the first knowledge map database, and obtains the above-mentioned sub-knowledge After the graph, the respective associated questions corresponding to the above sub-knowledge graphs are called to form a first first current associated question set, and the first current associated question set is sent to the user terminal.
  • the first question click instruction corresponding to the first current set of related questions sent by the user terminal obtain the first target entity name corresponding to the first question click instruction, and obtain the first target entity name corresponding to the first question click instruction.
  • the first set of associated entity names, according to the name of each first associated entity in the first associated entity name set, the corresponding second current associated problem set is obtained in the local first knowledge graph library, and the second current associated problem set is Send to the user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association The associated entity name corresponds to one of the associated problem subsets in the second current associated problem set.
  • the user terminal can manually click to select one of the most interesting questions to generate the first question Click Command. For example, if a user clicks on a breast-related question under the entity name of health big data (specifically, breast health big data in 2019), the key entity name (mammary gland) in this question is extracted, and then according to the entity name of breast, and The name of the entity (health big data) to which the currently viewing (2019 breast health big data) is associated is related to another entity (breast hyperplasia + female cancer) according to the association relationship between the entities in the first knowledge atlas ), and at the same time show the related questions below these two related entities (how to avoid breast hyperplasia in post-80s women, and what are the diseases with high incidence of female cancer?).
  • entity name of health big data specifically, breast health big data in 2019
  • the name of the entity (health big data) to which the currently viewing (2019 breast health big data) is associated is related to another entity (breast hyperplasia + female cancer) according to the association relationship between
  • the two associated entities in the above example are composed of the first associated entity name set corresponding to the first target entity name. After the first associated entity name set is obtained, the corresponding entity can be obtained in the local first knowledge graph library of the server.
  • the second current associated question set is sent to the user terminal, so as to guide the user to inquire about breast hyperplasia + female cancer-related questions.
  • the user terminal can manually click to select one of the most interesting questions to generate the second question Click Command. For example, if the user finally clicks on the diseases with high incidence of female cancer, according to this related question, it is extracted that the user A has completed the demand stimulation stage. At this time, a "completed scientific knowledge of women's cancer" is output as the first output result.
  • This first data result can be added as a user label to the current user portrait corresponding to the user, or the first data result can be added to the user's current user portrait.
  • the output result is mapped one-to-one with the current user portrait for binding.
  • the server when the server detects another intelligent session connection instruction sent by the client, it may be that the previous user A and the intelligent dialogue robot establish a session again to learn more about the relevant details of the product (that is, the user It has already gone through the demand stimulation stage and entered the concept introduction stage), it is also possible that another user B other than user A has established a conversation with the intelligent dialogue robot for the first time to understand the popular science knowledge of the product (that is, the user has not yet experienced the demand stimulation stage), At this time, the corresponding target user portrait can be obtained according to the connection instruction with another intelligent session. Then, it is determined whether the target user portrait corresponds to a first output result, so as to determine whether the user establishes a conversation with the intelligent dialogue robot for the first time.
  • S108 Determine whether the target user portrait corresponds to the binding of the first output result.
  • judging whether the target user portrait corresponds to the first output result is to judge whether the user establishes a session with the intelligent dialogue robot for the first time, so that the first output result can be called more accurately according to the user's situation in the subsequent steps.
  • the target user portrait is bound with the first output result, which means that the user who has completed at least one communication before and the intelligent dialogue robot establish a conversation again to learn more about the relevant details of the product. Then obtain the set of associated questions from the first knowledge graph library as in the above steps, but obtain the corresponding third current associated question set from the second local knowledge graph library according to the entity name corresponding to the first output result, and set the The third current associated question set is sent to the client.
  • users who have a preliminary understanding of product popularization knowledge can be directed to obtain more professional knowledge from the second knowledge graph library, without the need to understand product popularization knowledge again, thereby improving the efficiency and accuracy of information acquisition by users.
  • step S108 it further includes:
  • the intent recognition keyword set is a step of acquiring the current entity name corresponding to each intent recognition keyword in the intent recognition keyword set, so as to form the current entity name set.
  • the target user portrait is not bound with the first output result, indicating that the user is establishing a session with the intelligent dialogue robot for the first time.
  • the target user portrait can be updated as the current user portrait, and the execution is returned to Step S103. In this way, corresponding information is recommended for users with different knowledge mastery degrees, which can effectively improve the efficiency and accuracy of information acquisition by users.
  • step S109 it further includes:
  • corresponding summary information is obtained based on the first current related question set, the second current related question set, and the third current related question set.
  • the current association problem set, the second current association problem set, and the third current association problem set are obtained by hashing, for example, by using the sha256s algorithm. Uploading summary information to the blockchain ensures its security and fairness and transparency to users.
  • the user equipment can download the summary information from the blockchain, so as to verify whether the first current set of related questions, the second set of current related questions and the third current set of related questions have been tampered with.
  • the blockchain referred to in this example is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • the method realizes that users with different knowledge mastery levels recommend corresponding information according to the corresponding knowledge graph, which can effectively improve the efficiency and accuracy of information acquisition by users.
  • the embodiments of the present application further provide an intelligent replying device based on a multi-knowledge graph, and the multi-knowledge graph-based intelligent replying device is used to execute any of the foregoing embodiments of the multi-knowledge graph-based intelligent replying method.
  • FIG. 3 is a schematic block diagram of an intelligent replying device based on a multi-knowledge graph provided by an embodiment of the present application.
  • the multi-knowledge graph-based intelligent answering device 100 may be configured in a server.
  • the intelligent replying device 100 based on multiple knowledge graphs includes: a knowledge graph database establishment unit 101 , a connection establishment unit 102 , a current entity identification unit 103 , a first question set sending unit 104 , and a second question set sending unit 105 , a result binding unit 106 , a target user portrait acquisition unit 107 , a target user portrait judgment unit 108 , and a third question set sending unit 109 .
  • the knowledge graph library establishing unit 101 is configured to receive and save the first knowledge graph library and the second knowledge graph library uploaded by the business server; wherein, the first knowledge graph library includes a plurality of first sub-knowledge graphs, each A sub-knowledge graph corresponds to several entity names, and there is an association relationship between the entities corresponding to the entity names; the second knowledge graph library includes a plurality of second sub-knowledge graphs, and each second sub-knowledge graph corresponds to several entities name, and there is an association relationship between the entities corresponding to the entity name.
  • the server receives the first knowledge graph library and the second knowledge graph library uploaded by the service server, and stores the user portraits corresponding to multiple users, these data can be used for the client and the intelligent dialogue robot In the established dialogue, recommend questions or knowledge points.
  • the connection establishing unit 102 is configured to establish a connection with the user terminal and acquire the current user portrait corresponding to the user terminal if the intelligent session connection instruction sent by the user terminal is detected.
  • the server After a certain client (for example, denoted as client A) establishes a communication connection with the server and opens an intelligent session, the server establishes a connection with the client, and obtains the current user portrait corresponding to the client .
  • client A for example, denoted as client A
  • the server establishes a connection with the client, and obtains the current user portrait corresponding to the client .
  • the current user portrait can be understood as the basic portrait feature of the user.
  • the process of obtaining the user portrait corresponding to the user terminal is as follows:
  • the user unique identification code corresponding to the user terminal is obtained, and the current user portrait corresponding to the user terminal is obtained from the local user portrait library according to the user unique identification code.
  • the server in order to obtain the user portrait of the client more accurately, after the client establishes a connection with the server, the server obtains the user's unique identification code (such as a phone number, ID number, etc.) corresponding to the client. At the same time, according to the user's unique identification code in the local user portrait library of the server, the current user portrait uniquely corresponding to the user terminal can be obtained.
  • the user's unique identification code such as a phone number, ID number, etc.
  • the current entity recognition unit 103 is configured to receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity corresponding to each intention recognition keyword in the intention recognition keyword set. Entity names to make up the current set of entity names.
  • the user terminal when the user terminal establishes a connection with the server and starts a dialogue with the intelligent dialogue robot, it is necessary to preliminarily analyze the user's intention according to the received dialogue text. Based on the conversation topics that the current user may be generating, extract the entities of the knowledge graph associated in this conversation process (for example, the two entities of child + health). Since it is the first time for the client to establish a session with the intelligent dialogue robot, the sub-knowledge in the demand-stimulation stage can be called in the first knowledge map library based on the core entities extracted from this dialogue flow (such as the child + health in the above example). Atlas.
  • the current entity identification unit 103 includes:
  • a word segmentation unit used to segment the dialogue text through a word segmentation model based on probability statistics to obtain a word segmentation result corresponding to the dialogue text;
  • the keyword extraction unit is used for extracting keywords that do not exceed a preset ranking threshold after the word segmentation results are sorted in descending order of frequency-inverse text frequency index through a word frequency-inverse text frequency index model to form an intent-identifying keyword set.
  • the dialogue text is segmented based on the probability and statistical word segmentation model
  • C C1C2...Cm
  • C is the Chinese character string to be segmented
  • W W1W2...Wn
  • W is the cut
  • Wa, Wb, ..., Wk are all possible segmentation schemes of C.
  • the word segmentation model based on probability statistics can find the target word string W, so that W satisfies: P(W
  • C) MAX(P(Wa
  • the word string W obtained by the above word segmentation model is the word string with the largest estimated probability.
  • the dialogue text can be segmented based on the probability and statistics word segmentation model, so as to obtain a word segmentation result corresponding to the dialogue text.
  • the word frequency-inverse text frequency index model is used to extract the keywords in the word segmentation results.
  • the word frequency-inverse text frequency index model is the TF-IDF model, and TF-IDF is the abbreviation of Term Frequency-Inverse Document Frequency. Extracting the keywords that are located before the preset ranking value after the word segmentation results are sorted in descending order of frequency-inverse text frequency index, to form an intent-identifying keyword set. Through the word frequency-inverse text frequency index model, the core keywords in the text can be accurately extracted without manual reading.
  • the current entity identification unit 103 further includes:
  • a word forest obtaining unit used to call the pre-stored entity name library and thesaurus dictionary, and obtain the word forest corresponding to the thesaurus dictionary
  • the current entity name set obtaining unit is configured to obtain the current entity names corresponding to each intent identification keyword in the entity name database according to the word forest, so as to form a current entity name set.
  • the server locally pre-stores an entity name library composed of entity names in the first knowledge graph library and the sub-knowledge graphs included in the second knowledge graph library, and also stores a database for judging words and A thesaurus dictionary of similarity between words.
  • all the words included in the synonym words are organized in one or several tree structures (this tree or trees are recorded as the word forest), and the two words that need to be judged for the similarity of the words are found in the word forest. node, the path length of the two nodes can be used as the semantic distance between the two words (which can also be understood as the word similarity).
  • the intent recognition keyword set includes two intent keywords of child and health, and the word with the closest path length to the child in the word forest is parenting.
  • the intent keyword of child can be converted is the current entity name of parenting; in the same way, the word with the closest path length to health in the word forest is health big data.
  • the intention keyword of health can be converted into health big data.
  • the first question set sending unit 104 is configured to obtain the corresponding first current associated question set in the local first knowledge graph library according to each current entity name in the current entity name set, and send the first current associated question set to the user terminal; wherein the number of association question subsets included in the first current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to the first A subset of contextual questions in a current set of contextual questions.
  • the local first knowledge graph library of each current entity name in the current entity name set can be used at this time. Obtain the corresponding first current related question set from , so as to guide users to consult health big data + parenting related questions.
  • the intelligent dialogue robot mainly calls the sub-knowledge map corresponding to the entity of parenting in the first knowledge map database, and also calls the sub-knowledge map corresponding to the entity of health big data in the first knowledge map database, and obtains the above-mentioned sub-knowledge After the graph, the respective associated questions corresponding to the above sub-knowledge graphs are called to form a first first current associated question set, and the first current associated question set is sent to the user terminal.
  • the second question set sending unit 105 is configured to obtain the name of the first target entity corresponding to the first question click instruction and obtain the the first associated entity name set corresponding to the first target entity name, obtain the corresponding second current associated problem set in the local first knowledge graph library according to the first associated entity name in the first associated entity name set, and set the The second current association question set is sent to the user terminal; wherein the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set. The numbers are the same, and each first associated entity name corresponds to one of the associated problem subsets in the second current associated problem set.
  • the user terminal can manually click to select one of the most interesting questions to generate the first question Click Command. For example, if a user clicks on a breast-related question under the entity name of health big data (specifically, breast health big data in 2019), the key entity name (mammary gland) in this question is extracted, and then according to the entity name of breast, and The name of the entity (health big data) to which the currently viewing (2019 breast health big data) is associated is related to another entity (breast hyperplasia + female cancer) according to the association relationship between the entities in the first knowledge atlas ), and at the same time show the related questions below these two related entities (how to avoid breast hyperplasia in post-80s women, and what are the diseases with high incidence of female cancer?).
  • entity name of health big data specifically, breast health big data in 2019
  • the name of the entity (health big data) to which the currently viewing (2019 breast health big data) is associated is related to another entity (breast hyperplasia + female cancer) according to the association relationship between
  • the two associated entities in the above example are composed of a first associated entity name set corresponding to the name of the first target entity. After the first associated entity name set is obtained, the corresponding entity can be obtained in the local first knowledge graph library of the server.
  • the second current associated question set is sent to the user terminal, so as to guide the user to inquire about breast hyperplasia + female cancer-related questions.
  • the result binding unit 106 is configured to acquire the second target entity name corresponding to the second question click instruction and add the completion flag as For the first output result, the first output result and the current user portrait are mapped one-to-one to bind.
  • the user terminal can manually click to select one of the most interesting questions to generate the second question Click Command. For example, if the user finally clicks on the diseases with high incidence of female cancer, according to this related question, it is extracted that the user A has completed the demand stimulation stage. At this time, a "completed scientific knowledge of women's cancer" is output as the first output result.
  • This first data result can be added as a user label to the current user portrait corresponding to the user, or the first data result can be added to the user's current user portrait.
  • the output result is mapped one-to-one with the current user portrait for binding.
  • the target user portrait acquisition unit 107 is configured to acquire a target user portrait corresponding to another intelligent conversation connection instruction if another intelligent conversation connection instruction sent by the user terminal is detected.
  • the server when the server detects another intelligent session connection instruction sent by the client, it may be that the previous user A and the intelligent dialogue robot establish a session again to learn more about the relevant details of the product (that is, the user It has already gone through the demand stimulation stage and entered the concept introduction stage), it is also possible that another user B other than user A has established a conversation with the intelligent dialogue robot for the first time to understand the popular science knowledge of the product (that is, the user has not yet experienced the demand stimulation stage), At this time, the corresponding target user portrait can be acquired according to the connection instruction with another intelligent session. Then, it is determined whether the target user portrait corresponds to a first output result, so as to determine whether the user establishes a conversation with the intelligent dialogue robot for the first time.
  • the target user portrait judgment unit 108 is configured to judge whether the target user portrait corresponds to the binding of the first output result.
  • judging whether the target user portrait corresponds to the first output result is to judge whether the user establishes a session with the intelligent dialogue robot for the first time, so that the first output result can be called more accurately according to the user's situation in the subsequent steps.
  • the third question set sending unit 109 is configured to, if the target user portrait is bound with a first output result, obtain a corresponding third current associated question set in the local second knowledge graph library according to the first output result , and send the third current associated question set to the client.
  • the target user portrait is bound with the first output result, which means that the user who has completed at least one communication before and the intelligent dialogue robot establish a conversation again to learn more about the relevant details of the product. Then obtain the set of associated questions from the first knowledge graph library as in the above steps, but obtain the corresponding third current associated question set from the second local knowledge graph library according to the entity name corresponding to the first output result, and set the The third current associated question set is sent to the client.
  • users who have a preliminary understanding of product popularization knowledge can be directed to obtain more professional knowledge from the second knowledge graph library, without the need to understand product popularization knowledge again, thereby improving the efficiency and accuracy of information acquisition by users.
  • the multi-knowledge graph-based intelligent reply apparatus 100 further includes:
  • the return control unit is configured to update the target user portrait as the current user portrait if the target user portrait is not bound with the first output result, return the execution to receive the dialogue text sent by the user terminal, and respond to the dialogue A step of performing intent recognition on the text to obtain an intent recognition keyword set, and acquiring the current entity names corresponding to each intent recognition keyword in the intent recognition keyword set, so as to form a current entity name set.
  • the target user portrait is not bound with the first output result, indicating that the user is establishing a session with the intelligent dialogue robot for the first time.
  • the target user portrait can be updated as the current user portrait, and the execution is returned to Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set of steps.
  • corresponding information is recommended for users with different knowledge mastery degrees, which can effectively improve the efficiency and accuracy of information acquisition by users.
  • the intelligent replying device 100 based on the multi-knowledge graph further includes:
  • An uploading unit configured to upload the first current related problem set, the second current related problem set and the third current related problem set to the blockchain.
  • corresponding summary information is obtained based on the first current related question set, the second current related question set, and the third current related question set.
  • the current association problem set, the second current association problem set, and the third current association problem set are obtained by hashing, for example, by using the sha256s algorithm. Uploading summary information to the blockchain ensures its security and fairness and transparency to users.
  • the user equipment can download the summary information from the blockchain, so as to verify whether the first current set of related questions, the second set of current related questions and the third current set of related questions have been tampered with.
  • the blockchain referred to in this example is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • the device implements the recommendation of corresponding information according to the corresponding knowledge graph for users with different knowledge mastery degrees, which can effectively improve the efficiency and accuracy of information acquisition by the users.
  • the above-mentioned multi-knowledge graph-based intelligent replying apparatus can be implemented in the form of a computer program, and the computer program can be executed on a computer device as shown in FIG. 4 .
  • FIG. 4 is a schematic block diagram of a computer device provided by an embodiment of the present application.
  • the computer device 500 is a server, and the server may be an independent server or a server cluster composed of multiple servers.
  • the computer device 500 includes a processor 502 , a memory and a network interface 505 connected through a system bus 501 , wherein the memory may include a non-volatile storage medium 503 and an internal memory 504 .
  • the nonvolatile storage medium 503 can store an operating system 5031 and a computer program 5032 .
  • the processor 502 can execute the intelligent reply method based on the multi-knowledge graph.
  • the processor 502 is used to provide computing and control capabilities to support the operation of the entire computer device 500 .
  • the internal memory 504 provides an environment for running the computer program 5032 in the non-volatile storage medium 503.
  • the processor 502 can execute the intelligent answering method based on the multi-knowledge graph.
  • the network interface 505 is used for network communication, such as providing transmission of data information.
  • the network interface 505 is used for network communication, such as providing transmission of data information.
  • FIG. 4 is only a block diagram of a partial structure related to the solution of the present application, and does not constitute a limitation on the computer device 500 to which the solution of the present application is applied.
  • the specific computer device 500 may include more or fewer components than shown, or combine certain components, or have a different arrangement of components.
  • the processor 502 is configured to run the computer program 5032 stored in the memory, so as to implement the multi-knowledge graph-based intelligent reply method disclosed in the embodiment of the present application.
  • the embodiment of the computer device shown in FIG. 4 does not constitute a limitation on the specific structure of the computer device, and in other embodiments, the computer device may include more or less components than those shown in the figure, Either some components are combined, or different component arrangements.
  • the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are the same as those of the embodiment shown in FIG. 4 , and details are not repeated here.
  • the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor can be a microprocessor or the processor can also be any conventional processor or the like.
  • a computer-readable storage medium may be a non-volatile computer-readable storage medium, or a volatile computer-readable storage medium.
  • the computer-readable storage medium stores a computer program, wherein when the computer program is executed by the processor, the multi-knowledge graph-based intelligent reply method disclosed in the embodiments of the present application is implemented.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Human Computer Interaction (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A multi-knowledge graph based intelligent reply method and apparatus, a computer device and a storage medium, related to big data and knowledge graphs. The method comprises: if another intelligent session connection instruction sent by a client is detected, obtaining a target user portrait corresponding to another intelligent session connection instruction (S107); determining whether the target user portrait is correspondingly bound with a first output result or not (S108); and if a first output result is bound in the target user portrait, obtaining a corresponding third current association problem set from a local second knowledge graph library according to the first output result, and sending the third current association problem set to the client (S109). According to the method, the association problem set is called in different knowledge graph libraries according to key entity names corresponding to the user portraits, corresponding information is correspondingly recommended to the users with different knowledge mastering degrees, and the information acquisition efficiency and accuracy of the users can be effectively improved.

Description

基于多知识图谱的智能答复方法、装置、计算机设备及存储介质Intelligent reply method, device, computer equipment and storage medium based on multi-knowledge graph
本申请要求于2020年12月25日提交中国专利局、申请号为202011562101.X,发明名称为“基于多知识图谱的智能答复方法、装置及计算机设备的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed on December 25, 2020, with the application number of 202011562101.X, and the title of the invention is "Intelligent answering method, device and computer equipment based on multi-knowledge graph, the entire content of which is Incorporated herein by reference.
技术领域technical field
本申请涉及大数据的知识图谱领域,尤其涉及一种基于多知识图谱的智能答复方法、装置、计算机设备及存储介质。The present application relates to the field of knowledge graphs of big data, and in particular, to an intelligent reply method, apparatus, computer equipment and storage medium based on multi-knowledge graphs.
背景技术Background technique
目前,在服务器中所搭载的传统对话机器人一般按应用场景分为闲聊机器人、任务(垂直领域)机器人、问答(QA)机器人。发明人意识到上述常见的对话机器人在与用户进行智能问答的过程中,主要存在关键词匹配准确率较低、语法分析缺乏上下文、端到端生成可控性差、知识图谱仅基于简单事实、语义理解仅局限封闭领域等问题。At present, traditional dialogue robots carried in servers are generally classified into chat robots, task (vertical) robots, and question answering (QA) robots according to application scenarios. The inventor realized that in the process of intelligent question-answering with users, the above-mentioned common dialogue robots mainly have low keyword matching accuracy, lack of context in grammatical analysis, poor controllability of end-to-end generation, knowledge graphs based only on simple facts and semantics. Understanding issues such as confined areas only.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供了一种基于多知识图谱的智能答复方法、装置、计算机设备及存储介质,旨在解决现有技术中对话机器人在与用户进行智能问答的过程中,主要存在关键词匹配准确率较低、语法分析缺乏上下文、语义理解仅局限封闭领域的问题。The embodiments of the present application provide an intelligent answering method, device, computer equipment and storage medium based on multi-knowledge graphs, aiming to solve the problem of accurate keyword matching in the process of intelligent questioning and answering with users in the prior art. Low rate, lack of context for grammatical analysis, and semantic understanding limited to closed domains.
第一方面,本申请实施例提供了一种基于多知识图谱的智能答复方法,其包括:In a first aspect, an embodiment of the present application provides an intelligent reply method based on a multi-knowledge graph, which includes:
若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像;If an intelligent session connection instruction sent by the client is detected, a connection is established with the client, and a current user portrait corresponding to the client is obtained;
接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集;Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set;
根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同,且每一当前实体名称对应所述第一当前关联问题集中的其中一个关联问题子集;According to the current entity names in the current entity name set, the corresponding first current associated question set is obtained in the local first knowledge graph library, and the first current associated question set is sent to the client; wherein the first current associated question set is The number of association question subsets included in the current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the association questions in the first current association question set Subset;
若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关 联问题子集;以及If the first question click instruction corresponding to the first current associated question set sent by the client is detected, the first target entity name corresponding to the first question click instruction is obtained, and the first target entity name corresponding to the first target entity name is obtained. A set of associated entity names, obtaining a corresponding second current associated problem set in the local first knowledge graph library according to the names of each first associated entity in the first associated entity name set, and sending the second current associated problem set to The user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association entity a name corresponding to one of the subset of contextual questions in the second current set of contextual questions; and
若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。If the second question click instruction corresponding to the second current associated question set sent by the user terminal is detected, the second target entity name corresponding to the second question click instruction is acquired and the completion flag is added as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
第二方面,本申请实施例提供了一种基于多知识图谱的智能答复装置,其包括:In a second aspect, an embodiment of the present application provides an intelligent replying device based on a multi-knowledge graph, which includes:
连接建立单元,用于若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像;a connection establishment unit, configured to establish a connection with the user terminal and obtain a current user portrait corresponding to the user terminal if an intelligent session connection instruction sent by the user terminal is detected;
当前实体识别单元,用于接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集;The current entity recognition unit is used to receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity corresponding to each intention recognition keyword in the intention recognition keyword set. name, to make up the current set of entity names;
第一问题集发送单元,用于根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同,且每一当前实体名称对应所述第一当前关联问题集中的其中一个关联问题子集;The first question set sending unit is configured to obtain the corresponding first current associated question set in the local first knowledge graph library according to each current entity name in the current entity name set, and send the first current associated question set to User terminal; wherein, the number of association question subsets included in the first current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to the first one of the subset of related questions in the current set of related questions;
第二问题集发送单元,用于若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关联问题子集;以及The second question set sending unit is configured to acquire the name of the first target entity corresponding to the first question click instruction and obtain the The first associated entity name set corresponding to the first target entity name, according to each first associated entity name in the first associated entity name set, obtain the corresponding second current associated problem set in the local first knowledge graph library, The second current associated question set is sent to the client; wherein, the number of associated question subsets included in the second current associated question set and the number of first associated entity names included in the first associated entity name set are the same, and each first associated entity name corresponds to one of the associated problem subsets in the second current associated problem set; and
结果绑定单元,用于若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。The result binding unit is configured to acquire the second target entity name addition completion identifier corresponding to the second question click instruction corresponding to the second question click instruction as the first an output result, the first output result and the current user portrait are mapped one-to-one to bind.
第三方面,本申请实施例又提供了一种计算机设备,其包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现以下步骤:In a third aspect, an embodiment of the present application further provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor executes the computer The program implements the following steps:
若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像;If an intelligent session connection instruction sent by the client is detected, a connection is established with the client, and a current user portrait corresponding to the client is obtained;
接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集;Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set;
根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同, 且每一当前实体名称对应所述第一当前关联问题集中的其中一个关联问题子集;According to the current entity names in the current entity name set, the corresponding first current associated question set is obtained in the local first knowledge graph library, and the first current associated question set is sent to the client; wherein the first current associated question set is The number of association question subsets included in the current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the association questions in the first current association question set Subset;
若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关联问题子集;以及If the first question click instruction corresponding to the first current associated question set sent by the client is detected, the first target entity name corresponding to the first question click instruction is obtained, and the first target entity name corresponding to the first target entity name is obtained. A set of associated entity names, obtaining a corresponding second current associated problem set in the local first knowledge graph library according to the names of each first associated entity in the first associated entity name set, and sending the second current associated problem set to The user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association entity a name corresponding to one of the subset of contextual questions in the second current set of contextual questions; and
若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。If the second question click instruction corresponding to the second current associated question set sent by the user terminal is detected, the second target entity name corresponding to the second question click instruction is acquired and the completion flag is added as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
第四方面,本申请实施例还提供了一种计算机可读存储介质,其中所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行时使所述处理器执行以下操作:In a fourth aspect, embodiments of the present application further provide a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when executed by a processor, the computer program causes the processor to perform the following operations :
若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像;If an intelligent session connection instruction sent by the client is detected, a connection is established with the client, and a current user portrait corresponding to the client is obtained;
接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集;Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set;
根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同,且每一当前实体名称对应所述第一当前关联问题集中的其中一个关联问题子集;According to the current entity names in the current entity name set, the corresponding first current associated question set is obtained in the local first knowledge graph library, and the first current associated question set is sent to the client; wherein the first current associated question set is The number of association question subsets included in the current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the association questions in the first current association question set Subset;
若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关联问题子集;以及If the first question click instruction corresponding to the first current associated question set sent by the client is detected, the first target entity name corresponding to the first question click instruction is obtained, and the first target entity name corresponding to the first target entity name is obtained. A set of associated entity names, obtaining a corresponding second current associated problem set in the local first knowledge graph library according to the names of each first associated entity in the first associated entity name set, and sending the second current associated problem set to The user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association entity a name corresponding to one of the subset of contextual questions in the second current set of contextual questions; and
若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。If the second question click instruction corresponding to the second current associated question set sent by the user terminal is detected, the second target entity name corresponding to the second question click instruction is acquired and the completion flag is added as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
本申请实施例提供了一种基于多知识图谱的智能答复方法、装置、计算机设备及存储介质,判断用户是否初次与智能对话机器人建立会话,从而从不同的知识图谱库中调用关联问题集,对于不同知识掌握程度的用户对应推荐相应信息,能有效提高用户获取信息的效率和准确率。The embodiments of the present application provide an intelligent answering method, device, computer equipment and storage medium based on multiple knowledge graphs, to determine whether a user establishes a session with an intelligent dialogue robot for the first time, so as to call associated question sets from different knowledge graph libraries. Users with different levels of knowledge can recommend corresponding information accordingly, which can effectively improve the efficiency and accuracy of information acquisition by users.
附图说明Description of drawings
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the following briefly introduces the accompanying drawings used in the description of the embodiments. For those of ordinary skill, other drawings can also be obtained from these drawings without any creative effort.
图1为本申请实施例提供的基于多知识图谱的智能答复方法的应用场景示意图;1 is a schematic diagram of an application scenario of an intelligent reply method based on a multi-knowledge graph provided by an embodiment of the present application;
图2为本申请实施例提供的基于多知识图谱的智能答复方法的流程示意图;FIG. 2 is a schematic flowchart of an intelligent reply method based on a multi-knowledge graph provided by an embodiment of the present application;
图3为本申请实施例提供的基于多知识图谱的智能答复装置的示意性框图;3 is a schematic block diagram of an intelligent replying device based on a multi-knowledge graph provided by an embodiment of the present application;
图4为本申请实施例提供的计算机设备的示意性框图。FIG. 4 is a schematic block diagram of a computer device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.
应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It is to be understood that, when used in this specification and the appended claims, the terms "comprising" and "comprising" indicate the presence of the described features, integers, steps, operations, elements and/or components, but do not exclude one or The presence or addition of a number of other features, integers, steps, operations, elements, components, and/or sets thereof.
还应当理解,在此本申请说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本申请。如在本申请说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should also be understood that the terminology used in the specification of the application herein is for the purpose of describing particular embodiments only and is not intended to limit the application. As used in this specification and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural unless the context clearly dictates otherwise.
还应当进一步理解,在本申请说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should also be further understood that, as used in this specification and the appended claims, the term "and/or" refers to and including any and all possible combinations of one or more of the associated listed items .
请参阅图1和图2,图1为本申请实施例提供的基于多知识图谱的智能答复方法的应用场景示意图;图2为本申请实施例提供的基于多知识图谱的智能答复方法的流程示意图,该基于多知识图谱的智能答复方法应用于服务器中,该方法通过安装于服务器中的应用软件进行执行。Please refer to FIG. 1 and FIG. 2. FIG. 1 is a schematic diagram of an application scenario of the multi-knowledge graph-based intelligent reply method provided by the embodiment of the application; FIG. 2 is a schematic flowchart of the multi-knowledge graph-based intelligent reply method provided by the embodiment of the application. , the multi-knowledge graph-based intelligent reply method is applied in the server, and the method is executed by the application software installed in the server.
如图2所示,该方法包括步骤S101~S109。As shown in FIG. 2, the method includes steps S101-S109.
S101、接收并保存业务服务器上传的第一知识图谱库和第二知识图谱库;其中,所述第一知识图谱库中包括多个第一子知识图谱,每一第一子知识图谱对应若干个实体名称,且实体名称对应的实体之间存在关联关系;所述第二知识图谱库中包括多个第二子知识图谱,每一第二子知识图谱对应若干个实体名称,且实体名称对应的实体之间存在关联关系。S101. Receive and save a first knowledge graph library and a second knowledge graph library uploaded by a business server; wherein, the first knowledge graph library includes a plurality of first sub-knowledge graphs, and each first sub-knowledge graph corresponds to several entity name, and there is an association relationship between entities corresponding to the entity name; the second knowledge graph library includes a plurality of second sub-knowledge graphs, each second sub-knowledge graph corresponds to several entity names, and the entity name corresponds to There are associations between entities.
在本实施例中,为了更清楚的理解本申请的技术方案,下面对所涉及到的终端进行详细介绍。其中,本申请是在服务器的角度描述技术方案。In this embodiment, in order to understand the technical solutions of the present application more clearly, the involved terminals are introduced in detail below. Among them, the present application describes the technical solution from the perspective of the server.
一是业务服务器,在业务服务器中,研发人员可以将预先构建的第一知识图谱库和第二知识图谱库上传至服务器。一旦上述知识图谱库存在更新的情况,直接由业务服务器上传至 服务器进行更新即可。例如,以用户从服务器中获取产品知识或信息为例(产品可以是保险产品、理财产品、电子产品、体育用具产品等),第一知识图谱库中包括的多个第一子知识图谱对应是由用户针对一些产品的第一了解阶段(可以理解为需求激发阶段,用户更注重产品科普知识的获取)所关注问题的答复文本的集合,在第一知识图谱库中包括的多个第一子知识图谱是以产品相关的科普介绍为主;第二知识图谱库中包括的多个第二子知识图谱对应是由用户针对一些产品的第二了解阶段(可以理解为观念导入阶段,用户更注重产品的细节参数的获取)所关注问题的答复文本的集合,在第二知识图谱库中包括的多个第二子知识图谱是以产品相关的细节参数。One is the business server. In the business server, the R&D personnel can upload the pre-built first knowledge graph library and the second knowledge graph library to the server. Once the above-mentioned knowledge graph library is updated, it can be directly uploaded from the business server to the server for updating. For example, taking the user acquiring product knowledge or information from the server as an example (products can be insurance products, wealth management products, electronic products, sports equipment products, etc.), the multiple first sub-knowledge maps included in the first knowledge map database correspond to A collection of answer texts for questions concerned by users in the first understanding stage of some products (which can be understood as the demand stimulation stage, where users pay more attention to the acquisition of product popularization knowledge), and multiple first subsections included in the first knowledge graph library. The knowledge graph is mainly based on product-related popular science introduction; the multiple second sub-knowledge graphs included in the second knowledge graph base correspond to the second understanding stage of some products by the user (it can be understood as the concept introduction stage, users pay more attention to The acquisition of detailed parameters of the product) is a collection of answer texts for the concerned question, and the plurality of second sub-knowledge graphs included in the second knowledge graph base are the detailed parameters related to the product.
二是服务器,通过服务器可以存储由业务服务器上传的第一知识图谱库和第二知识图谱库,而且还存储了多个用户对应的用户画像,这些数据均作为服务器中部署的智能对话机器人的后台数据。其中用户画像中的标签关键词和上述两个知识图谱库中的实体名称存在的关系包括完全不相同、近似、及相同的关系。通过与用户对应的标签关键词可以在第一知识图谱库或第二知识图谱库中获取对应的关联问题集并发送至用户端。The second is the server. The server can store the first knowledge graph library and the second knowledge graph library uploaded by the business server, and also store the user portraits corresponding to multiple users. These data are used as the background of the intelligent dialogue robot deployed in the server. data. The relationship between the tag keywords in the user portrait and the entity names in the above two knowledge graph bases includes completely different, similar, and identical relationships. A corresponding set of associated questions can be obtained from the first knowledge graph library or the second knowledge graph library through the tag keywords corresponding to the user and sent to the user terminal.
三是用户端,与服务器通讯连接的用户端可以是一个也可以是多个,每一用户端在服务器中存储有对应的用户画像。若某一用户端与服务器中的智能对话机器人建立对话时,在服务器中可以获取到该用户端使用者的用户画像。The third is the client. There may be one or more clients connected to the server. Each client stores a corresponding user portrait in the server. If a certain client establishes a dialogue with the intelligent dialogue robot in the server, the user portrait of the user of the client can be obtained in the server.
当服务器接收了由业务服务器上传的第一知识图谱库和第二知识图谱库,以及存储了多个用户对应的用户画像后,这些数据可以用于用户端与智能对话机器人建立的对话中,进行问题或知识点的推荐。After the server receives the first knowledge graph library and the second knowledge graph library uploaded by the business server, and stores the user portraits corresponding to multiple users, these data can be used in the dialog established between the user end and the intelligent dialog robot to conduct Recommendations for questions or knowledge points.
S102、若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像。S102. If an intelligent session connection instruction sent by the user terminal is detected, establish a connection with the user terminal, and acquire a current user portrait corresponding to the user terminal.
在本实施例中,当某一用户端(例如记为用户端A)与服务器建立通讯连接并开启智能会话后,服务器与该用户端建立连接,并获取与所述用户端对应的当前用户画像。其中,当前用户画像可以理解为该用户的基础画像特征。In this embodiment, after a certain client (for example, denoted as client A) establishes a communication connection with the server and opens an intelligent session, the server establishes a connection with the client, and obtains the current user portrait corresponding to the client . Among them, the current user portrait can be understood as the basic portrait feature of the user.
具体实施时,在获取用户端对应的用户画像的过程如下:During specific implementation, the process of obtaining the user portrait corresponding to the user terminal is as follows:
获取所述用户端对应的用户唯一识别码,根据所述用户唯一识别码在本地的用户画像库中获取与所述用户端对应的当前用户画像。The user unique identification code corresponding to the user terminal is obtained, and the current user portrait corresponding to the user terminal is obtained from the local user portrait library according to the user unique identification code.
在本实施例中,为了更准确的获取用户端的用户画像,是在用户端与服务器建立连接后,由服务器获取该用户端对应的用户唯一识别码(如电话号码、身份证号等),此时根据所述用户唯一识别码在服务器本地的用户画像库中,可以获取到与所述用户端唯一对应的当前用户画像。In this embodiment, in order to obtain the user portrait of the client more accurately, after the client establishes a connection with the server, the server obtains the user's unique identification code (such as a phone number, ID number, etc.) corresponding to the client. At the same time, according to the user's unique identification code in the local user portrait library of the server, the current user portrait uniquely corresponding to the user terminal can be obtained.
S103、接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集。S103. Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and acquire the current entity names corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form a current entity name. Entity name set.
在本实施例中,当用户端与服务器建立连接并与开启了与智能对话机器人之间的对话时,需根据所接收的对话文本初步分析用户意图。基于当前用户可能正在产生的对话主题,抽取 本次对话过程中关联到的知识图谱的实体(例如孩子+健康这两个实体)。由于此时用户端是初次与智能对话机器人建立会话,此时可以基于本次对话流抽取的核心实体(如上述举例的孩子+健康),在第一知识图谱库中调用需求激发阶段的子知识图谱。In this embodiment, when the user terminal establishes a connection with the server and starts a dialogue with the intelligent dialogue robot, it is necessary to preliminarily analyze the user's intention according to the received dialogue text. Based on the conversation topics that the current user may be generating, extract the entities of the knowledge graph associated with this conversation process (such as the two entities of child + health). Since it is the first time for the client to establish a session with the intelligent dialogue robot, the sub-knowledge in the demand-stimulation stage can be called in the first knowledge map library based on the core entities extracted from this dialogue flow (such as the child + health in the above example). Atlas.
在一实施例中,步骤S103中对所述对话文本进行意图识别以得到意图识别关键词集的步骤包括:In one embodiment, the step of performing intent recognition on the dialogue text in step S103 to obtain an intent recognition keyword set includes:
将所述对话文本通过基于概率统计分词模型进行分词,得到与所述对话文本对应的分词结果;The dialogue text is subjected to word segmentation based on a probabilistic and statistical word segmentation model, and a word segmentation result corresponding to the dialogue text is obtained;
通过词频-逆文本频率指数模型,抽取所述分词结果按频-逆文本频率指数降序排序后未超出预设的排名阈值的关键词,以组成意图识别关键词集。Through the word frequency-inverse text frequency index model, keywords that do not exceed the preset ranking threshold after the word segmentation results are sorted in descending order by the frequency-inverse text frequency index are extracted to form an intent identification keyword set.
在本实施例中,通过基于概率统计分词模型对对话文本进行分词时,例如令C=C1C2...Cm,C是待切分的汉字串,令W=W1W2...Wn,W是切分的结果,Wa,Wb,……,Wk是C的所有可能的切分方案。那么,基于概率统计分词模型就是能够找到目的词串W,使得W满足:P(W|C)=MAX(P(Wa|C),P(Wb|C)...P(Wk|C))的分词模型,上述分词模型得到的词串W即估计概率为最大之词串。通过基于概率统计分词模型即可对所述对话文本进行分词,从而得到与所述对话文本对应的分词结果。In this embodiment, when the dialogue text is segmented based on the probability and statistical word segmentation model, for example, let C=C1C2...Cm, C is the Chinese character string to be segmented, and W=W1W2...Wn, W is the cut As a result of the segmentation, Wa, Wb, ..., Wk are all possible segmentation schemes of C. Then, the word segmentation model based on probability statistics can find the target word string W, so that W satisfies: P(W|C)=MAX(P(Wa|C), P(Wb|C)...P(Wk|C) ) word segmentation model, the word string W obtained by the above word segmentation model is the word string with the largest estimated probability. The dialogue text can be segmented based on the probability and statistics word segmentation model, so as to obtain a word segmentation result corresponding to the dialogue text.
之后使用词频-逆文本频率指数模型抽取分词结果中的关键词,其中词频-逆文本频率指数模型即TF-IDF模型,TF-IDF是Term Frequency–Inverse Document Frequency的简写。抽取所述分词结果按频-逆文本频率指数降序排序后位于预设的排名值之前的关键词,以组成意图识别关键词集。通过词频-逆文本频率指数模型能准确提取文本中的核心关键词,无需人工阅读后提取。After that, the word frequency-inverse text frequency index model is used to extract the keywords in the word segmentation results. The word frequency-inverse text frequency index model is the TF-IDF model, and TF-IDF is the abbreviation of Term Frequency-Inverse Document Frequency. Extracting the keywords that are located before the preset ranking value after the word segmentation results are sorted in descending order of frequency-inverse text frequency index, to form an intent-identifying keyword set. The word frequency-inverse text frequency index model can accurately extract the core keywords in the text without manual reading.
在一实施例中,步骤S103中获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集的步骤包括:In one embodiment, in step S103, the step of obtaining the current entity names corresponding to each intent identification keyword in the intent identification keyword set, so as to form the current entity name set, includes:
调用预先存储的实体名称库和同义词词典,并获取与所述同义词词典对应的词林;Call the pre-stored entity name library and thesaurus dictionary, and obtain the word forest corresponding to the thesaurus dictionary;
根据所述词林在所述实体名称库中获取各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集。According to the word forest, the current entity names corresponding to each intent identification keyword are obtained from the entity name database, so as to form a current entity name set.
在本实施例中,由于服务器的本地预先存储了由第一知识图谱库和第二知识图谱库中包括的个子知识图谱中的实体名称组成的实体名称库,而且还存储了用于判断词与词之间的相似度的同义词词典。其中,同义词词中包括的所有词组织在一棵或几棵树结构中(这一棵树或多棵树记为词林),在词林中找到需判断词语相似度的两个词语分别对应的节点,两个节点的路径长度即可作为两个词语之间的语义距离(也可以理解为词语相似度)。In this embodiment, because the server locally pre-stores an entity name library composed of entity names in the first knowledge graph library and the sub-knowledge graphs included in the second knowledge graph library, and also stores a database for judging words and A thesaurus dictionary of similarity between words. Among them, all the words included in the synonym words are organized in one or several tree structures (this tree or trees are recorded as the word forest), and the two words that need to be judged for the similarity of the words are found in the word forest. node, the path length of the two nodes can be used as the semantic distance between the two words (which can also be understood as the word similarity).
例如,所述意图识别关键词集中包括小孩和健康这两个意图关键词,在词林中获取到与小孩之间的路径长度最近的词语是育儿,此时可以将小孩这一意图关键词转换为育儿这一当前实体名称;同理,在词林中获取到与健康之间的路径长度最近的词语是健康大数据,此时可以将健康这一意图关键词转换为健康大数据。For example, the intent recognition keyword set includes two intent keywords of child and health, and the word with the closest path length to the child in the word forest is parenting. At this time, the intent keyword of child can be converted is the current entity name of parenting; in the same way, the word with the closest path length to health in the word forest is health big data. At this time, the intention keyword of health can be converted into health big data.
通过上述同义词的转换关系,确保了能根据用户的对话能在知识图谱库中检索到对应的实体名称,从而可准确的向用户推送信息。Through the conversion relationship of the above synonyms, it is ensured that the corresponding entity name can be retrieved in the knowledge graph database according to the user's dialogue, so that information can be accurately pushed to the user.
S104、根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同,且每一当前实体名称对应所述第一当前关联问题集中的其中一个关联问题子集。S104. Acquire a corresponding first current associated question set in a local first knowledge graph library according to each current entity name in the current entity name set, and send the first current associated question set to the client; wherein, the The number of association question subsets included in the first current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the first current association question sets. Associated problem subsets.
在本实施例中,当获取了用户意图对应的当前实体名称集后(例如上述举例的育儿+健康大数据),此时可以根据当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,以引导用户去咨询健康大数据+育儿相关的问题。智能对话机器人此时主要调用育儿这一实体在第一知识图谱库中对应的子知识图谱,还调用健康大数据这一实体在第一知识图谱库中对应的子知识图谱,获取了上述子知识图谱后调用与上述子知识图谱的分别对应的关联问题,以组成第一第一当前关联问题集,将所述第一当前关联问题集发送至用户端。In this embodiment, after the current entity name set corresponding to the user's intention is obtained (such as the parenting + health big data in the above example), the local first knowledge graph library of each current entity name in the current entity name set can be used at this time. Obtain the corresponding first current related question set from , so as to guide users to consult health big data + parenting related questions. At this time, the intelligent dialogue robot mainly calls the sub-knowledge map corresponding to the entity of parenting in the first knowledge map database, and also calls the sub-knowledge map corresponding to the entity of health big data in the first knowledge map database, and obtains the above-mentioned sub-knowledge After the graph, the respective associated questions corresponding to the above sub-knowledge graphs are called to form a first first current associated question set, and the first current associated question set is sent to the user terminal.
S105、若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关联问题子集。S105. If detecting the first question click instruction corresponding to the first current set of related questions sent by the user terminal, obtain the first target entity name corresponding to the first question click instruction, and obtain the first target entity name corresponding to the first question click instruction. The first set of associated entity names, according to the name of each first associated entity in the first associated entity name set, the corresponding second current associated problem set is obtained in the local first knowledge graph library, and the second current associated problem set is Send to the user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association The associated entity name corresponds to one of the associated problem subsets in the second current associated problem set.
在本实施例中,当用户端在与智能对话机器人之间的对话框中查看了服务器所发送的第一当前关联问题集后,可以手动点击选择其中一个最感兴趣的问题以生成第一问题点击指令。例如用户点击了健康大数据这个实体名称下面的乳腺相关问题(具体如2019年乳腺健康大数据),则抽取到了这个问题中的关键实体名称(乳腺),接下来会根据乳腺这个实体名称,以及当前正在查看的(2019年乳腺健康大数据)所关联到的(健康大数据)这个实体名称,根据第一知识图谱库中实体之间的关联关系而关联到另外的实体(乳腺增生+女性癌症),同时展示以这两个关联实体下面的相关问题(80后女性如何避免得乳腺增生,女性癌症高发疾病有哪些?)。In this embodiment, after viewing the first current associated question set sent by the server in the dialog box with the intelligent dialogue robot, the user terminal can manually click to select one of the most interesting questions to generate the first question Click Command. For example, if a user clicks on a breast-related question under the entity name of health big data (specifically, breast health big data in 2019), the key entity name (mammary gland) in this question is extracted, and then according to the entity name of breast, and The name of the entity (health big data) to which the currently viewing (2019 breast health big data) is associated is related to another entity (breast hyperplasia + female cancer) according to the association relationship between the entities in the first knowledge atlas ), and at the same time show the related questions below these two related entities (how to avoid breast hyperplasia in post-80s women, and what are the diseases with high incidence of female cancer?).
上述举例的两个关联实体组成的是与所述第一目标实体名称对应的第一关联实体名称集,在获取了第一关联实体名称集后可以在服务器本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端,以引导用户去咨询乳腺增生+女性癌症相关的问题。The two associated entities in the above example are composed of the first associated entity name set corresponding to the first target entity name. After the first associated entity name set is obtained, the corresponding entity can be obtained in the local first knowledge graph library of the server. The second current associated question set is sent to the user terminal, so as to guide the user to inquire about breast hyperplasia + female cancer-related questions.
S106、若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。S106. If the second question click instruction corresponding to the second current associated question set sent by the user terminal is detected, and the second target entity name addition completion identifier corresponding to the second question click instruction is obtained as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
在本实施例中,当用户端在与智能对话机器人之间的对话框中查看了服务器所发送的第二当前关联问题集后,可以手动点击选择其中一个最感兴趣的问题以生成第二问题点击指令。例如,用户最终点击了女性癌症高发疾病有哪些,则根据这个相关问题,抽取到该用户A已 经完成了需求激发阶段。此时输出一个“已完成女性癌症的科普知识了解”作为第一输出结果,这一第一数据结果可以作为一个用户标签增加至该用户对应的当前用户画像中,或者是可以将所述第一输出结果与所述当前用户画像进行一一映射以绑定。In this embodiment, after viewing the second current associated question set sent by the server in the dialog box with the intelligent dialogue robot, the user terminal can manually click to select one of the most interesting questions to generate the second question Click Command. For example, if the user finally clicks on the diseases with high incidence of female cancer, according to this related question, it is extracted that the user A has completed the demand stimulation stage. At this time, a "completed scientific knowledge of women's cancer" is output as the first output result. This first data result can be added as a user label to the current user portrait corresponding to the user, or the first data result can be added to the user's current user portrait. The output result is mapped one-to-one with the current user portrait for binding.
S107、若检测到用户端发送的另一智能会话连接指令,获取与另一智能会话连接指令对应的目标用户画像。S107. If another intelligent session connection instruction sent by the user terminal is detected, acquire a target user portrait corresponding to the other intelligent session connection instruction.
在本实施例中,当服务器检测到用户端发送的另一智能会话连接指令,此时可能是之前的用户A与智能对话机器人再次建立会话以更多的了解产品的相关细节(也就是该用户已经经历过需求激发阶段而进入观念导入阶段),也有可能是非用户A的另一用户B与智能对话机器人初次建立会话以了解产品的科普知识(也即该用户还未经历过需求激发阶段),此时可以根据与另一智能会话连接指令获取对应的目标用户画像。之后判断该目标用户画像是否对应一个第一输出结果,从而实现判断用户是否初次与智能对话机器人建立会话。In this embodiment, when the server detects another intelligent session connection instruction sent by the client, it may be that the previous user A and the intelligent dialogue robot establish a session again to learn more about the relevant details of the product (that is, the user It has already gone through the demand stimulation stage and entered the concept introduction stage), it is also possible that another user B other than user A has established a conversation with the intelligent dialogue robot for the first time to understand the popular science knowledge of the product (that is, the user has not yet experienced the demand stimulation stage), At this time, the corresponding target user portrait can be obtained according to the connection instruction with another intelligent session. Then, it is determined whether the target user portrait corresponds to a first output result, so as to determine whether the user establishes a conversation with the intelligent dialogue robot for the first time.
S108、判断所述目标用户画像中是否对应绑定第一输出结果。S108: Determine whether the target user portrait corresponds to the binding of the first output result.
在本实施例中,判断所述目标用户画像中是否对应绑定第一输出结果,即是判断用户是否初次与智能对话机器人建立会话,这样能在后续步骤中更精准的根据用户情况调用第一知识图谱库或是第二知识图谱库。In this embodiment, judging whether the target user portrait corresponds to the first output result is to judge whether the user establishes a session with the intelligent dialogue robot for the first time, so that the first output result can be called more accurately according to the user's situation in the subsequent steps. The knowledge graph base or the second knowledge graph base.
S109、若所述目标用户画像中绑定有第一输出结果,根据所述第一输出结果在本地的第二知识图谱库中获取对应的第三当前关联问题集,将所述第三当前关联问题集发送至用户端。S109. If the target user portrait is bound with a first output result, obtain a corresponding third current associated question set in the local second knowledge graph library according to the first output result, and associate the third current associated question set with the first output result. The question set is sent to the client.
在本实施例中,所述目标用户画像中绑定有第一输出结果,表示是之前完成过至少一次沟通的用户与智能对话机器人再次建立会话以更多的了解产品的相关细节,此时不再像上述步骤中一样从第一知识图谱库获取关联问题集,而是根据所述第一输出结果对应的实体名称在本地的第二知识图谱库中获取对应的第三当前关联问题集,将所述第三当前关联问题集发送至用户端。通过这一方式,可以将已初步了解过产品科普知识的用户导向从第二知识图谱库中获取更专业的知识,无需再次了解产品的科普知识,从而提高用户获取信息的效率和准确率。In this embodiment, the target user portrait is bound with the first output result, which means that the user who has completed at least one communication before and the intelligent dialogue robot establish a conversation again to learn more about the relevant details of the product. Then obtain the set of associated questions from the first knowledge graph library as in the above steps, but obtain the corresponding third current associated question set from the second local knowledge graph library according to the entity name corresponding to the first output result, and set the The third current associated question set is sent to the client. In this way, users who have a preliminary understanding of product popularization knowledge can be directed to obtain more professional knowledge from the second knowledge graph library, without the need to understand product popularization knowledge again, thereby improving the efficiency and accuracy of information acquisition by users.
在一实施例中,步骤S108之后还包括:In one embodiment, after step S108, it further includes:
若所述目标用户画像中未绑定有第一输出结果,将所述目标用户画像更新作为所述当前用户画像,返回执行接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集的步骤。If the target user portrait is not bound with the first output result, update the target user portrait as the current user portrait, return the execution to receive the dialog text sent by the client, and perform intent recognition on the dialog text to obtain The intent recognition keyword set is a step of acquiring the current entity name corresponding to each intent recognition keyword in the intent recognition keyword set, so as to form the current entity name set.
在本实施例中,所述目标用户画像中未绑定有第一输出结果,表示用户是初次与智能对话机器人建立会话,此时可以将该目标用户画像更新作为所述当前用户画像,返回执行步骤S103。通过这一方式,对于不同知识掌握程度的用户对应推荐相应信息,能有效提高用户获取信息的效率和准确率。In this embodiment, the target user portrait is not bound with the first output result, indicating that the user is establishing a session with the intelligent dialogue robot for the first time. At this time, the target user portrait can be updated as the current user portrait, and the execution is returned to Step S103. In this way, corresponding information is recommended for users with different knowledge mastery degrees, which can effectively improve the efficiency and accuracy of information acquisition by users.
在一实施例中,步骤S109之后还包括:In one embodiment, after step S109, it further includes:
将所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集均上传至区块链中。Uploading the first current set of related questions, the second set of current related questions, and the third set of current related questions to the blockchain.
在本实施例中,基于所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集得到对应的摘要信息,具体来说,摘要信息由所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集进行散列处理得到,比如利用sha256s算法处理得到。将摘要信息上传至区块链可保证其安全性和对用户的公正透明性。In this embodiment, corresponding summary information is obtained based on the first current related question set, the second current related question set, and the third current related question set. The current association problem set, the second current association problem set, and the third current association problem set are obtained by hashing, for example, by using the sha256s algorithm. Uploading summary information to the blockchain ensures its security and fairness and transparency to users.
用户设备可以从区块链中下载得该摘要信息,以便查证所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集是否被篡改。本示例所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。The user equipment can download the summary information from the blockchain, so as to verify whether the first current set of related questions, the second set of current related questions and the third current set of related questions have been tampered with. The blockchain referred to in this example is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
该方法实现了对于不同知识掌握程度的用户根据对应的知识图谱推荐相应信息,能有效提高用户获取信息的效率和准确率。The method realizes that users with different knowledge mastery levels recommend corresponding information according to the corresponding knowledge graph, which can effectively improve the efficiency and accuracy of information acquisition by users.
本申请实施例还提供一种基于多知识图谱的智能答复装置,该基于多知识图谱的智能答复装置用于执行前述基于多知识图谱的智能答复方法的任一实施例。具体地,请参阅图3,图3是本申请实施例提供的基于多知识图谱的智能答复装置的示意性框图。该基于多知识图谱的智能答复装置100可以配置于服务器中。The embodiments of the present application further provide an intelligent replying device based on a multi-knowledge graph, and the multi-knowledge graph-based intelligent replying device is used to execute any of the foregoing embodiments of the multi-knowledge graph-based intelligent replying method. Specifically, please refer to FIG. 3 , which is a schematic block diagram of an intelligent replying device based on a multi-knowledge graph provided by an embodiment of the present application. The multi-knowledge graph-based intelligent answering device 100 may be configured in a server.
如图3所示,基于多知识图谱的智能答复装置100包括:知识图谱库建立单元101、连接建立单元102、当前实体识别单元103、第一问题集发送单元104、第二问题集发送单元105、结果绑定单元106、目标用户画像获取单元107、目标用户画像判断单元108、第三问题集发送单元109。As shown in FIG. 3 , the intelligent replying device 100 based on multiple knowledge graphs includes: a knowledge graph database establishment unit 101 , a connection establishment unit 102 , a current entity identification unit 103 , a first question set sending unit 104 , and a second question set sending unit 105 , a result binding unit 106 , a target user portrait acquisition unit 107 , a target user portrait judgment unit 108 , and a third question set sending unit 109 .
知识图谱库建立单元101,用于接收并保存业务服务器上传的第一知识图谱库和第二知识图谱库;其中,所述第一知识图谱库中包括多个第一子知识图谱,每一第一子知识图谱对应若干个实体名称,且实体名称对应的实体之间存在关联关系;所述第二知识图谱库中包括多个第二子知识图谱,每一第二子知识图谱对应若干个实体名称,且实体名称对应的实体之间存在关联关系。The knowledge graph library establishing unit 101 is configured to receive and save the first knowledge graph library and the second knowledge graph library uploaded by the business server; wherein, the first knowledge graph library includes a plurality of first sub-knowledge graphs, each A sub-knowledge graph corresponds to several entity names, and there is an association relationship between the entities corresponding to the entity names; the second knowledge graph library includes a plurality of second sub-knowledge graphs, and each second sub-knowledge graph corresponds to several entities name, and there is an association relationship between the entities corresponding to the entity name.
在本实施例中,当服务器接收了由业务服务器上传的第一知识图谱库和第二知识图谱库,以及存储了多个用户对应的用户画像后,这些数据可以用于用户端与智能对话机器人建立的对话中,进行问题或知识点的推荐。In this embodiment, after the server receives the first knowledge graph library and the second knowledge graph library uploaded by the service server, and stores the user portraits corresponding to multiple users, these data can be used for the client and the intelligent dialogue robot In the established dialogue, recommend questions or knowledge points.
连接建立单元102,用于若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像。The connection establishing unit 102 is configured to establish a connection with the user terminal and acquire the current user portrait corresponding to the user terminal if the intelligent session connection instruction sent by the user terminal is detected.
在本实施例中,当某一用户端(例如记为用户端A)与服务器建立通讯连接并开启智能会话后,服务器与该用户端建立连接,并获取与所述用户端对应的当前用户画像。其中,当前用户画像可以理解为该用户的基础画像特征。In this embodiment, after a certain client (for example, denoted as client A) establishes a communication connection with the server and opens an intelligent session, the server establishes a connection with the client, and obtains the current user portrait corresponding to the client . Among them, the current user portrait can be understood as the basic portrait feature of the user.
具体实施时,在获取用户端对应的用户画像的过程如下:During specific implementation, the process of obtaining the user portrait corresponding to the user terminal is as follows:
获取所述用户端对应的用户唯一识别码,根据所述用户唯一识别码在本地的用户画像库中获取与所述用户端对应的当前用户画像。The user unique identification code corresponding to the user terminal is obtained, and the current user portrait corresponding to the user terminal is obtained from the local user portrait library according to the user unique identification code.
在本实施例中,为了更准确的获取用户端的用户画像,是在用户端与服务器建立连接后,由服务器获取该用户端对应的用户唯一识别码(如电话号码、身份证号等),此时根据所述用户唯一识别码在服务器本地的用户画像库中,可以获取到与所述用户端唯一对应的当前用户画像。In this embodiment, in order to obtain the user portrait of the client more accurately, after the client establishes a connection with the server, the server obtains the user's unique identification code (such as a phone number, ID number, etc.) corresponding to the client. At the same time, according to the user's unique identification code in the local user portrait library of the server, the current user portrait uniquely corresponding to the user terminal can be obtained.
当前实体识别单元103,用于接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集。The current entity recognition unit 103 is configured to receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity corresponding to each intention recognition keyword in the intention recognition keyword set. Entity names to make up the current set of entity names.
在本实施例中,当用户端与服务器建立连接并与开启了与智能对话机器人之间的对话时,需根据所接收的对话文本初步分析用户意图。基于当前用户可能正在产生的对话主题,抽取本次对话过程中关联到的知识图谱的实体(例如孩子+健康这两个实体)。由于此时用户端是初次与智能对话机器人建立会话,此时可以基于本次对话流抽取的核心实体(如上述举例的孩子+健康),在第一知识图谱库中调用需求激发阶段的子知识图谱。In this embodiment, when the user terminal establishes a connection with the server and starts a dialogue with the intelligent dialogue robot, it is necessary to preliminarily analyze the user's intention according to the received dialogue text. Based on the conversation topics that the current user may be generating, extract the entities of the knowledge graph associated in this conversation process (for example, the two entities of child + health). Since it is the first time for the client to establish a session with the intelligent dialogue robot, the sub-knowledge in the demand-stimulation stage can be called in the first knowledge map library based on the core entities extracted from this dialogue flow (such as the child + health in the above example). Atlas.
在一实施例中,当前实体识别单元103包括:In one embodiment, the current entity identification unit 103 includes:
分词单元,用于将所述对话文本通过基于概率统计分词模型进行分词,得到与所述对话文本对应的分词结果;A word segmentation unit, used to segment the dialogue text through a word segmentation model based on probability statistics to obtain a word segmentation result corresponding to the dialogue text;
关键词抽取单元,用于通过词频-逆文本频率指数模型,抽取所述分词结果按频-逆文本频率指数降序排序后未超出预设的排名阈值的关键词,以组成意图识别关键词集。The keyword extraction unit is used for extracting keywords that do not exceed a preset ranking threshold after the word segmentation results are sorted in descending order of frequency-inverse text frequency index through a word frequency-inverse text frequency index model to form an intent-identifying keyword set.
在本实施例中,通过基于概率统计分词模型对对话文本进行分词时,例如令C=C1C2...Cm,C是待切分的汉字串,令W=W1W2...Wn,W是切分的结果,Wa,Wb,……,Wk是C的所有可能的切分方案。那么,基于概率统计分词模型就是能够找到目的词串W,使得W满足:P(W|C)=MAX(P(Wa|C),P(Wb|C)...P(Wk|C))的分词模型,上述分词模型得到的词串W即估计概率为最大之词串。通过基于概率统计分词模型即可对所述对话文本进行分词,从而得到与所述对话文本对应的分词结果。In this embodiment, when the dialogue text is segmented based on the probability and statistical word segmentation model, for example, let C=C1C2...Cm, C is the Chinese character string to be segmented, and W=W1W2...Wn, W is the cut As a result of the segmentation, Wa, Wb, ..., Wk are all possible segmentation schemes of C. Then, the word segmentation model based on probability statistics can find the target word string W, so that W satisfies: P(W|C)=MAX(P(Wa|C), P(Wb|C)...P(Wk|C) ) word segmentation model, the word string W obtained by the above word segmentation model is the word string with the largest estimated probability. The dialogue text can be segmented based on the probability and statistics word segmentation model, so as to obtain a word segmentation result corresponding to the dialogue text.
之后使用词频-逆文本频率指数模型抽取分词结果中的关键词,其中词频-逆文本频率指数模型即TF-IDF模型,TF-IDF是Term Frequency–Inverse Document Frequency的简写。抽取所述分词结果按频-逆文本频率指数降序排序后位于预设的排名值之前的关键词,以组成意图识别关键词集。通过词频-逆文本频率指数模型能准确提取文本中的核心关键词,无需人工阅读后提取。After that, the word frequency-inverse text frequency index model is used to extract the keywords in the word segmentation results. The word frequency-inverse text frequency index model is the TF-IDF model, and TF-IDF is the abbreviation of Term Frequency-Inverse Document Frequency. Extracting the keywords that are located before the preset ranking value after the word segmentation results are sorted in descending order of frequency-inverse text frequency index, to form an intent-identifying keyword set. Through the word frequency-inverse text frequency index model, the core keywords in the text can be accurately extracted without manual reading.
在一实施例中,当前实体识别单元103还包括:In one embodiment, the current entity identification unit 103 further includes:
词林获取单元,用于调用预先存储的实体名称库和同义词词典,并获取与所述同义词词典对应的词林;a word forest obtaining unit, used to call the pre-stored entity name library and thesaurus dictionary, and obtain the word forest corresponding to the thesaurus dictionary;
当前实体名称集获取单元,用于根据所述词林在所述实体名称库中获取各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集。The current entity name set obtaining unit is configured to obtain the current entity names corresponding to each intent identification keyword in the entity name database according to the word forest, so as to form a current entity name set.
在本实施例中,由于服务器的本地预先存储了由第一知识图谱库和第二知识图谱库中包括的个子知识图谱中的实体名称组成的实体名称库,而且还存储了用于判断词与词之间的相似度的同义词词典。其中,同义词词中包括的所有词组织在一棵或几棵树结构中(这一棵树 或多棵树记为词林),在词林中找到需判断词语相似度的两个词语分别对应的节点,两个节点的路径长度即可作为两个词语之间的语义距离(也可以理解为词语相似度)。In this embodiment, because the server locally pre-stores an entity name library composed of entity names in the first knowledge graph library and the sub-knowledge graphs included in the second knowledge graph library, and also stores a database for judging words and A thesaurus dictionary of similarity between words. Among them, all the words included in the synonym words are organized in one or several tree structures (this tree or trees are recorded as the word forest), and the two words that need to be judged for the similarity of the words are found in the word forest. node, the path length of the two nodes can be used as the semantic distance between the two words (which can also be understood as the word similarity).
例如,所述意图识别关键词集中包括小孩和健康这两个意图关键词,在词林中获取到与小孩之间的路径长度最近的词语是育儿,此时可以将小孩这一意图关键词转换为育儿这一当前实体名称;同理,在词林中获取到与健康之间的路径长度最近的词语是健康大数据,此时可以将健康这一意图关键词转换为健康大数据。For example, the intent recognition keyword set includes two intent keywords of child and health, and the word with the closest path length to the child in the word forest is parenting. At this time, the intent keyword of child can be converted is the current entity name of parenting; in the same way, the word with the closest path length to health in the word forest is health big data. At this time, the intention keyword of health can be converted into health big data.
通过上述同义词的转换关系,确保了能根据用户的对话能在知识图谱库中检索到对应的实体名称,从而可准确的向用户推送信息。Through the conversion relationship of the above synonyms, it is ensured that the corresponding entity name can be retrieved in the knowledge graph database according to the user's dialogue, so that information can be accurately pushed to the user.
第一问题集发送单元104,用于根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同,且每一当前实体名称对应所述第一当前关联问题集中的其中一个关联问题子集。The first question set sending unit 104 is configured to obtain the corresponding first current associated question set in the local first knowledge graph library according to each current entity name in the current entity name set, and send the first current associated question set to the user terminal; wherein the number of association question subsets included in the first current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to the first A subset of contextual questions in a current set of contextual questions.
在本实施例中,当获取了用户意图对应的当前实体名称集后(例如上述举例的育儿+健康大数据),此时可以根据当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,以引导用户去咨询健康大数据+育儿相关的问题。智能对话机器人此时主要调用育儿这一实体在第一知识图谱库中对应的子知识图谱,还调用健康大数据这一实体在第一知识图谱库中对应的子知识图谱,获取了上述子知识图谱后调用与上述子知识图谱的分别对应的关联问题,以组成第一第一当前关联问题集,将所述第一当前关联问题集发送至用户端。In this embodiment, after the current entity name set corresponding to the user's intention is obtained (such as the parenting + health big data in the above example), the local first knowledge graph library of each current entity name in the current entity name set can be used at this time. Obtain the corresponding first current related question set from , so as to guide users to consult health big data + parenting related questions. At this time, the intelligent dialogue robot mainly calls the sub-knowledge map corresponding to the entity of parenting in the first knowledge map database, and also calls the sub-knowledge map corresponding to the entity of health big data in the first knowledge map database, and obtains the above-mentioned sub-knowledge After the graph, the respective associated questions corresponding to the above sub-knowledge graphs are called to form a first first current associated question set, and the first current associated question set is sent to the user terminal.
第二问题集发送单元105,用于若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关联问题子集。The second question set sending unit 105 is configured to obtain the name of the first target entity corresponding to the first question click instruction and obtain the the first associated entity name set corresponding to the first target entity name, obtain the corresponding second current associated problem set in the local first knowledge graph library according to the first associated entity name in the first associated entity name set, and set the The second current association question set is sent to the user terminal; wherein the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set. The numbers are the same, and each first associated entity name corresponds to one of the associated problem subsets in the second current associated problem set.
在本实施例中,当用户端在与智能对话机器人之间的对话框中查看了服务器所发送的第一当前关联问题集后,可以手动点击选择其中一个最感兴趣的问题以生成第一问题点击指令。例如用户点击了健康大数据这个实体名称下面的乳腺相关问题(具体如2019年乳腺健康大数据),则抽取到了这个问题中的关键实体名称(乳腺),接下来会根据乳腺这个实体名称,以及当前正在查看的(2019年乳腺健康大数据)所关联到的(健康大数据)这个实体名称,根据第一知识图谱库中实体之间的关联关系而关联到另外的实体(乳腺增生+女性癌症),同时展示以这两个关联实体下面的相关问题(80后女性如何避免得乳腺增生,女性癌症高发疾病有哪些?)。In this embodiment, after viewing the first current associated question set sent by the server in the dialog box with the intelligent dialogue robot, the user terminal can manually click to select one of the most interesting questions to generate the first question Click Command. For example, if a user clicks on a breast-related question under the entity name of health big data (specifically, breast health big data in 2019), the key entity name (mammary gland) in this question is extracted, and then according to the entity name of breast, and The name of the entity (health big data) to which the currently viewing (2019 breast health big data) is associated is related to another entity (breast hyperplasia + female cancer) according to the association relationship between the entities in the first knowledge atlas ), and at the same time show the related questions below these two related entities (how to avoid breast hyperplasia in post-80s women, and what are the diseases with high incidence of female cancer?).
上述举例的两个关联实体组成的是与所述第一目标实体名称对应的第一关联实体名称集, 在获取了第一关联实体名称集后可以在服务器本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端,以引导用户去咨询乳腺增生+女性癌症相关的问题。The two associated entities in the above example are composed of a first associated entity name set corresponding to the name of the first target entity. After the first associated entity name set is obtained, the corresponding entity can be obtained in the local first knowledge graph library of the server. The second current associated question set is sent to the user terminal, so as to guide the user to inquire about breast hyperplasia + female cancer-related questions.
结果绑定单元106,用于若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。The result binding unit 106 is configured to acquire the second target entity name corresponding to the second question click instruction and add the completion flag as For the first output result, the first output result and the current user portrait are mapped one-to-one to bind.
在本实施例中,当用户端在与智能对话机器人之间的对话框中查看了服务器所发送的第二当前关联问题集后,可以手动点击选择其中一个最感兴趣的问题以生成第二问题点击指令。例如,用户最终点击了女性癌症高发疾病有哪些,则根据这个相关问题,抽取到该用户A已经完成了需求激发阶段。此时输出一个“已完成女性癌症的科普知识了解”作为第一输出结果,这一第一数据结果可以作为一个用户标签增加至该用户对应的当前用户画像中,或者是可以将所述第一输出结果与所述当前用户画像进行一一映射以绑定。In this embodiment, after viewing the second current associated question set sent by the server in the dialog box with the intelligent dialogue robot, the user terminal can manually click to select one of the most interesting questions to generate the second question Click Command. For example, if the user finally clicks on the diseases with high incidence of female cancer, according to this related question, it is extracted that the user A has completed the demand stimulation stage. At this time, a "completed scientific knowledge of women's cancer" is output as the first output result. This first data result can be added as a user label to the current user portrait corresponding to the user, or the first data result can be added to the user's current user portrait. The output result is mapped one-to-one with the current user portrait for binding.
目标用户画像获取单元107,用于若检测到用户端发送的另一智能会话连接指令,获取与另一智能会话连接指令对应的目标用户画像。The target user portrait acquisition unit 107 is configured to acquire a target user portrait corresponding to another intelligent conversation connection instruction if another intelligent conversation connection instruction sent by the user terminal is detected.
在本实施例中,当服务器检测到用户端发送的另一智能会话连接指令,此时可能是之前的用户A与智能对话机器人再次建立会话以更多的了解产品的相关细节(也就是该用户已经经历过需求激发阶段而进入观念导入阶段),也有可能是非用户A的另一用户B与智能对话机器人初次建立会话以了解产品的科普知识(也即该用户还未经历过需求激发阶段),此时可以根据与另一智能会话连接指令获取对应的目标用户画像。之后判断该目标用户画像是否对应一个第一输出结果,从而实现判断用户是否初次与智能对话机器人建立会话。In this embodiment, when the server detects another intelligent session connection instruction sent by the client, it may be that the previous user A and the intelligent dialogue robot establish a session again to learn more about the relevant details of the product (that is, the user It has already gone through the demand stimulation stage and entered the concept introduction stage), it is also possible that another user B other than user A has established a conversation with the intelligent dialogue robot for the first time to understand the popular science knowledge of the product (that is, the user has not yet experienced the demand stimulation stage), At this time, the corresponding target user portrait can be acquired according to the connection instruction with another intelligent session. Then, it is determined whether the target user portrait corresponds to a first output result, so as to determine whether the user establishes a conversation with the intelligent dialogue robot for the first time.
目标用户画像判断单元108,用于判断所述目标用户画像中是否对应绑定第一输出结果。The target user portrait judgment unit 108 is configured to judge whether the target user portrait corresponds to the binding of the first output result.
在本实施例中,判断所述目标用户画像中是否对应绑定第一输出结果,即是判断用户是否初次与智能对话机器人建立会话,这样能在后续步骤中更精准的根据用户情况调用第一知识图谱库或是第二知识图谱库。In this embodiment, judging whether the target user portrait corresponds to the first output result is to judge whether the user establishes a session with the intelligent dialogue robot for the first time, so that the first output result can be called more accurately according to the user's situation in the subsequent steps. The knowledge graph base or the second knowledge graph base.
第三问题集发送单元109,用于若所述目标用户画像中绑定有第一输出结果,根据所述第一输出结果在本地的第二知识图谱库中获取对应的第三当前关联问题集,将所述第三当前关联问题集发送至用户端。The third question set sending unit 109 is configured to, if the target user portrait is bound with a first output result, obtain a corresponding third current associated question set in the local second knowledge graph library according to the first output result , and send the third current associated question set to the client.
在本实施例中,所述目标用户画像中绑定有第一输出结果,表示是之前完成过至少一次沟通的用户与智能对话机器人再次建立会话以更多的了解产品的相关细节,此时不再像上述步骤中一样从第一知识图谱库获取关联问题集,而是根据所述第一输出结果对应的实体名称在本地的第二知识图谱库中获取对应的第三当前关联问题集,将所述第三当前关联问题集发送至用户端。通过这一方式,可以将已初步了解过产品科普知识的用户导向从第二知识图谱库中获取更专业的知识,无需再次了解产品的科普知识,从而提高用户获取信息的效率和准确率。In this embodiment, the target user portrait is bound with the first output result, which means that the user who has completed at least one communication before and the intelligent dialogue robot establish a conversation again to learn more about the relevant details of the product. Then obtain the set of associated questions from the first knowledge graph library as in the above steps, but obtain the corresponding third current associated question set from the second local knowledge graph library according to the entity name corresponding to the first output result, and set the The third current associated question set is sent to the client. In this way, users who have a preliminary understanding of product popularization knowledge can be directed to obtain more professional knowledge from the second knowledge graph library, without the need to understand product popularization knowledge again, thereby improving the efficiency and accuracy of information acquisition by users.
在一实施例中,基于多知识图谱的智能答复装置100还包括:In one embodiment, the multi-knowledge graph-based intelligent reply apparatus 100 further includes:
返回控制单元,用于若所述目标用户画像中未绑定有第一输出结果,将所述目标用户画 像更新作为所述当前用户画像,返回执行接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集的步骤。The return control unit is configured to update the target user portrait as the current user portrait if the target user portrait is not bound with the first output result, return the execution to receive the dialogue text sent by the user terminal, and respond to the dialogue A step of performing intent recognition on the text to obtain an intent recognition keyword set, and acquiring the current entity names corresponding to each intent recognition keyword in the intent recognition keyword set, so as to form a current entity name set.
在本实施例中,所述目标用户画像中未绑定有第一输出结果,表示用户是初次与智能对话机器人建立会话,此时可以将该目标用户画像更新作为所述当前用户画像,返回执行接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集的步骤。通过这一方式,对于不同知识掌握程度的用户对应推荐相应信息,能有效提高用户获取信息的效率和准确率。In this embodiment, the target user portrait is not bound with the first output result, indicating that the user is establishing a session with the intelligent dialogue robot for the first time. At this time, the target user portrait can be updated as the current user portrait, and the execution is returned to Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set of steps. In this way, corresponding information is recommended for users with different knowledge mastery degrees, which can effectively improve the efficiency and accuracy of information acquisition by users.
在一实施例中,基于多知识图谱的智能答复装置100还包括:In an embodiment, the intelligent replying device 100 based on the multi-knowledge graph further includes:
上链单元,用于将所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集均上传至区块链中。An uploading unit, configured to upload the first current related problem set, the second current related problem set and the third current related problem set to the blockchain.
在本实施例中,基于所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集得到对应的摘要信息,具体来说,摘要信息由所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集进行散列处理得到,比如利用sha256s算法处理得到。将摘要信息上传至区块链可保证其安全性和对用户的公正透明性。In this embodiment, corresponding summary information is obtained based on the first current related question set, the second current related question set, and the third current related question set. The current association problem set, the second current association problem set, and the third current association problem set are obtained by hashing, for example, by using the sha256s algorithm. Uploading summary information to the blockchain ensures its security and fairness and transparency to users.
用户设备可以从区块链中下载得该摘要信息,以便查证所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集是否被篡改。本示例所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。The user equipment can download the summary information from the blockchain, so as to verify whether the first current set of related questions, the second set of current related questions and the third current set of related questions have been tampered with. The blockchain referred to in this example is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
该装置实现了对于不同知识掌握程度的用户根据对应的知识图谱推荐相应信息,能有效提高用户获取信息的效率和准确率。The device implements the recommendation of corresponding information according to the corresponding knowledge graph for users with different knowledge mastery degrees, which can effectively improve the efficiency and accuracy of information acquisition by the users.
上述基于多知识图谱的智能答复装置可以实现为计算机程序的形式,该计算机程序可以在如图4所示的计算机设备上运行。The above-mentioned multi-knowledge graph-based intelligent replying apparatus can be implemented in the form of a computer program, and the computer program can be executed on a computer device as shown in FIG. 4 .
请参阅图4,图4是本申请实施例提供的计算机设备的示意性框图。该计算机设备500是服务器,服务器可以是独立的服务器,也可以是多个服务器组成的服务器集群。Please refer to FIG. 4 , which is a schematic block diagram of a computer device provided by an embodiment of the present application. The computer device 500 is a server, and the server may be an independent server or a server cluster composed of multiple servers.
参阅图4,该计算机设备500包括通过系统总线501连接的处理器502、存储器和网络接口505,其中,存储器可以包括非易失性存储介质503和内存储器504。Referring to FIG. 4 , the computer device 500 includes a processor 502 , a memory and a network interface 505 connected through a system bus 501 , wherein the memory may include a non-volatile storage medium 503 and an internal memory 504 .
该非易失性存储介质503可存储操作系统5031和计算机程序5032。该计算机程序5032被执行时,可使得处理器502执行基于多知识图谱的智能答复方法。The nonvolatile storage medium 503 can store an operating system 5031 and a computer program 5032 . When the computer program 5032 is executed, the processor 502 can execute the intelligent reply method based on the multi-knowledge graph.
该处理器502用于提供计算和控制能力,支撑整个计算机设备500的运行。The processor 502 is used to provide computing and control capabilities to support the operation of the entire computer device 500 .
该内存储器504为非易失性存储介质503中的计算机程序5032的运行提供环境,该计算机程序5032被处理器502执行时,可使得处理器502执行基于多知识图谱的智能答复方法。The internal memory 504 provides an environment for running the computer program 5032 in the non-volatile storage medium 503. When the computer program 5032 is executed by the processor 502, the processor 502 can execute the intelligent answering method based on the multi-knowledge graph.
该网络接口505用于进行网络通信,如提供数据信息的传输等。本领域技术人员可以理 解,图4中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备500的限定,具体的计算机设备500可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface 505 is used for network communication, such as providing transmission of data information. Those skilled in the art can understand that the structure shown in FIG. 4 is only a block diagram of a partial structure related to the solution of the present application, and does not constitute a limitation on the computer device 500 to which the solution of the present application is applied. The specific computer device 500 may include more or fewer components than shown, or combine certain components, or have a different arrangement of components.
其中,所述处理器502用于运行存储在存储器中的计算机程序5032,以实现本申请实施例公开的基于多知识图谱的智能答复方法。The processor 502 is configured to run the computer program 5032 stored in the memory, so as to implement the multi-knowledge graph-based intelligent reply method disclosed in the embodiment of the present application.
本领域技术人员可以理解,图4中示出的计算机设备的实施例并不构成对计算机设备具体构成的限定,在其他实施例中,计算机设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。例如,在一些实施例中,计算机设备可以仅包括存储器及处理器,在这样的实施例中,存储器及处理器的结构及功能与图4所示实施例一致,在此不再赘述。Those skilled in the art can understand that the embodiment of the computer device shown in FIG. 4 does not constitute a limitation on the specific structure of the computer device, and in other embodiments, the computer device may include more or less components than those shown in the figure, Either some components are combined, or different component arrangements. For example, in some embodiments, the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are the same as those of the embodiment shown in FIG. 4 , and details are not repeated here.
应当理解,在本申请实施例中,处理器502可以是中央处理单元(Central Processing Unit,CPU),该处理器502还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that, in this embodiment of the present application, the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. Wherein, the general-purpose processor can be a microprocessor or the processor can also be any conventional processor or the like.
在本申请的另一实施例中提供计算机可读存储介质。该计算机可读存储介质可以为非易失性的计算机可读存储介质,也可以是易失性的计算机可读存储介质。该计算机可读存储介质存储有计算机程序,其中计算机程序被处理器执行时实现本申请实施例公开的基于多知识图谱的智能答复方法。In another embodiment of the present application, a computer-readable storage medium is provided. The computer-readable storage medium may be a non-volatile computer-readable storage medium, or a volatile computer-readable storage medium. The computer-readable storage medium stores a computer program, wherein when the computer program is executed by the processor, the multi-knowledge graph-based intelligent reply method disclosed in the embodiments of the present application is implemented.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above are only specific embodiments of the present application, but the protection scope of the present application is not limited thereto. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed in the present application. Modifications or substitutions shall be covered by the protection scope of this application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (20)

  1. 一种基于多知识图谱的智能答复方法,其中,包括:An intelligent reply method based on multi-knowledge graph, including:
    若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像;If an intelligent session connection instruction sent by the client is detected, a connection is established with the client, and a current user portrait corresponding to the client is obtained;
    接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集;Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set;
    根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同,且每一当前实体名称对应所述第一当前关联问题集中的其中一个关联问题子集;According to the current entity names in the current entity name set, the corresponding first current associated question set is obtained in the local first knowledge graph library, and the first current associated question set is sent to the client; wherein the first current associated question set is The number of association question subsets included in the current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the association questions in the first current association question set Subset;
    若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关联问题子集;以及If the first question click instruction corresponding to the first current associated question set sent by the client is detected, the first target entity name corresponding to the first question click instruction is obtained, and the first target entity name corresponding to the first target entity name is obtained. A set of associated entity names, obtaining a corresponding second current associated problem set in the local first knowledge graph library according to the names of each first associated entity in the first associated entity name set, and sending the second current associated problem set to The user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association entity a name corresponding to one of the subset of contextual questions in the second current set of contextual questions; and
    若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。If the second question click instruction corresponding to the second current associated question set sent by the user terminal is detected, the second target entity name corresponding to the second question click instruction is acquired and the completion flag is added as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
  2. 根据权利要求1所述的基于多知识图谱的智能答复方法,其中,还包括:The intelligent reply method based on multi-knowledge graph according to claim 1, wherein, further comprising:
    若检测到用户端发送的另一智能会话连接指令,获取与另一智能会话连接指令对应的目标用户画像;If another intelligent session connection instruction sent by the user terminal is detected, obtain the target user portrait corresponding to the other intelligent session connection instruction;
    判断所述目标用户画像中是否对应绑定第一输出结果;以及Judging whether the target user portrait corresponds to the binding first output result; and
    若所述目标用户画像中绑定有第一输出结果,根据所述第一输出结果在本地的第二知识图谱库中获取对应的第三当前关联问题集,将所述第三当前关联问题集发送至用户端。If the target user portrait is bound with the first output result, obtain the corresponding third current associated question set in the local second knowledge graph library according to the first output result, and assign the third current associated question set sent to the client.
  3. 根据权利要求1所述的基于多知识图谱的智能答复方法,其中,还包括:The intelligent reply method based on multi-knowledge graph according to claim 1, wherein, further comprising:
    接收并保存业务服务器上传的第一知识图谱库和第二知识图谱库;其中,所述第一知识图谱库中包括多个第一子知识图谱,每一第一子知识图谱对应若干个实体名称,且实体名称对应的实体之间存在关联关系;所述第二知识图谱库中包括多个第二子知识图谱,每一第二子知识图谱对应若干个实体名称,且实体名称对应的实体之间存在关联关系。Receive and save the first knowledge graph library and the second knowledge graph library uploaded by the business server; wherein, the first knowledge graph library includes a plurality of first sub-knowledge graphs, and each first sub-knowledge graph corresponds to several entity names , and there is an association relationship between the entities corresponding to the entity names; the second knowledge graph library includes a plurality of second sub-knowledge graphs, each second sub-knowledge graph corresponds to several entity names, and the entity name corresponding to the entity name There is a relationship between them.
  4. 根据权利要求1所述的基于多知识图谱的智能答复方法,其中,还包括:The intelligent reply method based on multi-knowledge graph according to claim 1, wherein, further comprising:
    若所述目标用户画像中未绑定有第一输出结果,将所述目标用户画像更新作为所述当前用户画像,返回执行接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称, 以组成当前实体名称集的步骤。If the target user portrait is not bound with the first output result, update the target user portrait as the current user portrait, return the execution to receive the dialog text sent by the client, and perform intent recognition on the dialog text to obtain The intent recognition keyword set, the step of acquiring the current entity names corresponding to each intent recognition keyword in the intent recognition keyword set, so as to form the current entity name set.
  5. 根据权利要求1所述的基于多知识图谱的智能答复方法,其中,所述获取与所述用户端对应的当前用户画像,包括:The multi-knowledge graph-based intelligent reply method according to claim 1, wherein the acquiring the current user portrait corresponding to the user terminal comprises:
    获取所述用户端对应的用户唯一识别码,根据所述用户唯一识别码在本地的用户画像库中获取与所述用户端对应的当前用户画像。The user unique identification code corresponding to the user terminal is obtained, and the current user portrait corresponding to the user terminal is obtained from the local user portrait library according to the user unique identification code.
  6. 根据权利要求1所述的基于多知识图谱的智能答复方法,其中,所述对所述对话文本进行意图识别以得到意图识别关键词集,包括:The multi-knowledge graph-based intelligent reply method according to claim 1, wherein the performing intention recognition on the dialogue text to obtain the intention recognition keyword set, comprising:
    将所述对话文本通过基于概率统计分词模型进行分词,得到与所述对话文本对应的分词结果;The dialogue text is subjected to word segmentation based on a probabilistic and statistical word segmentation model, and a word segmentation result corresponding to the dialogue text is obtained;
    通过词频-逆文本频率指数模型,抽取所述分词结果按频-逆文本频率指数降序排序后未超出预设的排名阈值的关键词,以组成意图识别关键词集。Through the word frequency-inverse text frequency index model, keywords that do not exceed the preset ranking threshold after the word segmentation results are sorted in descending order by the frequency-inverse text frequency index are extracted to form an intent identification keyword set.
  7. 根据权利要求1所述的基于多知识图谱的智能答复方法,其中,所述获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集,包括:The multi-knowledge graph-based intelligent reply method according to claim 1, wherein the acquiring the current entity name corresponding to each intent identification keyword in the intent identification keyword set respectively, to form the current entity name set, comprises:
    调用预先存储的实体名称库和同义词词典,并获取与所述同义词词典对应的词林;Call the pre-stored entity name library and thesaurus dictionary, and obtain the word forest corresponding to the thesaurus dictionary;
    根据所述词林在所述实体名称库中获取各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集。According to the word forest, the current entity names corresponding to each intent identification keyword are obtained from the entity name database, so as to form a current entity name set.
  8. 根据权利要求2所述的基于多知识图谱的智能答复方法,其中,所述若所述目标用户画像中绑定有第一输出结果,根据所述第一输出结果在本地的第二知识图谱库中获取对应的第三当前关联问题集,将所述第三当前关联问题集发送至用户端之后,还包括:The intelligent reply method based on multi-knowledge graphs according to claim 2, wherein, if the target user portrait is bound with a first output result, a local second knowledge graph library according to the first output result After obtaining the corresponding third current set of related questions from , and after sending the third current set of related questions to the client, it also includes:
    将所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集均上传至区块链中。Uploading the first current set of related questions, the second set of current related questions, and the third set of current related questions to the blockchain.
  9. 根据权利要求7所述的基于多知识图谱的智能答复方法,其中,所述根据所述词林在所述实体名称库中获取各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集,包括:The intelligent reply method based on multi-knowledge graph according to claim 7, wherein the current entity name corresponding to each intent identification keyword is obtained in the entity name database according to the word forest, so as to form the current entity name set, including:
    在所述词林中获取与各意图识别关键词之间分别对应的语义距离为最小值的目标词语,组成当前实体名称集。In the word forest, target words whose semantic distances corresponding to each intent recognition keyword are the smallest are obtained to form a current entity name set.
  10. 一种基于多知识图谱的智能答复装置,其中,包括:An intelligent replying device based on a multi-knowledge graph, comprising:
    连接建立单元,用于若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像;a connection establishment unit, configured to establish a connection with the user terminal and obtain a current user portrait corresponding to the user terminal if an intelligent session connection instruction sent by the user terminal is detected;
    当前实体识别单元,用于接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集;The current entity recognition unit is used to receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity corresponding to each intention recognition keyword in the intention recognition keyword set. name, to make up the current set of entity names;
    第一问题集发送单元,用于根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同,且每一当前实体名称对应所述第一当前关联问题集中的其中一个 关联问题子集;The first question set sending unit is configured to obtain the corresponding first current associated question set in the local first knowledge graph library according to each current entity name in the current entity name set, and send the first current associated question set to User terminal; wherein, the number of association question subsets included in the first current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to the first one of the subset of related questions in the current set of related questions;
    第二问题集发送单元,用于若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关联问题子集;以及The second question set sending unit is configured to acquire the name of the first target entity corresponding to the first question click instruction and obtain the The first associated entity name set corresponding to the first target entity name, according to each first associated entity name in the first associated entity name set, obtain the corresponding second current associated problem set in the local first knowledge graph library, The second current associated question set is sent to the client; wherein, the number of associated question subsets included in the second current associated question set and the number of first associated entity names included in the first associated entity name set are the same, and each first associated entity name corresponds to one of the associated problem subsets in the second current associated problem set; and
    结果绑定单元,用于若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。The result binding unit is configured to acquire the second target entity name addition completion identifier corresponding to the second question click instruction corresponding to the second question click instruction as the first an output result, the first output result and the current user portrait are mapped one-to-one to bind.
  11. 一种计算机设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其中,所述处理器执行所述计算机程序时实现以下步骤:A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the following steps when executing the computer program:
    若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像;If an intelligent session connection instruction sent by the client is detected, a connection is established with the client, and a current user portrait corresponding to the client is obtained;
    接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集;Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set;
    根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同,且每一当前实体名称对应所述第一当前关联问题集中的其中一个关联问题子集;According to the current entity names in the current entity name set, the corresponding first current associated question set is obtained in the local first knowledge graph library, and the first current associated question set is sent to the client; wherein the first current associated question set is The number of association question subsets included in the current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the association questions in the first current association question set Subset;
    若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关联问题子集;以及If the first question click instruction corresponding to the first current associated question set sent by the client is detected, the first target entity name corresponding to the first question click instruction is obtained, and the first target entity name corresponding to the first target entity name is obtained. A set of associated entity names, obtaining a corresponding second current associated problem set in the local first knowledge graph library according to the names of each first associated entity in the first associated entity name set, and sending the second current associated problem set to The user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association entity a name corresponding to one of the subset of contextual questions in the second current set of contextual questions; and
    若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。If the second question click instruction corresponding to the second current associated question set sent by the user terminal is detected, the second target entity name corresponding to the second question click instruction is acquired and the completion flag is added as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
  12. 根据权利要求11所述的计算机设备,其中,还包括:The computer device of claim 11, further comprising:
    若检测到用户端发送的另一智能会话连接指令,获取与另一智能会话连接指令对应的目标用户画像;If another intelligent session connection instruction sent by the user terminal is detected, obtain the target user portrait corresponding to the other intelligent session connection instruction;
    判断所述目标用户画像中是否对应绑定第一输出结果;以及Judging whether the target user portrait corresponds to the binding first output result; and
    若所述目标用户画像中绑定有第一输出结果,根据所述第一输出结果在本地的第二知识图谱库中获取对应的第三当前关联问题集,将所述第三当前关联问题集发送至用户端。If the target user portrait is bound with the first output result, obtain the corresponding third current associated question set in the local second knowledge graph library according to the first output result, and assign the third current associated question set sent to the client.
  13. 根据权利要求11所述的计算机设备,其中,还包括:The computer device of claim 11, further comprising:
    接收并保存业务服务器上传的第一知识图谱库和第二知识图谱库;其中,所述第一知识图谱库中包括多个第一子知识图谱,每一第一子知识图谱对应若干个实体名称,且实体名称对应的实体之间存在关联关系;所述第二知识图谱库中包括多个第二子知识图谱,每一第二子知识图谱对应若干个实体名称,且实体名称对应的实体之间存在关联关系。Receive and save the first knowledge graph library and the second knowledge graph library uploaded by the business server; wherein, the first knowledge graph library includes a plurality of first sub-knowledge graphs, and each first sub-knowledge graph corresponds to several entity names , and there is an association relationship between the entities corresponding to the entity names; the second knowledge graph library includes a plurality of second sub-knowledge graphs, each second sub-knowledge graph corresponds to several entity names, and the entity name corresponding to the entity name There is a relationship between them.
  14. 根据权利要求11所述的计算机设备,其中,还包括:The computer device of claim 11, further comprising:
    若所述目标用户画像中未绑定有第一输出结果,将所述目标用户画像更新作为所述当前用户画像,返回执行接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集的步骤。If the target user portrait is not bound with the first output result, update the target user portrait as the current user portrait, return the execution to receive the dialog text sent by the client, and perform intent recognition on the dialog text to obtain The intent recognition keyword set is a step of acquiring the current entity name corresponding to each intent recognition keyword in the intent recognition keyword set, so as to form the current entity name set.
  15. 根据权利要求11所述的计算机设备,其中,所述获取与所述用户端对应的当前用户画像,包括:The computer device according to claim 11, wherein the acquiring the current user portrait corresponding to the client comprises:
    获取所述用户端对应的用户唯一识别码,根据所述用户唯一识别码在本地的用户画像库中获取与所述用户端对应的当前用户画像。The user unique identification code corresponding to the user terminal is obtained, and the current user portrait corresponding to the user terminal is obtained from the local user portrait library according to the user unique identification code.
  16. 根据权利要求11所述的计算机设备,其中,所述对所述对话文本进行意图识别以得到意图识别关键词集,包括:The computer device according to claim 11, wherein the performing intention recognition on the dialogue text to obtain the intention recognition keyword set comprises:
    将所述对话文本通过基于概率统计分词模型进行分词,得到与所述对话文本对应的分词结果;The dialogue text is subjected to word segmentation based on a probabilistic and statistical word segmentation model, and a word segmentation result corresponding to the dialogue text is obtained;
    通过词频-逆文本频率指数模型,抽取所述分词结果按频-逆文本频率指数降序排序后未超出预设的排名阈值的关键词,以组成意图识别关键词集。Through the word frequency-inverse text frequency index model, keywords that do not exceed the preset ranking threshold after the word segmentation results are sorted in descending order by the frequency-inverse text frequency index are extracted to form an intent identification keyword set.
  17. 根据权利要求11所述的计算机设备,其中,所述获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集,包括:The computer device according to claim 11, wherein the obtaining the current entity names corresponding to the intent identification keywords in the intent identification keyword set respectively, to form the current entity name set, comprises:
    调用预先存储的实体名称库和同义词词典,并获取与所述同义词词典对应的词林;Call the pre-stored entity name library and thesaurus dictionary, and obtain the word forest corresponding to the thesaurus dictionary;
    根据所述词林在所述实体名称库中获取各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集。According to the word forest, the current entity names corresponding to each intent identification keyword are obtained from the entity name database, so as to form a current entity name set.
  18. 根据权利要求12所述的计算机设备,其中,所述若所述目标用户画像中绑定有第一输出结果,根据所述第一输出结果在本地的第二知识图谱库中获取对应的第三当前关联问题集,将所述第三当前关联问题集发送至用户端之后,还包括:The computer device according to claim 12, wherein, if the target user portrait is bound with a first output result, obtain a corresponding third output result in a local second knowledge graph library according to the first output result The current associated question set, after the third current associated question set is sent to the client, further includes:
    将所述第一当前关联问题集、所述第二当前关联问题集及所述第三当前关联问题集均上传至区块链中。Uploading the first current set of related questions, the second set of current related questions, and the third set of current related questions to the blockchain.
  19. 根据权利要求17所述的计算机设备,其中,所述根据所述词林在所述实体名称库中获取各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集,包括:The computer device according to claim 17, wherein the acquiring current entity names corresponding to each intent identification keyword in the entity name database according to the word forest, to form a current entity name set, comprising:
    在所述词林中获取与各意图识别关键词之间分别对应的语义距离为最小值的目标词语,组成当前实体名称集。In the word forest, target words whose semantic distances corresponding to each intent recognition keyword are the smallest are obtained to form a current entity name set.
  20. 一种计算机可读存储介质,其中,所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行以下操作:A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to perform the following operations:
    若检测到用户端发送的智能会话连接指令,与所述用户端建立连接,并获取与所述用户端对应的当前用户画像;If an intelligent session connection instruction sent by the client is detected, a connection is established with the client, and a current user portrait corresponding to the client is obtained;
    接收用户端发送的对话文本,对所述对话文本进行意图识别以得到意图识别关键词集,获取与所述意图识别关键词集中各意图识别关键词分别对应的当前实体名称,以组成当前实体名称集;Receive the dialogue text sent by the user terminal, perform intention recognition on the dialogue text to obtain an intention recognition keyword set, and obtain the current entity name corresponding to each intention recognition keyword in the intention recognition keyword set, so as to form the current entity name set;
    根据所述当前实体名称集中各当前实体名称在本地的第一知识图谱库中获取对应的第一当前关联问题集,将所述第一当前关联问题集发送至用户端;其中,所述第一当前关联问题集中包括的关联问题子集的个数与所述当前实体名称集中包括的当前实体名称的个数相同,且每一当前实体名称对应所述第一当前关联问题集中的其中一个关联问题子集;According to the current entity names in the current entity name set, the corresponding first current associated question set is obtained in the local first knowledge graph library, and the first current associated question set is sent to the client; wherein the first current associated question set is The number of association question subsets included in the current association question set is the same as the number of current entity names included in the current entity name set, and each current entity name corresponds to one of the association questions in the first current association question set Subset;
    若检测用户端发送的与第一当前关联问题集对应的第一问题点击指令,获取与所述第一问题点击指令对应的第一目标实体名称,获取所述第一目标实体名称对应的第一关联实体名称集,根据所述第一关联实体名称集中各第一关联实体名称在本地的第一知识图谱库中获取对应的第二当前关联问题集,将所述第二当前关联问题集发送至用户端;其中,所述第二当前关联问题集中包括的关联问题子集的个数与所述第一关联实体名称集中包括的第一关联实体名称的个数相同,且每一第一关联实体名称对应所述第二当前关联问题集中的其中一个关联问题子集;以及If the first question click instruction corresponding to the first current associated question set sent by the client is detected, the first target entity name corresponding to the first question click instruction is obtained, and the first target entity name corresponding to the first target entity name is obtained. A set of associated entity names, obtaining a corresponding second current associated problem set in the local first knowledge graph library according to the names of each first associated entity in the first associated entity name set, and sending the second current associated problem set to The user terminal; wherein, the number of association question subsets included in the second current association question set is the same as the number of first association entity names included in the first association entity name set, and each first association entity a name corresponding to one of the subset of contextual questions in the second current set of contextual questions; and
    若检测用户端发送的与第二当前关联问题集对应的第二问题点击指令,获取与所述第二问题点击指令对应的第二目标实体名称增加完成标识以作为第一输出结果,将所述第一输出结果与所述当前用户画像进行一一映射以绑定。If the second question click instruction corresponding to the second current associated question set sent by the user terminal is detected, the second target entity name corresponding to the second question click instruction is acquired and the completion flag is added as the first output result, and the The first output result and the current user portrait are mapped one-to-one for binding.
PCT/CN2021/091291 2020-12-25 2021-04-30 Multi-knowledge graph based intelligent reply method and apparatus, computer device and storage medium WO2022134421A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011562101.X 2020-12-25
CN202011562101.XA CN112650854B (en) 2020-12-25 2020-12-25 Intelligent reply method and device based on multiple knowledge graphs and computer equipment

Publications (1)

Publication Number Publication Date
WO2022134421A1 true WO2022134421A1 (en) 2022-06-30

Family

ID=75362972

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/091291 WO2022134421A1 (en) 2020-12-25 2021-04-30 Multi-knowledge graph based intelligent reply method and apparatus, computer device and storage medium

Country Status (2)

Country Link
CN (1) CN112650854B (en)
WO (1) WO2022134421A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115412640A (en) * 2022-11-02 2022-11-29 北京北投智慧城市科技有限公司 Call center information processing method and device based on knowledge graph
CN115810422A (en) * 2022-12-07 2023-03-17 深圳市宁远科技股份有限公司 Internet intelligent automatic diagnosis and treatment response system based on AI technology
CN116821287A (en) * 2023-08-28 2023-09-29 湖南创星科技股份有限公司 Knowledge graph and large language model-based user psychological portrait system and method
CN117633254A (en) * 2024-01-26 2024-03-01 武汉大学 Knowledge-graph-based map retrieval user portrait construction method and system

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112650854B (en) * 2020-12-25 2022-09-27 平安科技(深圳)有限公司 Intelligent reply method and device based on multiple knowledge graphs and computer equipment
CN113314236A (en) * 2021-06-01 2021-08-27 南华大学 Intelligent question-answering system for hypertension
CN115146075B (en) * 2022-07-11 2023-03-10 中科雨辰科技有限公司 Data processing system for acquiring knowledge graph

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108519998A (en) * 2018-03-07 2018-09-11 北京云知声信息技术有限公司 The problem of knowledge based collection of illustrative plates bootstrap technique and device
US20190205726A1 (en) * 2018-01-02 2019-07-04 International Business Machines Corporation Cognitive conversational agent for providing personalized insights on-the-fly
US20200004875A1 (en) * 2018-06-29 2020-01-02 International Business Machines Corporation Query expansion using a graph of question and answer vocabulary
CN110929016A (en) * 2019-12-10 2020-03-27 北京爱医生智慧医疗科技有限公司 Intelligent question and answer method and device based on knowledge graph
CN112650854A (en) * 2020-12-25 2021-04-13 平安科技(深圳)有限公司 Intelligent reply method and device based on multiple knowledge graphs and computer equipment

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104408639A (en) * 2014-10-22 2015-03-11 百度在线网络技术(北京)有限公司 Multi-round conversation interaction method and system
CN105068661B (en) * 2015-09-07 2018-09-07 百度在线网络技术(北京)有限公司 Man-machine interaction method based on artificial intelligence and system
US20190164064A1 (en) * 2017-11-27 2019-05-30 Shanghai Xiaoi Robot Technology Co., Ltd. Question and answer interaction method and device, and computer readable storage medium
CN110046238B (en) * 2019-03-29 2024-03-26 华为技术有限公司 Dialogue interaction method, graphic user interface, terminal equipment and network equipment
CN110377715A (en) * 2019-07-23 2019-10-25 天津汇智星源信息技术有限公司 Reasoning type accurate intelligent answering method based on legal knowledge map
CN110704596B (en) * 2019-09-29 2023-03-31 北京百度网讯科技有限公司 Topic-based conversation method and device and electronic equipment
CN111026886B (en) * 2019-12-26 2023-05-02 成都航天科工大数据研究院有限公司 Multi-round dialogue processing method for professional scene
CN111339781B (en) * 2020-02-10 2023-05-30 科大讯飞华南人工智能研究院(广州)有限公司 Intention recognition method, device, electronic equipment and storage medium
CN111400479B (en) * 2020-04-14 2023-05-23 支付宝(杭州)信息技术有限公司 Method and device for identifying problems aiming at multi-round dialogue
CN111552880B (en) * 2020-04-30 2023-06-30 杭州网易再顾科技有限公司 Knowledge graph-based data processing method and device, medium and electronic equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190205726A1 (en) * 2018-01-02 2019-07-04 International Business Machines Corporation Cognitive conversational agent for providing personalized insights on-the-fly
CN108519998A (en) * 2018-03-07 2018-09-11 北京云知声信息技术有限公司 The problem of knowledge based collection of illustrative plates bootstrap technique and device
US20200004875A1 (en) * 2018-06-29 2020-01-02 International Business Machines Corporation Query expansion using a graph of question and answer vocabulary
CN110929016A (en) * 2019-12-10 2020-03-27 北京爱医生智慧医疗科技有限公司 Intelligent question and answer method and device based on knowledge graph
CN112650854A (en) * 2020-12-25 2021-04-13 平安科技(深圳)有限公司 Intelligent reply method and device based on multiple knowledge graphs and computer equipment

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115412640A (en) * 2022-11-02 2022-11-29 北京北投智慧城市科技有限公司 Call center information processing method and device based on knowledge graph
CN115810422A (en) * 2022-12-07 2023-03-17 深圳市宁远科技股份有限公司 Internet intelligent automatic diagnosis and treatment response system based on AI technology
CN115810422B (en) * 2022-12-07 2023-12-29 深圳市维康致远科技有限公司 Internet intelligent automatic diagnosis and treatment response system based on AI technology
CN116821287A (en) * 2023-08-28 2023-09-29 湖南创星科技股份有限公司 Knowledge graph and large language model-based user psychological portrait system and method
CN116821287B (en) * 2023-08-28 2023-11-17 湖南创星科技股份有限公司 Knowledge graph and large language model-based user psychological portrait system and method
CN117633254A (en) * 2024-01-26 2024-03-01 武汉大学 Knowledge-graph-based map retrieval user portrait construction method and system
CN117633254B (en) * 2024-01-26 2024-04-05 武汉大学 Knowledge-graph-based map retrieval user portrait construction method and system

Also Published As

Publication number Publication date
CN112650854B (en) 2022-09-27
CN112650854A (en) 2021-04-13

Similar Documents

Publication Publication Date Title
WO2022134421A1 (en) Multi-knowledge graph based intelligent reply method and apparatus, computer device and storage medium
US9582757B1 (en) Scalable curation system
CN106960030B (en) Information pushing method and device based on artificial intelligence
US10630798B2 (en) Artificial intelligence based method and apparatus for pushing news
CN110837550A (en) Knowledge graph-based question and answer method and device, electronic equipment and storage medium
CN110597963B (en) Expression question-answering library construction method, expression search device and storage medium
CN109032381B (en) Input method and device based on context, storage medium and terminal
CN109345282A (en) A kind of response method and equipment of business consultation
CN109857846B (en) Method and device for matching user question and knowledge point
CN106874253A (en) Recognize the method and device of sensitive information
CN111310436A (en) Text processing method and device based on artificial intelligence and electronic equipment
CN112925898B (en) Question-answering method and device based on artificial intelligence, server and storage medium
CN111694940A (en) User report generation method and terminal equipment
US11019012B2 (en) File sending in instant messaging application
CN110147433A (en) A kind of text template extracting method based on dictionary tree
CN111552797B (en) Name prediction model training method and device, electronic equipment and storage medium
CN112632261A (en) Intelligent question and answer method, device, equipment and storage medium
CN110781676B (en) Text processing method, device and system, electronic equipment and storage medium
CN111552798A (en) Name information processing method and device based on name prediction model and electronic equipment
WO2020000752A1 (en) Counterfeit mobile application program determination method and system
CN112966076A (en) Intelligent question and answer generating method and device, computer equipment and storage medium
CN117112727A (en) Large language model fine tuning instruction set construction method suitable for cloud computing service
WO2021000400A1 (en) Hospital guide similar problem pair generation method and system, and computer device
CN108810640B (en) Television program recommendation method
CN110209804B (en) Target corpus determining method and device, storage medium and electronic device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21908432

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21908432

Country of ref document: EP

Kind code of ref document: A1