WO2022141875A1 - User intention recognition method and apparatus, device, and computer-readable storage medium - Google Patents

User intention recognition method and apparatus, device, and computer-readable storage medium Download PDF

Info

Publication number
WO2022141875A1
WO2022141875A1 PCT/CN2021/084250 CN2021084250W WO2022141875A1 WO 2022141875 A1 WO2022141875 A1 WO 2022141875A1 CN 2021084250 W CN2021084250 W CN 2021084250W WO 2022141875 A1 WO2022141875 A1 WO 2022141875A1
Authority
WO
WIPO (PCT)
Prior art keywords
intent
preset
success rate
node
intention
Prior art date
Application number
PCT/CN2021/084250
Other languages
French (fr)
Chinese (zh)
Inventor
李志韬
王健宗
程宁
吴天博
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2022141875A1 publication Critical patent/WO2022141875A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology

Definitions

  • the present application belongs to the technical field of intelligent decision-making, and in particular, relates to a method, apparatus, device and computer-readable storage medium for identifying user intent.
  • the dialogue system is a human-computer interaction system based on natural language.
  • Intent recognition is an important part of the human-computer interaction system. It converts the content of the user's dialogue into a way that the computer can understand.
  • the recognized intent will directly affect the robot's next sentence. Whether what is said is relevant to what the user expresses, and whether the customer is satisfied.
  • intent recognition mainly includes two parts: intent detection and extraction of semantic slots.
  • the traditional methods of intent recognition have ranged from Hidden Markov Model (HMM), conditional random fields (CRF), and Support Vector Machine (SVM) to more popular in the past decade. Both the convolutional neural network and the recurrent neural network have good experimental results.
  • HMM Hidden Markov Model
  • CRF conditional random fields
  • SVM Support Vector Machine
  • the traditional intent recognition is to use the classifier to select the intent with the highest probability as the final intent.
  • some intent recognition errors may be caused by occasional speech recognition errors. Data, and accurately determine the user's target intention is an urgent problem to be solved at present.
  • One of the purposes of the embodiments of the present application is to provide a method, device, device and computer-readable storage medium for identifying user intent, aiming to solve the technical problem that the intent of the voice data input by the user can be accurately determined according to the voice data of the user .
  • a first aspect of the embodiments of the present application provides a method for identifying user intent, the method comprising:
  • each of the preset intent tags determines a preset number of candidate intent tags from a plurality of the preset intent tags
  • the candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
  • a second aspect of the embodiments of the present application provides a user intent identification device, the user intent identification device includes an acquisition module, a generation module, a screening module, a first determination module, a second determination module, and a third determination module, wherein:
  • the obtaining module is used to obtain text information corresponding to the voice data input by the user;
  • the generating module is configured to input the text information into a preset intent classification model to obtain output probabilities of a plurality of preset intent labels used to represent voice intents;
  • the screening module configured to determine a preset number of candidate intent tags from a plurality of the preset intent tags according to the output probability of each of the preset intent tags;
  • the first determining module is configured to determine the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
  • the second determining module configured to determine the dialogue success rate of each of the candidate intent tags according to the dialogue success rate of each of the intent nodes
  • the third determining module is configured to determine the candidate intent label with the highest dialogue success rate as the intent label of the voice data input by the user.
  • a third aspect of the embodiments of the present application provides a computer device, including: a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program When realized:
  • each of the preset intent tags determines a preset number of candidate intent tags from a plurality of the preset intent tags
  • the candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
  • a fourth aspect of the embodiments of the present application provides a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement:
  • each of the preset intent tags determines a preset number of candidate intent tags from a plurality of the preset intent tags
  • the candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
  • a fifth aspect of the embodiments of the present application further provides a computer program product, when the computer program product is run on a computer device, the computer device can implement in real time:
  • each of the preset intent tags determines a preset number of candidate intent tags from a plurality of the preset intent tags
  • the candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
  • the embodiments of the present application include the following advantages:
  • the output probabilities of multiple preset intent labels used to represent the voice intent are obtained;
  • the output probability of the preset intent label determine a preset number of candidate intent labels from multiple preset intent labels; then determine the dialog success rate of each intent node in the preset intent knowledge graph;
  • the dialog success rate determines the dialog success rate of each candidate intent label; and the candidate intent label with the highest dialog success rate is determined as the intent label of the speech data input by the user.
  • the method can obtain the output probability of multiple preset intent labels through the preset intent classification model. Combined with the output probability of multiple preset intent labels and the dialogue success rate of each intent node in the preset intent knowledge graph, it can accurately Identify the user's target intent label.
  • FIG. 1 is a schematic flowchart of steps of a method for identifying user intent provided by an embodiment of the present application
  • FIG. 2 is a schematic block diagram of a preset intent classification model provided by an embodiment of the present application.
  • FIG. 3 is a schematic flow chart of sub-steps of the method for identifying user intent in FIG. 1;
  • FIG. 4 is a schematic diagram of a scenario of a knowledge graph of a preset intent provided by an embodiment of the present application
  • FIG. 5 is a schematic block diagram of an apparatus for identifying user intent according to an embodiment of the present application.
  • FIG. 6 is a schematic block diagram of sub-modules of the device for identifying user intent in FIG. 5;
  • FIG. 7 is a schematic structural block diagram of a computer device according to an embodiment of the present application.
  • Embodiments of the present application provide a method, apparatus, device, and computer-readable storage medium for identifying user intent.
  • the method for identifying the user intent can be applied to a terminal device, and the terminal device can be an electronic device such as a mobile phone, a tablet computer, a notebook computer, a desktop computer, a personal digital assistant, and a wearable device.
  • FIG. 1 is a schematic flowchart of steps of a method for identifying user intent provided by an embodiment of the present application.
  • the method for identifying user intent includes steps S101 to S105.
  • Step S101 Acquire text information corresponding to the voice data input by the user, and input the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels representing the voice intent.
  • the preset intent classification model is a pre-trained model, and the preset intent classification model includes a plurality of neural network layers, and the neural network layers at least include at least one of the following: a vector extraction layer, a delay neural network layer, and a ReLU layer , residual network layer, summation layer, recurrent neural network layer, dropout layer and Solfmaxlayer layer.
  • the text information corresponding to the voice data input by the user is obtained, the text information is input into the vector extraction layer, multiple word vectors are obtained, and the multiple word vectors are input into the delay neural network layer , extract multiple word vector features, and input multiple word vector features to the ReLU layer to process multiple word vector features, reduce the gradient disappearance of word vector features, and obtain semantic label vectors.
  • Input word vectors to the summation layer to obtain a plurality of preliminary intent label vectors input a plurality of the preliminary intent label vectors to the recurrent neural network layer, obtain a plurality of candidate intent label vectors, and input the plurality of candidate intent label vectors to dropout layer to obtain multiple preset intent label vectors, and input multiple intent label vectors to the Solfmaxlayer layer to obtain the output probability of the preset intent labels.
  • the vector extraction layer can be selected according to the actual situation.
  • the vector extraction layer is a Word2Vec model
  • the delay neural network layer and the ReLU layer also include a residual network layer.
  • the residual network layer makes the delay neural network
  • the parameter processing of the layer and the ReLU layer is more accurate.
  • the dropout layer can prevent the candidate intent label vector from overfitting and improve the accuracy of the output intent label vector.
  • the training method of the preset intent classification model may be: acquiring sample text information, labeling the sample text information according to the category identifier corresponding to the output probability of the preset intent label, so as to construct sample data, and based on the sample data, neural
  • the network model is iteratively trained until the neural network mode converges, thereby obtaining the preset intent classification model.
  • the above-mentioned neural network models include convolutional neural network models, cyclic neural network models, and cyclic convolutional neural network models. Of course, other network models can also be used for training to obtain a preset intent classification model, which is not specifically limited in this application.
  • the text information corresponding to the voice data input by the user is acquired, and the text information is input into the preset intent classification model to obtain the output probabilities of multiple preset intent labels.
  • the output probability of multiple preset intent labels can be accurately and quickly determined, which greatly improves the user experience.
  • the method of acquiring the text information corresponding to the voice data input by the user is: acquiring the voice input by the user, and inputting the voice into a preset voice recognition model to obtain the text information.
  • the preset speech recognition model is a pre-trained neural network model, which is not specifically limited in this application.
  • the text information corresponding to the voice data transmitted by other devices is obtained, and the text information corresponding to the voice data input by the user is obtained. It can be understood that there are other ways of acquiring the text information corresponding to the voice data of the user data, which is not specifically limited in this application.
  • Step S102 according to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags.
  • the candidate intent label is the intent label whose intent is closer to the user's intent.
  • the plurality of preset intent tags are sorted in descending order of output probability to obtain an intent tag queue; the preset intent tags are sequentially selected from the intent tag queue until a preset intent tag is obtained. the number of said candidate intent labels.
  • the preset intent label may be set according to the actual situation, which is not specifically limited in this application. For example, the preset intent label may be set to 5.
  • the output probability of the preset intent tag 1 is 10%
  • the output probability of the preset intent tag 2 is 20%
  • the output probability of the preset intent tag 3 is 5%
  • the output probability of the preset intent tag 4 is 12%.
  • the output probability of preset intent tag 5 is 7%
  • the output probability of preset intent tag 6 is 18%
  • the output probability of preset intent tag 7 is 25%
  • the output probability of preset intent tag 8 is 14%
  • the output probability of the preset intent label 9 is 4%
  • the output probability of the preset intent label 10 is 28%. According to the probability of each preset intent label, the 10 intent labels are sorted in descending order to obtain the intent labels.
  • the queue is [preset intent tag 10, preset intent tag 7, preset intent tag 2, preset intent tag 6, preset intent tag 8, preset intent tag 4, preset intent tag 1, preset intent tag 5 , preset intent tag 3, preset intent tag 9], the number of acquired preset intent tags is 5, select the first 5 candidate intent tags from the intent tag queue, and obtain the candidate intent tags as preset intent tags 10, preset intent tags Preset intent tag 7, preset intent tag 2, preset intent tag 6, and preset intent tag 8. By sorting the preset intent labels, candidate intent labels can be quickly selected.
  • Step S103 determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data.
  • the preset intent knowledge graph is generated according to historical dialogue data. Specifically, all historical dialog data is collected, the historical dialog data is classified, and the associated dialog data is associated to obtain the preset intent knowledge graph.
  • a preset intent knowledge graph is obtained, wherein the preset intent knowledge graph is generated according to historical dialogue data; the success of the intent node corresponding to each intent node in the preset intent knowledge graph is rate as the dialog success rate for each intent node.
  • the dialog success rate of each intent node can be accurately determined by the preset intent knowledge graph.
  • step S103 includes sub-steps S1031 to S1034 .
  • Sub-step S1031 Acquire multiple flow paths of each intention node from the preset intention knowledge graph, and count the number of the multiple flow paths, wherein each flow path includes multiple intention nodes.
  • each flow path includes multiple intent nodes.
  • the preset intent knowledge graph includes intent node a, intent node b, intent node c, intent node d, intent node e, intent node f, and intent node g, and the flow of intent node a.
  • the path includes the flow path where intent node a connects intent node b and intent node c, intent node a connects intent node b connects intent node e connects intent node f, and intent node a connects intent node b connects intent node e connects intent node.
  • intent node a connects intent node b connects intent node e connects intent node g's flow path
  • intent node a connects intent node b connects intent node e connects intent node f
  • the flow path and intent node a connects intent node d
  • the flow path connecting intention node g, the flow path of intention node b includes the flow path connecting intention node b to intention node c, the flow path connecting intention node b to intention node e and the flow path of intention node b connecting intention node e to the intention node
  • the flow path of g is connected to the flow path of intention node b and the flow path of intention node e is connected to the flow path of intention node a.
  • the flow path of intention node e includes the flow path of intention node e connecting intention node a, and the flow path and intention of intention node e connecting intention node f.
  • Node e is connected to the flow path of intention node g
  • the flow path of intention node d includes the flow path of intention node connected to intention node g
  • intention node c and intention node f have no flow path.
  • the number of paths of the flow path of the intention node a is 5, the number of paths of the flow path of the intention node b is 4, and the number of paths of the flow path of the intention node c is 0
  • the number of paths of the flow path of the intent node d is 1, the number of paths of the flow path of the intention node e is 3, the number of paths of the flow path of the intention node f is 0, and the number of paths of the flow path of the intention node g is 0
  • the number of paths is 0.
  • Sub-step S1032 Determine the flow path whose attribute identifier of the last intent node is the preset attribute identifier as a successful flow path.
  • the attribute identifier of the intent node is a keyword set according to the actual situation, for example, keywords such as time, place, and event.
  • the flow path identified by the preset attribute is determined as the flow path that has an attribute of the last intent node in the flow path.
  • the preset attribute identifier is time, and if the last intent node in the flow path is a flow path of time, the flow path is a successful flow path.
  • Sub-step S1033 Count the number of successful flow paths among the multiple flow paths of each intention node.
  • the number of paths of the multiple flow paths of each intent node is determined according to the preset intent knowledge graph.
  • the attribute identifier of the last intent node in the multiple flow paths of each intent node is queried from the preset intent knowledge graph as g.
  • the number of where the attribute of the last intent node in the multiple flow paths of the intent node a is 2, and the attribute of the last intent node in the multiple flow paths of the intent node b is the number of g is 1 , the attribute identifier of the last intent node in the multiple flow paths of the intent node c is 0, the number of the attribute identifier of the last intent node in the multiple flow paths of the intent node d is 1, and the intent node e
  • the attribute identifier of the last intent node in the multiple flow paths is 1, the attribute identifier of the last intent node in the multiple flow paths of the intent node f is 0, and the number of intent node g is multiple flow paths
  • the number of successful circulations of intent node a is 2
  • the number of successful circulations of intent node b is 1
  • the number of successful circulations of intent node c is obtained.
  • the number of successful circulations obtained by intent node d is 1
  • the number of successful circulations obtained by intent node e is 1
  • the number of successful circulations obtained by intent node f is 0, and the number of successful circulations obtained by intent node f is 1 Second-rate.
  • Sub-step S1034 Calculate the percentage of the number of successful circulation paths in the multiple circulation paths of each intention node to the number of all circulation paths, and use the calculated percentage as the dialogue success rate of each intention node.
  • the number of successful circulation of intent node a is 2, the number of successful circulation of intent node b is 1, the number of successful circulation of intent node c is 0, the number of successful circulation of intent node d is 1, and the number of successful circulation of intent node e1 2 times, the number of successful circulation of intention node f is 0, the number of successful circulation of intention node g is 0, the number of paths of the circulation path of intention node a is 5, and the number of paths of the circulation path of intention node b is 4
  • the number of paths for the flow path of the intent node c is 0, the number of paths for the flow path of the intention node d is 1, the number of paths of the flow path of the intention node e is 3, and the number of paths of the flow path of the intention node f
  • the number of paths is 0, and the number of paths of the flow path of intention node g is 0.
  • the number of successful flow of intention node a accounts for 40% of the corresponding number of paths
  • the number of successful flow of intention node b accounts for 40% of the corresponding number of paths.
  • 25% of the number of paths the number of successful circulation of intention node c accounts for 0% of the number of corresponding paths
  • the number of successful circulation of intention node d accounts for 100% of the number of corresponding paths
  • the number of successful circulation of intention node e accounts for the corresponding number of paths.
  • the number of successful circulation of intention node f accounts for 0% of the number of corresponding paths
  • the number of successful circulation of intention node g accounts for 100% of the number of corresponding paths
  • the number of intention nodes a, b, c , d, e, f, and g’s successful flow times account for the number of corresponding paths.
  • the dialog success rate for determining intent node d is 100%
  • the dialog success rate for determining intent node e is 33.3%
  • the dialog success rate for determining intent node f is 0%
  • the dialog success rate for determining intent node g is 100%.
  • Step S104 Determine the dialog success rate of each candidate intent tag according to the dialog success rate of each intent node.
  • the success rate of the candidate intent label is between 0 and 100%, and the larger the candidate intent label, the higher the probability of the successful dialogue of the candidate intent label.
  • the dialog success rate of each intent node and the preset intent label corresponding to each intent node are mapped to obtain the dialog success rate of each preset intent label;
  • the tags are mapped to each other, and the dialogue success rate of the mapped preset intent tags is used as the dialogue success rate of the candidate intent tags.
  • Step S105 Determine the candidate intent tag with the highest dialogue success rate as the intent tag of the voice data input by the user.
  • the target intent label is the intent label closest to the user's intent.
  • a plurality of candidate intent tags are sorted according to the dialogue success rate of each candidate intent tag to obtain a candidate intent tag queue, and the candidate intent tag with the highest dialogue success rate is selected from the candidate intent tag queue as the user's target.
  • Intent label The candidate intent labels are sorted by the success rate and the candidate intent label with the highest success rate is selected as the user's target intent label, which greatly improves the accuracy of determining the user's intent.
  • the dialog success rate of candidate intent label 1 is 50%
  • the dialog success rate of candidate intent label 2 is 25%
  • the dialog success rate of candidate intent label 3 is 15%
  • the dialog success rate of candidate intent label 4 is 60%.
  • the dialogue success rate of candidate intent label 5 is 40%
  • the candidate intent label 1, candidate intent label 2, candidate intent label 3, candidate intent label 4 and candidate intent label 5 are sorted according to the conversation success rate of candidate intent labels, and we get
  • the candidate intent label queue is, [candidate intent label 4, candidate intent label 1, candidate intent label 5, candidate intent label 2, candidate intent label 3], from the candidate intent label queue, select the candidate intent label 4 with the highest dialogue success rate as the The user's goal intent label.
  • the user intent recognition method obtains the output probability of multiple preset intent labels used to represent the voice intent by acquiring the text information corresponding to the voice data input by the user, and inputting the text information into the preset intent classification model. ; Then according to the output probability of each preset intent tag, determine a preset number of candidate intent tags from multiple preset intent tags; then determine the dialogue success rate of each intent node in the preset intent knowledge graph; The dialog success rate of each intent node determines the dialog success rate of each candidate intent label; and the candidate intent label with the highest dialog success rate is determined as the intent label of the speech data input by the user.
  • the method can obtain the output probability of multiple preset intent labels through the preset intent classification model. Combined with the output probability of multiple preset intent labels and the dialogue success rate of each intent node in the preset intent knowledge graph, it can accurately Identify the user's target intent label.
  • FIG. 5 is a schematic block diagram of an apparatus for recognizing user intent according to an embodiment of the present application.
  • the user intent recognition apparatus 200 includes an acquisition module 210 , a generation module 220 , a screening module 230 , a first determination module 240 , a second determination module 250 and a third determination module 260 , wherein,
  • the obtaining module 210 is configured to obtain text information corresponding to the voice data input by the user.
  • the generating module 220 is configured to input the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent the voice intent.
  • the screening module 230 is configured to determine a preset number of candidate intent tags from a plurality of the preset intent tags according to the output probability of each preset intent tag.
  • the first determining module 240 is configured to determine the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data.
  • the second determination module 250 is configured to determine the dialogue success rate of each of the candidate intent tags according to the dialogue success rate of each of the intent nodes.
  • the third determining module 260 is configured to determine the candidate intent label with the highest dialogue success rate as the intent label of the voice data input by the user.
  • the screening module 230 further includes the following sub-modules:
  • the sorting sub-module is used for sorting a plurality of the preset intent tags according to the descending output probability to obtain an intent tag queue.
  • a selection sub-module configured to sequentially select the preset intent tags from the intent tag queue until a preset number of the candidate intent tags are obtained.
  • the first determining module 240 further includes the following sub-modules:
  • the knowledge graph acquisition sub-module is used to acquire the preset intent knowledge graph.
  • a sub-module is set for taking the success rate of the intent node corresponding to each intent node in the preset intent knowledge graph as the dialog success rate of each intent node.
  • the first determination module 240 includes an acquisition sub-module 241, a statistics module 242, a determination sub-module 243 and a calculation module 244, wherein:
  • the acquisition sub-module 241 is configured to acquire multiple flow paths of each intent node from the preset intent knowledge graph.
  • the statistics module 242 is configured to count the number of the multiple flow paths, wherein each flow path includes multiple intent nodes.
  • the determining submodule 243 is configured to determine the flow path whose attribute identifier of the last intent node is the preset attribute identifier as a successful flow path.
  • the statistics module 242 is further configured to count the number of successful flow paths in the multiple flow paths of each intention node.
  • the calculation module 244 is configured to calculate the percentage of the number of successful circulation paths in the multiple circulation paths of each intention node to the number of all circulation paths, and use the calculated percentage as the dialogue success rate of each intention node.
  • the second determining module 250 further includes the following sub-modules:
  • the first mapping sub-module is configured to map the dialogue success rate of each of the intent nodes and the preset intent labels corresponding to each of the intent nodes to obtain the dialog success rate of each preset intent label.
  • the second mapping submodule is configured to map the candidate intent tag with the preset intent tag, and use the mapped dialog success rate of the preset intent tag as the dialog success rate of the candidate intent tag.
  • the preset intent classification model includes a vector extraction layer, a time-delay neural network layer, a ReLU layer, a residual network layer, a summation layer, a recurrent neural network layer, a dropout layer, and a Solfmaxlayer layer; the generating Module 220 also includes the following sub-modules:
  • the first input sub-module is used for inputting the text information to the vector extraction layer to obtain a plurality of word vectors.
  • the second input sub-module is used for inputting a plurality of the word vectors to the time delay neural network layer, and extracting a plurality of word vector features.
  • the third input sub-module is used for inputting a plurality of the word vector features to the ReLU layer to obtain a semantic label vector.
  • the fourth input sub-module is configured to input the semantic label vector and a plurality of the word vectors into the summation layer to obtain a plurality of preliminary intent label vectors.
  • the fifth input sub-module is used for inputting a plurality of the preliminary intent label vectors to the recurrent neural network layer to obtain a plurality of candidate intent label vectors.
  • the seventh input sub-module is used for inputting a plurality of the candidate intent label vectors to the dropout layer to obtain a plurality of preset intent label vectors.
  • the eighth input sub-module is used for inputting a plurality of the intent label vectors to the Solfmaxlayer layer to obtain the output probability of the preset intent label.
  • the apparatuses provided by the above embodiments may be implemented in the form of a computer program, and the computer program may be executed on the computer device as shown in FIG. 7 .
  • FIG. 7 is a schematic block diagram of the structure of a computer device according to an embodiment of the present application.
  • the computer device includes a processor, a memory, and a network interface connected by a system bus, wherein the memory may include a non-volatile storage medium and an internal memory.
  • the nonvolatile storage medium can store operating systems and computer programs.
  • the computer program includes program instructions that, when executed, can cause the processor to execute any method for identifying user intent.
  • This network interface is used for communication.
  • FIG. 7 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
  • the bus is, for example, an I2C (Inter-integrated Circuit) bus
  • the memory can be a Flash chip, a read-only memory (ROM, Read-Only Memory) magnetic disk, an optical disk, a U disk or a mobile hard disk, etc.
  • the processor can be Central Processing Unit (CPU)
  • the processor can also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), application specific integrated circuits (Application Specific Integrated Circuits, ASICs), field programmable gates Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor can be a microprocessor or the processor can also be any conventional processor or the like.
  • a computer device includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements when executing the computer program:
  • each of the preset intent tags determines a preset number of candidate intent tags from a plurality of the preset intent tags
  • the candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
  • the processor when executing the computer program, implements:
  • the preset intent tags are sequentially selected from the intent tag queue until a preset number of the candidate intent tags are obtained.
  • the processor when executing the computer program, implements:
  • the success rate of the intent node corresponding to each intent node in the preset intent knowledge graph is taken as the dialog success rate of each intent node.
  • the processor when executing the computer program, implements:
  • each intention node from the preset intention knowledge graph, and count the number of the multiple flow paths, wherein each flow path includes multiple intention nodes;
  • the processor when executing the computer program, implements:
  • the candidate intent tag is mapped with the preset intent tag, and the dialog success rate of the mapped preset intent tag is used as the dialog success rate of the candidate intent tag.
  • the preset intent classification model includes a vector extraction layer, a delay neural network layer, a ReLU layer, a residual network layer, a summation layer, a recurrent neural network layer, a dropout layer, and a Solfmaxlayer layer; the processing When the computer executes the computer program, it realizes:
  • a plurality of the intent label vectors are input to the Solfmaxlayer layer to obtain the output probability of the preset intent label.
  • Embodiments of the present application further provide a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, the computer program includes program instructions, and the method implemented when the program instructions are executed may refer to this document Various embodiments of methods for identifying user intent are claimed. specific:
  • each of the preset intent tags determines a preset number of candidate intent tags from a plurality of the preset intent tags
  • the candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
  • the computer program when executed by the processor, further implements:
  • the preset intent tags are sequentially selected from the intent tag queue until a preset number of the candidate intent tags are obtained.
  • the computer program when executed by the processor, further implements:
  • the success rate of the intent node corresponding to each intent node in the preset intent knowledge graph is taken as the dialog success rate of each intent node.
  • the computer program when executed by the processor, further implements:
  • each intention node from the preset intention knowledge graph, and count the number of the multiple flow paths, wherein each flow path includes multiple intention nodes;
  • the computer program when executed by the processor, further implements:
  • the candidate intent tag is mapped with the preset intent tag, and the dialog success rate of the mapped preset intent tag is used as the dialog success rate of the candidate intent tag.
  • the preset intent classification model includes a vector extraction layer, a time-delay neural network layer, a ReLU layer, a residual network layer, a summation layer, a recurrent neural network layer, a dropout layer, and a Solfmaxlayer layer; the computer When the program is executed by the processor, it also implements:
  • a plurality of the intent label vectors are input to the Solfmaxlayer layer to obtain the output probability of the preset intent label.
  • the computer-readable storage medium may be an internal storage unit of the computer device described in the foregoing embodiments, such as a hard disk or a memory of the computer device.
  • the computer-readable storage medium may be non-volatile or volatile.
  • the computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk equipped on the computer device, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) ) card, Flash Card, etc.

Abstract

The present invention relates to the technical field of intelligent decision making. Provided are a user intention recognition method and apparatus, a device, and a computer-readable storage medium. The method comprises: obtaining text information corresponding to speech data input by a user, and inputting the text information into a preset intention classification model to obtain output probabilities of a plurality of preset intention labels for representing speech intentions (S101); determining a preset number of candidate intention labels from the plurality of preset intention labels according to the output probability of each preset intention label (S102); determining a dialogue success rate of each intention node in a preset intention knowledge graph; determining a dialogue success rate of each candidate intention label according to the dialogue success rate of each intention node (S104); and determining the candidate intention label having the highest dialogue success rate as an intention label of the speech data input by the user (S105). A target intention label of a user is determined by combining the preset intention classification model with the dialogue success rate of each intention node in the preset intention knowledge graph.

Description

用户意图识别方法、装置、设备及计算机可读存储介质User intent recognition method, apparatus, device, and computer-readable storage medium
本申请要求于2020年12月30日在中国专利局提交的、申请号为202011631344.4、发明名称为“用户意图识别方法、装置、设备及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202011631344.4 and the invention title "User Intent Recognition Method, Apparatus, Equipment and Computer-readable Storage Medium", which was filed in the China Patent Office on December 30, 2020. The entire contents of this application are incorporated by reference.
技术领域technical field
本申请属于智能决策技术领域,尤其涉及一种用户意图识别方法、装置、设备及计算机可读存储介质。The present application belongs to the technical field of intelligent decision-making, and in particular, relates to a method, apparatus, device and computer-readable storage medium for identifying user intent.
背景技术Background technique
对话系统是是一种基于自然语言的人机交互系统,意图识别是人机交互系统的重要部分,它将用户对话的内容转化成计算机可以理解的方式,识别的意图会直接影响到机器人下一句说的内容是否和用户表达的相关,是否让客户满意。其中,意图识别主要包括两个部分:意图检测和提取语义槽。传统上做意图识别的方法从隐形马尔科夫模型(Hidden Markov Model,HMM)、条件随机场(conditional random fields,CRF)、支持向量机(Support Vector Machine,SVM),而到近十年比较流行的卷积神经网络和循环神经网络,都有不错的实验结果。但是,发明人意识到,这些模型只能在较少的上下文输入和大规模训练语料上才能得到比较好的效果。而且,传统的意图识别是通过分类器选择概率最大的那个意图作为最终意图,这样的结果在实际应用场景中,会因为偶尔的语音识别错误导致一些意图的识别错误,因此,如何根据用户的语音数据,准确的确定用户的目标意图是目前亟待解决的问题。The dialogue system is a human-computer interaction system based on natural language. Intent recognition is an important part of the human-computer interaction system. It converts the content of the user's dialogue into a way that the computer can understand. The recognized intent will directly affect the robot's next sentence. Whether what is said is relevant to what the user expresses, and whether the customer is satisfied. Among them, intent recognition mainly includes two parts: intent detection and extraction of semantic slots. The traditional methods of intent recognition have ranged from Hidden Markov Model (HMM), conditional random fields (CRF), and Support Vector Machine (SVM) to more popular in the past decade. Both the convolutional neural network and the recurrent neural network have good experimental results. However, the inventors realized that these models can only get better results with less contextual input and large-scale training corpus. Moreover, the traditional intent recognition is to use the classifier to select the intent with the highest probability as the final intent. In practical application scenarios, some intent recognition errors may be caused by occasional speech recognition errors. Data, and accurately determine the user's target intention is an urgent problem to be solved at present.
技术问题technical problem
本申请实施例的目的之一在于:提供一种用户意图识别方法、装置、设备及计算机可读存储介质,旨在解决根据用户的语音数据,可准确的确定用户输入的语音数据意图的技术问题。One of the purposes of the embodiments of the present application is to provide a method, device, device and computer-readable storage medium for identifying user intent, aiming to solve the technical problem that the intent of the voice data input by the user can be accurately determined according to the voice data of the user .
技术解决方案technical solutions
为解决上述技术问题,本申请实施例采用的技术方案是:In order to solve the above-mentioned technical problems, the technical solutions adopted in the embodiments of the present application are:
本申请实施例的第一方面提供了一种用户意图识别方法,所述方法包括:A first aspect of the embodiments of the present application provides a method for identifying user intent, the method comprising:
获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;acquiring text information corresponding to the voice data input by the user, and inputting the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intent;
根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;According to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags;
确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;Determine the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes;
将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
本申请实施例的第二方面提供了一种用户意图识别装置,所述用户意图识别装置包括获取模块、生成模块、筛选模块、第一确定模块、第二确定模块和第三确定模块,其中:A second aspect of the embodiments of the present application provides a user intent identification device, the user intent identification device includes an acquisition module, a generation module, a screening module, a first determination module, a second determination module, and a third determination module, wherein:
所述获取模块,用于获取用户输入的语音数据对应的文本信息;The obtaining module is used to obtain text information corresponding to the voice data input by the user;
所述生成模块,用于将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;The generating module is configured to input the text information into a preset intent classification model to obtain output probabilities of a plurality of preset intent labels used to represent voice intents;
所述筛选模块,用于根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;the screening module, configured to determine a preset number of candidate intent tags from a plurality of the preset intent tags according to the output probability of each of the preset intent tags;
所述第一确定模块,用于确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;The first determining module is configured to determine the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
所述第二确定模块,用于根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;the second determining module, configured to determine the dialogue success rate of each of the candidate intent tags according to the dialogue success rate of each of the intent nodes;
所述第三确定模块,用于将对话成功率最高的候选意图标签确定为所述用户输入的语 音数据的意图标签。The third determining module is configured to determine the candidate intent label with the highest dialogue success rate as the intent label of the voice data input by the user.
本申请实施例的第三方面提供了一种计算机设备,包括:存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现:A third aspect of the embodiments of the present application provides a computer device, including: a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program When realized:
获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;acquiring text information corresponding to the voice data input by the user, and inputting the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intent;
根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;According to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags;
确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;Determine the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes;
将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
本申请实施例的第四方面提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现:A fourth aspect of the embodiments of the present application provides a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement:
获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;acquiring text information corresponding to the voice data input by the user, and inputting the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intent;
根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;According to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags;
确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;Determine the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes;
将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
本申请实施例的第五方面还提供了一种计算机程序产品,当所述计算机程序产品在计算机设备上运行时,使得所述计算机设备行时实现:A fifth aspect of the embodiments of the present application further provides a computer program product, when the computer program product is run on a computer device, the computer device can implement in real time:
获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;acquiring text information corresponding to the voice data input by the user, and inputting the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intent;
根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;According to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags;
确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;Determine the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes;
将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
有益效果beneficial effect
与现有技术相比,本申请实施例包括以下优点:Compared with the prior art, the embodiments of the present application include the following advantages:
本申请实施例,通过获取用户输入的语音数据对应的文本信息,并将文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;然后根据每个预设意图标签的输出概率,从多个预设意图标签中确定预设数量个候选意图标签;之后确定预设意图知识图谱中的每个意图节点的对话成功率;再根据每个意图节点的对话成功率确定每个候选意图标签的对话成功率;并将对话成功率最高的候选意图标签确定为用户输入的语音数据的意图标签。本方法通过预设意图分类模型可以得到多个预设意图标签的输出概率,结合多个预设意图标签的输出概率和预设意图知识图谱中的每个意图节点的对话成功率,可以准确的确定用户的目标意图标签。In this embodiment of the present application, by acquiring the text information corresponding to the voice data input by the user, and inputting the text information into the preset intent classification model, the output probabilities of multiple preset intent labels used to represent the voice intent are obtained; The output probability of the preset intent label, determine a preset number of candidate intent labels from multiple preset intent labels; then determine the dialog success rate of each intent node in the preset intent knowledge graph; The dialog success rate determines the dialog success rate of each candidate intent label; and the candidate intent label with the highest dialog success rate is determined as the intent label of the speech data input by the user. The method can obtain the output probability of multiple preset intent labels through the preset intent classification model. Combined with the output probability of multiple preset intent labels and the dialogue success rate of each intent node in the preset intent knowledge graph, it can accurately Identify the user's target intent label.
附图说明Description of drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例或示范性技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附 图获得其它的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the accompanying drawings that are used in the description of the embodiments or exemplary technologies. Obviously, the drawings in the following description are only for the present application. In some embodiments, for those of ordinary skill in the art, other drawings can also be obtained according to these drawings without any creative effort.
图1为本申请实施例提供的一种用户意图识别方法的步骤流程示意图;1 is a schematic flowchart of steps of a method for identifying user intent provided by an embodiment of the present application;
图2为本申请实施例提供的预设意图分类模型的一示意性框图;2 is a schematic block diagram of a preset intent classification model provided by an embodiment of the present application;
图3为图1中的用户意图识别方法的子步骤流程示意图;3 is a schematic flow chart of sub-steps of the method for identifying user intent in FIG. 1;
图4为本申请实施例提供的预设意图知识图谱的一场景示意图;FIG. 4 is a schematic diagram of a scenario of a knowledge graph of a preset intent provided by an embodiment of the present application;
图5为本申请实施例提供的一种用户意图识别装置的示意性框图;FIG. 5 is a schematic block diagram of an apparatus for identifying user intent according to an embodiment of the present application;
图6为图5中的用户意图识别装置的子模块的示意性框图;6 is a schematic block diagram of sub-modules of the device for identifying user intent in FIG. 5;
图7为本申请实施例提供的一种计算机设备的结构示意性框图。FIG. 7 is a schematic structural block diagram of a computer device according to an embodiment of the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics and advantages of the purpose of the present application will be further described with reference to the accompanying drawings in conjunction with the embodiments.
本发明的实施方式Embodiments of the present invention
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.
附图中所示的流程图仅是示例说明,不是必须包括所有的内容和操作/步骤,也不是必须按所描述的顺序执行。例如,有的操作/步骤还可以分解、组合或部分合并,因此实际执行的顺序有可能根据实际情况改变。The flowcharts shown in the figures are for illustration only, and do not necessarily include all contents and operations/steps, nor do they have to be performed in the order described. For example, some operations/steps can also be decomposed, combined or partially combined, so the actual execution order may be changed according to the actual situation.
本申请实施例提供一种用户意图识别方法、装置、设备及计算机可读存储介质。其中,该用户意图识别方法可应用于终端设备中,该终端设备可以是手机、平板电脑、笔记本电脑、台式电脑、个人数字助理和穿戴式设备等电子设备。Embodiments of the present application provide a method, apparatus, device, and computer-readable storage medium for identifying user intent. Wherein, the method for identifying the user intent can be applied to a terminal device, and the terminal device can be an electronic device such as a mobile phone, a tablet computer, a notebook computer, a desktop computer, a personal digital assistant, and a wearable device.
下面结合附图,对本申请的一些实施方式作详细说明。在不冲突的情况下,下述的实施例及实施例中的特征可以相互组合。Some embodiments of the present application will be described in detail below with reference to the accompanying drawings. The embodiments described below and features in the embodiments may be combined with each other without conflict.
请参照图1,图1为本申请实施例提供的一种用户意图识别方法的步骤流程示意图。Please refer to FIG. 1 , which is a schematic flowchart of steps of a method for identifying user intent provided by an embodiment of the present application.
如图1所示,该用户意图识别方法包括步骤S101至步骤S105。As shown in FIG. 1 , the method for identifying user intent includes steps S101 to S105.
步骤S101、获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率。Step S101: Acquire text information corresponding to the voice data input by the user, and input the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels representing the voice intent.
其中,预设意图分类模型为预先训练好的模型,该预设意图分类模型包括多个神经网络层,该神经网络层至少包括以下至少一种:向量提取层、时延神经网络层、ReLU层、残差网络层、加和层、循环神经网络层、dropout层和Solfmaxlayer层。The preset intent classification model is a pre-trained model, and the preset intent classification model includes a plurality of neural network layers, and the neural network layers at least include at least one of the following: a vector extraction layer, a delay neural network layer, and a ReLU layer , residual network layer, summation layer, recurrent neural network layer, dropout layer and Solfmaxlayer layer.
具体地,如图2所示,获取用户的用户输入的语音数据对应的文本信息,将该文本信息输入至向量提取层,得到多个词向量,将多个词向量输入至时延神经网络层,提取到多个词向量特征,将多个词向量特征输入至ReLU层,以对多个词向量特征进行处理,减少词向量特征的梯度消失,得到语义标签向量,将该语义标签向量和多个词向量输入至加和层,得到多个初步意图标签向量,将多个该初步意图标签向量输入至循环神经网络层,得到多个候选意图标签向量,将多个候选意图标签向量输入至dropout层,得到多个预设意图标签向量,将多个意图标签向量输入至Solfmaxlayer层,得到预设意图标签的输出概率。Specifically, as shown in FIG. 2, the text information corresponding to the voice data input by the user is obtained, the text information is input into the vector extraction layer, multiple word vectors are obtained, and the multiple word vectors are input into the delay neural network layer , extract multiple word vector features, and input multiple word vector features to the ReLU layer to process multiple word vector features, reduce the gradient disappearance of word vector features, and obtain semantic label vectors. Input word vectors to the summation layer to obtain a plurality of preliminary intent label vectors, input a plurality of the preliminary intent label vectors to the recurrent neural network layer, obtain a plurality of candidate intent label vectors, and input the plurality of candidate intent label vectors to dropout layer to obtain multiple preset intent label vectors, and input multiple intent label vectors to the Solfmaxlayer layer to obtain the output probability of the preset intent labels.
需要说明的是,向量提取层可以根据实际情况进行选择,例如,该向量提取层为Word2Vec模型,时延神经网络层和ReLU层还包括残差网络层,该残差网络层使时延神经网络层和ReLU层的参数处理更加准确,通过该dropout层可以防止候选意图标签向量过拟合化,提高输出的意图标签向量的准确性。It should be noted that the vector extraction layer can be selected according to the actual situation. For example, the vector extraction layer is a Word2Vec model, and the delay neural network layer and the ReLU layer also include a residual network layer. The residual network layer makes the delay neural network The parameter processing of the layer and the ReLU layer is more accurate. The dropout layer can prevent the candidate intent label vector from overfitting and improve the accuracy of the output intent label vector.
其中,该预设意图分类模型的训练方式可以为:获取样本文本信息,根据预设意图标签的输出概率对应的类别标识对该样本文本信息进行标注,以构建样本数据,基于该样本数据对神经网络模型进行迭代训练,直到神经网络模式收敛,从而得到该预设意图分类模型。上述神经网络模型包括卷积神经网络模型、循环神经网络模型和循环卷积神经网络模型,当然也可以采用其他网络模型训练得到预设意图分类模型,本申请对此不作具体限定。The training method of the preset intent classification model may be: acquiring sample text information, labeling the sample text information according to the category identifier corresponding to the output probability of the preset intent label, so as to construct sample data, and based on the sample data, neural The network model is iteratively trained until the neural network mode converges, thereby obtaining the preset intent classification model. The above-mentioned neural network models include convolutional neural network models, cyclic neural network models, and cyclic convolutional neural network models. Of course, other network models can also be used for training to obtain a preset intent classification model, which is not specifically limited in this application.
在一实施例中,获取用户输入的语音数据对应的文本信息,并将该文本信息输入至预 设意图分类模型,得到多个预设意图标签的输出概率。通过该预设意图分类模型准确且快速的确定多个预设意图标签的输出概率,极大地提高了用户的使用体验。In one embodiment, the text information corresponding to the voice data input by the user is acquired, and the text information is input into the preset intent classification model to obtain the output probabilities of multiple preset intent labels. Through the preset intent classification model, the output probability of multiple preset intent labels can be accurately and quickly determined, which greatly improves the user experience.
在一实施例中,获取用户输入的语音数据对应的文本信息的方式为:获取用户输入的语音,并将该语音输入至预设语音识别模型,得到文本信息。其中,该预设语音识别模型为预先训练好的神经网络模型,本申请对此不做具体限定。在另一些实施例中,获取其他设备传输的语音数据对应的文本信息,得到用户输入的语音数据对应的文本信息。可以理解的是,还有其他获取用户数据语音数据对应的文本信息的方式,本申请对此不做具体限定。In one embodiment, the method of acquiring the text information corresponding to the voice data input by the user is: acquiring the voice input by the user, and inputting the voice into a preset voice recognition model to obtain the text information. The preset speech recognition model is a pre-trained neural network model, which is not specifically limited in this application. In other embodiments, the text information corresponding to the voice data transmitted by other devices is obtained, and the text information corresponding to the voice data input by the user is obtained. It can be understood that there are other ways of acquiring the text information corresponding to the voice data of the user data, which is not specifically limited in this application.
步骤S102、根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签。Step S102 , according to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags.
其中,候选意图标签为意图越接近用户意图的意图标签。Among them, the candidate intent label is the intent label whose intent is closer to the user's intent.
在一实施例中,对多个所述预设意图标签按照输出概率的从大到小进行排序,得到意图标签队列;依次从所述意图标签队列选择所述预设意图标签,直至得到预设数量个所述候选意图标签。其中,预设意图标签可以根据实际情况进行设置,本申请对此不做具体限定,例如,预设意图标签可以设置为5。通过排列预设意图标签的意图标签队列,然后根据概率大小进行选择候选意图标签,可以提高选择候选意图标签的准确度和效率。In one embodiment, the plurality of preset intent tags are sorted in descending order of output probability to obtain an intent tag queue; the preset intent tags are sequentially selected from the intent tag queue until a preset intent tag is obtained. the number of said candidate intent labels. The preset intent label may be set according to the actual situation, which is not specifically limited in this application. For example, the preset intent label may be set to 5. By arranging the intent tag queue of preset intent tags, and then selecting candidate intent tags according to the probability, the accuracy and efficiency of selecting candidate intent tags can be improved.
示例性的,预设意图标签1的输出概率为10%、预设意图标签2的输出概率为20%、预设意图标签3的输出概率为5%、预设意图标签4的输出概率为12%、预设意图标签5的输出概率为7%、预设意图标签6的输出概率为18%、预设意图标签7的输出概率为25%、预设意图标签8的输出概率为14%、预设意图标签9的输出概率为4%和预设意图标签10的输出概率为28%,根据每个预设意图标签的概率对该10个意图标签从大到小依次进行排序,得到意图标签队列为[预设意图标签10、预设意图标签7、预设意图标签2、预设意图标签6、预设意图标签8、预设意图标签4、预设意图标签1、预设意图标签5、预设意图标签3、预设意图标签9],获取预设意图标签的数量为5,从该意图标签队列中选择前5个候选意图标签,得到候选意图标签为预设意图标签10、预设意图标签7、预设意图标签2、预设意图标签6和预设意图标签8。通过对预设意图标签进行排序,可以快速的选出候选意图标签。Exemplarily, the output probability of the preset intent tag 1 is 10%, the output probability of the preset intent tag 2 is 20%, the output probability of the preset intent tag 3 is 5%, and the output probability of the preset intent tag 4 is 12%. %, the output probability of preset intent tag 5 is 7%, the output probability of preset intent tag 6 is 18%, the output probability of preset intent tag 7 is 25%, the output probability of preset intent tag 8 is 14%, The output probability of the preset intent label 9 is 4% and the output probability of the preset intent label 10 is 28%. According to the probability of each preset intent label, the 10 intent labels are sorted in descending order to obtain the intent labels. The queue is [preset intent tag 10, preset intent tag 7, preset intent tag 2, preset intent tag 6, preset intent tag 8, preset intent tag 4, preset intent tag 1, preset intent tag 5 , preset intent tag 3, preset intent tag 9], the number of acquired preset intent tags is 5, select the first 5 candidate intent tags from the intent tag queue, and obtain the candidate intent tags as preset intent tags 10, preset intent tags Preset intent tag 7, preset intent tag 2, preset intent tag 6, and preset intent tag 8. By sorting the preset intent labels, candidate intent labels can be quickly selected.
步骤S103、确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的。Step S103 , determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data.
其中,预设意图知识图谱是根据历史对话数据生成的,具体地,收集所有的历史对话数据,对历史对话数据进行分类,以及将相关联的对话数据进行关联,得到预设意图知识图谱。The preset intent knowledge graph is generated according to historical dialogue data. Specifically, all historical dialog data is collected, the historical dialog data is classified, and the associated dialog data is associated to obtain the preset intent knowledge graph.
在一实施例中,获取预设意图知识图谱,其中,该预设意图知识图谱是根据历史对话数据生成的;将所述预设意图知识图谱中与每个意图节点相对应的意图节点的成功率作为每个意图节点的对话成功率。通过预设意图知识图谱可以准确的确定每个意图节点的对话成功率。In one embodiment, a preset intent knowledge graph is obtained, wherein the preset intent knowledge graph is generated according to historical dialogue data; the success of the intent node corresponding to each intent node in the preset intent knowledge graph is rate as the dialog success rate for each intent node. The dialog success rate of each intent node can be accurately determined by the preset intent knowledge graph.
在一实施例中,如图3所示,步骤S103包括子步骤S1031至子步骤S1034。In one embodiment, as shown in FIG. 3 , step S103 includes sub-steps S1031 to S1034 .
子步骤S1031、从所述预设意图知识图谱中获取每个意图节点的多个流转路径,并统计所述多个流转路径的数量,其中,每个流转路径包括多个意图节点。Sub-step S1031: Acquire multiple flow paths of each intention node from the preset intention knowledge graph, and count the number of the multiple flow paths, wherein each flow path includes multiple intention nodes.
其中,每个流转路径包括多个意图节点。Wherein, each flow path includes multiple intent nodes.
示例性的,如图4所示,该预设意图知识图谱包括意图节点a、意图节点b、意图节点c、意图节点d、意图节点e、意图节点f和意图节点g,意图节点a的流转路径包括意图节点a连接意图节点b连接意图节点c的流转路径、意图节点a连接意图节点b连接意图节点e连接意图节点f的流转路径、意图节点a连接意图节点b连接意图节点e连接意图节点a的流转路径、意图节点a连接意图节点b连接意图节点e连接意图节点g的流转路径、意图节点a连接意图节点b连接意图节点e连接意图节点f的流转路径和意图节点a连接意图 节点d连接意图节点g流转路径,意图节点b的流转路径包括意图节点b连接意图节点c的流转路径、意图节点b连接意图节点e连接意图节点f的流转路径、意图节点b连接意图节点e连接意图节点g的流转路径和意图节点b连接意图节点e连接意图节点a的流转路径,意图节点e的流转路径包括意图节点e连接意图节点a的流转路径、意图节点e连接意图节点f的流转路径和意图节点e连接意图节点g的流转路径,意图节点d的流转路径包括意图节点连接意图节点g的流转路径,意图节点c和意图节点f没有流转路径。Exemplarily, as shown in FIG. 4 , the preset intent knowledge graph includes intent node a, intent node b, intent node c, intent node d, intent node e, intent node f, and intent node g, and the flow of intent node a. The path includes the flow path where intent node a connects intent node b and intent node c, intent node a connects intent node b connects intent node e connects intent node f, and intent node a connects intent node b connects intent node e connects intent node. A's flow path, intent node a connects intent node b connects intent node e connects intent node g's flow path, intent node a connects intent node b connects intent node e connects intent node f The flow path and intent node a connects intent node d The flow path connecting intention node g, the flow path of intention node b includes the flow path connecting intention node b to intention node c, the flow path connecting intention node b to intention node e and the flow path of intention node b connecting intention node e to the intention node The flow path of g is connected to the flow path of intention node b and the flow path of intention node e is connected to the flow path of intention node a. The flow path of intention node e includes the flow path of intention node e connecting intention node a, and the flow path and intention of intention node e connecting intention node f. Node e is connected to the flow path of intention node g, the flow path of intention node d includes the flow path of intention node connected to intention node g, and intention node c and intention node f have no flow path.
示例性的,如图4所示,意图节点a的流转路径的路径条数为5条,意图节点b的流转路径的路径条数为4条,意图节点c的流转路径的路径条数为0条,意图节点d的流转路径的路径条数为1条,意图节点e的流转路径的路径条数为3条,意图节点f的流转路径的路径条数为0条,意图节点g的流转路径的路径条数为0条。Exemplarily, as shown in FIG. 4 , the number of paths of the flow path of the intention node a is 5, the number of paths of the flow path of the intention node b is 4, and the number of paths of the flow path of the intention node c is 0 The number of paths of the flow path of the intent node d is 1, the number of paths of the flow path of the intention node e is 3, the number of paths of the flow path of the intention node f is 0, and the number of paths of the flow path of the intention node g is 0 The number of paths is 0.
子步骤S1032、将最后一个意图节点的属性标识为预设属性标识的流转路径确定为流转成功路径。Sub-step S1032: Determine the flow path whose attribute identifier of the last intent node is the preset attribute identifier as a successful flow path.
其中,意图节点的属性标识为根据实际情况设置的关键词,例如,时间,地点和事件等等关键词。The attribute identifier of the intent node is a keyword set according to the actual situation, for example, keywords such as time, place, and event.
在一实施例中,将流转路径中最后一个意图节点的属性为为预设属性标识的流转路径确定为流转成功路径。例如,预设属性标识为时间,若流转路径中最后一个意图节点为时间的流转路径,则该流转路径为流转成功路径。In one embodiment, the flow path identified by the preset attribute is determined as the flow path that has an attribute of the last intent node in the flow path. For example, the preset attribute identifier is time, and if the last intent node in the flow path is a flow path of time, the flow path is a successful flow path.
子步骤S1033、统计每个所述意图节点的多个流转路径中的流转成功路径的数量。Sub-step S1033: Count the number of successful flow paths among the multiple flow paths of each intention node.
具体地,根据预设意图知识图谱确定每个意图节点的多个流转路径的路径条数。Specifically, the number of paths of the multiple flow paths of each intent node is determined according to the preset intent knowledge graph.
示例性的,如图4所示,当意图节点的预设属性标识为g时,从预设意图知识图谱中查询每个意图节点的多个流转路径中的最后一个意图节点的属性标识为g的数量,其中,意图节点a多个流转路径中的最后一个意图节点的属性标识为g的数量为2,意图节点b多个流转路径中的最后一个意图节点的属性标识为g的数量为1,意图节点c多个流转路径中的最后一个意图节点的属性标识为g的数量为0,意图节点d多个流转路径中的最后一个意图节点的属性标识为g的数量为1,意图节点e多个流转路径中的最后一个意图节点的属性标识为g的数量为1,意图节点f多个流转路径中的最后一个意图节点的属性标识为g的数量为0,意图节点g多个流转路径中的最后一个意图节点的属性标识为g的数量为1,因此,得到意图节点a的流转成功次数为2次,得到意图节点b的流转成功次数为1次,得到意图节点c的流转成功次数为0次,得到意图节点d的流转成功次数为1次,得到意图节点e的流转成功次数为1次,得到意图节点f的流转成功次数为0次,得到意图节点f的流转成功次数为1次。Exemplarily, as shown in FIG. 4 , when the preset attribute identifier of the intent node is g, the attribute identifier of the last intent node in the multiple flow paths of each intent node is queried from the preset intent knowledge graph as g. The number of , where the attribute of the last intent node in the multiple flow paths of the intent node a is 2, and the attribute of the last intent node in the multiple flow paths of the intent node b is the number of g is 1 , the attribute identifier of the last intent node in the multiple flow paths of the intent node c is 0, the number of the attribute identifier of the last intent node in the multiple flow paths of the intent node d is 1, and the intent node e The attribute identifier of the last intent node in the multiple flow paths is 1, the attribute identifier of the last intent node in the multiple flow paths of the intent node f is 0, and the number of intent node g is multiple flow paths The attribute identifier of the last intent node in , the number of g is 1. Therefore, the number of successful circulations of intent node a is 2, the number of successful circulations of intent node b is 1, and the number of successful circulations of intent node c is obtained. is 0 times, the number of successful circulations obtained by intent node d is 1, the number of successful circulations obtained by intent node e is 1, the number of successful circulations obtained by intent node f is 0, and the number of successful circulations obtained by intent node f is 1 Second-rate.
子步骤S1034、计算每个所述意图节点的多个流转路径中的流转成功路径的数量占所有流转路径数量的百分比,并将计算得到百分比作为每个意图节点的对话成功率。Sub-step S1034: Calculate the percentage of the number of successful circulation paths in the multiple circulation paths of each intention node to the number of all circulation paths, and use the calculated percentage as the dialogue success rate of each intention node.
在一实施例中,确定每个意图节点的流转成功次数占对应的路径条数的百分比;将每个意图节点的流转成功次数占对应的路径条数的百分比,确定为预设意图知识图谱中的每个意图节点的对话成功率。In one embodiment, determine the percentage of the successful circulation times of each intent node to the corresponding number of paths; determine the percentage of the successful circulation times of each intent node to the corresponding number of paths as a percentage in the preset intent knowledge graph. The dialog success rate for each intent node of .
示例性的,意图节点a的流转成功次数2次,意图节点b的流转成功次数1次,意图节点c的流转成功次数0次,意图节点d的流转成功次数1次,意图节点e1的流转成功次数2次,意图节点f的流转成功次数0次,意图节点g的流转成功次数0次,意图节点a的流转路径的路径条数为5条,意图节点b的流转路径的路径条数为4条,意图节点c的流转路径的路径条数为0条,意图节点d的流转路径的路径条数为1条,意图节点e的流转路径的路径条数为3条,意图节点f的流转路径的路径条数为0条,意图节点g的流转路径的路径条数为0条,得到意图节点a的流转成功次数占对应的路径条数的40%,意图节点b的流转成功次数占对应的路径条数的25%,意图节点c的流转成功次数占对应的路径条数的0%,意图节点d的流转成功次数占对应的路径条数的100%,意图节点e的流转成功次数占对应的路径条数的33.3%,意图节点f的流转成功次数占对应的路径条数的0%, 意图节点g的流转成功次数占对应的路径条数的100%,根据意图节点a、b、c、d、e、f和g的流转成功次数占对应的路径条数,确定意图节点a的对话成功率为40%,确定意图节点b的对话成功率为25%,确定意图节点c的对话成功率为0%,确定意图节点d的对话成功率为100%,确定意图节点e的对话成功率为33.3%,确定意图节点f的对话成功率为0%,确定意图节点g的对话成功率为100%。Exemplarily, the number of successful circulation of intent node a is 2, the number of successful circulation of intent node b is 1, the number of successful circulation of intent node c is 0, the number of successful circulation of intent node d is 1, and the number of successful circulation of intent node e1 2 times, the number of successful circulation of intention node f is 0, the number of successful circulation of intention node g is 0, the number of paths of the circulation path of intention node a is 5, and the number of paths of the circulation path of intention node b is 4 The number of paths for the flow path of the intent node c is 0, the number of paths for the flow path of the intention node d is 1, the number of paths of the flow path of the intention node e is 3, and the number of paths of the flow path of the intention node f The number of paths is 0, and the number of paths of the flow path of intention node g is 0. The number of successful flow of intention node a accounts for 40% of the corresponding number of paths, and the number of successful flow of intention node b accounts for 40% of the corresponding number of paths. 25% of the number of paths, the number of successful circulation of intention node c accounts for 0% of the number of corresponding paths, the number of successful circulation of intention node d accounts for 100% of the number of corresponding paths, and the number of successful circulation of intention node e accounts for the corresponding number of paths. 33.3% of the number of paths, the number of successful circulation of intention node f accounts for 0% of the number of corresponding paths, the number of successful circulation of intention node g accounts for 100% of the number of corresponding paths, according to the number of intention nodes a, b, c , d, e, f, and g’s successful flow times account for the number of corresponding paths. Determine the success rate of the conversation of intent node a is 40%, determine the success rate of conversation of intent node b is 25%, and determine the success rate of conversation of intent node c. 0%, the dialog success rate for determining intent node d is 100%, the dialog success rate for determining intent node e is 33.3%, the dialog success rate for determining intent node f is 0%, and the dialog success rate for determining intent node g is 100%.
步骤S104、根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率。Step S104: Determine the dialog success rate of each candidate intent tag according to the dialog success rate of each intent node.
其中,候选意图标签的成功率处于0到100%之间,候选意图标签越大,候选意图标签的对话成功的概率越高。Among them, the success rate of the candidate intent label is between 0 and 100%, and the larger the candidate intent label, the higher the probability of the successful dialogue of the candidate intent label.
在一实施例中,将每个意图节点的对话成功率和每个意图节点对应的预设意图标签的相映射,得到每个预设意图标签的对话成功率;将候选意图标签与预设意图标签相映射,并将相映射的预设意图标签的对话成功率作为候选意图标签的对话成功率。In one embodiment, the dialog success rate of each intent node and the preset intent label corresponding to each intent node are mapped to obtain the dialog success rate of each preset intent label; The tags are mapped to each other, and the dialogue success rate of the mapped preset intent tags is used as the dialogue success rate of the candidate intent tags.
步骤S105、将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。Step S105: Determine the candidate intent tag with the highest dialogue success rate as the intent tag of the voice data input by the user.
其中,目标意图标签为最接近用户意图的意图标签。Among them, the target intent label is the intent label closest to the user's intent.
在一实施例中,根据每个候选意图标签的对话成功率对多个候选意图标签进行排序,得到候选意图标签队列,从该候选意图标签队列选取对话成功率最大的候选意图标签作为用户的目标意图标签。通过成功率对候选意图标签进行排序并选取成功率最大的候选意图标签作为用户的目标意图标签,极大地提高了确定用户意图的准确性。In one embodiment, a plurality of candidate intent tags are sorted according to the dialogue success rate of each candidate intent tag to obtain a candidate intent tag queue, and the candidate intent tag with the highest dialogue success rate is selected from the candidate intent tag queue as the user's target. Intent label. The candidate intent labels are sorted by the success rate and the candidate intent label with the highest success rate is selected as the user's target intent label, which greatly improves the accuracy of determining the user's intent.
示例性的,候选意图标签1的对话成功率为50%,候选意图标签2的对话成功率为25%,候选意图标签3的对话成功率为15%,候选意图标签4的对话成功率为60%,候选意图标签5的对话成功率为40%,根据候选意图标签对话成功率对候选意图标签1、候选意图标签2、候选意图标签3、候选意图标签4和候选意图标签5进行排序,得到候选意图标签队列为,[候选意图标签4、候选意图标签1、候选意图标5、候选意图标签2、候选意图标签3],从该候选意图标签队列选取对话成功率最大的候选意图标签4作为用户的目标意图标签。Exemplarily, the dialog success rate of candidate intent label 1 is 50%, the dialog success rate of candidate intent label 2 is 25%, the dialog success rate of candidate intent label 3 is 15%, and the dialog success rate of candidate intent label 4 is 60%. %, the dialogue success rate of candidate intent label 5 is 40%, and the candidate intent label 1, candidate intent label 2, candidate intent label 3, candidate intent label 4 and candidate intent label 5 are sorted according to the conversation success rate of candidate intent labels, and we get The candidate intent label queue is, [candidate intent label 4, candidate intent label 1, candidate intent label 5, candidate intent label 2, candidate intent label 3], from the candidate intent label queue, select the candidate intent label 4 with the highest dialogue success rate as the The user's goal intent label.
上述实施例提供的用户意图识别方法,通过获取用户输入的语音数据对应的文本信息,并将文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;然后根据每个预设意图标签的输出概率,从多个预设意图标签中确定预设数量个候选意图标签;之后确定预设意图知识图谱中的每个意图节点的对话成功率;再根据每个意图节点的对话成功率确定每个候选意图标签的对话成功率;并将对话成功率最高的候选意图标签确定为用户输入的语音数据的意图标签。本方法通过预设意图分类模型可以得到多个预设意图标签的输出概率,结合多个预设意图标签的输出概率和预设意图知识图谱中的每个意图节点的对话成功率,可以准确的确定用户的目标意图标签。The user intent recognition method provided by the above embodiment obtains the output probability of multiple preset intent labels used to represent the voice intent by acquiring the text information corresponding to the voice data input by the user, and inputting the text information into the preset intent classification model. ; Then according to the output probability of each preset intent tag, determine a preset number of candidate intent tags from multiple preset intent tags; then determine the dialogue success rate of each intent node in the preset intent knowledge graph; The dialog success rate of each intent node determines the dialog success rate of each candidate intent label; and the candidate intent label with the highest dialog success rate is determined as the intent label of the speech data input by the user. The method can obtain the output probability of multiple preset intent labels through the preset intent classification model. Combined with the output probability of multiple preset intent labels and the dialogue success rate of each intent node in the preset intent knowledge graph, it can accurately Identify the user's target intent label.
请参阅图5,图5为本申请实施例提供的一种用户意图识别装置的示意性框图。Please refer to FIG. 5. FIG. 5 is a schematic block diagram of an apparatus for recognizing user intent according to an embodiment of the present application.
如图5所示,用户意图识别装置200包括获取模块210、生成模块220、筛选模块230、第一确定模块240、第二确定模块250和第三确定模块260,其中,As shown in FIG. 5 , the user intent recognition apparatus 200 includes an acquisition module 210 , a generation module 220 , a screening module 230 , a first determination module 240 , a second determination module 250 and a third determination module 260 , wherein,
所述获取模块210,用于获取用户输入的语音数据对应的文本信息。The obtaining module 210 is configured to obtain text information corresponding to the voice data input by the user.
所述生成模块220,用于将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率。The generating module 220 is configured to input the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent the voice intent.
所述筛选模块230,用于根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签。The screening module 230 is configured to determine a preset number of candidate intent tags from a plurality of the preset intent tags according to the output probability of each preset intent tag.
所述第一确定模块240,用于确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的。The first determining module 240 is configured to determine the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data.
所述第二确定模块250,用于根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率。The second determination module 250 is configured to determine the dialogue success rate of each of the candidate intent tags according to the dialogue success rate of each of the intent nodes.
所述第三确定模块260,用于将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The third determining module 260 is configured to determine the candidate intent label with the highest dialogue success rate as the intent label of the voice data input by the user.
在一实施例中,所述筛选模块230还包括如下子模块:In one embodiment, the screening module 230 further includes the following sub-modules:
排序子模块,用于对多个所述预设意图标签按照输出概率的从大到小进行排序,得到意图标签队列。The sorting sub-module is used for sorting a plurality of the preset intent tags according to the descending output probability to obtain an intent tag queue.
选择子模块,用于依次从所述意图标签队列选择所述预设意图标签,直至得到预设数量个所述候选意图标签。A selection sub-module, configured to sequentially select the preset intent tags from the intent tag queue until a preset number of the candidate intent tags are obtained.
在一实施例中,所述第一确定模块240还包括如下子模块:In one embodiment, the first determining module 240 further includes the following sub-modules:
知识图谱获取子模块,用于获取所述预设意图知识图谱。The knowledge graph acquisition sub-module is used to acquire the preset intent knowledge graph.
设置子模块,用于将所述预设意图知识图谱中与每个意图节点相对应的意图节点的成功率作为每个意图节点的对话成功率。A sub-module is set for taking the success rate of the intent node corresponding to each intent node in the preset intent knowledge graph as the dialog success rate of each intent node.
在一实施例中,如图6所示,所述第一确定模块240包括获取子模块241、统计模块242、确定子模块243和计算模块244,其中:In one embodiment, as shown in FIG. 6 , the first determination module 240 includes an acquisition sub-module 241, a statistics module 242, a determination sub-module 243 and a calculation module 244, wherein:
所述获取子模块241,用于从所述预设意图知识图谱中获取每个意图节点的多个流转路径。The acquisition sub-module 241 is configured to acquire multiple flow paths of each intent node from the preset intent knowledge graph.
所述统计模块242,用于统计所述多个流转路径的数量,其中,每个流转路径包括多个意图节点。The statistics module 242 is configured to count the number of the multiple flow paths, wherein each flow path includes multiple intent nodes.
所述确定子模块243,用于将最后一个意图节点的属性标识为预设属性标识的流转路径确定为流转成功路径。The determining submodule 243 is configured to determine the flow path whose attribute identifier of the last intent node is the preset attribute identifier as a successful flow path.
所述统计模块242,还用于统计每个所述意图节点的多个流转路径中的流转成功路径的数量。The statistics module 242 is further configured to count the number of successful flow paths in the multiple flow paths of each intention node.
所述计算模块244,用于计算每个所述意图节点的多个流转路径中的流转成功路径的数量占所有流转路径数量的百分比,并将计算得到百分比作为每个意图节点的对话成功率。The calculation module 244 is configured to calculate the percentage of the number of successful circulation paths in the multiple circulation paths of each intention node to the number of all circulation paths, and use the calculated percentage as the dialogue success rate of each intention node.
在一实施例中,所述第二确定模块250还包括如下子模块:In one embodiment, the second determining module 250 further includes the following sub-modules:
第一映射子模块,用于将每个所述意图节点的对话成功率和每个所述意图节点对应的预设意图标签的相映射,得到每个预设意图标签的对话成功率。The first mapping sub-module is configured to map the dialogue success rate of each of the intent nodes and the preset intent labels corresponding to each of the intent nodes to obtain the dialog success rate of each preset intent label.
第二映射子模块,用于将所述候选意图标签与所述预设意图标签相映射,并将相映射的所述预设意图标签的对话成功率作为所述候选意图标签的对话成功率。The second mapping submodule is configured to map the candidate intent tag with the preset intent tag, and use the mapped dialog success rate of the preset intent tag as the dialog success rate of the candidate intent tag.
在一实施例中,所述预设意图分类模型包括向量提取层、时延神经网络层、ReLU层、残差网络层、加和层、循环神经网络层、dropout层和Solfmaxlayer层;所述生成模块220还包括如下子模块:In one embodiment, the preset intent classification model includes a vector extraction layer, a time-delay neural network layer, a ReLU layer, a residual network layer, a summation layer, a recurrent neural network layer, a dropout layer, and a Solfmaxlayer layer; the generating Module 220 also includes the following sub-modules:
第一输入子模块,用于将所述文本信息输入至所述向量提取层,得到多个词向量。The first input sub-module is used for inputting the text information to the vector extraction layer to obtain a plurality of word vectors.
第二输入子模块,用于将多个所述词向量输入至所述时延神经网络层,提取到多个词向量特征。The second input sub-module is used for inputting a plurality of the word vectors to the time delay neural network layer, and extracting a plurality of word vector features.
第三输入子模块,用于将多个所述词向量特征输入至所述ReLU层,得到语义标签向量。The third input sub-module is used for inputting a plurality of the word vector features to the ReLU layer to obtain a semantic label vector.
第四输入子模块,用于将所述语义标签向量和多个所述词向量输入至所述加和层,得到多个初步意图标签向量。The fourth input sub-module is configured to input the semantic label vector and a plurality of the word vectors into the summation layer to obtain a plurality of preliminary intent label vectors.
第五输入子模块,用于将多个所述初步意图标签向量输入至所述循环神经网络层,得到多个候选意图标签向量。The fifth input sub-module is used for inputting a plurality of the preliminary intent label vectors to the recurrent neural network layer to obtain a plurality of candidate intent label vectors.
第七输入子模块,用于将多个所述候选意图标签向量输入至所述dropout层,得到多个预设意图标签向量。The seventh input sub-module is used for inputting a plurality of the candidate intent label vectors to the dropout layer to obtain a plurality of preset intent label vectors.
第八输入子模块,用于将多个所述意图标签向量输入至所述Solfmaxlayer层,得到预设意图标签的输出概率。The eighth input sub-module is used for inputting a plurality of the intent label vectors to the Solfmaxlayer layer to obtain the output probability of the preset intent label.
上述实施例提供的装置可以实现为一种计算机程序的形式,该计算机程序可以在如图7所示的计算机设备上运行。The apparatuses provided by the above embodiments may be implemented in the form of a computer program, and the computer program may be executed on the computer device as shown in FIG. 7 .
请参阅图7,图7为本申请实施例提供的一种计算机设备的结构示意性框图。Please refer to FIG. 7 , which is a schematic block diagram of the structure of a computer device according to an embodiment of the present application.
如图7所示,该计算机设备包括通过系统总线连接的处理器、存储器和网络接口,其中,存储器可以包括非易失性存储介质和内存储器。As shown in FIG. 7, the computer device includes a processor, a memory, and a network interface connected by a system bus, wherein the memory may include a non-volatile storage medium and an internal memory.
非易失性存储介质可存储操作系统和计算机程序。该计算机程序包括程序指令,该程序指令被执行时,可使得处理器执行任意一种用户意图识别方法。The nonvolatile storage medium can store operating systems and computer programs. The computer program includes program instructions that, when executed, can cause the processor to execute any method for identifying user intent.
该网络接口用于通信。本领域技术人员可以理解,图7中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。This network interface is used for communication. Those skilled in the art can understand that the structure shown in FIG. 7 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
应当理解的是,该总线比如为I2C(Inter-integrated Circuit)总线,存储器可以是Flash芯片、只读存储器(ROM,Read-Only Memory)磁盘、光盘、U盘或移动硬盘等,处理器可以是中央处理单元(Central Processing Unit,CPU),该处理器还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that the bus is, for example, an I2C (Inter-integrated Circuit) bus, the memory can be a Flash chip, a read-only memory (ROM, Read-Only Memory) magnetic disk, an optical disk, a U disk or a mobile hard disk, etc., and the processor can be Central Processing Unit (CPU), the processor can also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), application specific integrated circuits (Application Specific Integrated Circuits, ASICs), field programmable gates Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. Wherein, the general-purpose processor can be a microprocessor or the processor can also be any conventional processor or the like.
其中,在一个实施例中,一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现:Wherein, in one embodiment, a computer device includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements when executing the computer program:
获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;acquiring text information corresponding to the voice data input by the user, and inputting the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intent;
根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;According to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags;
确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;Determine the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes;
将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
在一个实施例中,所述处理器执行所述计算机程序时实现:In one embodiment, the processor, when executing the computer program, implements:
对多个所述预设意图标签按照输出概率的从大到小进行排序,得到意图标签队列;Sorting a plurality of the preset intent tags according to the output probability in descending order to obtain an intent tag queue;
依次从所述意图标签队列选择所述预设意图标签,直至得到预设数量个所述候选意图标签。The preset intent tags are sequentially selected from the intent tag queue until a preset number of the candidate intent tags are obtained.
在一个实施例中,所述处理器执行所述计算机程序时实现:In one embodiment, the processor, when executing the computer program, implements:
获取所述预设意图知识图谱;obtaining the preset intent knowledge graph;
将所述预设意图知识图谱中与每个意图节点相对应的意图节点的成功率作为每个意图节点的对话成功率。The success rate of the intent node corresponding to each intent node in the preset intent knowledge graph is taken as the dialog success rate of each intent node.
在一个实施例中,所述处理器执行所述计算机程序时实现:In one embodiment, the processor, when executing the computer program, implements:
从所述预设意图知识图谱中获取每个意图节点的多个流转路径,并统计所述多个流转路径的数量,其中,每个流转路径包括多个意图节点;Obtain multiple flow paths of each intention node from the preset intention knowledge graph, and count the number of the multiple flow paths, wherein each flow path includes multiple intention nodes;
将最后一个意图节点的属性标识为预设属性标识的流转路径确定为流转成功路径;Determining the flow path with the attribute identification of the last intent node as the preset attribute identification as the successful flow path;
统计每个所述意图节点的多个流转路径中的流转成功路径的数量;Count the number of successful circulation paths in the multiple circulation paths of each of the intent nodes;
计算每个所述意图节点的多个流转路径中的流转成功路径的数量占所有流转路径数量的百分比,并将计算得到百分比作为每个意图节点的对话成功率。Calculate the percentage of the number of successful flow paths in the multiple flow paths of each intention node to the number of all flow paths, and use the calculated percentage as the dialogue success rate of each intention node.
在一个实施例中,所述处理器执行所述计算机程序时实现:In one embodiment, the processor, when executing the computer program, implements:
将每个所述意图节点的对话成功率和每个所述意图节点对应的预设意图标签的相映射,得到每个预设意图标签的对话成功率;Mapping the dialogue success rate of each of the intent nodes and the preset intent labels corresponding to each of the intent nodes to obtain the dialog success rate of each preset intent label;
将所述候选意图标签与所述预设意图标签相映射,并将相映射的所述预设意图标签的对话成功率作为所述候选意图标签的对话成功率。The candidate intent tag is mapped with the preset intent tag, and the dialog success rate of the mapped preset intent tag is used as the dialog success rate of the candidate intent tag.
在一个实施例中,所述预设意图分类模型包括向量提取层、时延神经网络层、ReLU层、残差网络层、加和层、循环神经网络层、dropout层和Solfmaxlayer层;所述处理器执行所述计算机程序时实现:In one embodiment, the preset intent classification model includes a vector extraction layer, a delay neural network layer, a ReLU layer, a residual network layer, a summation layer, a recurrent neural network layer, a dropout layer, and a Solfmaxlayer layer; the processing When the computer executes the computer program, it realizes:
将所述文本信息输入至所述向量提取层,得到多个词向量;Inputting the text information to the vector extraction layer to obtain a plurality of word vectors;
将多个所述词向量输入至所述时延神经网络层,提取到多个词向量特征;inputting a plurality of the word vectors into the delay neural network layer, and extracting a plurality of word vector features;
将多个所述词向量特征输入至所述ReLU层,得到语义标签向量;Inputting a plurality of the word vector features to the ReLU layer to obtain a semantic label vector;
将所述语义标签向量和多个所述词向量输入至所述加和层,得到多个初步意图标签向量;Inputting the semantic label vector and a plurality of the word vectors to the summation layer to obtain a plurality of preliminary intent label vectors;
将多个所述初步意图标签向量输入至所述循环神经网络层,得到多个候选意图标签向量;inputting a plurality of the preliminary intent label vectors into the recurrent neural network layer to obtain a plurality of candidate intent label vectors;
将多个所述候选意图标签向量输入至所述dropout层,得到多个预设意图标签向量;Inputting a plurality of the candidate intent label vectors to the dropout layer to obtain a plurality of preset intent label vectors;
将多个所述意图标签向量输入至所述Solfmaxlayer层,得到预设意图标签的输出概率。A plurality of the intent label vectors are input to the Solfmaxlayer layer to obtain the output probability of the preset intent label.
需要说明的是,所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述计算机设备的具体工作过程,可以参考前述用户意图识别方法实施例中的对应过程,在此不再赘述。It should be noted that those skilled in the art can clearly understand that, for the convenience and brevity of description, for the specific working process of the computer device described above, reference may be made to the corresponding process in the foregoing embodiment of the method for identifying user intent, which is not repeated here. Repeat.
本申请实施例还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,所述计算机程序中包括程序指令,所述程序指令被执行时所实现的方法可参照本申请用户意图识别方法的各个实施例。具体的:Embodiments of the present application further provide a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, the computer program includes program instructions, and the method implemented when the program instructions are executed may refer to this document Various embodiments of methods for identifying user intent are claimed. specific:
一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,其中所述计算机程序被处理器执行时实现:A computer-readable storage medium on which a computer program is stored, wherein the computer program is implemented when executed by a processor:
获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;acquiring text information corresponding to the voice data input by the user, and inputting the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intent;
根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;According to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags;
确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;Determine the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes;
将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
在一实施例中,所述计算机程序被处理器执行时还实现:In one embodiment, the computer program, when executed by the processor, further implements:
对多个所述预设意图标签按照输出概率的从大到小进行排序,得到意图标签队列;Sorting a plurality of the preset intent tags according to the output probability in descending order to obtain an intent tag queue;
依次从所述意图标签队列选择所述预设意图标签,直至得到预设数量个所述候选意图标签。The preset intent tags are sequentially selected from the intent tag queue until a preset number of the candidate intent tags are obtained.
在一实施例中,所述计算机程序被处理器执行时还实现:In one embodiment, the computer program, when executed by the processor, further implements:
获取所述预设意图知识图谱;obtaining the preset intent knowledge graph;
将所述预设意图知识图谱中与每个意图节点相对应的意图节点的成功率作为每个意图节点的对话成功率。The success rate of the intent node corresponding to each intent node in the preset intent knowledge graph is taken as the dialog success rate of each intent node.
在一实施例中,所述计算机程序被处理器执行时还实现:In one embodiment, the computer program, when executed by the processor, further implements:
从所述预设意图知识图谱中获取每个意图节点的多个流转路径,并统计所述多个流转路径的数量,其中,每个流转路径包括多个意图节点;Obtain multiple flow paths of each intention node from the preset intention knowledge graph, and count the number of the multiple flow paths, wherein each flow path includes multiple intention nodes;
将最后一个意图节点的属性标识为预设属性标识的流转路径确定为流转成功路径;Determining the flow path with the attribute identification of the last intent node as the preset attribute identification as the successful flow path;
统计每个所述意图节点的多个流转路径中的流转成功路径的数量;Count the number of successful circulation paths in the multiple circulation paths of each of the intent nodes;
计算每个所述意图节点的多个流转路径中的流转成功路径的数量占所有流转路径数量的百分比,并将计算得到百分比作为每个意图节点的对话成功率。Calculate the percentage of the number of successful flow paths in the multiple flow paths of each intention node to the number of all flow paths, and use the calculated percentage as the dialogue success rate of each intention node.
在一实施例中,所述计算机程序被处理器执行时还实现:In one embodiment, the computer program, when executed by the processor, further implements:
将每个所述意图节点的对话成功率和每个所述意图节点对应的预设意图标签的相映射,得到每个预设意图标签的对话成功率;Mapping the dialogue success rate of each of the intent nodes and the preset intent labels corresponding to each of the intent nodes to obtain the dialog success rate of each preset intent label;
将所述候选意图标签与所述预设意图标签相映射,并将相映射的所述预设意图标签的对话成功率作为所述候选意图标签的对话成功率。The candidate intent tag is mapped with the preset intent tag, and the dialog success rate of the mapped preset intent tag is used as the dialog success rate of the candidate intent tag.
在一实施例中,所述预设意图分类模型包括向量提取层、时延神经网络层、ReLU层、残差网络层、加和层、循环神经网络层、dropout层和Solfmaxlayer层;所述计算机程序被处理器执行时还实现:In one embodiment, the preset intent classification model includes a vector extraction layer, a time-delay neural network layer, a ReLU layer, a residual network layer, a summation layer, a recurrent neural network layer, a dropout layer, and a Solfmaxlayer layer; the computer When the program is executed by the processor, it also implements:
将所述文本信息输入至所述向量提取层,得到多个词向量;Inputting the text information to the vector extraction layer to obtain a plurality of word vectors;
将多个所述词向量输入至所述时延神经网络层,提取到多个词向量特征;inputting a plurality of the word vectors into the delay neural network layer, and extracting a plurality of word vector features;
将多个所述词向量特征输入至所述ReLU层,得到语义标签向量;Inputting a plurality of the word vector features to the ReLU layer to obtain a semantic label vector;
将所述语义标签向量和多个所述词向量输入至所述加和层,得到多个初步意图标签向量;Inputting the semantic label vector and a plurality of the word vectors to the summation layer to obtain a plurality of preliminary intent label vectors;
将多个所述初步意图标签向量输入至所述循环神经网络层,得到多个候选意图标签向量;inputting a plurality of the preliminary intent label vectors into the recurrent neural network layer to obtain a plurality of candidate intent label vectors;
将多个所述候选意图标签向量输入至所述dropout层,得到多个预设意图标签向量;Inputting a plurality of the candidate intent label vectors to the dropout layer to obtain a plurality of preset intent label vectors;
将多个所述意图标签向量输入至所述Solfmaxlayer层,得到预设意图标签的输出概率。A plurality of the intent label vectors are input to the Solfmaxlayer layer to obtain the output probability of the preset intent label.
其中,所述计算机可读存储介质可以是前述实施例所述的计算机设备的内部存储单元,例如所述计算机设备的硬盘或内存。所述计算机可读存储介质可以是非易失性,也可以是易失性。所述计算机可读存储介质也可以是所述计算机设备的外部存储设备,例如所述计算机设备上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。The computer-readable storage medium may be an internal storage unit of the computer device described in the foregoing embodiments, such as a hard disk or a memory of the computer device. The computer-readable storage medium may be non-volatile or volatile. The computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk equipped on the computer device, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) ) card, Flash Card, etc.
应当理解,在此本申请说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本申请。如在本申请说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should be understood that the terms used in the specification of the present application herein are for the purpose of describing particular embodiments only and are not intended to limit the present application. As used in this specification and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural unless the context clearly dictates otherwise.
还应当理解,在本申请说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。It will also be understood that, as used in this specification and the appended claims, the term "and/or" refers to and including any and all possible combinations of one or more of the associated listed items. It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or system comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or system. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in the process, method, article or system that includes the element.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above-mentioned serial numbers of the embodiments of the present application are only for description, and do not represent the advantages or disadvantages of the embodiments. The above are only specific embodiments of the present application, but the protection scope of the present application is not limited thereto. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed in the present application. Modifications or substitutions shall be covered by the protection scope of this application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (20)

  1. 一种用户意图识别方法,其中,所述方法包括:A method for identifying user intent, wherein the method comprises:
    获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;acquiring text information corresponding to the voice data input by the user, and inputting the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intent;
    根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;According to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags;
    确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
    根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;Determine the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes;
    将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
  2. 如权利要求1所述的用户意图识别方法,其中,所述根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签,包括:The method for identifying user intent according to claim 1, wherein determining a preset number of candidate intent tags from a plurality of preset intent tags according to an output probability of each preset intent tag, comprising:
    对多个所述预设意图标签按照输出概率的从大到小进行排序,得到意图标签队列;Sorting a plurality of the preset intent tags according to the output probability in descending order to obtain an intent tag queue;
    依次从所述意图标签队列选择所述预设意图标签,直至得到预设数量个所述候选意图标签。The preset intent tags are sequentially selected from the intent tag queue until a preset number of the candidate intent tags are obtained.
  3. 如权利要求1所述的用户意图识别方法,其中,所述确定预设意图知识图谱中的每个意图节点的对话成功率,包括:The method for identifying user intent according to claim 1, wherein the determining the dialog success rate of each intent node in the preset intent knowledge graph comprises:
    获取所述预设意图知识图谱;obtaining the preset intent knowledge graph;
    将所述预设意图知识图谱中与每个意图节点相对应的意图节点的成功率作为每个意图节点的对话成功率。The success rate of the intent node corresponding to each intent node in the preset intent knowledge graph is taken as the dialog success rate of each intent node.
  4. 如权利要求1所述的用户意图识别方法,其中,所述确定预设意图知识图谱中的每个意图节点的对话成功率,包括:The method for identifying user intent according to claim 1, wherein the determining the dialog success rate of each intent node in the preset intent knowledge graph comprises:
    从所述预设意图知识图谱中获取每个意图节点的多个流转路径,并统计所述多个流转路径的数量,其中,每个流转路径包括多个意图节点;Obtain multiple flow paths of each intention node from the preset intention knowledge graph, and count the number of the multiple flow paths, wherein each flow path includes multiple intention nodes;
    将最后一个意图节点的属性标识为预设属性标识的流转路径确定为流转成功路径;Determining the flow path with the attribute identification of the last intent node as the preset attribute identification as the successful flow path;
    统计每个所述意图节点的多个流转路径中的流转成功路径的数量;Count the number of successful circulation paths in the multiple circulation paths of each of the intent nodes;
    计算每个所述意图节点的多个流转路径中的流转成功路径的数量占所有流转路径数量的百分比,并将计算得到百分比作为每个意图节点的对话成功率。Calculate the percentage of the number of successful flow paths in the multiple flow paths of each intention node to the number of all flow paths, and use the calculated percentage as the dialogue success rate of each intention node.
  5. 如权利要求1-4任一项所述的用户意图识别方法,其中,所述根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率,包括:The method for identifying user intent according to any one of claims 1-4, wherein the determining the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes comprises:
    将每个所述意图节点的对话成功率和每个所述意图节点对应的预设意图标签的相映射,得到每个预设意图标签的对话成功率;Mapping the dialogue success rate of each of the intent nodes and the preset intent labels corresponding to each of the intent nodes to obtain the dialog success rate of each preset intent label;
    将所述候选意图标签与所述预设意图标签相映射,并将相映射的所述预设意图标签的对话成功率作为所述候选意图标签的对话成功率。The candidate intent tag is mapped with the preset intent tag, and the dialog success rate of the mapped preset intent tag is used as the dialog success rate of the candidate intent tag.
  6. 如权利要求1-4任一项所述的用户意图识别方法,其中,所述预设意图分类模型包括向量提取层、时延神经网络层、ReLU层、残差网络层、加和层、循环神经网络层、dropout层和Solfmaxlayer层;所述将所述文本信息输入至预设意图分类模型,得到N个预设意图标签的输出概率,包括:The user intent identification method according to any one of claims 1-4, wherein the preset intent classification model comprises a vector extraction layer, a time-delay neural network layer, a ReLU layer, a residual network layer, a summation layer, a loop A neural network layer, a dropout layer and a Solfmaxlayer layer; the text information is input into the preset intent classification model to obtain the output probabilities of N preset intent labels, including:
    将所述文本信息输入至所述向量提取层,得到多个词向量;Inputting the text information to the vector extraction layer to obtain a plurality of word vectors;
    将多个所述词向量输入至所述时延神经网络层,提取到多个词向量特征;inputting a plurality of the word vectors into the delay neural network layer, and extracting a plurality of word vector features;
    将多个所述词向量特征输入至所述ReLU层,得到语义标签向量;Inputting a plurality of the word vector features to the ReLU layer to obtain a semantic label vector;
    将所述语义标签向量和多个所述词向量输入至所述加和层,得到多个初步意图标签向量;Inputting the semantic label vector and a plurality of the word vectors to the summation layer to obtain a plurality of preliminary intent label vectors;
    将多个所述初步意图标签向量输入至所述循环神经网络层,得到多个候选意图标签向 量;Inputting a plurality of the preliminary intent label vectors to the recurrent neural network layer to obtain a plurality of candidate intent label vectors;
    将多个所述候选意图标签向量输入至所述dropout层,得到多个预设意图标签向量;Inputting a plurality of the candidate intent label vectors to the dropout layer to obtain a plurality of preset intent label vectors;
    将多个所述意图标签向量输入至所述Solfmaxlayer层,得到预设意图标签的输出概率。A plurality of the intent label vectors are input to the Solfmaxlayer layer to obtain the output probability of the preset intent label.
  7. 一种用户意图识别装置,其中,所述用户意图识别装置包括获取模块、生成模块、筛选模块、第一确定模块、第二确定模块和第三确定模块,其中:A user intention identification device, wherein the user intention identification device includes an acquisition module, a generation module, a screening module, a first determination module, a second determination module and a third determination module, wherein:
    所述获取模块,用于获取用户输入的语音数据对应的文本信息;The obtaining module is used to obtain text information corresponding to the voice data input by the user;
    所述生成模块,用于将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;The generating module is configured to input the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intents;
    所述筛选模块,用于根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;the screening module, configured to determine a preset number of candidate intent tags from a plurality of the preset intent tags according to the output probability of each of the preset intent tags;
    所述第一确定模块,用于确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;The first determining module is configured to determine the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
    所述第二确定模块,用于根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;the second determining module, configured to determine the dialogue success rate of each of the candidate intent tags according to the dialogue success rate of each of the intent nodes;
    所述第三确定模块,用于将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The third determining module is configured to determine the candidate intent label with the highest dialogue success rate as the intent label of the voice data input by the user.
  8. 如权利要求7所述的用户意图识别装置,其中,所述筛选模块还包括如下子模块:The device for identifying user intentions as claimed in claim 7, wherein the screening module further comprises the following sub-modules:
    排序子模块,用于对多个所述预设意图标签按照输出概率的从大到小进行排序,得到意图标签队列;a sorting sub-module, configured to sort a plurality of the preset intent tags according to the descending output probability to obtain an intent tag queue;
    选择子模块,用于依次从所述意图标签队列选择所述预设意图标签,直至得到预设数量个所述候选意图标签。A selection sub-module, configured to sequentially select the preset intent tags from the intent tag queue until a preset number of the candidate intent tags are obtained.
  9. 如权利要求7所述的用户意图识别装置,其中,所述第一确定模块还包括如下子模块:The device for identifying user intent according to claim 7, wherein the first determining module further comprises the following sub-modules:
    知识图谱获取子模块,用于获取所述预设意图知识图谱;a knowledge graph acquisition sub-module for acquiring the preset intent knowledge graph;
    设置子模块,用于将所述预设意图知识图谱中与每个意图节点相对应的意图节点的成功率作为每个意图节点的对话成功率。A sub-module is set for taking the success rate of the intent node corresponding to each intent node in the preset intent knowledge graph as the dialog success rate of each intent node.
  10. 如权利要求7所述的用户意图识别装置,其中,所述第一确定模块包括获取子模块、统计模块、确定子模块和计算模块,其中:The user intention identification device according to claim 7, wherein the first determination module comprises an acquisition sub-module, a statistics module, a determination sub-module and a calculation module, wherein:
    所述获取子模块,用于从所述预设意图知识图谱中获取每个意图节点的多个流转路径;The obtaining submodule is used to obtain multiple flow paths of each intention node from the preset intention knowledge graph;
    所述统计模块,用于统计所述多个流转路径的数量,其中,每个流转路径包括多个意图节点;the statistics module, configured to count the number of the multiple flow paths, wherein each flow path includes multiple intent nodes;
    所述确定子模块,用于将最后一个意图节点的属性标识为预设属性标识的流转路径确定为流转成功路径;The determining submodule is used to determine the flow path whose attribute identifier of the last intent node is the preset attribute identifier as the successful flow path;
    所述统计模块,还用于统计每个所述意图节点的多个流转路径中的流转成功路径的数量;The statistics module is further configured to count the number of successful circulation paths in the multiple circulation paths of each of the intention nodes;
    所述计算模块,用于计算每个所述意图节点的多个流转路径中的流转成功路径的数量占所有流转路径数量的百分比,并将计算得到百分比作为每个意图节点的对话成功率。The calculation module is configured to calculate the percentage of the number of successful circulation paths in the multiple circulation paths of each intention node to the number of all circulation paths, and use the calculated percentage as the dialogue success rate of each intention node.
  11. 如权利要求7-11任一项所述的用户意图识别装置,其中,所述第二确定模块还包括如下子模块:The user intention identification device according to any one of claims 7-11, wherein the second determining module further comprises the following sub-modules:
    第一映射子模块,用于将每个所述意图节点的对话成功率和每个所述意图节点对应的预设意图标签的相映射,得到每个预设意图标签的对话成功率;a first mapping submodule, configured to map the dialogue success rate of each of the intent nodes and the preset intent labels corresponding to each of the intent nodes to obtain the dialog success rate of each preset intent label;
    第二映射子模块,用于将所述候选意图标签与所述预设意图标签相映射,并将相映射的所述预设意图标签的对话成功率作为所述候选意图标签的对话成功率。The second mapping submodule is configured to map the candidate intent tag with the preset intent tag, and use the mapped dialog success rate of the preset intent tag as the dialog success rate of the candidate intent tag.
  12. 如权利要求7-11任一项所述的用户意图识别装置,其中,所述预设意图分类模型包括向量提取层、时延神经网络层、ReLU层、残差网络层、加和层、循环神经网络层、dropout层和Solfmaxlayer层;所述生成模块还包括如下子模块:The user intent recognition device according to any one of claims 7-11, wherein the preset intent classification model comprises a vector extraction layer, a time-delay neural network layer, a ReLU layer, a residual network layer, a summation layer, a loop Neural network layer, dropout layer and Solfmaxlayer layer; the generation module also includes the following sub-modules:
    第一输入子模块,用于将所述文本信息输入至所述向量提取层,得到多个词向量;a first input submodule, for inputting the text information into the vector extraction layer to obtain a plurality of word vectors;
    第二输入子模块,用于将多个所述词向量输入至所述时延神经网络层,提取到多个词向量特征;The second input sub-module is used for inputting a plurality of the word vectors to the time delay neural network layer, and extracting a plurality of word vector features;
    第三输入子模块,用于将多个所述词向量特征输入至所述ReLU层,得到语义标签向量;The third input sub-module is used to input a plurality of the word vector features to the ReLU layer to obtain a semantic label vector;
    第四输入子模块,用于将所述语义标签向量和多个所述词向量输入至所述加和层,得到多个初步意图标签向量;a fourth input sub-module, configured to input the semantic label vector and a plurality of the word vectors into the summation layer to obtain a plurality of preliminary intent label vectors;
    第五输入子模块,用于将多个所述初步意图标签向量输入至所述循环神经网络层,得到多个候选意图标签向量;a fifth input sub-module, configured to input a plurality of the preliminary intent label vectors into the recurrent neural network layer to obtain a plurality of candidate intent label vectors;
    第七输入子模块,用于将多个所述候选意图标签向量输入至所述dropout层,得到多个预设意图标签向量;a seventh input sub-module, configured to input a plurality of the candidate intent label vectors to the dropout layer to obtain a plurality of preset intent label vectors;
    第八输入子模块,用于将多个所述意图标签向量输入至所述Solfmaxlayer层,得到预设意图标签的输出概率。The eighth input sub-module is used for inputting a plurality of the intent label vectors to the Solfmaxlayer layer to obtain the output probability of the preset intent label.
  13. 一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现:A computer device comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, the processor executing the computer program to achieve:
    获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;acquiring text information corresponding to the voice data input by the user, and inputting the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intent;
    根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个候选意图标签;According to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags;
    确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
    根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;Determine the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes;
    将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
  14. 根据权利要求13所述的计算机设备,其中,所述处理器执行所述计算机程序时还实现:The computer device of claim 13, wherein the processor, when executing the computer program, further implements:
    对多个所述预设意图标签按照输出概率的从大到小进行排序,得到意图标签队列;Sorting a plurality of the preset intent tags according to the output probability in descending order to obtain an intent tag queue;
    依次从所述意图标签队列选择所述预设意图标签,直至得到预设数量个所述候选意图标签。The preset intent tags are sequentially selected from the intent tag queue until a preset number of the candidate intent tags are obtained.
  15. 根据权利要求13所述的计算机设备,其中,所述处理器执行所述计算机程序时还实现:The computer device of claim 13, wherein the processor, when executing the computer program, further implements:
    获取所述预设意图知识图谱;obtaining the preset intent knowledge graph;
    将所述预设意图知识图谱中与每个意图节点相对应的意图节点的成功率作为每个意图节点的对话成功率。The success rate of the intent node corresponding to each intent node in the preset intent knowledge graph is taken as the dialog success rate of each intent node.
  16. 根据权利要求13所述的计算机设备,其中,所述处理器执行所述计算机程序时还实现:The computer device of claim 13, wherein the processor, when executing the computer program, further implements:
    从所述预设意图知识图谱中获取每个意图节点的多个流转路径,并统计所述多个流转路径的数量,其中,每个流转路径包括多个意图节点;Obtain multiple flow paths of each intention node from the preset intention knowledge graph, and count the number of the multiple flow paths, wherein each flow path includes multiple intention nodes;
    将最后一个意图节点的属性标识为预设属性标识的流转路径确定为流转成功路径;Determining the flow path with the attribute identification of the last intent node as the preset attribute identification as the successful flow path;
    统计每个所述意图节点的多个流转路径中的流转成功路径的数量;Count the number of successful circulation paths in the multiple circulation paths of each of the intent nodes;
    计算每个所述意图节点的多个流转路径中的流转成功路径的数量占所有流转路径数量的百分比,并将计算得到百分比作为每个意图节点的对话成功率。Calculate the percentage of the number of successful flow paths in the multiple flow paths of each intention node to the number of all flow paths, and use the calculated percentage as the dialogue success rate of each intention node.
  17. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其中,所述计算机程序被处理器执行时实现:A computer-readable storage medium storing a computer program, wherein the computer program is executed by a processor to realize:
    获取用户输入的语音数据对应的文本信息,并将所述文本信息输入至预设意图分类模型,得到用于表示语音意图的多个预设意图标签的输出概率;acquiring text information corresponding to the voice data input by the user, and inputting the text information into a preset intent classification model to obtain output probabilities of multiple preset intent labels used to represent voice intent;
    根据每个所述预设意图标签的输出概率,从多个所述预设意图标签中确定预设数量个 候选意图标签;According to the output probability of each of the preset intent tags, determine a preset number of candidate intent tags from a plurality of the preset intent tags;
    确定预设意图知识图谱中的每个意图节点的对话成功率,其中,所述预设意图知识图谱是根据历史对话数据生成的;determining the dialog success rate of each intent node in the preset intent knowledge graph, wherein the preset intent knowledge graph is generated according to historical dialog data;
    根据每个所述意图节点的对话成功率确定每个所述候选意图标签的对话成功率;Determine the dialog success rate of each of the candidate intent labels according to the dialog success rate of each of the intent nodes;
    将对话成功率最高的候选意图标签确定为所述用户输入的语音数据的意图标签。The candidate intent label with the highest dialogue success rate is determined as the intent label of the voice data input by the user.
  18. 根据权利要求17所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时还实现:The computer-readable storage medium of claim 17, wherein the computer program, when executed by the processor, further implements:
    对多个所述预设意图标签按照输出概率的从大到小进行排序,得到意图标签队列;Sorting a plurality of the preset intent tags according to the output probability in descending order to obtain an intent tag queue;
    依次从所述意图标签队列选择所述预设意图标签,直至得到预设数量个所述候选意图标签。The preset intent tags are sequentially selected from the intent tag queue until a preset number of the candidate intent tags are obtained.
  19. 根据权利要求17所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时还实现:The computer-readable storage medium of claim 17, wherein the computer program, when executed by the processor, further implements:
    获取所述预设意图知识图谱;obtaining the preset intent knowledge graph;
    将所述预设意图知识图谱中与每个意图节点相对应的意图节点的成功率作为每个意图节点的对话成功率。The success rate of the intent node corresponding to each intent node in the preset intent knowledge graph is taken as the dialog success rate of each intent node.
  20. 根据权利要求17所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时还实现:The computer-readable storage medium of claim 17, wherein the computer program, when executed by the processor, further implements:
    从所述预设意图知识图谱中获取每个意图节点的多个流转路径,并统计所述多个流转路径的数量,其中,每个流转路径包括多个意图节点;Obtain multiple flow paths of each intention node from the preset intention knowledge graph, and count the number of the multiple flow paths, wherein each flow path includes multiple intention nodes;
    将最后一个意图节点的属性标识为预设属性标识的流转路径确定为流转成功路径;Determining the flow path with the attribute identification of the last intent node as the preset attribute identification as the successful flow path;
    统计每个所述意图节点的多个流转路径中的流转成功路径的数量;Count the number of successful circulation paths in the multiple circulation paths of each of the intent nodes;
    计算每个所述意图节点的多个流转路径中的流转成功路径的数量占所有流转路径数量的百分比,并将计算得到百分比作为每个意图节点的对话成功率。Calculate the percentage of the number of successful flow paths in the multiple flow paths of each intention node to the number of all flow paths, and use the calculated percentage as the dialogue success rate of each intention node.
PCT/CN2021/084250 2020-12-30 2021-03-31 User intention recognition method and apparatus, device, and computer-readable storage medium WO2022141875A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011631344.4 2020-12-30
CN202011631344.4A CN112732882A (en) 2020-12-30 2020-12-30 User intention identification method, device, equipment and computer readable storage medium

Publications (1)

Publication Number Publication Date
WO2022141875A1 true WO2022141875A1 (en) 2022-07-07

Family

ID=75608457

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/084250 WO2022141875A1 (en) 2020-12-30 2021-03-31 User intention recognition method and apparatus, device, and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN112732882A (en)
WO (1) WO2022141875A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113377969B (en) * 2021-08-16 2021-11-09 中航信移动科技有限公司 Intention recognition data processing system
CN113593533B (en) * 2021-09-10 2023-05-02 平安科技(深圳)有限公司 Method, device, equipment and medium for jumping flow node based on intention recognition
CN114860912B (en) * 2022-05-20 2023-08-29 马上消费金融股份有限公司 Data processing method, device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9720981B1 (en) * 2016-02-25 2017-08-01 International Business Machines Corporation Multiple instance machine learning for question answering systems
CN110377911A (en) * 2019-07-23 2019-10-25 中国工商银行股份有限公司 Intension recognizing method and device under dialogue frame
CN111695352A (en) * 2020-05-28 2020-09-22 平安科技(深圳)有限公司 Grading method and device based on semantic analysis, terminal equipment and storage medium
CN111897935A (en) * 2020-07-30 2020-11-06 北京文思海辉金信软件有限公司 Knowledge graph-based dialogical path selection method and device and computer equipment
CN111949787A (en) * 2020-08-21 2020-11-17 平安国际智慧城市科技股份有限公司 Automatic question-answering method, device, equipment and storage medium based on knowledge graph

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9720981B1 (en) * 2016-02-25 2017-08-01 International Business Machines Corporation Multiple instance machine learning for question answering systems
CN110377911A (en) * 2019-07-23 2019-10-25 中国工商银行股份有限公司 Intension recognizing method and device under dialogue frame
CN111695352A (en) * 2020-05-28 2020-09-22 平安科技(深圳)有限公司 Grading method and device based on semantic analysis, terminal equipment and storage medium
CN111897935A (en) * 2020-07-30 2020-11-06 北京文思海辉金信软件有限公司 Knowledge graph-based dialogical path selection method and device and computer equipment
CN111949787A (en) * 2020-08-21 2020-11-17 平安国际智慧城市科技股份有限公司 Automatic question-answering method, device, equipment and storage medium based on knowledge graph

Also Published As

Publication number Publication date
CN112732882A (en) 2021-04-30

Similar Documents

Publication Publication Date Title
CN107818781B (en) Intelligent interaction method, equipment and storage medium
WO2022141875A1 (en) User intention recognition method and apparatus, device, and computer-readable storage medium
CN109241524B (en) Semantic analysis method and device, computer-readable storage medium and electronic equipment
CN109241255B (en) Intention identification method based on deep learning
WO2020244073A1 (en) Speech-based user classification method and device, computer apparatus, and storage medium
US11526663B2 (en) Methods, apparatuses, devices, and computer-readable storage media for determining category of entity
US11727211B2 (en) Systems and methods for colearning custom syntactic expression types for suggesting next best correspondence in a communication environment
WO2020057413A1 (en) Junk text identification method and device, computing device and readable storage medium
US11790174B2 (en) Entity recognition method and apparatus
WO2022048194A1 (en) Method, apparatus and device for optimizing event subject identification model, and readable storage medium
US11604925B1 (en) Architecture for gazetteer-augmented named entity recognition
CN111046656A (en) Text processing method and device, electronic equipment and readable storage medium
CN111274797A (en) Intention recognition method, device and equipment for terminal and storage medium
CN111353021B (en) Intention recognition method and device, electronic device and medium
CN110414005B (en) Intention recognition method, electronic device and storage medium
WO2022141876A1 (en) Word embedding-based search method, apparatus and device, and storage medium
CN113326702A (en) Semantic recognition method and device, electronic equipment and storage medium
CN114398968B (en) Method and device for labeling similar customer-obtaining files based on file similarity
CN108763202A (en) Method, apparatus, equipment and the readable storage medium storing program for executing of the sensitive text of identification
CN112036186A (en) Corpus labeling method and device, computer storage medium and electronic equipment
WO2021185113A1 (en) Data analysis method based on multiple analysis tasks and electronic device
CN110502755B (en) Character string recognition method based on fusion model and computer storage medium
CN110874408B (en) Model training method, text recognition device and computing equipment
Korpusik et al. Dialogue state tracking with convolutional semantic taggers
CN114841143A (en) Voice room quality evaluation method and device, equipment, medium and product thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21912645

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21912645

Country of ref document: EP

Kind code of ref document: A1