CN109582773A - Intelligent answer matching process and device - Google Patents

Intelligent answer matching process and device Download PDF

Info

Publication number
CN109582773A
CN109582773A CN201811448785.3A CN201811448785A CN109582773A CN 109582773 A CN109582773 A CN 109582773A CN 201811448785 A CN201811448785 A CN 201811448785A CN 109582773 A CN109582773 A CN 109582773A
Authority
CN
China
Prior art keywords
subject information
value
history
history subject
real
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811448785.3A
Other languages
Chinese (zh)
Other versions
CN109582773B (en
Inventor
熊立志
李千
李伟文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Aiwen Technology Co Ltd
Original Assignee
Shenzhen Aiwen Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Aiwen Technology Co Ltd filed Critical Shenzhen Aiwen Technology Co Ltd
Priority to CN201811448785.3A priority Critical patent/CN109582773B/en
Publication of CN109582773A publication Critical patent/CN109582773A/en
Application granted granted Critical
Publication of CN109582773B publication Critical patent/CN109582773B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Abstract

The present embodiments relate to technical field of information processing, in particular to a kind of intelligent answer matching process and device.This method can realize multi-angular analysis and the processing to question information by convolutional neural networks model, machine learning model, and then filter out qualified history subject information, and calculate the score value of these history subject informations, dual judgement is carried out based on score value to obtain the question and answer pair for meeting decision condition, it so, it is possible the quality for the question and answer pair that raising acquires.

Description

Intelligent answer matching process and device
Technical field
The present embodiments relate to technical field of information processing, in particular to a kind of intelligent answer matching process and Device.
Background technique
The earliest realization conception of question answering system can trace back to turing test.In order to which whether test machine has mankind's intelligence Can, turing test requires computer that can answer a series of problems proposed by mankind tester in 5 minutes, and it is reached more than 30% answer allows tester to be mistakenly considered the mankind and is answered.With the development of the relevant technologies such as artificial intelligence, natural language processing, Variation for different data shapes also derives different types of question answering system.But existing question answering system is directed to user The new problem processing mode of proposition is single, and flexibility is not high, and then causes the answer matched of low quality.
Summary of the invention
In view of this, the present invention provides a kind of intelligent answer matching process and devices.
The embodiment of the invention provides a kind of intelligent answer matching process, applied to the service connecting with client communication End, which comprises
The question information that the client is sent is received, the real-time subject information of the question information is extracted;
Convolutional neural networks model is constructed, all history subject informations in presetting database are refreshing to the convolution is inputted It is trained through network model;
The multiple history subject informations to match with the real-time subject information are found out from the presetting database, In, the first similarity between each history subject information and the real-time subject information in the multiple history subject information Value is less than first threshold;The multiple history subject information is inputted into the convolutional neural networks model of training completion to calculate State the second similarity value between each history subject information and the real-time subject information in multiple history subject informations;Base In multiple second similarity values being calculated, preset quantity is chosen from the multiple history subject information according to setting sequence A history subject information;
According to the building of the multiple history subject information and training machine learning model, by the real-time subject information and institute It states the preset quantity history subject information input machine learning model and the preset quantity history theme letter is calculated The score value of each history subject information in breath;
Judge whether the maximum value in the multiple score values being calculated reaches setting value, if the maximum value reaches described Setting value calculates the editing distance value between the corresponding history subject information of the maximum value and the real-time subject information;
Judge whether the editing distance value is less than second threshold, if the editing distance value is less than the second threshold, The question and answer pair for obtaining the corresponding history subject information of the maximum value, by the question and answer to being sent to the client.
Optionally, the method also includes:
If the maximum value does not reach the setting value, searches whether to exist and match with the real-time subject information The corresponding client of user's portrait;If it exists, the question information is sent to the client found out so that the visitor found out Family end is answered by the question information.
Optionally, based on multiple second similarity values being calculated, according to setting sequence from the multiple history theme The step of preset quantity history subject information is chosen in information, comprising:
Multiple second similarity values being calculated are ranked up according to sequence from high to low;
Obtain the corresponding history subject information of the second similarity value of the forward setting quantity that sorts.
Optionally, according to the building of the multiple history subject information and training machine learning model the step of, comprising:
For each history subject information in the multiple history subject information, obtains in the history subject information and include Label characteristics value, characteristic of division value and similarity characteristic value;
Building machine learning model, the multiple labels that will acquire are established based on gradient boosted tree and algorithm with regress analysis method Characteristic value, multiple characteristic of division values and multiple similarity characteristic values input the machine learning model and are trained.
Optionally, the real-time subject information and the preset quantity history subject information are inputted into the machine learning The step of score value of each history subject information in the preset quantity history subject information is calculated in model, packet It includes:
Obtain the label characteristics value, characteristic of division value and similarity characteristic value of the real-time subject information;
Label characteristics value, the classification for obtaining each history subject information in the preset quantity history subject information are special Value indicative and similarity characteristic value;
By the label characteristics value of the real-time subject information, characteristic of division value and similarity characteristic value and the present count Measure the label characteristics value, characteristic of division value and the input of similarity characteristic value of each history subject information in a history subject information Each history subject information in the preset quantity history subject information is calculated in the machine learning model that training is completed Score value.
The embodiment of the invention also provides a kind of intelligent answer coalignments, applied to the service connecting with client communication End, described device include:
Real-time subject information extraction module, the question information sent for receiving the client extract the enquirement letter The real-time subject information of breath;
Convolutional neural networks model construction module, for constructing convolutional neural networks model, by the institute in presetting database There is history subject information to be trained to the convolutional neural networks model is inputted;
Subject information screening module matches for finding out from the presetting database with the real-time subject information Multiple history subject informations, wherein each history subject information in the multiple history subject information and the real-time master The first similarity value inscribed between information is less than first threshold;The multiple history subject information is inputted into the convolution that training is completed Neural network model is believed with calculating each history subject information in the multiple history subject information and the real-time theme The second similarity value between breath;Based on multiple second similarity values being calculated, gone through according to setting sequence from the multiple Preset quantity history subject information is chosen in history subject information;
Score value computing module will for constructing simultaneously training machine learning model according to the multiple history subject information The real-time subject information and the preset quantity history subject information input machine learning model are calculated described The score value of each history subject information in preset quantity history subject information;
First judgment module, for judging whether the maximum value in the multiple score values being calculated reaches setting value, if The maximum value reaches the setting value, calculate the corresponding history subject information of the maximum value and the real-time subject information it Between editing distance value;
Second judgment module, for judging whether the editing distance value is less than second threshold, if the editing distance value Less than the second threshold, the question and answer pair of the corresponding history subject information of the maximum value are obtained, by the question and answer to being sent to The client.
Optionally, the first judgment module is also used to:
If the maximum value does not reach the setting value, searches whether to exist and match with the real-time subject information The corresponding client of user's portrait;If it exists, the question information is sent to the client found out so that the visitor found out Family end is answered by the question information.
Optionally, the subject information screening module is in the following manner based on multiple second similarities being calculated Value, chooses preset quantity history subject information according to setting sequence from the multiple history subject information:
Multiple second similarity values being calculated are ranked up according to sequence from high to low;
Obtain the corresponding history subject information of the second similarity value of the forward setting quantity that sorts.
Optionally, the score value computing module is constructed and is instructed according to the multiple history subject information in the following manner Practice machine learning model:
For each history subject information in the multiple history subject information, obtains in the history subject information and include Label characteristics value, characteristic of division value and similarity characteristic value;
Building machine learning model, the multiple labels that will acquire are established based on gradient boosted tree and algorithm with regress analysis method Characteristic value, multiple characteristic of division values and multiple similarity characteristic values input the machine learning model and are trained.
Optionally, the score value computing module is in the following manner by the real-time subject information and the preset quantity A history subject information inputs the machine learning model and each of described preset quantity history subject information is calculated The score value of history subject information:
Obtain the label characteristics value, characteristic of division value and similarity characteristic value of the real-time subject information;
Label characteristics value, the classification for obtaining each history subject information in the preset quantity history subject information are special Value indicative and similarity characteristic value;
By the label characteristics value of the real-time subject information, characteristic of division value and similarity characteristic value and the present count Measure the label characteristics value, characteristic of division value and the input of similarity characteristic value of each history subject information in a history subject information Each history subject information in the preset quantity history subject information is calculated in the machine learning model that training is completed Score value.
The embodiment of the invention also provides a kind of server-side, including memory, processor and storage are on a memory simultaneously The computer program that can be run on a processor, the processor realize above-mentioned intelligent answer when executing the computer program Matching process.
The embodiment of the invention also provides a kind of computer readable storage medium, the readable storage medium storing program for executing includes computer Program, the server-side computer program controls the readable storage medium storing program for executing when running where execute above-mentioned intelligent answer matching Method.
Beneficial effect
Intelligent answer matching process and device provided in an embodiment of the present invention, be primarily based in presetting database all goes through History subject information is trained the convolutional neural networks model of building, the convolutional neural networks model then finished by training The second similarity value of each history subject information to match with real-time subject information is calculated, and is pressed based on the second similarity value Preset quantity history theme letter is chosen from the multiple history subject informations to match with real-time subject information according to setting sequence Breath calculates the score value to preset quantity history subject information using machine learning model, is finally carried out based on score value double Major punishment is fixed to obtain the question and answer pair for meeting decision condition, in this way, can be real by convolutional neural networks model, machine learning model Now to the multi-angular analysis of question information and processing, and then improve the quality of the question and answer pair acquired.
Further, it searches matched user according to real-time subject information to draw a portrait corresponding client, and by question information It is sent to the client, improves the flexibility to question information processing, and the client can be guaranteed for the question information The quality of given answer.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be to needed in the embodiment attached Figure is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as pair The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this A little attached drawings obtain other relevant attached drawings.
Fig. 1 is a kind of block diagram of server-side 10 provided by the embodiment of the present invention.
Fig. 2 is a kind of flow chart of intelligent answer matching process provided by the embodiment of the present invention.
Fig. 3 is the schematic diagram for the sub-step that step S24 shown in Fig. 2 includes in an embodiment.
Fig. 4 is a kind of module frame chart of intelligent answer coalignment 20 provided by the embodiment of the present invention.
Icon:
10- server-side;11- memory;12- processor;13- network module;
20- intelligent answer coalignment;The real-time subject information extraction module of 21-;22- convolutional neural networks model construction mould Block;23- subject information screening module;24- score value computing module;25- first judgment module;The second judgment module of 26-.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment only It is a part of the embodiments of the present invention, instead of all the embodiments.The present invention being usually described and illustrated herein in the accompanying drawings The component of embodiment can be arranged and be designed with a variety of different configurations.
Therefore, the detailed description of the embodiment of the present invention provided in the accompanying drawings is not intended to limit below claimed The scope of the present invention, but be merely representative of selected embodiment of the invention.Based on the embodiments of the present invention, this field is common Technical staff's every other embodiment obtained without creative efforts belongs to the model that the present invention protects It encloses.
It should also be noted that similar label and letter indicate similar terms in following attached drawing, therefore, once a certain Xiang Yi It is defined in a attached drawing, does not then need that it is further defined and explained in subsequent attached drawing.
Inventor further investigation reveals that, existing question answering system for user propose new problem processing mode it is single, flexibly Property is not high, and then causes the answer matched of low quality.
Defect present in the above scheme in the prior art, is that inventor is obtaining after practicing and carefully studying As a result, therefore, the solution that the discovery procedure of the above problem and the hereinafter embodiment of the present invention are proposed regarding to the issue above Scheme all should be the contribution that inventor makes the present invention in process of the present invention.
Based on the studies above, the embodiment of the invention provides a kind of intelligent answer matching process and devices, pass through convolution mind It can be realized multi-angular analysis and processing to question information through network model, machine learning model, and then guarantee to acquire Question and answer pair quality.
Fig. 1 shows a kind of block diagram of server-side 10 provided by the embodiment of the present invention.In the embodiment of the present invention Server-side 10 have data storage, transmission, processing function, as shown in Figure 1, server-side 10 includes: memory 11, processor 12, network module 13 and intelligent answer coalignment 20.
It is directly or indirectly electrically connected between memory 11, processor 12 and network module 13, to realize the biography of data Defeated or interaction.It is electrically connected for example, these elements can be realized from each other by one or more communication bus or signal wire. Intelligent answer coalignment 20 is stored in memory 11, the intelligent answer coalignment 20 includes at least one can be with software Or the form of firmware (firmware) is stored in the software function module in the memory 11, the processor 12 passes through operation The intelligent answer coalignment 20 being stored in the software program and module, such as the embodiment of the present invention in memory 11, from And perform various functions application and data processing, that is, realize the intelligent answer matching process in the embodiment of the present invention.
Wherein, the memory 11 may be, but not limited to, random access memory (Random Access Memory, RAM), read-only memory (Read Only Memory, ROM), programmable read only memory (Programmable Read-Only Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only Memory, EPROM), Electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only Memory, EEPROM) etc.. Wherein, memory 11 is for storing program, and the processor 12 executes described program after receiving and executing instruction.
The processor 12 may be a kind of IC chip, the processing capacity with data.Above-mentioned processor 12 It can be general processor, including central processing unit (Central Processing Unit, CPU), network processing unit (Network Processor, NP) etc..It may be implemented or execute each method, step disclosed in the embodiment of the present invention and patrol Collect block diagram.General processor can be microprocessor or the processor is also possible to any conventional processor etc..
Network module 13 is used to establish the communication connection between server-side 10 and other communication terminal devices by network, real The transmitting-receiving operation of existing network signal and data.Above-mentioned network signal may include wireless signal or wire signal.
It is appreciated that structure shown in FIG. 1 is only to illustrate, server-side 10 may also include than shown in Fig. 1 more or more Few component, or with the configuration different from shown in Fig. 1.Each component shown in Fig. 1 can use hardware, software or its group It closes and realizes.
The embodiment of the present invention also provides a kind of computer readable storage medium, and the readable storage medium storing program for executing includes computer journey Sequence.Server-side 10 computer program controls the readable storage medium storing program for executing when running where executes following intelligent answer matching Method.
Fig. 2 shows a kind of flow charts of intelligent answer matching process provided by the embodiment of the present invention.The method has Method and step defined in the process of pass is applied to server-side 10, can be realized by the processor 12.It below will be to shown in Fig. 2 Detailed process be described in detail:
In the present embodiment, server-side 10 is connect with a client communication, and server-side 10 includes a presetting database, this is pre- If being stored with multiple question and answer pair in database, these question and answer are to user's offer by 10 place platform of server-side, wherein Mei Gewen It answers questions and is corresponding with a history subject information, for example, the history subject information in presetting database is M1~Mn
Step S21 receives the question information that client is sent, extracts the real-time subject information of question information.
Server-side 10 carries out keyword extraction to the question information received, and then obtains the corresponding real-time master of question information Inscribe information Mt, for question and answer matching later.
Step S22 constructs convolutional neural networks model, and all history subject informations in presetting database roll up input Product neural network model is trained.
Convolutional neural networks (Convolutional Neural Networks, CNN) model is constructed, by M1~MnInput CNN model is trained.
Step S23 finds out the multiple history subject informations to match with real-time subject information from presetting database, will The convolutional neural networks that multiple history subject information input training are completed are gone through with calculating each of multiple history subject informations The second similarity value between history subject information and real-time subject information is pressed based on multiple second similarity values being calculated Preset quantity history subject information is chosen from multiple history subject informations according to setting sequence.
For convenient for subsequent explanation, it is assumed that with MtThe multiple history subject informations to match are M1~M20, wherein M1~M20In Each history subject information and MtBetween the first similarity value be less than first threshold, first threshold can be according to the actual situation Setting, the first similarity value are cosine similarity value.
Optionally, server-side 10 can use search engine and find out from presetting database and real-time subject information phase The multiple history subject informations matched.
In the present embodiment, using presetting database as first database, the database that search engine is inquired as Second database will complete the CNN model of training as third database.
By M1~M20Input training completes amount CNN model to calculate M1~M20In each history subject information and Mt Between the second similarity value, multiple second similarity values being calculated are ranked up according to sequence from high to low, and The corresponding history subject information of the second similarity value of forward setting quantity that sorts is obtained, for example, set quantity as five, acquisition First five history subject information are as follows: M3、M1、M2、M8And M13
Step S24, according to the building of multiple history subject informations and training machine learning model, by real-time subject information and in advance If quantity history subject information input machine learning model is calculated each of preset quantity history subject information and goes through The score value of history subject information.
Fig. 3 is please referred to, is enumerated in the present embodiment by step S241, step S242, step S243 and step S244 One of implementation of step S24.
Step S241 believes for each history theme in the multiple history subject informations to match with real-time subject information Breath, obtains the label characteristics value, characteristic of division value and similarity characteristic value for including in the history subject information.
Extract M1~M20In each history subject information label characteristics value, characteristic of division value and similarity characteristic value.
Step S242 establishes building machine learning model based on gradient boosted tree and algorithm with regress analysis method, will acquire to obtain Multiple label characteristics values, multiple characteristic of division values and multiple similarity characteristic values input machine learning model be trained.
Wherein, gradient boosted tree (Gradient Boosting Decision Tree, GBDT) has good non-linear Capability of fitting and robustness, and algorithm with regress analysis method (Logistic Regression, LR) can be adapted for continuity and classification Property independent variable, and be easy to use and explain.
It can be between history subject information and real-time subject information based on GBDT and the LR machine learning model built Correlation is further analyzed and scores, and improves the accuracy and reliability of analysis.
Step S243 obtains the label characteristics value, characteristic of division value and similarity characteristic value of real-time subject information, obtains pre- If label characteristics value, characteristic of division value and the similarity feature of each history subject information in quantity history subject information Value.
M is obtained respectivelyt、M3、M1、M2、M8And M13Label characteristics value, characteristic of division value and similarity characteristic value.
Step S244 by the label characteristics value of real-time subject information, characteristic of division value and similarity characteristic value and is preset The label characteristics value of each history subject information in quantity history subject information, characteristic of division value and similarity characteristic value are defeated Enter the machine learning model that training is completed and each history subject information in preset quantity history subject information is calculated Score value.
By Mt、M3、M1、M2、M8And M13Label characteristics value, characteristic of division value and similarity characteristic value input machine learning Model calculates M3、M1、M2、M8And M13Respective score value.
Step S25, judges whether the maximum value in the multiple score values being calculated reaches setting value.
For example, M3、M1、M2、M8And M13Respective score value is respectively as follows: grade3、grade1、grade2、grade8With grade13
Assuming that maximum value is grade13If grade13Reach setting value (being assumed to be 80 points), turns to step S26.
If grade13Do not reach setting value, turns to step S29.
Step S26 calculates the editing distance value between the corresponding history subject information of maximum value and real-time subject information, sentences Whether disconnected editing distance value is less than second threshold.
Calculate M13M betweentBetween editing distance value.
If editing distance value is less than second threshold (being set according to actual conditions), step S27 is turned to.
If editing distance value is not less than second threshold, step S28 is turned to.
Step S27 obtains the question and answer pair of the corresponding history subject information of maximum value, by question and answer to being sent to client.
Obtain M13Corresponding question and answer pair, by question and answer to being sent to client so that user checks.
Step S28 is modified acquisition correction result to maximum value, and correction result input machine learning model is carried out two Secondary training.
It is appreciated that the machine learning model after can use second training is to M3、M1、M2、M8And M13It re-starts and comments Score value calculates, then repeatedly step S25 and step S26.
Step S29 finds out client corresponding with user's portrait that real-time subject information matches, question information is sent out It send to the client found out so that the client found out answers question information.
If grade13Do not reach setting value, shows M3、M1、M2、M8And M13The quality of mostly corresponding question and answer pair is difficult to and Mt Matching, server-side 10 can be according to M at this timetLabel characteristics value, characteristic of division value etc. is found out and MtMatched user's portrait institute is right The client answered is sent to the client found out by and by question information so that the client's single pair question information found out carries out It answers.
In this way, being difficult to find out and M in server-side 10tThe high question and answer clock synchronization of matching precision, can be to the special of question information The characteristics such as industry degree and region are analyzed, and then find out relatively suitable target user, determine question information by these mesh It marks user to answer, to improve the professional and validity of question answering.
Optionally, the question information and answer form question and answer to being stored in first database, to realize to the first number According to the update in library.
It is appreciated that first database, the second database and third database have the function of real-time update, so, it is possible It improves to the subsequent accuracy for puing question to matching answer, and then guarantees the quality of the answer matched.
On the basis of the above, described as shown in figure 4, the embodiment of the invention provides a kind of intelligent answer coalignment 20 Intelligent answer coalignment 20 includes: real-time subject information extraction module 21, convolutional neural networks model construction module 22, theme Information sifting module 23, score value computing module 24, first judgment module 25 and the second judgment module 26.
Real-time subject information extraction module 21, the question information sent for receiving the client extract the enquirement The real-time subject information of information.
Since real-time subject information extraction module 21 is similar with the realization principle of step S21 in Fig. 2, do not make herein more More explanations.
Convolutional neural networks model construction module 22 will be in presetting database for constructing convolutional neural networks model All history subject informations are trained to the convolutional neural networks model is inputted.
Since convolutional neural networks model construction module 22 is similar with the realization principle of step S22 in Fig. 2, herein not Make more explanations.
Subject information screening module 23, for being found out from the presetting database and the real-time subject information phase The multiple history subject informations matched, wherein each history subject information in the multiple history subject information and it is described in real time The first similarity value between subject information is less than first threshold;The multiple history subject information is inputted into the volume that training is completed Product neural network model is to calculate each history subject information in the multiple history subject information and the real-time theme The second similarity value between information;Based on multiple second similarity values being calculated, according to setting sequence from the multiple Preset quantity history subject information is chosen in history subject information.
Since subject information screening module 23 is similar with the realization principle of step S23 in Fig. 2, do not say more herein It is bright.
Score value computing module 24, for constructing simultaneously training machine learning model according to the multiple history subject information, The real-time subject information and the preset quantity history subject information are inputted into the machine learning model, institute is calculated State the score value of each history subject information in preset quantity history subject information.
Since score value computing module 24 is similar with the realization principle of step S24 in Fig. 2, do not say more herein It is bright.
First judgment module 25, for judging whether the maximum value in the multiple score values being calculated reaches setting value, If the maximum value reaches the setting value, the corresponding history subject information of the maximum value and the real-time subject information are calculated Between editing distance value.
Since first judgment module 25 and step S25, step S26 in Fig. 2 are similar with the realization principle of step S29, Do not illustrate more herein.
Second judgment module 26, for judging whether the editing distance value is less than second threshold, if the editing distance Value is less than the second threshold, the question and answer pair of the corresponding history subject information of the maximum value is obtained, by the question and answer to transmission To the client.
Since the second judgment module 26 and step S26, step S27 in Fig. 2 are similar with the realization principle of step S28, Do not illustrate more herein.
To sum up, intelligent answer matching process and device provided by the embodiment of the present invention, by convolutional neural networks model, Machine learning model can be realized multi-angular analysis and processing to question information, be matched by being searched according to real-time subject information User draw a portrait corresponding client, and question information is sent to the client, improved to the flexible of question information processing Property, it so, it is possible the quality for improving the question and answer pair acquired.
In several embodiments provided by the embodiment of the present invention, it should be understood that disclosed device and method, it can also To realize by another way.Device and method embodiment described above is only schematical, for example, in attached drawing Flow chart and block diagram show that the devices of multiple embodiments according to the present invention, method and computer program product are able to achieve Architecture, function and operation.In this regard, each box in flowchart or block diagram can represent module, a program A part of section or code, a part of the module, section or code include that one or more is patrolled for realizing defined Collect the executable instruction of function.It should also be noted that in some implementations as replacement, function marked in the box It can occur in a different order than that indicated in the drawings.For example, two continuous boxes can actually be held substantially in parallel Row, they can also be executed in the opposite order sometimes, and this depends on the function involved.It is also noted that block diagram and/or The combination of each box in flow chart and the box in block diagram and or flow chart, can the function as defined in executing or dynamic The dedicated hardware based system made is realized, or can be realized using a combination of dedicated hardware and computer instructions.
In addition, each functional module in each embodiment of the present invention can integrate one independent portion of formation together Point, it is also possible to modules individualism, an independent part can also be integrated to form with two or more modules.
It, can be with if the function is realized and when sold or used as an independent product in the form of software function module It is stored in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially in other words The part of the part that contributes to existing technology or the technical solution can be embodied in the form of software products, the meter Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be a People's computer, server-side 10 or the network equipment etc.) execute all or part of step of each embodiment the method for the present invention Suddenly.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), deposits at random The various media that can store program code such as access to memory (RAM, Random Access Memory), magnetic or disk. It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to the packet of nonexcludability Contain, so that the process, method, article or equipment for including a series of elements not only includes those elements, but also including Other elements that are not explicitly listed, or further include for elements inherent to such a process, method, article, or device. In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including the element Process, method, article or equipment in there is also other identical elements.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, made any to repair Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.

Claims (10)

1. a kind of intelligent answer matching process, which is characterized in that applied to the server-side being connect with client communication, the method Include:
The question information that the client is sent is received, the real-time subject information of the question information is extracted;
Convolutional neural networks model is constructed, by all history subject informations in presetting database to the input convolutional Neural net Network model is trained;
The multiple history subject informations to match with the real-time subject information are found out from the presetting database, wherein The first similarity value between each history subject information and the real-time subject information in the multiple history subject information Less than first threshold;It is described to calculate that the multiple history subject information is inputted into the convolutional neural networks model that training is completed The second similarity value between each history subject information and the real-time subject information in multiple history subject informations;It is based on Multiple second similarity values being calculated choose preset quantity according to setting sequence from the multiple history subject information History subject information;
According to the building of the multiple history subject information and training machine learning model, by the real-time subject information and described pre- If quantity history subject information inputs the machine learning model and is calculated in the preset quantity history subject information Each history subject information score value;
Judge whether the maximum value in the multiple score values being calculated reaches setting value, if the maximum value reaches the setting Value, calculates the editing distance value between the corresponding history subject information of the maximum value and the real-time subject information;
Judge whether the editing distance value is less than second threshold, if the editing distance value is less than the second threshold, obtains The question and answer pair of the corresponding history subject information of the maximum value, by the question and answer to being sent to the client.
2. intelligent answer matching process according to claim 1, which is characterized in that the method also includes:
If the maximum value does not reach the setting value, search whether there is the user to match with the real-time subject information It draws a portrait corresponding client;If it exists, the question information is sent to the client found out so that the client found out The question information is answered.
3. intelligent answer matching process according to claim 1, which is characterized in that based on multiple second phases being calculated Like angle value, the step of choosing preset quantity history subject information from the multiple history subject information according to setting sequence, Include:
Multiple second similarity values being calculated are ranked up according to sequence from high to low;
Obtain the corresponding history subject information of the second similarity value of the forward setting quantity that sorts.
4. intelligent answer matching process according to claim 1, which is characterized in that according to the multiple history subject information The step of constructing simultaneously training machine learning model, comprising:
For each history subject information in the multiple history subject information, the mark for including in the history subject information is obtained Sign characteristic value, characteristic of division value and similarity characteristic value;
Building machine learning model, the multiple label characteristics that will acquire are established based on gradient boosted tree and algorithm with regress analysis method Value, multiple characteristic of division values and multiple similarity characteristic values input the machine learning model and are trained.
5. intelligent answer matching process according to claim 4, which is characterized in that by the real-time subject information and described Preset quantity history subject information inputs the machine learning model and the preset quantity history subject information is calculated In each history subject information score value the step of, comprising:
Obtain the label characteristics value, characteristic of division value and similarity characteristic value of the real-time subject information;
Obtain label characteristics value, the characteristic of division value of each history subject information in the preset quantity history subject information With similarity characteristic value;
By the label characteristics value of the real-time subject information, characteristic of division value and similarity characteristic value and the preset quantity Label characteristics value, characteristic of division value and the input training of similarity characteristic value of each history subject information in history subject information Each history subject information that the machine learning model of completion is calculated in the preset quantity history subject information is commented Score value.
6. a kind of intelligent answer coalignment, which is characterized in that applied to the server-side being connect with client communication, described device Include:
Real-time subject information extraction module, the question information sent for receiving the client, extracts the question information Real-time subject information;
Convolutional neural networks model construction module goes through all in presetting database for constructing convolutional neural networks model History subject information is trained to the convolutional neural networks model is inputted;
Subject information screening module, for found out from the presetting database match with the real-time subject information it is more A history subject information, wherein each history subject information and the real-time theme in the multiple history subject information are believed The first similarity value between breath is less than first threshold;The multiple history subject information is inputted into the convolutional Neural that training is completed Network model with calculate each history subject information in the multiple history subject information and the real-time subject information it Between the second similarity value;Based on multiple second similarity values being calculated, according to setting sequence from the multiple history master It inscribes and chooses preset quantity history subject information in information;
Score value computing module will be described for constructing simultaneously training machine learning model according to the multiple history subject information Real-time subject information and the preset quantity history subject information input the machine learning model and described preset are calculated The score value of each history subject information in quantity history subject information;
First judgment module, for judging whether the maximum value in the multiple score values being calculated reaches setting value, if described Maximum value reaches the setting value, calculates between the corresponding history subject information of the maximum value and the real-time subject information Editing distance value;
Second judgment module, for judging whether the editing distance value is less than second threshold, if the editing distance value is less than The second threshold obtains the question and answer pair of the corresponding history subject information of the maximum value, and the question and answer are described to being sent to Client.
7. intelligent answer coalignment according to claim 6, which is characterized in that the first judgment module is also used to:
If the maximum value does not reach the setting value, search whether there is the user to match with the real-time subject information It draws a portrait corresponding client;If it exists, the question information is sent to the client found out so that the client found out The question information is answered.
8. intelligent answer coalignment according to claim 6, which is characterized in that the subject information screening module passes through Following manner is selected from the multiple history subject information based on multiple second similarity values being calculated, according to setting sequence Take preset quantity history subject information:
Multiple second similarity values being calculated are ranked up according to sequence from high to low;
Obtain the corresponding history subject information of the second similarity value of the forward setting quantity that sorts.
9. intelligent answer coalignment according to claim 6, which is characterized in that the score value computing module by with Under type is according to the building of the multiple history subject information and training machine learning model:
For each history subject information in the multiple history subject information, the mark for including in the history subject information is obtained Sign characteristic value, characteristic of division value and similarity characteristic value;
Building machine learning model, the multiple label characteristics that will acquire are established based on gradient boosted tree and algorithm with regress analysis method Value, multiple characteristic of division values and multiple similarity characteristic values input the machine learning model and are trained.
10. intelligent answer coalignment according to claim 9, which is characterized in that the score value computing module passes through The real-time subject information and the preset quantity history subject information are inputted the machine learning model meter by following manner It calculates and obtains the score value of each history subject information in the preset quantity history subject information:
Obtain the label characteristics value, characteristic of division value and similarity characteristic value of the real-time subject information;
Obtain label characteristics value, the characteristic of division value of each history subject information in the preset quantity history subject information With similarity characteristic value;
By the label characteristics value of the real-time subject information, characteristic of division value and similarity characteristic value and the preset quantity Label characteristics value, characteristic of division value and the input training of similarity characteristic value of each history subject information in history subject information Each history subject information that the machine learning model of completion is calculated in the preset quantity history subject information is commented Score value.
CN201811448785.3A 2018-11-29 2018-11-29 Intelligent question-answer matching method and device Active CN109582773B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811448785.3A CN109582773B (en) 2018-11-29 2018-11-29 Intelligent question-answer matching method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811448785.3A CN109582773B (en) 2018-11-29 2018-11-29 Intelligent question-answer matching method and device

Publications (2)

Publication Number Publication Date
CN109582773A true CN109582773A (en) 2019-04-05
CN109582773B CN109582773B (en) 2020-11-27

Family

ID=65923775

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811448785.3A Active CN109582773B (en) 2018-11-29 2018-11-29 Intelligent question-answer matching method and device

Country Status (1)

Country Link
CN (1) CN109582773B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110096580A (en) * 2019-04-24 2019-08-06 北京百度网讯科技有限公司 A kind of FAQ dialogue method, device and electronic equipment
CN110347786A (en) * 2019-06-11 2019-10-18 深圳追一科技有限公司 A kind of tuning method and system of semantic model
CN110413961A (en) * 2019-06-21 2019-11-05 平安国际智慧城市科技股份有限公司 The method, apparatus and computer equipment of text scoring are carried out based on disaggregated model
CN111611363A (en) * 2020-05-13 2020-09-01 湖南福米信息科技有限责任公司 Intelligent response robot system based on user image and machine learning
CN112860962A (en) * 2021-02-10 2021-05-28 北京百度网讯科技有限公司 Method, device, electronic equipment and medium for processing question information

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103902733A (en) * 2014-04-18 2014-07-02 北京大学 Information retrieval method based on interrogative extension
US20160342895A1 (en) * 2015-05-21 2016-11-24 Baidu Usa Llc Multilingual image question answering
CN106649760A (en) * 2016-12-27 2017-05-10 北京百度网讯科技有限公司 Question type search work searching method and question type search work searching device based on deep questions and answers
CN107145573A (en) * 2017-05-05 2017-09-08 上海携程国际旅行社有限公司 The problem of artificial intelligence customer service robot, answers method and system
CN107562836A (en) * 2017-06-07 2018-01-09 北京航空航天大学 Method is recommended based on the answerer of topic model and machine learning
CN108052588A (en) * 2017-12-11 2018-05-18 浙江大学城市学院 A kind of construction method of the document automatically request-answering system based on convolutional neural networks
CN108090174A (en) * 2017-12-14 2018-05-29 北京邮电大学 A kind of robot answer method and device based on system function syntax
CN108829719A (en) * 2018-05-07 2018-11-16 中国科学院合肥物质科学研究院 The non-true class quiz answers selection method of one kind and system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103902733A (en) * 2014-04-18 2014-07-02 北京大学 Information retrieval method based on interrogative extension
US20160342895A1 (en) * 2015-05-21 2016-11-24 Baidu Usa Llc Multilingual image question answering
CN106649760A (en) * 2016-12-27 2017-05-10 北京百度网讯科技有限公司 Question type search work searching method and question type search work searching device based on deep questions and answers
CN107145573A (en) * 2017-05-05 2017-09-08 上海携程国际旅行社有限公司 The problem of artificial intelligence customer service robot, answers method and system
CN107562836A (en) * 2017-06-07 2018-01-09 北京航空航天大学 Method is recommended based on the answerer of topic model and machine learning
CN108052588A (en) * 2017-12-11 2018-05-18 浙江大学城市学院 A kind of construction method of the document automatically request-answering system based on convolutional neural networks
CN108090174A (en) * 2017-12-14 2018-05-29 北京邮电大学 A kind of robot answer method and device based on system function syntax
CN108829719A (en) * 2018-05-07 2018-11-16 中国科学院合肥物质科学研究院 The non-true class quiz answers selection method of one kind and system

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
RONGQIANG MA 等: "Hybrid answer selection model for non-factoid question answering", 《ASIAN LANGUAGE PROCESSING》 *
张栋 等: "基于问题与答案联合表示学习的半监督问题分类方法", 《中文信息学报》 *
荣光辉 等: "基于深度学习的问答匹配方法", 《计算机应用》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110096580A (en) * 2019-04-24 2019-08-06 北京百度网讯科技有限公司 A kind of FAQ dialogue method, device and electronic equipment
CN110347786A (en) * 2019-06-11 2019-10-18 深圳追一科技有限公司 A kind of tuning method and system of semantic model
CN110347786B (en) * 2019-06-11 2021-01-05 深圳追一科技有限公司 Semantic model tuning method and system
CN110413961A (en) * 2019-06-21 2019-11-05 平安国际智慧城市科技股份有限公司 The method, apparatus and computer equipment of text scoring are carried out based on disaggregated model
CN110413961B (en) * 2019-06-21 2021-02-09 平安国际智慧城市科技股份有限公司 Method and device for text scoring based on classification model and computer equipment
CN111611363A (en) * 2020-05-13 2020-09-01 湖南福米信息科技有限责任公司 Intelligent response robot system based on user image and machine learning
CN112860962A (en) * 2021-02-10 2021-05-28 北京百度网讯科技有限公司 Method, device, electronic equipment and medium for processing question information

Also Published As

Publication number Publication date
CN109582773B (en) 2020-11-27

Similar Documents

Publication Publication Date Title
CN109582773A (en) Intelligent answer matching process and device
CN109271401B (en) Topic searching and correcting method and device, electronic equipment and storage medium
Shovon et al. An Approach of Improving Students Academic Performance by using k means clustering algorithm and Decision tree
WO2017167069A1 (en) Resume assessment method and apparatus
CN106649760A (en) Question type search work searching method and question type search work searching device based on deep questions and answers
CN105117460A (en) Learning resource recommendation method and system
CN108121702B (en) Method and system for evaluating and reading mathematical subjective questions
CN105303179A (en) Fingerprint identification method and fingerprint identification device
CN109582969A (en) Methodology for Entities Matching, device and electronic equipment
CN108052504B (en) Structure analysis method and system for mathematic subjective question answer result
CN109273023B (en) Data evaluation method, device and equipment and readable storage medium
CN107301164B (en) Semantic analysis method and device for mathematical formula
CN103488790A (en) Polychronic time sequence similarity analysis method based on weighting BORDA counting method
CN112100464B (en) Question-answering community expert recommendation method and system combining dynamic interest and professional knowledge
CN115798022B (en) Artificial intelligence recognition method based on feature extraction
CN111611781B (en) Data labeling method, question answering device and electronic equipment
CN111597305B (en) Entity marking method, entity marking device, computer equipment and storage medium
CN112836994A (en) Big data visualization method for online remote education and online education server
CN110852064B (en) Key sentence extraction method and device
CN111541911A (en) Video detection method and device, storage medium and electronic device
CN109064578A (en) A kind of attendance system and method based on cloud service
CN114971425B (en) Database information monitoring method, device, equipment and storage medium
CN116228361A (en) Course recommendation method, device, equipment and storage medium based on feature matching
CN113313470B (en) Employment type assessment method and system based on big data
CN111708890B (en) Search term determining method and related device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant