CN107330120A - Inquire answer method, inquiry answering device and computer-readable recording medium - Google Patents
Inquire answer method, inquiry answering device and computer-readable recording medium Download PDFInfo
- Publication number
- CN107330120A CN107330120A CN201710575024.3A CN201710575024A CN107330120A CN 107330120 A CN107330120 A CN 107330120A CN 201710575024 A CN201710575024 A CN 201710575024A CN 107330120 A CN107330120 A CN 107330120A
- Authority
- CN
- China
- Prior art keywords
- inquiry
- correlation
- inquiry message
- candidate
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Artificial Intelligence (AREA)
- Human Computer Interaction (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides a kind of inquiry answer method, inquiry answering device and computer-readable recording medium, and the inquiry answer method includes:Semantic processes step (S101), carries out semantic processes, with the user view of the inquiry purpose of reaction of formation inquiry message and for the retrieval information used in being retrieved according to inquiry message to the inquiry message that user inputs;Searching step (S102), based on the retrieval information, carries out the data retrieval based on participle from database, obtains the list of candidate's solid data;Sequence step (S103), based on the degree of correlation between candidate's solid data and user view, processing is ranked up to candidate's solid data;And first result determine step (S104), candidate's solid data will in list with the highest degree of correlation is defined as the response result for user's query information.
Description
Technical field
The present invention relates to the inquiry answer method based on semantic understanding, inquiry answering device and computer-readable storage medium
Matter.
Background technology
At present, fuzzy semantics understand be in information retrieval and semantic analysis one it is very universal the problem of, if can not be very
The good identification that semanteme is carried out to it, the result of return may not be very much the result that user wants greatly.Phonetic entry is just turning into more
Carry out more universal interactive mode, although have benefited from the lifting of computing capability and the accumulation of mass data, the use of deep learning is big
Width reduces identification error rate, but still has 4%-5% error rate, and the field occured frequently in some neologisms is particularly acute, and this is just
So that fuzzy semantics, which understand, seems critically important.Still further aspect, due to information huge explosion, man memory power is limited, when many
Waiting possibly can not accurately say whole information, and this also causes fuzzy semantics to understand a necessary part as system.
In view of the above-mentioned problems, application publication number proposes a kind of fuzzy inspection of entity for CN106294875A Chinese patent application
Rope method and system, but this method is relatively simple, does not account for the factor of phonetic error correction etc, it is difficult to solve current Vague language
The problem of reason and good sense solution.
Separately have, application publication number proposes crucial in a kind of network search procedure for CN101206673A Chinese patent application
The intelligent correction system and method for word.The system is applied on internet platform, set up language model, corresponding dictionary and
Data directory database, calculates sound character error and fuzzy matching calculates morphological pattern error correction, and degree of correlation filtering and sequence are carried out to result,
Obtain immediate several results.This method is to be used for web search, it is impossible to the fuzzy search suitable for many wheel dialogues, it is impossible to
The error correction of solution fuzzy phoneme, it is impossible to the problem of solving state transition in many wheel dialogues, it is impossible to solve retrieval result in the absence of optimal
In the case of error correction, how to deal with and be defined when also not to coming to nothing, also influence of the error correction result to display, such as
Prompt message etc..
The content of the invention
In view of above mentioned problem of the prior art have developed the present invention.The present invention is intended to provide one kind can carry out Vague language
The system and method for reason and good sense solution, as user because the fuzzy expression of speech intonation, mistake or sending inaccurate the problems such as do not remember clearly
During true instruction, system remains to make correct semantic understanding and smoothly completes information retrieval on this basis.It is applied to institute
The scene of error correction is obscured the need for having, includes the fuzzy semantics error correction in the semantic error correction of web search, and many wheel dialogues.
The first aspect of the present invention provides a kind of inquiry answer method based on semantic understanding, the inquiry answer method bag
Include:Semantic processes step (S101), carries out semantic processes, with the inquiry of reaction of formation inquiry message to the inquiry message that user inputs
Ask the user view of purpose and for the retrieval information used in being retrieved according to inquiry message;Searching step (S102), is based on
The retrieval information, carries out the data retrieval based on participle from database, obtains the list of candidate's solid data;Sequence step
(S103), based on the degree of correlation between candidate's solid data and user view, processing is ranked up to candidate's solid data;And
First result determines step (S104), will have candidate's solid data of the highest degree of correlation in list, is defined as asking for user
Ask the response result of information.
The second aspect of the present invention provides a kind of inquiry answering device based on semantic understanding, the inquiry answering device bag
Include:Semantic processing unit (1101), carries out semantic processes, with the inquiry of reaction of formation inquiry message to the inquiry message that user inputs
Ask the user view of purpose and for the retrieval information used in being retrieved according to inquiry message;Retrieval unit (1102), is based on
The retrieval information, carries out the data retrieval based on participle from database, obtains the list of candidate's solid data;Sequencing unit
(1103), based on the degree of correlation between candidate's solid data and user view, processing is ranked up to candidate's solid data;And
First result determining unit (1104), will have candidate's solid data of the highest degree of correlation in list, be defined as asking for user
Ask the response result of information.
The third aspect of the present invention provides a kind of inquiry response system (100) based on semantic understanding, and the system includes
User terminal (1001) and the server (1002) being connected with user terminal, the user terminal include:Input receiving unit
(10011) inquiry message of user's input, is received;Semantic processing unit (10012), semantic processes are carried out to inquiry message, with
The user view of the inquiry purpose of reaction of formation inquiry message and for the retrieval information used in being retrieved according to inquiry message;
Transmitting element (10013), inquiry message, the user view of the inquiry message and retrieval information are sent in the way of associated
Server, and the response result for inquiry message is received from server, the server includes:Receiving unit (10021), from
User terminal receives inquiry message and the user view associated with the inquiry message and retrieval information;Retrieval unit (10022),
Based on the retrieval information, the data retrieval based on participle is carried out from database, the list of candidate's solid data is obtained;Sequence
Unit (10023), based on the degree of correlation between candidate's solid data and user view, is ranked up to candidate's solid data;With
And result determining unit (10024), will there is candidate's solid data of the highest degree of correlation in list, be defined as being directed to user's query
The response result of information, and response result is sent to user terminal.
The fourth aspect of the present invention provides a kind of computer-readable recording medium, and it stores computer program, the calculating
Machine program is when being executed by processor, the step of realization includes according to above-mentioned inquiry answer method.
, also can be smooth even if the inquiry message of input error the problems such as due to user's vagueness in memory according to the present invention
Completion information retrieval so that user results in the closer retrieval result of intention with user.In addition, even in the absence of
In the case of optimal retrieval result, error correction can be also carried out, and provide a user the result after error correction.
Brief description of the drawings
In order to illustrate more clearly of the technical scheme in the embodiment of the present application, make required in being described below to embodiment
Accompanying drawing is briefly described, it should be apparent that, drawings in the following description are only some implementations described in the application
Example, on the premise of not paying creative work, can also be according to these accompanying drawings for this area or those of ordinary skill
Obtain other accompanying drawings.
Fig. 1 is the figure for the hardware construction for showing the inquiry answering device in the present invention.
Fig. 2 is the flow chart for illustrating inquiry answer method according to a first embodiment of the present invention.
Fig. 3 is the flow chart for the semantic processes step for illustrating the inquiry answer method according to the present invention.
Fig. 4 is the flow chart for the sequence step for illustrating the inquiry answer method according to the present invention.
Fig. 5 is the block diagram for the software configuration for illustrating the inquiry answering device according to first embodiment.
Fig. 6 is the flow chart for illustrating inquiry answer method according to a second embodiment of the present invention.
Fig. 7 is the block diagram for the software configuration for illustrating the inquiry answering device according to second embodiment.
Fig. 8 is the flow chart for illustrating inquiry answer method according to the preferred embodiment of the invention.
Fig. 9 is the block diagram for the software configuration for illustrating the inquiry answering device according to preferred embodiment.
Figure 10 is the schematic diagram for illustrating the inquiry response system of the present invention.
Embodiment
Hereinafter describe embodiments of the invention in detail with reference to the accompanying drawings.It should be appreciated that following embodiments and unawareness
The figure limitation present invention, also, on the means solved the problems, such as according to the present invention, it is not absolutely required to be retouched according to following embodiments
The whole combinations for each side stated.For simplicity, to identical structure division or step, identical has been used to mark or mark
Number, and the description thereof will be omitted.
[hardware configuration of inquiry answering device]
Fig. 1 is the figure for the hardware construction for showing the inquiry answering device in the present invention.In the present embodiment, with smart phone
Description is provided as the example of inquiry answering device.Although it is noted that illustrating smart phone in the present embodiment as inquiry
Ask answering device 1000, but it is clear that not limited to this, inquiry answering device of the invention can be mobile terminal (smart mobile phone,
Intelligent watch, Intelligent bracelet, music player devices), notebook computer, tablet personal computer, PDA (personal digital assistant), fax dress
Put, printer or be with inquiry answering internet device (such as digital camera, refrigerator, television set)
Etc. various devices.
First, the hardware configuration of the block diagram description inquiry answering device 1000 (2000,3000) of reference picture 1.In addition, at this
Following construction is described as example in embodiment, but the inquiry answering device of the present invention is not limited to the construction shown in Fig. 1.
Inquiry answering device 1000 includes input interface 101, CPU 102, the ROM being connected to each other via system bus
103rd, RAM 105, storage device 106, output interface 104, communication unit 107 and short-distance wireless communication unit 108 and display
Unit 109.Input interface 101 is
For via such as microphone, button, button or touch-screen operating unit (not shown) receive from user input data and
The interface of operational order.It note that the display unit 109 being described later on and operating unit can be at least partly integrated, also,
For example, it may be carrying out picture output in same picture and receiving the construction of user's operation.
CPU 102 is system control unit, and generally comprehensively answering device 1000 is inquired in control.In addition, for example,
CPU 102 carries out the display control of the display unit 109 of inquiry answering device 1000.It is all that the storages of ROM 103 CPU 102 is performed
The fixed data of such as tables of data and control program and operating system (OS) program.In the present embodiment, stored in ROM 103
Each control program, for example, under the OS stored in ROM 103 management, carry out at such as scheduling, task switching and interruption
The software of reason etc. performs control.
RAM 105 is constructed such as SRAM (static RAM), DRAM as needing stand-by power supply.This
In the case of, RAM 105 can store the significant data of control variable of program etc. in a non-volatile manner.In addition, RAM 105
Working storage and main storage as CPU 102.
The model of the storage training in advance of storage device 106 is (for example, word error correction mode, physical model, Rank models, semanteme
Model etc.), for the database retrieved and for performing according to application program of inquiry answer method of the present invention etc..
It note that database here can also be stored in the external device (ED) of such as server.In addition, storage device 106 stores all
Such as it is used for the information transmission/receiving control program for being transmitted/receiving via communication unit 107 and communicator (not shown)
Various programs, and various information that these programs are used.In addition, storage device 106 also stores inquiry answering device 1000
Configuration information, inquire the management data etc. of answering device 1000.
Output interface 104 is for being controlled the display picture with display information and application program to display unit 109
The interface in face.Display unit 109 is for example constructed by LCD (liquid crystal display).Have such as by being arranged on display unit 109
The soft keyboard of the key of numerical value enter key, mode setting button, decision key, cancel key and power key etc., can receive single via display
The input from user of member 109.
Inquire answering device 100 via communication unit 107 for example, by radio communications such as Wi-Fi (Wireless Fidelity) or bluetooth
Method, data communication is performed with external device (ED) (not shown).
In addition, inquiry answering device 1000 can also via short-distance wireless communication unit 108, in short-range with
External device (ED) etc. carries out wireless connection and performs data communication.And short-distance wireless communication unit 108 by with communication unit
107 different communication means are communicated.It is, for example, possible to use its communication range is shorter than the communication means of communication unit 107
Bluetooth Low Energy (BLE) as short-distance wireless communication unit 108 communication means.In addition, being used as short-distance wireless communication list
The communication means of member 108, for example, it is also possible to perceive (Wi-Fi Aware) using NFC (near-field communication) or Wi-Fi.
[first embodiment]
[according to the inquiry answer method of first embodiment]
It can be stored according to the inquiry answer method of the present invention by inquiring that the CPU 102 of answering device 1000 is read
ROM 103 or control program on storage device 106 or via communication unit 107 from passing through network and inquiry answering device
The webserver (not shown) of 1000 connections and the control program downloaded are realized.
, it is necessary to first preparation model and database before the inquiry answer method according to the present invention is carried out.Idiographic flow is such as
Under:
(1) crawl of related data:The crawl of solid data and the crawl of associated data such as label etc., its
In, solid data refers to the entity in certain field (such as video field), such as film " private savings of husbands ", " the Mi months pass ", and
Label is exactly the word for describing the entity:Such as " social forest ", " love ".
(2) training of model:Word error correcting model:The mapping of the pinyin table and fuzzy phoneme of word is set up, passes through the instruction to language material
Practice, calculate the transition probability model between the probabilistic model of word and word;Physical model:Using language model to including solid data
Language material be trained, it is established that identification entity model;Rank models:Pass through ready data and feature extraction, training
Into GBDT decision-tree model;Semantic model:By language model and training corpus, the model of semanteme can be extracted by being trained to.
Sample needed for model above training process, can be crawled from public network.
(3) the index storage of data:To the field modeling, based on existing data and model, be processed into be available for retrieval and
The structural data and storage of semantic understanding.
Next, being illustrated with reference to Fig. 2 to Fig. 4 to inquiry answer method according to a first embodiment of the present invention.Wherein,
Fig. 2 is the flow chart for illustrating inquiry answer method according to a first embodiment of the present invention;Fig. 3 is to illustrate the inquiry according to the present invention
The flow chart of the semantic processes step of answer method;Fig. 4 is the sequence step for illustrating the inquiry answer method according to the present invention
Flow chart.
As shown in Fig. 2 first, in semantic processes step S101, language is carried out to the inquiry message (query) that user inputs
Justice processing, with the user view of the inquiry purpose of reaction of formation inquiry message and for used in being retrieved according to inquiry message
Retrieve information.Preferably, as shown in figure 3, semantic processes step S101 further comprises:User view identification step S1011 is right
Inquiry message carries out user view identification, obtains the user view corresponding to inquiry message;Entity recognition step S1012, passes through
The physical model of training in advance, identifies solid data from inquiry message;And semantic understanding step S1013, by advance
The semantic model of training, carries out semantic understanding, to obtain retrieval information to inquiry message.Here, inquiry message be user for example
By the text message of input through keyboard, by changing the text envelope that user is for example generated by the voice messaging of microphone input
One in the text message of the text message and the text combination for being converted into user speech information of breath and user's input
Kind.For example, user can input inquiry message " I will see The Shawshank Redemption ", now, pass through Entity recognition step, Ke Yicong
Entity " The Shawshank Redemption " is identified in the inquiry message, by semantic understanding step, semantic reason is carried out to the inquiry message
Solution, can obtain retrieval information, retrieval information here using the intelligible slot value pair of computer form, for example " title=
The Shawshank Redemption ".
Next, in searching step S102, based on the retrieval information, the data based on participle are carried out from database
Retrieval, obtains the list of candidate's solid data.Here, first by the slot value obtained in semantic understanding step S1013 to conversion
Into the sentence that can be retrieved (for example, " title=The Shawshank Redemption " is converted into " film title:The Shawshank Redemption "),
Then retrieval request is sent with returning result list to database., can be according to pre-prepd participle mould in retrieving
Type carries out participle to the value (such as " The Shawshank Redemption ") in retrieval information, and in the preparation of database, also can be in storehouse
Each solid data carries out participle and is indexed with falling sequence, and the result of matching is found out this makes it possible to the result by participle
Come.It is based on the advantage that participle is retrieved, even if the inquiry message of user's input error due to vagueness in memory is (for example
" cucurbit baby brother "), by inquiry message participle it is " cucurbit baby " and " brother " by participle model, can be also examined from database
Rope goes out desired result (such as " Calabash Brothers ");Or, user may input incomplete inquiry message (such as " Xiao Shenke
Redeem "), by participle model by inquiry message participle be " Xiao Shenke " and " redeeming ", the phase can be also retrieved from database
The result (such as " The Shawshank Redemption ") of prestige.
Next, in sequence step S103, based on the degree of correlation between candidate's solid data and user view, to candidate
Solid data is ranked up processing.Preferably, as shown in figure 4, sequence step S103 further comprises:Relatedness computation step
S1031, the degree of correlation between candidate's solid data and user view is calculated according to GBDT models;And relevancy ranking step
S1032, based on the degree of correlation calculated, is ranked up using Rank models to candidate's solid data.Here, candidate's reality is being calculated
During the degree of correlation between volume data and user view, first by context state, entity static information (such as label, name,
Classification etc.), the multidate information (such as distance of temperature, marking and current time) of entity calculate characteristic value, then will be all
Characteristic value calculate the last degree of correlation by pre-prepd GBDT models.Here, the characteristic value of static information is logical
The matching degree for the inquiry message that information is inputted with user is crossed come what is calculated, this matching degree can pass through phonetic (including fuzzy phoneme)
Editing distance, the editing distance of word, semantic editing distance etc. determine that and the characteristic value of multidate information can be by certain
Formula calculate.
Finally, in the first result determines step S104, will there is candidate's solid data of the highest degree of correlation in list, really
It is set to the response result for user's query information.Here it is possible to by display unit 109, by the time with the highest degree of correlation
Solid data is selected as optimal result and returns to user.
Inquiry answer method according to a first embodiment of the present invention, by the way that based on retrieval information, base is carried out from database
In the data retrieval of participle, the list of candidate's solid data is obtained, and based on the phase between candidate's solid data and user view
Guan Du, processing is ranked up to candidate's solid data, can obtain following technique effect:Even if a. due to user's vagueness in memory or
Input error and input incomplete inquiry message, can also retrieve preferable result;B. allow users to obtain with using
The closer retrieval result of intention at family.
[according to the software configuration of the inquiry answering device of first embodiment]
Fig. 5 is the block diagram for the software configuration for illustrating the inquiry answering device according to first embodiment.As shown in figure 5, inquiry
Answering device 1000 includes semantic processing unit 1101, retrieval unit 1102, the result of sequencing unit 1103 and first and determines list
Member 1104.
Specifically, semantic processing unit 1101 includes:User view recognition unit 11011, is used inquiry message
Family intention assessment, obtains the user view corresponding to inquiry message;Entity recognition unit 11012, passes through the entity of training in advance
Model, identifies solid data from inquiry message;And semantic understanding unit 11013, by the semantic model of training in advance,
Semantic understanding is carried out to inquiry message, to obtain retrieval information.Retrieval unit 1102 is based on the retrieval information, from database
The data retrieval based on participle is carried out, the list of candidate's solid data is obtained.Sequencing unit 1103 includes:Correlation calculating unit
11031, the degree of correlation between candidate's solid data and user view is calculated according to GBDT models;And relevancy ranking unit
11032, based on the degree of correlation calculated, candidate's solid data is ranked up.First result determining unit 1104, by list
Candidate's solid data with the highest degree of correlation, is defined as the response result for user's query information.
[second embodiment]
[according to the inquiry answer method of second embodiment]
Inquiry answer method according to a second embodiment of the present invention is illustrated with reference to Fig. 6.Wherein, Fig. 6 is example
Show the flow chart of inquiry answer method according to a second embodiment of the present invention.
As shown in fig. 6, according to the inquiry answer method of second embodiment and the inquiry answer method according to first embodiment
Difference is, adds the first judgment step S204, the second judgment step S205 and the second result and determines step S206.
Specifically, in the first judgment step S204, according to similarity distance, the list obtained in step s 103 is calculated
In there is first degree of correlation between the candidate's solid data and inquiry message of the highest degree of correlation, and whether judge first degree of correlation
Less than first threshold.Here, the different attribute of solid data is equivalent to the different slots of semantic understanding, and the inquiry of attribute and user
Ask that the degree of correlation of information is determined by similarity distance, similarity distance here include the editor of phonetic (including fuzzy phoneme) away from
Editing distance from, the editing distance of word and semanteme etc., wherein, the editing distance of word is for example because font is close, unisonance is different
Situations such as word, few word multiword and produce.If first degree of correlation is less than first threshold (being "Yes" in step S204), then it represents that from
Differing greatly between the desired result of optimal result and user retrieved in database, at this moment, processing proceed to the second knot
Fruit determines step S206, and the solid data identified in Entity recognition step S1012 is defined as into response result and returned to
User so that it is not anticipated that in the case of result, user can also obtain preferable result in database.For example, with
Family inputs inquiry message " I will see dear Interpreter Officer " in step S101, and does not have the film in database, at this moment
The entity " dear Interpreter Officer " identified in Entity recognition step S1012 can be returned to user.
On the other hand, if first degree of correlation is more than or equal to first threshold (in step S204 be "No"), handle into
Row is to the second judgment step S205, to judge whether first degree of correlation is more than Second Threshold.If first degree of correlation is more than second
Threshold value (being "Yes" in step S205), then it represents that the optimal result retrieved from database is consistent with the desired result of user,
And handle and proceed to step S104, by the optimal result, be defined as the response result for user's query information.So that
User results in satisfied response result.
On the other hand, if first degree of correlation is not more than Second Threshold (being "No" in step S205), then it represents that from data
Difference is still suffered between the desired result of optimal result and user retrieved in storehouse, at this moment, processing proceeds to step S206, with
The solid data identified in Entity recognition step S1012 is defined as response result and returns to user.
It is advance in training, checking and the performance tested according to model for note that above first threshold and Second Threshold
Determine, to ensure in the performance recalled with had in accuracy rate.
In addition, in above-mentioned second judgment step S205, if having candidate's solid data of the highest degree of correlation in list
First degree of correlation between inquiry message is not more than Second Threshold, can also determine whether there is the second high correlation in list
Whether the degree of correlation between the candidate's solid data and inquiry message of degree is more than Second Threshold, and be judged as the situation of "Yes"
Under, proceed to step S104.It can so avoid leading to miss optimal response knot due to sequencing errors in step s 103
Really.In the case where not appreciably affecting processing speed, preceding N that can be in step S205 successively in calculations list is (for example, N=
3) degree of correlation between the candidate's solid data and inquiry message of position.
Inquiry answer method according to a second embodiment of the present invention, by calculating the phase between optimal result and inquiry message
Guan Du, carrys out the response result that certainly directional user returns, can obtain following technique effect:So that without pre- in database
In the case of phase result, user can also obtain preferable result.
[according to the software configuration of the inquiry answering device of second embodiment]
Fig. 7 is the block diagram for the software configuration for illustrating the inquiry answering device according to second embodiment.As shown in fig. 7, according to
The difference of the inquiry answering device 2000 of second embodiment and the inquiry answering device 1000 according to first embodiment is, increases
First judging unit 1204, the second result determining unit 1206 and the second judging unit 1205.
Specifically, the first judging unit is according to candidate's entity number in similarity distance calculations list with the highest degree of correlation
According to first degree of correlation between inquiry message, and judge whether first degree of correlation is less than first threshold.Second result determines single
Member recognizes the Entity recognition unit in the case where first judging unit judges that first degree of correlation is less than first threshold
The solid data gone out, is defined as response result.Second judging unit, judges whether first degree of correlation is more than Second Threshold, wherein,
In the case where second judging unit judges that first degree of correlation is more than Second Threshold, the first result determining unit will have
There is candidate's solid data of the highest degree of correlation, be defined as response result, and wherein, the similarity distance includes the editor of phonetic
At least one of editing distance of distance, the editing distance of word and semanteme.
[preferred embodiment]
[according to the inquiry answer method of preferred embodiment]
Inquiry answer method according to the preferred embodiment of the invention is illustrated with reference to Fig. 8.Fig. 8 is to illustrate basis
The flow chart of the inquiry answer method of the preferred embodiment of the present invention.
As shown in figure 8, according to the inquiry answer method and the inquiry answer method according to first embodiment of preferred embodiment
Difference be, add pretreatment and error correction step S301.
Specifically, in pretreatment and error correction step S301, inquiry message is pre-processed, and by instructing in advance
Experienced word error correcting model, correction process is carried out to the inquiry message by pretreatment.Here, the pretreatment includes believing inquiry
The deletion of the stop words and spoken word that are included in breath and the capital and small letter of letter and number included in inquiry message is changed
Deng.For example, when user input inquiry message in include some colloquial words when, carry out semantic processes step S101 it
It is preceding, it is necessary to remove these colloquial words.For example, in the feelings that the inquiry message that user inputs is " I will see dear diplomat "
Under condition, colloquial word " I will see " can be deleted by pretreatment first.Then, will be pre- by the word error correcting model of training in advance
Inquiry message " dear diplomat " after processing is corrected as " dear Interpreter Officer ".Next, to by pretreatment and error correction
Inquiry message after processing carries out subsequent treatment.In addition, user is also possible to the inquiry message of input error due to pronunciation mistake,
For example in the case where the inquiry message that user inputs is " Xiao Shengke's redeems ", entangled by the fuzzy phoneme in word correction process
It is wrong, additionally it is possible to be corrected as " The Shawshank Redemption ".
According to the inquiry answer method of preferred embodiment by being carried out before semantic processes are carried out at pretreatment and error correction
Reason, can be corrected to the inquiry message that user inputs, so as to improve the accuracy of later retrieval.
[according to the software configuration of the inquiry answering device of preferred embodiment]
Fig. 9 is the block diagram for the software configuration for illustrating the inquiry answering device according to preferred embodiment.As shown in figure 9, according to
The difference of the inquiry answering device 3000 of preferred embodiment and the inquiry answering device 1000 according to first embodiment is, increases
Pretreatment and error correction unit 1301.
Specifically, pretreatment and error correction unit 1301 are pre-processed to inquiry message, and pass through training in advance
Word error correcting model, correction process is carried out to the inquiry message by pretreatment.
In addition, present invention also offers a kind of inquiry response system based on semantic understanding.Figure 10 is to illustrate the present invention
The schematic diagram of inquiry response system.As shown in Figure 10, inquiry response system 100 includes user terminal 1001 and server 1002,
User terminal 1001 is connected with server 1002 via network 1003, and network 1003 can be cable network or wireless network.
User terminal 1001 includes input receiving unit 10011, semantic processing unit 10012 and transmitting element 10013.Clothes
Business device 1002 includes receiving unit 10021, retrieval unit 10022, sequencing unit 10023 and result determining unit 10024.
Specifically, in user terminal 1001, input receiving unit 10011 receives the inquiry message of user's input;Language
Adopted processing unit 10012 carries out semantic processes to inquiry message, with the user view of the inquiry purpose of reaction of formation inquiry message
With for the retrieval information used in being retrieved according to inquiry message;Transmitting element 10013 is by inquiry message, the inquiry message
User view and retrieval information are sent to server in the way of associated, and receive the response for inquiry message from server
As a result.
On the other hand, in server 1002, receiving unit 10021 from user terminal receive inquiry message and with the inquiry
The associated user view of information and retrieval information;Retrieval unit 10022 is based on the retrieval information, and base is carried out from database
In the data retrieval of participle, the list of candidate's solid data is obtained;Sequencing unit 10023 is based on candidate's solid data and anticipated with user
The degree of correlation between figure, is ranked up to candidate's solid data;As a result determining unit 10024 will have the highest degree of correlation in list
Candidate's solid data, be defined as the response result for user's query information, and response result is sent to user terminal.
Although with reference to exemplary embodiment, invention has been described above, above-described embodiment is only to illustrate this hair
Bright technical concepts and features, it is not intended to limit the scope of the present invention.It is all to be done according to spirit of the invention
Any equivalent variations or modification, should all be included within the scope of the present invention.
Claims (20)
1. a kind of inquiry answer method based on semantic understanding, the inquiry answer method includes:
Semantic processes step (S101), carries out semantic processes, with reaction of formation inquiry message to the inquiry message that user inputs
Inquire the user view of purpose and for the retrieval information used in being retrieved according to inquiry message;
Searching step (S102), based on the retrieval information, carries out the data retrieval based on participle from database, obtains candidate
The list of solid data;
Sequence step (S103), based on the degree of correlation between candidate's solid data and user view, is carried out to candidate's solid data
Sequence is handled;And
First result determines step (S104), will have candidate's solid data of the highest degree of correlation in list, is defined as using
The response result of family inquiry message.
2. inquiry answer method according to claim 1, wherein, the semantic processes step (S101) includes:
User view identification step (S1011), user view identification is carried out to inquiry message, obtains the use corresponding to inquiry message
Family is intended to;
Entity recognition step (S1012), by the physical model of training in advance, identifies solid data from inquiry message;With
And
Semantic understanding step (S1013), by the semantic model of training in advance, semantic understanding is carried out to inquiry message, to obtain
Retrieve information.
3. inquiry answer method according to claim 2, the inquiry answer method the sequence step (S103) it
Also include afterwards:
First judgment step (S204), according to candidate's solid data in similarity distance calculations list with the highest degree of correlation with asking
First degree of correlation between information is asked, and judges whether first degree of correlation is less than first threshold;And
Second result determines step (S206), judges that first degree of correlation is less than the feelings of first threshold in first judgment step
Under condition, the solid data that will be identified in the Entity recognition step is defined as response result.
4. inquiry answer method according to claim 3, the inquiry answer method is after the described first determination step
Also include:
Second judgment step (S205), judges whether first degree of correlation is more than Second Threshold,
Wherein, in the case of judging that first degree of correlation is more than Second Threshold in second judgment step, in first knot
Fruit is determined in step, by candidate's solid data with the highest degree of correlation, is defined as response result, and
Wherein, in the editing distance of editing distance of the similarity distance including phonetic, the editing distance of word and semanteme at least
One.
5. inquiry answer method according to any one of claim 1 to 4, wherein, the sequence step (S103) includes:
Relatedness computation step (S1031), the degree of correlation between candidate's solid data and user view is calculated according to GBDT models;
And
Relevancy ranking step (S1032), based on the degree of correlation calculated, is ranked up to candidate's solid data.
6. inquiry answer method according to any one of claim 1 to 4, the inquiry answer method is at the semanteme
Also include before reason step (S101):
Pretreatment and error correction step (S301), are pre-processed to inquiry message, and by the word error correcting model of training in advance,
Correction process is carried out to the inquiry message by pretreatment.
7. inquiry answer method according to claim 6, the pretreatment includes the stop words to being included in inquiry message
The capital and small letter of deletion with spoken word and the letter and number to being included in inquiry message is changed.
8. inquiry answer method according to any one of claim 1 to 4, wherein, the retrieval information uses slot value pair
Form.
9. inquiry answer method according to any one of claim 1 to 4, the inquiry message is the text that user inputs
Information, by change voice messaging that user inputs and text message that the text message generated and user input with that will use
One kind in the text message for the text combination that family voice messaging is converted into.
10. a kind of inquiry answering device based on semantic understanding, the inquiry answering device includes:
Semantic processing unit (1101), carries out semantic processes, with reaction of formation inquiry message to the inquiry message that user inputs
Inquire the user view of purpose and for the retrieval information used in being retrieved according to inquiry message;
Retrieval unit (1102), based on the retrieval information, carries out the data retrieval based on participle from database, obtains candidate
The list of solid data;
Sequencing unit (1103), based on the degree of correlation between candidate's solid data and user view, is carried out to candidate's solid data
Sequence is handled;And
First result determining unit (1104), will have candidate's solid data of the highest degree of correlation, is defined as using in list
The response result of family inquiry message.
11. inquiry answering device according to claim 10, wherein, the semantic processing unit includes:
User view recognition unit (11011), user view identification is carried out to inquiry message, obtains the use corresponding to inquiry message
Family is intended to;
Entity recognition unit (11012), by the physical model of training in advance, identifies solid data from inquiry message;With
And
Semantic understanding unit (11013), by the semantic model of training in advance, semantic understanding is carried out to inquiry message, to obtain
Retrieve information.
12. inquiry answering device according to claim 11, the inquiry answering device also includes:
First judging unit (1204), according to candidate's solid data in similarity distance calculations list with the highest degree of correlation with asking
Ask first degree of correlation between information;And
Second result determining unit (1206), judges that first degree of correlation is less than the situation of first threshold in first judging unit
Under, the solid data that the Entity recognition unit is identified is defined as response result.
13. inquiry answering device according to claim 12, the inquiry answering device also includes:
Second judging unit (1205), judges whether first degree of correlation is more than Second Threshold,
Wherein, in the case where second judging unit judges that first degree of correlation is more than Second Threshold, first result is true
Candidate's solid data with the highest degree of correlation is defined as response result by order member, and
Wherein, in the editing distance of editing distance of the similarity distance including phonetic, the editing distance of word and semanteme at least
One.
14. the inquiry answering device according to any one of claim 10 to 13, wherein, the sequencing unit includes:
Correlation calculating unit (11031), the degree of correlation between candidate's solid data and user view is calculated according to GBDT models;
And
Relevancy ranking unit (11032), based on the degree of correlation calculated, is ranked up to candidate's solid data.
15. the inquiry answering device according to any one of claim 10 to 13, the inquiry answering device also includes:
Pretreatment and error correction unit (1301), are pre-processed to inquiry message, and by the word error correcting model of training in advance,
Correction process is carried out to the inquiry message by pretreatment.
16. inquiry answering device according to claim 15, the pretreatment includes the deactivation to being included in inquiry message
The deletion of word and spoken word and the capital and small letter of letter and number included in inquiry message is changed.
17. the inquiry answering device according to any one of claim 10 to 13, wherein, the retrieval information uses slot value
To form.
18. the inquiry answering device according to any one of claim 10 to 13, the inquiry message is what user inputted
Text message, by change voice messaging that user inputs and text message that the text message generated and user input with
One kind in the text message for the text combination that user speech information is converted into.
19. a kind of inquiry response system (100) based on semantic understanding, the system includes user terminal (1001) and and user
The server (1002) of terminal connection,
The user terminal includes:
Receiving unit (10011) is inputted, receives the inquiry message of user's input;
Semantic processing unit (10012), carries out semantic processes, with the inquiry purpose of reaction of formation inquiry message to inquiry message
User view and for the retrieval information used in being retrieved according to inquiry message;
Transmitting element (10013), inquiry message, the user view of the inquiry message and retrieval information are sent out in the way of associated
Server is given, and the response result for inquiry message is received from server,
The server includes:
Receiving unit (10021), inquiry message and the user view associated with the inquiry message and inspection are received from user terminal
Rope information;
Retrieval unit (10022), based on the retrieval information, the data retrieval based on participle is carried out from database, is waited
Select the list of solid data;
Sequencing unit (10023), based on the degree of correlation between candidate's solid data and user view, is carried out to candidate's solid data
Sequence;And
As a result determining unit (10024), will have candidate's solid data of the highest degree of correlation in list, be defined as asking for user
The response result of information is asked, and response result is sent to user terminal.
20. a kind of computer-readable recording medium, it stores computer program, and the computer program is being executed by processor
When, realize the step of inquiry answer method according to any one of claim 1 to 9 includes.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710575024.3A CN107330120B (en) | 2017-07-14 | 2017-07-14 | Inquire answer method, inquiry answering device and computer readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710575024.3A CN107330120B (en) | 2017-07-14 | 2017-07-14 | Inquire answer method, inquiry answering device and computer readable storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107330120A true CN107330120A (en) | 2017-11-07 |
CN107330120B CN107330120B (en) | 2018-09-18 |
Family
ID=60226783
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710575024.3A Active CN107330120B (en) | 2017-07-14 | 2017-07-14 | Inquire answer method, inquiry answering device and computer readable storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107330120B (en) |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108021554A (en) * | 2017-11-14 | 2018-05-11 | 无锡小天鹅股份有限公司 | Audio recognition method, device and washing machine |
CN109597993A (en) * | 2018-11-30 | 2019-04-09 | 深圳前海微众银行股份有限公司 | Sentence analysis processing method, device, equipment and computer readable storage medium |
CN109800407A (en) * | 2017-11-15 | 2019-05-24 | 腾讯科技(深圳)有限公司 | Intension recognizing method, device, computer equipment and storage medium |
CN110334347A (en) * | 2019-06-27 | 2019-10-15 | 腾讯科技(深圳)有限公司 | Information processing method, relevant device and storage medium based on natural language recognition |
WO2019214679A1 (en) * | 2018-05-09 | 2019-11-14 | 华为技术有限公司 | Entity search method, related device and computer storage medium |
CN110456339A (en) * | 2019-08-12 | 2019-11-15 | 四川九洲电器集团有限责任公司 | A kind of inquiry, answer method and device, computer storage medium, electronic equipment |
CN110457423A (en) * | 2019-06-24 | 2019-11-15 | 平安科技(深圳)有限公司 | A kind of knowledge mapping entity link method, apparatus, computer equipment and storage medium |
CN110647987A (en) * | 2019-08-22 | 2020-01-03 | 腾讯科技(深圳)有限公司 | Method and device for processing data in application program, electronic equipment and storage medium |
CN110737756A (en) * | 2018-07-03 | 2020-01-31 | 百度在线网络技术(北京)有限公司 | Method, apparatus, device and medium for determining a response to user input data |
CN110765342A (en) * | 2019-09-12 | 2020-02-07 | 竹间智能科技(上海)有限公司 | Information query method and device, storage medium and intelligent terminal |
CN110858216A (en) * | 2018-08-13 | 2020-03-03 | 株式会社日立制作所 | Dialogue method, dialogue system, and storage medium |
CN111295708A (en) * | 2017-12-07 | 2020-06-16 | 三星电子株式会社 | Speech recognition apparatus and method of operating the same |
CN111417924A (en) * | 2017-11-23 | 2020-07-14 | 三星电子株式会社 | Electronic device and control method thereof |
CN111538894A (en) * | 2020-06-19 | 2020-08-14 | 腾讯科技(深圳)有限公司 | Query feedback method and device, computer equipment and storage medium |
CN111597808A (en) * | 2020-04-24 | 2020-08-28 | 北京百度网讯科技有限公司 | Instrument panel drawing processing method and device, electronic equipment and storage medium |
CN112396481A (en) * | 2019-08-13 | 2021-02-23 | 北京京东尚科信息技术有限公司 | Offline product information transmission method, system, electronic device, and storage medium |
CN112527819A (en) * | 2020-12-08 | 2021-03-19 | 北京百度网讯科技有限公司 | Address book information retrieval method and device, electronic equipment and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090089277A1 (en) * | 2007-10-01 | 2009-04-02 | Cheslow Robert D | System and method for semantic search |
CN104050256A (en) * | 2014-06-13 | 2014-09-17 | 西安蒜泥电子科技有限责任公司 | Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method |
CN104598445A (en) * | 2013-11-01 | 2015-05-06 | 腾讯科技(深圳)有限公司 | Automatic question-answering system and method |
-
2017
- 2017-07-14 CN CN201710575024.3A patent/CN107330120B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090089277A1 (en) * | 2007-10-01 | 2009-04-02 | Cheslow Robert D | System and method for semantic search |
CN104598445A (en) * | 2013-11-01 | 2015-05-06 | 腾讯科技(深圳)有限公司 | Automatic question-answering system and method |
CN104050256A (en) * | 2014-06-13 | 2014-09-17 | 西安蒜泥电子科技有限责任公司 | Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method |
Non-Patent Citations (1)
Title |
---|
邹超: "基于深度学习的中文代词消解及其在问答系统中的应用", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108021554A (en) * | 2017-11-14 | 2018-05-11 | 无锡小天鹅股份有限公司 | Audio recognition method, device and washing machine |
CN109800407A (en) * | 2017-11-15 | 2019-05-24 | 腾讯科技(深圳)有限公司 | Intension recognizing method, device, computer equipment and storage medium |
CN111417924B (en) * | 2017-11-23 | 2024-01-09 | 三星电子株式会社 | Electronic device and control method thereof |
CN111417924A (en) * | 2017-11-23 | 2020-07-14 | 三星电子株式会社 | Electronic device and control method thereof |
CN111295708A (en) * | 2017-12-07 | 2020-06-16 | 三星电子株式会社 | Speech recognition apparatus and method of operating the same |
US11636143B2 (en) | 2018-05-09 | 2023-04-25 | Huawei Technologies Co., Ltd. | Entity search method, related device, and computer storage medium |
WO2019214679A1 (en) * | 2018-05-09 | 2019-11-14 | 华为技术有限公司 | Entity search method, related device and computer storage medium |
US11238050B2 (en) | 2018-07-03 | 2022-02-01 | Baidu Online Network Technology (Beijing) Co., Ltd. | Method and apparatus for determining response for user input data, and medium |
CN110737756A (en) * | 2018-07-03 | 2020-01-31 | 百度在线网络技术(北京)有限公司 | Method, apparatus, device and medium for determining a response to user input data |
CN110858216B (en) * | 2018-08-13 | 2023-06-06 | 株式会社日立制作所 | Dialogue method, dialogue system, and storage medium |
CN110858216A (en) * | 2018-08-13 | 2020-03-03 | 株式会社日立制作所 | Dialogue method, dialogue system, and storage medium |
CN109597993A (en) * | 2018-11-30 | 2019-04-09 | 深圳前海微众银行股份有限公司 | Sentence analysis processing method, device, equipment and computer readable storage medium |
CN110457423A (en) * | 2019-06-24 | 2019-11-15 | 平安科技(深圳)有限公司 | A kind of knowledge mapping entity link method, apparatus, computer equipment and storage medium |
CN110334347A (en) * | 2019-06-27 | 2019-10-15 | 腾讯科技(深圳)有限公司 | Information processing method, relevant device and storage medium based on natural language recognition |
CN110334347B (en) * | 2019-06-27 | 2024-06-28 | 腾讯科技(深圳)有限公司 | Information processing method based on natural language recognition, related equipment and storage medium |
CN110456339A (en) * | 2019-08-12 | 2019-11-15 | 四川九洲电器集团有限责任公司 | A kind of inquiry, answer method and device, computer storage medium, electronic equipment |
CN112396481A (en) * | 2019-08-13 | 2021-02-23 | 北京京东尚科信息技术有限公司 | Offline product information transmission method, system, electronic device, and storage medium |
CN110647987A (en) * | 2019-08-22 | 2020-01-03 | 腾讯科技(深圳)有限公司 | Method and device for processing data in application program, electronic equipment and storage medium |
CN110765342A (en) * | 2019-09-12 | 2020-02-07 | 竹间智能科技(上海)有限公司 | Information query method and device, storage medium and intelligent terminal |
CN111597808A (en) * | 2020-04-24 | 2020-08-28 | 北京百度网讯科技有限公司 | Instrument panel drawing processing method and device, electronic equipment and storage medium |
CN111597808B (en) * | 2020-04-24 | 2023-07-25 | 北京百度网讯科技有限公司 | Instrument panel drawing processing method and device, electronic equipment and storage medium |
CN111538894A (en) * | 2020-06-19 | 2020-08-14 | 腾讯科技(深圳)有限公司 | Query feedback method and device, computer equipment and storage medium |
CN112527819A (en) * | 2020-12-08 | 2021-03-19 | 北京百度网讯科技有限公司 | Address book information retrieval method and device, electronic equipment and storage medium |
CN112527819B (en) * | 2020-12-08 | 2024-06-04 | 北京百度网讯科技有限公司 | Address book information retrieval method and device, electronic equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN107330120B (en) | 2018-09-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107330120B (en) | Inquire answer method, inquiry answering device and computer readable storage medium | |
US11062270B2 (en) | Generating enriched action items | |
CN107430616B (en) | Interactive reformulation of voice queries | |
WO2020140635A1 (en) | Text matching method and apparatus, storage medium and computer device | |
Schalkwyk et al. | “Your word is my command”: Google search by voice: A case study | |
US10460720B2 (en) | Generation of language understanding systems and methods | |
US9176941B2 (en) | Text inputting method, apparatus and system based on a cache-based language model and a universal language model | |
KR20190131065A (en) | Detect Mission Changes During Conversation | |
US10242672B2 (en) | Intelligent assistance in presentations | |
US11482223B2 (en) | Systems and methods for automatically determining utterances, entities, and intents based on natural language inputs | |
US20230185834A1 (en) | Data manufacturing frameworks for synthesizing synthetic training data to facilitate training a natural language to logical form model | |
KR20190000776A (en) | Information inputting method | |
US20230214579A1 (en) | Intelligent character correction and search in documents | |
CN116501960B (en) | Content retrieval method, device, equipment and medium | |
US10885275B2 (en) | Phrase placement for optimizing digital page | |
US20230185799A1 (en) | Transforming natural language to structured query language based on multi-task learning and joint training | |
CN113342948A (en) | Intelligent question and answer method and device | |
CN107424612A (en) | Processing method, device and machine readable media | |
US20210312138A1 (en) | System and method for handling out of scope or out of domain user inquiries | |
KR20220109238A (en) | Device and method for providing recommended sentence related to utterance input of user | |
US20230376700A1 (en) | Training data generation to facilitate fine-tuning embedding models | |
CN112149403A (en) | Method and device for determining confidential text | |
WO2023200519A1 (en) | Editing files using a pattern-completion engine | |
US20200175476A1 (en) | Job identification for optimizing digital page | |
RU2759090C1 (en) | Method for controlling a dialogue and natural language recognition system in a platform of virtual assistants |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200727 Address after: 518000 Nanshan District science and technology zone, Guangdong, Zhejiang Province, science and technology in the Tencent Building on the 1st floor of the 35 layer Patentee after: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd. Address before: 100029, Beijing, Chaoyang District new East Street, building No. 2, -3 to 25, 101, 8, 804 rooms Patentee before: Tricorn (Beijing) Technology Co.,Ltd. |
|
TR01 | Transfer of patent right |