CN107330120A - Inquire answer method, inquiry answering device and computer-readable recording medium - Google Patents

Inquire answer method, inquiry answering device and computer-readable recording medium Download PDF

Info

Publication number
CN107330120A
CN107330120A CN201710575024.3A CN201710575024A CN107330120A CN 107330120 A CN107330120 A CN 107330120A CN 201710575024 A CN201710575024 A CN 201710575024A CN 107330120 A CN107330120 A CN 107330120A
Authority
CN
China
Prior art keywords
inquiry
correlation
inquiry message
candidate
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710575024.3A
Other languages
Chinese (zh)
Other versions
CN107330120B (en
Inventor
陈华荣
亓超
王卓然
马宇驰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Triangle Animal (beijing) Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Triangle Animal (beijing) Technology Co Ltd filed Critical Triangle Animal (beijing) Technology Co Ltd
Priority to CN201710575024.3A priority Critical patent/CN107330120B/en
Publication of CN107330120A publication Critical patent/CN107330120A/en
Application granted granted Critical
Publication of CN107330120B publication Critical patent/CN107330120B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of inquiry answer method, inquiry answering device and computer-readable recording medium, and the inquiry answer method includes:Semantic processes step (S101), carries out semantic processes, with the user view of the inquiry purpose of reaction of formation inquiry message and for the retrieval information used in being retrieved according to inquiry message to the inquiry message that user inputs;Searching step (S102), based on the retrieval information, carries out the data retrieval based on participle from database, obtains the list of candidate's solid data;Sequence step (S103), based on the degree of correlation between candidate's solid data and user view, processing is ranked up to candidate's solid data;And first result determine step (S104), candidate's solid data will in list with the highest degree of correlation is defined as the response result for user's query information.

Description

Inquire answer method, inquiry answering device and computer-readable recording medium
Technical field
The present invention relates to the inquiry answer method based on semantic understanding, inquiry answering device and computer-readable storage medium Matter.
Background technology
At present, fuzzy semantics understand be in information retrieval and semantic analysis one it is very universal the problem of, if can not be very The good identification that semanteme is carried out to it, the result of return may not be very much the result that user wants greatly.Phonetic entry is just turning into more Carry out more universal interactive mode, although have benefited from the lifting of computing capability and the accumulation of mass data, the use of deep learning is big Width reduces identification error rate, but still has 4%-5% error rate, and the field occured frequently in some neologisms is particularly acute, and this is just So that fuzzy semantics, which understand, seems critically important.Still further aspect, due to information huge explosion, man memory power is limited, when many Waiting possibly can not accurately say whole information, and this also causes fuzzy semantics to understand a necessary part as system.
In view of the above-mentioned problems, application publication number proposes a kind of fuzzy inspection of entity for CN106294875A Chinese patent application Rope method and system, but this method is relatively simple, does not account for the factor of phonetic error correction etc, it is difficult to solve current Vague language The problem of reason and good sense solution.
Separately have, application publication number proposes crucial in a kind of network search procedure for CN101206673A Chinese patent application The intelligent correction system and method for word.The system is applied on internet platform, set up language model, corresponding dictionary and Data directory database, calculates sound character error and fuzzy matching calculates morphological pattern error correction, and degree of correlation filtering and sequence are carried out to result, Obtain immediate several results.This method is to be used for web search, it is impossible to the fuzzy search suitable for many wheel dialogues, it is impossible to The error correction of solution fuzzy phoneme, it is impossible to the problem of solving state transition in many wheel dialogues, it is impossible to solve retrieval result in the absence of optimal In the case of error correction, how to deal with and be defined when also not to coming to nothing, also influence of the error correction result to display, such as Prompt message etc..
The content of the invention
In view of above mentioned problem of the prior art have developed the present invention.The present invention is intended to provide one kind can carry out Vague language The system and method for reason and good sense solution, as user because the fuzzy expression of speech intonation, mistake or sending inaccurate the problems such as do not remember clearly During true instruction, system remains to make correct semantic understanding and smoothly completes information retrieval on this basis.It is applied to institute The scene of error correction is obscured the need for having, includes the fuzzy semantics error correction in the semantic error correction of web search, and many wheel dialogues.
The first aspect of the present invention provides a kind of inquiry answer method based on semantic understanding, the inquiry answer method bag Include:Semantic processes step (S101), carries out semantic processes, with the inquiry of reaction of formation inquiry message to the inquiry message that user inputs Ask the user view of purpose and for the retrieval information used in being retrieved according to inquiry message;Searching step (S102), is based on The retrieval information, carries out the data retrieval based on participle from database, obtains the list of candidate's solid data;Sequence step (S103), based on the degree of correlation between candidate's solid data and user view, processing is ranked up to candidate's solid data;And First result determines step (S104), will have candidate's solid data of the highest degree of correlation in list, is defined as asking for user Ask the response result of information.
The second aspect of the present invention provides a kind of inquiry answering device based on semantic understanding, the inquiry answering device bag Include:Semantic processing unit (1101), carries out semantic processes, with the inquiry of reaction of formation inquiry message to the inquiry message that user inputs Ask the user view of purpose and for the retrieval information used in being retrieved according to inquiry message;Retrieval unit (1102), is based on The retrieval information, carries out the data retrieval based on participle from database, obtains the list of candidate's solid data;Sequencing unit (1103), based on the degree of correlation between candidate's solid data and user view, processing is ranked up to candidate's solid data;And First result determining unit (1104), will have candidate's solid data of the highest degree of correlation in list, be defined as asking for user Ask the response result of information.
The third aspect of the present invention provides a kind of inquiry response system (100) based on semantic understanding, and the system includes User terminal (1001) and the server (1002) being connected with user terminal, the user terminal include:Input receiving unit (10011) inquiry message of user's input, is received;Semantic processing unit (10012), semantic processes are carried out to inquiry message, with The user view of the inquiry purpose of reaction of formation inquiry message and for the retrieval information used in being retrieved according to inquiry message; Transmitting element (10013), inquiry message, the user view of the inquiry message and retrieval information are sent in the way of associated Server, and the response result for inquiry message is received from server, the server includes:Receiving unit (10021), from User terminal receives inquiry message and the user view associated with the inquiry message and retrieval information;Retrieval unit (10022), Based on the retrieval information, the data retrieval based on participle is carried out from database, the list of candidate's solid data is obtained;Sequence Unit (10023), based on the degree of correlation between candidate's solid data and user view, is ranked up to candidate's solid data;With And result determining unit (10024), will there is candidate's solid data of the highest degree of correlation in list, be defined as being directed to user's query The response result of information, and response result is sent to user terminal.
The fourth aspect of the present invention provides a kind of computer-readable recording medium, and it stores computer program, the calculating Machine program is when being executed by processor, the step of realization includes according to above-mentioned inquiry answer method.
, also can be smooth even if the inquiry message of input error the problems such as due to user's vagueness in memory according to the present invention Completion information retrieval so that user results in the closer retrieval result of intention with user.In addition, even in the absence of In the case of optimal retrieval result, error correction can be also carried out, and provide a user the result after error correction.
Brief description of the drawings
In order to illustrate more clearly of the technical scheme in the embodiment of the present application, make required in being described below to embodiment Accompanying drawing is briefly described, it should be apparent that, drawings in the following description are only some implementations described in the application Example, on the premise of not paying creative work, can also be according to these accompanying drawings for this area or those of ordinary skill Obtain other accompanying drawings.
Fig. 1 is the figure for the hardware construction for showing the inquiry answering device in the present invention.
Fig. 2 is the flow chart for illustrating inquiry answer method according to a first embodiment of the present invention.
Fig. 3 is the flow chart for the semantic processes step for illustrating the inquiry answer method according to the present invention.
Fig. 4 is the flow chart for the sequence step for illustrating the inquiry answer method according to the present invention.
Fig. 5 is the block diagram for the software configuration for illustrating the inquiry answering device according to first embodiment.
Fig. 6 is the flow chart for illustrating inquiry answer method according to a second embodiment of the present invention.
Fig. 7 is the block diagram for the software configuration for illustrating the inquiry answering device according to second embodiment.
Fig. 8 is the flow chart for illustrating inquiry answer method according to the preferred embodiment of the invention.
Fig. 9 is the block diagram for the software configuration for illustrating the inquiry answering device according to preferred embodiment.
Figure 10 is the schematic diagram for illustrating the inquiry response system of the present invention.
Embodiment
Hereinafter describe embodiments of the invention in detail with reference to the accompanying drawings.It should be appreciated that following embodiments and unawareness The figure limitation present invention, also, on the means solved the problems, such as according to the present invention, it is not absolutely required to be retouched according to following embodiments The whole combinations for each side stated.For simplicity, to identical structure division or step, identical has been used to mark or mark Number, and the description thereof will be omitted.
[hardware configuration of inquiry answering device]
Fig. 1 is the figure for the hardware construction for showing the inquiry answering device in the present invention.In the present embodiment, with smart phone Description is provided as the example of inquiry answering device.Although it is noted that illustrating smart phone in the present embodiment as inquiry Ask answering device 1000, but it is clear that not limited to this, inquiry answering device of the invention can be mobile terminal (smart mobile phone, Intelligent watch, Intelligent bracelet, music player devices), notebook computer, tablet personal computer, PDA (personal digital assistant), fax dress Put, printer or be with inquiry answering internet device (such as digital camera, refrigerator, television set) Etc. various devices.
First, the hardware configuration of the block diagram description inquiry answering device 1000 (2000,3000) of reference picture 1.In addition, at this Following construction is described as example in embodiment, but the inquiry answering device of the present invention is not limited to the construction shown in Fig. 1.
Inquiry answering device 1000 includes input interface 101, CPU 102, the ROM being connected to each other via system bus 103rd, RAM 105, storage device 106, output interface 104, communication unit 107 and short-distance wireless communication unit 108 and display Unit 109.Input interface 101 is For via such as microphone, button, button or touch-screen operating unit (not shown) receive from user input data and The interface of operational order.It note that the display unit 109 being described later on and operating unit can be at least partly integrated, also, For example, it may be carrying out picture output in same picture and receiving the construction of user's operation.
CPU 102 is system control unit, and generally comprehensively answering device 1000 is inquired in control.In addition, for example, CPU 102 carries out the display control of the display unit 109 of inquiry answering device 1000.It is all that the storages of ROM 103 CPU 102 is performed The fixed data of such as tables of data and control program and operating system (OS) program.In the present embodiment, stored in ROM 103 Each control program, for example, under the OS stored in ROM 103 management, carry out at such as scheduling, task switching and interruption The software of reason etc. performs control.
RAM 105 is constructed such as SRAM (static RAM), DRAM as needing stand-by power supply.This In the case of, RAM 105 can store the significant data of control variable of program etc. in a non-volatile manner.In addition, RAM 105 Working storage and main storage as CPU 102.
The model of the storage training in advance of storage device 106 is (for example, word error correction mode, physical model, Rank models, semanteme Model etc.), for the database retrieved and for performing according to application program of inquiry answer method of the present invention etc.. It note that database here can also be stored in the external device (ED) of such as server.In addition, storage device 106 stores all Such as it is used for the information transmission/receiving control program for being transmitted/receiving via communication unit 107 and communicator (not shown) Various programs, and various information that these programs are used.In addition, storage device 106 also stores inquiry answering device 1000 Configuration information, inquire the management data etc. of answering device 1000.
Output interface 104 is for being controlled the display picture with display information and application program to display unit 109 The interface in face.Display unit 109 is for example constructed by LCD (liquid crystal display).Have such as by being arranged on display unit 109 The soft keyboard of the key of numerical value enter key, mode setting button, decision key, cancel key and power key etc., can receive single via display The input from user of member 109.
Inquire answering device 100 via communication unit 107 for example, by radio communications such as Wi-Fi (Wireless Fidelity) or bluetooth Method, data communication is performed with external device (ED) (not shown).
In addition, inquiry answering device 1000 can also via short-distance wireless communication unit 108, in short-range with External device (ED) etc. carries out wireless connection and performs data communication.And short-distance wireless communication unit 108 by with communication unit 107 different communication means are communicated.It is, for example, possible to use its communication range is shorter than the communication means of communication unit 107 Bluetooth Low Energy (BLE) as short-distance wireless communication unit 108 communication means.In addition, being used as short-distance wireless communication list The communication means of member 108, for example, it is also possible to perceive (Wi-Fi Aware) using NFC (near-field communication) or Wi-Fi.
[first embodiment]
[according to the inquiry answer method of first embodiment]
It can be stored according to the inquiry answer method of the present invention by inquiring that the CPU 102 of answering device 1000 is read ROM 103 or control program on storage device 106 or via communication unit 107 from passing through network and inquiry answering device The webserver (not shown) of 1000 connections and the control program downloaded are realized.
, it is necessary to first preparation model and database before the inquiry answer method according to the present invention is carried out.Idiographic flow is such as Under:
(1) crawl of related data:The crawl of solid data and the crawl of associated data such as label etc., its In, solid data refers to the entity in certain field (such as video field), such as film " private savings of husbands ", " the Mi months pass ", and Label is exactly the word for describing the entity:Such as " social forest ", " love ".
(2) training of model:Word error correcting model:The mapping of the pinyin table and fuzzy phoneme of word is set up, passes through the instruction to language material Practice, calculate the transition probability model between the probabilistic model of word and word;Physical model:Using language model to including solid data Language material be trained, it is established that identification entity model;Rank models:Pass through ready data and feature extraction, training Into GBDT decision-tree model;Semantic model:By language model and training corpus, the model of semanteme can be extracted by being trained to. Sample needed for model above training process, can be crawled from public network.
(3) the index storage of data:To the field modeling, based on existing data and model, be processed into be available for retrieval and The structural data and storage of semantic understanding.
Next, being illustrated with reference to Fig. 2 to Fig. 4 to inquiry answer method according to a first embodiment of the present invention.Wherein, Fig. 2 is the flow chart for illustrating inquiry answer method according to a first embodiment of the present invention;Fig. 3 is to illustrate the inquiry according to the present invention The flow chart of the semantic processes step of answer method;Fig. 4 is the sequence step for illustrating the inquiry answer method according to the present invention Flow chart.
As shown in Fig. 2 first, in semantic processes step S101, language is carried out to the inquiry message (query) that user inputs Justice processing, with the user view of the inquiry purpose of reaction of formation inquiry message and for used in being retrieved according to inquiry message Retrieve information.Preferably, as shown in figure 3, semantic processes step S101 further comprises:User view identification step S1011 is right Inquiry message carries out user view identification, obtains the user view corresponding to inquiry message;Entity recognition step S1012, passes through The physical model of training in advance, identifies solid data from inquiry message;And semantic understanding step S1013, by advance The semantic model of training, carries out semantic understanding, to obtain retrieval information to inquiry message.Here, inquiry message be user for example By the text message of input through keyboard, by changing the text envelope that user is for example generated by the voice messaging of microphone input One in the text message of the text message and the text combination for being converted into user speech information of breath and user's input Kind.For example, user can input inquiry message " I will see The Shawshank Redemption ", now, pass through Entity recognition step, Ke Yicong Entity " The Shawshank Redemption " is identified in the inquiry message, by semantic understanding step, semantic reason is carried out to the inquiry message Solution, can obtain retrieval information, retrieval information here using the intelligible slot value pair of computer form, for example " title= The Shawshank Redemption ".
Next, in searching step S102, based on the retrieval information, the data based on participle are carried out from database Retrieval, obtains the list of candidate's solid data.Here, first by the slot value obtained in semantic understanding step S1013 to conversion Into the sentence that can be retrieved (for example, " title=The Shawshank Redemption " is converted into " film title:The Shawshank Redemption "), Then retrieval request is sent with returning result list to database., can be according to pre-prepd participle mould in retrieving Type carries out participle to the value (such as " The Shawshank Redemption ") in retrieval information, and in the preparation of database, also can be in storehouse Each solid data carries out participle and is indexed with falling sequence, and the result of matching is found out this makes it possible to the result by participle Come.It is based on the advantage that participle is retrieved, even if the inquiry message of user's input error due to vagueness in memory is (for example " cucurbit baby brother "), by inquiry message participle it is " cucurbit baby " and " brother " by participle model, can be also examined from database Rope goes out desired result (such as " Calabash Brothers ");Or, user may input incomplete inquiry message (such as " Xiao Shenke Redeem "), by participle model by inquiry message participle be " Xiao Shenke " and " redeeming ", the phase can be also retrieved from database The result (such as " The Shawshank Redemption ") of prestige.
Next, in sequence step S103, based on the degree of correlation between candidate's solid data and user view, to candidate Solid data is ranked up processing.Preferably, as shown in figure 4, sequence step S103 further comprises:Relatedness computation step S1031, the degree of correlation between candidate's solid data and user view is calculated according to GBDT models;And relevancy ranking step S1032, based on the degree of correlation calculated, is ranked up using Rank models to candidate's solid data.Here, candidate's reality is being calculated During the degree of correlation between volume data and user view, first by context state, entity static information (such as label, name, Classification etc.), the multidate information (such as distance of temperature, marking and current time) of entity calculate characteristic value, then will be all Characteristic value calculate the last degree of correlation by pre-prepd GBDT models.Here, the characteristic value of static information is logical The matching degree for the inquiry message that information is inputted with user is crossed come what is calculated, this matching degree can pass through phonetic (including fuzzy phoneme) Editing distance, the editing distance of word, semantic editing distance etc. determine that and the characteristic value of multidate information can be by certain Formula calculate.
Finally, in the first result determines step S104, will there is candidate's solid data of the highest degree of correlation in list, really It is set to the response result for user's query information.Here it is possible to by display unit 109, by the time with the highest degree of correlation Solid data is selected as optimal result and returns to user.
Inquiry answer method according to a first embodiment of the present invention, by the way that based on retrieval information, base is carried out from database In the data retrieval of participle, the list of candidate's solid data is obtained, and based on the phase between candidate's solid data and user view Guan Du, processing is ranked up to candidate's solid data, can obtain following technique effect:Even if a. due to user's vagueness in memory or Input error and input incomplete inquiry message, can also retrieve preferable result;B. allow users to obtain with using The closer retrieval result of intention at family.
[according to the software configuration of the inquiry answering device of first embodiment]
Fig. 5 is the block diagram for the software configuration for illustrating the inquiry answering device according to first embodiment.As shown in figure 5, inquiry Answering device 1000 includes semantic processing unit 1101, retrieval unit 1102, the result of sequencing unit 1103 and first and determines list Member 1104.
Specifically, semantic processing unit 1101 includes:User view recognition unit 11011, is used inquiry message Family intention assessment, obtains the user view corresponding to inquiry message;Entity recognition unit 11012, passes through the entity of training in advance Model, identifies solid data from inquiry message;And semantic understanding unit 11013, by the semantic model of training in advance, Semantic understanding is carried out to inquiry message, to obtain retrieval information.Retrieval unit 1102 is based on the retrieval information, from database The data retrieval based on participle is carried out, the list of candidate's solid data is obtained.Sequencing unit 1103 includes:Correlation calculating unit 11031, the degree of correlation between candidate's solid data and user view is calculated according to GBDT models;And relevancy ranking unit 11032, based on the degree of correlation calculated, candidate's solid data is ranked up.First result determining unit 1104, by list Candidate's solid data with the highest degree of correlation, is defined as the response result for user's query information.
[second embodiment]
[according to the inquiry answer method of second embodiment]
Inquiry answer method according to a second embodiment of the present invention is illustrated with reference to Fig. 6.Wherein, Fig. 6 is example Show the flow chart of inquiry answer method according to a second embodiment of the present invention.
As shown in fig. 6, according to the inquiry answer method of second embodiment and the inquiry answer method according to first embodiment Difference is, adds the first judgment step S204, the second judgment step S205 and the second result and determines step S206.
Specifically, in the first judgment step S204, according to similarity distance, the list obtained in step s 103 is calculated In there is first degree of correlation between the candidate's solid data and inquiry message of the highest degree of correlation, and whether judge first degree of correlation Less than first threshold.Here, the different attribute of solid data is equivalent to the different slots of semantic understanding, and the inquiry of attribute and user Ask that the degree of correlation of information is determined by similarity distance, similarity distance here include the editor of phonetic (including fuzzy phoneme) away from Editing distance from, the editing distance of word and semanteme etc., wherein, the editing distance of word is for example because font is close, unisonance is different Situations such as word, few word multiword and produce.If first degree of correlation is less than first threshold (being "Yes" in step S204), then it represents that from Differing greatly between the desired result of optimal result and user retrieved in database, at this moment, processing proceed to the second knot Fruit determines step S206, and the solid data identified in Entity recognition step S1012 is defined as into response result and returned to User so that it is not anticipated that in the case of result, user can also obtain preferable result in database.For example, with Family inputs inquiry message " I will see dear Interpreter Officer " in step S101, and does not have the film in database, at this moment The entity " dear Interpreter Officer " identified in Entity recognition step S1012 can be returned to user.
On the other hand, if first degree of correlation is more than or equal to first threshold (in step S204 be "No"), handle into Row is to the second judgment step S205, to judge whether first degree of correlation is more than Second Threshold.If first degree of correlation is more than second Threshold value (being "Yes" in step S205), then it represents that the optimal result retrieved from database is consistent with the desired result of user, And handle and proceed to step S104, by the optimal result, be defined as the response result for user's query information.So that User results in satisfied response result.
On the other hand, if first degree of correlation is not more than Second Threshold (being "No" in step S205), then it represents that from data Difference is still suffered between the desired result of optimal result and user retrieved in storehouse, at this moment, processing proceeds to step S206, with The solid data identified in Entity recognition step S1012 is defined as response result and returns to user.
It is advance in training, checking and the performance tested according to model for note that above first threshold and Second Threshold Determine, to ensure in the performance recalled with had in accuracy rate.
In addition, in above-mentioned second judgment step S205, if having candidate's solid data of the highest degree of correlation in list First degree of correlation between inquiry message is not more than Second Threshold, can also determine whether there is the second high correlation in list Whether the degree of correlation between the candidate's solid data and inquiry message of degree is more than Second Threshold, and be judged as the situation of "Yes" Under, proceed to step S104.It can so avoid leading to miss optimal response knot due to sequencing errors in step s 103 Really.In the case where not appreciably affecting processing speed, preceding N that can be in step S205 successively in calculations list is (for example, N= 3) degree of correlation between the candidate's solid data and inquiry message of position.
Inquiry answer method according to a second embodiment of the present invention, by calculating the phase between optimal result and inquiry message Guan Du, carrys out the response result that certainly directional user returns, can obtain following technique effect:So that without pre- in database In the case of phase result, user can also obtain preferable result.
[according to the software configuration of the inquiry answering device of second embodiment]
Fig. 7 is the block diagram for the software configuration for illustrating the inquiry answering device according to second embodiment.As shown in fig. 7, according to The difference of the inquiry answering device 2000 of second embodiment and the inquiry answering device 1000 according to first embodiment is, increases First judging unit 1204, the second result determining unit 1206 and the second judging unit 1205.
Specifically, the first judging unit is according to candidate's entity number in similarity distance calculations list with the highest degree of correlation According to first degree of correlation between inquiry message, and judge whether first degree of correlation is less than first threshold.Second result determines single Member recognizes the Entity recognition unit in the case where first judging unit judges that first degree of correlation is less than first threshold The solid data gone out, is defined as response result.Second judging unit, judges whether first degree of correlation is more than Second Threshold, wherein, In the case where second judging unit judges that first degree of correlation is more than Second Threshold, the first result determining unit will have There is candidate's solid data of the highest degree of correlation, be defined as response result, and wherein, the similarity distance includes the editor of phonetic At least one of editing distance of distance, the editing distance of word and semanteme.
[preferred embodiment]
[according to the inquiry answer method of preferred embodiment]
Inquiry answer method according to the preferred embodiment of the invention is illustrated with reference to Fig. 8.Fig. 8 is to illustrate basis The flow chart of the inquiry answer method of the preferred embodiment of the present invention.
As shown in figure 8, according to the inquiry answer method and the inquiry answer method according to first embodiment of preferred embodiment Difference be, add pretreatment and error correction step S301.
Specifically, in pretreatment and error correction step S301, inquiry message is pre-processed, and by instructing in advance Experienced word error correcting model, correction process is carried out to the inquiry message by pretreatment.Here, the pretreatment includes believing inquiry The deletion of the stop words and spoken word that are included in breath and the capital and small letter of letter and number included in inquiry message is changed Deng.For example, when user input inquiry message in include some colloquial words when, carry out semantic processes step S101 it It is preceding, it is necessary to remove these colloquial words.For example, in the feelings that the inquiry message that user inputs is " I will see dear diplomat " Under condition, colloquial word " I will see " can be deleted by pretreatment first.Then, will be pre- by the word error correcting model of training in advance Inquiry message " dear diplomat " after processing is corrected as " dear Interpreter Officer ".Next, to by pretreatment and error correction Inquiry message after processing carries out subsequent treatment.In addition, user is also possible to the inquiry message of input error due to pronunciation mistake, For example in the case where the inquiry message that user inputs is " Xiao Shengke's redeems ", entangled by the fuzzy phoneme in word correction process It is wrong, additionally it is possible to be corrected as " The Shawshank Redemption ".
According to the inquiry answer method of preferred embodiment by being carried out before semantic processes are carried out at pretreatment and error correction Reason, can be corrected to the inquiry message that user inputs, so as to improve the accuracy of later retrieval.
[according to the software configuration of the inquiry answering device of preferred embodiment]
Fig. 9 is the block diagram for the software configuration for illustrating the inquiry answering device according to preferred embodiment.As shown in figure 9, according to The difference of the inquiry answering device 3000 of preferred embodiment and the inquiry answering device 1000 according to first embodiment is, increases Pretreatment and error correction unit 1301.
Specifically, pretreatment and error correction unit 1301 are pre-processed to inquiry message, and pass through training in advance Word error correcting model, correction process is carried out to the inquiry message by pretreatment.
In addition, present invention also offers a kind of inquiry response system based on semantic understanding.Figure 10 is to illustrate the present invention The schematic diagram of inquiry response system.As shown in Figure 10, inquiry response system 100 includes user terminal 1001 and server 1002, User terminal 1001 is connected with server 1002 via network 1003, and network 1003 can be cable network or wireless network.
User terminal 1001 includes input receiving unit 10011, semantic processing unit 10012 and transmitting element 10013.Clothes Business device 1002 includes receiving unit 10021, retrieval unit 10022, sequencing unit 10023 and result determining unit 10024.
Specifically, in user terminal 1001, input receiving unit 10011 receives the inquiry message of user's input;Language Adopted processing unit 10012 carries out semantic processes to inquiry message, with the user view of the inquiry purpose of reaction of formation inquiry message With for the retrieval information used in being retrieved according to inquiry message;Transmitting element 10013 is by inquiry message, the inquiry message User view and retrieval information are sent to server in the way of associated, and receive the response for inquiry message from server As a result.
On the other hand, in server 1002, receiving unit 10021 from user terminal receive inquiry message and with the inquiry The associated user view of information and retrieval information;Retrieval unit 10022 is based on the retrieval information, and base is carried out from database In the data retrieval of participle, the list of candidate's solid data is obtained;Sequencing unit 10023 is based on candidate's solid data and anticipated with user The degree of correlation between figure, is ranked up to candidate's solid data;As a result determining unit 10024 will have the highest degree of correlation in list Candidate's solid data, be defined as the response result for user's query information, and response result is sent to user terminal.
Although with reference to exemplary embodiment, invention has been described above, above-described embodiment is only to illustrate this hair Bright technical concepts and features, it is not intended to limit the scope of the present invention.It is all to be done according to spirit of the invention Any equivalent variations or modification, should all be included within the scope of the present invention.

Claims (20)

1. a kind of inquiry answer method based on semantic understanding, the inquiry answer method includes:
Semantic processes step (S101), carries out semantic processes, with reaction of formation inquiry message to the inquiry message that user inputs Inquire the user view of purpose and for the retrieval information used in being retrieved according to inquiry message;
Searching step (S102), based on the retrieval information, carries out the data retrieval based on participle from database, obtains candidate The list of solid data;
Sequence step (S103), based on the degree of correlation between candidate's solid data and user view, is carried out to candidate's solid data Sequence is handled;And
First result determines step (S104), will have candidate's solid data of the highest degree of correlation in list, is defined as using The response result of family inquiry message.
2. inquiry answer method according to claim 1, wherein, the semantic processes step (S101) includes:
User view identification step (S1011), user view identification is carried out to inquiry message, obtains the use corresponding to inquiry message Family is intended to;
Entity recognition step (S1012), by the physical model of training in advance, identifies solid data from inquiry message;With And
Semantic understanding step (S1013), by the semantic model of training in advance, semantic understanding is carried out to inquiry message, to obtain Retrieve information.
3. inquiry answer method according to claim 2, the inquiry answer method the sequence step (S103) it Also include afterwards:
First judgment step (S204), according to candidate's solid data in similarity distance calculations list with the highest degree of correlation with asking First degree of correlation between information is asked, and judges whether first degree of correlation is less than first threshold;And
Second result determines step (S206), judges that first degree of correlation is less than the feelings of first threshold in first judgment step Under condition, the solid data that will be identified in the Entity recognition step is defined as response result.
4. inquiry answer method according to claim 3, the inquiry answer method is after the described first determination step Also include:
Second judgment step (S205), judges whether first degree of correlation is more than Second Threshold,
Wherein, in the case of judging that first degree of correlation is more than Second Threshold in second judgment step, in first knot Fruit is determined in step, by candidate's solid data with the highest degree of correlation, is defined as response result, and
Wherein, in the editing distance of editing distance of the similarity distance including phonetic, the editing distance of word and semanteme at least One.
5. inquiry answer method according to any one of claim 1 to 4, wherein, the sequence step (S103) includes:
Relatedness computation step (S1031), the degree of correlation between candidate's solid data and user view is calculated according to GBDT models; And
Relevancy ranking step (S1032), based on the degree of correlation calculated, is ranked up to candidate's solid data.
6. inquiry answer method according to any one of claim 1 to 4, the inquiry answer method is at the semanteme Also include before reason step (S101):
Pretreatment and error correction step (S301), are pre-processed to inquiry message, and by the word error correcting model of training in advance, Correction process is carried out to the inquiry message by pretreatment.
7. inquiry answer method according to claim 6, the pretreatment includes the stop words to being included in inquiry message The capital and small letter of deletion with spoken word and the letter and number to being included in inquiry message is changed.
8. inquiry answer method according to any one of claim 1 to 4, wherein, the retrieval information uses slot value pair Form.
9. inquiry answer method according to any one of claim 1 to 4, the inquiry message is the text that user inputs Information, by change voice messaging that user inputs and text message that the text message generated and user input with that will use One kind in the text message for the text combination that family voice messaging is converted into.
10. a kind of inquiry answering device based on semantic understanding, the inquiry answering device includes:
Semantic processing unit (1101), carries out semantic processes, with reaction of formation inquiry message to the inquiry message that user inputs Inquire the user view of purpose and for the retrieval information used in being retrieved according to inquiry message;
Retrieval unit (1102), based on the retrieval information, carries out the data retrieval based on participle from database, obtains candidate The list of solid data;
Sequencing unit (1103), based on the degree of correlation between candidate's solid data and user view, is carried out to candidate's solid data Sequence is handled;And
First result determining unit (1104), will have candidate's solid data of the highest degree of correlation, is defined as using in list The response result of family inquiry message.
11. inquiry answering device according to claim 10, wherein, the semantic processing unit includes:
User view recognition unit (11011), user view identification is carried out to inquiry message, obtains the use corresponding to inquiry message Family is intended to;
Entity recognition unit (11012), by the physical model of training in advance, identifies solid data from inquiry message;With And
Semantic understanding unit (11013), by the semantic model of training in advance, semantic understanding is carried out to inquiry message, to obtain Retrieve information.
12. inquiry answering device according to claim 11, the inquiry answering device also includes:
First judging unit (1204), according to candidate's solid data in similarity distance calculations list with the highest degree of correlation with asking Ask first degree of correlation between information;And
Second result determining unit (1206), judges that first degree of correlation is less than the situation of first threshold in first judging unit Under, the solid data that the Entity recognition unit is identified is defined as response result.
13. inquiry answering device according to claim 12, the inquiry answering device also includes:
Second judging unit (1205), judges whether first degree of correlation is more than Second Threshold,
Wherein, in the case where second judging unit judges that first degree of correlation is more than Second Threshold, first result is true Candidate's solid data with the highest degree of correlation is defined as response result by order member, and
Wherein, in the editing distance of editing distance of the similarity distance including phonetic, the editing distance of word and semanteme at least One.
14. the inquiry answering device according to any one of claim 10 to 13, wherein, the sequencing unit includes:
Correlation calculating unit (11031), the degree of correlation between candidate's solid data and user view is calculated according to GBDT models; And
Relevancy ranking unit (11032), based on the degree of correlation calculated, is ranked up to candidate's solid data.
15. the inquiry answering device according to any one of claim 10 to 13, the inquiry answering device also includes:
Pretreatment and error correction unit (1301), are pre-processed to inquiry message, and by the word error correcting model of training in advance, Correction process is carried out to the inquiry message by pretreatment.
16. inquiry answering device according to claim 15, the pretreatment includes the deactivation to being included in inquiry message The deletion of word and spoken word and the capital and small letter of letter and number included in inquiry message is changed.
17. the inquiry answering device according to any one of claim 10 to 13, wherein, the retrieval information uses slot value To form.
18. the inquiry answering device according to any one of claim 10 to 13, the inquiry message is what user inputted Text message, by change voice messaging that user inputs and text message that the text message generated and user input with One kind in the text message for the text combination that user speech information is converted into.
19. a kind of inquiry response system (100) based on semantic understanding, the system includes user terminal (1001) and and user The server (1002) of terminal connection,
The user terminal includes:
Receiving unit (10011) is inputted, receives the inquiry message of user's input;
Semantic processing unit (10012), carries out semantic processes, with the inquiry purpose of reaction of formation inquiry message to inquiry message User view and for the retrieval information used in being retrieved according to inquiry message;
Transmitting element (10013), inquiry message, the user view of the inquiry message and retrieval information are sent out in the way of associated Server is given, and the response result for inquiry message is received from server,
The server includes:
Receiving unit (10021), inquiry message and the user view associated with the inquiry message and inspection are received from user terminal Rope information;
Retrieval unit (10022), based on the retrieval information, the data retrieval based on participle is carried out from database, is waited Select the list of solid data;
Sequencing unit (10023), based on the degree of correlation between candidate's solid data and user view, is carried out to candidate's solid data Sequence;And
As a result determining unit (10024), will have candidate's solid data of the highest degree of correlation in list, be defined as asking for user The response result of information is asked, and response result is sent to user terminal.
20. a kind of computer-readable recording medium, it stores computer program, and the computer program is being executed by processor When, realize the step of inquiry answer method according to any one of claim 1 to 9 includes.
CN201710575024.3A 2017-07-14 2017-07-14 Inquire answer method, inquiry answering device and computer readable storage medium Active CN107330120B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710575024.3A CN107330120B (en) 2017-07-14 2017-07-14 Inquire answer method, inquiry answering device and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710575024.3A CN107330120B (en) 2017-07-14 2017-07-14 Inquire answer method, inquiry answering device and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN107330120A true CN107330120A (en) 2017-11-07
CN107330120B CN107330120B (en) 2018-09-18

Family

ID=60226783

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710575024.3A Active CN107330120B (en) 2017-07-14 2017-07-14 Inquire answer method, inquiry answering device and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN107330120B (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108021554A (en) * 2017-11-14 2018-05-11 无锡小天鹅股份有限公司 Audio recognition method, device and washing machine
CN109597993A (en) * 2018-11-30 2019-04-09 深圳前海微众银行股份有限公司 Sentence analysis processing method, device, equipment and computer readable storage medium
CN109800407A (en) * 2017-11-15 2019-05-24 腾讯科技(深圳)有限公司 Intension recognizing method, device, computer equipment and storage medium
CN110334347A (en) * 2019-06-27 2019-10-15 腾讯科技(深圳)有限公司 Information processing method, relevant device and storage medium based on natural language recognition
WO2019214679A1 (en) * 2018-05-09 2019-11-14 华为技术有限公司 Entity search method, related device and computer storage medium
CN110456339A (en) * 2019-08-12 2019-11-15 四川九洲电器集团有限责任公司 A kind of inquiry, answer method and device, computer storage medium, electronic equipment
CN110457423A (en) * 2019-06-24 2019-11-15 平安科技(深圳)有限公司 A kind of knowledge mapping entity link method, apparatus, computer equipment and storage medium
CN110647987A (en) * 2019-08-22 2020-01-03 腾讯科技(深圳)有限公司 Method and device for processing data in application program, electronic equipment and storage medium
CN110737756A (en) * 2018-07-03 2020-01-31 百度在线网络技术(北京)有限公司 Method, apparatus, device and medium for determining a response to user input data
CN110765342A (en) * 2019-09-12 2020-02-07 竹间智能科技(上海)有限公司 Information query method and device, storage medium and intelligent terminal
CN110858216A (en) * 2018-08-13 2020-03-03 株式会社日立制作所 Dialogue method, dialogue system, and storage medium
CN111295708A (en) * 2017-12-07 2020-06-16 三星电子株式会社 Speech recognition apparatus and method of operating the same
CN111417924A (en) * 2017-11-23 2020-07-14 三星电子株式会社 Electronic device and control method thereof
CN111538894A (en) * 2020-06-19 2020-08-14 腾讯科技(深圳)有限公司 Query feedback method and device, computer equipment and storage medium
CN111597808A (en) * 2020-04-24 2020-08-28 北京百度网讯科技有限公司 Instrument panel drawing processing method and device, electronic equipment and storage medium
CN112396481A (en) * 2019-08-13 2021-02-23 北京京东尚科信息技术有限公司 Offline product information transmission method, system, electronic device, and storage medium
CN112527819A (en) * 2020-12-08 2021-03-19 北京百度网讯科技有限公司 Address book information retrieval method and device, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090089277A1 (en) * 2007-10-01 2009-04-02 Cheslow Robert D System and method for semantic search
CN104050256A (en) * 2014-06-13 2014-09-17 西安蒜泥电子科技有限责任公司 Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method
CN104598445A (en) * 2013-11-01 2015-05-06 腾讯科技(深圳)有限公司 Automatic question-answering system and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090089277A1 (en) * 2007-10-01 2009-04-02 Cheslow Robert D System and method for semantic search
CN104598445A (en) * 2013-11-01 2015-05-06 腾讯科技(深圳)有限公司 Automatic question-answering system and method
CN104050256A (en) * 2014-06-13 2014-09-17 西安蒜泥电子科技有限责任公司 Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
邹超: "基于深度学习的中文代词消解及其在问答系统中的应用", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108021554A (en) * 2017-11-14 2018-05-11 无锡小天鹅股份有限公司 Audio recognition method, device and washing machine
CN109800407A (en) * 2017-11-15 2019-05-24 腾讯科技(深圳)有限公司 Intension recognizing method, device, computer equipment and storage medium
CN111417924B (en) * 2017-11-23 2024-01-09 三星电子株式会社 Electronic device and control method thereof
CN111417924A (en) * 2017-11-23 2020-07-14 三星电子株式会社 Electronic device and control method thereof
CN111295708A (en) * 2017-12-07 2020-06-16 三星电子株式会社 Speech recognition apparatus and method of operating the same
US11636143B2 (en) 2018-05-09 2023-04-25 Huawei Technologies Co., Ltd. Entity search method, related device, and computer storage medium
WO2019214679A1 (en) * 2018-05-09 2019-11-14 华为技术有限公司 Entity search method, related device and computer storage medium
US11238050B2 (en) 2018-07-03 2022-02-01 Baidu Online Network Technology (Beijing) Co., Ltd. Method and apparatus for determining response for user input data, and medium
CN110737756A (en) * 2018-07-03 2020-01-31 百度在线网络技术(北京)有限公司 Method, apparatus, device and medium for determining a response to user input data
CN110858216B (en) * 2018-08-13 2023-06-06 株式会社日立制作所 Dialogue method, dialogue system, and storage medium
CN110858216A (en) * 2018-08-13 2020-03-03 株式会社日立制作所 Dialogue method, dialogue system, and storage medium
CN109597993A (en) * 2018-11-30 2019-04-09 深圳前海微众银行股份有限公司 Sentence analysis processing method, device, equipment and computer readable storage medium
CN110457423A (en) * 2019-06-24 2019-11-15 平安科技(深圳)有限公司 A kind of knowledge mapping entity link method, apparatus, computer equipment and storage medium
CN110334347A (en) * 2019-06-27 2019-10-15 腾讯科技(深圳)有限公司 Information processing method, relevant device and storage medium based on natural language recognition
CN110334347B (en) * 2019-06-27 2024-06-28 腾讯科技(深圳)有限公司 Information processing method based on natural language recognition, related equipment and storage medium
CN110456339A (en) * 2019-08-12 2019-11-15 四川九洲电器集团有限责任公司 A kind of inquiry, answer method and device, computer storage medium, electronic equipment
CN112396481A (en) * 2019-08-13 2021-02-23 北京京东尚科信息技术有限公司 Offline product information transmission method, system, electronic device, and storage medium
CN110647987A (en) * 2019-08-22 2020-01-03 腾讯科技(深圳)有限公司 Method and device for processing data in application program, electronic equipment and storage medium
CN110765342A (en) * 2019-09-12 2020-02-07 竹间智能科技(上海)有限公司 Information query method and device, storage medium and intelligent terminal
CN111597808A (en) * 2020-04-24 2020-08-28 北京百度网讯科技有限公司 Instrument panel drawing processing method and device, electronic equipment and storage medium
CN111597808B (en) * 2020-04-24 2023-07-25 北京百度网讯科技有限公司 Instrument panel drawing processing method and device, electronic equipment and storage medium
CN111538894A (en) * 2020-06-19 2020-08-14 腾讯科技(深圳)有限公司 Query feedback method and device, computer equipment and storage medium
CN112527819A (en) * 2020-12-08 2021-03-19 北京百度网讯科技有限公司 Address book information retrieval method and device, electronic equipment and storage medium
CN112527819B (en) * 2020-12-08 2024-06-04 北京百度网讯科技有限公司 Address book information retrieval method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN107330120B (en) 2018-09-18

Similar Documents

Publication Publication Date Title
CN107330120B (en) Inquire answer method, inquiry answering device and computer readable storage medium
US11062270B2 (en) Generating enriched action items
CN107430616B (en) Interactive reformulation of voice queries
WO2020140635A1 (en) Text matching method and apparatus, storage medium and computer device
Schalkwyk et al. “Your word is my command”: Google search by voice: A case study
US10460720B2 (en) Generation of language understanding systems and methods
US9176941B2 (en) Text inputting method, apparatus and system based on a cache-based language model and a universal language model
KR20190131065A (en) Detect Mission Changes During Conversation
US10242672B2 (en) Intelligent assistance in presentations
US11482223B2 (en) Systems and methods for automatically determining utterances, entities, and intents based on natural language inputs
US20230185834A1 (en) Data manufacturing frameworks for synthesizing synthetic training data to facilitate training a natural language to logical form model
KR20190000776A (en) Information inputting method
US20230214579A1 (en) Intelligent character correction and search in documents
CN116501960B (en) Content retrieval method, device, equipment and medium
US10885275B2 (en) Phrase placement for optimizing digital page
US20230185799A1 (en) Transforming natural language to structured query language based on multi-task learning and joint training
CN113342948A (en) Intelligent question and answer method and device
CN107424612A (en) Processing method, device and machine readable media
US20210312138A1 (en) System and method for handling out of scope or out of domain user inquiries
KR20220109238A (en) Device and method for providing recommended sentence related to utterance input of user
US20230376700A1 (en) Training data generation to facilitate fine-tuning embedding models
CN112149403A (en) Method and device for determining confidential text
WO2023200519A1 (en) Editing files using a pattern-completion engine
US20200175476A1 (en) Job identification for optimizing digital page
RU2759090C1 (en) Method for controlling a dialogue and natural language recognition system in a platform of virtual assistants

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20200727

Address after: 518000 Nanshan District science and technology zone, Guangdong, Zhejiang Province, science and technology in the Tencent Building on the 1st floor of the 35 layer

Patentee after: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd.

Address before: 100029, Beijing, Chaoyang District new East Street, building No. 2, -3 to 25, 101, 8, 804 rooms

Patentee before: Tricorn (Beijing) Technology Co.,Ltd.

TR01 Transfer of patent right