WO2018227930A1 - Method and device for intelligently prompting answers - Google Patents

Method and device for intelligently prompting answers Download PDF

Info

Publication number
WO2018227930A1
WO2018227930A1 PCT/CN2017/118746 CN2017118746W WO2018227930A1 WO 2018227930 A1 WO2018227930 A1 WO 2018227930A1 CN 2017118746 W CN2017118746 W CN 2017118746W WO 2018227930 A1 WO2018227930 A1 WO 2018227930A1
Authority
WO
WIPO (PCT)
Prior art keywords
answer
semantic
question
feature
similarity
Prior art date
Application number
PCT/CN2017/118746
Other languages
French (fr)
Chinese (zh)
Inventor
王经委
张杰伟
张霄
贺坚
程涛远
Original Assignee
百度在线网络技术(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 百度在线网络技术(北京)有限公司 filed Critical 百度在线网络技术(北京)有限公司
Publication of WO2018227930A1 publication Critical patent/WO2018227930A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/01Customer relationship services
    • G06Q30/015Providing customer assistance, e.g. assisting a customer within a business location or via helpdesk
    • G06Q30/016After-sales

Definitions

  • the present disclosure generally relates to the field of computer technologies, and in particular, to a method and apparatus for intelligently prompting answers.
  • the customer and the company's customer service communicate through web pages, communication software, and telephone. For example, the customer asks questions about the company's products or business, and the customer service gives answers to customer questions.
  • the customer asks questions about the company's products or business
  • the customer service gives answers to customer questions.
  • the company needs to conduct regular business training, and the cost is high.
  • Some systems are known to add a quick reply function to the above problems.
  • the customer service can set common answers in the system in advance, and when the customer asks the same question, they can choose the pre-set answer.
  • the drawback of this method is that it needs the customer service to judge which answer the customer question is matched with.
  • the preset number of answers is small, it is difficult to meet the demand.
  • the preset number of answers is large, the customer service is difficult to quickly find the matching answer and cannot respond quickly. Therefore, existing systems and methods can only assist with small, simple problems.
  • an embodiment of the present application provides a method for intelligently prompting an answer, including:
  • An information feature of the problem Q is analyzed and extracted, the information feature including a lexical feature S 2 (Q), a syntactic feature S 4 (Q), a semantic lexical feature S 3 (Q), and a semantic syntactic feature S 5 (Q);
  • each class information is calculated between Q and Q of the problem I library have problems similarity, similarity Sim is obtained and the problem Q Q I library have the problem of (Q, Q I), wherein the library have Contains the question Q i and the corresponding answer;
  • the problem library problem Q i is sorted according to the similarity Sim(Q, Q i ), and the answers of several question database questions Q i with the highest ranking result are provided to the user.
  • the method for intelligently prompting an answer includes: the problem library includes a public problem library and a private problem library; and the similarity of the questions in the question and question library Sim(Q, Q i ) is prioritized by the high priority and the private question. The principles are sorted; in response to the final answer determined by the user based on the provided answer; the question Q and the final answer are combined and added to the private question base.
  • the embodiment of the present application further provides an apparatus for intelligently prompting an answer, including:
  • a feature extraction unit configured to analyze and extract an information feature of the question Q, the information feature including a lexical feature S 2 (Q), a syntax feature S 4 (Q), a semantic lexical feature S 3 (Q), and a semantic syntax feature S 5 (Q);
  • Similarity similarity calculation unit wherein the configuration information for each class is calculated between Q and the problem library have problems of Q I, Q of the problem is obtained and the similarity Sim library have the problem of I Q (Q, Q i ), where the question library contains the question Q i and the corresponding answer;
  • the answer screening unit is configured to sort the problem library problem Q i according to the similarity Sim(Q, Q i ), and provide the answer of the plurality of problem library questions Q i with the highest ranking result to the user.
  • the embodiment of the present application further provides an apparatus, including:
  • One or more processors and memories are One or more processors and memories;
  • the memory includes instructions executable by the one or more processors to cause the one or more processors to perform the method of intelligent prompt answers provided in accordance with various embodiments of the present application.
  • an embodiment of the present application further provides a computer readable storage medium storing a computer program, the computer program causing a computer to perform a method for answering an intelligent prompt according to embodiments of the present application.
  • the method for intelligently prompting answers provided by the embodiments of the present application can obtain the similarity between the problem and the problem database problem by calculating the similarity degree of each type of information feature between the problem and the problem library problem, and can realize intelligent and accurate analysis of the problem; Sorting similarity helps to quickly filter the answers to the questions, so that the customer service can quickly and accurately answer customer questions, greatly improving the response speed of the customer service and the accuracy of the response;
  • the method for intelligent prompt answer provided by some embodiments of the present application further ensures the accuracy and consistency of the answer by configuring the public problem database and the private question, and satisfies the flexibility and personalization of the answer, and effectively improves the service of the customer service. quality.
  • FIG. 1 illustrates an exemplary flow chart of a method of intelligent prompting an answer in accordance with an embodiment of the present application
  • FIG. 2 is a schematic structural diagram of an apparatus for answering an intelligent prompt provided by an embodiment of the present application
  • FIG. 3 shows a schematic structural view of a device suitable for implementing an embodiment of the present application.
  • FIG. 1 illustrates an exemplary flow chart of a method of intelligently prompting an answer in accordance with an embodiment of the present application.
  • the method of intelligently prompting answers includes:
  • Step S10 analyzing and extracting information features of the question Q, the information features including lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q), and semantic syntactic features S 5 (Q).
  • the information features including lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q), and semantic syntactic features S 5 (Q).
  • step S10 may be, but is not limited to, implemented as follows:
  • the semantic lexical feature S 3 (Q) is adjacently combined to obtain a semantic syntactic feature S 5 (Q).
  • the participle sequence S 1 (Q) is subjected to part-of-speech tagging, and obtaining the lexical feature S 2 (Q) can be obtained as follows:
  • the lexical feature S 2 (Q) is the part of the part of speech tagging.
  • the lexical feature S 2 (Q) is semantically annotated according to the semantic dictionary, and the obtained semantic lexical feature S 3 (Q) can be obtained as follows:
  • the semantic lexical feature S 3 (Q) is the original sequence.
  • the above method for analyzing and extracting the feature information feature has four levels of lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q) and semantic syntactic features S 5 (Q) of the problem sentence.
  • the information is analyzed and extracted to lay the foundation for the accuracy of subsequent similarity calculations.
  • S 5 (Q) can achieve the same technical effect.
  • Step S20 the degree of similarity between information characterizing each class and the Q calculation library have problems Q i, Q obtain similarity Sim problems and problems of libraries question I Q (Q, Q i), wherein the library have Contains the question Q i and the corresponding answer.
  • the similarity Sim (S j (Q), S j (Q i )) of each type of information feature between the problem Q and the problem Q i in the problem library may be calculated as:
  • the similarity Sim(Q,Q i ) of the problem Q and the problem Q i in the problem library can be calculated as follows:
  • Com (S j (Q) , S j (Q i)) is S j (Q) and S j (Q i) of the common element number, Num (S j (Q) , S j (Q i))
  • is a smoothing parameter
  • the smoothing parameter ⁇ is configured according to experience and can be adjusted in the subsequent feedback based on the effect of the smart answer prompt. Set smoothing parameter ⁇ , avoid Com (S j (Q), S j (Q i)) and Num (S j (Q), S j (Q i)) resulting in little difference similarity Sim (S j (Q), S j (Q i )) calculation is inaccurate, effectively improving the discrimination between the two to improve the accuracy of the calculation.
  • the similarity Sim(S j (Q), S j (Q i )) of each type of information feature between the problem Q and the problem Q i in the problem library is calculated by the above formula (1), and the lexical method is used.
  • the information of the four levels of the feature S 2 (Q), the syntactic feature S 4 (Q), the semantic lexical feature S 3 (Q) and the semantic syntactic feature S 5 (Q) are combined, and the problem is calculated by the above formula (2).
  • library have a problem Q i of the similarity Sim (Q, Q i), thus greatly improving the accuracy and reliability of Q and Q i question bank similarity calculation.
  • the similarity of question Q to question Q i can be calculated for all questions Q i in the problem library, followed by subsequent operations. Such an embodiment can traverse all of the questions Q i in the problem library so as not to miss the appropriate answer.
  • the similarity of the problem Q to the partial problem may be calculated for only some of the questions in the problem library for subsequent processing. In such an embodiment, when the problem library is large, the candidate problem can be selected by preliminary screening (for example, keyword matching, etc.), thereby reducing the amount of calculation and further improving the response speed.
  • the method provided by the present application is not limited to the calculation methods of the above formulas (1) and (2), and more different calculation methods may be configured to calculate between the problem Q and the problem Q i in the problem library.
  • the similarity Sim (S j (Q), S j (Q i )) obtains the similarity of the problem and the problem in the problem library Sim(Q, Q i ), and the same technical effect can be achieved.
  • Step S30 Sorting the problem library problem Q i according to the similarity Sim(Q, Q i ), and providing the answer of the plurality of problem library questions Q i with the highest ranking result to the user.
  • the problem according to the similarity Sim (Q, Q i) of the library have problems Q I sort order is the result of the most forward arsenals Q I, which is the most similar to the problem of Q Therefore, its corresponding answer is provided to the user as the closest answer for reference.
  • the sorting and screening based on the similarity Sim(Q, Q i ) reduces the time for obtaining the prompt answer, and greatly improves the response speed of the reply.
  • the problem library comprises a public problem library and a private problem library, wherein the public problem library is composed of questions and standard answers in the collection business scope of the company, and the private problem library is customized for each user according to their own needs.
  • the problem library problem Q i is sorted according to the similarity high priority and private problem prioritization order, and some of the top results (for example, 3, 5, or 10) problem libraries are sorted.
  • the answer to question Q i is provided to the user.
  • the user determines the final answer as the answer to question Q; the question Q and the final answer are combined and added to the private question base.
  • the reasonable sorting and screening rules further help to obtain the optimal prompt answer quickly and accurately, thereby further improving the speed and accuracy of the question answer.
  • the user for example, customer service
  • the public problem database and the private problem database the accuracy and consistency of the reply are ensured, and the flexibility and personalization of the reply are realized.
  • Private problem libraries can be used only by users themselves or only by groups of users with permissions.
  • FIG. 2 is a schematic structural diagram of an apparatus for providing an intelligent prompt answer according to an embodiment of the present application.
  • the device of the smart cue answer shown in FIG. 2 may correspond to any of the methods previously described in connection with FIG.
  • the present application provides an apparatus for intelligently prompting an answer, including:
  • the feature extraction unit 10 is configured to analyze and extract information features of the question Q, including lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q), and semantic syntactic features S 5 (Q);
  • Similarity calculating unit 20 is configured to feature information of the similarity between each type of calculation and the library have problems Q Q I, Q and the problem is obtained similarity Sim library have the problem of I Q (Q, Q I ), where the problem library contains the question Q i and the corresponding answer;
  • the answer screening unit 30 configured according to the similarity of the question database question Q i sort, the sorting result answer forwardmost Problems arsenals Q i provided to the user.
  • the feature extraction unit 10 may be configured to analyze and extract the information features of the question Q as follows:
  • the semantic lexical feature S 3 (Q) is adjacently combined to obtain a semantic syntactic feature S 5 (Q).
  • the participle sequence S 1 (Q) is subjected to part-of-speech tagging, and obtaining the lexical feature S 2 (Q) can be obtained as follows:
  • the lexical feature S 2 (Q) is semantically annotated according to the semantic dictionary, and the obtained semantic lexical feature S 3 (Q) can be obtained as follows:
  • the semantic lexical feature S 3 (Q) is the original sequence.
  • the feature extraction unit 10 analyzes the extraction process for the question Q "How to improve my keyword ranking":
  • the feature extraction unit 10 may also analyze and extract the lexical features S 2 (Q), syntactic features S 4 (Q), and semantic lexical features S 3 (Q) of the problem Q according to actual needs. And the semantic syntactic feature S 5 (Q) can achieve the same technical effect.
  • the similarity calculation unit 20 may be configured to calculate the similarity Sim (S j (Q), S j (Q i )) of each type of information feature according to the following formula (1).
  • the similarity Sim(Q,Q i ) of the problem Qi in the problem and problem library can then be calculated according to the following formula (2):
  • the smoothing parameter ⁇ is configured according to experience and can be adjusted in the subsequent feedback based on the effect of the smart answer prompt. Setting the smoothing parameter ⁇ can avoid the similarity Sim (S j ) caused by the difference between Com(S j (Q), S j (Q i )) and Num(S j (Q), S j (Q i )) (Q), S j (Q i )) calculation is inaccurate, effectively improving the discrimination between the two to improve the accuracy of the calculation.
  • the similarity calculation unit 20 is configured to calculate the similarity Sim (S j (Q), S j (Q) of each type of information feature between the problem Q and the problem Q i in the problem library by the above formula (1). i )), and combines the information of the lexical feature S 2 (Q), the syntactic feature S 4 (Q), the semantic lexical feature S 3 (Q) and the semantic syntactic feature S 5 (Q), through the above formula (2) Calculate the similarity Sim(Q,Q i ) of the problem Q i in the problem and problem library, thereby greatly improving the accuracy and reliability of the similarity calculation of the problem Q and the problem library Q i .
  • the similarity calculation unit 20 can be used to calculate the similarity between the problem Q and the problem Q i for all the questions Q i in the problem library, and then perform subsequent operations. Such an embodiment can traverse all of the questions Q i in the problem library so as not to miss the appropriate answer.
  • the similarity calculation unit 20 can be used to calculate the similarity of the problem Q to the partial problem for only a part of the problem in the problem library for subsequent processing. In such an embodiment, when the problem library is large, the candidate problem can be selected by preliminary screening (for example, keyword matching, etc.), thereby reducing the amount of calculation and further improving the response speed.
  • the similarity calculation unit 20 provided by the present application is not limited to the calculation method by the above formulas (1), (2), and more different calculation methods can be configured to separately calculate the problem Q and the problem library.
  • the similarity of the information features Sim(S j (Q), S j (Q i )) yields the similarity of the problem and the problem in the problem library Sim(Q, Q i ), which can achieve the same technical effect.
  • the screening unit 30 answer for (Q, Q i) of the library have questions Q i are sorted according to the similarity Sim, sort result forwardmost library problems question Q i, i.e. The problem closest to question Q, and thus its corresponding answer is provided to the user as a reference for the closest answer.
  • the answer screening unit 30 performs sorting and screening based on the similarity Sim(Q, Q i ), which reduces the time for obtaining the prompt answer, and greatly improves the response speed of the reply.
  • the device for intelligently prompting the answer further includes:
  • the problem library maintenance unit 40 is configured to classify the problem library into a public question library and a private question library, and to add the question Q and the final answer combination to the private question in response to the final answer determined by the user based on the provided answer.
  • Library the public problem database is composed of questions and standard answers in the company's business scope, and the private problem database is customized for each user according to their own needs.
  • the answer screening unit 30 is configured to sort the problem library problem Q i according to the similarity order of high priority and private question priority, and select the top result (for example, 3, 5, or 10) questions for the question library Q i are provided to the user.
  • the answer screening unit 30 further helps to obtain the optimal prompt answer quickly and accurately through reasonable sorting and screening rules, thereby further improving the speed and accuracy of the question answer.
  • the user for example, customer service
  • the problem library maintenance unit 40 the public problem library and the private problem library are configured, which not only ensures the accuracy and consistency of the reply, but also realizes the flexibility and personalization of the reply. Private problem libraries can be used only by users themselves or only by groups of users with permissions.
  • FIG. 3 shows a schematic structural view of a device suitable for implementing an embodiment of the present application.
  • the device 300 includes a central processing unit (CPU) 301 that can be loaded from a program stored in a read only memory (ROM) 302 or a program loaded from a storage portion 308 into a random access memory (RAM) 303. Perform various appropriate actions and processes.
  • ROM read only memory
  • RAM random access memory
  • various programs and data required for the operation of the device 300 are also stored.
  • the CPU 301, the ROM 302, and the RAM 303 are connected to each other through a bus 304.
  • An input/output (I/O) interface 305 is also coupled to bus 304.
  • the following components are connected to the I/O interface 305: an input portion 306 including a keyboard, a mouse, etc.; an output portion 307 including, for example, a cathode ray tube (CRT), a liquid crystal display (LCD), and the like, and a storage portion 308 including a hard disk or the like. And a communication portion 309 including a network interface card such as a LAN card, a modem, or the like. The communication section 309 performs communication processing via a network such as the Internet.
  • Driver 310 is also connected to I/O interface 305 as needed.
  • a removable medium 311 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory or the like is mounted on the drive 310 as needed so that a computer program read therefrom is installed into the storage portion 308 as needed.
  • an embodiment of the present disclosure includes a computer program product comprising a computer program tangibly embodied on a machine readable medium, the computer program comprising program code for performing the method of FIG.
  • the computer program can be downloaded and installed from the network via the communication portion 309, and/or installed from the removable medium 311.
  • each block of the flowchart or block diagrams can represent a module, a program segment, or a portion of code that includes one or more logic for implementing the specified.
  • Functional executable instructions can also occur in a different order than that illustrated in the drawings. For example, two successively represented blocks may in fact be executed substantially in parallel, and they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowcharts, and combinations of blocks in the block diagrams and/or flowcharts can be implemented in a dedicated hardware-based system that performs the specified function or operation. Or it can be implemented by a combination of dedicated hardware and computer instructions.
  • the units or modules described in the embodiments of the present application may be implemented by software or by hardware.
  • the described unit or module can also be provided in the processor.
  • the names of these units or modules do not in any way constitute a limitation on the unit or module itself.
  • the present application further provides a computer readable storage medium, which may be a computer readable storage medium included in the apparatus described in the foregoing embodiment, or may exist separately, not A computer readable storage medium that is assembled into the device.
  • a computer readable storage medium stores one or more programs that are used by one or more processors to perform the methods described herein.

Abstract

A method and device for intelligently prompting answers. The method comprises: analyzing and extracting information features of a question Q, the information features comprising lexical features S2 (Q), syntactic features S4 (Q), semantic lexical features S3 (Q), and semantic syntactic features S5 (Q) (S10); computing the similarity of each type of information features between the question Q and questions Qi in a question bank, and obtaining similarity Sim (Q, Qi) between the question Q and the questions Qi in the question bank, wherein the question bank contains the questions Qi and corresponding answers (S20); and sorting the questions Qi in the question bank according to similarity Sim (Q, Qi), and providing the answers to some questions Qi in the question bank which are among the most front of the sorted results to a user (S30). According to the method, intelligent and accurate analyses for the questions can be achieved, the corresponding answers to the questions are quickly screened out and obtained, and therefore the response speed and accuracy for answering the questions are greatly improved.

Description

智能提示答案的方法及装置Method and device for intelligent prompt answer
相关申请的交叉引用Cross-reference to related applications
本申请要求百度在线网络技术(北京)有限公司于2017年6月15日提交的、发明名称为“智能提示答案的方法及装置”的、中国专利申请号“201710452312.X”的优先权。This application claims the priority of the Chinese patent application number "201710452312.X" submitted by Baidu Online Network Technology (Beijing) Co., Ltd. on June 15, 2017, entitled "Method and Apparatus for Answering Intelligent Tips".
技术领域Technical field
本公开一般涉及计算机技术领域,具体涉及一种智能提示答案的方法及装置。The present disclosure generally relates to the field of computer technologies, and in particular, to a method and apparatus for intelligently prompting answers.
背景技术Background technique
随着电子商务平台的兴起,越来越多的行业开始与电子商务相结合。相比传统销售渠道的面对面直接沟通,如何同客户高效专业地进行线上沟通,对于提升公司的线上销售额具有十分重要的作用。With the rise of e-commerce platforms, more and more industries are beginning to combine with e-commerce. Compared with traditional sales channels, face-to-face direct communication, how to communicate effectively and professionally with customers online, is very important to enhance the company's online sales.
客服系统中,客户与公司客服通过网页、通讯软件、电话等方式沟通交流。例如:客户对公司产品或业务提出问题,客服针对客户问题给出答案。但由于客服人员对公司业务知识的局限性和差异性,难以保证所给答案的准确性和一致性。为弥补客服人员在业务知识上的不足,公司需要经常对其进行业务培训,成本较高。In the customer service system, the customer and the company's customer service communicate through web pages, communication software, and telephone. For example, the customer asks questions about the company's products or business, and the customer service gives answers to customer questions. However, due to the limitations and differences of customer service personnel's business knowledge, it is difficult to guarantee the accuracy and consistency of the answers given. In order to make up for the lack of customer service personnel in the business knowledge, the company needs to conduct regular business training, and the cost is high.
已知一些系统针对上述问题增加了快捷回复功能。利用该功能,客服可以预先在系统里设置常用回答,当客户提出相同问题时,可以选择预先设置好的答案。该方法的缺陷在于,需要客服判断客户问题是和哪个答案匹配,当预设置的答案数量较少时难以满足需求,当预设置的答案数量较多时客服难以快速找到匹配的答案,无法快速响应。因此,现有系统和方法只能辅助处理少量、简单的问题。Some systems are known to add a quick reply function to the above problems. With this function, the customer service can set common answers in the system in advance, and when the customer asks the same question, they can choose the pre-set answer. The drawback of this method is that it needs the customer service to judge which answer the customer question is matched with. When the preset number of answers is small, it is difficult to meet the demand. When the preset number of answers is large, the customer service is difficult to quickly find the matching answer and cannot respond quickly. Therefore, existing systems and methods can only assist with small, simple problems.
发明内容Summary of the invention
鉴于现有技术中的上述缺陷或不足,期望提供一种高效灵活且智能的提示答案的方法。In view of the above-mentioned drawbacks or deficiencies in the prior art, it is desirable to provide an efficient, flexible and intelligent method of prompting answers.
第一方面,本申请实施例提供了一种智能提示答案的方法,包括:In a first aspect, an embodiment of the present application provides a method for intelligently prompting an answer, including:
分析并提取问题Q的信息特征,该信息特征包括词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q); An information feature of the problem Q is analyzed and extracted, the information feature including a lexical feature S 2 (Q), a syntactic feature S 4 (Q), a semantic lexical feature S 3 (Q), and a semantic syntactic feature S 5 (Q);
计算该问题Q和问题库中问题Q i之间的每一类信息特征的相似度,获得该问题Q和问题库中问题Q i的相似度Sim(Q,Q i),其中该问题库中包含问题Q i以及对应的答案; 以及 Wherein each class information is calculated between Q and Q of the problem I library have problems similarity, similarity Sim is obtained and the problem Q Q I library have the problem of (Q, Q I), wherein the library have Contains the question Q i and the corresponding answer;
根据该相似度Sim(Q,Q i)对问题库问题Q i进行排序,将排序结果最靠前的若干问题库问题Q i的答案提供给用户。 The problem library problem Q i is sorted according to the similarity Sim(Q, Q i ), and the answers of several question database questions Q i with the highest ranking result are provided to the user.
在一些实施例中,该智能提示答案的方法包括:问题库包括公共问题库和私有问题库;按照问题和问题库中问题的相似度Sim(Q,Q i)高优先和私有问题优先的排序原则进行排序;响应于用户基于所提供的答案确定的最终答案;将该问题Q和最终答案组合,添加至私有问题库。 In some embodiments, the method for intelligently prompting an answer includes: the problem library includes a public problem library and a private problem library; and the similarity of the questions in the question and question library Sim(Q, Q i ) is prioritized by the high priority and the private question. The principles are sorted; in response to the final answer determined by the user based on the provided answer; the question Q and the final answer are combined and added to the private question base.
第二方面,本申请实施例还提供了一种智能提示答案的装置,包括:In a second aspect, the embodiment of the present application further provides an apparatus for intelligently prompting an answer, including:
特征提取单元,配置用于分析并提取问题Q的信息特征,该信息特征包括词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q); A feature extraction unit configured to analyze and extract an information feature of the question Q, the information feature including a lexical feature S 2 (Q), a syntax feature S 4 (Q), a semantic lexical feature S 3 (Q), and a semantic syntax feature S 5 (Q);
相似度计算单元,配置用于计算该问题Q和问题库中问题Q i之间的每一类信息特征的相似度,获得该问题Q和问题库中问题Q i的相似度Sim(Q,Q i),其中该问题库中包含问题Q i以及对应的答案;以及 Similarity similarity calculation unit, wherein the configuration information for each class is calculated between Q and the problem library have problems of Q I, Q of the problem is obtained and the similarity Sim library have the problem of I Q (Q, Q i ), where the question library contains the question Q i and the corresponding answer;
答案筛选单元,配置用于根据该相似度Sim(Q,Q i)对问题库问题Q i进行排序,将排序结果最靠前的若干问题库问题Q i的答案提供给用户。 The answer screening unit is configured to sort the problem library problem Q i according to the similarity Sim(Q, Q i ), and provide the answer of the plurality of problem library questions Q i with the highest ranking result to the user.
第三方面,本申请实施例还提供了一种设备,包括:In a third aspect, the embodiment of the present application further provides an apparatus, including:
一个或多个处理器和存储器;One or more processors and memories;
其中,存储器包含可由该一个或多个处理器执行的指令以使得该一个或多个处理器执行根据本申请各实施例提供的智能提示答案的方法。Therein, the memory includes instructions executable by the one or more processors to cause the one or more processors to perform the method of intelligent prompt answers provided in accordance with various embodiments of the present application.
第四方面,本申请实施例还提供了一种存储有计算机程序的计算机可读存储介质,该计算机程序使计算机执行根据本申请各实施例提供的智能提示答案的方法。In a fourth aspect, an embodiment of the present application further provides a computer readable storage medium storing a computer program, the computer program causing a computer to perform a method for answering an intelligent prompt according to embodiments of the present application.
本申请实施例提供的智能提示答案的方法,通过计算问题和问题库问题之间每一类信息特征的相似度,得到问题和问题库问题的相似度,能够实现对问题智能准确地分析;根据相似度排序,有助于快速筛选得到问题对应的答案,从而辅助客服能够快速准确地答复客户问题,大大提高了客服的响应速度及答复的准确性;The method for intelligently prompting answers provided by the embodiments of the present application can obtain the similarity between the problem and the problem database problem by calculating the similarity degree of each type of information feature between the problem and the problem library problem, and can realize intelligent and accurate analysis of the problem; Sorting similarity helps to quickly filter the answers to the questions, so that the customer service can quickly and accurately answer customer questions, greatly improving the response speed of the customer service and the accuracy of the response;
本申请一些实施例提供的智能提示答案的方法进一步通过配置公共问题库和私有问题,既保证了答案的准确性和一致性,又满足了回答的灵活性和个性化,有效提高了客服的服务质量。The method for intelligent prompt answer provided by some embodiments of the present application further ensures the accuracy and consistency of the answer by configuring the public problem database and the private question, and satisfies the flexibility and personalization of the answer, and effectively improves the service of the customer service. quality.
附图说明DRAWINGS
通过阅读参照以下附图所作的对非限制性实施例所作的详细描述,本申请的其它特征、目的和优点将会变得更明显:Other features, objects, and advantages of the present application will become more apparent from the detailed description of the accompanying drawings.
图1示出了根据本申请实施例的智能提示答案的方法的示例性流程图;FIG. 1 illustrates an exemplary flow chart of a method of intelligent prompting an answer in accordance with an embodiment of the present application;
图2示出了本申请一实施例提供的智能提示答案的装置的结构示意图;以及2 is a schematic structural diagram of an apparatus for answering an intelligent prompt provided by an embodiment of the present application;
图3示出了适于用来实现本申请实施例的设备的结构示意图。FIG. 3 shows a schematic structural view of a device suitable for implementing an embodiment of the present application.
具体实施方式detailed description
下面结合附图和实施例对本申请作进一步的详细说明。可以理解的是,此处所描述的具体实施例仅仅用于解释相关发明,而非对该发明的限定。另外还需要说明的是,为了便于描述,附图中仅示出了与发明相关的部分。The present application will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the invention, rather than the invention. It should also be noted that, for the convenience of description, only parts related to the invention are shown in the drawings.
需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。下面将参考附图并结合实施例来详细说明本申请。It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict. The present application will be described in detail below with reference to the accompanying drawings.
图1示出了根据本申请实施例的智能提示答案的方法的示例性流程图。FIG. 1 illustrates an exemplary flow chart of a method of intelligently prompting an answer in accordance with an embodiment of the present application.
如图1所示,该智能提示答案的方法包括:As shown in Figure 1, the method of intelligently prompting answers includes:
步骤S10:分析并提取问题Q的信息特征,该信息特征包括词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q)。 Step S10: analyzing and extracting information features of the question Q, the information features including lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q), and semantic syntactic features S 5 (Q).
具体地,在本实施例中,步骤S10可以但不限于按照如下方式实现:Specifically, in this embodiment, step S10 may be, but is not limited to, implemented as follows:
对问题Q进行分词,获得分词序列S 1(Q); Segmenting the problem Q to obtain the word segmentation sequence S 1 (Q);
对分词序列S 1(Q)进行词性标注,获得词法特征S 2(Q); Performing part-of-speech tagging on the segmentation word sequence S 1 (Q) to obtain the lexical feature S 2 (Q);
根据语义词典对词法特征S 2(Q)进行语义标注,获得语义词法特征S 3(Q); Semantic annotation of the lexical feature S 2 (Q) according to the semantic dictionary, obtaining the semantic lexical feature S 3 (Q);
对词法特征S 2(Q)进行相邻组合,获得句法特征S 4(Q);以及 Performing adjacent combinations on the lexical feature S 2 (Q) to obtain a syntactic feature S 4 (Q);
对语义词法特征S 3(Q)进行相邻组合,获得语义句法特征S 5(Q)。 The semantic lexical feature S 3 (Q) is adjacently combined to obtain a semantic syntactic feature S 5 (Q).
优选地,对分词序列S 1(Q)进行词性标注,获得词法特征S 2(Q)可以按照如下方式得到: Preferably, the participle sequence S 1 (Q) is subjected to part-of-speech tagging, and obtaining the lexical feature S 2 (Q) can be obtained as follows:
对分词序列S 1(Q)进行词性标注,并保留其中的名词、动词、形容词和疑问词,得到词性标注序列; To perform part-of-speech tagging on the segmentation sequence S 1 (Q), and retain the nouns, verbs, adjectives and interrogative words, to obtain the part-of-speech tagging sequence;
词法特征S 2(Q)为该词性标注序列。 The lexical feature S 2 (Q) is the part of the part of speech tagging.
优选地,根据语义词典对词法特征S 2(Q)进行语义标注,获得语义词法特征S 3(Q)可以按照如下方式得到: Preferably, the lexical feature S 2 (Q) is semantically annotated according to the semantic dictionary, and the obtained semantic lexical feature S 3 (Q) can be obtained as follows:
查找语义词典,将词法特征S 2(Q)中登录在语义词典中的词语转换为对应的义原,并将未登录在语义词典中的词语作为义原保留,从而将词法特征S 2(Q)转换为义原序列; Finding a semantic dictionary, converting words registered in the semantic dictionary in the lexical feature S 2 (Q) into corresponding meanings, and retaining words not registered in the semantic dictionary as meanings, thereby lexical features S 2 (Q ) converted to a sequence of senses;
语义词法特征S 3(Q)为该义原序列。 The semantic lexical feature S 3 (Q) is the original sequence.
例如,对问题Q“怎么提高我的关键词排行”,分析提取过程如下:For example, for question Q "How to improve my keyword rankings", the analysis extraction process is as follows:
1.问句分词,得到对应的分词序列S 1(Q):怎么/提高/我/的/关键词/排行; 1. Question segmentation, get the corresponding word segmentation sequence S 1 (Q): how / improve / me / / keyword / ranking;
2.词性标注,并保留名词、动词、形容词和疑问词,得到词法特征S 2(Q),也就是词性标注序列:怎么(疑问词)/提高(动词)/关键词(名词)/排行(动词); 2. part of speech tagging, and retain nouns, verbs, adjectives and interrogative words, get lexical features S 2 (Q), that is, part of speech tagging sequence: how (question word) / improve (verb) / keyword (noun) / ranking ( verb);
3.查找语义词典,将词法特征S 2(Q)中登录在语义词典中的词语转换为对应的义原,并将未登录在语义词典中的词语作为义原保留,从而将词法特征S 2(Q)转换为义原序列,也就是语义词法特征S 3(Q):Ka35B0(怎么)/Ie12A0(提高)/关键词(关键词)/Dd07A0(排行),其中“关键词”未登录在语义词典中,则作为义原保留; 3. Find the semantic dictionary, convert the words registered in the semantic dictionary in the lexical feature S 2 (Q) into the corresponding semantics, and retain the words not registered in the semantic dictionary as the original, thereby the lexical feature S 2 (Q) is converted to the original sequence, that is, the semantic lexical feature S 3 (Q): Ka35B0 (how) / Ie12A0 (improvement) / keyword (keyword) / Dd07A0 (rank), where "keyword" is not registered in In the semantic dictionary, it is reserved as a Yiyuan;
4.对词法特征S 2(Q)进行相邻组合,得到句法特征S 4(Q):<怎么,提高>/<提高,关键词>/<关键词,排行>; 4. morphological characteristics S 2 (Q) for the adjacent combinations, syntactic features give S 4 (Q): <how to improve> / <increased keyword> / <keyword ranking>;
5.对语义词法特征S 3(Q)进行相邻组合,得到语义句法特征S 5(Q):<Ka35B0,Ie12A0>/<Ie12A0,关键词>/<关键词,Dd07A0>。 5. Perform adjacent combination on the semantic lexical feature S 3 (Q) to obtain the semantic syntax feature S 5 (Q): <Ka35B0, Ie12A0>/<Ie12A0, keyword>/<keyword, Dd07A0>.
上述分析并提取问题信息特征的方法,将问题语句的词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q)四个层面的信息进行分析并提取,为后续相似度计算的准确性奠定了基础。 The above method for analyzing and extracting the feature information feature has four levels of lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q) and semantic syntactic features S 5 (Q) of the problem sentence. The information is analyzed and extracted to lay the foundation for the accuracy of subsequent similarity calculations.
在更多实施例中,还可以根据实际需求采用不同的方法分析并提取问题Q的词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q),可实现相同的技术效果。 In more embodiments, different methods may be used to analyze and extract the lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q) and semantic syntactic features of the problem Q according to actual needs. S 5 (Q) can achieve the same technical effect.
步骤S20:计算问题Q和问题库中问题Q i之间的每一类信息特征的相似度,获得问题Q和问题库中问题Q i的相似度Sim(Q,Q i),其中问题库中包含问题Q i以及对应的答案。 Step S20: the degree of similarity between information characterizing each class and the Q calculation library have problems Q i, Q obtain similarity Sim problems and problems of libraries question I Q (Q, Q i), wherein the library have Contains the question Q i and the corresponding answer.
具体地,在本实施例中,问题Q和问题库中问题Q i之间每一类信息特征的相似度Sim(S j(Q),S j(Q i))的计算方式可以为: Specifically, in this embodiment, the similarity Sim (S j (Q), S j (Q i )) of each type of information feature between the problem Q and the problem Q i in the problem library may be calculated as:
Figure PCTCN2017118746-appb-000001
Figure PCTCN2017118746-appb-000001
基于上述每一类信息特征的相似度,问题Q和问题库中问题Q i的相似度Sim(Q,Q i)的计算方式继而可以为: Based on the similarity of each type of information feature described above, the similarity Sim(Q,Q i ) of the problem Q and the problem Q i in the problem library can be calculated as follows:
Figure PCTCN2017118746-appb-000002
Figure PCTCN2017118746-appb-000002
其中,Com(S j(Q),S j(Q i))为S j(Q)和S j(Q i)的公共元素数量,Num(S j(Q),S j(Q i))为S j(Q)和S j(Q i)中的最大元素数量,σ为平滑参数,元素为词语、词语组合、义原或义 原组合,j=2、3、4或5。 Wherein, Com (S j (Q) , S j (Q i)) is S j (Q) and S j (Q i) of the common element number, Num (S j (Q) , S j (Q i)) For the maximum number of elements in S j (Q) and S j (Q i ), σ is a smoothing parameter, and the element is a word, a word combination, a sense or a combination of sense, j=2, 3, 4 or 5.
平滑参数σ根据经验配置,可以在后续根据智能答案提示效果的反馈进行调整。设置平滑参数σ,可以避免Com(S j(Q),S j(Q i))和Num(S j(Q),S j(Q i))相差不大而导致的相似度Sim(S j(Q),S j(Q i))计算不准确,有效提高两者的区分度从而提高计算的准确性。 The smoothing parameter σ is configured according to experience and can be adjusted in the subsequent feedback based on the effect of the smart answer prompt. Set smoothing parameter σ, avoid Com (S j (Q), S j (Q i)) and Num (S j (Q), S j (Q i)) resulting in little difference similarity Sim (S j (Q), S j (Q i )) calculation is inaccurate, effectively improving the discrimination between the two to improve the accuracy of the calculation.
本实施例中,通过上述公式(1)计算问题Q和问题库中问题Q i之间每一类信息特征的相似度Sim(S j(Q),S j(Q i)),并将词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q)四个层面的信息融合起来,通过上述公式(2)计算得到问题和问题库中问题Q i的相似度Sim(Q,Q i),从而大大提高了问题Q和问题库Q i相似度计算的准确性和可靠性。 In this embodiment, the similarity Sim(S j (Q), S j (Q i )) of each type of information feature between the problem Q and the problem Q i in the problem library is calculated by the above formula (1), and the lexical method is used. The information of the four levels of the feature S 2 (Q), the syntactic feature S 4 (Q), the semantic lexical feature S 3 (Q) and the semantic syntactic feature S 5 (Q) are combined, and the problem is calculated by the above formula (2). library have a problem Q i of the similarity Sim (Q, Q i), thus greatly improving the accuracy and reliability of Q and Q i question bank similarity calculation.
在一些实施例中,可以针对问题库中的所有问题Q i,计算问题Q与问题Q i的相似度,再进行后续操作。这种实施例可以遍历问题库中的所有问题Q i,以免漏掉合适的答案。在另一些实施例中,可以只针对问题库中的部分问题,计算问题Q与该部分问题的相似度,以供后续处理。在这种实施例中,当问题库较大时,可以先通过初步的筛选(例如,关键词匹配等)来选择候选问题,从而降低计算量,进一步提高响应速度。 In some embodiments, the similarity of question Q to question Q i can be calculated for all questions Q i in the problem library, followed by subsequent operations. Such an embodiment can traverse all of the questions Q i in the problem library so as not to miss the appropriate answer. In other embodiments, the similarity of the problem Q to the partial problem may be calculated for only some of the questions in the problem library for subsequent processing. In such an embodiment, when the problem library is large, the candidate problem can be selected by preliminary screening (for example, keyword matching, etc.), thereby reducing the amount of calculation and further improving the response speed.
在更多实施例中,本申请提供的方法不局限于上述公式(1)、(2)的计算方法,可配置更多的不同的计算方式分别计算问题Q和问题库中问题Q i之间的每一类信息特征的相似度Sim(S j(Q),S j(Q i))、问题和问题库中问题的相似度Sim(Q,Q i),只要通过每一类信息特征的相似度Sim(S j(Q),S j(Q i))得到问题和问题库中问题的相似度Sim(Q,Q i),即可实现同样的技术效果。 In more embodiments, the method provided by the present application is not limited to the calculation methods of the above formulas (1) and (2), and more different calculation methods may be configured to calculate between the problem Q and the problem Q i in the problem library. The similarity of each type of information feature Sim(S j (Q), S j (Q i )), the problem and the similarity of the problem in the problem library Sim(Q,Q i ), as long as each type of information feature The similarity Sim (S j (Q), S j (Q i )) obtains the similarity of the problem and the problem in the problem library Sim(Q, Q i ), and the same technical effect can be achieved.
步骤S30:根据相似度Sim(Q,Q i)对问题库问题Q i进行排序,将排序结果最靠前的若干问题库问题Q i的答案提供给用户。 Step S30: Sorting the problem library problem Q i according to the similarity Sim(Q, Q i ), and providing the answer of the plurality of problem library questions Q i with the highest ranking result to the user.
具体地,在本实施例中,根据相似度Sim(Q,Q i)对问题库中问题Q i进行排序,排序结果最靠前的问题库问题Q i,也就是与问题Q最相近的问题,因而其对应的答案作为最相近的答案提供给用户以作参考。 Particular problem, in the present embodiment, the problem according to the similarity Sim (Q, Q i) of the library have problems Q I sort order is the result of the most forward arsenals Q I, which is the most similar to the problem of Q Therefore, its corresponding answer is provided to the user as the closest answer for reference.
本实施例中,基于相似度Sim(Q,Q i)进行排序筛选,减少了获取提示答案的时间,大大提高了答复的响应速度。 In this embodiment, the sorting and screening based on the similarity Sim(Q, Q i ) reduces the time for obtaining the prompt answer, and greatly improves the response speed of the reply.
优选地,问题库包括公共问题库和私有问题库,其中,公共问题库为收集公司业务范围内问题和标准答案构成,私有问题库为每个用户根据自身需求自定义形成。Preferably, the problem library comprises a public problem library and a private problem library, wherein the public problem library is composed of questions and standard answers in the collection business scope of the company, and the private problem library is customized for each user according to their own needs.
进一步优选地,按照相似度高优先和私有问题优先的排序原则对对问题库问题Q i 进行排序,并将排序结果最靠前的若干(例如,3个,5个,或10个)问题库问题Q i的答案提供给用户。 Further preferably, the problem library problem Q i is sorted according to the similarity high priority and private problem prioritization order, and some of the top results (for example, 3, 5, or 10) problem libraries are sorted. The answer to question Q i is provided to the user.
进一步优选地,基于排序结果最靠前的若干问题库问题Qi的答案,用户确定最终答案作为问题Q的答复;将问题Q和该最终答案组合,添加至私有问题库。Further preferably, based on the answers to the top question number questions Qi of the ranking result, the user determines the final answer as the answer to question Q; the question Q and the final answer are combined and added to the private question base.
上述实施例中,通过合理的排序和筛选规则,进一步有助于快速准确地得到最优提示答案,从而进一步提高了问题答复的速度和准确性。用户(例如,客服)可以从提供的提示答案中,挑选合适的答案直接回答给客户,也可以在提示答案的基础上进行修改后间接回答给客户。进一步通过配置公共问题库和私有问题库,既保证了答复的准确性和一致性,又实现了答复的灵活性和个性化。私有问题库可以仅限用户自己使用,或者仅限有权限的用户群组使用。In the above embodiment, the reasonable sorting and screening rules further help to obtain the optimal prompt answer quickly and accurately, thereby further improving the speed and accuracy of the question answer. The user (for example, customer service) can directly answer the customer's answer by selecting the appropriate answer from the provided prompt answer, or indirectly after answering the prompt answer. Further, by configuring the public problem database and the private problem database, the accuracy and consistency of the reply are ensured, and the flexibility and personalization of the reply are realized. Private problem libraries can be used only by users themselves or only by groups of users with permissions.
图2示出了本申请一实施例提供的智能提示答案的装置的结构示意图。图2所示的智能提示答案的装置可对应执行前文结合图1描述的任一方法。FIG. 2 is a schematic structural diagram of an apparatus for providing an intelligent prompt answer according to an embodiment of the present application. The device of the smart cue answer shown in FIG. 2 may correspond to any of the methods previously described in connection with FIG.
如图2所示,在本实施例中,本申请提供一种智能提示答案的装置,包括:As shown in FIG. 2, in this embodiment, the present application provides an apparatus for intelligently prompting an answer, including:
特征提取单元10,配置用于分析并提取问题Q的信息特征,该信息特征包括词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q); The feature extraction unit 10 is configured to analyze and extract information features of the question Q, including lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q), and semantic syntactic features S 5 (Q);
相似度计算单元20,配置用于计算问题Q和问题库中问题Q i之间的每一类信息特征的相似度,获得问题Q和问题库中问题Q i的相似度Sim(Q,Q i),其中问题库中包含问题Q i以及对应的答案;以及 Similarity calculating unit 20 is configured to feature information of the similarity between each type of calculation and the library have problems Q Q I, Q and the problem is obtained similarity Sim library have the problem of I Q (Q, Q I ), where the problem library contains the question Q i and the corresponding answer;
答案筛选单元30,配置用于根据相似度对问题库问题Q i进行排序,将排序结果最靠前的若干问题库问题Q i的答案提供给用户。 The answer screening unit 30, configured according to the similarity of the question database question Q i sort, the sorting result answer forwardmost Problems arsenals Q i provided to the user.
具体地,在本实施例中,特征提取单元10可以配置用于按如下分析并提取问题Q的信息特征:Specifically, in the present embodiment, the feature extraction unit 10 may be configured to analyze and extract the information features of the question Q as follows:
对问题Q进行分词,获得分词序列S 1(Q); Segmenting the problem Q to obtain the word segmentation sequence S 1 (Q);
对分词序列S 1(Q)进行词性标注,获得词法特征S 2(Q); Performing part-of-speech tagging on the segmentation word sequence S 1 (Q) to obtain the lexical feature S 2 (Q);
根据语义词典对词法特征S 2(Q)进行语义标注,获得语义词法特征S 3(Q); Semantic annotation of the lexical feature S 2 (Q) according to the semantic dictionary, obtaining the semantic lexical feature S 3 (Q);
对词法特征S 2(Q)进行相邻组合,获得句法特征S 4(Q);以及 Performing adjacent combinations on the lexical feature S 2 (Q) to obtain a syntactic feature S 4 (Q);
对语义词法特征S 3(Q)进行相邻组合,获得语义句法特征S 5(Q)。 The semantic lexical feature S 3 (Q) is adjacently combined to obtain a semantic syntactic feature S 5 (Q).
优选地,对分词序列S 1(Q)进行词性标注,获得词法特征S 2(Q)可以按照如下方式得到: Preferably, the participle sequence S 1 (Q) is subjected to part-of-speech tagging, and obtaining the lexical feature S 2 (Q) can be obtained as follows:
对分词序列S 1(Q)进行词性标注,并保留其中的名词、动词、形容词和疑问词,得到词性标注序列; To perform part-of-speech tagging on the segmentation sequence S 1 (Q), and retain the nouns, verbs, adjectives and interrogative words, to obtain the part-of-speech tagging sequence;
词法特征S 2(Q)为该词性标注序列。 Lexical wherein S 2 (Q) for the part of speech tagging sequence.
优选地,根据语义词典对词法特征S 2(Q)进行语义标注,获得语义词法特征S 3(Q)可以按照如下方式得到: Preferably, the lexical feature S 2 (Q) is semantically annotated according to the semantic dictionary, and the obtained semantic lexical feature S 3 (Q) can be obtained as follows:
查找语义词典,将词法特征S 2(Q)中登录在语义词典中的词语转换为对应的义原,并将未登录在语义词典中的词语作为义原保留,从而将词法特征S 2(Q)转换为义原序列; Finding a semantic dictionary, converting words registered in the semantic dictionary in the lexical feature S 2 (Q) into corresponding meanings, and retaining words not registered in the semantic dictionary as meanings, thereby lexical features S 2 (Q ) converted to a sequence of senses;
语义词法特征S 3(Q)为该义原序列。 The semantic lexical feature S 3 (Q) is the original sequence.
例如,特征提取单元10对问题Q“怎么提高我的关键词排行”,分析提取过程如下:For example, the feature extraction unit 10 analyzes the extraction process for the question Q "How to improve my keyword ranking":
1.问句分词,得到对应的分词序列S 1(Q):怎么/提高/我/的/关键词/排行; 1. Question segmentation, get the corresponding word segmentation sequence S 1 (Q): how / improve / me / / keyword / ranking;
2.词性标注,并保留名词、动词、形容词和疑问词,得到词法特征S 2(Q),也就是词性标注序列:怎么(疑问词)/提高(动词)/关键词(名词)/排行(动词); 2. part of speech tagging, and retain nouns, verbs, adjectives and interrogative words, get lexical features S 2 (Q), that is, part of speech tagging sequence: how (question word) / improve (verb) / keyword (noun) / ranking ( verb);
3.查找语义词典,将词法特征S 2(Q)中登录在语义词典中的词语转换为对应的义原,并将未登录在语义词典中的词语作为义原保留,从而将词法特征S 2(Q)转换为义原序列,也就是语义词法特征S 3(Q):Ka35B0(怎么)/Ie12A0(提高)/关键词(关键词)/Dd07A0(排行),其中“关键词”未登录在语义词典中,则作为义原保留; 3. Find the semantic dictionary, convert the words registered in the semantic dictionary in the lexical feature S 2 (Q) into the corresponding semantics, and retain the words not registered in the semantic dictionary as the original, thereby the lexical feature S 2 (Q) is converted to the original sequence, that is, the semantic lexical feature S 3 (Q): Ka35B0 (how) / Ie12A0 (improvement) / keyword (keyword) / Dd07A0 (rank), where "keyword" is not registered in In the semantic dictionary, it is reserved as a Yiyuan;
4.对词法特征S 2(Q)进行相邻组合,得到句法特征S 4(Q):<怎么,提高>/<提高,关键词>/<关键词,排行>; 4. Perform adjacent combination on the lexical feature S 2 (Q) to obtain the syntactic feature S 4 (Q): <how, improve>/<improvement, keyword>/<keyword, ranking>;
5.对语义词法特征S 3(Q)进行相邻组合,得到语义句法特征S 5(Q):<Ka35B0,Ie12A0>/<Ie12A0,关键词>/<关键词,Dd07A0>。 5. Perform adjacent combination on the semantic lexical feature S 3 (Q) to obtain the semantic syntax feature S 5 (Q): <Ka35B0, Ie12A0>/<Ie12A0, keyword>/<keyword, Dd07A0>.
上述分析并提取问题信息特征的方法,将问题语句的词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q)四个层面的信息进行分析并提取,为后续相似度计算的准确性奠定了基础。 The method of analyzing and extracting information features above problem, the problem statement lexical feature S 2 (Q), characterized in syntactic S 4 (Q), characterized in lexical semantic S 3 (Q) and Semantic Syntax wherein S 5 (Q) four levels The information is analyzed and extracted to lay the foundation for the accuracy of subsequent similarity calculations.
在更多实施例中,特征提取单元10还可以根据实际需求采用不同的方法分析并提取问题Q的词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q),可实现相同的技术效果。 In more embodiments, the feature extraction unit 10 may also analyze and extract the lexical features S 2 (Q), syntactic features S 4 (Q), and semantic lexical features S 3 (Q) of the problem Q according to actual needs. And the semantic syntactic feature S 5 (Q) can achieve the same technical effect.
具体地,在本实施例中,相似度计算单元20可以用于按照下式(1)计算每一类信息特征的相似度Sim(S j(Q),S j(Q i)) Specifically, in the present embodiment, the similarity calculation unit 20 may be configured to calculate the similarity Sim (S j (Q), S j (Q i )) of each type of information feature according to the following formula (1).
Figure PCTCN2017118746-appb-000003
Figure PCTCN2017118746-appb-000003
基于上述每一类信息特征的相似度,继而可以按照下式(2)计算问题和问题库中问题Qi的相似度Sim(Q,Q i): Based on the similarity of each type of information feature described above, the similarity Sim(Q,Q i ) of the problem Qi in the problem and problem library can then be calculated according to the following formula (2):
Figure PCTCN2017118746-appb-000004
Figure PCTCN2017118746-appb-000004
其中,Com(S j(Q),S j(Q i))为S j(Q)和S j(Q i)的公共元素数量,Num(S j(Q),S j(Q i))为S j(Q)和S j(Q i)中的最大元素数量,σ为平滑参数,元素为词语、词语组合、义原或义原组合,j=2、3、4或5。 Where Com(S j (Q), S j (Q i )) is the number of common elements of S j (Q) and S j (Q i ), Num(S j (Q), S j (Q i )) the maximum number of elements S j (Q) and S j (Q i), σ is a smoothing parameter, elements words, combinations of words, the original sense, or a combination of the original sense, j = 2,3,4 or 5.
平滑参数σ根据经验配置,可以在后续根据智能答案提示效果的反馈进行调整。设置平滑参数σ,可以避免Com(S j(Q),S j(Q i))和Num(S j(Q),S j(Q i))相差不大而导致的相似度Sim(S j(Q),S j(Q i))计算不准确,有效提高两者的区分度从而提高计算的准确性。 The smoothing parameter σ is configured according to experience and can be adjusted in the subsequent feedback based on the effect of the smart answer prompt. Setting the smoothing parameter σ can avoid the similarity Sim (S j ) caused by the difference between Com(S j (Q), S j (Q i )) and Num(S j (Q), S j (Q i )) (Q), S j (Q i )) calculation is inaccurate, effectively improving the discrimination between the two to improve the accuracy of the calculation.
本实施例中,相似度计算单元20用于通过上述公式(1)计算问题Q和问题库中问题Q i之间每一类信息特征的相似度Sim(S j(Q),S j(Q i)),并将词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q)四个层面的信息融合起来,通过上述公式(2)计算得到问题和问题库中问题Q i的相似度Sim(Q,Q i),从而大大提高了问题Q和问题库Q i相似度计算的准确性和可靠性。 In this embodiment, the similarity calculation unit 20 is configured to calculate the similarity Sim (S j (Q), S j (Q) of each type of information feature between the problem Q and the problem Q i in the problem library by the above formula (1). i )), and combines the information of the lexical feature S 2 (Q), the syntactic feature S 4 (Q), the semantic lexical feature S 3 (Q) and the semantic syntactic feature S 5 (Q), through the above formula (2) Calculate the similarity Sim(Q,Q i ) of the problem Q i in the problem and problem library, thereby greatly improving the accuracy and reliability of the similarity calculation of the problem Q and the problem library Q i .
在一些实施例中,相似度计算单元20可以用于针对问题库中的所有问题Q i,计算问题Q与问题Q i的相似度,再进行后续操作。这种实施例可以遍历问题库中的所有问题Q i,以免漏掉合适的答案。在另一些实施例中,相似度计算单元20可以用于只针对问题库中的部分问题,计算问题Q与该部分问题的相似度,以供后续处理。在这种实施例中,当问题库较大时,可以先通过初步的筛选(例如,关键词匹配等)来选择候选问题,从而降低计算量,进一步提高响应速度。 In some embodiments, the similarity calculation unit 20 can be used to calculate the similarity between the problem Q and the problem Q i for all the questions Q i in the problem library, and then perform subsequent operations. Such an embodiment can traverse all of the questions Q i in the problem library so as not to miss the appropriate answer. In other embodiments, the similarity calculation unit 20 can be used to calculate the similarity of the problem Q to the partial problem for only a part of the problem in the problem library for subsequent processing. In such an embodiment, when the problem library is large, the candidate problem can be selected by preliminary screening (for example, keyword matching, etc.), thereby reducing the amount of calculation and further improving the response speed.
在更多实施例中,本申请提供的相似度计算单元20不局限于通过上述公式(1)、(2)的计算方法,可配置更多的不同的计算方式分别计算问题Q和问题库中问题Q i之间的每一类信息特征的相似度Sim(S j(Q),S j(Q i))、问题和问题库中问题的相似度Sim(Q,Q i),只要通过每一类信息特征的相似度Sim(S j(Q),S j(Q i))得到问题和问题库中问题的相似度Sim(Q,Q i),即可实现同样的技术效果。 In more embodiments, the similarity calculation unit 20 provided by the present application is not limited to the calculation method by the above formulas (1), (2), and more different calculation methods can be configured to separately calculate the problem Q and the problem library. The similarity of each type of information feature between questions Q i (S j (Q), S j (Q i )), the similarity of the problem and the problem in the problem library Sim(Q, Q i ), as long as each pass The similarity of the information features Sim(S j (Q), S j (Q i )) yields the similarity of the problem and the problem in the problem library Sim(Q, Q i ), which can achieve the same technical effect.
具体地,在本实施例中,答案筛选单元30,用于根据相似度Sim(Q,Q i)对问题库中问题Q i进行排序,排序结果最靠前的问题库问题Q i,也就是与问题Q最相近的问题,因而其对应的答案作为最相近的答案提供给用户以作参考。 In particular, in the present embodiment, the screening unit 30 answer for (Q, Q i) of the library have questions Q i are sorted according to the similarity Sim, sort result forwardmost library problems question Q i, i.e. The problem closest to question Q, and thus its corresponding answer is provided to the user as a reference for the closest answer.
本实施例中,答案筛选单元30基于相似度Sim(Q,Q i)进行排序筛选,减少了获取提示答案的时间,大大提高了答复的响应速度。 In this embodiment, the answer screening unit 30 performs sorting and screening based on the similarity Sim(Q, Q i ), which reduces the time for obtaining the prompt answer, and greatly improves the response speed of the reply.
优选地,智能提示答案的装置还包括:Preferably, the device for intelligently prompting the answer further includes:
问题库维护单元40,配置用于将问题库分类为公共问题库和私有问题库,以及响应于用户基于所提供的答案确定的最终答案,将问题Q和所述最终答案组合,添加至私有问题库。其中,公共问题库为收集公司业务范围内问题和标准答案构成,私有问题库为每个用户根据自身需求自定义形成。The problem library maintenance unit 40 is configured to classify the problem library into a public question library and a private question library, and to add the question Q and the final answer combination to the private question in response to the final answer determined by the user based on the provided answer. Library. Among them, the public problem database is composed of questions and standard answers in the company's business scope, and the private problem database is customized for each user according to their own needs.
进一步优选地,该答案筛选单元30,配置用于按照相似度高优先和私有问题优先的排序原则对对问题库问题Q i进行排序,并将排序结果最靠前的若干(例如,3个,5个,或10个)问题库问题Q i的答案提供给用户。 Further preferably, the answer screening unit 30 is configured to sort the problem library problem Q i according to the similarity order of high priority and private question priority, and select the top result (for example, 3, 5, or 10) questions for the question library Q i are provided to the user.
上述实施例中,答案筛选单元30通过合理的排序和筛选规则,进一步有助于快速准确地得到最优提示答案,从而进一步提高了问题答复的速度和准确性。用户(例如,客服)可以从提供的提示答案中,挑选合适的答案直接回答给客户,也可以在提示答案的基础上进行修改后间接回答给客户。进一步通过问题库维护单元40,配置公共问题库和私有问题库,既保证了答复的准确性和一致性,又实现了答复的灵活性和个性化。私有问题库可以仅限用户自己使用,或者仅限有权限的用户群组使用。In the above embodiment, the answer screening unit 30 further helps to obtain the optimal prompt answer quickly and accurately through reasonable sorting and screening rules, thereby further improving the speed and accuracy of the question answer. The user (for example, customer service) can directly answer the customer's answer by selecting the appropriate answer from the provided prompt answer, or indirectly after answering the prompt answer. Further, through the problem library maintenance unit 40, the public problem library and the private problem library are configured, which not only ensures the accuracy and consistency of the reply, but also realizes the flexibility and personalization of the reply. Private problem libraries can be used only by users themselves or only by groups of users with permissions.
图3示出了适于用来实现本申请实施例的设备的结构示意图。FIG. 3 shows a schematic structural view of a device suitable for implementing an embodiment of the present application.
如图3所示,设备300包括中央处理单元(CPU)301,其可以根据存储在只读存储器(ROM)302中的程序或者从存储部分308加载到随机访问存储器(RAM)303中的程序而执行各种适当的动作和处理。在RAM 303中,还存储有设备300操作所需的各种程序和数据。CPU 301、ROM 302以及RAM 303通过总线304彼此相连。输入/输出(I/O)接口305也连接至总线304。As shown in FIG. 3, the device 300 includes a central processing unit (CPU) 301 that can be loaded from a program stored in a read only memory (ROM) 302 or a program loaded from a storage portion 308 into a random access memory (RAM) 303. Perform various appropriate actions and processes. In the RAM 303, various programs and data required for the operation of the device 300 are also stored. The CPU 301, the ROM 302, and the RAM 303 are connected to each other through a bus 304. An input/output (I/O) interface 305 is also coupled to bus 304.
以下部件连接至I/O接口305:包括键盘、鼠标等的输入部分306;包括诸如阴极射线管(CRT)、液晶显示器(LCD)等以及扬声器等的输出部分307;包括硬盘等的存储部分308;以及包括诸如LAN卡、调制解调器等的网络接口卡的通信部分309。通信部分309经由诸如因特网的网络执行通信处理。驱动器310也根据需要连接至I/O接口305。可拆卸介质311,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器310上,以便于从其上读出的计算机程序根据需要被安装入存储部分308。The following components are connected to the I/O interface 305: an input portion 306 including a keyboard, a mouse, etc.; an output portion 307 including, for example, a cathode ray tube (CRT), a liquid crystal display (LCD), and the like, and a storage portion 308 including a hard disk or the like. And a communication portion 309 including a network interface card such as a LAN card, a modem, or the like. The communication section 309 performs communication processing via a network such as the Internet. Driver 310 is also connected to I/O interface 305 as needed. A removable medium 311 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory or the like is mounted on the drive 310 as needed so that a computer program read therefrom is installed into the storage portion 308 as needed.
特别地,根据本公开的实施例,上文参考图1描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括有形地包含在机器可读介质上的计算机程序,该计算机程序包含用于执行图1的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信部分309从网络上被下载和安装,和/或从可拆卸介质311被安装。In particular, according to an embodiment of the present disclosure, the process described above with reference to FIG. 1 may be implemented as a computer software program. For example, an embodiment of the present disclosure includes a computer program product comprising a computer program tangibly embodied on a machine readable medium, the computer program comprising program code for performing the method of FIG. In such an embodiment, the computer program can be downloaded and installed from the network via the communication portion 309, and/or installed from the removable medium 311.
附图中的流程图和框图,图示了按照本发明各种实施例的系统、方法和计算机程 序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,所述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products in accordance with various embodiments of the present invention. In this regard, each block of the flowchart or block diagrams can represent a module, a program segment, or a portion of code that includes one or more logic for implementing the specified. Functional executable instructions. It should also be noted that in some alternative implementations, the functions noted in the blocks may also occur in a different order than that illustrated in the drawings. For example, two successively represented blocks may in fact be executed substantially in parallel, and they may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowcharts, and combinations of blocks in the block diagrams and/or flowcharts, can be implemented in a dedicated hardware-based system that performs the specified function or operation. Or it can be implemented by a combination of dedicated hardware and computer instructions.
描述于本申请实施例中所涉及到的单元或模块可以通过软件的方式实现,也可以通过硬件的方式来实现。所描述的单元或模块也可以设置在处理器中。这些单元或模块的名称在某种情况下并不构成对该单元或模块本身的限定。The units or modules described in the embodiments of the present application may be implemented by software or by hardware. The described unit or module can also be provided in the processor. The names of these units or modules do not in any way constitute a limitation on the unit or module itself.
作为另一方面,本申请还提供了一种计算机可读存储介质,该计算机可读存储介质可以是上述实施例中所述装置中所包含的计算机可读存储介质;也可以是单独存在,未装配入设备中的计算机可读存储介质。计算机可读存储介质存储有一个或者一个以上程序,所述程序被一个或者一个以上的处理器用来执行描述于本申请的方法。In another aspect, the present application further provides a computer readable storage medium, which may be a computer readable storage medium included in the apparatus described in the foregoing embodiment, or may exist separately, not A computer readable storage medium that is assembled into the device. A computer readable storage medium stores one or more programs that are used by one or more processors to perform the methods described herein.
以上描述仅为本申请的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本申请中所涉及的发明范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离所述发明构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本申请中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is only a preferred embodiment of the present application and a description of the principles of the applied technology. It should be understood by those skilled in the art that the scope of the invention referred to in the present application is not limited to the specific combination of the above technical features, and should also be covered by the above technical features without departing from the inventive concept. Other technical solutions formed by any combination of their equivalent features. For example, the above features are combined with the technical features disclosed in the present application, but are not limited to the technical features having similar functions.

Claims (17)

  1. 一种智能提示答案的方法,其特征在于,所述方法包括:A method for intelligently prompting an answer, characterized in that the method comprises:
    分析并提取问题Q的信息特征,所述信息特征包括词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q); Analyzing and extracting information features of the question Q, the information features including lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q), and semantic syntactic features S 5 (Q);
    计算所述问题Q和问题库中问题Q i之间的每一类信息特征的相似度,获得所述问题Q和问题库中问题Q i的相似度Sim(Q,Q i),其中所述问题库中包含问题Q i以及对应的答案;以及 Similarity information characteristic of each class is calculated between Q and the library have problems question Q i, Q to obtain the similarity Sim problems and problems of libraries question I Q (Q, Q i), wherein said The problem library contains the question Q i and the corresponding answer;
    根据所述相似度Sim(Q,Q i)对问题库问题Q i进行排序,将排序结果最靠前的若干问题库问题Q i的答案提供给用户。 The problem library problem Q i is sorted according to the similarity Sim(Q, Q i ), and the answers of several question database questions Q i with the highest ranking result are provided to the user.
  2. 根据权利要求1所述的智能提示答案的方法,其特征在于,所述分析并提取问题Q的信息特征,包括:The method for intelligently prompting an answer according to claim 1, wherein the analyzing and extracting information features of the question Q comprises:
    对所述问题Q进行分词,获得分词序列S 1(Q); Performing word segmentation on the problem Q to obtain a word segmentation sequence S 1 (Q);
    对所述分词序列S 1(Q)进行词性标注,获得词法特征S 2(Q); Performing part-of-speech tagging on the segmentation sequence S 1 (Q) to obtain a lexical feature S 2 (Q);
    根据语义词典对所述词法特征S 2(Q)进行语义标注,获得语义词法特征S 3(Q); Semanticly labeling the lexical feature S 2 (Q) according to a semantic dictionary to obtain a semantic lexical feature S 3 (Q);
    对所述词法特征S 2(Q)进行相邻组合,获得句法特征S 4(Q);以及 Performing adjacent combinations on the lexical feature S 2 (Q) to obtain a syntactic feature S 4 (Q);
    对所述语义词法特征S 3(Q)进行相邻组合,获得语义句法特征S 5(Q)。 The semantic lexical feature S 3 (Q) is adjacently combined to obtain a semantic syntactic feature S 5 (Q).
  3. 根据权利要求2所述的智能提示答案的方法,其特征在于,所述对分词序列S 1(Q)进行词性标注,获得词法特征S 2(Q),包括: The method for intelligently prompting an answer according to claim 2, wherein the participle word sequence S 1 (Q) is subjected to part-of-speech tagging, and the lexical feature S 2 (Q) is obtained, including:
    对所述分词序列S 1(Q)进行词性标注,并保留其中的名词、动词、形容词和疑问词,得到词性标注序列; Performing part-of-speech tagging on the segmentation sequence S 1 (Q), and retaining nouns, verbs, adjectives and interrogative words therein to obtain a part-of-speech tagging sequence;
    所述词法特征S 2(Q)为所述词性标注序列。 The lexical feature S 2 (Q) is the part of speech tagging sequence.
  4. 根据权利要求2所述的智能提示答案的方法,其特征在于,所述根据语义词典对所述词法特征S 2(Q)进行语义标注,获得语义词法特征S 3(Q),包括: The smart-answer method according to claim 2, wherein said lexical feature according to the semantic lexicon S 2 (Q) semantic annotation, to obtain semantic morphological characteristics S 3 (Q), comprising:
    查找语义词典,将所述词法特征S 2(Q)中登录在所述语义词典中的词语转换为对应的义原,并将未登录在所述语义词典中的词语作为义原保留,从而将所述词法特征S 2(Q)转换为义原序列; Searching a semantic dictionary, converting words registered in the semantic dictionary in the lexical feature S 2 (Q) into corresponding semantics, and retaining words not registered in the semantic dictionary as meanings, thereby Converting the lexical feature S 2 (Q) to a sense sequence;
    所述语义词法特征S 3(Q)为所述义原序列。 The semantic lexical feature S 3 (Q) is the semantic sequence.
  5. 根据权利要求1-4任一项所述的智能提示答案的方法,其特征在于,按照下式计算所述每一类信息特征的相似度Sim(S j(Q),S j(Q i)): The method for intelligently prompting an answer according to any one of claims 1 to 4, characterized in that the similarity Sim (S j (Q), S j (Q i ) of each type of information feature is calculated according to the following formula ):
    Figure PCTCN2017118746-appb-100001
    Figure PCTCN2017118746-appb-100001
    其中,Com(S j(Q),S j(Q i))为S j(Q)和S j(Q i)的公共元素数量,Num(S j(Q),S j(Q i))为 S j(Q)和S j(Q i)中的最大元素数量,σ为平滑参数,元素为词语、词语组合、义原或义原组合,j=2、3、4或5。 Where Com(S j (Q), S j (Q i )) is the number of common elements of S j (Q) and S j (Q i ), Num(S j (Q), S j (Q i )) For the maximum number of elements in S j (Q) and S j (Q i ), σ is a smoothing parameter, and the element is a word, a word combination, a sense or a combination of sense, j=2, 3, 4 or 5.
  6. 根据权利要求1-5任一项所述的智能提示答案的方法,其特征在于,按照下式计算所述问题和问题库中问题Q i的相似度Sim(Q,Q i): The method for intelligently prompting an answer according to any one of claims 1 to 5, characterized in that the similarity Sim(Q, Q i ) of the problem Q i in the problem and the problem library is calculated according to the following formula:
    Figure PCTCN2017118746-appb-100002
    Figure PCTCN2017118746-appb-100002
    其中,Sim(S j(Q),S j(Q i))为每一类信息特征的相似度,j=2、3、4或5。 Where, Sim(S j (Q), S j (Q i )) is the similarity of each type of information feature, j=2, 3, 4 or 5.
  7. 根据权利要求1-6任一项所述的智能提示答案的方法,其特征在于,所述问题库包括公共问题库和私有问题库。The method of intelligently prompting an answer according to any one of claims 1 to 6, wherein the problem library comprises a public problem library and a private problem library.
  8. 根据权利要求1-7任一项所述的智能提示答案的方法,其特征在于,所述根据所述相似度对问题库问题Q i进行排序,将排序结果最靠前的若干问题库问题Q i的答案提供给用户,包括: The method for intelligently answering an answer according to any one of claims 1 to 7, wherein the problem library problem Q i is sorted according to the similarity, and the problem database Q with the highest ranking result is Q. The answer to i is provided to the user, including:
    按照相似度高优先和私有问题优先的排序原则进行排序。Sort by the order of similarity high priority and private problem first.
  9. 根据权利要求1-8任一项所述的智能提示答案的方法,其特征在于,所述方法还包括:The method of intelligently prompting an answer according to any one of claims 1-8, wherein the method further comprises:
    响应于用户基于所提供的答案确定的最终答案;Responding to the final answer determined by the user based on the provided answer;
    将所述问题Q和所述最终答案组合,添加至所述私有问题库。The question Q and the final answer are combined and added to the private question base.
  10. 一种智能提示答案的装置,其特征在于,所述装置包括:A device for intelligently prompting an answer, characterized in that the device comprises:
    特征提取单元,配置用于分析并提取问题Q的信息特征,所述信息特征包括词法特征S 2(Q)、句法特征S 4(Q)、语义词法特征S 3(Q)和语义句法特征S 5(Q); A feature extraction unit configured to analyze and extract information features of the question Q, the information features including lexical features S 2 (Q), syntactic features S 4 (Q), semantic lexical features S 3 (Q), and semantic syntactic features S 5 (Q);
    相似度计算单元,配置用于计算所述问题Q和问题库中问题Q i之间的每一类信息特征的相似度,获得所述问题Q和问题库中问题Q i的相似度Sim(Q,Q i),其中所述问题库中包含问题Q i以及对应的答案;以及 Similarity similarity calculation unit, wherein the configuration information for each class is calculated between Q and the problem library have problems of Q I, Q is obtained and the problem of the similarity Sim library have the problem of I Q (Q , Q i ), wherein the problem library contains the question Q i and the corresponding answer;
    答案筛选单元,配置用于根据所述相似度Sim(Q,Q i)对问题库问题Q i进行排序,将排序结果最靠前的若干问题库问题Q i的答案提供给用户。 Answer screening unit, configured according to the similarity Sim (Q, Q i) Q i arsenals problem sorted, the sorted result answer forwardmost Problems arsenals Q i provided to the user.
  11. 根据权利要求10所述的智能提示答案的装置,其特征在于,所述特征提取单元配置用于按如下分析并提取问题Q的信息特征:The apparatus for intelligently answering an answer according to claim 10, wherein the feature extraction unit is configured to analyze and extract information features of the question Q as follows:
    对所述问题Q进行分词,获得分词序列S 1(Q); Performing word segmentation on the problem Q to obtain a word segmentation sequence S 1 (Q);
    对所述分词序列S 1(Q)进行词性标注,获得词法特征S 2(Q); The word sequence S 1 (Q) part of speech tagging, to obtain morphological characteristics S 2 (Q);
    根据语义词典对所述词法特征S 2(Q)进行语义标注,获得语义词法特征S 3(Q); Semanticly labeling the lexical feature S 2 (Q) according to a semantic dictionary to obtain a semantic lexical feature S 3 (Q);
    对所述词法特征S 2(Q)进行相邻组合,获得句法特征S 4(Q);以及 The morphological characteristics of S 2 (Q) for the adjacent composition obtained syntactic features S 4 (Q); and
    对所述语义词法特征S 3(Q)进行相邻组合,获得语义句法特征S 5(Q)。 The semantic lexical feature S 3 (Q) is adjacently combined to obtain a semantic syntactic feature S 5 (Q).
  12. 根据权利要求10-11任一项所述的智能提示答案的装置,其特征在于,所述相似度计算单元配置用于按照下式计算所述每一类信息特征的相似度Sim(S j(Q),S j(Q i)): The apparatus for intelligently answering an answer according to any one of claims 10 to 11, wherein the similarity calculation unit is configured to calculate a similarity Sim (S j ( Q), S j (Q i )):
    Figure PCTCN2017118746-appb-100003
    Figure PCTCN2017118746-appb-100003
    其中,Com(S j(Q),S j(Q i))为S j(Q)和S j(Q i)的公共元素数量,Num(S j(Q),S j(Q i))为S j(Q)和S j(Q i)中的最大元素数量,σ为平滑参数,元素为词语、词语组合、义原或义原组合,j=2、3、4或5。 Wherein, Com (S j (Q) , S j (Q i)) is S j (Q) and S j (Q i) of the common element number, Num (S j (Q) , S j (Q i)) For the maximum number of elements in S j (Q) and S j (Q i ), σ is a smoothing parameter, and the element is a word, a word combination, a sense or a combination of sense, j=2, 3, 4 or 5.
  13. 根据权利要求10-12任一项所述的智能提示答案的装置,其特征在于,所述相似度计算单元配置用于按照下式计算所述问题和问题库中问题Q i的相似度Sim(Q,Q i): The apparatus for intelligently prompting an answer according to any one of claims 10 to 12, wherein the similarity calculation unit is configured to calculate a similarity Sim of the problem Q i in the problem and the problem library according to the following formula ( Q,Q i ):
    Figure PCTCN2017118746-appb-100004
    Figure PCTCN2017118746-appb-100004
    其中,Sim(S j(Q),S j(Q i))为每一类信息特征的相似度,j=2、3、4或5。 Where, Sim(S j (Q), S j (Q i )) is the similarity of each type of information feature, j=2, 3, 4 or 5.
  14. 根据权利要求10-13任一项所述的智能提示答案的装置,其特征在于,所述装置还包括:The device for intelligently prompting an answer according to any one of claims 10 to 13, wherein the device further comprises:
    问题库维护单元,配置用于将所述问题库分类为公共问题库和私有问题库,以及响应于用户基于所提供的答案确定的最终答案,将所述问题Q和所述最终答案组合,添加至所述私有问题库。a problem library maintenance unit configured to classify the problem library into a public question library and a private question library, and to combine the question Q and the final answer in response to a final answer determined by the user based on the provided answer, adding To the private problem library.
  15. 根据权利要求10-14任一项所述的智能提示答案的装置,其特征在于,所述答案筛选单元配置用于按照相似度高优先和私有问题优先的排序原则进行排序。The apparatus for intelligently prompting an answer according to any one of claims 10 to 14, wherein the answer screening unit is configured to sort according to a ranking principle of similarity high priority and private question priority.
  16. 一种设备,其特征在于,所述设备包括:A device, characterized in that the device comprises:
    一个或多个处理器;One or more processors;
    存储装置,用于存储一个或多个程序,a storage device for storing one or more programs,
    当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如权利要求1-9中任一所述的方法。The one or more programs are executed by the one or more processors such that the one or more processors implement the method of any of claims 1-9.
  17. 一种计算机可读存储介质,其上存储有计算机程序,其特征在于,该程序被处理器执行时实现如权利要求1-9中任一所述的方法。A computer readable storage medium having stored thereon a computer program, wherein the program, when executed by a processor, implements the method of any of claims 1-9.
PCT/CN2017/118746 2017-06-15 2017-12-26 Method and device for intelligently prompting answers WO2018227930A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710452312.XA CN107436916B (en) 2017-06-15 2017-06-15 Intelligent answer prompting method and device
CN201710452312X 2017-06-15

Publications (1)

Publication Number Publication Date
WO2018227930A1 true WO2018227930A1 (en) 2018-12-20

Family

ID=60458402

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/118746 WO2018227930A1 (en) 2017-06-15 2017-12-26 Method and device for intelligently prompting answers

Country Status (2)

Country Link
CN (1) CN107436916B (en)
WO (1) WO2018227930A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107436916B (en) * 2017-06-15 2021-04-27 百度在线网络技术(北京)有限公司 Intelligent answer prompting method and device
CN109033265B (en) * 2018-07-09 2020-10-16 苏州浪潮智能软件有限公司 Man-machine interaction method and device
CN110209768B (en) * 2019-05-31 2021-08-10 中国联合网络通信集团有限公司 Question processing method and device for automatic question answering
CN110647614B (en) * 2019-08-01 2023-05-23 平安科技(深圳)有限公司 Intelligent question-answering method, device, medium and electronic equipment
CN111931498B (en) * 2020-08-14 2024-03-01 中国工商银行股份有限公司 User online question processing method and system based on complexity analysis
CN112581954B (en) * 2020-12-01 2023-08-04 杭州九阳小家电有限公司 High-matching voice interaction method and intelligent device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101286161A (en) * 2008-05-28 2008-10-15 华中科技大学 Intelligent Chinese request-answering system based on concept
CN104503998A (en) * 2014-12-05 2015-04-08 百度在线网络技术(北京)有限公司 Type identifying method and device aiming at query sentence of user
CN104598445A (en) * 2013-11-01 2015-05-06 腾讯科技(深圳)有限公司 Automatic question-answering system and method
CN104850539A (en) * 2015-05-28 2015-08-19 宁波薄言信息技术有限公司 Natural language understanding method and travel question-answering system based on same
CN107436916A (en) * 2017-06-15 2017-12-05 百度在线网络技术(北京)有限公司 The method and device of intelligent prompt answer

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101373532A (en) * 2008-07-10 2009-02-25 昆明理工大学 FAQ Chinese request-answering system implementing method in tourism field
CN102456060A (en) * 2010-10-28 2012-05-16 株式会社日立制作所 Information processing device and information processing method
CN103810218B (en) * 2012-11-14 2018-06-08 北京百度网讯科技有限公司 A kind of automatic question-answering method and device based on problem cluster
US9666098B2 (en) * 2013-02-15 2017-05-30 Voxy, Inc. Language learning systems and methods
CN103577556B (en) * 2013-10-21 2017-01-18 北京奇虎科技有限公司 Device and method for obtaining association degree of question and answer pair
CN103902652A (en) * 2014-02-27 2014-07-02 深圳市智搜信息技术有限公司 Automatic question-answering system
CN103970729B (en) * 2014-04-29 2016-08-24 河海大学 A kind of multi-threaded extracting method based on semantic category
CN104573028B (en) * 2015-01-14 2019-01-25 百度在线网络技术(北京)有限公司 Realize the method and system of intelligent answer

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101286161A (en) * 2008-05-28 2008-10-15 华中科技大学 Intelligent Chinese request-answering system based on concept
CN104598445A (en) * 2013-11-01 2015-05-06 腾讯科技(深圳)有限公司 Automatic question-answering system and method
CN104503998A (en) * 2014-12-05 2015-04-08 百度在线网络技术(北京)有限公司 Type identifying method and device aiming at query sentence of user
CN104850539A (en) * 2015-05-28 2015-08-19 宁波薄言信息技术有限公司 Natural language understanding method and travel question-answering system based on same
CN107436916A (en) * 2017-06-15 2017-12-05 百度在线网络技术(北京)有限公司 The method and device of intelligent prompt answer

Also Published As

Publication number Publication date
CN107436916A (en) 2017-12-05
CN107436916B (en) 2021-04-27

Similar Documents

Publication Publication Date Title
WO2018227930A1 (en) Method and device for intelligently prompting answers
CN108829893B (en) Method and device for determining video label, storage medium and terminal equipment
US10102254B2 (en) Confidence ranking of answers based on temporal semantics
WO2020108608A1 (en) Search result processing method, device, terminal, electronic device, and storage medium
US10831762B2 (en) Extracting and denoising concept mentions using distributed representations of concepts
US10146862B2 (en) Context-based metadata generation and automatic annotation of electronic media in a computer network
Gupta et al. A survey of text question answering techniques
US11514235B2 (en) Information extraction from open-ended schema-less tables
CN108280114B (en) Deep learning-based user literature reading interest analysis method
Fang et al. Word-of-mouth understanding: Entity-centric multimodal aspect-opinion mining in social media
US20210407499A1 (en) Automatically generating conference minutes
CN112667794A (en) Intelligent question-answer matching method and system based on twin network BERT model
CN111831802B (en) Urban domain knowledge detection system and method based on LDA topic model
US11023503B2 (en) Suggesting text in an electronic document
WO2019169858A1 (en) Searching engine technology based data analysis method and system
US11188819B2 (en) Entity model establishment
US11699034B2 (en) Hybrid artificial intelligence system for semi-automatic patent infringement analysis
US10558692B2 (en) Minimum coordination passage scoring
WO2020237872A1 (en) Method and apparatus for testing accuracy of semantic analysis model, storage medium, and device
US20150293901A1 (en) Utilizing Temporal Indicators to Weight Semantic Values
CN111309916B (en) Digest extracting method and apparatus, storage medium, and electronic apparatus
CN113157887A (en) Knowledge question-answering intention identification method and device and computer equipment
CN109992723B (en) User interest tag construction method based on social network and related equipment
CN111222032A (en) Public opinion analysis method and related equipment
CN116090450A (en) Text processing method and computing device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17913142

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 27.03.2020)

122 Ep: pct application non-entry in european phase

Ref document number: 17913142

Country of ref document: EP

Kind code of ref document: A1