WO2020119063A1 - Expert knowledge recommendation method and apparatus, computer device, and storage medium - Google Patents

Expert knowledge recommendation method and apparatus, computer device, and storage medium Download PDF

Info

Publication number
WO2020119063A1
WO2020119063A1 PCT/CN2019/092507 CN2019092507W WO2020119063A1 WO 2020119063 A1 WO2020119063 A1 WO 2020119063A1 CN 2019092507 W CN2019092507 W CN 2019092507W WO 2020119063 A1 WO2020119063 A1 WO 2020119063A1
Authority
WO
WIPO (PCT)
Prior art keywords
expert
vector
semantic
target
keyword
Prior art date
Application number
PCT/CN2019/092507
Other languages
French (fr)
Chinese (zh)
Inventor
吴壮伟
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2020119063A1 publication Critical patent/WO2020119063A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri

Definitions

  • This application relates to the field of artificial intelligence technology, and in particular, to a method, device, computer equipment, and storage medium for recommending expert knowledge.
  • the existing intelligent online customer service can only answer simple questions. For highly professional questions (such as legal issues, professional technical problems in the high-tech field), the existing intelligent online customer service cannot provide accurate answers.
  • the embodiments of the present application provide an expert knowledge recommendation method, device, computer equipment, and storage medium, which are designed to solve the user's professional problems in the prior art and need to be consulted to obtain a reply. Answers obtained by a person who edits manually cannot obtain a timely response to the consultation, and the professional degree of the answer is greatly limited by the professional knowledge of the respondent.
  • the embodiment of the present application provides an expert knowledge recommendation method, which includes: receiving the uploaded consultation questions to be answered, segmenting and extracting the consultation questions to be answered, and obtaining consultation with the pending answers
  • the semantic network vector corresponding to the question includes: calculating the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer library to obtain the similarity between the reply answer library and the semantic network vector is greater than the preset similarity
  • the semantic vector of degree threshold as the target semantic vector; obtain the expert list corresponding to the target semantic vector; and obtain the heat value of each expert in the expert list, and sort the semantic vectors in descending order according to the heat value of the expert to obtain the ranking
  • After the semantic vector obtain the semantic vector ranked before the preset first ranking value in the sorted semantic vector to obtain the filtered semantic vector; obtain the corresponding semantic vector in the filtered semantic vector in the reply answer library
  • To obtain expert knowledge recommendation information and send the expert knowledge recommendation information to the uploader corresponding to the consultation question to be answered.
  • an embodiment of the present application provides an expert knowledge recommendation device, which includes: a consultation question acquisition unit for receiving the uploaded consultation questions to be answered, and segmenting and extracting the consultation questions to be answered, Obtain a semantic network vector corresponding to the query question to be answered; a target semantic vector acquisition unit for calculating the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer database to obtain a reply answer database
  • the semantic vector whose similarity between the semantic network vector and the semantic network vector is greater than a preset similarity threshold is used as the target semantic vector;
  • the expert list obtaining unit is used to obtain the expert list corresponding to the target semantic vector; and the sorting unit is used To obtain the heat value of each expert in the expert list, sort the semantic vectors in descending order according to the heat value of the experts to obtain a sorted semantic vector, and obtain the ranking of the sorted semantic vector before the preset first ranking value Semantic vectors to obtain the filtered semantic vectors; consultation reply unit, used to obtain the corresponding response content of each semantic vector
  • an embodiment of the present application further provides a computer device, including a memory, a processor, and a computer program stored on the memory and executable on the processor, the processor executing the computer
  • the program implements the expert knowledge recommendation method described in the first aspect above.
  • an embodiment of the present application also provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor causes the processor to execute the first On the one hand, the expert knowledge recommendation method.
  • FIG. 1 is a schematic diagram of an application scenario of an expert knowledge recommendation method provided by an embodiment of this application.
  • FIG. 2 is a schematic flowchart of an expert knowledge recommendation method provided by an embodiment of the application
  • FIG. 3 is a schematic diagram of a sub-process of a method for recommending expert knowledge provided by an embodiment of the present application
  • FIG. 4 is a schematic diagram of another sub-process of the expert knowledge recommendation method provided by the embodiment of the present application.
  • FIG. 5 is a schematic block diagram of an expert knowledge recommendation device provided by an embodiment of this application.
  • FIG. 6 is a schematic block diagram of a subunit of an expert knowledge recommendation device provided by an embodiment of this application.
  • FIG. 7 is a schematic block diagram of another subunit of an expert knowledge recommendation device provided by an embodiment of this application.
  • FIG. 8 is a schematic block diagram of a computer device provided by an embodiment of the present application.
  • FIG. 1 is a schematic diagram of an application scenario of an expert knowledge recommendation method provided by an embodiment of the present application
  • FIG. 2 is a schematic flowchart of an expert knowledge recommendation method provided by an embodiment of the present application.
  • the expert knowledge recommendation method is applied to In the server, this method is executed by the application software installed in the server.
  • the method includes steps S110-S150.
  • the consultation question to be answered can be converted into a quantized multidimensional row vector or multidimensional column vector, At this time, the approximate question and its answer can be searched in the pre-built reply answer database according to the consultation question to be answered, and the obtained reply is more accurate.
  • step S110 includes:
  • S111 Segment the consultation question to be answered by a probability-based word segmentation model to obtain a word segmentation result corresponding to the consultation question to be answered;
  • S113 Acquire a target word vector corresponding to each keyword information in the target keyword set
  • the word segmentation process of the query question to be answered through a probability-based word segmentation model is as follows:
  • the word segmentation model based on probability statistics can find the target word string W, so that W satisfies: P(W
  • C) MAX(P(Wa
  • Word segmentation model the word string W obtained by the above word segmentation model is the word string whose estimated probability is the largest. which is:
  • all candidate words w1, w2, ..., wi, ..., wn are taken from left to right; the probability value P(wi) of each candidate word is found in the dictionary, and Record all the left neighbor words of each candidate word; calculate the cumulative probability of each candidate word, and compare to get the best left neighbor word of each candidate word; if the current word wn is the last word of the string S, and the cumulative probability P (wn) is the largest, then wn is the end word of S; from wn, in accordance with the order from right to left, the best left neighbor of each word is output in turn, that is, the word segmentation result of S.
  • the word segmentation results are extracted through the word frequency-inverse text frequency index model (ie, TF-IDF model, TF-IDF is the abbreviation of Term, Frequency-Inverse Document Frequency)
  • TF-IDF is the abbreviation of Term, Frequency-Inverse Document Frequency
  • the keyword information before the preset second ranking value is used as the target keyword set.
  • the keyword information before the preset ranking value in the word segmentation result is extracted through the TF-IDF model, as follows:
  • IDF i lg[total number of documents in the corpus/(number of documents containing the participle+1)];
  • the denominator is larger, and the inverse document frequency is smaller and closer to 0. The reason why the denominator is increased by 1 is to avoid the denominator being 0 (that is, all documents do not contain the word).
  • TF-IDF is directly proportional to the number of occurrences of a word in the document, and inversely proportional to the number of occurrences of the word in the entire language. Therefore, automatically extracting keywords is to calculate the TF-IDF value of each participle of the document, and then arrange them in descending order, and take the top N words as the keyword list of the document.
  • the target word vector corresponding to each keyword in the target keyword set can be correspondingly acquired.
  • the word vector corresponding to the keyword information is obtained based on a pre-constructed vocabulary table query.
  • the process of acquiring the word vector is called word2vec, and its function is to convert words in natural language into dense vectors that can be understood by the computer.
  • a corpus that is, a vocabulary
  • AA, BB, CC, and DD where AA, BB, CC, and DD represent a Chinese word
  • the words are converted into discrete individual symbols through One-Hot Encoder (one-hot code), and then converted into low-dimensional continuous values, that is, dense vectors, through Word2Vec dimensionality reduction, and words with similar meanings will be mapped To a similar location in vector space.
  • One-Hot Encoder one-hot code
  • the weight corresponding to each target word vector can be obtained.
  • the consultation questions to be answered can be obtained.
  • Corresponding semantic network vector The specific calculation formula is as follows:
  • Vector refers to the semantic network vector corresponding to the query question to be answered
  • Word_Embedding(kwi) is the target word vector i
  • ⁇ i is the weight corresponding to the target word vector i.
  • each piece of answer data in the reply answer database is in the format of expert name, answer content, keyword combination, and semantic vector; where the keyword combination is the answer content, extracted by the TF-IDF model
  • the top N keywords are combined to form a keyword combination (where N is a custom-set value in the server, such as setting N equal to the second ranking value +1);
  • the semantic vector is each keyword corresponding to the content of the answer, and
  • the weight corresponding to each keyword is calculated, and the calculation process is the same as obtaining the semantic network vector corresponding to the query question to be answered.
  • the data of the target answer library is as follows:
  • the reply answer database is pre-built, when the semantic network vector corresponding to the consultation question to be answered is obtained, the similarity calculation of the semantic network vector and the semantic vector included in the pre-built reply answer database can be performed In order to obtain the answer content with high correlation with the semantic network vector, based on the semantic network matching question and the answer in the question database, it is more able to identify similar questions and improve the quality of question answering.
  • step S120 includes:
  • the target keyword set corresponding to the question to be answered is obtained before the semantic network vector corresponding to the question to be answered.
  • the keyword combination including the target keyword set in the reply answer library may be obtained by comparing the target keyword set with the keyword combination in the reply answer library to obtain a keyword matching result .
  • the similarity between the semantic vector corresponding to each keyword combination in the keyword matching result and the semantic network vector is calculated.
  • it can be calculated by the following formula:
  • a and b represent two vectors respectively
  • is the angle between vector a and vector b.
  • the similarity threshold is set to 0.5. It can be seen that through the above process, it is possible to quickly screen and obtain the answer content with a high correlation with the semantic network vector.
  • the target semantic vector is obtained, that is, the answer content that is highly relevant to the query question to be answered is obtained, each answer content corresponds to a piece of answer data in the answer answer library, and each answer data is corresponding to An expert name, so after acquiring the target semantic vector, the name of the expert corresponding to each semantic vector included in the target semantic vector can be obtained, thereby forming an expert list.
  • S140 Obtain the heat value of each expert in the expert list, sort the semantic vectors in descending order according to the heat value of the experts to obtain a sorted semantic vector, and obtain the ranking in the sorted semantic vector before the preset first ranking value To obtain the semantic vector after filtering.
  • each expert in the expert list corresponding to the target semantic vector may be statistically calculated to rank the heat value
  • the content of the answer of the top expert is regarded as the content of priority recommendation, that is, the answer of the expert trusted by the user can be more recommended, and the accuracy is improved.
  • obtaining the heat value of each expert in the expert list in step S140 includes:
  • the total number of times the articles of each expert in the expert list are cited can be used as the heat value of the expert. If the expert publishes multiple articles, the total number of times that each article is cited in the multiple articles is summed to obtain the expert's heat value.
  • obtaining the heat value of each expert in the expert list in step S140 includes:
  • a directed social network structure of experts can be constructed, where the subject is the name of each expert, and the directed side refers to, Expert A quotes expert B's article, then expert A points to expert B.
  • the directed boundary value is the reference value with time decay factor.
  • value k represents the heat value of expert k in the expert list
  • the reference value between other experts i and expert k in the expert list is
  • the publication time of the article of expert k in the expert list is T 0
  • the citation time of the article of expert k in the expert list cited by other experts i is T
  • is the preset adjustment parameter (such as setting the adjustment parameter to 0.5).
  • a semantic vector ranked before the preset first ranking value in the sorted semantic vector is obtained (For example, the top 10 semantic vectors are selected, and the first ranking value is set to 11), and the answer content corresponding to the semantic vectors to obtain expert knowledge recommendation information, which is pushed to the uploading end corresponding to the question to be answered,
  • the pushed expert knowledge recommendation information includes sorting, answer content, and expert name.
  • the pushed expert knowledge recommendation information is as follows:
  • the uploading end corresponding to the query question to be answered can obtain a highly reliable reply content.
  • This method adopts semantic recognition technology to recommend expert answers trusted by users to improve the accuracy of recommendations, and based on semantic network matching questions and answers in the answer answer library, it can identify similar questions and improve the quality of question answers.
  • An embodiment of the present application further provides an expert knowledge recommendation device, which is used to execute any of the foregoing embodiments of the expert knowledge recommendation method.
  • FIG. 5 is a schematic block diagram of an expert knowledge recommendation device provided by an embodiment of the present application.
  • the expert knowledge recommendation device 100 can be configured in a server.
  • the expert knowledge recommendation device 100 includes a consultation question acquisition unit 110, a target semantic vector acquisition unit 120, an expert list acquisition unit 130, a sorting unit 140, and a consultation reply unit 150.
  • the consultation question obtaining unit 110 is configured to receive the uploaded consultation questions to be answered, perform word segmentation and keyword extraction on the consultation questions to be answered, and obtain a semantic network vector corresponding to the consultation questions to be answered.
  • the consultation question obtaining unit 110 includes:
  • the word segmentation unit 111 is configured to perform word segmentation on the consultation question to be answered based on a probability statistical word segmentation model to obtain a word segmentation result corresponding to the consultation question to be answered;
  • the keyword extraction unit 112 is used to extract the keyword information located before the preset second ranking value in the word segmentation result through the word frequency-inverse text frequency index model as the target keyword set;
  • a target word vector acquiring unit 113 configured to acquire a target word vector corresponding to each keyword information in the target keyword set;
  • the semantic network vector obtaining unit 114 is configured to obtain a semantic network vector corresponding to the question to be answered according to each target word vector and the weight corresponding to each target word vector.
  • the target semantic vector acquiring unit 120 is configured to calculate the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer database, and obtain that the similarity between the reply answer database and the semantic network vector is greater than the Set the semantic vector of the similarity threshold as the target semantic vector.
  • the target semantic vector acquisition unit 120 includes:
  • a target keyword set obtaining unit 121 configured to obtain a target keyword set corresponding to the semantic network vector
  • the keyword comparison unit 122 is used for comparing the target keyword set with the keyword combination in the reply answer library, and acquiring the keyword combination including the target keyword set in the reply answer library, to Get keyword matching results;
  • the similarity set acquisition unit 123 is used to calculate the cosine of the angle between the semantic vector corresponding to each keyword combination in the keyword matching result and the semantic network vector to obtain each key in the keyword matching result
  • the similarity between the word combination and the semantic network vector is used as a similarity set
  • the target similarity set acquisition unit 124 is configured to acquire a similarity greater than the similarity threshold in the similarity set to obtain a target similarity set;
  • the target similarity set parsing unit 125 is used to obtain a semantic vector corresponding to each similarity in the target similarity set as a target semantic vector.
  • the expert list obtaining unit 130 is configured to obtain an expert list corresponding to the target semantic vector.
  • the sorting unit 140 is configured to obtain the heat value of each expert in the expert list, sort the semantic vectors in descending order according to the heat value of the experts to obtain the sorted semantic vector, and obtain the ranked position in the semantic vector after the sorting in the preset A semantic vector before the ranking value to obtain the filtered semantic vector.
  • the sorting unit 140 is further used to:
  • the sorting unit 140 is further used to:
  • the consultation reply unit 150 is used to obtain the corresponding response content of each semantic vector in the filtered semantic vector in the reply answer library to obtain expert knowledge recommendation information, and send the expert knowledge recommendation information to the consultation question to be answered The corresponding upload end.
  • the device uses semantic recognition technology to recommend expert answers trusted by users to improve the accuracy of recommendations, and based on semantic network matching questions and answers in the answer answer library, it can identify similar questions and improve the quality of question answers.
  • the above expert knowledge recommendation device may be implemented in the form of a computer program, and the computer program may run on a computer device as shown in FIG. 8.
  • FIG. 8 is a schematic block diagram of a computer device provided by an embodiment of the present application.
  • the computer device 500 is a server.
  • the server may be an independent server or a server cluster composed of multiple servers.
  • the computer device 500 includes a processor 502, a memory, and a network interface 505 connected through a system bus 501, where the memory may include a non-volatile storage medium 503 and an internal memory 504.
  • the non-volatile storage medium 503 can store an operating system 5031 and a computer program 5032.
  • the processor 502 can execute the expert knowledge recommendation method.
  • the processor 502 is used to provide computing and control capabilities and support the operation of the entire computer device 500.
  • the internal memory 504 provides an environment for the operation of the computer program 5032 in the non-volatile storage medium 503.
  • the processor 502 can cause the processor 502 to perform an expert knowledge recommendation method.
  • the network interface 505 is used for network communication, such as the transmission of data information.
  • the network interface 505 is used for network communication, such as the transmission of data information.
  • FIG. 8 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device 500 to which the solution of the present application is applied.
  • the specific computer device 500 may include more or less components than shown in the figure, or combine certain components, or have a different arrangement of components.
  • the processor 502 is used to run the computer program 5032 stored in the memory to implement the expert knowledge recommendation method of the embodiment of the present application.
  • the embodiment of the computer device shown in FIG. 8 does not constitute a limitation on the specific configuration of the computer device.
  • the computer device may include more or fewer components than shown in the figure. Or combine certain components, or arrange different components.
  • the computer device may include only a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with the embodiment shown in FIG. 8 and will not be repeated here.
  • the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), Application specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor may be a microprocessor or the processor may be any conventional processor.
  • a computer-readable storage medium may be a non-volatile computer-readable storage medium.
  • the computer-readable storage medium stores a computer program, where the computer program is executed by a processor to implement the expert knowledge recommendation method of the embodiments of the present application.
  • the storage medium may be an internal storage unit of the foregoing device, such as a hard disk or a memory of the device.
  • the storage medium may also be an external storage device of the device, such as a plug-in hard disk equipped on the device, a smart memory card (Smart) Card (SMC), a secure digital (SD) card, or a flash memory card (Flash Card) etc.
  • the storage medium may also include both an internal storage unit of the device and an external storage device.

Abstract

The present application discloses an expert knowledge recommendation method and apparatus, a computer device, and a storage medium. Said method comprises: upon reception of an uploaded counseling question to be replied to, acquiring a semantic network vector corresponding thereto; performing a retrieval in a constructed answer reply database to obtain semantic vectors of which the similarity with the semantic network vector is greater than a similarity threshold as target semantic vectors and a list of experts corresponding thereto; acquiring the popularity of each expert in the list of experts, ranking, per popularity, the semantic vectors in a descending order and taking semantic vectors ranked prior to a first rank value to obtain filtered semantic vectors; and acquiring, from the filtered semantic vectors, corresponding expert knowledge recommendation information, and sending same to an uploading end corresponding to said counseling question.

Description

专家知识推荐方法、装置、计算机设备及存储介质Expert knowledge recommendation method, device, computer equipment and storage medium
本申请要求于2018年12月11日提交中国专利局、申请号为201811510416.2、申请名称为“专家知识推荐方法、装置、计算机设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application requires the priority of the Chinese patent application submitted to the Chinese Patent Office on December 11, 2018, with the application number 201811510416.2 and the application name "expert knowledge recommendation method, device, computer equipment and storage medium", the entire content of which is cited by reference Incorporated in this application.
技术领域Technical field
本申请涉及人工智能技术领域,尤其涉及一种专家知识推荐方法、装置、计算机设备及存储介质。This application relates to the field of artificial intelligence technology, and in particular, to a method, device, computer equipment, and storage medium for recommending expert knowledge.
背景技术Background technique
目前,当用户有专业问题需咨询以得到回复时,一般是在咨询平台上发布问题后,由回答者人工编辑得到回复答案。现有的智能在线客服只能答复简单的问题,对于专业性强的问题(如法律问题,高新技术领域的专业技术问题)现有的智能在线客服无法反馈准确的答复。At present, when a user has a professional question to be consulted in order to get a reply, generally, after posting the question on the consulting platform, the answerer manually edits to get the reply answer. The existing intelligent online customer service can only answer simple questions. For highly professional questions (such as legal issues, professional technical problems in the high-tech field), the existing intelligent online customer service cannot provide accurate answers.
发明内容Summary of the invention
本申请实施例提供了一种专家知识推荐方法、装置、计算机设备及存储介质,旨在解决现有技术中用户有专业问题需咨询以得到回复时,是在自选平台发布后等待以得到由答复者人工编辑得到的回复答案,无法及时的得到咨询回复,而且回复答案的专业程度极大程度上受到答复者专业知识限制的问题。The embodiments of the present application provide an expert knowledge recommendation method, device, computer equipment, and storage medium, which are designed to solve the user's professional problems in the prior art and need to be consulted to obtain a reply. Answers obtained by a person who edits manually cannot obtain a timely response to the consultation, and the professional degree of the answer is greatly limited by the professional knowledge of the respondent.
第一方面,本申请实施例提供了一种专家知识推荐方法,其包括:接收所上传的待答复咨询问题,将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量;将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量;获取所述目标语义向量对应的专家列表;以及获取所述专家列表中每一专家的热度值,根据专家的热度值对语义向量进行降序排序得到排序后语义向量,获取所述排序后语义向量中排名位于预设的第一排名值之前的语义向量,以得到筛选后语义向量;获取所述筛选后语义向量中各语义向量在回复答案库中对 应的回答内容以得到专家知识推荐信息,将所述专家知识推荐信息发送至与所述待答复咨询问题对应的上传端。In the first aspect, the embodiment of the present application provides an expert knowledge recommendation method, which includes: receiving the uploaded consultation questions to be answered, segmenting and extracting the consultation questions to be answered, and obtaining consultation with the pending answers The semantic network vector corresponding to the question; calculating the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer library to obtain the similarity between the reply answer library and the semantic network vector is greater than the preset similarity The semantic vector of degree threshold as the target semantic vector; obtain the expert list corresponding to the target semantic vector; and obtain the heat value of each expert in the expert list, and sort the semantic vectors in descending order according to the heat value of the expert to obtain the ranking After the semantic vector, obtain the semantic vector ranked before the preset first ranking value in the sorted semantic vector to obtain the filtered semantic vector; obtain the corresponding semantic vector in the filtered semantic vector in the reply answer library To obtain expert knowledge recommendation information, and send the expert knowledge recommendation information to the uploader corresponding to the consultation question to be answered.
第二方面,本申请实施例提供了一种专家知识推荐装置,其包括:咨询问题获取单元,用于接收所上传的待答复咨询问题,将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量;目标语义向量获取单元,用于将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量;专家列表获取单元,用于获取所述目标语义向量对应的专家列表;以及排序单元,用于获取所述专家列表中每一专家的热度值,根据专家的热度值对语义向量进行降序排序得到排序后语义向量,获取所述排序后语义向量中排名位于预设的第一排名值之前的语义向量,以得到筛选后语义向量;咨询回复单元,用于获取所述筛选后语义向量中各语义向量在回复答案库中对应的回答内容以得到专家知识推荐信息,将所述专家知识推荐信息发送至与所述待答复咨询问题对应的上传端。In a second aspect, an embodiment of the present application provides an expert knowledge recommendation device, which includes: a consultation question acquisition unit for receiving the uploaded consultation questions to be answered, and segmenting and extracting the consultation questions to be answered, Obtain a semantic network vector corresponding to the query question to be answered; a target semantic vector acquisition unit for calculating the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer database to obtain a reply answer database The semantic vector whose similarity between the semantic network vector and the semantic network vector is greater than a preset similarity threshold is used as the target semantic vector; the expert list obtaining unit is used to obtain the expert list corresponding to the target semantic vector; and the sorting unit is used To obtain the heat value of each expert in the expert list, sort the semantic vectors in descending order according to the heat value of the experts to obtain a sorted semantic vector, and obtain the ranking of the sorted semantic vector before the preset first ranking value Semantic vectors to obtain the filtered semantic vectors; consultation reply unit, used to obtain the corresponding response content of each semantic vector in the filtered semantic vectors in the reply answer database to obtain expert knowledge recommendation information, and recommend the expert knowledge recommendation information Send to the uploader corresponding to the question to be answered.
第三方面,本申请实施例又提供了一种计算机设备,其包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现上述第一方面所述的专家知识推荐方法。In a third aspect, an embodiment of the present application further provides a computer device, including a memory, a processor, and a computer program stored on the memory and executable on the processor, the processor executing the computer The program implements the expert knowledge recommendation method described in the first aspect above.
第四方面,本申请实施例还提供了一种计算机可读存储介质,其中所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行时使所述处理器执行上述第一方面所述的专家知识推荐方法。According to a fourth aspect, an embodiment of the present application also provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor causes the processor to execute the first On the one hand, the expert knowledge recommendation method.
附图说明BRIEF DESCRIPTION
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly explain the technical solutions of the embodiments of the present application, the following will briefly introduce the drawings used in the description of the embodiments. Obviously, the drawings in the following description are some embodiments of the present application. Ordinary technicians can obtain other drawings based on these drawings without creative work.
图1为本申请实施例提供的专家知识推荐方法的应用场景示意图;1 is a schematic diagram of an application scenario of an expert knowledge recommendation method provided by an embodiment of this application;
图2为本申请实施例提供的专家知识推荐方法的流程示意图;2 is a schematic flowchart of an expert knowledge recommendation method provided by an embodiment of the application;
图3为本申请实施例提供的专家知识推荐方法的子流程示意图;3 is a schematic diagram of a sub-process of a method for recommending expert knowledge provided by an embodiment of the present application;
图4为本申请实施例提供的专家知识推荐方法的另一子流程示意图;4 is a schematic diagram of another sub-process of the expert knowledge recommendation method provided by the embodiment of the present application;
图5为本申请实施例提供的专家知识推荐装置的示意性框图;5 is a schematic block diagram of an expert knowledge recommendation device provided by an embodiment of this application;
图6为本申请实施例提供的专家知识推荐装置的子单元示意性框图;6 is a schematic block diagram of a subunit of an expert knowledge recommendation device provided by an embodiment of this application;
图7为本申请实施例提供的专家知识推荐装置的另一子单元示意性框图;7 is a schematic block diagram of another subunit of an expert knowledge recommendation device provided by an embodiment of this application;
图8为本申请实施例提供的计算机设备的示意性框图。8 is a schematic block diagram of a computer device provided by an embodiment of the present application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be described clearly and completely below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by a person of ordinary skill in the art without creative work fall within the protection scope of the present application.
应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It should be understood that when used in this specification and the appended claims, the terms "including" and "comprising" indicate the presence of described features, wholes, steps, operations, elements, and/or components, but do not exclude one or The presence or addition of multiple other features, wholes, steps, operations, elements, components, and/or collections thereof.
还应当理解,在此本申请说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本申请。如在本申请说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should also be understood that the terminology used in the description of this application is for the purpose of describing particular embodiments only and is not intended to limit this application. As used in the specification of the present application and the appended claims, unless the context clearly indicates otherwise, the singular forms "a", "an", and "the" are intended to include the plural forms.
还应当进一步理解,在本申请说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should also be further understood that the term "and/or" used in the specification of the present application and the appended claims refers to any and all possible combinations of one or more of the associated listed items and includes these combinations .
请参阅图1和图2,图1是本申请实施例提供的专家知识推荐方法的应用场景示意图,图2是本申请实施例提供的专家知识推荐方法的流程示意图,该专家知识推荐方法应用于服务器中,该方法通过安装于服务器中的应用软件进行执行。Please refer to FIGS. 1 and 2, FIG. 1 is a schematic diagram of an application scenario of an expert knowledge recommendation method provided by an embodiment of the present application, and FIG. 2 is a schematic flowchart of an expert knowledge recommendation method provided by an embodiment of the present application. The expert knowledge recommendation method is applied to In the server, this method is executed by the application software installed in the server.
如图2所示,该方法包括步骤S110~S150。As shown in FIG. 2, the method includes steps S110-S150.
S110、接收所上传的待答复咨询问题,将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量。S110. Receive the uploaded consultation questions to be answered, extract the consultation questions to be segmented and extract keywords, and obtain a semantic network vector corresponding to the consultation questions to be answered.
在本实施例中,当用户需在线咨询问题时,在用户终端(即待答复咨询问题对应的上传端)上编辑好待答复咨询问题,并将待答复咨询问题上传至服务器,服务器接收所述待答复咨询问题后,对所述待答复咨询问题进行语义识别, 得到与所述待答复咨询问题对应的语义网络向量。由于服务器中无法直接理解待答复咨询问题的含义,但是针对待答复咨询问题进行分词和关键词抽取后,即可将所述待答复咨询问题转化为一个量化后的多维行向量或多维列向量,此时即可根据所述待答复咨询问题在预先构建的回复答案库中搜索近似问题及其答案,所得到的回复更准确。In this embodiment, when a user needs to consult a question online, edit the question to be answered on the user terminal (ie, the uploading end corresponding to the question to be answered), and upload the question to be answered to the server, and the server receives the After the consultation question is to be answered, semantic recognition is performed on the consultation question to be answered, and a semantic network vector corresponding to the consultation question to be answered is obtained. Since the server cannot directly understand the meaning of the consultation question to be answered, but after word segmentation and keyword extraction for the consultation question to be answered, the consultation question to be answered can be converted into a quantized multidimensional row vector or multidimensional column vector, At this time, the approximate question and its answer can be searched in the pre-built reply answer database according to the consultation question to be answered, and the obtained reply is more accurate.
在一实施例中,如图3所示,步骤S110包括:In an embodiment, as shown in FIG. 3, step S110 includes:
S111、将所述待答复咨询问题通过基于概率统计分词模型进行分词,得到与所述待答复咨询问题对应的分词结果;S111: Segment the consultation question to be answered by a probability-based word segmentation model to obtain a word segmentation result corresponding to the consultation question to be answered;
S112、通过词频-逆文本频率指数模型,抽取所述分词结果中位于预设的第二排名值之前的关键词信息以作为目标关键词集合;S112. Using the word frequency-inverse text frequency index model, extract the keyword information in the word segmentation result before the preset second ranking value as the target keyword set;
S113、获取所述目标关键词集合中每一关键词信息对应的目标词向量;S113: Acquire a target word vector corresponding to each keyword information in the target keyword set;
S114、根据每一目标词向量,及每一目标词向量对应的权重,获取与待答复咨询问题对应的语义网络向量。S114. Acquire a semantic network vector corresponding to the consultation question to be answered according to each target word vector and the weight corresponding to each target word vector.
在本实施例中,将所述待答复咨询问题通过基于概率统计分词模型进行分词过程如下:In this embodiment, the word segmentation process of the query question to be answered through a probability-based word segmentation model is as follows:
例如,令C=C1C2...Cm,C是待切分的汉字串,令W=W1W2...Wn,W是切分的结果,Wa,Wb,……,Wk是C的所有可能的切分方案。那么,基于概率统计分词模型就是能够找到目的词串W,使得W满足:P(W|C)=MAX(P(Wa|C),P(Wb|C)...P(Wk|C))的分词模型,上述分词模型得到的词串W即估计概率为最大之词串。即:For example, let C=C1C2...Cm, C be the Chinese character string to be segmented, let W=W1W2...Wn, W be the result of segmentation, Wa, Wb,..., Wk are all possible of C Split the plan. Then, the word segmentation model based on probability statistics can find the target word string W, so that W satisfies: P(W|C)=MAX(P(Wa|C), P(Wb|C)...P(Wk|C) ) Word segmentation model, the word string W obtained by the above word segmentation model is the word string whose estimated probability is the largest. which is:
对一个待分词的子串S,按照从左到右的顺序取出全部候选词w1、w2、…、wi、…、wn;在词典中查出每个候选词的概率值P(wi),并记录每个候选词的全部左邻词;计算每个候选词的累计概率,同时比较得到每个候选词的最佳左邻词;如果当前词wn是字串S的尾词,且累计概率P(wn)最大,则wn就是S的终点词;从wn开始,按照从右到左顺序,依次将每个词的最佳左邻词输出,即S的分词结果。For a substring S to be segmented, all candidate words w1, w2, ..., wi, ..., wn are taken from left to right; the probability value P(wi) of each candidate word is found in the dictionary, and Record all the left neighbor words of each candidate word; calculate the cumulative probability of each candidate word, and compare to get the best left neighbor word of each candidate word; if the current word wn is the last word of the string S, and the cumulative probability P (wn) is the largest, then wn is the end word of S; from wn, in accordance with the order from right to left, the best left neighbor of each word is output in turn, that is, the word segmentation result of S.
获取了与所述待答复咨询问题对应的分词结果后,再通过词频-逆文本频率指数模型(即TF-IDF模型,TF-IDF是Term Frequency–Inverse Document Frequency的简写),抽取所述分词结果中位于预设的第二排名值之前的关键词 信息以作为目标关键词集合。通过TF-IDF模型抽取所述分词结果中位于预设的排名值之前的关键词信息,具体如下:After obtaining the word segmentation results corresponding to the consultation questions to be answered, the word segmentation results are extracted through the word frequency-inverse text frequency index model (ie, TF-IDF model, TF-IDF is the abbreviation of Term, Frequency-Inverse Document Frequency) The keyword information before the preset second ranking value is used as the target keyword set. The keyword information before the preset ranking value in the word segmentation result is extracted through the TF-IDF model, as follows:
1)计算分词结果中每一分词i的词频,记为TF i1) Calculate the word frequency of each participle i in the participle result, and record it as TF i ;
2)计算分词结果中每一分词i的逆文档频率IDF i2) Calculate the inverse document frequency IDF i of each participle i in the word segmentation result;
在计算每一分词i的逆文档频率IDFi时,需要一个语料库(与分词过程中的字典类似),用来模拟语言的使用环境;When calculating the inverse document frequency IDFi of each word segmentation i, a corpus (similar to a dictionary in the word segmentation process) is needed to simulate the language usage environment;
逆文档频率IDF i=lg[语料库的文档总数/(包含该分词的文档数+1)]; Inverse document frequency IDF i =lg[total number of documents in the corpus/(number of documents containing the participle+1)];
如果一个词越常见,那么分母就越大,逆文档频率就越小越接近0。分母之所以要加1,是为了避免分母为0(即所有文档都不包含该词)。If a word is more common, then the denominator is larger, and the inverse document frequency is smaller and closer to 0. The reason why the denominator is increased by 1 is to avoid the denominator being 0 (that is, all documents do not contain the word).
3)根据TF i*IDF i计算分词结果中每一分词i对应的词频-逆文本频率指数TF-IDFi; 3) Calculate the word frequency-inverse text frequency index TF-IDFi corresponding to each participle i in the word segmentation result according to TF i *IDF i ;
显然,TF-IDF与一个词在文档中的出现次数成正比,与该词在整个语言中的出现次数成反比。所以,自动提取关键词即是计算出文档的每个分词的TF-IDF值,然后按降序排列,取排在前N位的词作为文档的关键词列表。Obviously, TF-IDF is directly proportional to the number of occurrences of a word in the document, and inversely proportional to the number of occurrences of the word in the entire language. Therefore, automatically extracting keywords is to calculate the TF-IDF value of each participle of the document, and then arrange them in descending order, and take the top N words as the keyword list of the document.
4)将分词结果中每一分词对应的词频-逆文本频率指数按降序排序,取排名位于预设的排名值之前(例如预设的排名值为21)的分词组成与所述待答复咨询问题对应的目标关键词集合。4) Sort the word frequency-inverse text frequency index corresponding to each word segmentation in the word segmentation results in descending order, and take the word segment composition ranked before the preset ranking value (for example, the preset ranking value is 21) and the question to be answered Corresponding target keyword set.
获取与所述待答复咨询问题对应的目标关键词集合后,即可对应获取目标关键词集合中每一关键词对应的目标词向量。其中,获取关键词信息对应的词向量是基于预先构建的词汇表查询得到,词向量的获取过程称为word2vec,作用就是将自然语言中的字词转为计算机可以理解的稠密向量。例如,在语料库(也即词汇表)中,AA、BB、CC、DD(其中AA、BB、CC、DD代表一个中文词)各对应一个向量,向量中只有一个值为1,其余都为0。即先通过One-Hot Encoder(独热码)将字词转为离散的单独的符号,再通过Word2Vec降维转化为低维度的连续值,也就是稠密向量,并且其中意思相近的词将被映射到向量空间中相近的位置。After acquiring the target keyword set corresponding to the query question to be answered, the target word vector corresponding to each keyword in the target keyword set can be correspondingly acquired. Among them, the word vector corresponding to the keyword information is obtained based on a pre-constructed vocabulary table query. The process of acquiring the word vector is called word2vec, and its function is to convert words in natural language into dense vectors that can be understood by the computer. For example, in a corpus (that is, a vocabulary), AA, BB, CC, and DD (where AA, BB, CC, and DD represent a Chinese word) each correspond to a vector. Only one value in the vector is 1, and the rest are 0. . That is, the words are converted into discrete individual symbols through One-Hot Encoder (one-hot code), and then converted into low-dimensional continuous values, that is, dense vectors, through Word2Vec dimensionality reduction, and words with similar meanings will be mapped To a similar location in vector space.
最后,根据分词结果中每一关键词的词频,能获取与每一目标词向量对应的权重,此时根据每一目标词向量,及每一目标词向量对应的权重,获取与待答复咨询问题对应的语义网络向量。具体计算的公式如下:Finally, according to the word frequency of each keyword in the word segmentation result, the weight corresponding to each target word vector can be obtained. At this time, according to each target word vector and the weight corresponding to each target word vector, the consultation questions to be answered can be obtained. Corresponding semantic network vector. The specific calculation formula is as follows:
Figure PCTCN2019092507-appb-000001
Figure PCTCN2019092507-appb-000001
其中,Vector指的是与待答复咨询问题对应的语义网络向量,Word_Embedding(kwi)为目标词向量i,ω i是目标词向量i对应的权重。通过上述过程即可将所述待答复咨询问题转化为一个多维行向量或多维列向量,实现了对所述待答复咨询问题的量化转化。 Among them, Vector refers to the semantic network vector corresponding to the query question to be answered, Word_Embedding(kwi) is the target word vector i, and ω i is the weight corresponding to the target word vector i. Through the above process, the consultation question to be answered can be converted into a multi-dimensional row vector or multi-dimensional column vector, and the quantitative conversion of the consultation question to be answered can be realized.
S120、将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量。S120. Calculate the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer library to obtain a semantic vector whose similarity between the reply answer library and the semantic network vector is greater than a preset similarity threshold , As the target semantic vector.
在本实施例中,回复答案库中每一条答案数据均是以专家姓名、回答内容、关键词组合与语义向量的保存格式;其中,关键词组合是对回答内容,通过TF-IDF模型抽取出排名前N位的关键词以组成关键词组合(其中,N为在服务器中自定义设置的值,例如设置N等于第二排名值+1);语义向量是回答内容对应的各关键词、及各关键词对应的权重计算得到,计算过程与获取所述待答复咨询问题对应的语义网络向量相同。例如目标答案库的数据如下:In this embodiment, each piece of answer data in the reply answer database is in the format of expert name, answer content, keyword combination, and semantic vector; where the keyword combination is the answer content, extracted by the TF-IDF model The top N keywords are combined to form a keyword combination (where N is a custom-set value in the server, such as setting N equal to the second ranking value +1); the semantic vector is each keyword corresponding to the content of the answer, and The weight corresponding to each keyword is calculated, and the calculation process is the same as obtaining the semantic network vector corresponding to the query question to be answered. For example, the data of the target answer library is as follows:
序号Serial number 专家姓名Expert name 回答内容Answer content 关键词组合Keyword combination 语义向量Semantic vector
11 AA1AA1 B1B2B3B4B1B2B3B4 B1+B2B1+B2 [C1C2……C3][C1C2……C3]
22 AA2AA2 B1B2B4B7B1B2B4B7 B2+B7B2+B7 [C4C5……C6][C4C5...C6]
……...
NN AANAAN B3B4B7B9B3B4B7B9 B3+B9B3+B9 [C7C8……C9][C7C8...C9]
由于预先构建了所述回复答案库,当获取了与待答复咨询问题对应的语义网络向量后,即可将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,从而得到与所述语义网络向量相关度较高的回答内容,基于语义网络匹配问题和问题库中的答案,更加能识别出相似问题,提高问题回答的质量。Since the reply answer database is pre-built, when the semantic network vector corresponding to the consultation question to be answered is obtained, the similarity calculation of the semantic network vector and the semantic vector included in the pre-built reply answer database can be performed In order to obtain the answer content with high correlation with the semantic network vector, based on the semantic network matching question and the answer in the question database, it is more able to identify similar questions and improve the quality of question answering.
在一实施例中,如图4所示,步骤S120包括:In an embodiment, as shown in FIG. 4, step S120 includes:
S121、获取所述语义网络向量对应的目标关键词集合;S121. Acquire a target keyword set corresponding to the semantic network vector;
S122、将所述目标关键词集合与所述回复答案库中的关键词组合进行比对,获取所述回复答案库中包括所述目标关键词集合的关键词组合,以得到关键词匹配结果;S122. Compare the target keyword set with the keyword combination in the reply answer library, and obtain a keyword combination including the target keyword set in the reply answer library, to obtain a keyword matching result;
S123、计算关键词匹配结果中每一关键词组合对应的语义向量与所述语义网络向量之间夹角的余弦值,以得到所述关键词匹配结果中每一关键词组合与所述语义网络向量之间的相似度,以作为相似度集合;S123. Calculate the cosine of the angle between the semantic vector corresponding to each keyword combination in the keyword matching result and the semantic network vector to obtain each keyword combination and the semantic network in the keyword matching result The similarity between vectors is used as a similarity set;
S124、获取所述相似度集合中大于所述相似度阈值的相似度,以得到目标相似度集合;S124: Obtain a similarity degree in the similarity degree set that is greater than the similarity threshold value to obtain a target similarity degree set;
S125、获取所述目标相似度集合中每一相似度对应的语义向量,以作为目标语义向量。S125. Acquire a semantic vector corresponding to each similarity in the target similarity set as a target semantic vector.
在本实施例中,在与待答复咨询问题对应的语义网络向量之前,是获取了与待答复咨询问题对应的目标关键词集合,此时为了提高获取待答复咨询问题的回答内容的检索效率,可先通过将所述目标关键词集合与所述回复答案库中的关键词组合进行比对,获取所述回复答案库中包括所述目标关键词集合的关键词组合,以得到关键词匹配结果。In this embodiment, before the semantic network vector corresponding to the question to be answered, the target keyword set corresponding to the question to be answered is obtained. In this case, in order to improve the retrieval efficiency of obtaining the answer content of the question to be answered, The keyword combination including the target keyword set in the reply answer library may be obtained by comparing the target keyword set with the keyword combination in the reply answer library to obtain a keyword matching result .
之后再在关键词匹配结果对应的多条答案数据中,计算关键词匹配结果中每一关键词组合对应的语义向量与所述语义网络向量之间的相似度。在计算向量之间的相似度时,可通过如下公式计算:Then, in multiple answer data corresponding to the keyword matching result, the similarity between the semantic vector corresponding to each keyword combination in the keyword matching result and the semantic network vector is calculated. When calculating the similarity between vectors, it can be calculated by the following formula:
Figure PCTCN2019092507-appb-000002
Figure PCTCN2019092507-appb-000002
其中,a和b分别代表两个向量,θ为向量a和向量b之间的夹角。具体实施时,设置相似度阈值为0.5。可见,通过上述过程即可快速筛选得到与所述语义网络向量相关度较高的回答内容。Among them, a and b represent two vectors respectively, θ is the angle between vector a and vector b. In specific implementation, the similarity threshold is set to 0.5. It can be seen that through the above process, it is possible to quickly screen and obtain the answer content with a high correlation with the semantic network vector.
S130、获取所述目标语义向量对应的专家列表。S130. Acquire an expert list corresponding to the target semantic vector.
在本实施例中,获取了目标语义向量,即获取了与待答复咨询问题相关度较高的回答内容,每一回答内容均对应回复答案库中的一条答案数据,而每一答案数据是对应一个专家姓名,故在获取了所述目标语义向量后,即可对应获取所述目标语义向量包括的各语义向量一一对应的专家姓名,从而组成专家列表。In this embodiment, the target semantic vector is obtained, that is, the answer content that is highly relevant to the query question to be answered is obtained, each answer content corresponds to a piece of answer data in the answer answer library, and each answer data is corresponding to An expert name, so after acquiring the target semantic vector, the name of the expert corresponding to each semantic vector included in the target semantic vector can be obtained, thereby forming an expert list.
S140、获取所述专家列表中每一专家的热度值,根据专家的热度值对语义向量进行降序排序得到排序后语义向量,获取所述排序后语义向量中排名位于预设的第一排名值之前的语义向量,以得到筛选后语义向量。S140: Obtain the heat value of each expert in the expert list, sort the semantic vectors in descending order according to the heat value of the experts to obtain a sorted semantic vector, and obtain the ranking in the sorted semantic vector before the preset first ranking value To obtain the semantic vector after filtering.
在本实施例中,为了对所述待答复咨询问题对应的上传端推送更值得信赖 的回答内容,可以对所述目标语义向量对应的专家列表中各专家分别统计计算热度值,将热度值排名靠前的专家的回答内容作为优先推荐的回答内容,即更加能推荐出用户所信任的专家答案,提高准确性。In this embodiment, in order to push more reliable answer content to the uploader corresponding to the query question to be answered, each expert in the expert list corresponding to the target semantic vector may be statistically calculated to rank the heat value The content of the answer of the top expert is regarded as the content of priority recommendation, that is, the answer of the expert trusted by the user can be more recommended, and the accuracy is improved.
在一实施例中,作为计算热度值的第一实施例,所述步骤S140中获取所述专家列表中每一专家的热度值,包括:In an embodiment, as the first embodiment for calculating the heat value, obtaining the heat value of each expert in the expert list in step S140 includes:
根据专家列表中每一专家的文章被引用累计总次数,以对应得到每一专家的热度值。According to the total number of citations of each expert's article in the expert list, to obtain the heat value of each expert.
即作为计算热度值的第一实施例,可以根据专家列表中每一专家的文章被引用累计总次数,作为专家的热度值。若该专家发表了多篇文章,则其多篇文章中每一文章被引用累计总次数求和,即可得到该专家的热度值。That is, as the first embodiment for calculating the heat value, the total number of times the articles of each expert in the expert list are cited can be used as the heat value of the expert. If the expert publishes multiple articles, the total number of times that each article is cited in the multiple articles is summed to obtain the expert's heat value.
在一实施例中,作为计算热度值的第二实施例,所述步骤S140中获取所述专家列表中每一专家的热度值,包括:In an embodiment, as a second embodiment for calculating the heat value, obtaining the heat value of each expert in the expert list in step S140 includes:
根据预设的引用值模型获取所述专家列表中每一专家的文章被引用的引用值之和,以得到与所述专家列表中每一专家对应的热度值;其中,所述专家列表为
Figure PCTCN2019092507-appb-000003
其中value k表示所述专家列表中专家k的热度值,其他专家i与所述专家列表中专家k之间的引用值为
Figure PCTCN2019092507-appb-000004
所述专家列表中专家k的文章发表时间是T 0,其他专家i引用所述专家列表中专家k的文章的引用时间为T,λ为预设的调节参数。
Obtaining the sum of the cited values of the articles of each expert in the expert list according to a preset reference value model to obtain the heat value corresponding to each expert in the expert list; wherein, the expert list is
Figure PCTCN2019092507-appb-000003
Where value k represents the heat value of expert k in the expert list, and the reference value between other experts i and expert k in the expert list is
Figure PCTCN2019092507-appb-000004
The publication time of the article of expert k in the expert list is T 0 , the citation time of the article of expert k in the expert list cited by other experts i is T, and λ is the preset adjustment parameter.
即作为计算热度值的第二实施例,在计算专家列表中每一专家的热度值时,可构建专家的有向社交网络结构,其中,主体是各个专家名称,有向的边指的是,专家A引用了专家B的文章,则专家A指向专家B,有向的边值为带有时间衰退因子的引用值,在计算某一专家的热度值时,计算公式如下:That is, as a second embodiment for calculating the heat value, when calculating the heat value of each expert in the expert list, a directed social network structure of experts can be constructed, where the subject is the name of each expert, and the directed side refers to, Expert A quotes expert B's article, then expert A points to expert B. The directed boundary value is the reference value with time decay factor. When calculating the heat value of an expert, the calculation formula is as follows:
Figure PCTCN2019092507-appb-000005
Figure PCTCN2019092507-appb-000005
其中,value k表示所述专家列表中专家k的热度值,其他专家i与所述专家列表中专家k之间的引用值为
Figure PCTCN2019092507-appb-000006
所述专家列表中专家k的文章发表时间是T 0,其他专家i引用所述专家列表中专家k的文章的引用时间为T,λ为预设的调节参数(如设置调节参数为0.5)。
Where value k represents the heat value of expert k in the expert list, and the reference value between other experts i and expert k in the expert list is
Figure PCTCN2019092507-appb-000006
The publication time of the article of expert k in the expert list is T 0 , the citation time of the article of expert k in the expert list cited by other experts i is T, and λ is the preset adjustment parameter (such as setting the adjustment parameter to 0.5).
S150、获取所述筛选后语义向量中各语义向量在回复答案库中对应的回答内容以得到专家知识推荐信息,将所述专家知识推荐信息发送至与所述待答复 咨询问题对应的上传端。S150. Acquire corresponding answer content of each semantic vector in the filtered semantic vector in the reply answer database to obtain expert knowledge recommendation information, and send the expert knowledge recommendation information to the uploading end corresponding to the consultation question to be answered.
在本实施中,计算了专家列表中每一专家的热度值后,并且以此作为排序得到排序后语义向量,获取所述排序后语义向量中排名位于预设的第一排名值之前的语义向量(例如选择出排名前10的语义向量,第一排名值即设置为11),及与语义向量对应的回答内容以得到专家知识推荐信息,推送至与所述待答复咨询问题对应的上传端,所推送的专家知识推荐信息包括了排序、回答内容、专家姓名。例如,所推送的专家知识推荐信息如下:In this implementation, after calculating the heat value of each expert in the expert list, and using this as a sort to obtain a sorted semantic vector, a semantic vector ranked before the preset first ranking value in the sorted semantic vector is obtained (For example, the top 10 semantic vectors are selected, and the first ranking value is set to 11), and the answer content corresponding to the semantic vectors to obtain expert knowledge recommendation information, which is pushed to the uploading end corresponding to the question to be answered, The pushed expert knowledge recommendation information includes sorting, answer content, and expert name. For example, the pushed expert knowledge recommendation information is as follows:
序号Serial number 专家姓名Expert name 回答内容Answer content
11 AA1AA1 B1B2B3B4B1B2B3B4
22 AA2AA2 B1B2B4B7B1B2B4B7
……...
1010 AA10AA10 B3B4B7B9B3B4B7B9
通过上述形式的专家知识推荐信息,所述待答复咨询问题对应的上传端即可得到可信度较高的答复内容。Through the above-mentioned expert knowledge recommendation information, the uploading end corresponding to the query question to be answered can obtain a highly reliable reply content.
该方法采用语义识别技术能推荐出用户所信任的专家答案,提高推荐的准确性,而且基于语义网络匹配问题和回复答案库中的答案,更加能识别出相似问题,提高问题回答的质量。This method adopts semantic recognition technology to recommend expert answers trusted by users to improve the accuracy of recommendations, and based on semantic network matching questions and answers in the answer answer library, it can identify similar questions and improve the quality of question answers.
本申请实施例还提供一种专家知识推荐装置,该专家知识推荐装置用于执行前述专家知识推荐方法的任一实施例。具体地,请参阅图5,图5是本申请实施例提供的专家知识推荐装置的示意性框图。该专家知识推荐装置100可以配置于服务器中。An embodiment of the present application further provides an expert knowledge recommendation device, which is used to execute any of the foregoing embodiments of the expert knowledge recommendation method. Specifically, please refer to FIG. 5, which is a schematic block diagram of an expert knowledge recommendation device provided by an embodiment of the present application. The expert knowledge recommendation device 100 can be configured in a server.
如图5所示,专家知识推荐装置100包括咨询问题获取单元110、目标语义向量获取单元120、专家列表获取单元130、排序单元140、咨询回复单元150。As shown in FIG. 5, the expert knowledge recommendation device 100 includes a consultation question acquisition unit 110, a target semantic vector acquisition unit 120, an expert list acquisition unit 130, a sorting unit 140, and a consultation reply unit 150.
咨询问题获取单元110,用于接收所上传的待答复咨询问题,将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量。The consultation question obtaining unit 110 is configured to receive the uploaded consultation questions to be answered, perform word segmentation and keyword extraction on the consultation questions to be answered, and obtain a semantic network vector corresponding to the consultation questions to be answered.
在一实施例中,如图6所示,咨询问题获取单元110包括:In an embodiment, as shown in FIG. 6, the consultation question obtaining unit 110 includes:
分词单元111,用于将所述待答复咨询问题通过基于概率统计分词模型进行分词,得到与所述待答复咨询问题对应的分词结果;The word segmentation unit 111 is configured to perform word segmentation on the consultation question to be answered based on a probability statistical word segmentation model to obtain a word segmentation result corresponding to the consultation question to be answered;
关键词抽取单元112,用于通过词频-逆文本频率指数模型,抽取所述分词 结果中位于预设的第二排名值之前的关键词信息以作为目标关键词集合;The keyword extraction unit 112 is used to extract the keyword information located before the preset second ranking value in the word segmentation result through the word frequency-inverse text frequency index model as the target keyword set;
目标词向量获取单元113,用于获取所述目标关键词集合中每一关键词信息对应的目标词向量;A target word vector acquiring unit 113, configured to acquire a target word vector corresponding to each keyword information in the target keyword set;
语义网络向量获取单元114,用于根据每一目标词向量,及每一目标词向量对应的权重,获取与待答复咨询问题对应的语义网络向量。The semantic network vector obtaining unit 114 is configured to obtain a semantic network vector corresponding to the question to be answered according to each target word vector and the weight corresponding to each target word vector.
目标语义向量获取单元120,用于将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量。The target semantic vector acquiring unit 120 is configured to calculate the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer database, and obtain that the similarity between the reply answer database and the semantic network vector is greater than the Set the semantic vector of the similarity threshold as the target semantic vector.
在一实施例中,如图7所示,目标语义向量获取单元120包括:In an embodiment, as shown in FIG. 7, the target semantic vector acquisition unit 120 includes:
目标关键词集合获取单元121,用于获取所述语义网络向量对应的目标关键词集合;A target keyword set obtaining unit 121, configured to obtain a target keyword set corresponding to the semantic network vector;
关键词比较单元122,用于将所述目标关键词集合与所述回复答案库中的关键词组合进行比对,获取所述回复答案库中包括所述目标关键词集合的关键词组合,以得到关键词匹配结果;The keyword comparison unit 122 is used for comparing the target keyword set with the keyword combination in the reply answer library, and acquiring the keyword combination including the target keyword set in the reply answer library, to Get keyword matching results;
相似度集合获取单元123,用于计算关键词匹配结果中每一关键词组合对应的语义向量与所述语义网络向量之间夹角的余弦值,以得到所述关键词匹配结果中每一关键词组合与所述语义网络向量之间的相似度,以作为相似度集合;The similarity set acquisition unit 123 is used to calculate the cosine of the angle between the semantic vector corresponding to each keyword combination in the keyword matching result and the semantic network vector to obtain each key in the keyword matching result The similarity between the word combination and the semantic network vector is used as a similarity set;
目标相似度集合获取单元124,用于获取所述相似度集合中大于所述相似度阈值的相似度,以得到目标相似度集合;The target similarity set acquisition unit 124 is configured to acquire a similarity greater than the similarity threshold in the similarity set to obtain a target similarity set;
目标相似度集合解析单元125,用于获取所述目标相似度集合中每一相似度对应的语义向量,以作为目标语义向量。The target similarity set parsing unit 125 is used to obtain a semantic vector corresponding to each similarity in the target similarity set as a target semantic vector.
专家列表获取单元130,用于获取所述目标语义向量对应的专家列表。The expert list obtaining unit 130 is configured to obtain an expert list corresponding to the target semantic vector.
排序单元140,用于获取所述专家列表中每一专家的热度值,根据专家的热度值对语义向量进行降序排序得到排序后语义向量,获取所述排序后语义向量中排名位于预设的第一排名值之前的语义向量,以得到筛选后语义向量。The sorting unit 140 is configured to obtain the heat value of each expert in the expert list, sort the semantic vectors in descending order according to the heat value of the experts to obtain the sorted semantic vector, and obtain the ranked position in the semantic vector after the sorting in the preset A semantic vector before the ranking value to obtain the filtered semantic vector.
在一实施例中,作为计算热度值的第一实施例,所述排序单元140还用于:In an embodiment, as the first embodiment for calculating the heat value, the sorting unit 140 is further used to:
根据专家列表中每一专家的文章被引用累计总次数,以对应得到每一专家的热度值。According to the total number of citations of each expert's article in the expert list, to obtain the heat value of each expert.
在一实施例中,作为计算热度值的第二实施例,所述排序单元140还用于:In an embodiment, as a second embodiment for calculating the heat value, the sorting unit 140 is further used to:
根据预设的引用值模型获取所述专家列表中每一专家的文章被引用的引用 值之和,以得到与所述专家列表中每一专家对应的热度值;其中,所述专家列表为
Figure PCTCN2019092507-appb-000007
其中value k表示所述专家列表中专家k的热度值,其他专家i与所述专家列表中专家k之间的引用值为
Figure PCTCN2019092507-appb-000008
所述专家列表中专家k的文章发表时间是T 0,其他专家i引用所述专家列表中专家k的文章的引用时间为T,λ为预设的调节参数。
Obtaining the sum of the cited values of the articles of each expert in the expert list according to a preset reference value model to obtain the heat value corresponding to each expert in the expert list; wherein, the expert list is
Figure PCTCN2019092507-appb-000007
Where value k represents the heat value of expert k in the expert list, and the reference value between other experts i and expert k in the expert list is
Figure PCTCN2019092507-appb-000008
The publication time of the article of expert k in the expert list is T 0 , the citation time of the article of expert k in the expert list cited by other experts i is T, and λ is the preset adjustment parameter.
咨询回复单元150,用于获取所述筛选后语义向量中各语义向量在回复答案库中对应的回答内容以得到专家知识推荐信息,将所述专家知识推荐信息发送至与所述待答复咨询问题对应的上传端。The consultation reply unit 150 is used to obtain the corresponding response content of each semantic vector in the filtered semantic vector in the reply answer library to obtain expert knowledge recommendation information, and send the expert knowledge recommendation information to the consultation question to be answered The corresponding upload end.
该装置采用语义识别技术能推荐出用户所信任的专家答案,提高推荐的准确性,而且基于语义网络匹配问题和回复答案库中的答案,更加能识别出相似问题,提高问题回答的质量。The device uses semantic recognition technology to recommend expert answers trusted by users to improve the accuracy of recommendations, and based on semantic network matching questions and answers in the answer answer library, it can identify similar questions and improve the quality of question answers.
上述专家知识推荐装置可以实现为计算机程序的形式,该计算机程序可以在如图8所示的计算机设备上运行。The above expert knowledge recommendation device may be implemented in the form of a computer program, and the computer program may run on a computer device as shown in FIG. 8.
请参阅图8,图8是本申请实施例提供的计算机设备的示意性框图。该计算机设备500是服务器。其中,服务器可以是独立的服务器,也可以是多个服务器组成的服务器集群。Please refer to FIG. 8, which is a schematic block diagram of a computer device provided by an embodiment of the present application. The computer device 500 is a server. The server may be an independent server or a server cluster composed of multiple servers.
参阅图8,该计算机设备500包括通过系统总线501连接的处理器502、存储器和网络接口505,其中,存储器可以包括非易失性存储介质503和内存储器504。该非易失性存储介质503可存储操作系统5031和计算机程序5032。该计算机程序5032被执行时,可使得处理器502执行专家知识推荐方法。该处理器502用于提供计算和控制能力,支撑整个计算机设备500的运行。该内存储器504为非易失性存储介质503中的计算机程序5032的运行提供环境,该计算机程序5032被处理器502执行时,可使得处理器502执行专家知识推荐方法。该网络接口505用于进行网络通信,如提供数据信息的传输等。本领域技术人员可以理解,图8中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备500的限定,具体的计算机设备500可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Referring to FIG. 8, the computer device 500 includes a processor 502, a memory, and a network interface 505 connected through a system bus 501, where the memory may include a non-volatile storage medium 503 and an internal memory 504. The non-volatile storage medium 503 can store an operating system 5031 and a computer program 5032. When the computer program 5032 is executed, the processor 502 can execute the expert knowledge recommendation method. The processor 502 is used to provide computing and control capabilities and support the operation of the entire computer device 500. The internal memory 504 provides an environment for the operation of the computer program 5032 in the non-volatile storage medium 503. When the computer program 5032 is executed by the processor 502, the processor 502 can cause the processor 502 to perform an expert knowledge recommendation method. The network interface 505 is used for network communication, such as the transmission of data information. Those skilled in the art can understand that the structure shown in FIG. 8 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device 500 to which the solution of the present application is applied. The specific computer device 500 may include more or less components than shown in the figure, or combine certain components, or have a different arrangement of components.
其中,所述处理器502用于运行存储在存储器中的计算机程序5032,以实现本申请实施例的专家知识推荐方法。Wherein, the processor 502 is used to run the computer program 5032 stored in the memory to implement the expert knowledge recommendation method of the embodiment of the present application.
本领域技术人员可以理解,图8中示出的计算机设备的实施例并不构成对计算机设备具体构成的限定,在其他实施例中,计算机设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。例如,在一些实施例中,计算机设备可以仅包括存储器及处理器,在这样的实施例中,存储器及处理器的结构及功能与图8所示实施例一致,在此不再赘述。Those skilled in the art can understand that the embodiment of the computer device shown in FIG. 8 does not constitute a limitation on the specific configuration of the computer device. In other embodiments, the computer device may include more or fewer components than shown in the figure. Or combine certain components, or arrange different components. For example, in some embodiments, the computer device may include only a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with the embodiment shown in FIG. 8 and will not be repeated here.
应当理解,在本申请实施例中,处理器502可以是中央处理单元(Central Processing Unit,CPU),该处理器502还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that in the embodiment of the present application, the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), Application specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. The general-purpose processor may be a microprocessor or the processor may be any conventional processor.
在本申请的另一实施例中提供计算机可读存储介质。该计算机可读存储介质可以为非易失性的计算机可读存储介质。该计算机可读存储介质存储有计算机程序,其中计算机程序被处理器执行时实现本申请实施例的专家知识推荐方法。In another embodiment of the present application, a computer-readable storage medium is provided. The computer-readable storage medium may be a non-volatile computer-readable storage medium. The computer-readable storage medium stores a computer program, where the computer program is executed by a processor to implement the expert knowledge recommendation method of the embodiments of the present application.
所述存储介质可以是前述设备的内部存储单元,例如设备的硬盘或内存。所述存储介质也可以是所述设备的外部存储设备,例如所述设备上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,所述存储介质还可以既包括所述设备的内部存储单元也包括外部存储设备。The storage medium may be an internal storage unit of the foregoing device, such as a hard disk or a memory of the device. The storage medium may also be an external storage device of the device, such as a plug-in hard disk equipped on the device, a smart memory card (Smart) Card (SMC), a secure digital (SD) card, or a flash memory card (Flash Card) etc. Further, the storage medium may also include both an internal storage unit of the device and an external storage device.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的设备、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and conciseness of the description, the specific working processes of the devices, devices, and units described above can refer to the corresponding processes in the foregoing method embodiments, and are not repeated here.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above is only the specific implementation of this application, but the scope of protection of this application is not limited to this, any person skilled in the art can easily think of various equivalents within the technical scope disclosed in this application Modifications or replacements, these modifications or replacements should be covered within the scope of protection of this application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (20)

  1. 一种专家知识推荐方法,包括:An expert knowledge recommendation method, including:
    接收所上传的待答复咨询问题,将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量;Receiving the uploaded consultation questions to be answered, extracting the word consultation and keyword extraction of the consultation questions to be answered, and obtaining a semantic network vector corresponding to the consultation questions to be answered;
    将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量;Calculating the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer library to obtain a semantic vector whose similarity between the reply answer library and the semantic network vector is greater than a preset similarity threshold, to As the target semantic vector;
    获取所述目标语义向量对应的专家列表;Obtaining an expert list corresponding to the target semantic vector;
    获取所述专家列表中每一专家的热度值,根据专家的热度值对语义向量进行降序排序得到排序后语义向量,获取所述排序后语义向量中排名位于预设的第一排名值之前的语义向量,以得到筛选后语义向量;以及Obtaining the heat value of each expert in the expert list, sorting the semantic vectors in descending order according to the heat value of the experts to obtain the sorted semantic vector, and obtaining the semantics in the sorted semantic vector ranked before the preset first ranking value Vector to get the filtered semantic vector; and
    获取所述筛选后语义向量中各语义向量在回复答案库中对应的回答内容以得到专家知识推荐信息,将所述专家知识推荐信息发送至与所述待答复咨询问题对应的上传端。Obtain corresponding response content of each semantic vector in the filtered semantic vector in the reply answer library to obtain expert knowledge recommendation information, and send the expert knowledge recommendation information to the uploading end corresponding to the consultation question to be answered.
  2. 根据权利要求1所述的专家知识推荐方法,其中,所述将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量,包括:The expert knowledge recommendation method according to claim 1, wherein the word segmentation and keyword extraction of the consultation question to be answered to obtain a semantic network vector corresponding to the consultation question to be answered include:
    将所述待答复咨询问题通过基于概率统计分词模型进行分词,得到与所述待答复咨询问题对应的分词结果;Segmenting the consultation question to be answered by a probability-based word segmentation model to obtain a word segmentation result corresponding to the consultation question to be answered;
    通过词频-逆文本频率指数模型,抽取所述分词结果中位于预设的第二排名值之前的关键词信息以作为目标关键词集合;Using the word frequency-inverse text frequency index model, extract the keyword information in the word segmentation result before the preset second ranking value as the target keyword set;
    获取所述目标关键词集合中每一关键词信息对应的目标词向量;Acquiring a target word vector corresponding to each keyword information in the target keyword set;
    根据每一目标词向量,及每一目标词向量对应的权重,获取与待答复咨询问题对应的语义网络向量。According to each target word vector and the weight corresponding to each target word vector, a semantic network vector corresponding to the consultation question to be answered is obtained.
  3. 根据权利要求2所述的专家知识推荐方法,其中,所述将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量,包括:The expert knowledge recommendation method according to claim 2, wherein the similarity calculation is performed between the semantic network vector and the semantic vector included in the pre-built reply answer library to obtain the reply network and the semantic network The semantic vector whose similarity between the vectors is greater than the preset similarity threshold is used as the target semantic vector, including:
    获取所述语义网络向量对应的目标关键词集合;Acquiring the target keyword set corresponding to the semantic network vector;
    将所述目标关键词集合与所述回复答案库中的关键词组合进行比对,获取所述回复答案库中包括所述目标关键词集合的关键词组合,以得到关键词匹配结果;Comparing the target keyword set with the keyword combination in the reply answer library, and acquiring the keyword combination including the target keyword set in the reply answer library to obtain a keyword matching result;
    计算关键词匹配结果中每一关键词组合对应的语义向量与所述语义网络向量之间夹角的余弦值,以得到所述关键词匹配结果中每一关键词组合与所述语义网络向量之间的相似度,以作为相似度集合;Calculating the cosine of the angle between the semantic vector corresponding to each keyword combination in the keyword matching result and the semantic network vector to obtain the relationship between each keyword combination and the semantic network vector in the keyword matching result Between the similarities, as a set of similarities;
    获取所述相似度集合中大于所述相似度阈值的相似度,以得到目标相似度集合;Obtaining a similarity in the similarity set that is greater than the similarity threshold to obtain a target similarity set;
    获取所述目标相似度集合中每一相似度对应的语义向量,以作为目标语义向量。Acquire a semantic vector corresponding to each similarity in the target similarity set as a target semantic vector.
  4. 根据权利要求1所述的专家知识推荐方法,其中,所述获取所述专家列表中每一专家的热度值,包括:根据专家列表中每一专家的文章被引用累计总次数,以对应得到每一专家的热度值。The expert knowledge recommendation method according to claim 1, wherein the obtaining the heat value of each expert in the expert list comprises: accumulating the total number of times that the articles of each expert in the expert list are cited to obtain the corresponding An expert's heat value.
  5. 根据权利要求1所述的专家知识推荐方法,其中,所述获取所述专家列表中每一专家的热度值,包括:The expert knowledge recommendation method according to claim 1, wherein the acquiring the heat value of each expert in the expert list includes:
    根据预设的引用值模型获取所述专家列表中每一专家的文章被引用的引用值之和,以得到与所述专家列表中每一专家对应的热度值;其中,所述专家列表为
    Figure PCTCN2019092507-appb-100001
    其中value k表示所述专家列表中专家k的热度值,其他专家i与所述专家列表中专家k之间的引用值为
    Figure PCTCN2019092507-appb-100002
    所述专家列表中专家k的文章发表时间是T 0,其他专家i引用所述专家列表中专家k的文章的引用时间为T,λ为预设的调节参数。
    Obtaining the sum of the cited values of the articles of each expert in the expert list according to a preset reference value model to obtain the heat value corresponding to each expert in the expert list; wherein, the expert list is
    Figure PCTCN2019092507-appb-100001
    Where value k represents the heat value of expert k in the expert list, and the reference value between other experts i and expert k in the expert list is
    Figure PCTCN2019092507-appb-100002
    The publication time of the article of expert k in the expert list is T 0 , the citation time of the article of expert k in the expert list cited by other experts i is T, and λ is the preset adjustment parameter.
  6. 根据权利要求2所述的专家知识推荐方法,其中,所述通过词频-逆文本频率指数模型,抽取所述分词结果中位于预设的第二排名值之前的关键词信息以作为目标关键词集合,包括:The expert knowledge recommendation method according to claim 2, wherein the keyword information before the preset second ranking value in the word segmentation result is extracted as the target keyword set by the word frequency-inverse text frequency index model ,include:
    计算分词结果中每一分词的词频;Calculate the word frequency of each participle in the word segmentation result;
    计算分词结果中每一分词的逆文档频率;Calculate the inverse document frequency of each word segmentation in the word segmentation result;
    根据词频*逆文档频率计算分词结果中每一分词对应的词频-逆文本频率指数;Calculate the word frequency-inverse text frequency index corresponding to each participle in the word segmentation result according to the word frequency*inverse document frequency;
    将分词结果中每一分词对应的词频-逆文本频率指数按降序排序,取排名位于预设的第二排名值之前的分词组成与所述待答复咨询问题对应的目标关键词集合。The word frequency-inverse text frequency index corresponding to each word segmentation in the word segmentation result is sorted in descending order, and the word segmentation ranked before the preset second ranking value is used to form a target keyword set corresponding to the query question to be answered.
  7. 根据权利要求5所述的专家知识推荐方法,其中,所述根据预设的引用值模型获取所述专家列表中每一专家的文章被引用的引用值之和,以得到与所述专家列表中每一专家对应的热度值之前,还包括:The expert knowledge recommendation method according to claim 5, wherein the acquisition of the sum of citation values of the articles of each expert in the expert list is obtained according to a preset reference value model to obtain the expert list Before the heat value corresponding to each expert, it also includes:
    构建专家的有向社交网络结构;其中,有向社交网络结构中主体是各专家名称,有向的边值为专家之间带有时间衰退因子的引用值;专家列表中专家k的文章发表时间是T 0,其他专家i引用所述专家列表中专家k的文章的引用时间为T,其他专家i与所述专家列表中专家k之间的引用值为
    Figure PCTCN2019092507-appb-100003
    λ为预设的调节参数。
    Construct an expert's directed social network structure; where the subject is the name of each expert in the directed social network structure, and the directed boundary value is the reference value with a time decay factor between experts; the time of publication of expert k's article in the expert list Is T 0 , the citation time of the article of other expert i citing expert k in the expert list is T, and the reference value between other expert i and expert k in the expert list is
    Figure PCTCN2019092507-appb-100003
    λ is the preset adjustment parameter.
  8. 一种专家知识推荐装置,其中,包括:An expert knowledge recommendation device, which includes:
    咨询问题获取单元,用于接收所上传的待答复咨询问题,将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量;The consultation question obtaining unit is configured to receive the uploaded consultation questions to be answered, and perform word segmentation and keyword extraction on the consultation questions to be answered, to obtain a semantic network vector corresponding to the consultation questions to be answered;
    目标语义向量获取单元,用于将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量;A target semantic vector acquiring unit, configured to calculate the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer library to obtain a similarity between the reply answer library and the semantic network vector greater than a preset The semantic vector of the similarity threshold is used as the target semantic vector;
    专家列表获取单元,用于获取所述目标语义向量对应的专家列表;An expert list obtaining unit, configured to obtain an expert list corresponding to the target semantic vector;
    排序单元,用于获取所述专家列表中每一专家的热度值,根据专家的热度值对语义向量进行降序排序得到排序后语义向量,获取所述排序后语义向量中排名位于预设的第一排名值之前的语义向量,以得到筛选后语义向量;以及The sorting unit is used to obtain the heat value of each expert in the expert list, sort the semantic vectors in descending order according to the heat value of the experts to obtain the sorted semantic vector, and obtain the ranked first position in the semantic vector after sorting The semantic vector before the ranking value to get the filtered semantic vector; and
    咨询回复单元,用于获取所述筛选后语义向量中各语义向量在回复答案库中对应的回答内容以得到专家知识推荐信息,将所述专家知识推荐信息发送至与所述待答复咨询问题对应的上传端。The consultation reply unit is used to obtain the corresponding response content of each semantic vector in the filtered semantic vector in the reply answer database to obtain expert knowledge recommendation information, and send the expert knowledge recommendation information to correspond to the consultation question to be answered Uploader.
  9. 根据权利要求8所述的专家知识推荐装置,其中,所述咨询问题获取单元,包括:The expert knowledge recommendation device according to claim 8, wherein the consultation question acquisition unit includes:
    分词单元,用于将所述待答复咨询问题通过基于概率统计分词模型进行分 词,得到与所述待答复咨询问题对应的分词结果;The word segmentation unit is used to segment the consultation question to be answered based on a probability statistical word segmentation model to obtain a word segmentation result corresponding to the consultation question to be answered;
    关键词抽取单元,用于通过词频-逆文本频率指数模型,抽取所述分词结果中位于预设的第二排名值之前的关键词信息以作为目标关键词集合;The keyword extraction unit is used to extract keyword information before the preset second ranking value in the word segmentation result through the word frequency-inverse text frequency index model as the target keyword set;
    目标词向量获取单元,用于获取所述目标关键词集合中每一关键词信息对应的目标词向量;A target word vector acquiring unit, configured to acquire a target word vector corresponding to each keyword information in the target keyword set;
    语义网络向量获取单元,用于根据每一目标词向量,及每一目标词向量对应的权重,获取与待答复咨询问题对应的语义网络向量。The semantic network vector acquisition unit is used to acquire the semantic network vector corresponding to the question to be answered according to each target word vector and the weight corresponding to each target word vector.
  10. 根据权利要求9所述的专家知识推荐装置,其中,所述目标语义向量获取单元,包括:The expert knowledge recommendation device according to claim 9, wherein the target semantic vector acquisition unit includes:
    目标关键词集合获取单元,用于获取所述语义网络向量对应的目标关键词集合;A target keyword set acquisition unit, configured to acquire a target keyword set corresponding to the semantic network vector;
    关键词比较单元,用于将所述目标关键词集合与所述回复答案库中的关键词组合进行比对,获取所述回复答案库中包括所述目标关键词集合的关键词组合,以得到关键词匹配结果;The keyword comparison unit is used to compare the target keyword set with the keyword combination in the reply answer library, and obtain the keyword combination including the target keyword set in the reply answer library to obtain Keyword matching results;
    相似度集合获取单元,用于计算关键词匹配结果中每一关键词组合对应的语义向量与所述语义网络向量之间夹角的余弦值,以得到所述关键词匹配结果中每一关键词组合与所述语义网络向量之间的相似度,以作为相似度集合;The similarity set acquisition unit is used to calculate the cosine value of the angle between the semantic vector corresponding to each keyword combination in the keyword matching result and the semantic network vector to obtain each keyword in the keyword matching result Combining the similarity between the semantic network vector as a similarity set;
    目标相似度集合获取单元,用于获取所述相似度集合中大于所述相似度阈值的相似度,以得到目标相似度集合;A target similarity set acquisition unit, configured to acquire a similarity greater than the similarity threshold in the similarity set to obtain a target similarity set;
    目标相似度集合解析单元,用于获取所述目标相似度集合中每一相似度对应的语义向量,以作为目标语义向量。The target similarity set parsing unit is used to obtain a semantic vector corresponding to each similarity in the target similarity set as a target semantic vector.
  11. 一种计算机设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其中,所述处理器执行所述计算机程序时实现以下步骤:A computer device includes a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor implements the following steps when executing the computer program:
    接收所上传的待答复咨询问题,将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量;Receiving the uploaded consultation questions to be answered, extracting the word consultation and keyword extraction of the consultation questions to be answered, and obtaining a semantic network vector corresponding to the consultation questions to be answered;
    将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量;Calculating the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer library to obtain a semantic vector whose similarity between the reply answer library and the semantic network vector is greater than a preset similarity threshold, to As the target semantic vector;
    获取所述目标语义向量对应的专家列表;Obtaining an expert list corresponding to the target semantic vector;
    获取所述专家列表中每一专家的热度值,根据专家的热度值对语义向量进行降序排序得到排序后语义向量,获取所述排序后语义向量中排名位于预设的第一排名值之前的语义向量,以得到筛选后语义向量;以及Obtaining the heat value of each expert in the expert list, sorting the semantic vectors in descending order according to the heat value of the experts to obtain the sorted semantic vector, and obtaining the semantics in the sorted semantic vector ranked before the preset first ranking value Vector to get the filtered semantic vector; and
    获取所述筛选后语义向量中各语义向量在回复答案库中对应的回答内容以得到专家知识推荐信息,将所述专家知识推荐信息发送至与所述待答复咨询问题对应的上传端。Obtain corresponding response content of each semantic vector in the filtered semantic vector in the reply answer library to obtain expert knowledge recommendation information, and send the expert knowledge recommendation information to the uploading end corresponding to the consultation question to be answered.
  12. 根据权利要求11所述的计算机设备,其中,所述将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量,包括:The computer device according to claim 11, wherein the word segmentation and keyword extraction of the consultation question to be answered to obtain a semantic network vector corresponding to the consultation question to be answered include:
    将所述待答复咨询问题通过基于概率统计分词模型进行分词,得到与所述待答复咨询问题对应的分词结果;Segmenting the consultation question to be answered by a probability-based word segmentation model to obtain a word segmentation result corresponding to the consultation question to be answered;
    通过词频-逆文本频率指数模型,抽取所述分词结果中位于预设的第二排名值之前的关键词信息以作为目标关键词集合;Using the word frequency-inverse text frequency index model, extract the keyword information in the word segmentation result before the preset second ranking value as the target keyword set;
    获取所述目标关键词集合中每一关键词信息对应的目标词向量;Acquiring a target word vector corresponding to each keyword information in the target keyword set;
    根据每一目标词向量,及每一目标词向量对应的权重,获取与待答复咨询问题对应的语义网络向量。According to each target word vector and the weight corresponding to each target word vector, a semantic network vector corresponding to the consultation question to be answered is obtained.
  13. 根据权利要求12所述的计算机设备,其中,所述将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量,包括:The computer device according to claim 12, wherein the similarity calculation is performed between the semantic network vector and a semantic vector included in a pre-built reply answer library to obtain a relationship between the semantic network vector and the semantic network vector The semantic vector whose inter-similarity is greater than the preset similarity threshold is used as the target semantic vector, including:
    获取所述语义网络向量对应的目标关键词集合;Acquiring the target keyword set corresponding to the semantic network vector;
    将所述目标关键词集合与所述回复答案库中的关键词组合进行比对,获取所述回复答案库中包括所述目标关键词集合的关键词组合,以得到关键词匹配结果;Comparing the target keyword set with the keyword combination in the reply answer library, and acquiring the keyword combination including the target keyword set in the reply answer library to obtain a keyword matching result;
    计算关键词匹配结果中每一关键词组合对应的语义向量与所述语义网络向量之间夹角的余弦值,以得到所述关键词匹配结果中每一关键词组合与所述语义网络向量之间的相似度,以作为相似度集合;Calculating the cosine of the angle between the semantic vector corresponding to each keyword combination in the keyword matching result and the semantic network vector to obtain the relationship between each keyword combination and the semantic network vector in the keyword matching result Between the similarities, as a set of similarities;
    获取所述相似度集合中大于所述相似度阈值的相似度,以得到目标相似度集合;Obtaining a similarity in the similarity set that is greater than the similarity threshold to obtain a target similarity set;
    获取所述目标相似度集合中每一相似度对应的语义向量,以作为目标语义 向量。The semantic vector corresponding to each similarity in the target similarity set is obtained as the target semantic vector.
  14. 根据权利要求11所述的计算机设备,其中,所述获取所述专家列表中每一专家的热度值,包括:根据专家列表中每一专家的文章被引用累计总次数,以对应得到每一专家的热度值。The computer device according to claim 11, wherein the obtaining the heat value of each expert in the expert list comprises: accumulating a total number of times that the articles of each expert in the expert list are cited to obtain each expert correspondingly 'S heat value.
  15. 根据权利要求11所述的计算机设备,其中,所述获取所述专家列表中每一专家的热度值,包括:The computer device according to claim 11, wherein the acquiring the heat value of each expert in the expert list comprises:
    根据预设的引用值模型获取所述专家列表中每一专家的文章被引用的引用值之和,以得到与所述专家列表中每一专家对应的热度值;其中,所述专家列表为
    Figure PCTCN2019092507-appb-100004
    其中value k表示所述专家列表中专家k的热度值,其他专家i与所述专家列表中专家k之间的引用值为
    Figure PCTCN2019092507-appb-100005
    所述专家列表中专家k的文章发表时间是T 0,其他专家i引用所述专家列表中专家k的文章的引用时间为T,λ为预设的调节参数。
    Obtaining the sum of the cited values of the articles of each expert in the expert list according to a preset reference value model to obtain the heat value corresponding to each expert in the expert list; wherein, the expert list is
    Figure PCTCN2019092507-appb-100004
    Where value k represents the heat value of expert k in the expert list, and the reference value between other experts i and expert k in the expert list is
    Figure PCTCN2019092507-appb-100005
    The publication time of the article of expert k in the expert list is T 0 , the citation time of the article of expert k in the expert list cited by other experts i is T, and λ is the preset adjustment parameter.
  16. 根据权利要求12所述的计算机设备,其中,所述通过词频-逆文本频率指数模型,抽取所述分词结果中位于预设的第二排名值之前的关键词信息以作为目标关键词集合,包括:The computer device according to claim 12, wherein the keyword information before the preset second ranking value in the word segmentation result is extracted by the word frequency-inverse text frequency index model as the target keyword set, including :
    计算分词结果中每一分词的词频;Calculate the word frequency of each participle in the word segmentation result;
    计算分词结果中每一分词的逆文档频率;Calculate the inverse document frequency of each word segmentation in the word segmentation result;
    根据词频*逆文档频率计算分词结果中每一分词对应的词频-逆文本频率指数;Calculate the word frequency-inverse text frequency index corresponding to each participle in the word segmentation result according to the word frequency*inverse document frequency;
    将分词结果中每一分词对应的词频-逆文本频率指数按降序排序,取排名位于预设的第二排名值之前的分词组成与所述待答复咨询问题对应的目标关键词集合。The word frequency-inverse text frequency index corresponding to each word segmentation in the word segmentation result is sorted in descending order, and the word segmentation ranked before the preset second ranking value is used to form a target keyword set corresponding to the query question to be answered.
  17. 根据权利要求15所述的计算机设备,其中,所述根据预设的引用值模型获取所述专家列表中每一专家的文章被引用的引用值之和,以得到与所述专家列表中每一专家对应的热度值之前,还包括:The computer device according to claim 15, wherein the acquisition of the sum of citation values of the articles of each expert in the expert list is obtained according to a preset reference value model to obtain Before the heat value corresponding to the expert, it also includes:
    构建专家的有向社交网络结构;其中,有向社交网络结构中主体是各专家名称,有向的边值为专家之间带有时间衰退因子的引用值;专家列表中专家k 的文章发表时间是T 0,其他专家i引用所述专家列表中专家k的文章的引用时间为T,其他专家i与所述专家列表中专家k之间的引用值为
    Figure PCTCN2019092507-appb-100006
    λ为预设的调节参数。
    Construct an expert's directed social network structure; where the subject is the name of each expert in the directed social network structure, and the directed boundary value is the reference value with time decay factor between experts; the publication time of the expert k's article in the expert list Is T 0 , the citation time of the article of other expert i citing expert k in the expert list is T, and the reference value between other expert i and expert k in the expert list is
    Figure PCTCN2019092507-appb-100006
    λ is the preset adjustment parameter.
  18. 一种计算机可读存储介质,其中,所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行时使所述处理器执行以下操作:A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, which when executed by a processor causes the processor to perform the following operations:
    接收所上传的待答复咨询问题,将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量;Receiving the uploaded consultation questions to be answered, extracting the word consultation and keyword extraction of the consultation questions to be answered, and obtaining a semantic network vector corresponding to the consultation questions to be answered;
    将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量,以作为目标语义向量;Calculating the similarity between the semantic network vector and the semantic vector included in the pre-built reply answer library to obtain a semantic vector whose similarity between the reply answer library and the semantic network vector is greater than a preset similarity threshold, to As the target semantic vector;
    获取所述目标语义向量对应的专家列表;Obtaining an expert list corresponding to the target semantic vector;
    获取所述专家列表中每一专家的热度值,根据专家的热度值对语义向量进行降序排序得到排序后语义向量,获取所述排序后语义向量中排名位于预设的第一排名值之前的语义向量,以得到筛选后语义向量;以及Obtaining the heat value of each expert in the expert list, sorting the semantic vectors in descending order according to the heat value of the experts to obtain the sorted semantic vector, and obtaining the semantics in the sorted semantic vector ranked before the preset first ranking value Vector to get the filtered semantic vector; and
    获取所述筛选后语义向量中各语义向量在回复答案库中对应的回答内容以得到专家知识推荐信息,将所述专家知识推荐信息发送至与所述待答复咨询问题对应的上传端。Obtain corresponding response content of each semantic vector in the filtered semantic vector in the reply answer library to obtain expert knowledge recommendation information, and send the expert knowledge recommendation information to the uploading end corresponding to the consultation question to be answered.
  19. 根据权利要求18所述的计算机可读存储介质,其中,所述将所述待答复咨询问题进行分词和关键词抽取,得到与所述待答复咨询问题对应的语义网络向量,包括:The computer readable storage medium according to claim 18, wherein the word segmentation and keyword extraction of the consultation question to be answered to obtain a semantic network vector corresponding to the consultation question to be answered include:
    将所述待答复咨询问题通过基于概率统计分词模型进行分词,得到与所述待答复咨询问题对应的分词结果;Segmenting the consultation question to be answered by a probability-based word segmentation model to obtain a word segmentation result corresponding to the consultation question to be answered;
    通过词频-逆文本频率指数模型,抽取所述分词结果中位于预设的第二排名值之前的关键词信息以作为目标关键词集合;Using the word frequency-inverse text frequency index model, extract the keyword information in the word segmentation result before the preset second ranking value as the target keyword set;
    获取所述目标关键词集合中每一关键词信息对应的目标词向量;Acquiring a target word vector corresponding to each keyword information in the target keyword set;
    根据每一目标词向量,及每一目标词向量对应的权重,获取与待答复咨询问题对应的语义网络向量。According to each target word vector and the weight corresponding to each target word vector, a semantic network vector corresponding to the consultation question to be answered is obtained.
  20. 根据权利要求19所述的计算机可读存储介质,其中,所述将所述语义网络向量与预先构建的回复答案库中所包括的语义向量进行相似度计算,得到回复答案库中与所述语义网络向量之间相似度大于预设相似度阈值的语义向量, 以作为目标语义向量,包括:The computer-readable storage medium according to claim 19, wherein the similarity calculation is performed on the semantic network vector and the semantic vector included in the pre-built reply answer database to obtain the semantics in the reply answer database A semantic vector whose similarity between network vectors is greater than a preset similarity threshold is used as the target semantic vector, including:
    获取所述语义网络向量对应的目标关键词集合;Acquiring the target keyword set corresponding to the semantic network vector;
    将所述目标关键词集合与所述回复答案库中的关键词组合进行比对,获取所述回复答案库中包括所述目标关键词集合的关键词组合,以得到关键词匹配结果;Comparing the target keyword set with the keyword combination in the reply answer library, and acquiring the keyword combination including the target keyword set in the reply answer library to obtain a keyword matching result;
    计算关键词匹配结果中每一关键词组合对应的语义向量与所述语义网络向量之间夹角的余弦值,以得到所述关键词匹配结果中每一关键词组合与所述语义网络向量之间的相似度,以作为相似度集合;Calculating the cosine of the angle between the semantic vector corresponding to each keyword combination in the keyword matching result and the semantic network vector to obtain the relationship between each keyword combination and the semantic network vector in the keyword matching result Between the similarities, as a set of similarities;
    获取所述相似度集合中大于所述相似度阈值的相似度,以得到目标相似度集合;Obtaining a similarity in the similarity set that is greater than the similarity threshold to obtain a target similarity set;
    获取所述目标相似度集合中每一相似度对应的语义向量,以作为目标语义向量。Acquire a semantic vector corresponding to each similarity in the target similarity set as a target semantic vector.
PCT/CN2019/092507 2018-12-11 2019-06-24 Expert knowledge recommendation method and apparatus, computer device, and storage medium WO2020119063A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811510416.2A CN109325132A (en) 2018-12-11 2018-12-11 Expertise recommended method, device, computer equipment and storage medium
CN201811510416.2 2018-12-11

Publications (1)

Publication Number Publication Date
WO2020119063A1 true WO2020119063A1 (en) 2020-06-18

Family

ID=65256609

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/092507 WO2020119063A1 (en) 2018-12-11 2019-06-24 Expert knowledge recommendation method and apparatus, computer device, and storage medium

Country Status (2)

Country Link
CN (1) CN109325132A (en)
WO (1) WO2020119063A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112100464A (en) * 2020-10-14 2020-12-18 济南大学 Question-answering community expert recommendation method and system combining dynamic interest and professional knowledge

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109325132A (en) * 2018-12-11 2019-02-12 平安科技(深圳)有限公司 Expertise recommended method, device, computer equipment and storage medium
CN109783516A (en) * 2019-02-19 2019-05-21 北京奇艺世纪科技有限公司 A kind of query statement retrieval answering method and device
CN111723231B (en) * 2019-03-20 2023-10-17 北京百舸飞驰科技有限公司 Question prediction method and device
CN110175224B (en) * 2019-06-03 2022-09-30 安徽大学 Semantic link heterogeneous information network embedding-based patent recommendation method and device
CN110347823A (en) * 2019-06-06 2019-10-18 平安科技(深圳)有限公司 Voice-based user classification method, device, computer equipment and storage medium
CN112650829A (en) * 2019-10-11 2021-04-13 阿里巴巴集团控股有限公司 Customer service processing method and device
CN111259041A (en) * 2020-02-26 2020-06-09 山东理工大学 Scientific and technological expert resource virtualization and semantic reasoning retrieval method
CN111309270B (en) * 2020-03-13 2021-04-27 清华大学 Persistent memory key value storage system
CN111611387B (en) * 2020-05-28 2023-07-21 深圳市华云中盛科技股份有限公司 Civil case consultation method and device, computer equipment and storage medium
CN111813898A (en) * 2020-08-28 2020-10-23 北京智源人工智能研究院 Expert recommendation method, device and equipment based on semantic search and storage medium
CN112163075A (en) * 2020-09-27 2021-01-01 北京乐学帮网络技术有限公司 Information recommendation method and device, computer equipment and storage medium
CN113032578B (en) * 2021-03-23 2022-12-06 平安科技(深圳)有限公司 Information pushing method and device based on hotspot event and computer equipment
CN113506639A (en) * 2021-03-23 2021-10-15 崔剑虹 Intelligent medical theme interaction method and system based on big data
CN113032530B (en) * 2021-04-26 2022-05-27 朗动信息咨询(上海)有限公司 Big data acquisition and analysis-based consultation service system
CN115809755B (en) * 2023-02-02 2023-06-27 广东工业大学 Carbon emission accounting method, equipment and storage medium based on semantic recognition
CN116739003A (en) * 2023-06-01 2023-09-12 中国南方电网有限责任公司 Intelligent question-answering implementation method and device for power grid management, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102184225A (en) * 2011-05-09 2011-09-14 北京奥米时代生物技术有限公司 Method for searching preferred expert information in question-answering system
CN102479202A (en) * 2010-11-26 2012-05-30 卓望数码技术(深圳)有限公司 Recommendation system based on domain expert
CN107609096A (en) * 2017-09-11 2018-01-19 武汉科技大学 A kind of intelligent lawyer's expert responses method
CN108153876A (en) * 2017-12-26 2018-06-12 爱因互动科技发展(北京)有限公司 Intelligent answer method and system
US20180181673A1 (en) * 2016-12-28 2018-06-28 Beijing Baidu Netcom Science And Technology Co., Ltd. Answer searching method and device based on deep question and answer
CN109325132A (en) * 2018-12-11 2019-02-12 平安科技(深圳)有限公司 Expertise recommended method, device, computer equipment and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106095872A (en) * 2016-06-07 2016-11-09 北京高地信息技术有限公司 Answer sort method and device for Intelligent Answer System
CN107980130A (en) * 2017-11-02 2018-05-01 深圳前海达闼云端智能科技有限公司 It is automatic to answer method, apparatus, storage medium and electronic equipment
CN108038209A (en) * 2017-12-18 2018-05-15 深圳前海微众银行股份有限公司 Answer system of selection, device and computer-readable recording medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102479202A (en) * 2010-11-26 2012-05-30 卓望数码技术(深圳)有限公司 Recommendation system based on domain expert
CN102184225A (en) * 2011-05-09 2011-09-14 北京奥米时代生物技术有限公司 Method for searching preferred expert information in question-answering system
US20180181673A1 (en) * 2016-12-28 2018-06-28 Beijing Baidu Netcom Science And Technology Co., Ltd. Answer searching method and device based on deep question and answer
CN107609096A (en) * 2017-09-11 2018-01-19 武汉科技大学 A kind of intelligent lawyer's expert responses method
CN108153876A (en) * 2017-12-26 2018-06-12 爱因互动科技发展(北京)有限公司 Intelligent answer method and system
CN109325132A (en) * 2018-12-11 2019-02-12 平安科技(深圳)有限公司 Expertise recommended method, device, computer equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112100464A (en) * 2020-10-14 2020-12-18 济南大学 Question-answering community expert recommendation method and system combining dynamic interest and professional knowledge
CN112100464B (en) * 2020-10-14 2022-09-02 济南大学 Question-answering community expert recommendation method and system combining dynamic interest and professional knowledge

Also Published As

Publication number Publication date
CN109325132A (en) 2019-02-12

Similar Documents

Publication Publication Date Title
WO2020119063A1 (en) Expert knowledge recommendation method and apparatus, computer device, and storage medium
WO2020073534A1 (en) Pushing method and apparatus based on re-clustering, and computer device and storage medium
US20210209182A1 (en) Systems and methods for improved web searching
US10970346B2 (en) Systems and methods for paragraph-based document searching
WO2019091026A1 (en) Knowledge base document rapid search method, application server, and computer readable storage medium
US20100241647A1 (en) Context-Aware Query Recommendations
WO2011037603A1 (en) Searching for information based on generic attributes of the query
CN112035599B (en) Query method and device based on vertical search, computer equipment and storage medium
Nguyen et al. A math-aware search engine for math question answering system
CN112100396B (en) Data processing method and device
CN110990533A (en) Method and device for determining standard text corresponding to query text
US10482390B2 (en) Information discovery system
WO2021226840A1 (en) Hot news intention recognition method, apparatus and device and readable storage medium
CN112632261A (en) Intelligent question and answer method, device, equipment and storage medium
WO2021051934A1 (en) Method and apparatus for extracting key contract term on basis of artificial intelligence, and storage medium
Bounabi et al. A comparison of text classification methods using different stemming techniques
CN111325033B (en) Entity identification method, entity identification device, electronic equipment and computer readable storage medium
TW202001621A (en) Corpus generating method and apparatus, and human-machine interaction processing method and apparatus
CN111339424B (en) Method, device and equipment for searching based on keywords and storage medium
WO2013107031A1 (en) Method, device and system for determining video quality parameter based on comment
US9552415B2 (en) Category classification processing device and method
US20120059786A1 (en) Method and an apparatus for matching data network resources
CN105653553B (en) Word weight generation method and device
WO2021000400A1 (en) Hospital guide similar problem pair generation method and system, and computer device
CN108810640B (en) Television program recommendation method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19895599

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19895599

Country of ref document: EP

Kind code of ref document: A1