CN108304587B

CN108304587B - Community question-answering platform answer sorting method

Info

Publication number: CN108304587B
Application number: CN201810186972.2A
Authority: CN
Inventors: 陈恩红; 刘淇; 金斌斌; 赵洪科; 童世炜
Original assignee: University of Science and Technology of China USTC
Current assignee: University of Science and Technology of China USTC
Priority date: 2018-03-07
Filing date: 2018-03-07
Publication date: 2020-10-27
Anticipated expiration: 2038-03-07
Also published as: CN108304587A

Abstract

The invention discloses a community question-answering platform answer sorting method, which can solve the answer sorting problem by fully utilizing rich metadata (such as a theme, a timestamp and text content of question answers) of the question and the answer; meanwhile, the answer ordering question is answered by using an enhanced attention-based recurrent neural network (EARNN) model, and more information is used compared with the traditional model. The results of the prediction are improved to some extent by a plurality of evaluation indexes.

Description

Community question-answering platform answer sorting method

Technical Field

The invention relates to the field of machine learning and question-answering systems, in particular to a community question-answering platform answer sequencing method.

Background

The community question-answering platform, such as hundredth knowledge, dog search question and the like, provides an online question-answering platform for Internet users to help people quickly obtain high-quality answers to daily or professional questions. As community questions and answers become more popular, questions present in the platform gradually emerge, one of which is that answers are of varying quality, and the experience of users on the platform is greatly affected by the low-quality answers. Although these platforms have proposed the mechanism of "like" at present, for those new questions and answers, the number of like is still unstable due to the short time of occurrence, and the quality of answer is not reflected. Therefore, how to effectively measure the quality of the answer is a research problem which needs to be solved urgently by the community question-answering platform.

Around this research question, researchers have proposed a number of ways, of which "answer ranking" is an effective way to help users quickly pick out high quality answers from a range of answers of varying quality. Relevant research mainly focuses on lexical, syntactic or semantic matching between questions and answers, and ignores the positive influence of metadata information such as topics and timestamps on answer sorting questions.

Disclosure of Invention

The invention aims to provide a community question-answering platform answer sorting method, which can solve answer sorting problems by fully utilizing rich metadata (such as topics, timestamps and text contents of question answers) of questions and answers.

The purpose of the invention is realized by the following technical scheme:

a community question-answering platform answer sorting method comprises the following steps:

crawling a certain amount of data from a community question and answer platform website, wherein the crawled data for a question comprises the following data: the text content of the question, the subject to which the question belongs, the text content of a series of answers corresponding to the question, the timestamp of each answer and the number of praise for each answer;

constructing an enhanced attention mechanism recurrent neural network model based on the text content of each crawled question, the topic to which the question belongs and the text content of a series of answers corresponding to the question, and then combining the time stamp of each answer to perform ranking score of the quality of the relevant answer; training an enhanced attention mechanism recurrent neural network model by combining the ranking scores with a preset time-sensitive objective function and using a problem-dependent pair-wise training strategy;

for a new question and a series of answers thereof, a series of examples are constructed by utilizing the text content of the new question, the subject to which the new question belongs, the text content of a series of answers corresponding to the new question and the timestamp of each answer, and are sequentially input into the trained enhanced attention mechanism recurrent neural network model, so that a series of ranking scores are obtained, and corresponding answers are ranked from front to back according to the ranking scores.

According to the technical scheme provided by the invention, the answer sorting problem is carried out by using an enhanced attention-machine recurrent neural network (EARNN), and compared with the traditional model, more information is used. The results of the prediction are improved to some extent by a plurality of evaluation indexes.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on the drawings without creative efforts.

Fig. 1 is a flowchart of a community question-answering platform answer sorting method according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention are clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.

The embodiment of the invention provides a community question-answering platform answer sorting method, which mainly comprises the following steps as shown in figure 1:

step 1, crawling a certain amount of data from a community question-and-answer platform website, wherein the crawled data for a question comprises the following steps: the text content of the question, the subject to which the question belongs, the text content of a series of answers corresponding to the question, the timestamp of each answer, and the number of praise for each answer.

Step 2, constructing an enhanced attention mechanism recurrent neural network model based on the text content of each crawled question, the topic to which the question belongs and the text content of a series of answers corresponding to the question, and then combining the time stamp of each answer to perform ranking score of the quality of the relevant answer; the enhanced attention-mechanism recurrent neural network model is trained in conjunction with the ranking scores and a preset time-sensitive objective function and using a problem-dependent pair-wise training strategy.

And 3, for a new question and a series of answers thereof, constructing a series of examples by using the text content of the new question, the subject to which the new question belongs, the text content of the series of answers corresponding to the new question and the time stamp of each answer, sequentially inputting the examples into the trained enhanced attention mechanism recurrent neural network model to obtain a series of ranking scores, and ranking the corresponding answers from front to back according to the ranking scores.

For ease of understanding, the above-described process is described in detail below.

1. And (4) crawling of data.

In the embodiment of the invention, a certain amount of data is crawled from a community question-and-answer platform website, and the data crawled for a question comprises the following steps: the text content of the question, the topic to which the question belongs (possibly multiple topics), the text content of a series of answers to the question, the timestamp of each answer, and the number of praise for each answer.

2. And (4) preprocessing data.

Preprocessing the crawled data before constructing an enhanced attention mechanism cyclic neural network model to ensure the effect of the model; the pretreatment mainly comprises the following steps:

1) and removing the questions and answers with the number of words less than the set number in the text content.

In the embodiment of the invention, some questions and answers with lower quality need to be removed, and the questions and answers with the number of words smaller than the set number in the text content are generally considered to be of lower quality; illustratively, the set number here may be 10.

2) The questions and answers with the number of praise exceeding the preset range in a period of time are removed.

In the embodiment of the invention, if the wave band of the praise number exceeds the preset range in a period of time, the praise number is considered not to be stable yet, and the data are biased to the model evaluation, so that the question and the answer of which the praise number is not stable yet are removed.

3) The text content of the remaining questions and answers after the two steps is subjected to word segmentation processing, and the data of each question is changed into: the word segmentation result of the text content of the question, the subject to which the question belongs, the word segmentation result of the content of each answer, the timestamp of each answer, and the number of praise of each answer; the praise number of each answer is used for verifying the quality of the model, and the rest information is used as the input of the model for later evaluation of the quality of each answer.

3. And constructing an enhanced attention mechanism recurrent neural network model.

The method for constructing the enhanced attention mechanism recurrent neural network model comprises four parts: an input layer, a long-short term memory network layer, an attention layer and an evaluation layer.

1) An input layer: regarding an answer, the answer is considered to be composed of a plurality of sentences, each sentence being composed of a plurality of words; regarding a corresponding question, the question is considered to be composed of a sentence, and the sentence is composed of a plurality of words; regarding the topic to which the question belongs, the topic is considered to be composed of a plurality of words; using Word Embedding technology to represent words appearing in the text by a vector with a fixed length, so that each Word appearing in the text content of the question, the text content of the answer and the theme is replaced by a K-dimensional vector; the vector sequence TQ of the text content for the hypothesis problem consists of N vectors, denoted TQ ═ x₁,x₂,…,x_N},x_p∈R^KP ═ 1,2,. cndot, N; assuming that a vector sequence TA of text content of an answer consists of M sentences each consisting of D vectors, TA ═ s₁,s₂,…,s_M}，s_m＝{y_m1,y_m2,...,y_mD},y_md∈R^KM1, 2, M, D1, 2, D; let the topic TC consist of C vectors, denoted TC ═ z₁,z₂,...,z_C},z_q∈R^KQ is 1,2. N, M, D, C is not a fixed value and will vary from input instance to input instance.

2) Long-short term memory network layer: modeling the vector sequences in question and answer using two long-short term memory networks LSTM _ Q and LSTM _ A for the vector sequence TQ of the question and one answer vector sequence TA, respectively, and using the last cell vector in LSTM _ Q for initialization of the cell vector in LSTM _ A; then, a vector sequence of questions and answers passing through the long-term and short-term memory network is obtained, wherein the vector sequence is MQ and MA, and each vector in the vector sequence MQ and MA contains context semantic information.

In the embodiment of the invention, the vector sequence TQ for the problem is { x ═ x₁,x₂,...,x_NN updates the cell vector sequence c ═ c { c } in LSTM _ Q over time t ═ 1,2₁,c₂,...,c_NGet hidden vector sequence h ═ h₁,h₂,...,h_NThe calculation method is as follows:

i_t＝σ(W_xix_t+W_hih_t-1+b_i)；

f_t＝σ(W_xfx_t+W_hfh_t-1+b_f)；

c_t＝f_t·c_t-1+i_t·τ(W_xcx_t+W_hch_t-1+b_c)；

o_t＝σ(W_xox_t+W_hoh_t-1+b_o)；

h_t＝o_t·τ(c_t)；

the numerical value of the time corresponds to the sequence number of the vector in the vector series TQ of the problem one by one, that is, the first time t is 1, and the vector x with the sequence number of 1 in the TQ corresponds to₁(ii) a In particular, cell vector c₀And hidden vector h₀Initializing to a zero vector; i.e. i_t,f_t,o_tAn input gate, a forgetting gate and an output gate respectively; sigma (·), tau (·) is sigmoid (·), tanh (·) nonlinear activation function, respectively; is an element multiply operation; { W_xi,W_hi,W_xf,W_hf,W_xc,W_hc,W_xo,W_hoIs a modelParameter matrix to be optimized, { b_i,b_f,b_c,b_oIs the parameter vector to be optimized in the model.

Similarly, for each sentence in the answer, the vector sequence s_m＝{y_m1,y_m2,...,y_mDD updates the cell vector sequence c ' ═ { c ' in LSTM _ a over time t ' ═ 1,2.₁,c'₂,...,c'_DAnd obtaining an implicit vector sequence h'_m＝{h'_m1,h'_m2,...,h'_mD}. In particular, to model the relationship between questions and answers, the resulting hidden vector h'_mThe cell vector c 'may be varied depending on the question'₀＝c_NIs implicit to vector h'_m0Initialized to a zero vector. Similarly, the value at this time also corresponds to the sequence number of the vector in each sentence vector sequence.

From this layer, the corresponding number of hidden vectors h and h 'can be derived from the vector sequences TQ and TA of questions and answers'_mAlthough the number of hidden vectors and the dimension and input of the vectors remain unchanged, the hidden vectors contain context semantic information, wherein the hidden vector sequence h is also the vector sequence MQ and the hidden vector sequence h'_mI.e. the vector sequence MA.

3) Attention layer: the vector sequence MQ of the question and the vector sequence MA of an answer can be effectively interacted by using a sentence-level attention mechanism to obtain a vector of the question and a vector of the answer; or, based on sentence-level attention mechanism, a deeper word-level attention mechanism can be designed to change the topic from the vector sequence TC to a vector FC; and then fusing the vector FC of the subject, the vector sequence MQ of the question and the vector sequence MA of an answer to finally obtain a vector of the question and a vector of the answer.

In the embodiment of the invention, during specific implementation, a sentence-level attention mechanism or a word-level attention mechanism can be used for processing MQ and MA to finally obtain corresponding vectors. For the sake of distinction, in the following description, a sentence-level attention mechanism is used to obtain a vector of questions and a vector of answersCorresponding notation is FQ₁、FA₁(ii) a Obtaining the vector of the question and the corresponding vector of one answer by using the word level attention mechanism and recording the result as FQ₂、FA₂。

The following is a detailed description of the sentence-level attention mechanism and the word-level attention mechanism.

a. The process of interacting the vector sequence MQ of the question with the vector sequence MA of one answer with the sentence-level attention mechanism to obtain the vector of the question and the vector of one answer is as follows:

for the vector sequence MQ of the problem, it is expressed as a K-dimensional vector FQ using an averaging pooling (averaging) operation₁：

Wherein MQ_pA vector representing the p-th word in the sequence of vectors MQ.

For the answer vector sequence MA, firstly, each sentence in the answer vector sequence MA is represented by a K-dimensional vector by using an average pooling operation, and several semantic representations r'_mM1, 2., M; then, a distance function is used to calculate their respective attention scores α'_mM1, 2, M, and then using a weighted average to obtain a vector representation FA of the answer₁：

Wherein, MA_mdThe vector representing the d word in the m sentence in the vector sequence MA, f (-) represents the cosine similarity function.

b. Changing the theme from a vector sequence TC into a vector FC by using a word-level attention mechanism; then fusing the vector FC of the subject, the vector sequence MQ of the question and the vector sequence MA of an answer to finally obtain the vector of the question and the vector of the answer as follows:

subject TC ═ z for a given problem₁,z₂,...,z_CUsing an averaging poolThe quantization operation transforms it into a fixed-length vector FC:

after the vector FC is obtained, the word-level attention mechanism uses it to calculate a score for the attention of each word in the question and answer;

for the vector sequence MQ of the question, which consists of a series of semantically represented vectors, we use the transformation matrix W to compute the distance between the vector of each word in the question and the vector FC of the topic, and then to compute the attention score β for the p-th word in the question using the softmax operation_p(ii) a Resulting vector FQ of the problem₂Expressed as:

wherein MQ_p、MQ_iCorrespondingly, the p-th and i-th vectors in the vector sequence MQ are represented, and h (a, b) ═ a^TWb is used to calculate the distance of different space vectors, where (a, b) corresponds to the above equation (FC, MQ)_p)、(FC,MQ_i)。

For the vector sequence MA of answers, an attention score β is calculated for the vector of the d word in the m sentence of the answer using a similar method as described above_mdAnd obtaining a vector representation r of the mth sentence_m(ii) a Then using something like a'_m、FA₁To calculate the attention score alpha of the mth sentence in the word-level attention mechanism_mVector representation of sum answer FA₂：

Wherein, MA_md、MA_mlCorresponding sequence of representation vectors MAnd (B) vectors of the d and l words in the m sentence in the A.

In the embodiment of the invention, the sentence-level attention mechanism mainly utilizes the vector sequence MQ of the question and the vector sequence MA of one answer to distinguish the importance degree of each sentence in the answer, and the word-level attention mechanism captures the deep semantic relationship among the theme, the question and the answer by further utilizing the additional information of the theme TC on the basis of the sentence-level attention mechanism, and further distinguishes the importance of words and sentences in the question and the answer.

By the above method, a vector representation of the question and answer, in which FQ₁、FA₁、FQ₂、FA₂∈R^K。

4) Evaluation layer: and calculating answer deep language matching scores by combining the vectors of the questions and the vectors of one answer, and taking time effects into consideration by combining the time stamps of the answers so as to obtain the ranking scores.

In the embodiment of the invention, firstly, the vector of the question and the vector of an answer are combined to calculate the matching score of the answer deep layer language

The formula is as follows:

wherein, σ (·), τ (·) are sigmoid (·), tanh (·) nonlinear activation functions respectively;

representing a splicing operation; { W₁,W₂Is the parameter matrix to be optimized in the model, { b₁,b₂The parameter vector to be optimized in the model is used as the parameter vector; in the above formula, FQ_y、FA_yThe correlation vector in which y is 1 or 2 and represents the above calculation formula may be a calculation result of the sentence-level attention mechanism or a calculation result of the word-level attention mechanism.

Combining the time stamp T of the answerThe effect is also taken into consideration to obtain a ranking score

The formula is as follows:

wherein, T₀The timestamp representing the first answer, H is a hyper-parameter.

4. And training model parameters.

The step is mainly to train all parameter matrixes or vectors in the enhanced attention mechanism recurrent neural network model established in the previous step, including { W }_xi,W_hi,W_xf,W_hf,W_xc,W_hc,W_xo,W_ho}、{b_i,b_f,b_c,b_o}, conversion matrices W, { W₁,W₂And { b }and₁,b₂}. Specifically, the evaluation result is combined with a preset time-sensitive objective function, and an enhanced attention mechanism recurrent neural network model is trained by using a problem-dependent pair-wise training strategy.

For each question Q, two answers A are extracted from the corresponding series of answers⁺And A^-Wherein A is⁺Has a praise number greater than A^-Thus constituting a triplet (Q, A)⁺，A^-)；

A random gradient descent (SGD) algorithm is used to minimize the time-sensitive objective function:

L＝max(0,m+S(Q,A^-)-S(Q,A⁺))；

wherein, S (Q, A)⁺) And S (Q, A)^-) The corresponding ranking scores represent the two answers A + and A-in question Q.

In addition, in the training process, the whole data set can be divided into a training set and a test set according to the proportion of 4:1, the training set is used for optimizing the parameters of the model, and the test set is used for measuring the quality of the final model.

5. Predicting value of a series of answers to a new question

This step essentially predicts the value of a series of answers to a new question and ranks the answers according to the magnitude of the predicted value (i.e., the ranking score).

In the embodiment of the invention, a new question X is utilized, a theme B corresponding to the new question X, and a series of text contents A ═ A of answers₁,A₂,...,A_GAnd a timestamp T ═ T for each answer₁,T₂,...,T_GConstruction of a series of examples (X, B, A)_g,T_g) G is more than or equal to 1 and less than or equal to G; inputting the examples into a trained enhanced attention mechanism recurrent neural network model in sequence to obtain a series of ranking scores

Sorting the corresponding answers in a front-to-back manner according to the sorting scores; that is, the higher the ranking score, the higher the quality of the corresponding answer is considered, and the rank is relatively top.

According to the scheme of the embodiment of the invention, the deep semantic relation between the questions and the answers is captured by utilizing the fusion of various metadata, so that the key points of the questions and the answers can be effectively distinguished, the ordering of the answers is realized, and readers are helped to quickly find valuable and attractive answers.

Through the above description of the embodiments, it is clear to those skilled in the art that the above embodiments can be implemented by software, and can also be implemented by software plus a necessary general hardware platform. With this understanding, the technical solutions of the embodiments can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, a usb disk, a removable hard disk, etc.), and includes several instructions for enabling a computer device (which can be a personal computer, a server, or a network device, etc.) to execute the methods according to the embodiments of the present invention.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A community question-answering platform answer sorting method is characterized by comprising the following steps:

for a new question and a series of answers thereof, constructing a series of examples by using the text content of the new question, the subject to which the new question belongs, the text content of a series of answers corresponding to the new question and the timestamp of each answer, sequentially inputting the examples into a trained enhanced attention mechanism recurrent neural network model so as to obtain a series of ranking scores, and ranking corresponding answers from front to back according to the ranking scores;

the method for constructing the enhanced attention mechanism recurrent neural network model comprises the following four parts: an input layer, a long-short term memory network layer, an attention layer and an evaluation layer;

an input layer: regarding an answer, the answer is considered to be composed of a plurality of sentences, each sentence being composed of a plurality of words; regarding a corresponding question, the question is considered to be composed of a sentence, and the sentence is composed of a plurality of words; for the subject to which the question belongs, the subject is considered to be composed ofA plurality of words; using Word Embedding technology to represent words appearing in the text by a vector with a fixed length, so that each Word appearing in the text content of the question, the text content of the answer and the theme is replaced by a K-dimensional vector; the vector sequence TQ of the text content of the question consists of N vectors, denoted TQ ═ x₁,x₂,...,x_N},x_p∈R^KP ═ 1,2,. cndot, N; a vector sequence TA of text content of an answer consists of M sentences each consisting of D vectors, then TA ═ s₁,s₂,...,s_M}，s_m＝{y_m1,y_m2,...,y_mD},y_md∈R^KM1, 2, M, D1, 2, D; the topic TC consists of C vectors, denoted TC ═ z₁,z₂,...,z_C},z_q∈R^K,q＝1,2,...,C；

Long-short term memory network layer: modeling the vector sequences in question and answer using two long-short term memory networks LSTM _ Q and LSTM _ A for the vector sequence TQ of the question and one answer vector sequence TA, respectively, and using the last cell vector in LSTM _ Q for initialization of the cell vector in LSTM _ A; then obtaining vector sequences of the questions and answers after passing through the long-term and short-term memory network, wherein the vector sequences are MQ and MA respectively, and each vector in the vector sequences MQ and MA contains context semantic information;

attention layer: interacting the vector sequence MQ of the question with the vector sequence MA of an answer by using a sentence-level attention mechanism to obtain the vector FQ of the question₁And a vector of answers FA₁(ii) a Or changing the theme from the vector sequence TC into a vector FC by using a word-level attention mechanism; then fusing the vector FC of the subject, the vector sequence MQ of the question and the vector sequence MA of an answer to finally obtain the vector FQ of the question₂And a vector of answers FA₂；

Evaluation layer: and calculating answer deep language matching scores by combining the vectors of the questions and the vectors of one answer, and taking time effects into consideration by combining the time stamps of the answers so as to obtain the ranking scores.

2. The method for sorting answers of a community question-answering platform according to claim 1, wherein a step of preprocessing crawled data is further included before constructing the enhanced attention mechanism recurrent neural network model, and the step includes:

removing questions and answers with the number of words smaller than the set number in the text content;

removing questions and answers with praise number exceeding a preset range in a wave band within a period of time;

the remaining questions and the text content of the answers are participled, and the data for each question becomes: the word segmentation result of the text content of the question, the subject to which the question belongs, the word segmentation result of the content of each answer, the timestamp of each answer, and the number of praise of each answer; the praise number of each answer is used for verifying the quality of the model, and the rest information is used as the input of the model for later evaluation of the quality of each answer.

3. The method as claimed in claim 1, wherein the method comprises the steps of,

vector sequence for problem TQ ═ x₁,x₂,...,x_NN updates the cell vector sequence c ═ c { c } in LSTM _ Q over time t ═ 1,2₁,c₂,...,c_NH and hidden vector sequence h ═ h₁,h₂,...,h_NThe calculation method is as follows:

i_t＝σ(W_xix_t+W_hih_t-1+b_i)；

f_t＝σ(W_xfx_t+W_hfh_t-1+b_f)；

c_t＝f_t·c_t-1+i_t·τ(W_xcx_t+W_hch_t-1+b_c)；

o_t＝σ(W_xox_t+W_hoh_t-1+b_o)；

h_t＝o_t·τ(c_t)；

where the value of the time corresponds one-to-one to the sequence number of the vector in the vector sequence TQ of the problem, i_t,f_t,o_tAn input gate, a forgetting gate and an output gate respectively; cell vector c₀And hidden vector h₀Initializing to a zero vector; sigma (·), tau (·) is sigmoid (·), tanh (·) nonlinear activation function, respectively; is an element multiply operation; { W_xi,W_hi,W_xf,W_hf,W_xc,W_hc,W_xo,W_hoIs the parameter matrix to be optimized in the model, { b_i,b_f,b_c,b_oIs the parameter vector to be optimized in the model;

vector sequence s for each sentence in the answer_m＝{y_m1,y_m2,...,y_mDD updates the cell vector sequence c ' ═ { c ' in LSTM _ a over time t ' ═ 1,2.₁,c'₂,...,c'_DAnd obtaining an implicit vector sequence h'_m＝{h'_m1,h'_m2,...,h'_mD}; let cell vector c'₀＝c_NIs implicit to vector h'_m0Initialized to a zero vector.

4. The method as claimed in claim 1, wherein the method comprises the steps of,

the process of interacting the vector sequence MQ of the question with the vector sequence MA of one answer with the sentence-level attention mechanism to obtain the vector of the question and the vector of one answer is as follows:

for the vector sequence MQ of the problem, it is expressed as a K-dimensional vector FQ using an average pooling operation₁：

Wherein MQ_pA vector representing the p-th word in the vector sequence MQ;

for the vector sequence of answers MA, each of the vector sequence of answers MA is first processed using an averaging pooling operationThe sentences are represented by a vector of K dimension to obtain several semantic representations r'_mM1, 2., M; then, a distance function is used to calculate their respective attention scores α'_mM1, 2, M, and then using a weighted average to obtain a vector representation FA of the answer₁：

Wherein, MA_mdA vector representing the d word in the m sentence in the vector sequence MA, wherein f (-) represents a cosine similarity function;

changing the theme from a vector sequence TC into a vector FC by using a word-level attention mechanism; then fusing the vector FC of the subject, the vector sequence MQ of the question and the vector sequence MA of an answer to finally obtain the vector of the question and the vector of the answer as follows:

subject TC ═ z for a given problem₁,z₂,...,z_CIt is transformed into a fixed-length vector FC using an average pooling operation:

for a vector sequence MQ of the question, which is composed of a series of semantically represented vectors, the distance between the vector of each word in the question and the vector FC of the topic is calculated using the transformation matrix W, after which the attention score β for the p-th word in the question is calculated using the softmax operation_p(ii) a Resulting vector FQ of the problem₂Expressed as:

wherein MQ_p、MQ_iCorresponding representation of the p, i-th vector, h in the vector sequence MQ(a,b)＝a^TWb is used to calculate the distance of different space vectors, where (a, b) corresponds to the above equation (FC, MQ)_p)、(FC,MQ_i)；

For the vector sequence MA of answers, an attention score β is calculated for the vector of the d word in the m sentence of the answer_mdAnd obtaining a vector representation r of the mth sentence_m(ii) a Then the attention score alpha of the mth sentence in the attention mechanism of the word level is calculated_mVector of sum answers FA₂：

Wherein, MA_md、MA_mlThe corresponding vector represents the d, l word in the m sentence in the vector sequence MA.

5. The method as claimed in claim 1, wherein the method comprises the steps of,

computing answer deep language matching scores combining a question vector and an answer vector

The formula is as follows:

y is 1 or 2;

representing a splicing operation; { W₁,W₂Is the parameter matrix to be optimized in the model, { b₁,b₂The parameter vector to be optimized in the model is used as the parameter vector;

combining with the time stamp T of the answer, the time effect is also taken into consideration to obtain the ranking score

The formula is as follows:

6. The method for sorting answers of a community question-answering platform according to claims 1,2, 3, 4 or 5, wherein a training process of the enhanced attention mechanism recurrent neural network model by combining the evaluation result with a preset time-sensitive objective function and using a question-dependent pair-wise training strategy is as follows:

A random gradient descent algorithm is used to minimize the time-sensitive objective function:

L＝max(0,m+S(Q,A^-)-S(Q,A⁺))；

wherein, S (Q, A)⁺) And S (Q, A)^-) Corresponding two answers A in the presentation question Q⁺And A^-The ranking score of (1).

7. The method as claimed in claim 1,2, 3, 4 or 5, wherein a new question X is used, the topic B of the new question X, and the text content a ═ a of a series of answers₁,A₂,...,A_GAnd a timestamp T ═ T for each answer₁,T₂,...,T_GConstruction of a series of examples (X, B, A)_g,T_g) G is more than or equal to 1 and less than or equal to G; these examples are to be construed as beingSequentially inputting the data into a trained enhanced attention mechanism cyclic neural network model so as to obtain a series of sequencing scores

And sorting the corresponding answers in a front-to-back mode according to the size of the sorting score.