CN112069399A

CN112069399A - Personalized search system based on interactive matching

Info

Publication number: CN112069399A
Application number: CN202010861245.9A
Authority: CN
Inventors: 窦志成; 邴庆禹
Original assignee: Renmin University of China
Current assignee: Renmin University of China
Priority date: 2020-08-25
Filing date: 2020-08-25
Publication date: 2020-12-11
Anticipated expiration: 2040-08-25
Also published as: CN112069399B

Abstract

The invention realizes an interactive matching-based personalized search system through a method in the field of artificial intelligence, and comprises a system input module, an interactive matching-based personalized search module and an output module, wherein the operation process of the interactive matching-based personalized search module comprises four steps of bottom layer matching modeling of user search history, calculation of attention weight, generation of user interest matching vectors and personalized reordering, the model idea of matching the history query of a user and candidate documents is interacted at a word level, the idea of reducing the influence of irrelevant information in the search history is focused, a convolutional neural network is used for fusing weighted matching methods, so that the final interest matching vectors of the documents are generated, more accurate interest matching scores are obtained, and the problem that the quality of sequencing results depends on the vector construction model under the existing vector representation-based method is solved, and the process of constructing the vector may omit some useful information.

Description

Personalized search system based on interactive matching

Technical Field

The invention relates to the field of artificial intelligence, in particular to an interactive matching-based personalized search system.

Background

Personalizing user searches with the user's historical information has proven to be effective in improving the quality of search rankings. The personalized search algorithm firstly models the interests of the user according to the information such as the historical behaviors of the user, not only considers the relevance of the query statement and the document, but also introduces the matching degree between the document and the interests of the user when calculating the matching score, thereby customizing a search result list which meets the requirements of different users in a personalized manner. The user interest model can be established based on various information sources, such as the position information of the user, the retrieval mode, the browsing history and the search history of the user, and the like. In recent years, researchers introduce a deep learning method into a personalized ranking model, so that the semantic understanding capability of the model to texts is enhanced, and a good effect on personalized rearrangement of search results is achieved. Ranking algorithms using deep learning can be classified into representation-based matching and interaction-based matching. The expression matching is based on that in a sorting algorithm, semantic vector expressions of a query and a document are obtained by learning respectively, and then the two vectors are subjected to matching calculation, and the core of the algorithm is to learn the semantic vector expressions. The algorithm based on interactive matching is to interact the query and the document in advance at a finer-grained word level, grab more complete matching signals, and fuse the matching signals into a matching score, and the core of the algorithm is how to process the matching signals and fuse the matching signals into a matching score. The existing personalized search algorithm almost calculates the interest expression vector of a user firstly, then interacts with the expression vector of a candidate document to obtain a personalized matching score, and an algorithm idea based on expression matching is used.

Most of the existing personalized ranking algorithms directly calculate interest expression vectors of users in various ways according to historical behaviors of the users, and then interact with the expression vectors of candidate documents to obtain personalized matching scores. The method of the type is to obtain a matching signal of the document and the user interest by taking the whole document as a unit, and is mainly characterized in that the document to be matched and the user interest are converted into a representation vector, then vector matching is carried out, and the construction of a representation layer is emphasized. Under the vector representation-based method, the quality of the ranking result depends on the quality of a vector construction model, and the vector construction process may omit some useful information, such as text information and interaction information of queries and documents at a word level, thereby affecting the personalized ranking result.

Disclosure of Invention

Therefore, the invention provides an interactive matching-based personalized search system, which comprises an input module, an interactive matching-based personalized search module and an output module;

the input module is used for reading the user query history and the alternative documents, standardizing the formats of the documents and inputting the documents into the personalized search module based on the interactive matching,

the operation process of the personalized search module based on interactive matching is divided into four steps:

the method comprises the following steps: the method comprises the following steps of performing bottom matching modeling on user search history, namely establishing a bottom matching model by utilizing historical search information of a user, and interacting historical query of the user and candidate documents according to words to obtain a detailed bottom matching signal;

step two: the calculation step of the attention weight value, introduce the attention mechanism, according to the degree of contribution of different inquiry records in the user search history to the current inquiry, carry on the weighting processing to its corresponding matching signal;

step three: a step of generating a user interest matching vector, which is to extract the characteristics of the weighted matching signals by using a convolutional neural network to generate a final matching vector of the document and the user interest;

step four: a personalized reordering step, namely calculating personalized scores of candidate documents through the user interest matching vectors obtained in the user interest matching vector generation step, calculating the relevance scores of the candidate documents through clicking the feature vectors, and performing personalized reordering by taking the sum of the two as a final document matching score;

and the output module outputs the document matching distribution and the personalized rearrangement result.

The specific implementation manner of the bottom layer matching modeling step of the user search history is as follows: defining a history of a userThe query list is { q₁,q₂,q₃,…,q_nH (where n is 3 or more and is an integer), the current candidate document is d, and for each historical query-candidate document pair<q_i,d>Firstly, mapping the two words into word vectors word by word, using word2vec model to express the word vectors, q_iAfter processing, the words are expressed as a group of word vectors qw₁,qw₂,qw₃,…,qw_xD is processed and expressed as { dw }₁，dw₂，dw₃，…，dw_y}. Each vector in the two groups of word vectors is interacted pairwise to obtain<q_i,d>The word matching matrix T of (a), each element in the matching matrix T being:

T_i,j＝cos(qw_i,dw_j)

wherein T is_i,jRepresents the element of the ith row and jth column of the matrix T, qw_iRepresents the word vector corresponding to the ith word in the historical query, dw_jAnd representing word vectors corresponding to jth words in the candidate documents (wherein i is more than or equal to 1 and less than or equal to x, j is more than or equal to 1 and less than or equal to y, and i, j, x and y are integers), and calculating the matching values of the i, the j, the x and the y by a cosine function. In the K-NRM model, K RBF kernels are applied to each row in a matching matrix to obtain a K-dimensional feature vector

The corresponding formula of the RBF kernel is as follows:

wherein, K_k(T_i) Representing the value of the processed ith row of the matching matrix T by the kth RBF kernel, wherein the value range is between 0 and y; mu.s_kAnd σ_kAll the parameters are hyper-parameters, mu is uniformly valued from-1 to 1, and then logarithms of characteristic vectors corresponding to each row in the matching matrix are taken and summed to serve as historical queries q_iFinal underlying matching results with candidate documents:

for the underlying match vector calculated based on the user's historical search information, { v }₁,v₂，v₃,…，v_nAnd expressing the element of the fine-grained matching vector v of the candidate document.

The specific implementation manner of the calculation step of the attention weight value is as follows: and calculating an attention weight value for the bottom layer matching vector corresponding to each historical query record by using the fine-grained matching vector v of the current query q and the candidate document d:

e_i＝g(v，v_i)

wherein g is a multilayer perceptron with tanh as the activation function, α_iIs the bottom layer matching vector v calculated by the attention layer_iThe corresponding weight value, the weighted bottom layer matching vector is:

the weighted fine-grained matching vector corresponding to each historical query of the user is { V }₁，V₂，V₃，…，V_n}。

The specific implementation manner of the step of generating the user interest matching vector is as follows: matching the weighted fine-grained matching vector V₁，V₂，V₃,…,V_nSplicing the characters into a matched feature matrix M according to columns, wherein M is [ V ═ V }₁，V₂，V₃，…，V_n]∈R^K×nAnd using 100 convolution cores to carry out convolution on the matched feature matrix M to obtain a three-dimensional tensor A belonging to the R^{100×(K-2)×(n-2)}Each element in tensor a is:

wherein t is an integer of 1-100, b_tFor the bias vector b ∈ R¹⁰⁰Of (1) t-th element value, f_tFor the t-th 3 × 3 convolution kernel, M_{i-1:i+1,j-1:j+1}Represents a sub-matrix of the value of the matching characteristic matrix M from the i-1 th row to the i +1 th row and from the j-1 st column to the j +1 st column,

the method comprises the steps of multiplying elements at corresponding positions of two matrixes, adding and summing all the products, adopting a Relu function as an activation function by a convolutional layer, processing the convolutional layer, and applying maximal pooling to the second dimension and the third dimension of a three-dimensional tensor A by a pooling layer to obtain a 100-dimensional vector I, I_tFor the t-th element in vector I:

the output vector I is the final user interest matching vector.

The size of the convolution kernel is 3 x 3, and there are at least 3 pieces in the search history of each user.

The specific implementation manner of the personalized reordering step is as follows: the matching score (d | I) of the candidate document and the user interest is obtained by training an interest matching vector I through a multi-layer perceptron; the relevance score (d | q) of the candidate document and the current query is obtained through calculation of a multi-layer sensing computer according to click times, original click positions and click entropies; and adding the interest matching score (d | I) and the correlation score (d | q) to obtain a final score of the candidate document, and reordering the original document list according to the score to obtain a final personalized sorting result.

In the calculation of the relevance scores of the candidate documents and the current query, the training is carried out through a Lambdarank algorithm, the clicked document is taken as a relevant document sample, the rest documents are taken as irrelevant samples,selecting a related document d_iAnd an irrelevant document d_jDocument pairs are constructed to calculate losses. The calculation of the loss function also introduces the influence degree of the sequence of the exchanged document pairs on the evaluation index MAP, and the sequence is used as a corresponding weight value, namely, the document pairs with larger difference (the MAP change value after the exchange sequence is large) are endowed with larger weight values. The loss function is obtained by multiplying the cross entropy between the actual probability and the prediction probability with the change value of the MAP evaluation index:

wherein Δ is a document d_iAnd document d_jThe change value of the MAP evaluation index after the exchange position,

representing a document d_iDocument d_jActual probability of high correlation, p_ijRepresenting the prediction probability, prediction probability p_ijThe calculation method comprises the following steps:

the technical effects to be realized by the invention are as follows:

(1) a model idea based on interactive matching is introduced, a text is not converted into a unique integral expression vector, and the historical query of a user is interacted with a candidate document at a word level to obtain a more accurate and complete matching signal.

(2) An attention mechanism is introduced, and corresponding matching signals are weighted according to the contribution degree of different historical queries to current matching, so that the influence of irrelevant information in search history is reduced.

(3) Feature extraction is carried out on the weighted matching signals by using a convolutional neural network, and a final interest matching vector of the document is generated, so that more accurate interest matching scores are obtained.

Drawings

FIG. 1 is a framework of a personalized search module based on interactive matching;

Detailed Description

The following is a preferred embodiment of the present invention and is further described with reference to the accompanying drawings, but the present invention is not limited to this embodiment.

In order to achieve the above object, the present invention provides a personalized search system based on interactive matching.

The system comprises an input module, an individualized searching module based on interactive matching and an output module; the input module is used for reading user query history and alternative documents, standardizing formats of the documents and inputting the documents into the personalized search module based on interactive matching, and the output module outputs the document matching scores and personalized rearrangement results.

And the personalized search module based on interactive matching processes the bottom layer matching signals by using a convolutional neural network to obtain a final interest matching result of the candidate document.

The personalized search module based on interactive matching considers the historical query in the user historical behavior information and the matching signal between words of the candidate document, and a historical query list { q ] of the user₁,q₂,q₃,…,q_nD is the current candidate document, firstly, a search log of a user is processed through a K-NRM model based on interactive matching, and each historical query q is obtained_iAnd fine-grained matching vector v of candidate document d_i(where 1 ≦ i ≦ n), and the fine-grained match vector v for the current query q and candidate document d. Then, considering that the user interests are dynamically changing and the user query sometimes has a certain contingency, the contribution of different queries in the user search history to the current query is different. According to the contribution degree of each historical query to the current query, a matching vector { v ] generated by a multi-layer perceptron on the K-NRM model₁,v₂,v₃,…,v_nWeighting to obtain a weighted matching vector list (V)₁,V₂,V₃,…,V_n}. Then, convolutional nerves are utilizedThe network processes these vectors to derive matching vectors between the candidate documents and the user interests. And finally, respectively calculating the interest matching score and the relevancy score of the current candidate document according to the interest matching vector and the click feature vector, and adding to obtain a final document matching score, wherein the formula is as follows:

score(d)＝score(d|I)+score(d|q)

where score (d | I) represents the matching score of the current candidate document to the user search interests and score (d | q) represents the relevance score of the current candidate document to the current query.

The framework of the personalized search module based on interactive matching is shown in fig. 1, and is divided into the following four parts according to the processing flow:

the method comprises the following steps: the bottom level of the user search history is modeled as a match. And establishing a bottom layer matching model by using the historical search information of the user, and interacting the historical query of the user and the candidate document according to words to obtain a detailed matching signal of the bottom layer.

Step two: and (4) calculating the attention weight value. And (4) introducing an attention mechanism, and weighting the corresponding matching signals according to the contribution degree of different query records in the user search history to the current query.

Step three: and generating a user interest matching vector. And performing feature extraction on the weighted matching signals by using a convolutional neural network to generate a final matching vector of the document and the user interest.

Step four: and (5) personalized reordering. And calculating the personalized score of the candidate document according to the obtained interest matching vector, calculating the relevance score of the candidate document by clicking the feature vector, and performing personalized rearrangement by taking the sum of the two as the final document matching score.

Bottom matching modeling step of user search history:

the search history of the user can provide rich information for obtaining the search interest of the user. Most of the existing algorithms model the interest of the user based on the historical behavior information of the user to obtain an interest vector representing the search preference of the user, and then carry out interactive processing with the document vector. A K-NRM framework is adopted, for each user U, a bottom layer matching model is established by using historical search information of the user U, and each historical query in the user historical search is interactively matched with a candidate document at the bottom layer.

The user's historical query list is q₁,q₂,q₃,…,q_nAnd d is the current candidate document. For each historical query-candidate document pair<q_i,d>Firstly, the two words are mapped into word vectors word by word, and the word vectors are expressed by using a word2vec model. q. q.s_iAfter processing, the words are expressed as a group of word vectors qw₁,qw₂,qw₃,…,qw_xD is processed and expressed as { dw }₁,dw₂,dw₃,…,dw_y}. Each vector in the two groups of word vectors is interacted pairwise to obtain<q_i,d>Matches the matrix T. Each element in the matching matrix T is given by the following formula:

T_i,j＝cos(qw_i,dw_j)

wherein T is_i,jRepresents the element of the ith row and jth column of the matrix T, qw_iRepresents the word vector corresponding to the ith word in the historical query, dw_jAnd representing a word vector corresponding to the jth word in the candidate document (wherein i is more than or equal to 1 and less than or equal to x, and j is more than or equal to 1 and less than or equal to y), and calculating the matching value of the two words by a cosine function.

As can be seen from the above description, the ith row in the matching matrix represents the signal of matching the ith word in the historical query with the candidate document. In the K-NRM model, K RBF kernels are applied to each row in a matching matrix to obtain a K-dimensional feature vector

The corresponding formula of the RBF kernel is as follows:

wherein, K_k(T_i) Representing the value of the processed ith row of the matching matrix T by the kth RBF kernel, wherein the value range is between 0 and y; mu.s_kAnd σ_kAre all hyper-parameters. In the K-NRM model used by the user, the cosine similarity value of the vector is between-1 and 1, so mu is uniformly taken from-1 to 1. Then, logarithm of characteristic vector corresponding to each row in the matching matrix is taken and then summed to serve as historical query q_iThe final underlying matching results with the candidate documents are as follows:

for each historical query q_iIt has a K-dimensional matching vector with the current candidate document, and the matching vector is the historical query q_iAnd fine-grained matching vector v of candidate document d_i. And calculating a fine-grained matching vector v of the current query q and the candidate document d by the process. To this end, we have obtained the underlying match vector calculated based on the user's historical search information, using { v }₁,v₂,v₃,…,v_nRepresents it.

The calculation step of the attention weight value:

because the search interest and the search mode of the user are dynamically changed and the user query has a certain contingency, the influence degree of different query records in the search history of the user on the current query is different. Based on the consideration, the step introduces an attention mechanism, and further optimizes each bottom layer matching vector according to the contribution degree of different historical queries to the current matching.

In the previous step, we obtain the underlying match vector { v } calculated using the user's historical search information₁,v₂,v₃,…,v_n}. In the step, based on the fine-grained matching vectors v of the current query q and the candidate document d, the attention weight value is calculated for the bottom layer matching vector corresponding to each historical query record. The input of the attention layer is the bottom layer matching vector v calculated in the previous step₁,v₂,v₃,…,v_nAnd v, the calculation formula is as follows:

e_i＝g(v,v_i)

wherein g (-) is a multi-layer perceptron with tanh as the activation function, α_iIs the bottom layer matching vector v calculated by the attention layer_iThe corresponding weight value. The weighted bottom-level matching vector is given by the following formula:

and the attention layer gives more attention to the bottom layer matching vector corresponding to the history query with larger contribution according to the information quantity of the current matching contribution of different history queries in the user search history, and obtains optimized bottom layer matching information weighted according to the contribution degree. At this point, we obtain a weighted fine-grained matching vector { V) corresponding to each historical query of the user₁，V₂，V₃，…,V_n}。

Generating a user interest matching vector:

matching the weighted fine-grained matching vector V₁，V₂，V₃，…，V_nSplicing the characters into a matched feature matrix M according to columns, wherein M is [ V ═ V }₁，V₂，V₃,…，V_n]∈R^K×n. The traditional approach is to apply maximum pooling or average pooling directly on the matching feature matrix to obtain the user interest matching vector. However, given the potentially large number of historical search records in a user's search history, applying pooling directly on the matching feature matrix may omit some useful information, such as relationship information between the underlying matching vectors corresponding to adjacent historical queries.

To compensate for this deficiency, this step uses 100 convolution kernels f of 3 × 3₁,f₂,…,f₁₀₀Convolving the matched feature matrix M to obtain a three-dimensional tensor A epsilon R^{100×(K-2)×(n-2)}. Each element in tensor a is disclosed byThe formula is given as:

wherein t is more than or equal to 1 and less than or equal to 100, b_tFor the bias vector b ∈ R¹⁰⁰Of (1) t-th element value, f_tFor the t-th 3 × 3 convolution kernel, M_{i-1:i+1,j-1:j+1}Represents a sub-matrix of the value of the matching characteristic matrix M from the i-1 th row to the i +1 th row and from the j-1 st column to the j +1 st column,

which represents an operation of multiplying elements at corresponding positions of two matrices and summing up all the products. The convolution layer of this step uses a convolution kernel of 3 × 3, which requires at least 3 historical query records in the search history of each user. In other words, the model does not support users with less than three historical query records, because too few historical query records cannot provide enough information for extracting the search interest of the user, and in this case, personalized rearrangement of the documents can interfere with accurate calculation of the document scores. In addition, the convolution layer uses the Relu function as an activation function, and the Relu function has smaller calculation amount compared with other activation functions such as sigmoid and the like, and can avoid the problem of gradient disappearance.

After convolutional layer processing, we apply max-Pooling to the second and third dimensions of the three-dimensional tensor A at the pooling layer to obtain a 100-dimensional vector I. I is_tFor the t-th element in the vector I, the calculation formula is as follows:

the purpose of the pooling layer is to perform further feature extraction on the matched feature tensor A, and the output vector I is the final user interest matching vector.

Personalized reordering step

Since the score of a candidate document consists of two parts: a matching score of the candidate documents to the user interests and a relevance score to the current query. The matching score (d | I) of the candidate document and the user interest is obtained by training an interest matching vector I through a multi-layer perceptron; and the relevance score (d | q) of the candidate document and the current query is obtained through calculation of a multi-layer perception computer according to three click features, namely the click times, the original click position and the click entropy. And adding the interest matching score (d | I) and the correlation score (d | q) to obtain a final score of the candidate document, and reordering the original document list according to the score to obtain a final personalized sorting result.

The Lambdannk algorithm is selected for training, the clicked document is used as a related document sample, other documents are used as unrelated samples, and a related document d is selected_iAnd an irrelevant document d_jDocument pairs are constructed to calculate losses. The loss function is obtained by multiplying the cross entropy between the actual probability and the prediction probability by the change value of the MAP evaluation index, and the calculation formula is as follows:

wherein, Delta is the variation value of the MAP evaluation index,

representing a document d_iDocument d_jActual probability of high correlation, p_ijRepresenting its prediction probability;

representing a document d_jDocument d_iActual probability of high correlation, p_jiRepresenting its prediction probability. Prediction probability p_ijCalculated by the following formula:

and finally outputting the obtained personalized sorting result to an output module for external output.

Claims

1. An interactive matching-based personalized search system, which is characterized in that: the system comprises an input module, an individualized searching module based on interactive matching and an output module;

2. The personalized search system based on interactive matching as claimed in claim 1, wherein: the specific implementation manner of the bottom layer matching modeling step of the user search history is as follows: defining a user's historical query list as q₁,q₂,q₃,…,q_nWherein n is an integer of n.gtoreq.3The current candidate document is d, and for each historical query-candidate document pair<q_i,d>Firstly, mapping the two words into word vectors word by word, using word2vec model to express the word vectors, q_iAfter processing, the words are expressed as a group of word vectors qw₁,qw₂,qw₃,…,qw_xD is processed and expressed as { dw }₁,dw₂,dw₃,…,dw_yAnd (4) interacting every two of the two groups of word vectors to obtain<q_i,d>The word matching matrix T of (a), each element in the matching matrix T being:

T_i,j＝cos(qw_i,dw_j)

wherein T is_i,jRepresents the element of the ith row and jth column of the matrix T, qw_iRepresents the word vector corresponding to the ith word in the historical query, dw_jRepresenting word vectors corresponding to jth words in the candidate documents, wherein i is more than or equal to 1 and less than or equal to x, j is more than or equal to 1 and less than or equal to y, i, j, x and y are integers, the matching values of the i, the j, the x and the y are calculated by a cosine function, and in a K-NRM model, K RBF kernels are applied to each row in a matching matrix to obtain a K-dimensional feature vector

The corresponding formula of the RBF kernel is as follows:

for the underlying match vector calculated based on the user's historical search information, { v }₁,v₂,v₃,…,v_nAnd expressing the element of the fine-grained matching vector v of the candidate document.

3. The personalized search system based on interactive matching as claimed in claim 2, wherein: the specific implementation manner of the calculation step of the attention weight value is as follows: and calculating an attention weight value for the bottom layer matching vector corresponding to each historical query record by using the fine-grained matching vector v of the current query q and the candidate document d:

e_i＝g(v,v_i)

the weighted fine-grained matching vector corresponding to each historical query of the user is { V }₁,V₂,V₃,…,V_n}。

4. The personalized search system based on interactive matching as claimed in claim 3, wherein: the specific implementation manner of the step of generating the user interest matching vector is as follows: matching the weighted fine-grained matching vector V₁,V₂，V₃,…,V_nSplicing the characters into a matched feature matrix M according to columns, wherein M is [ V ═ V }₁，V₂，V₃，…，V_n]∈R^K×nTo makeAnd (3) performing convolution on the matched feature matrix M by using 100 convolution cores to obtain a three-dimensional tensor A belonging to R^{100×(K-2)×(n-2)}Each element in tensor a is:

wherein t is an integer of 1-100, b_tFor the bias vector b ∈ R¹⁰⁰Of (1) t-th element value, f_tFor the t-th 3 × 3 convolution kernel, M_{i-1:i+1，j-1:j+1}Represents a sub-matrix of the value of the matching characteristic matrix M from the i-1 th row to the i +1 th row and from the j-1 st column to the j +1 st column,

the output vector I is the final user interest matching vector.

5. The personalized search system based on interactive matching of claim 4, wherein: the size of the convolution kernel is 3 x 3, and there are at least 3 pieces in the search history of each user.

6. The personalized search system based on interactive matching of claim 5, wherein: the specific implementation manner of the personalized reordering step is as follows: the matching score (d | I) of the candidate document and the user interest is obtained by training an interest matching vector I through a multi-layer perceptron; the relevance score (d | q) of the candidate document and the current query is obtained through calculation of a multi-layer sensing computer according to click times, original click positions and click entropies; and adding the interest matching score (d | I) and the correlation score (d | q) to obtain a final score of the candidate document, and reordering the original document list according to the score to obtain a final personalized sorting result.

7. The personalized search system based on interactive matching of claim 6, wherein: in the calculation of the relevance scores of the candidate documents and the current query, training is carried out through a Lambdarank algorithm, a clicked document is used as a relevant document sample, other documents are used as irrelevant samples, and a relevant document d is selected_iAnd an irrelevant document d_jThe loss function is calculated by forming a document pair, the influence degree of the sequence of the exchange document pair on the evaluation index MAP is also introduced into the calculation of the loss function, the influence degree is used as a corresponding weight value, namely the greater the change value of the MAP after the exchange sequence is, the greater the document difference is, the greater the weight value is given to the exchange sequence, and the loss function is obtained by multiplying the cross entropy between the actual probability and the prediction probability and the change value of the MAP evaluation index:

representing a document d_iDocument d_jActual probability of high correlation, p_ijRepresenting the prediction probability.