CN114048386A

CN114048386A - Search method integrating personalization and diversification based on deep learning

Info

Publication number: CN114048386A
Application number: CN202111391539.0A
Authority: CN
Inventors: 窦志成; 王淑婷
Original assignee: Renmin University of China
Current assignee: Renmin University of China
Priority date: 2021-11-23
Filing date: 2021-11-23
Publication date: 2022-02-15
Anticipated expiration: 2041-11-23
Also published as: CN114048386B

Abstract

The invention realizes a search method integrating individuation and diversification based on deep learning by a method in the field of intelligent search. The method mainly comprises three steps of calculating the weight of general diversity and personalized diversity, dynamically modeling the diversity and aggregating each score result, a search model (DFSPD model) which is based on deep learning and integrates the personalization and the diversity is taken as the core of the method, the diversity is divided into the personalized diversity and the general diversity to simultaneously consider the influence of the personalization and the diversity on the result, and the result which is more satisfactory to users is provided according to different conditions. The method and the device have the advantages that the flexibility is very high, the user description portrait generator, the word embedding matrix and the personalized weighting process can be replaced, and the accuracy is greatly improved.

Description

Search method integrating personalization and diversification based on deep learning

Technical Field

The invention relates to the technical field of network search, in particular to a search method fusing personalization and diversification based on deep learning.

Background

Ambiguity of a search statement is an important factor influencing the satisfaction degree of a search engine result, and two main ambiguity removing methods are available, namely personalized search and diversification of the search result. Personalized searches typically build models that can learn information about user interests and preferences from a user's search history to clarify the user's specific intent in the current situation, thereby providing documents that match the user's interests. From early personalized learning based on artificial features, such as click features, word frequency features and the like, to learning of more complex and abstract correlation features based on various deep learning models, such as RNN, GAN, Transformer and the like, personalized search is always dedicated to mining more accurate and fine-grained interest features of a user, but the problems of redundancy of results, incomplete capture of user interest and the like are caused by neglecting the diversity of the results and the inherent diversity of the user interest, and meanwhile, when the user does not have historical data, the user cannot express good performance, namely the problem of 'cold start'. The diversification of search results is expected to ensure that the model can return sufficiently diverse document sets to meet different intentions of different users as much as possible on the premise of ensuring the content relevance, so as to achieve the purpose of disambiguation. Early diversification models typically designed fixed formulas that calculated diversity based on content similarity or subtopic coverage, so that locally optimal documents were greedy selected to compose the returned results. The development of machine learning enables diversified models to learn semantically based features to more reasonably compute diversity. The diversification is limited in that specific intentions of the user cannot be accurately extracted, so that an optimal document ranking result cannot be generated, and a large number of documents which are not interested by the current user are contained in a search result. The algorithm combining individuation and diversification can combine the advantages of the two algorithms to realize the complementation of the advantages and the disadvantages, thereby providing more satisfactory search results for users. At present, research in the field is deficient, personalized diversification of most models is realized by introducing personalized factors on a traditional diversification algorithm, and personalized and diversified static fusion work is realized by a structured SVM model in the near future.

The existing fusion work of individuation and diversification mainly has the following problems:

(1) under different conditions, the emphasis of personalization and diversification should be different, for example, when a new user uses a search engine, or the user searches for a problem that the difference from the history is large, and the model cannot learn useful information from the search history, the emphasis should be placed on diversification to improve the probability of user satisfaction, conversely, the emphasis should be placed on personalization, most of the existing methods only simply introduce a personalization factor of user information to improve the diversified model, or statically consider the fusion of personalization and diversification, and do not dynamically consider the weighting problem of the two.

(2) The interest of the user has inherent diversity, so the personalization is emphasized, the diversity of the user intention is emphasized, and the incomplete capture of the user intention is prevented.

(3) The model based on deep learning can learn feature information on high-order semantics, and most of the existing fusion models are based on manual features or a traditional machine learning method, so that the accuracy is greatly improved.

Disclosure of Invention

Therefore, the invention firstly provides a search method integrating individuation and diversification based on deep learning, historical query data, current query and candidate document set of a user are input, and the search method is realized through three steps: the method comprises the following steps: based on HRNN model according to long-term history H of user_lShort term history H_sLearn its initial long-term and short-term description picture L⁰And S⁰In which H is_l＝{{q₁,D₁},…,{q_n,D_nH, n represents the total number of queries H contained in the previous session_s＝{{q_n+1,D_n+1},…,{q_n+m,D_n+mJ, m is the number of queries that have been searched in the current session, q_nRepresenting the nth query, D, made by the user_nFor querying q_nReturning the initial candidate document set by the time ad-hoc search engine, further using the matching degree of the query vector and the user description image, and using div (d | q)_v) Representing general diversity of document d, div (d | S), div (d | L) representing personalized diversity of documents, using r^S,r^LDynamically controlling the influence degree of the personalized diversity on the scoring result as the weight of the personalized diversity; step two: constructing an RRNN model, selecting a local optimal document d of each step based on the current comprehensive score (d) of the documents in the residual candidate document set by utilizing a greedy algorithm, and then selecting the local optimal document d according to the current comprehensive scoreThe virtual subtopic representation of the document is used to learn the influence of the document on the subtopic representation of the user long and short term description portraits and queries, the reset gate structure is used to model the influence, and the updating is carried out

Is composed of

Wherein

And after the sub-topics covered by the t selected documents are forgotten to be respectively expressed, the vector representations of the query sentence, the user short-term portrait and the user long-term portrait are used for calculating the score of the next document selection.

Representing the initiability of the query and user portrait vector representations. Finally, dynamic modeling of modeling document diversity is realized; and step three, aggregating all score results, and obtaining the basis for reordering the candidate documents based on the scores, thereby realizing the search result reordering algorithm integrating individuation and diversification. The document total score function is as follows:

score(d)＝P(d|q,D,S,u)＝P(d|q_v,S,L)＝

φ(div(d|q_v),r^Sdiv(d|S),r^Ldiv(d|L),rel(d,q))

wherein div (d | q)_v) Representing general diversity of document d, div (d | S), div (d | L) representing personalized diversity of documents, using r^S,r^LAnd as the weight of the personalized diversity to dynamically control the influence degree of the personalized diversity on the score, rel (d, q) represents the correlation score of the query and the document obtained by calculation based on additional manual features, and phi is an aggregation function and is realized by using a multilayer perceptron.

The specific calculation mode of the general diversity weight and the personalized diversity weight is as follows: word embedding is learned using word2vec techniques, and then word-based TF-IDF weightingVector representation by weighted summation

Learning user initial long-short term description portrait H 'by using HRNN structure'⁰,S'⁰；

The HRNN structure firstly uses a first-layer RNN to take all query records of users as input, and each session is ended to construct interest characterization of the users in each session for period nodes:

subscripts m and n denote the nth query record in the mth session, superscript 1 denotes the first-tier RNN structure, q_m,n,d_m,nRepresenting the corresponding input query and the average relevant documents, the user's short-term description representation may be represented as:

where M denotes the Mth current session, n_MIndicating that a search has been conducted by the user in the session;

the long-term description sketch is constructed according to the whole history record which does not contain the current session record, firstly, each session representation of the user history is used as input, and a second-layer RNN structure is applied to learn the interest representation of each period of the user:

where m denotes the mth current session, n_mIndicating that the user has performed a search in the session, and then utilizes the current query

And user interest characterization for each period

Degree of similarity of (a)_mAnd as the weights of the interests of the various periods, taking the result of weighted summation as the long-term interest description of the user:

introducing a convolutional neural network conv to learn a virtual sub-topic representation sequence thereof from the original representation of the query:

wherein q is_iRepresents the ith sub-topic characterization of the query, and c represents the number of sub-topics contained by the query. Similar sub-topic representations of documents and user long-short term description portraits can be obtained: d, L⁰,S⁰：

d＝[d₁,d₂…d_c]＝conv(d')

L⁰＝[L₁,L₂…L_c]＝conv(L'⁰)

S⁰＝[S₁,S₂…S_c]＝conv(S'⁰)

Furthermore, a KNRM model was introduced, using t_q＝[t_q1,...,t_qn],t_d＝[t_d1,...,t_dm]To represent the word sequences of the query and the word sequences of the document, where t_qiThe ith word-embedding vector, t, representing the query_djA j word embedding vector representing the document, n and m respectively representing the number of words of the current query and document, and KNRM constructing tran according to cosine similarity between each group of word pairs of the query and documentslop matrix M:

M_ij＝cos_similarity(t_qi,t_dj)

then the KNRM learns the multi-dimensional characteristics on a translation matrix by using k Gaussian kernels, and then the similarity of the query and the document is calculated by using a multilayer perceptron, wherein the specific calculation steps are as follows:

F_K(t_q,t_d)＝φ(f₁(M),...f_o(M),...,f_k(M))

wherein u is_o,σ_oThe mean and standard deviation corresponding to the o-th gaussian kernel are manually set hyper-parameters, and then the weighted value can be expressed as:

the reset gate is constructed in the following manner:

wherein the content of the first and second substances,

show that

Spliced together, W^qIs a learnable mapping parameter, will

The mapping is a real number, σ (·) representing the activation function.

The activation function is a tanh function.

The update function of the reset gate is:

the total score is calculated using a learnable matrix W^aAggregating the individual diversity and relevance scores to yield:

score(d)＝φ(s)＝tanh(s^TW^a)

where s denotes a vector formed by the respective sub-scores, phi denotes a multi-layer perceptron with tanh as an activation function, and s ═ div (d | q)_v),r^Sdiv(d|S),r^Ldiv(d|L),rel(d,q)]Wherein q is_vS, L are

Omitting the superscript form of t, d, q of the latest feature_vS, L both represent corresponding virtual sub-topic sequences, and the scores of general diversity and personalized diversity are computed using the KNRM structure:

div(d|q_v)＝F_K(d,q_v)

div(d|S)＝F_K(d,S)

div(d|L)＝F_K(d,L)

rel(d,q)＝φ(f(d,q))

additional manual feature sequences for documents and queries are represented using f (d, q), with the individual manual features being fused using a multi-tier perceptron φ.

The loss function of the method is constructed by using a two-classification loss function:

where Q represents all queries in the training set, Q^oRepresenting all sample pairs, y, to which the query q corresponds^oIs a sample label when

Ratio of

When the user demand can be satisfied, y ^o1, and conversely 0,

representing model predictions

Ratio of

Better probability, the resample pairs are written in the form of a pair: (C, d)₁,d₂) Is calculated by the following formula

Finally, gradually optimizing the model by an AdamaOptimizer optimizer.

The technical effects to be realized by the invention are as follows:

a search method is realized and has the following characteristics:

(1) and dynamically learning personalized and diversified weights according to the matching degree of the query proposed by the user and the user description portrait.

(2) Diversity is divided into personalized diversity, which captures the inherent diversity of user interests, and general diversity, which deals with cold start and the problem of useless historical information.

(3) Virtual subtopic representations of learnable users, query sentences and documents are constructed, so that the displayed subtopic representations which take the diversity among the documents into consideration are learnt, and the subtopic representations with wider coverage are learnt.

(4) And dynamically modeling the diversity of the documents by utilizing a neural network, and constructing personalized and diversified branches according to the long-term and short-term description portrait of the user and the query statement.

Drawings

FIG. 1 is a search model architecture based on deep learning that incorporates personalization and diversification;

FIG. 2 is a graphical representation of a reset gate structure

Detailed Description

The following is a preferred embodiment of the present invention and is further described with reference to the accompanying drawings, but the present invention is not limited to this embodiment.

The invention provides a search method fusing individuation and diversification based on deep learning. The method is based on a Search model fusing Personalization and diversification based on deep learning, and a DFSPD (dynamic Fusion Framework of Search Result Personalization and diversification) model is used as the core of the method, and the method divides the diversity into personalized diversity and general diversity to simultaneously consider the influence of the Personalization and the diversification on the Result. Specifically, a long-term and short-term description image of a user is learned based on the HRNN model, personalized and diversified weights are learned by utilizing the matching degree of a query vector and the user description image, and the RRNN model is constructed to dynamically model the diversity of documents. And introducing virtual subtopics to display and consider subtopic distribution, calculating the similarity of subtopic sequences by utilizing a similar KNRM structure, and finally fusing each sub-score by using a multilayer perceptron.

Suppose that for a user u, its history data contains a long-term history H_lAnd short term history H_sThe former including the interaction behavior H in the previous session_l＝{{q₁,D₁},…,{q_n,D_nH, where n represents the total number of queries contained in the previous session, the latter containing a list of queries in the current session and candidate document H_s＝{{q_n+1,D_n+1},…,{q_n+m,D_n+mAnd m is the number of queries that have been searched in the current session. The DFSPD model is firstly based on the HRNN model according to the long-term history H of the user_lShort term history H_sLearn its initial long-term and short-term description picture L⁰And S⁰. When the user submits a query q, the ad-hoc search engine returns an initial set of candidate documents D ═ D { (D)₁,d₂,.. }, our model would select the locally optimal document d for each step greedy based on the document's current composite score (d)^*. After the local optimal document is selected, the model learns the influence of the document on the sub-topic characterization of the long-short term description portrait of the user and the query according to the virtual sub-topic characterization of the document, models the influence by using the reset gate structure designed by the user, and updates the model

Is composed of

For convenience, we will hereinafter sometimes refer to

Reduced to q_vS, L, we use

To emphasize the initiatives of query and user portrait vector representations, i.e., the state when no documents are selected. The document total score function is as follows:

wherein div (d | q)_v) Representing the general diversity of the document d, div (d | S), div (d | L) representing the personalized diversity of the document, we use r^S,r^LAnd dynamically controlling the influence degree of the personalized diversity on the score as the weight of the personalized diversity. rel (d, q) represents the relevance score of the query and document computed based on the additional manual features. φ is an aggregation function, which we implement using multi-layer perceptrons (MLPs). The model structure is shown in fig. 1.

The inputs to the model include the user's historical query data, the current query, and the set of candidate documents. The model mainly comprises the following steps:

1. weights for general diversity and personalized diversity are calculated.

2. Dynamic modeling of diversity.

3. Aggregating the individual scoring results.

Calculating weights for general diversity and personalized diversity:

as we have said earlier, personalization and diversification should be given different weights based on different situations, and more specifically, such weights are learned based on the similarity of the current query and the user description representation. Meanwhile, a convolutional neural network is utilized to learn virtual subtopic representation of user figures, queries and documents, and therefore matching degree between the user figures, the queries and the documents is considered according to the subtopic.

For modeling of documents and query sentences, we use word2vec techniques to learn word embedding, and then get their vector representation based on a weighted summation of the words' TF-IDF weights

We learn the user's initial long-short term description portrait H ' using the HRNN structure '⁰,S'⁰。

The idea of the HRNN architecture is to utilize a hierarchical RNN model to build a long-short term description sketch of a user from the user's search history. Firstly, a first-layer RNN is utilized to take all query records of users as input, and each session is ended to construct interest characterization of the users in each session for a period node:

subscripts m and n denote the nth query record in the mth session, superscript 1 denotes the first-tier RNN structure, q_m,n,d_m,nRepresenting the corresponding input query and the average relevant documents. Let the current session be the Mth session in which the user has already made n_MThe secondary search, the user's short-term representation may be represented as:

the long-term description sketch is constructed according to the overall history record (not including the current session record), firstly, each session representation of the user history is taken as input, and a second-layer RNN structure is applied to learn the interest representation of each period of the user:

let us assume that the user has performed n in the m-th session_mAnd (5) secondary searching. Thereafter utilizing the current query

And user interest characterization for each period

in order to obtain a fine-grained sub-topic with a wider coverage, a convolutional neural network conv is introduced to learn a virtual sub-topic representation sequence from an original representation of a query:

wherein q is_iRepresents the ith sub-topic characterization of the query, and c represents the number of sub-topics contained by the query. Similar sub-topics capable of obtaining long-short term description portrait of document and userRepresents: d, L⁰,S⁰。

To enable learning of finer grains from multiple dimensions

And S⁰,L⁰We introduced the KNRM model. KNRM is a common ad-hoc model, i.e. document ranking is based only on content similarity, regardless of personalization or diversification. The input to a conventional KNRM is a query and a document, both of which are represented by corresponding word-embedded vector sequences. We use t_q＝[t_q1,...,t_qn],t_d＝[t_d1,...,t_dm]To represent the word sequences of the query and the word sequences of the document, where t_qiThe ith word-embedding vector, t, representing the query_djThe jth word representing a document is embedded into the vector, and n and m represent the number of words of the current query and document, respectively. The KNRM constructs a translation matrix M according to cosine similarity between each group of word pairs of the query and the document:

M_ij＝cos_similarity(t_qi,t_dj)

then the KNRM learns the multi-dimensional characteristics on a translation matrix by using k Gaussian kernels, and then the similarity of the query and the document is calculated by using a multilayer perceptron (MLP), wherein the specific calculation steps are as follows:

F_K(t_q,t_d)＝φ(f₁(M),...f_o(M),...,f_k(M))

wherein u is_o,σ_oThe mean and standard deviation corresponding to the o-th gaussian kernel are shown as manually set hyper-parameters. We apply the idea of KNRM similarity to compute the similarity of sub-topic sequencesIt is possible to obtain:

dynamic modeling of diversity:

when the current locally optimal document d is selected^*Later, we want the query and user profile to be forgotten and d^*Related aspects, whereby in the following document selection, emphasis is placed on d^*The angle that has not been covered yet. Based on this assumption, we build a Reset gate (Reset gate):

wherein the content of the first and second substances,

show that

Spliced together, W^qIs a learnable mapping parameter, will

The mapping is a real number. σ (-) denotes the activation function, and the tanh function is used herein.

The structure is graphically represented in figure 2.

The use of a reset gate alone has several drawbacks: 1. the candidate documents are usually dozens, the tanh function is a function for mapping real numbers to (-1,1), and the multiplication of a plurality of numbers with absolute values smaller than 1 results in a value with small absolute values, which may make the vector lose expression capability at a later stage and also cause problems such as gradient disappearance. 2. After browsing a certain number of documents, the user may forget what was seen before and may therefore want to click on a document similar to the previously browsed document. Inspired by resnet, our update function is as follows:

the structure not only can well solve the problem of gradient disappearance, but also gives the model an opportunity to remember the prior feature distribution. S, S^t,L^tThe same updating method is also used. We refer to this RNN structure as a unit of reset gates as RRNN.

Aggregating the individual score results:

mmr (maximum geographic relevance) is a classical structure that balances the relevance score and diversity score. He linearly combines the correlation score and the diversity score by introducing a balance factor λ. However, λ is a manually set hyper-parameter and is in most cases not an optimal choice. To find a more appropriate balance factor, we use a learnable matrix W^aAggregating the individual diversity and relevance scores into a final score:

score(d)＝φ(s)＝tanh(s^TW^a)

where s represents a vector formed by the sub-scores, and phi represents a multi-layer perceptron with tanh as an activation function. s ═ div (d | q)_v),r^Sdiv(d|S),r^Ldiv(d|L),rel(d,q)]Here, for the sake of brevity, we omit the t superscript denoting the latest features, where d, q_vS, L both represent the corresponding virtual sub-topic sequence. Wherein, the scores of general diversity and personalized diversity are calculated by using KNRM structure:

div(d|q_v)＝F_K(d,q_v)

div(d|S)＝F_K(d,S)

div(d|L)＝F_K(d,L)

rel(d,q)＝φ(f(d,q))

we use f (d, q) to represent additional manual feature sequences for documents and queries, and use a multi-tier perceptron φ to fuse individual manual features, computing relevance scores.

Training and optimizing:

in previous personalization work, researchers thought to be clicked onThe documents of (2) reflect only the relevance of the documents to the user and the query, however, the behavior of the user is complex, and a redundant relevant document may not be clicked on by the user because the information requirements of the user in this respect have been met. That is, the user's behavior reflects not only document relevance, but also document diversity. Therefore, the document clicked by the user at each step is regarded as a positive sample meeting the requirements of the user from the relevance and diversity, and is regarded as a negative sample in the opposite. One sample pair of our model can be represented as (r)₁,r₂)，r₁,r₂Representing two document ranks, our model is apparently the List-pair model, however the difference between these two ranks is only reflected in the last document, so this sample pair can also be written as (C, d)₁,d₂) Where C represents a set of selected documents, d₁,d₂Represents the last sample in the ordering, let d₁Is a positive sample, d₂Is a negative example, so we can reduce our model to the C-based pairwise method and use the two-class loss function to construct the loss function of our model:

where Q represents all queries in the training set, Q^oRepresenting all sample pairs corresponding to query q. y is^oIs a sample label when

Ratio of

When the user demand can be satisfied, y ^o1, otherwise 0,.

Representing model predictions

Ratio of

Better probability. Because our sample pairs can be written in the form of a pairwise: (C, d)₁,d₂) We can calculate by the following formula

Finally, gradually optimizing the model by an AdamaOptimizer optimizer.

Claims

1. A search method fusing personalization and diversification based on deep learning is characterized in that: inputting historical query data, a current query and a candidate document set of a user, and realizing the method through three steps: the method comprises the following steps: based on HRNN model according to long-term history H of user_lShort term history H_sLearn its initial long-term and short-term description picture L⁰And S⁰In which H is_l＝{{q₁,D₁},…,{q_n,D_nH, n represents the total number of queries H contained in the previous session_s＝{{q_n+1,D_n+1},…,{q_n+m,D_n+mJ, m is the number of queries that have been searched in the current session, q_nRepresenting the nth query, D, made by the user_nFor querying q_nReturning the initial candidate document set by the time ad-hoc search engine, further using the matching degree of the query vector and the user description image, and using div (d | q)_v) Representing general diversity of document d, div (d | S), div (d | L) representing personalized diversity of documents, using r^S,r^LDynamically controlling the influence degree of the personalized diversity on the score as the weight of the personalized diversity; step two: constructing an RRNN model, and selecting the local optimal document d of each step according to the current comprehensive score (d) of the candidate documents in the residual candidate document set by utilizing a greedy algorithm^*Then according to the virtual sub-words of the documentTopic representations to learn the impact of the document on the user's long and short term description portraits and sub-topic representations of queries, using reset gate structures to model such impacts, updates

S^t-1,L^t-1Is composed of

S^t,L^tWherein, in the step (A),

S^t,L^trespectively representing vector representations of query sentences, user short-term portraits and long-term portraits after forgetting sub-topics covered by t selected documents for the score calculation of the next document selection,

L⁰,S⁰representing the initiability of the query and user portrait vector representations. Finally, dynamic modeling of modeling document diversity is realized; and step three, aggregating all score results, wherein the scores are the basis for reordering the candidate documents, and outputting the reordered search results.

2. The deep learning-based search method integrating personalization and diversification as claimed in claim 1, wherein: the specific calculation mode of the general diversity weight and the personalized diversity weight is as follows: word embedding is learned using word2vec techniques, and then vector representation based on a weighted summation of word TF-IDF weights

d ', learning the initial long-short term description portrait H ' by HRNN structure '⁰,S'⁰；

And user interest characterization for each period

wherein q is_iThe representation of the ith sub-topic of the query, c represents the number of sub-topics contained in the query and further obtains the representation of the sub-topics of the document and the user long-short term description portrait: d, L⁰,S⁰:

d＝[d₁,d₂…d_c]＝conv(d')

L⁰＝[L₁,L₂…L_c]＝conv(L'⁰)

S⁰＝[S₁,S₂…S_c]＝conv(S'⁰)

Furthermore, a KNRM model was introduced, using t_q＝[t_q1,...,t_qn],t_d＝[t_d1,...,t_dm]To represent the word sequences of the query and the word sequences of the document, where t_qiThe ith word-embedding vector, t, representing the query_djA jth word embedding vector representing a document, n and M respectively represent the number of words of the current query and the document, and the KNRM constructs a translation matrix M according to cosine similarity between each group of word pairs of the query and the document:

M_ij＝cos_similarity(t_qi,t_dj)

F_k(t_q,t_d)＝φ(f₁(M),...f_o(M),...,f_k(M))

3. the deep learning-based search method integrating personalization and diversification as claimed in claim 2, wherein: the reset gate is constructed in the following manner:

wherein the content of the first and second substances,

show that

d^*Spliced together, W^qIs a learnable mapping parameter, will

The mapping is a real number, σ (·) representing the activation function.

4. The deep learning-based search method integrating personalization and diversification as claimed in claim 3, wherein: the activation function is a tanh function.

5. The deep learning-based search method integrating personalization and diversification as claimed in claim 4, wherein: the update function of the reset gate is:

score(d)＝φ(s)＝tanh(s^TW^a)

S^t,L^tOmitting the superscript form of t, d, q of the latest feature_vS, L both represent corresponding virtual sub-topic sequences, and the scores of general diversity and personalized diversity are computed using the KNRM structure:

div(d|q_v)＝F_K(d,q_v)

div(d|S)＝F_K(d,S)

div(d|L)＝F_K(d,L)

rel(d,q)＝φ(f(d,q))

f (d, q) is used to represent additional sequences of manual features about the document and the query, a multi-tier perceptron phi is used to fuse the individual manual features, and rel (d, q) represents the relevance score of the query and document computed based on the additional manual features.

6. The deep learning-based search method integrating personalization and diversification as claimed in claim 5, wherein: the loss function of the method is constructed by using a two-classification loss function:

Ratio of

When the user demand can be satisfied, y^o1, and conversely 0,

representing model predictions

Ratio of

Finally, gradually optimizing the model by an AdamaOptimizer optimizer.