CN105930400A - Markov decision process model based session search method - Google Patents

Markov decision process model based session search method Download PDF

Info

Publication number
CN105930400A
CN105930400A CN201610237174.9A CN201610237174A CN105930400A CN 105930400 A CN105930400 A CN 105930400A CN 201610237174 A CN201610237174 A CN 201610237174A CN 105930400 A CN105930400 A CN 105930400A
Authority
CN
China
Prior art keywords
document
degree
association
inquiry
session
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610237174.9A
Other languages
Chinese (zh)
Other versions
CN105930400B (en
Inventor
刘峰
朱荣鑫
唐丹丹
张苏可
丁霄汉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University
Original Assignee
Nanjing University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University filed Critical Nanjing University
Priority to CN201610237174.9A priority Critical patent/CN105930400B/en
Publication of CN105930400A publication Critical patent/CN105930400A/en
Application granted granted Critical
Publication of CN105930400B publication Critical patent/CN105930400B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3349Reuse of stored results of previous queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a Markov decision process model based session search method. The method comprises the following steps of 1) in a preparation stage, crawling sufficient web pages and obtaining a corpus complete set C, wherein C is a result after screening crawled web page sets, and each web page correspondingly contains a document d; and performing spontaneous session search and recording a search process by a trainer to obtain training data, and generating a training document; 2) in a training stage, preprocessing the data, and making statistics on a degree of association between words t required to be used in a retrieval stage and the document d, wherein the degree of association includes Ps(t/d) using Dirichlet smoothing and Pus(t/d) not using the Dirichlet smoothing; and 3) in the retrieval stage, receiving a query statement qi currently input by a user; calculating a degree of association between each document d in the corpus complete set C and the current query statement qi through a formula; calculating a degree of association between each document d and search of the whole session; and returning first N documents with high degree of association.

Description

A kind of session searching method based on markov decision process model
Technical field
The present invention relates to the information retrieval of dialogue-based search, particularly relate to a kind of based on Markovian decision The session searching method of process model.
Background technology
Session search is a kind of realizing alternately by search engine and user during search sessions The search technique of information retrieval.When searching for the relevant information of a theme, user can be the most continuous Ground change inquiry content, until finding its information needed.In session search, user can returning according to search engine Returning result and adjust inquiry content, search engine is then as the feedback of user, in order to improve session search Result.The rule that in session search, the change pattern of user's inquiry is the most unified, search engine is difficult to accurately obtain The intention of user, thus session search is a challenging information retrieval task.Inquire about currently with study The method of change mainly has:
(1) classify based on different types of search, such as embody, summarize, conversion, or Slight change, then performs retrieving.
(2) another kind of method is inquiry to be mapped as semantic figure represent, as body or inquiry log are formed Inquiry flow graph, then research inquiry Changing Pattern in the drawings.
But, first method depends on complete inquiry record, and such data are often difficult to obtain. Ontological in second method maps very challenging, and this likely introduces coarse intermediate data, and damages The accuracy of bad search.So, although these methods may apply to, in information retrieval task, such as inquire about Standardization, Associate Inquire etc., it can be difficult to directly apply to session search.
In order to make full use of, session search be inquired about the information of change, invent a kind of based on Ma Er herein The session searching method of section husband decision process model, can effectively strengthen the effect of session search.
Summary of the invention
Goal of the invention: the invention provides the information retrieval based on markov decision process model of a kind of novelty Method.The statement q of inquiry can be made up of multiple word t ∈ q every time.Can utilize and inquire about between adjacent inquiry The change of statement, and the return result of previous retrieval strengthen session search accuracy.First the method obtains Take corpus, then training dataset, generate Training document, then to data prediction, finally receive inquiry Statement, returns Optimizing Queries ground result.
A kind of session searching method based on markov decision process model, the method comprises the steps:
1) preparatory stage
A) crawl abundant webpage, obtain corpus complete or collected works C.C is that the collections of web pages crawled is after screening Result.Corresponding document d, C={d in each webpagei}
B) trainer carries out spontaneous session and searches for and record its process to obtain training data (training data Including the inquiry related to, the change of inquiry, document that user clicks in the result that search engine returns and point thereof Hit the time of staying etc.), generate Training document
C) preparatory stage is terminated
2) training stage
A) data prediction, needs the P used in statistics retrieval phases(t | d) and Pus(t|d)
B) analyzing step 1-b) the middle Training document generated, including the information of multiple sessions
C) reading a session in intensified learning, a session includes that one or many user is to Search Results The information of operation
D) read information that Search Results operates by a user in session (include the inquiry related to, inquiry Change, the document of click, the time of staying etc. of click), and thus update PusThe value of (t | d)
E) step d) is repeated until conversation end
F) step c) is repeated, d), e) until all sessions are all processed complete
G) training stage is terminated
3) retrieval phase
A) the query statement q that user is currently entered is receivedi
B) each document d and current queries q is calculated by formulaiThe degree of association
C) degree of association of each document d and whole session is calculated
D) front N piece document (present invention takes 10) that the degree of association is high is returned
E) repeat a), b), c), d) until user terminates inquiry
F) retrieval phase is terminated
Wherein said step 1-b) described in training file
1) defining one group of theme (topics), trainer spontaneously carries out specific search according to the description of theme. Search engine uses the Yahoo BOSS APIs (Build your Own Search Service) that Yahoo provides
2) system records user and searching system is mutual.Including the inquiry related to, the change of inquiry, click on Document, the time of staying etc. of click
3) training file is generated
4) terminate
Wherein said step 2-b) described in data prediction:
1) calculateSmooth with using Di Li CrayAs word The initial value of language and document associations degree, wherein (t, d) is the number of times that occurs in document d of word t to #, and P (t | C) is T occurs in the number of times in complete or collected works C, the length that | d | is the document, and μ is the parameter of Di Li Cray method, this 5000 it are set in bright
2) terminate
Wherein said step 2-f) described in renewal PusThe process of the value of (t | d):
1) if the most mutual, P is not the most changedusThe value of (t | d)
2) if not the most mutual, if the most mutual inquiry content is qi, in the most mutual front inquiry Hold for qi-1, make qthemeFor qiAnd qi-1Longest common subsequence, then+Δ q=qi-qtheme,-Δ q= qi-1-qtheme.Update Pus(t | d) it is divided into weights constant, reduce weights and increase the situation of weights
A) situation constant with document d degree of association weights for word t.ForAnd the situation of t ∈-Δ q, Search engine does not change its weights
B) situation of word t and document d degree of association weights is reduced.When inquiry changes, no matter+Δ q or -Δ q, if the result set D that the search occurring in last time returnsi-1In, the weights of these words will be reduced. Pus(t | d) it is the contribution to the correlation between current queries and document to be assessed of the word t acquiescence.And due to word Language t is already present in document sets Di-1In, in order to embody novelty degree, word t is in document sets Di-1Middle appearance Frequency is the highest, and weights just subtract the most.Therefore, for (t ∈+Δ q ort ∈-Δ q) and t ∈ Di-1, have Equation below:
log P u s ( t | d ) n e w = ( 1 - P u s ( t | d i - 1 * ) ) log P u s ( t | d )
Use logarithmic function is to prevent numerical underflows herein.
Wherein determineProcess:
I. will be to qi-1Search returns front ten fragments of result with satisfied click as effective Search Results, It is designated asSo-called satisfied click refers to that the time of staying is more than 30s on the document clicked on
Ii. for all of Search ResultsFind out textual association degree and inquire about q with last timei-1Maximum 'sI.e.
d i - 1 * = argmax d k ∈ D i - 1 e ( q i - 1 | d k )
Wherein
Iii. calculateValue;With 2-a method
Iv. terminate to determineProcess
C) situation of word t and document d degree of association weights is increased.
I. when for a word increased and the result set D that do not appears in last inquiryi-1In, this Bright will increase the weights of this word according to the frequency of anti-document.ForThere are following public affairs Formula:
log Pus(t|d)new=(1+idf (t)) log Pus(t|d)
Wherein:
Idf (t) is the frequency of anti-document, is defined as:Wherein D is that search engine returns All number of documents.DWIt it is the number of documents that in D, t occurs.
Ii. for t ∈ qtheme, it is also desirable to increase weights, because descriptor is typically the topic in a session Class or everyday words, be not everyday expressions in whole complete or collected works.Therefore, idf (t) and inapplicable herein.The present invention The inverse operation of frequency occurred with word t previously maximum return document,Replace idf (t). Formula is as follows:
log P u s ( t | d ) n e w = ( 1 + ( 1 - P u s ( t | d i - 1 * ) ) ) log P u s ( t | d )
3) terminate to update PusThe process of the value of (t | d)
Wherein said step 3-b) described in document d with inquiry qiCalculation of relationship degree:
1) if i=1, the i.e. user first time in this session inquires about
Then degree of association Score (q1, d)=logP (q1|d)
If i > 1, the then degree of association:
S c o r e ( q i , d ) = log P ( q i | d ) + α Σ t ∈ q t h e m e [ 1 - P ( t | d i - 1 * ) ] log P u s ( t | d ) - β Σ t ∈ + Δ q t ∈ d i - 1 * P u s ( t | d i - 1 * ) log P u s ( t | d ) + ∈ Σ t ∈ + Δ q t ∉ d i - 1 * i d f ( t ) log P u s ( t | d ) - δ Σ t ∈ - Δ q P u s ( t | d i - 1 * ) log P u s ( t | d )
Wherein P (qi| d) it is instant income,Ps(t | d) by step 2-b) try to achieve.
α, β, ∈, δ are the discount factor of each type action.α=2.15 are obtained according to all previous experiment, β=1.75, ∈=0.07, δ=0.42
2) process calculating document d with the degree of association of inquiry qi is terminated
Wherein said step 3-c) described in the calculation of relationship degree of document d and session search:
1 ) - - - Score s e s s i o n ( q n , d ) = Σ i = 1 n γ n - i S c o r e ( q i , d )
Wherein Score (q1, d)=logP (q1|d)
γ in MDP model is discount factor, and the present invention takes γ=0.8.Repeat in view of user is necessarily dissatisfied Inquiry between inquiry and Search Results, the discount factor repeated between inquiry is set to 0 by the present invention.Formula is such as Under:
2) terminate
A kind of generally applicable decision model of markov decision process (MDP).MDP is that decision process is correlated with All agencies set up state space S and motion space A.The action of agency affects conditions Ambient so that state Making uncertain conversion, the feedback of action also can affect the Action Selection of agency.In the present invention, inquiry q modeling For state S, the adjustment of word and document associations degree weights will be tired out by search engine as action A, search engine The document that long-pending financial value is high returns to user as Search Results.Inquiry before user affects Search Results, with Time Search Results can affect the decision-making that user inquires about next time.This process does not stop iteration, until poll-final.
Beneficial effect, the invention provides the information retrieval based on markov decision process model of a kind of novelty Method.The statement q of inquiry can be made up of multiple word t ∈ q every time.Can utilize and inquire about between adjacent inquiry The change of statement, and the return result of previous retrieval strengthen session search accuracy.First the method obtains Take corpus, then training dataset, generate Training document, then to data prediction, finally receive inquiry Statement, returns Optimizing Queries ground result.The inventive method is not rely on complete inquiry record, can directly answer Search for for session.The information inquiring about change in session search can be made full use of, can effectively strengthen session and search The effect of rope.
Accompanying drawing explanation
The workflow diagram of Fig. 1 summation present invention;
The session searching method workflow diagram based on markov decision process model of Fig. 2 present invention;
Fig. 3 generates the workflow diagram of training file;
Fig. 4 training stage workflow diagram;
Fig. 5 updates P according to interaction contentusThe value workflow diagram of (t | d);
Fig. 6 information retrieval stage workflow diagram.
Detailed description of the invention
The present invention is described in detail below in conjunction with the accompanying drawings.
The present invention is session searching method based on markov decision process model, it is intended that when improving information retrieval Accuracy, provide the user useful and satisfied information.As it is shown in figure 1, describe the process of the present invention Journey.The present invention first obtains corpus and training data, then pre-processes data, then performs specifically to look into Ask retrieval, finally export result.
In the present invention, process is divided into three phases, preparatory stage, training stage and retrieval phase, such as Fig. 2 Shown in.The present invention's it is critical that optimizes word and the weights of document associations degree according to training data, and Query script adjusts word and the weights of document associations degree in real time, and calculates document and the session inquiry degree of association Method.
Step 2-0 is the initial state of the session searching method based on markov decision process model of the present invention;
Preparatory stage includes step 2-1,2-2;
Step 2-1 obtains corpus, crawls abundant webpage;
Step 2-2 obtains training data, generates Training document, and detailed step is shown in Fig. 4;
Training stage includes step 2-3, step 2-4, step 2-5;
Step 2-3 data prediction, calculates and associates angle value between word t and document d;
Step 2-4 reads Training document, reads the interactive information in session;
Step 2-5 updates P according to interaction contentusThe value of (t | d);
Retrieval phase includes step 2-6, step 2-7;
Step 2-6 receives the query statement of user's input, calculates the degree of association of each document and whole session;
Step 2-7, according to the degree of association of document Yu inquiry, returns the top n document that the degree of association is high, and the present invention takes N is 10;
Step 2-8 is done state.
Fig. 3 is the detailed description of the process generating training file.
Step 3-0 is the beginning state generating training file;
Step 3-1, according to corpus complete or collected works C, defines one group of theme, and trainer is carried out according to the description of theme Search;
Step 3-2 record user and searching system mutual.Including the inquiry related to, the change of inquiry, click on Document, the time of staying etc. of click;
Step 3-3 generates training file;
Step 3-4 is the done state of the process generating training file,
Fig. 4 is the detailed description for the training stage.
Step 4-0 is to start training step;
Data are pre-processed by step 4-1, calculateSmooth with using Di Li CrayAnd preserve, # (t, d) is the number of times that occurs in document d of word t, P (t | C) be T occurs in the number of times in complete or collected works C, the length that | d | is the document, and μ is the parameter of Di Li Cray method, this 5000 it are set in bright;
Step 4-2 reads training file;
Step 4-3 reads a session;
It is mutual that step 4-4 is taken out in session;
Step 4-5 judges whether terminating alternately in session, if terminating to forward step 4-6 to, if do not terminated Forward step 4-4 to;
Step 4-6 updates P according to interaction contentusThe value of (t | d), detailed step is shown in Fig. 5;
Whether the session in step 4-7 training of judgement file is read and is terminated, if all training terminates then to forward to Step 4-8, if otherwise forwarding step 4-3 to;
Step 4-8 is training stage done state.
Fig. 5 is to update PusThe detailed description of the value of (t | d).
Step 5-0 is to update PusThe beginning state of the value of (t | d);
Step 5-1 judges whether to belong to the most mutual, belongs to, forwards step 5-2 to, otherwise forwards step 5-3 to;
Step 5-2 belongs to the most mutual, and weights are constant, forward step 5-7 to;
Step 5-3 is not belonging to the most mutual, it is judged that t ∈+Δ q or t ∈-Δ q.Belong to, forward step 5-4 to, Otherwise forward step 5-5 to.Determine+Δ q ,-Δ q, qthemeMethod: set the most mutual inquiry content as qi, Last mutual inquiry content is qi-1, make qthemeFor qiAnd qi-1Longest common subsequence, + Δ q=qi-qtheme,-Δ q=qi-1-qtheme
Step 5-4 changes when inquiry, though+Δ q or-Δ q, as long as occurring in the Search Results D of last timei-1 In, the weights of these words will be reduced, pass directly to step 5-7 afterwards.Therefore, for (t ∈+Δ q or t ∈ -Δq)and t∈Di-1, there is an equation below:
log P u s ( t | d ) n e w = ( 1 - P u s ( t | d i - 1 * ) ) log P u s ( t | d )
Use logarithmic function is to prevent numerical underflows herein.P (t | d) occurred in complete or collected works C by calculating t Probability try to achieve.Wherein, wherein determineJourney:
1) using the click of front ten fragments returning result and satisfaction as effective Search Results, it is designated as The definition of satisfied click is that the document time of staying clicked on is more than 30s.
2) for all of Search ResultsFind out textual association degree and inquire about q with last timei-1? BigI.e.
d i - 1 * = argmax d k ∈ D i - 1 e P ( q i - 1 | d k )
Wherein
3) according to t andSearchValue.
If step 5-5 t is unsatisfactory for (t ∈+Δ q or t ∈-Δ q) and t ∈ Di-1, it is judged that t ∈+Δ qOr t is ∈ qthemeIf meeting, forwarding step 5-6 to, otherwise forward step 5-2 to;
Step 5-6 meets t ∈+Δ qOr t is ∈ qtheme, increase weights, the most directly turn To step 5-7.
1) for t ∈+Δ qThere is an equation below:
logPus(t|d)new=(1+idf (t)) logPus(t|d)
2) for t ∈ qtheme, it is also desirable to increase weights, owing to descriptor is typically the topic in a session Class or everyday words, be not everyday expressions in whole complete or collected works.Therefore, idf (t) and inapplicable herein.The present invention The inverse operation of the frequency that middle word t previously maximum return document occurs,Replace idf (t). Formula is as follows:
log P u s ( t | d ) n e w = ( 1 + ( 1 - P u s ( t | d i - 1 * ) ) ) log P u s ( t | d )
Wherein: idf (t) is inverse text index, and formula is:Wherein D is that search engine returns Whole webpage numbers, DWIt it is the webpage number of t appearance.
Step 5-7 is for updating PusThe done state of the value of (t | d).
Fig. 6 is information retrieval stage workflow diagram
Step 6-0 is the beginning state in information retrieval stage;
Step 6-1 receives the query statement of user's input;
Step 6-2 calculates each document d and current queries q by formulaiThe degree of association;
Step 6-3 calculates the search before consideration user of the degree of association of each document d and whole session to search The impact of result;
Step 6-4 returns the document high with the degree of association of whole session;
Step 6-5 judges whether user's inquiry terminates, and terminates then to perform step 6-1, otherwise performs step 6-6;
Step 6-6 is the done state in information retrieval stage.
Technological means disclosed in the present invention program is not limited only to the technological means disclosed in above-mentioned embodiment, also Including the technical scheme being made up of above technical characteristic.It should be pointed out that, for the art For those of ordinary skill, under the premise without departing from the principles of the invention, it is also possible to make some improvements and modifications, These improvements and modifications are also considered as protection scope of the present invention.

Claims (3)

1. a session searching method based on markov decision process model, it is characterised in that include as follows Step:
1) preparatory stage
A) crawl abundant webpage, obtain corpus complete or collected works C;C is that the collections of web pages crawled is after screening Result;To having document d, then C={d in each webpagei};
B) trainer carries out spontaneous session and searches for and record its process to obtain training data (training data Including the inquiry related to, the change of inquiry, document that user clicks in the result that search engine returns and point thereof Hit the time of staying etc.), generate Training document;
C) preparatory stage is terminated;
2) training stage
A) data prediction, needs the degree of association between word t and the document d used in statistics retrieval phase, P is smoothed including using Di Li Crays(t | d) and do not use the smooth P of Di Li Crayus(t|d);
B) analyzing step 1-b) the middle Training document generated, including the information of session;
C) reading a session in Training document, a session includes that one or many user is to Search Results The information of operation;
D) (including the inquiry related to, inquiry is more to read in session the information that Search Results operates by user Change, the document of click, the time of staying etc. of click), and thus update PusThe value of (t | d);
E) step d) is repeated until conversation end;
F) step c) is repeated, d), e) until all sessions are all processed complete;
G) training stage is terminated;
3) retrieval phase
A) the query statement q that user is currently entered is receivedi
B) each document d and current queries q in corpus complete or collected works C is calculated by formulaiThe degree of association;
C) degree of association of each document d and whole session search is calculated;
D) front N piece document (present invention takes 10) that the degree of association is high is returned;
E) step a) is repeated, b), c), d) until user terminates inquiry;
F) retrieval phase is terminated;
Wherein said step 2-a) described in data prediction:
1) calculateSmooth with using Di Li CrayAs word The initial value of language t and the document d degree of association, wherein # (t, d) is the number of times that occurs in document d of word t, P (t | C) is the number of times that t occurs in corpus complete or collected works C, and | d | is the length of document d, and μ is Di Li Cray side The parameter of method, is set to 5000 in the present invention;
2) terminate;
Wherein said step 2-d) described in more neologism t and document d between the degree of association, i.e. update Pus(t|d) The process of value:
1) if inquiry is mutual for the first time, P is not the most changedusThe value of (t | d);
2) mutual, if the most mutual inquiry content is q if not inquiry for the first timei, the most mutual front looking into Inquiry content is qi-1, make qthemeFor qiAnd qi-1Longest common subsequence ,+Δ q=qi-qtheme,-Δ q= qi-1-qtheme;To PusThe renewal of (t | d) is divided into weights constant, reduces weights and increases the situation of weights;
A) situation constant with the weights of the document d degree of association for word t;ForAnd the feelings of t ∈-Δ q Condition, search engine does not change its weights;
B) situation of word t and document d degree of association weights is reduced;When inquiry changes, whether+Δ q is also It is-Δ q, as long as occurring in document sets D of the Search Results of last timei-1In, the weights of these words will be reduced; Pus(t | d) it is the word t contribution to the correlation between current queries and document to be assessed;And due to word t It is already present in document sets Di-1In, in order to embody novelty degree, word t is in document sets Di-1The frequency of middle appearance The highest, weights just subtract the most;Therefore, for (t ∈+Δ q or t ∈-Δ q) and t ∈ Di-1, just like Lower formula:
log P u s ( t | d ) n e w = ( 1 - P u s ( t | d i - 1 * ) ) log P u s ( t | d )
Use logarithmic function is to prevent numerical underflows herein;
Wherein determineProcess:
I. will be to qi-1Search returns front ten fragments of result with satisfied click as effective Search Results, It is designated asSo-called satisfied click refers to that the time of staying is more than 30s on the document clicked on;
Ii. for all of document searching resultFind out textual association degree and inquire about q with last timei-1 MaximumI.e.
d i - 1 * = argmax d k ∈ D i - 1 e P ( q i - 1 | d k )
Wherein
Iii. calculateValue;With 2-a method
Iv. terminate to determineProcess;
C) situation of word t and document d degree of association weights is increased;
I. when for a word increased and the result set D that do not appears in last inquiryi-1In, this The bright middle weights by proportional for the frequency according to anti-document these words of increase;If one in a lot of documents Common word, in order to ensure increasing the word of a preference when, it is to avoid increase too much;For t∈+Δq andThere is an equation below:
log Pus(t|d)new=(1+idf (t)) log Pus(t|d)
Wherein:
Idf (t) is the frequency of anti-document, is defined as:Wherein D is that search engine returns All numbers of document;DWIt it is the number of documents that t occurs in D;
Ii. for t ∈ qtheme, also increase weights, the topic class that is typically in a session due to descriptor or Everyday words, is not everyday expressions in whole complete or collected works;Therefore, idf (t) and inapplicable herein;The present invention uses The inverse operation of the frequency that word t previously maximum return document occurs,Replace idf (t);Public Formula is as follows:
log P u s ( t | d ) n e w = ( 1 + ( 1 - P u s ( t | d i - 1 * ) ) ) log P u s ( t | d )
3) terminate to update PusThe process of the value of (t | d).
Session searching method the most according to claim 1, it is characterised in that wherein said step 3-b) described Document d with inquiry qiCalculation of relationship degree:
1) if i=1, the i.e. user first time in this session inquires about
Then degree of association Score (q1, d)=logP (q1|d)
If i > 1, the then degree of association:
S c o r e ( q i , d ) = log P ( q i | d ) + α Σ t ∈ q t h e m e [ 1 - P ( t | d i - 1 * ) ] log P u s ( t | d ) - β Σ t ∈ + Δ q t ∈ d i - 1 * P u s ( t | d i - 1 * ) log P u s ( t | d ) + ∈ Σ t ∈ + Δ q t ∉ d i - 1 * i d f ( t ) log P u s ( t | d ) - δ Σ t ∈ - Δ q P u s ( t | d i - 1 * ) log P u s ( t | d )
Wherein P (qi| d) it is instant income,Ps(t | d) by step 2-a) try to achieve;
α, β, ∈, δ are the discount factor of each type action;α=2.15 are obtained according to all previous experiment, β=1.75, ∈=0.07, δ=0.42;
2) process calculating document d with the degree of association of inquiry qi is terminated.
Session searching method the most according to claim 1, it is characterised in that wherein said step 3-c) described The calculating of document d and the session search degree of association:
1)
Wherein Score (q1, d)=logP (q1|d)
γ in MDP model is discount factor, and the present invention takes γ=0.8;Repeat in view of user is necessarily dissatisfied Inquiry between inquiry and Search Results, the discount factor repeated between inquiry is set to 0 by the present invention;Formula is such as Under:
2) terminate.
CN201610237174.9A 2016-04-15 2016-04-15 A kind of session searching method based on markov decision process model Active CN105930400B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610237174.9A CN105930400B (en) 2016-04-15 2016-04-15 A kind of session searching method based on markov decision process model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610237174.9A CN105930400B (en) 2016-04-15 2016-04-15 A kind of session searching method based on markov decision process model

Publications (2)

Publication Number Publication Date
CN105930400A true CN105930400A (en) 2016-09-07
CN105930400B CN105930400B (en) 2019-10-11

Family

ID=56839103

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610237174.9A Active CN105930400B (en) 2016-04-15 2016-04-15 A kind of session searching method based on markov decision process model

Country Status (1)

Country Link
CN (1) CN105930400B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107729512A (en) * 2017-10-20 2018-02-23 南京大学 A kind of session searching method based on part Observable markov decision process model
CN109117475A (en) * 2018-07-02 2019-01-01 武汉斗鱼网络科技有限公司 A kind of method and relevant device of text rewriting
CN109241243A (en) * 2018-08-30 2019-01-18 清华大学 Candidate documents sort method and device
CN109783709A (en) * 2018-12-21 2019-05-21 昆明理工大学 A kind of sort method based on Markovian decision process and k- arest neighbors intensified learning
CN111241407A (en) * 2020-01-21 2020-06-05 中国人民大学 Personalized search method based on reinforcement learning

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080021708A1 (en) * 1999-11-12 2008-01-24 Bennett Ian M Speech recognition system interactive agent
CN102117321A (en) * 2010-01-06 2011-07-06 微软公司 Automated discovery aggregation and organization of subject area discussions
CN102262661A (en) * 2011-07-18 2011-11-30 南京大学 Web page access forecasting method based on k-order hybrid Markov model
CN103425710A (en) * 2012-05-25 2013-12-04 北京百度网讯科技有限公司 Subject-based searching method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080021708A1 (en) * 1999-11-12 2008-01-24 Bennett Ian M Speech recognition system interactive agent
CN102117321A (en) * 2010-01-06 2011-07-06 微软公司 Automated discovery aggregation and organization of subject area discussions
CN102262661A (en) * 2011-07-18 2011-11-30 南京大学 Web page access forecasting method based on k-order hybrid Markov model
CN103425710A (en) * 2012-05-25 2013-12-04 北京百度网讯科技有限公司 Subject-based searching method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
肖哲: "基于MarKov链的Web访问序列挖掘算法研究", 《中国优秀硕士学位论文全文数据库》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107729512A (en) * 2017-10-20 2018-02-23 南京大学 A kind of session searching method based on part Observable markov decision process model
CN109117475A (en) * 2018-07-02 2019-01-01 武汉斗鱼网络科技有限公司 A kind of method and relevant device of text rewriting
CN109241243A (en) * 2018-08-30 2019-01-18 清华大学 Candidate documents sort method and device
CN109783709A (en) * 2018-12-21 2019-05-21 昆明理工大学 A kind of sort method based on Markovian decision process and k- arest neighbors intensified learning
CN109783709B (en) * 2018-12-21 2023-03-28 昆明理工大学 Sorting method based on Markov decision process and k-nearest neighbor reinforcement learning
CN111241407A (en) * 2020-01-21 2020-06-05 中国人民大学 Personalized search method based on reinforcement learning
CN111241407B (en) * 2020-01-21 2023-07-28 中国人民大学 Personalized search method based on reinforcement learning

Also Published As

Publication number Publication date
CN105930400B (en) 2019-10-11

Similar Documents

Publication Publication Date Title
CN110442760B (en) Synonym mining method and device for question-answer retrieval system
CN110633409B (en) Automobile news event extraction method integrating rules and deep learning
CN103064945B (en) Based on the Situational searching method of body
US9824124B2 (en) Method and apparatus for web ad matching
CN105930400A (en) Markov decision process model based session search method
CN104750844B (en) Text eigenvector based on TF-IGM generates method and apparatus and file classification method and device
US9171078B2 (en) Automatic recommendation of vertical search engines
CN106649818A (en) Recognition method and device for application search intentions and application search method and server
US20210365517A1 (en) Method for Training Fusion Ordering Model, Search Ordering Method, Electronic Device and Storage Medium
CN105045875B (en) Personalized search and device
US20090265290A1 (en) Optimizing ranking functions using click data
CN107169001A (en) A kind of textual classification model optimization method based on mass-rent feedback and Active Learning
US20110004573A1 (en) Identifying training documents for a content classifier
CN107818105A (en) The recommendation method and server of application program
CN106204156A (en) A kind of advertisement placement method for network forum and device
CN101609450A (en) Web page classification method based on training set
US20090132515A1 (en) Method and Apparatus for Performing Multi-Phase Ranking of Web Search Results by Re-Ranking Results Using Feature and Label Calibration
CN104933239A (en) Hybrid model based personalized position information recommendation system and realization method therefor
CN110489523B (en) Fine-grained emotion analysis method based on online shopping evaluation
CN108717433A (en) A kind of construction of knowledge base method and device of programming-oriented field question answering system
CN103678576A (en) Full-text retrieval system based on dynamic semantic analysis
CN110134799B (en) BM25 algorithm-based text corpus construction and optimization method
CN110175585B (en) Automatic correcting system and method for simple answer questions
CN108182186B (en) Webpage sorting method based on random forest algorithm
CN111125538B (en) Searching method for enhancing personalized retrieval effect by utilizing entity information

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant