CN106294656A - A kind of map locating keyword is to the method for relevant issues - Google Patents

A kind of map locating keyword is to the method for relevant issues Download PDF

Info

Publication number
CN106294656A
CN106294656A CN201610631777.7A CN201610631777A CN106294656A CN 106294656 A CN106294656 A CN 106294656A CN 201610631777 A CN201610631777 A CN 201610631777A CN 106294656 A CN106294656 A CN 106294656A
Authority
CN
China
Prior art keywords
word
cps
inquiry
key word
descriptor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610631777.7A
Other languages
Chinese (zh)
Other versions
CN106294656B (en
Inventor
黄浩
颜钱
李宗鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Yuanfeng Intelligent Technology Co.,Ltd.
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN201610631777.7A priority Critical patent/CN106294656B/en
Publication of CN106294656A publication Critical patent/CN106294656A/en
Application granted granted Critical
Publication of CN106294656B publication Critical patent/CN106294656B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3334Selection or weighting of terms from queries, including natural language queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems

Abstract

The invention discloses a kind of map locating keyword method to relevant issues;First crawl problem information, then extraction key word of the inquiry and the descriptor of problem, select candidate question set and closeCPS q , forCPS q In each problem, calculate the degree of correlation of itself and key word of the inquiry, calculate the comprehensive score of this problem by structure degree of correlation and pouplarity, and right according to score order from high to lowCPS q In problem be ranked up being gatheredRP, subsequently by calculatingRPCosine similarity between middle problem selects the composition set of representational problem from all kinds of Similar ProblemsFP, final updatingFPIn the comprehensive score of each problem, and right according to mark order from high to lowFPIn problem be ranked up, return sequence after problem setFPAs the problem relevant to key word of the inquiry;The present invention can directly obtain the problem relevant to user's key word of the inquiry and answer, thus is more fully understood from user's request, it is thus achieved that preferably search experience.

Description

A kind of map locating keyword is to the method for relevant issues
Technical field
The invention belongs to technical field of information retrieval, particularly relate to a kind of map locating keyword side to relevant issues Method.
Background technology
Along with the development of web2.0, community-based question and answer website (community based question Answering sites, writes a Chinese character in simplified form CQA) become more popular, increasing people is by proposing problem at CQA and answering a question Carry out Knowledge Sharing.Inquire about information needed in a search engine with respect to keyword, the problem on CQA can be brighter True expression user's request, and each problem of CQA is furnished an answer by many users behind, and mark one Good answer, such that it is able to better meet the information retrieval demand of user.By depositing of CQA such a high quality information platform , the problem that the key word of the inquiry that user provides is mapped on CQA, it is possible not only to provide the user answer, but also permissible Deeply understand user's request and as clear and definite problem to serve web search result.
When the problem being mapped on CQA by the key word of the inquiry that user provides, some key word of the inquiry can not be all Being contained in problem, so being accomplished by establishing a standard, the dependency between key word of the inquiry and problem being judged. Meanwhile, after obtaining multiple problems relevant to key word of the inquiry, it should provide the problem that can react user's request accurately. Further, owing to problems a lot of on CQA have similarity, in order to meet the many demands of user, similar problem can be returned For same category, therefrom select representational problem, without all showing.
Summary of the invention
In order to solve above-mentioned technical problem, the invention provides a kind of map locating keyword method to relevant issues. For given information requirement, people can directly propose problem or select the word being correlated with to inquire about from problem, claim this A little words are the theme word.By key word of the inquiry and descriptor of all the problems are analyzed, obtain and key word of the inquiry phase The candidate's problem closed, being then passed through is ranked up candidate's problem and classifies accurately obtains what user's key word of the inquiry was mapped Problem.
The technical solution adopted in the present invention is: the method for a kind of map locating keyword to relevant issues, including following Step:
Step 1: carry out problem on CQA and crawl, and record each problem generic, obtains being made up of N number of problem Problem set PS, remembers PS={P1,P2,...,PN, for each problem P in set PSj, by the POS of a standard Tagger program extracts noun phrase therein, then combines its generic word and obtains the descriptor set of correspondence PTSj;For the key word of the inquiry q of n word composition, remember q={w1,w2,...,wn, calculate each word w in qiDescriptor Score Tgrade (wi), and by score more than threshold θtWord add descriptor set corresponding to q;Described θt∈[0,1]; If the descriptor set of certain problem comprises the descriptor set of key word of the inquiry, then this problem is added the time of key word of the inquiry Select problem set CPSq, otherwise this problem is considered as the problem unrelated with key word of the inquiry, does not considers;Each word w in qiMain Epigraph score Tgrade (wi) computing formula be:
T g r a d e ( w i ) = Σ j = 1 N T i m e s ( w i | PTS j ) Σ j = 1 N p t i m e s ( w i | P j ) , ( i = 1 , 2 , ... , n )
Wherein, n is the number of words that key word of the inquiry q comprises;wiIt it is the word in q;N is to comprise in problem set PS Number of questions;Times(wi|PTSj) it is word wiEach problem P in set PSjCorresponding descriptor set PTSjIn Occurrence number;ptimes(wi|Pj) it is word wiEach problem P in set PSjIn occurrence number.
Step 2: for set CPSqIn each problem PcIf, PcWith the degree of correlation between key word of the inquiry q is more Height, the information retrieval demand that more likely accurate response user is current, so can be by the relevant journey of problem to key word of the inquiry Spend as the important reference selecting final problem set, set of computations CPSqIn each problem PcWith inquiry key The degree of correlation of word q, corresponding result uses Cor (Pc, q) represent, Cor (Pc, being specifically calculated as q):
C o r ( P c , q ) = Π i = 1 n ( λ × c t i m e s ( w i | P c ) l e n g t h ( P c ) + ( 1 - λ ) Σ j = 1 N p t i m e s ( w i | P j ) Σ k = 1 n Σ j = 1 N p t i m e s ( w k | P j ) ) , ( c = 1 , 2 , ... , N c )
Wherein, NcIt is that candidate question set closes CPSqThe number of questions comprised;N is the number of words that key word of the inquiry q comprises; wiIt it is the word in q;ctimes(wi|Pc) it is word wiAt set CPSqIn each problem PcIn occurrence number; length(Pc) it is set CPSqIn each problem PcThe word number comprised;N is the problem comprised in problem set PS Number;ptimes(wi|Pj) it is word wiEach problem P in set PSjIn occurrence number;λ (λ ∈ (0,1)) is for giving Fixed inhibitive factor;Described λ ∈ (0,1).
Step 3: construct a figure G, set CPSqMiddle problem is as node, and each problem is corresponding to figure G's One node;Time initial, figure G only comprises node, and any two node VmAnd VnBetween the most there is not limit;Then for figure G In any two node VmAnd Vn, it is assumed that it is corresponding to set CPSqIn problem PmAnd Pn, calculate PmAnd PnDescriptor cover Lid rate Cover (Pm,Pn), if Cover is (Pm,Pn) more than given threshold θc, then there is node VmTo node VnA limit, otherwise Node VmWith node VnThe most there is not limit;Described θc∈[0,1];Wherein descriptor coverage rate Cover (Pm,Pn) computing formula For:
Wherein PTSmFor problem PmDescriptor set;||PTSm| | represent set PTSmIn element number cos (Pm,Pn) It it is the cosine similarity of two problems;α is given inhibitive factor;Described α ∈ (0,1).
Step 4: for set CPSqIn each problem PcIf being accessed for number of times the most, then show that this problem is more Welcome, more it is likely to be the problem corresponding to current keyword query, thus the pouplarity of problem is final as selecting One important reference of problem set, uses Wel (Pc) represent set CPSqIn each problem PcPouplarity, Wel(Pc) be specifically calculated as:
W e l ( P c ) = 1 N c + d Σ v ∈ a d j ( P c ) W e l ( v ) deg ( v ) , ( c = 1 , 2 , ... , N c )
Wherein, NcIt is that candidate question set closes CPSqThe number of questions comprised;adj(Pc) in representative graph G with each problem Pc The node set being connected;V is set adj (PcA node in);Deg (v) is the degree of node v;D (d ∈ (0,1)) gives Inhibitive factor;
Step 5: for set CPSqIn each problem Pc, combine its pouplarity and the phase with key word of the inquiry Pass degree, calculates the comprehensive score Grade (P of each problemc), according to comprehensive score order from big to small to CPSqIn Problem is ranked up, the problem set RP after being sorted;Comprehensive score Grade (Pc) be specifically calculated as:
Grade(Pc)=log (Cor (Pc,q))+log(Wel(Pc)), (c=1,2 ..., Nc)
Wherein NcIt is that candidate question set closes CPSqThe number of questions comprised;Cor(Pc, q) it is each problem PcClose with inquiry The degree of correlation of key word q;Wel(Pc) it is each problem PcPouplarity;
Step 6: initialize a null set FP, adds the first problem in RP FP, selects in RP surplus the most successively Remaining each problem Pr, calculate PrThe cosine similarity csim of each problem with in FP, the cosine similarity that record is maximum Problem P in maxcsim and corresponding FPf, by PrMark Grade (Pr) add Grade (Pf) to upper, if maxcsim is less than simultaneously Given threshold θs, then by PrAdd FP, if maxcsim is more than given threshold θs, described θs∈ [0,1], then it is assumed that problem PrWith PfSimilar, and record and problem PfSimilar problem number Nfq
Step 7: update each problem P in FP settComprehensive score, and according to mark from big to small suitable after updating Problem sequence in ordered pair FP, returns the set FP after sequence.The formula updating score is:
G r a d e ( P t ) N e w = G r a d e ( P t ) O l d N t q
Wherein Grade (Pt)OldFor problem P each in FPtRenewal before mark;NtqIt is and each problem PtSimilar Number of questions;Grade(Pt)NewIt it is each problem P in FPtRenewal after mark.
Preferably, in step 1, θt∈[0.3,0.9]。
Preferably, in step 2, λ ∈ (0.2,0.9).
Preferably, in step 3, α ∈ (0.1,1), θc∈[0.3,0.9]。
Preferably, in step 4, d ∈ (0.1,1).
Preferably, in step 6, θs∈[0.2,0.9]。
User's key word of the inquiry is mapped as clear and definite problem by the present invention, can the high quality information on CQA be integrated into Search engine, and the problem relevant to user's key word of the inquiry and answer can be directly obtained, thus be more fully understood from User's request, it is thus achieved that preferably search experience.
Accompanying drawing explanation
The flow chart of Fig. 1: the embodiment of the present invention.
Fig. 2: first three problem result figure of score rank in RP set in the embodiment of the present invention;
Fig. 3: first three problem result figure of score rank in FP set after sequence in the embodiment of the present invention;
Fig. 4: in the embodiment of the present invention, key word of the inquiry is inquired about in a search engine and obtained first three result figure of ranking.
Detailed description of the invention
Understand and implement the present invention for the ease of those of ordinary skill in the art, below in conjunction with the accompanying drawings and embodiment is to this Bright it is described in further detail, it will be appreciated that embodiment described herein is merely to illustrate and explains the present invention, not For limiting the present invention.
The invention provides a kind of map locating keyword method to relevant issues.For given key word of the inquiry, Map relative problem.
Asking for an interview Fig. 1, the present invention comprises the following steps:
Step 1: at Yahoo!The upper selection of Answers " iPod " classification carries out problem and crawls, and obtains problem set PS, remembers PS ={ P1,P2,...,PN}.For each problem P in set PSj, extract it by the POS tagger program of a standard In noun phrase, then in conjunction with " iPod " obtain correspondence descriptor set PTSj
Given key word of the inquiry q=" iPod downloaded videos ", calculate each word w in qiDescriptor Score Tgrade (wi), and by score more than threshold θtWord add descriptor set corresponding to q;Described θt∈[0,1]; If the descriptor set of certain problem comprises the descriptor set of key word of the inquiry, then this problem is added the time of key word of the inquiry Select problem set CPSq, otherwise this problem is considered as the problem unrelated with key word of the inquiry, does not considers;Each word w in qiMain Epigraph score Tgrade (wi) computing formula be:
T g r a d e ( w i ) = Σ j = 1 N T i m e s ( w i | PTS j ) Σ j = 1 N p t i m e s ( w i | P j ) , ( i = 1 , 2 , ... , n )
Wherein, n is the number of words that key word of the inquiry q comprises;wiIt it is the word in q;N is to comprise in problem set PS Number of questions;Times(wi|PTSj) it is word wiEach problem P in set PSjCorresponding descriptor set PTSjIn Occurrence number;ptimes(wi|Pj) it is word wiEach problem P in set PSjIn occurrence number.
Step 2: for set CPSqIn each problem PcIf, PcWith the degree of correlation between key word of the inquiry q is more Height, the information retrieval demand that more likely accurate response user is current, so can be by the relevant journey of problem to key word of the inquiry Spend as the important reference selecting final problem set, set of computations CPSqIn each problem PcWith inquiry key The degree of correlation of word q, corresponding result uses Cor (Pc, q) represent, Cor (Pc, being specifically calculated as q):
C o r ( P c , q ) = Π i = 1 n ( λ × t i m e s ( w i | P c ) l e n g t h ( P c ) ( 1 - λ ) Σ j = 1 N t i m e s ( w i | P j ) Σ k = 1 n Σ j = 1 N t i m e s ( w k | P j ) ) , ( c = 1 , 2 , ... , N c )
Wherein, NcIt is that candidate question set closes CPSqThe number of questions comprised;N is the number of words that key word of the inquiry q comprises; wiIt it is the word in q;ctimes(wi|Pc) it is word wiAt set CPSqIn each problem PcIn occurrence number; length(Pc) it is set CPSqIn each problem PcThe word number comprised;N is the problem comprised in problem set PS Number;ptimes(wi|Pj) it is word wiEach problem P in set PSjIn occurrence number;λ (λ ∈ (0,1)) is for giving Fixed inhibitive factor;Described λ ∈ (0,1).
Step 3: construct a figure G, set CPSqMiddle problem is as node, and each problem is corresponding to figure G's One node;Time initial, figure G only comprises node, and any two node VmAnd VnBetween the most there is not limit;Then for figure G In any two node VmAnd Vn, it is assumed that it is corresponding to set CPSqIn problem PmAnd Pn, calculate PmAnd PnDescriptor cover Lid rate Cover (Pm,Pn), if Cover is (Pm,Pn) more than given threshold θc, then there is node VmTo node VnA limit, otherwise Node VmWith node VnThe most there is not limit;Described θc∈[0,1];Wherein descriptor coverage rate Cover (Pm,Pn) computing formula For:
Wherein PTSmFor problem PmDescriptor set;||PTSm| | represent set PTSmIn element number cos (Pm,Pn) It it is the cosine similarity of two problems;α is given inhibitive factor;Described α ∈ (0,1).
Step 4: for set CPSqIn each problem PcIf being accessed for number of times the most, then show that this problem is more Welcome, more it is likely to be the problem corresponding to current keyword query, thus the pouplarity of problem is final as selecting One important reference of problem set, uses Wel (Pc) represent set CPSqIn each problem PcPouplarity, Wel(Pc) be specifically calculated as:
W e l ( P c ) = 1 N c + d Σ v ∈ a d j ( P c ) W e l ( v ) deg ( v ) , ( c = 1 , 2 , ... , N c )
Wherein, NcIt is that candidate question set closes CPSqThe number of questions comprised;adj(Pc) in representative graph G with each problem Pc The node set being connected;V is set adj (PcA node in);Deg (v) is the degree of node v;D (d ∈ (0,1)) gives Inhibitive factor;
Step 5: for set CPSqIn each problem Pc, combine its pouplarity and the phase with key word of the inquiry Pass degree, calculates the comprehensive score Grade (P of each problemc), according to comprehensive score order from big to small to CPSqIn Problem is ranked up, the problem set RP after being sorted.RP set in score rank first three problem as shown in Figure 2.Comprehensively Property score Grade (Pc) be specifically calculated as
Grade(Pc)=log (Cor (Pc,q))+log(Wel(Pc)), (c=1,2 ..., Nc)
Wherein NcIt is that candidate question set closes CPSqThe number of questions comprised;Cor(Pc, q) it is each problem PcClose with inquiry The degree of correlation of key word q;Wel(Pc) it is each problem PcPouplarity;
Step 6: initialize a null set FP, adds the first problem in RP FP, selects in RP surplus the most successively Remaining each problem Pr, calculate PrThe cosine similarity csim of each problem with in FP, the cosine similarity that record is maximum Problem P in maxcsim and corresponding FPf, by PrMark Grade (Pr) add Grade (Pf) to upper, if maxcsim is less than simultaneously Given threshold θs, then by PrAdd FP, if maxcsim is more than given threshold θs, described θs∈ [0,1], then it is assumed that problem PrWith PfSimilar, and record and problem PfSimilar problem number Nfq
Step 7: update each problem P in FP settComprehensive score, and according to mark from big to small suitable after updating In ordered pair FP problem sequence, after sequence FP set in comprehensive score rank first three problem as shown in Figure 3.Inquiry key Word inquire about in a search engine obtain ranking first three result as shown in Figure 4.Return the set FP after sequence.Update the public affairs of score Formula is:
G r a d e ( P t ) N e w = G r a d e ( P t ) O l d N t q
Wherein Grade (Pt)OldFor problem P each in FPtRenewal before mark;NtqIt is and each problem PtSimilar Number of questions;Grade(Pt)NewIt it is each problem P in FPtRenewal after mark.
User's key word of the inquiry is mapped as clear and definite problem by the present invention, is possible not only to provide the user answer, but also Can deeply understand user's request and as clear and definite problem to serve web search result, thus obtain and preferably search Cable body is tested.
In step 1, θt∈[0.3,0.9]。
In step 2, λ ∈ (0.2,0.9).
In step 3, α ∈ (0.1,1), θc∈[0.3,0.9]。
In step 4, d ∈ (0.1,1).
In step 6, θs∈[0.2,0.9]。
It should be appreciated that this specification is the part elaborated belongs to prior art.
It should be appreciated that the above-mentioned description for preferred embodiment is more detailed, can not therefore be considered this The restriction of invention patent protection scope, those of ordinary skill in the art, under the enlightenment of the present invention, is weighing without departing from the present invention Profit requires under the ambit protected, it is also possible to make replacement or deformation, within each falling within protection scope of the present invention, this The bright scope that is claimed should be as the criterion with claims.

Claims (6)

1. a map locating keyword is to the method for relevant issues, it is characterised in that comprise the following steps:
Step 1: carry out problem on CQA and crawl, and record each problem generic, obtains problem set PS, remembers PS={P1, P2,...,PN, for each problem P in set PSi, extract name therein by the POS tagger program of a standard Word phrase, then combines its generic word and obtains the descriptor set PTS of correspondencei;Inquiry for n word composition is closed Key word q, remembers q={w1,w2,...,wn, calculate each word w in qiDescriptor score Tgrade (wi), and score is more than Threshold θtt∈ [0,1]) word add descriptor set corresponding to q;Close if the descriptor set of certain problem comprises inquiry The descriptor set of key word, then the candidate question set that this problem adds key word of the inquiry closes CPSq;Each word w in qiTheme Word score Tgrade (wi) computing formula be:
T g r a d e ( w i ) = Σ j = 1 N T i m e s ( w i | PTS j ) Σ j = 1 N t i m e s ( w i | P j ) , ( i = 1 , 2 , ... , n )
Wherein n is the number of words that q comprises;wiIt it is the word in q;N is the number of questions comprised in PS;Times(wi|PTSj) It is wiAt PTSjIn occurrence number;times(wi|Pj) it is wiAt PjIn occurrence number;
Step 2: for set CPSqIn each problem PcIf, PcWith degree of correlation between key word of the inquiry q is the highest, The information retrieval demand that more likely accurate response user is current, uses Cor (Pc, q) represent PcRelevant to key word of the inquiry q Degree, Cor (Pc, being specifically calculated as q):
C o r ( P c , q ) = Π i = 1 n ( λ × t i m e s ( w i | P c ) l e n g t h ( P c ) + ( 1 - λ ) Σ j = 1 N t i m e s ( w i | P j ) Σ k = 1 n Σ j = 1 N t i m e s ( w k | P j ) ) , ( c = 1 , 2 , ... , N c )
Wherein NcIt is CPSqThe number of questions comprised;N is the number of words that q comprises;wiIt it is the word in q;times(wi|Pc) be wiAt PcIn occurrence number;length(Pc) it is PcThe word number comprised;N is the number of questions comprised in PS;λ(λ∈(0, 1)) for given inhibitive factor;
Step 3: construct a figure G, CPS will be gatheredqIn each problem as figure G node, then a set of computations CPSqIn any two problem PiAnd PjDescriptor coverage rate Cover (Pi,Pj), if Cover is (Pi,Pj) more than given threshold value θcc∈ [0,1]), then there is PiTo PjA limit;Wherein descriptor coverage rate Cover (Pi,Pj) computing formula be:
Wherein PTSiFor problem PiDescriptor set;||PTSi| | represent set PTSiIn element number cos (Pi,Pj) it is two The cosine similarity of individual problem;α (α ∈ (0,1)) is given inhibitive factor;
Step 4: for set CPSqIn each problem PcIf being accessed for number of times the most, then show that this problem is more by joyous Meet, be more likely to be the problem corresponding to current keyword query, use Wel (Pc) represent PcPouplarity, Wel (Pc) Be specifically calculated as:
W e l ( P c ) = 1 N c + d Σ v ∈ a d j ( P c ) W e l ( v ) deg ( v ) , ( c = 1 , 2 , ... , N c )
Wherein NcIt is CPSqThe number of questions comprised;adj(Pc) for figure G in PcThe node set being connected;V is set adj (Pc) In a node;Deg (v) is the degree of node v;The inhibitive factor that d (d ∈ (0,1)) is given;
Step 5: for set CPSqIn each problem Pc, combine its pouplarity and the relevant journey to key word of the inquiry Degree, calculates the comprehensive score Grade (P of each problemc), according to comprehensive score order from big to small to CPSqIn problem It is ranked up, the problem set RP after being sorted;Comprehensive score Grade (Pc) be specifically calculated as
Grade(Pc)=log (Cor (Pc|q))+log(Wel(Pc))
Wherein Cor (Pc| q) it is PcDegree of correlation with q;Wel(Pc) it is PcPouplarity;
Step 6: initialize a null set FP, adds the first problem in RP FP, selects in RP remaining the most successively Each problem Pr, calculate PrThe cosine similarity csim of each problem with in FP, cosine similarity maxcsim that record is maximum and Problem P in corresponding FPf, by PrMark Grade (Pf) add Grade (Pf) to upper, if maxcsim is less than given threshold θ simultaneouslyss∈ [0,1]), then by PrAdd FP, otherwise it is assumed that problem PrAnd PfSimilar, and record and problem PfSimilar problem number Nfq
Step 7: update the comprehensive score of each problem in FP set, and according to the order from big to small of the mark after updating to FP In problem sequence, return sequence after set FP;The formula updating score is:
G r a d e ( P f ) N e w = G r a d e ( P f ) O l d N f q
Wherein Grade (Pf)OldFor problem P in FPsRenewal before mark;NfqIt is and PfSimilar number of questions;Grade (Pf)NewIt it is problem P in FPfRenewal after mark.
A kind of map locating keyword the most according to claim 1 is to the method for relevant issues, it is characterised in that: in step In 1, θt∈[0.3,0.9]。
A kind of map locating keyword the most according to claim 1 is to the method for relevant issues, it is characterised in that: in step In 2, λ ∈ (0.2,0.9).
A kind of map locating keyword the most according to claim 1 is to the method for relevant issues, it is characterised in that: in step In 3, α ∈ (0.1,1), θc∈[0.3,0.9]。
A kind of map locating keyword the most according to claim 1 is to the method for relevant issues, it is characterised in that: in step In 4, d ∈ (0.1,1).
A kind of map locating keyword the most according to claim 1 is to the method for relevant issues, it is characterised in that: in step In 6, θs∈[0.2,0.9]。
CN201610631777.7A 2016-08-04 2016-08-04 A kind of method of map locating keyword to relevant issues Active CN106294656B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610631777.7A CN106294656B (en) 2016-08-04 2016-08-04 A kind of method of map locating keyword to relevant issues

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610631777.7A CN106294656B (en) 2016-08-04 2016-08-04 A kind of method of map locating keyword to relevant issues

Publications (2)

Publication Number Publication Date
CN106294656A true CN106294656A (en) 2017-01-04
CN106294656B CN106294656B (en) 2019-03-19

Family

ID=57664941

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610631777.7A Active CN106294656B (en) 2016-08-04 2016-08-04 A kind of method of map locating keyword to relevant issues

Country Status (1)

Country Link
CN (1) CN106294656B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103885966A (en) * 2012-12-20 2014-06-25 北京京东尚科信息技术有限公司 Question and answer interaction method and system of electronic commerce transaction platform
CN104899273A (en) * 2015-05-27 2015-09-09 东南大学 Personalized webpage recommendation method based on topic and relative entropy
CN105447080A (en) * 2015-11-05 2016-03-30 华建宇通科技(北京)有限责任公司 Query completion method in community ask-answer search
CN105786794A (en) * 2016-02-05 2016-07-20 青岛理工大学 Question-answer pair search method and community question-answer search system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103885966A (en) * 2012-12-20 2014-06-25 北京京东尚科信息技术有限公司 Question and answer interaction method and system of electronic commerce transaction platform
CN104899273A (en) * 2015-05-27 2015-09-09 东南大学 Personalized webpage recommendation method based on topic and relative entropy
CN105447080A (en) * 2015-11-05 2016-03-30 华建宇通科技(北京)有限责任公司 Query completion method in community ask-answer search
CN105786794A (en) * 2016-02-05 2016-07-20 青岛理工大学 Question-answer pair search method and community question-answer search system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
徐振: "面向问答社区的问题类型敏感的答案摘要算法研究", 《中国优秀硕士论文全文数据库 信息科技辑》 *
杨海天: "社区问答系统中问句检索技术的研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Also Published As

Publication number Publication date
CN106294656B (en) 2019-03-19

Similar Documents

Publication Publication Date Title
US8156120B2 (en) Information retrieval using user-generated metadata
CN104935963B (en) A kind of video recommendation method based on timing driving
Eirinaki et al. Web path recommendations based on page ranking and markov models
US9171078B2 (en) Automatic recommendation of vertical search engines
CA2767838C (en) Progressive filtering of search results
CN107688606A (en) The acquisition methods and device of a kind of recommendation information, electronic equipment
US20100318537A1 (en) Providing knowledge content to users
Popescu et al. Mining social media to create personalized recommendations for tourist visits
US10019513B1 (en) Weighted answer terms for scoring answer passages
CN110443571A (en) The method, device and equipment of knowledge based map progress resume assessment
Priyogi Preference elicitation strategy for conversational recommender system
CN101111837A (en) Search processing with automatic categorization of queries
Kantor et al. Capturing human intelligence in the net
US20110238491A1 (en) Suggesting keyword expansions for advertisement selection
CN105761154B (en) A kind of socialization recommended method and device
CN110147494A (en) Information search method, device, storage medium and electronic equipment
US20160098397A1 (en) Dynamic summary generator
CN103294670B (en) A kind of searching method and system based on vocabulary
CN111506596A (en) Information retrieval method, information retrieval device, computer equipment and storage medium
CN103842997A (en) Searching for and creating an adaptive content
Yu et al. “An Image is Worth a Thousand Features”: Scalable Product Representations for In-Session Type-Ahead Personalization
KR101955920B1 (en) Search method and apparatus using property language
CN105978729A (en) System and method for pushing mobile phone information based on user surfing log and position
Jamjuntra et al. Social network user identification
CN112417174A (en) Data processing method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20190617

Address after: 212000 298 South Xu Road, hi tech Zone, Zhenjiang, Jiangsu

Patentee after: Zhenjiang Jiaoze Information Technology Co., Ltd.

Address before: 430072 Wuhan University, Luojia mountain, Wuchang District, Wuhan, Hubei

Patentee before: Wuhan University

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20211015

Address after: 210042 rooms 601 and 605, 6 / F, building 5, Juhui Park, phase II, Xuzhuang high tech Zone, 108 Xuanwu Avenue, Xuanwu District, Nanjing, Jiangsu Province

Patentee after: Nanjing Yuanfeng Intelligent Technology Co.,Ltd.

Address before: 212000 298 South Xu Road, hi tech Zone, Zhenjiang, Jiangsu

Patentee before: ZHENJIANG JIAOZE INFORMATION TECHNOLOGY Co.,Ltd.

TR01 Transfer of patent right