CN103309900A - Personalized multidimensional document sequencing method and system - Google Patents

Personalized multidimensional document sequencing method and system Download PDF

Info

Publication number
CN103309900A
CN103309900A CN201210069568XA CN201210069568A CN103309900A CN 103309900 A CN103309900 A CN 103309900A CN 201210069568X A CN201210069568X A CN 201210069568XA CN 201210069568 A CN201210069568 A CN 201210069568A CN 103309900 A CN103309900 A CN 103309900A
Authority
CN
China
Prior art keywords
document
user
vector
parameter vector
ordering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210069568XA
Other languages
Chinese (zh)
Inventor
祁勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201210069568XA priority Critical patent/CN103309900A/en
Publication of CN103309900A publication Critical patent/CN103309900A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a personalized multidimensional document sequencing method and a personalized multidimensional document sequencing system. The method comprises the following steps of iteratively calculating a sequencing value of each document under each characteristic according to a link relationship between the documents and parameter vectors of the documents; acquiring a group of documents according to a searching condition submitted by a query user, and calculating a personalized sequencing value of each document in the group according to a sequencing vector of the document in the group and a query vector set by the query user; transmitting the documents of the group to the query user according to the magnitudes of the personalized sequencing values, wherein the parameter vectors of the documents are updated according to a parameter vector updating algorithm. The invention also discloses the personalized multidimensional document sequencing system. According to the method, the precision ratio of a search engine and the anti-cheating capability of a webpage ranking algorithm can be improved.

Description

A kind of document ordering method and system of personalized various dimensions
Technical field
The present invention relates to internet arena, relate in particular to a kind of document ordering method and system of personalized various dimensions.
Background technology
Search engine is to utilize information retrieval technique to carry out large-scale collecting web page, index, ordering, and the application program of webpage being presented to inquiring user according to ranking results.Along with the rapid growth of network information, search engine has more and more become the indispensable instrument that people obtain the network information.
The core technology of search engine is sort algorithm.The most effectively sort algorithm is super link analysis algorithm, for example the PageRank algorithm of Google.Although existing search engine has commercially been obtained huge success, they also have very large room for improvement.It is the web page interlinkage relation that is made up according to its subjective desire by the Web page maker on the link analysis technological essence, although it has fully reflected Web page maker's individual preference and the understanding that web page interlinkage is concerned, it does not but reflect the user-user's of search engine individual preference.Because it is normally different to the Assessment of Important of same webpage with the user with different hobbies to be engaged in different industries, and the super link analysis algorithm such as PageRank can't carry out this differentiation, they can only provide unique page rank to each webpage, therefore, the design of super link analysis algorithm is defective.Feasible solution is that the individualized feature in conjunction with the user improves Search Results, so that the rank of each webpage not only relies on the linking relationship between the webpage, and relies on the user's who submits search inquiry to individualized feature.The analysis showed that, by the individualized feature of user and document, can improve the precision ratio of search engine, reduced the user to the scanning of invalid information and browse.
But there is a following difficult problem in the individualized feature that obtains on the internet user and webpage.The firstth, the automatic acquisition problem of customized information.It is estimated has nearly 1,000,000,000,000 webpages and 2,000,000,000 users on the present internet, safeguard that webpage and user's individualized feature is unpractical by hand.How the individualized feature of automatic acquisition user and webpage is a huge challenge to personalized search.The secondth, the replacement problem of customized information.As time goes on, the personal information such as user's hobby, work place, the industry of being engaged in and education degree can change, but it is difficult requiring all users to upgrade in real time its customized information.The 3rd is the semantic difference problem of customized information.Fill in the website such as the user who has and to like Mozart, filling in of having to like classical music, possible these two users' hobby is identical, but since the difference of literal expression therefore be difficult to they are effectively sorted out.The 4th is the problem of completeness of customized information.The personal information that the user fills in the website is usually simpler.Normally liking music in the description on hobby one hurdle, play baseball or reading etc. such as the user, is difficult and require the user to describe out meticulously all sidedly its interested field.
In sum, how to obtain webpage and user's individualized feature, and come the optimizing webpage rank algorithm according to webpage and user's individualized feature, expecting the webpage sought so that search engine can filter and filter out the user more effectively in the webpage of magnanimity, is a problem demanding prompt solution.
Summary of the invention
Problem in view of above-mentioned prior art existence, the object of the present invention is to provide a kind of document ordering method and system of personalized various dimensions, by automatic acquisition document and user's individualized feature, improve the searching order algorithm, improve the precision ratio of search engine.
According to above-described purpose, the present invention proposes a kind of document ordering method of personalized various dimensions, it is characterized in that, domain features collection K={1 is set in server, 2 ..., L}, and carry out following steps:
Obtain a plurality of documents, form document sets D={1,2 ..., M}; At least contain two document subsets, wherein document sets among the described document sets D
Figure BSA00000685602000021
In each document contain at least one link and point to document among the described document sets D, document sets
Figure BSA00000685602000022
In each document pointed to by the contained link of a document among the described document sets S at least; And S ∪ E=D, S ∩ E ≠ Φ;
Each document among the described document sets D is provided with ordering vector and parameter vector; If the ordering vector K of document m (m ∈ D) p(m)=[PR (m, 1), PR (m, 2) ..., PR (m, k) ..., PR (m, L)], wherein said PR (m, k) is illustrated in the lower ranking value of described document m in described document sets D of feature k (k ∈ K); If the parameter vector K of described document m d(m)=(dw M1, dw M2..., dw Mk..., dw ML), wherein said dw MkThe degree of correlation that represents described document m and feature k (k ∈ K); Described K d(m) upgraded by the parameter vector update algorithm;
Upgrade the ordering vector of each document among the described document sets D; The ordering vector update algorithm is as follows: the ranking value of any one the document m among the described document sets D under feature k (k ∈ K), be linked to described document m among the described document sets S each document under described feature k ranking value and described document sets S in be linked to the function of the degree of correlation of each document of described document m and described feature k;
According to described ordering vector, the document among the described document sets D is processed.
Compared with prior art, the inventive method is by the signal of user's access document (such as webpage), automatically upgrade document and user's parameter vector, and according to document and user's parameter vector Algorithms for Page Ranking is optimized, improved the precision ratio of search engine.In addition, owing to used document and user's individualized feature technology, so this patent method has improved the anti-cheating ability on page rank.
Description of drawings
Fig. 1 is the parameter vector method for expressing of each document in document sets D;
Fig. 2 is for collecting the parameter vector method for expressing of each user among the U the user;
Fig. 3 is the ordering vector method for expressing of each document in document sets D;
Fig. 4 is document and user's parameter vector update algorithm process flow diagram;
Fig. 5 is a kind of process flow diagram of document ordering method of personalized various dimensions;
Fig. 6 is a kind of structural drawing of document ordering system of personalized various dimensions.
Embodiment
By reference to the accompanying drawings the inventive method is described in further detail.
The specific embodiments explanation of this patent method comprises following components.At first, the method for numbering serial of specification documents, user and domain features, document and user's parameter vector method for expressing and the ordering vector method for expressing of document; Then, Algorithms for Page Ranking based on parameter vector is described; Again, specification documents and user's parameter vector update algorithm; At last, introduce a kind of document ordering method and system of personalized various dimensions.
The method for numbering serial of specification documents, user and domain features at first.Described document is often referred to the Web webpage, and it obtains on the internet by the reptile program.The form of expression of described document comprises text, video, music and picture etc.Be numbered by the URL address to the Web webpage that obtains, we have just obtained document sets D={1, and 2 ..., M}, wherein M is the document number.Equally, the user on the internet also has unique identification, such as user account number, phone number, Cookie identification code, IP address, Email address and instant communication number etc.We pool together having one group of user of uniquely identified on the internet, and form the user and collect U={1,2 ..., N}, wherein N is user's number.
Document and user's common area feature is also carried out Unified number, form feature set K={1,2 ..., L}, wherein L is Characteristic Number.Domain features represents the attribute of user and document, for example news, finance and economics, science, music, military affairs and physical culture etc.Described feature, the attribute that can describe document can be described again user's attribute.A document has " science " feature, and the specification documents content is relevant with " science "; And a user has " science " feature, illustrates that the user likes the content relevant with " science ".
The below introduces the method for expressing of document and user's parameter vector.It is similar to the vectorial expression method of the vector space model that Gerard Salton proposes, namely with the base unit of characteristic item as document or user's feature.The parameter vector that represents document with the set of the degree of correlation of document and each feature; The parameter vector that represents the user with the set of the degree of correlation of user and each feature.
Fig. 1 is the parameter vector method for expressing of each document in document sets D.The parameter vector of any one document i in document sets D (i ∈ D) is set to K d(i)=(dw I1, dw I2..., dw Ik..., dw IL), wherein said dw IkThe degree of correlation that represents described document i and feature k (k ∈ K), dw Ik∈ [a, b], a and b are nonnegative constant.In addition, the degree of correlation of k feature of each document among the described document sets D is pooled together, form a vector, be called k the document column vector (dw of document sets D 1k, dw 2k..., dw Ik..., dw Mk).
Fig. 2 is for collecting the parameter vector method for expressing of each user among the U the user.The parameter vector that collects any one user j among the U (j ∈ U) the user is set to K u(j)=(uw J1, uw J2..., uw Jk..., uw JL), wherein said uw JkThe degree of correlation that represents described user j and feature k (k ∈ K), uw Jk∈ [a, b], a and b are nonnegative constant.In addition, the degree of correlation that described user is collected k the feature of each user among the U pools together, and forms a vector, is called k user's column vector (uw that the user collects U 1k, uw 2k..., uw Jk..., uw Nk).
The described degree of correlation is a real number value, the close relation degree of each feature among its expression document or user and the feature set K.If document or user are related with musical features more related a little less with sports feature, we just say that the degree of correlation of the document or user and musical features is high, and are low with the degree of correlation of sports feature.In addition, have correlativity between some feature, therefore for example physics and mechanics can reduce by reducing correlativity between the feature dimension of feature set K when feature selecting, reduce the demand to the server stores space, improves efficiency of algorithm.Some feature needn't directly be listed in the feature set, because the relatedness computation that the degree of correlation of these features can be by one or several further feature among the feature set K out.
The following describes the Algorithms for Page Ranking based on parameter vector.The PageRank algorithm of standard can represent with following formula.
PR ( m ) = 1 - d M + d Σ i ∈ T PR ( i ) C ( i ) - - - ( 1 )
Wherein, set
Figure BSA00000685602000052
For the chain of webpage m (m ∈ D) enters the webpage set, C (i) goes out webpage quantity for the chain of webpage i (i ∈ T); D is damping factor, and it represents that the user visits the probability of described webpage m by the link of other webpage, and 1-d represents that the user directly accesses the probability of described webpage m by keying in the URL address, and d ∈ (0,1); The webpage sorting value PageRank of PR (m) the described webpage m of expression in described document sets D, M represents the webpage quantity among the document sets D.In addition, the initial ranking value of each webpage is made as 1/M.
The shortcoming of the PageRank algorithm of standard is that on the internet each webpage only has a unique webpage sorting value.And in fact, be engaged in different industries different often to the Assessment of Important of same webpage with the user with different hobbies.Therefore, need to improve existing sort algorithm.
Fig. 3 is the ordering vector method for expressing of each document in document sets D.In order to realize personalized search, we expand the conventional P ageRank value of webpage, and the one dimension ranking value PR (i) with any one the document i among the described document sets D expands to the ordering vector based on the multidimensional of domain features.If the ordering vector K of document i (i ∈ D) p(i)=[PR (i, 1), PR (i, 2) ..., PR (i, k) ..., PR (i, L)], the ranking value of wherein said PR (i, k) the described document i of expression under feature k (k ∈ K).Therefore, described formula (1) can be transformed into following ordering vector update algorithm:
PR ( m , k ) = 1 - d M + d Σ i ∈ T PR ( i , k ) · dw ik C ( i ) - - - ( 2 )
Wherein, set
Figure BSA00000685602000054
For the chain of webpage m (m ∈ D) enters the webpage set; D is damping factor, and it represents that the user visits the probability of described webpage m by the link of other webpages, and 1-d represents that the user directly accesses the probability of described webpage m by keying in the URL address, and d ∈ (0,1); The webpage sorting value of PR (m, k) the described webpage m of expression under feature k (k ∈ K).C (i) is a kind of in following two kinds of define methods: the first, C (i) is that the chain of webpage i (i ∈ T) goes out webpage quantity; The second, C (i) is one and sets constant, for example to each i ∈ T, C (i)=1 is set.In addition, for each webpage m ∈ D and each feature k ∈ K, establish the initial ranking value PR (m, k) of webpage=1/M.
Using described formula (2) before, need to make some pre-service to document sets D, in order to reject the webpage of grade sinking (rank sink) and grade leakage (rank leak).So-called grade sinking refers to that one group of webpage that interlinks does not all point to the link of any webpage outside this group; So-called grade is revealed and is referred to the webpage link of not going out.In described document sets D, solve grade sinking problem by adding the back link that a sensing chain enters webpage; Eliminate the impact that the grade leakage brings by rejecting the webpage that produces the grade leakage.After finishing iterative algorithm, more disallowable webpage is recovered, give their ranking value by described formula (2).
Described formula (2) can calculate under each feature k ∈ K the ranking value PR (m, k) of any one the document m among the described document sets D by iterative manner.Stopping criterion for iteration is that the ordering vector of document satisfies ∑ in twice iteration I ∈ D|| K p N+1(i)-K p n(i) || reach the setting constant less than default penalties or iterations, wherein n is iterative steps.In an application example of described formula (2), after per step iteration, need to Sorted list vector [PR (and 1, k), PR (2, k) ..., PR (M, k)] carry out normalized (normalization), to guarantee the convergence of described formula (2).
In described formula (2), except needs were known the web page interlinkage topological structure in advance, the step of a key obtained the parameter vector of each document (being webpage) exactly.
The following describes the parameter vector K of document i d(i)=(dw I1, dw I2..., dw Ik..., dw IL) update method.At first, by the parameter vector initial value method to set up of three example declaratives documents and certain customers, then, specification documents and user's parameter vector update algorithm.If document or user's parameter vector is not set up initial value, the default null vector that is made as of its parameter vector initial value.
Example 1 is the method that the parameter vector initial value of document i (i ∈ D) manually is set.Feature sum L=5 for example is set, and feature set K=(science, finance and economics, education, music, physical culture) arranges K d(i)=(dw I1, dw I2, dw I3, dw I4, dw I5)=(0,0.00032,0,0.00059,0).Be document i with the degree of correlation of " finance and economics " feature be 0.00032, with the degree of correlation of " music " feature be 0.00059, with the degree of correlation of further feature be zero.Equally, use similar approach that customer parameter vector K can be set u(j) initial value.
Example 2 is methods that the parameter vector initial value of user j (j ∈ U) is set.One group of collection of document by described user j submission
Figure BSA00000685602000071
The parameter vector of described document r (r ∈ H) is K d(r)=(dw R1, dw R2..., dw RL), therefore, for each k ∈ K, uw is set Jk=(σ 1/ s) ∑ (r ∈ H)[dw Rk/ (∑ (k ∈ K)Dw Rk)], wherein s is the element number of described set H, σ 1For setting constant.Use similarity method, described user j also can collect described user selects one group of user to calculate the parameter vector initial value of described user j among the U.
Example 3 is a kind of methods that the parameter vector initial value of document is set.Catalogue is a kind of special document, and corresponding document code is arranged.We suppose that the document under the same directory has some identical feature, and the document under for example physical culture catalogue is all relevant with physical culture.If document i (i ∈ D) is a document under the catalogue n (n ∈ D), then the parameter vector initial value of described document i is decided by the parameter vector of described catalogue n, namely for each k ∈ K, dw is set Ik2Dw Nk, σ wherein 2Be constant.
Fig. 4 is document and user's parameter vector update algorithm process flow diagram.Described parameter vector update algorithm comprises following concrete steps:
S11. collect certain customers' parameters vector initial value among the U for partial document among the document sets D and user;
S12. receive the signal that any one user j (j ∈ U) accesses any one document i (i ∈ D), described signal comprises the sign of described document i and described user j at least;
S13. according to the sign of described document i, read the parameter vector K of described document i d(i)=(dw I1, dw I2..., dw Ik..., dw IL), wherein said dw IkThe degree of correlation that represents described document i and feature k (k ∈ K);
S14. according to the sign of described user j, read the parameter vector K of described user j u(j)=(uw J1, uw J2..., uw Jk..., uw JL), wherein said uw JkThe degree of correlation that represents described user j and feature k (k ∈ K);
S15. upgrade the parameter vector of described document i and described user j:
K d *(i)=function1[K d(i),K u(j)]
K u *(j)=function2[K d(i),K u(j)]
Wherein, described K d(i) and described K d *(i) represent respectively to upgrade parameter vector front and the rear described document i of renewal, described K u(j) and described K u *(j) represent respectively to upgrade parameter vector front and the rear described user j of renewal, described K d *(i)=(dw I1 *, dw I2, *..., dw Ik *..., dw IL *), described K u *(j)=(uw J1 *, uw J2 *..., uw Jk *..., uw JL *); Described function1 represents K d *(i) be K d(i) and K u(j) function, described function2 represents K u *(j) be K d(i) and K u(j) function.
In the described method of Fig. 4, execute after the described step S15, also comprise and upgrade described K d(i) and described K u(j) step namely arranges K d(i)=K d *(i) and K u(j)=K u *(j).
In the described method of Fig. 4, described signal is a kind of in the following several types: T=1 represents that described user j clicks the link of described document i, T=2 represents that described user j keys in the address of described document i, T=3 represents that described user j arranges label to described document i, T=4 represents that the described document i of described user j is set to bookmark, T=5 represent the described document i of described user j be set to like (as the Like of the types of facial makeup in Beijing operas and Google+1), T=6 represents that described user j comments on described document i, and T=7 represents that described user j transmits described document i.In an application example of the described method of Fig. 4, described signal gathers from the Web daily record.Described Web daily record comprises server log (server log), error log (error log) and Cookie daily record etc.
In an application example of the described method of Fig. 4, described method satisfies K d *(i) 〉=K d(i) and K u *(j) 〉=K u(j).Inequality K wherein d *(i) 〉=K d(i) implication is for any k ∈ K, and dw is arranged Ik *〉=dw IkInequality K u *(j) 〉=K u(j) implication is for any k ∈ K, and uw is arranged Jk *〉=uw Jk
In an application example of the described method of Fig. 4, described K d *(i) component of a vector dw Ik *Described uw JkIncreasing function, be ∑ (k ∈ K)Uw JkSubtraction function; Described K u *(j) component of a vector uw Jk *Described dw IkIncreasing function, be ∑ (k ∈ K)Dw IkSubtraction function.
In an application example of the described method of Fig. 4, need to be for each feature k ∈ K, respectively to document column vector (dw 1k, dw 2k..., dw Ik..., dw Mk) and user's column vector (uw 1k, uw 2k..., uw Jk..., uw Nk) carry out normalized (normalization).The executive condition of described normalized comprises a kind of in the following condition at least: (1). carry out described parameter vector update algorithm and reach set point number; (2). start according to the default time.The concrete application example of described method for normalizing is as follows:
Example 1: to document column vector (dw 1k, dw 2k..., dw Ik..., dw Nk) carry out the method for normalized, comprise for each i ∈ D dw is set Ik=dw Ik/ (∑ (t ∈ D)Dw Tk); To user's column vector (uw 1k, uw 2k..., uw Jk..., uw Nk) carry out the method for normalized, comprise for each j ∈ U uw is set Jk=uw Jk/ (∑ (t ∈ U)Uw Tk).
Example 2: at document column vector (dw 1k, dw 2k..., dw Ik..., dw Mk) in randomly draw R data and they sorted, obtain set { s 1, s 2..., s R, and s 1<s 2<...<s RFor each i ∈ D, if s m≤ dw Ik≤ s M+1, dw then is set Ik=g (s m), g (s wherein m) be increasing function, g (s m) ∈ [a, b], a and b are nonnegative constant.Same method can be to user's column vector (uw 1k, uw 2k..., uw Jk..., uw Nk) carry out normalized.
Application example 1.
This is an application example of the described parameter vector update algorithm of Fig. 4, and it upgrades the parameter vector K of described document i (i ∈ D) with following specific algorithm d *(i) and the parameter vector K of described user j (j ∈ U) u *(j):
Dw Ik *=dw Ik+ λ 1(i, j, T) f 1[K u(j)] (for each
Figure BSA00000685602000091
)
Uw Jk *=uw Jk+ λ 2(i, j, T) f 2[K d(i)] (for each )
Wherein, described λ 1(i, j, T) is under the type T of described signal, and described user j is to the influence coefficient of described document i; Described λ 2(i, j, T) is under the type T of described signal, and described document i is to the influence coefficient of described user j.Described DK iThe parameter vector K by described document i d(i)=(dw I1, dw I2..., dw Ik..., dw IL) in the P of numerical value maximum iThe set that the corresponding feature of individual component forms, described UK jThe parameter vector K by described user j u(j)=(uw J1, uw J2..., uw Jk..., uw JL) in the Q of numerical value maximum jThe set that the corresponding feature of individual component forms, P iAnd Q jBe setup parameter.I=30 for example, P 30=3, DK 30={ literature, computing machine, biology }; J=265, Q 265=2, UK 265={ science, history }.Described dw IkWith described dw Ik *Represent respectively to upgrade parameter vector K front and the rear described document i of renewal d(i) k component, described uw JkWith described uw Jk *Represent respectively to upgrade parameter vector K front and the rear described user j of renewal u(j) k component.Described specific algorithm carries out following assignment after also being included in and carrying out described specific algorithm, namely for each k ∈ DK i, uw is set Jk=uw Jk *For each k ∈ UK j, dw is set Ik=dw Ik *
In described application example 1, described specific algorithm can be further defined to: for each k ∈ DK i, satisfy uw Jk *〉=uw JkFor each k ∈ UK j, satisfy dw Ik *〉=dw Ik
In described application example 1, described f 1[K u(j)] be the parameter vector K of described user j u(j) function, described f 2[K d(i)] be the parameter vector K of described document i d(i) function.Described f 1[K uAnd described f (j)] 2[K d(i)] concrete methods of realizing comprises:
Example 1: described f 1[K u(j)] be described uw JkIncreasing function, be ∑ (k ∈ K)Uw JkSubtraction function; Described f 2[K d(i)] be described dw IkIncreasing function, be ∑ (k ∈ K)Dw IkSubtraction function.
Example 2:f 1[K u(j)]=σ 3Uw Jk/ (∑ (k ∈ K)Uw Jk), f 2[K d(i)]=σ 4Dw Ik/ (∑ (k ∈ K)Dw Ik), σ wherein 3And σ 4For setting constant.
Example 3:f 1[K u(j)]=σ 5Uw Jk, f 2[K d(i)]=σ 6Dw Ik, σ wherein 5And σ 6For setting constant.
Example 4:f 1[[K u(j)]=σ 7{ 1/[1+exp (uw Jk)], f 2[K d(i)]=σ 8{ 1/[1+exp (dw Ik)], σ wherein 7And σ 8For setting constant.
In described application example 1, described λ 1(i, j, T) and described λ 2The concrete methods of realizing of (i, j, T) comprising:
Example 1: described λ 1(i, j, T) and described λ 2(i, j, T) is respectively the function of mathematical distance sim (i, j) between the parameter vector of described document i and described user j.λ for example 1(i, j, T)=c 1Sim (i, j), λ 2(i, j, T)=c 2Sim (i, j), wherein sim (i, j)=|| K d(i), K u(j) ||=[∑ k(dw IkUw Jk)]/{ [∑ k(dw Ik) 2] 1/2[∑ k(uw Jk) 2] 1/2, c 1And c 2For setting constant.The implication of this example is that the mathematical distance between document and user's the parameter vector is larger, and their similarity degree is higher, so they the scale-up factor of " ballot " is larger each other.
Example 2: described λ 1(i, j, T)=u 1(j) d 1(i), described λ 2(i, j, T)=u 2(j) d 2(i), u wherein 1(j) whether the parameter vector of expression user j can be used for upgrading the parameter vector of document sets D document, d 1(i) whether the parameter vector of expression document i can be upgraded u by the parameter vector that the user collect user among the U 2(j) whether the parameter vector of expression user j can be upgraded d by the parameter vector of document among the document sets D 2(i) whether the parameter vector of expression document i can be used for upgrading the parameter vector that the user collects U user.u 1(j), u 2(j), d 1(i) and d 2(i) be setup parameter, their value is 0 or 1.1 representative is that 0 representative is no.The implication of this example is for preventing malicious attack, and some is through the document (or user) of reliability certification, and its parameter vector can not be upgraded by other user's (or document) parameter vector; And some document (or user) is not owing to passing through reliability certification, and its parameter vector can not upgrade other user's (or document) parameter vector.
Example 3: described λ 1(i, j, T)=s 1(T), described λ 2(i, j, T)=s 2(T).Wherein said T is the type of user's access document signal, described s 1(T) and described s 2(T) be the function of described T.
Example 4: the Combination of Methods with above-mentioned example 1~3 generates described λ 1(i, j, T) and λ 2(i, j, T).For example
λ 1(i,j,T)={c 1·sim(i,j)}·{u 1(j)·d 1(i)}·s 1(T)
λ 2(i,j,T)={c 2·sim(i,j)}·{u 2(j)·d 2(i)}·s 2(T)。
Example 5: described λ 1(i, j, T) and described λ 2(i, j, T) is for setting constant.
In described application example 1, need to be for each feature k ∈ K, respectively to document column vector (dw 1k, dw 2k..., dw Ik..., dw Mk) and user's column vector (uw 1k, uw 2k..., uw Jk..., uw Nk) carry out normalized.The executive condition of normalized comprises a kind of in the following condition at least: (1). carry out described concrete parameter vector update algorithm and reach set point number, (2). start according to the default time.
Application example 2
This is a concrete applicating example of described application example 1.For convenience of illustration,let us suppose that two users and three documents on the internet, and each user and each document all have two features, and namely the user collects U={1,2}, document sets D={1,2,3}, feature set K={1,2}.User 1 and user's 2 parameter vector is respectively (uw 11, uw 12) and (uw 21, uw 22), the parameter vector of document 1, document 2 and document 3 is respectively (dw 11, dw 12), (dw 21, dw 22) and (dw 31, dw 32).Uw wherein 21The degree of correlation that represents described user 2 and feature 1; Dw 32Represent the degree of correlation of described document 3 and feature 2, the subscript define method of other degree of correlation is similar with it.
Supposed to receive the signal of the described document 3 of described user's 2 access, and signal type T=1, then according to following parameter vector update algorithm, upgrade described document 3 and described user's 2 parameter vector:
dw 31 *=dw 311(2,3,1){uw 21/(uw 21+uw 22)}
dw 32 *=dw 321(2,3,1){uw 22/(uw 21+uw 22)}
uw 21 *=uw 212(2,3,1){dw 31/(dw 31+dw 32)}
uw 22 *=uw 222(2,3,1){dw 32/(dw 31+dw 32)}
Wherein, λ 1(2,3,1) is illustrated under the signal type T=1, the influence coefficient of 2 pairs of described documents 3 of described user; λ 2(2,3,1) is illustrated under the signal type T=1,3 couples of described users' 2 of described document influence coefficient.λ for example 1(2,3,1)=c 1Sim (2,3) s 1(1); λ 2(2,3,1)=c 2Sim (2,3) s 2(1).S wherein 1(1)=s 2(1)=1.5; Mathematical distance between described sim (2,3) the described document 3 of expression and described user's 2 the parameter vector, i.e. sim (2,3)=(uw 21Dw 31+ uw 22Dw 32)/{ [(uw 21) 2+ (uw 22) 2] 1/2[(dw 31) 2+ (dw 32) 2] 1/2; c 1And c 2For setting constant.
After executing above-mentioned algorithm, upgrade described document 3 and described user's 2 parameter vector, dw namely is set 31=dw 31 *, dw 32=dw 32 *, uw 21=uw 21 *And uw 22=uw 22 *
After executing the parameter vector update algorithm, to user's column vector (uw 11, uw 21) and (uw 12, uw 22) carry out normalized, and to document column vector (dw 11, dw 21, dw 31) and (dw 12, dw 22, dw 32) carry out normalized:
Uw 11=uw 11/ (uw 11+ uw 21), uw 21=uw 21/ (uw 11+ uw 21); (to feature k=1)
Uw 12=uw 12/ (uw 12+ uw 22), uw 22=uw 22/ (uw 12+ uw 22); (to feature k=2)
dw 11=dw 11/(dw 11+dw 21+dw 31),dw 21=dw 21/(dw 11+dw 21+dw 31),
Dw 31=dw 31/ (dw 11+ dw 21+ dw 31); (to feature k=1)
dw 12=dw 12/(dw 12+dw 22+dw 32),dw 22=dw 22/(dw 12+dw 22+dw 32),
Dw 32=dw 32/ (dw 12+ dw 22+ dw 32); (to feature k=2).
Fig. 5 is a kind of document ordering method flow diagram of personalized various dimensions.This is an integrated application example of Extraordinary document ordering method, comprises the steps:
S10. use described parameter vector update algorithm for many times, upgrade the parameter vector of document among the described document sets D and the parameter vector that described user collects user among the U; Concrete methods of realizing comprises the step S11 to S15 among Fig. 4;
S20., the ordering vector initial value of each document among the described document sets D is set;
S30. use described ordering vector update algorithm, iteration is updated under each feature the ranking value of each document among the described document sets D, namely upgrades the ordering vector of each document among the described document sets D;
S40. user n (n ∈ D) arranges query vector, and submits search condition to; In described search condition, extract search key; Described search condition comprises all information that the user submits in search dialogue;
S50. one group of document Q that retrieval and described search key mate in described document sets D;
S60. according to the ordering vector of each document among described query vector and the described one group of document Q, calculate the personalized ordering value UR (i, n) of each document among described one group of document Q; Described UR (i, n) expression is based on the personalized ordering value of the described document i (i ∈ Q) of the query vector of described user n;
S70. according to described personalized ordering value UR (i, n), described one group of document Q is sorted, and according to ranking results described one group of document Q is sent to described user n.
In the described method of Fig. 5, establish the query vector K of user n s(n)=(sw N1, sw N2..., sw Nk..., sw NL), sw wherein NkExpression is queried the degree of correlation of document and feature k (k ∈ K), sw Nk∈ [a, b], a and b are default nonnegative constant.This query vector K s(n) following several method to set up is arranged.The firstth, in feature set K, select feature by described user n, and it is arranged the feature degree of correlation, sw for example is set N2=0.00023, sw N6=0.00061, other component of a vector is 0.The secondth, give described query vector K the parameter vector assignment of described user n s(n).The 3rd is the sign S that described user n submits one group of user or document to n=..., r ... }.When
Figure BSA00000685602000131
The time, described user r (r ∈ S n) parameter vector be (uw R1, uw R2..., uw RL), therefore the query vector of described user n is made as: for each feature k ∈ K, sw Nk=(σ 9/ s) ∑ (r ∈ Sn)[uw Rk/ (∑ (k ∈ K)Uw Rk)]; When
Figure BSA00000685602000132
The time, described document r (r ∈ S n) parameter vector be (dw R1, dw R2..., dw RL), therefore the query vector of described user n is set to: for each feature k ∈ K, sw Nk=(σ 10/ s) ∑ (r ∈ Sn)[dw Rk/ (∑ (k ∈ K)Dw Rk)]; When
Figure BSA00000685602000133
The time, the query vector of described user n is made as: for each feature k ∈ K, sw Nk=(σ 9/ s) ∑ (r ∈ Sn ∩ U)[uw Rk/ (∑ (k ∈ K)Uw Rk)]+(σ 10/ s) ∑ (r ∈ Sn ∩ D)[dw Rk/ (∑ (k ∈ K)Dw Rk)], wherein s is described S set nElement number, σ 9And σ 10For setting constant.
In the described method of Fig. 5, described personalized ordering value UR (i, n) has two kinds of computing method.
First method is the query vector K according to described user n s(n) and the ordering vector of described document i (i ∈ Q), calculate the personalized ordering value UR (i, n) of described document i, namely
UR ( i , n ) = Σ k = 1 L PR ( i , k ) · sw nk
Wherein, described PR (i, k) is illustrated in the lower ranking value of described document i in described document sets D of feature k (k ∈ K), sw NkExpression is queried the degree of correlation of document and feature k (k ∈ K).
Second method is that described UR (i, n) is defined as mathematical distance between the parameter vector of the query vector of described user n and the document i among described one group of document Q.If the query vector of described user n is K s(n)=(sw N1, sw N2..., sw NL), the parameter vector K of described document i (i ∈ Q) d(i)=(dw I1, dw I2..., dw IL), then:
UR(i,n)=||K s(n),K d(i)||=[∑ k(sw nk·dw ik)]/{[∑ k(sw nk) 2] 1/2·[∑ k(dw ik) 2] 1/2}。
Fig. 6 is a kind of structural drawing of document ordering system of personalized various dimensions.Described document ordering system 200 comprises following functional module:
The document user characteristics arranges module 211: obtain a plurality of Web web document by the reptile program, form document sets D={1,2, ..., M}, wherein each document is provided with ordering vector and parameter vector, and described ordering vector and described parameter vector are stored in the document database 220; Obtain a plurality of users on the internet, form the user and collect U={1,2 ..., N}, wherein each user is provided with parameter vector, and it is stored in the customer data base 230; Domain features collection K={1 is set, 2 ..., L}, and it is stored in the property data base 240;
Document and user's initial value arrange module 212: be the vector of the partial document parameters among described document sets D initial value, be each the document setup ordering vector initial value among the described document sets D, and described parameter vector initial value and described ordering vector initial value are stored in the document database 220; For the user collects certain customers parameters vector initial value among the U, and it is stored in the customer data base 230; Be not set up user and the document of parameter vector initial value, its default parameter vector initial value is made as null vector;
User's access document signal acquisition module 213: be used for gathering the signal that any one user j (j ∈ U) (102) accesses any one document i (i ∈ D), wherein comprise at least the sign of described user j (102) and described document i, described signal storage is in Web log database 250; The signal of the described document i of described user j (102) access, to be sent at least one application server, described application server comprises portal site server 301, social networking service device 302, search engine server 303 and instant communication server 304;
Document and customer parameter vector update module 214: according to the sign of the described document i that in described user's access document signal acquisition module 213, obtains and described user j (102), in described document database 220, read the parameter vector of described document i, in described customer data base 230, read the parameter vector of user j; Then by the parameter vector update algorithm, upgrade the parameter vector of described document i and described user j (102); According to the described document i after upgrading and the parameter vector of described user j (102), upgrade respectively described document database 220 and described customer data base 230 at last;
Document ordering vector update module 215: according to the parameter vector of the relation of the document links among the described document sets D and each document, iteration is updated under each feature the ranking value of each document among the described document sets D, then upgrades described document database 220 with the described ranking value after upgrading;
User's enquiry module 216: at first, the user arranges query vector and submits search condition to; Then, in described search condition, extract search key, and in described document sets D, retrieve the one group of document Q that mates with described search key; Afterwards, according to the ordering vector of each document among described query vector and the described one group of document Q, calculate the personalized ordering value of each document among described one group of document Q, and according to described personalized ordering value described one group of document Q is sorted; At last, according to described ranking results described one group of document Q is sent in batches the user of submit Query.
The above application example only is better application example of the present invention, is not to limit protection scope of the present invention.

Claims (12)

1. the document ordering method of personalized various dimensions is characterized in that, domain features collection K={1 is set in server, 2 ..., L}, and carry out following steps:
Obtain a plurality of documents, form document sets D={1,2 ..., M}; At least contain two document subsets, wherein document sets among the described document sets D
Figure FSA00000685601900011
In each document contain at least one link and point to document among the described document sets D, document sets
Figure FSA00000685601900012
In each document pointed to by the contained link of a document among the described document sets S at least; And S ∪ E=D, S ∩ E ≠ Φ;
Each document among the described document sets D is provided with ordering vector and parameter vector; If the ordering vector K of document m (m ∈ D) p(m)=[PR (m, 1), PR (m, 2) ..., PR (m, k) ..., PR (m, L)], wherein said PR (m, k) is illustrated in the lower ranking value of described document m in described document sets D of feature k (k ∈ K); If the parameter vector K of described document m d(m)=(dw M1, dw M2..., dw Mk..., dw ML), wherein said dw MkThe degree of correlation that represents described document m and feature k (k ∈ K); Described K d(m) upgraded by the parameter vector update algorithm;
Upgrade the ordering vector of each document among the described document sets D; The ordering vector update algorithm is as follows: the ranking value of any one the document m among the described document sets D under feature k (k ∈ K), be linked to described document m among the described document sets S each document under described feature k ranking value and described document sets S in be linked to the function of the degree of correlation of each document of described document m and described feature k;
According to described ordering vector, the document among the described document sets D is processed.
2. method according to claim 1 is characterized in that, repeatedly uses described ordering vector update algorithm, the ordering vector of each document among the described document sets D is carried out iteration upgrade.
3. method according to claim 1, it is characterized in that, described ordering vector update algorithm is further comprising the steps of, and namely the chain according to each document among the described document sets S goes out number, adjusts the ranking value of each document under each feature k (k ∈ K) among the described document sets S.
4. method according to claim 1 is characterized in that, described parameter vector update algorithm comprises the steps: at first, obtains a plurality of users, form the user and collect U={1, and 2 ..., N} establishes the parameter vector K of user j (j ∈ U) u(j)=(uw J1, uw J2..., uw Jk..., uw JL), wherein said uw JkThe degree of correlation that represents described user j and feature k (k ∈ K); Then, be the vector of the partial document parameters among described document sets D initial value, and be that described user collects the certain customers' parameters vector initial value among the U; At last, repeatedly carry out following steps:
Receive the signal that any one user j (j ∈ U) accesses any one document i (i ∈ D), described signal comprises the sign of described document i and described user j at least;
According to the sign of described document i, read the parameter vector K of described document i d(i);
According to the sign of described user j, read the parameter vector K of described user j u(j);
Upgrade the parameter vector of described document i and described user j:
K d *(i)=function1[K d(i),K u(j)]
K u *(j)=function2[K d(i),K u(j)]
Wherein said K d(i) and described K d *(i) represent respectively to upgrade parameter vector front and the rear described document i of renewal, described K u(j) and described K u *(j) represent respectively to upgrade parameter vector front and the rear described user j of renewal.
5. method according to claim 4 is characterized in that, in described parameter vector update algorithm, satisfies K d *(i) 〉=K d(i) and K u *(j) 〉=K u(j).
6. method according to claim 4 is characterized in that, establishes described K d *(i)=(dw I1 *, dw I2, *..., dw Ik *..., dw IL *), described K u *(j)=(uw J1, uw J2 *..., uw Jk *..., uw JL *), described dw then Ik *Described uw JkIncreasing function, be ∑ (k ∈ K)Uw JkSubtraction function; Described uw Jk *Described dw IkIncreasing function, be ∑ (k ∈ K)Dw IkSubtraction function.
7. method according to claim 4, it is characterized in that, described signal is a kind of in the following several types, comprises that described user j clicks the link of described document i, described user j and keys in the address of described document i, described user j and to described document i label, the described document i of described user j are set and are set to that bookmark, the described document i of described user j are set to like, described user j comments on described document i and described user j transmits described document i.
8. method according to claim 4 is characterized in that, the parameter vector K of described document i d *(i) and the parameter vector K of described user j u *(j) concrete update algorithm is as follows:
Dw Ik *=dw Ik+ λ 1(i, j, T) f 1[K u(j)] (for each
Figure FSA00000685601900021
)
Uw Jk *=uw Jk+ λ 2(i, j, T) f 2[K d(i)] (for each
Figure FSA00000685601900022
)
λ wherein 1(i, j, T) is under the type T of described signal, and described user j is to the influence coefficient of described document i; λ 2(i, j, T) is under the type T of described signal, and described document i is to the influence coefficient of described user j; Described DK iThe parameter vector K by described document i d(i)=(dw I1, dw I2..., dw Ik..., dw IL) in the P of numerical value maximum iThe set that the corresponding feature of individual component forms, described UK jThe parameter vector K by described user j u(j)=(uw J1, uw J2..., uw Jk..., uw JL) in the Q of numerical value maximum jThe set that the corresponding feature of individual component forms, P iAnd Q jBe setup parameter; Described dw IkWith described dw Ik *Represent respectively to upgrade parameter vector K front and the rear described document i of renewal d(i) k component, described uw JkWith described uw Jk *Represent respectively to upgrade parameter vector K front and the rear described user j of renewal u(j) k component.
9. method according to claim 8 is characterized in that, described λ 1(i, j, T) and described λ 2(i, j, T) is respectively the function of the mathematical distance between the parameter vector of the parameter vector of described document i and described user j.
10. method according to claim 4 is characterized in that, for each feature k ∈ K, to user's column vector (uw 1k, uw 2k..., uw Jk..., uw Nk) carry out normalized; And for each feature k ∈ K, to document column vector (dw 1k, dw 2k..., dw Ik..., dw Mk) carry out normalized.
11. method according to claim 1 is characterized in that, described method comprises the one by one document ordering application example of property, and its step is as follows:
Repeatedly use described parameter vector update algorithm, upgrade the parameter vector of document among the described document sets D and the parameter vector that described user collects user among the U;
The ordering vector initial value of each document among the described document sets D is set;
Use described ordering vector update algorithm, iteration is updated under each feature the ranking value of each document among the described document sets D;
User n (n ∈ D) arranges query vector and submits search condition to; In described search condition, extract search key;
One group of document Q of retrieval and described search key coupling in described document sets D;
According to the ordering vector of each document among described query vector and the described one group of document Q, calculate the personalized ordering value of each document among described one group of document Q;
According to described personalized ordering value, described one group of document Q is sorted, and according to ranking results described one group of document Q is sent to described user n.
12. the document ordering system of personalized various dimensions is characterized in that, comprises following functional module:
The document user characteristics arranges module: obtain a plurality of Web webpages by the reptile program, form document sets D={1,2, ..., M}, wherein each document is provided with ordering vector and parameter vector, and described ordering vector and described parameter vector are stored in the document database; Obtain a plurality of users on the internet, form the user and collect U={1,2 ..., N}, wherein each user is provided with parameter vector, and it is stored in the customer data base; Domain features collection K={1 is set, 2 ..., L}, and it is stored in the property data base;
Document and user's initial value arrange module: be the vector of the partial document parameters among described document sets D initial value, be each the document setup ordering vector initial value among the described document sets D, and described parameter vector initial value and described ordering vector initial value are stored in document database; For the user collects certain customers parameters vector initial value among the U, and it is stored in customer data base; Be not set up user and the document of parameter vector initial value, the default null vector that is made as of its parameter vector initial value;
User's access document signal acquisition module: be used for gathering the signal that any one user j (j ∈ U) accesses any one document i (i ∈ D), wherein comprise at least the sign of described user j and described document i, described signal storage is in the Web log database; Described user j accesses the signal of described document i, will be sent at least one application server, and described application server comprises portal site server, social networking service device, search engine server and instant communication server;
Document and customer parameter vector update module: according to the described document i that obtains in described user's access document signal acquisition module and the sign of described user j, in described document database, read the parameter vector of described document i and the parameter vector that in described customer data base, reads user j; Then upgrade the parameter vector of described document i and described user j by the parameter vector update algorithm;
Document ordering vector update module: according to the ordering vector initial value of each document among the described document sets D and the parameter vector of each document, iteration is updated under each feature the ranking value of each document among the described document sets D, and upgrades described document database with the described ranking value after upgrading;
User's enquiry module: at first, the user arranges query vector and submits search condition to, and described enquiry module extracts search key in described search condition; Then, one group of document Q of retrieval and described search key coupling in described document sets D; Afterwards, according to the ordering vector of each document among described query vector and the described one group of document Q, calculate the personalized ordering value of each document among described one group of document Q; At last, according to described personalized ordering value described one group of document Q is sorted, and the user who described one group of document Q is sent in batches submit Query according to described ranking results.
CN201210069568XA 2012-03-06 2012-03-06 Personalized multidimensional document sequencing method and system Pending CN103309900A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210069568XA CN103309900A (en) 2012-03-06 2012-03-06 Personalized multidimensional document sequencing method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210069568XA CN103309900A (en) 2012-03-06 2012-03-06 Personalized multidimensional document sequencing method and system

Publications (1)

Publication Number Publication Date
CN103309900A true CN103309900A (en) 2013-09-18

Family

ID=49135137

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210069568XA Pending CN103309900A (en) 2012-03-06 2012-03-06 Personalized multidimensional document sequencing method and system

Country Status (1)

Country Link
CN (1) CN103309900A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103324648A (en) * 2012-03-20 2013-09-25 祁勇 Method and system for acquiring individuation characteristics of users and documents
CN103390008A (en) * 2012-05-08 2013-11-13 祁勇 Method and system for acquiring personalized features of user
CN103886039A (en) * 2014-03-10 2014-06-25 百度在线网络技术(北京)有限公司 Optimization method and device with searching
CN105159932A (en) * 2015-08-07 2015-12-16 南车青岛四方机车车辆股份有限公司 Data retrieving and sorting system and method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388025A (en) * 2008-10-09 2009-03-18 浙江大学 Semantic web object ordering method based on Pagerank
US20100106719A1 (en) * 2008-10-23 2010-04-29 Debora Donato Context-sensitive search
CN101770521A (en) * 2010-03-11 2010-07-07 东华大学 Focusing relevancy ordering method for vertical search engine
CN102253971A (en) * 2011-06-14 2011-11-23 南京信息工程大学 PageRank method based on quick similarity

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388025A (en) * 2008-10-09 2009-03-18 浙江大学 Semantic web object ordering method based on Pagerank
US20100106719A1 (en) * 2008-10-23 2010-04-29 Debora Donato Context-sensitive search
CN101770521A (en) * 2010-03-11 2010-07-07 东华大学 Focusing relevancy ordering method for vertical search engine
CN102253971A (en) * 2011-06-14 2011-11-23 南京信息工程大学 PageRank method based on quick similarity

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
姜鑫维 等: "Topic PageRank-一种基于主题的搜索引擎", 《计算机技术与发展》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103324648A (en) * 2012-03-20 2013-09-25 祁勇 Method and system for acquiring individuation characteristics of users and documents
CN103390008A (en) * 2012-05-08 2013-11-13 祁勇 Method and system for acquiring personalized features of user
CN103390008B (en) * 2012-05-08 2018-09-28 六六鱼信息科技(上海)有限公司 A kind of method and system obtaining user individual feature
CN103886039A (en) * 2014-03-10 2014-06-25 百度在线网络技术(北京)有限公司 Optimization method and device with searching
CN103886039B (en) * 2014-03-10 2018-01-19 百度在线网络技术(北京)有限公司 Using the optimization method and device of retrieval
CN105159932A (en) * 2015-08-07 2015-12-16 南车青岛四方机车车辆股份有限公司 Data retrieving and sorting system and method
CN105159932B (en) * 2015-08-07 2019-06-21 中车青岛四方机车车辆股份有限公司 A kind of data retrieval engine and ordering system and method

Similar Documents

Publication Publication Date Title
KR101463974B1 (en) Big data analysis system for marketing and method thereof
US7844592B2 (en) Ontology-content-based filtering method for personalized newspapers
US9152722B2 (en) Augmenting online content with additional content relevant to user interest
TWI493367B (en) Progressive filtering search results
US8612435B2 (en) Activity based users' interests modeling for determining content relevance
KR100645608B1 (en) Server of providing information search service using visited uniform resource locator log, and method thereof
CN105488233A (en) Reading information recommendation method and system
US7818341B2 (en) Using scenario-related information to customize user experiences
US9864768B2 (en) Surfacing actions from social data
CN102982042A (en) Personalization content recommendation method and platform and system
CN101206674A (en) Enhancement type related search system and method using commercial articles as medium
US20110320442A1 (en) Systems and Methods for Semantics Based Domain Independent Faceted Navigation Over Documents
US9245010B1 (en) Extracting and leveraging knowledge from unstructured data
CN103309869A (en) Method and system for recommending display keyword of data object
KR100671077B1 (en) Server, Method and System for Providing Information Search Service by Using Sheaf of Pages
CN103309900A (en) Personalized multidimensional document sequencing method and system
KR101866411B1 (en) Method for providing document recommandation information, and device using the same
CN103514237B (en) A kind of method and system obtaining user and Document personalization feature
CN104050203A (en) Method for acquiring personalized characteristics of webpages and users
CN103324648A (en) Method and system for acquiring individuation characteristics of users and documents
JP6421146B2 (en) Information processing system, information processing apparatus, program
CN103544190A (en) Method and system for acquiring personalized features of users and documents
Samar et al. The strange case of reproducibility versus representativeness in contextual suggestion test collections
Chunrong et al. Design and Implementation of Reading System Based on Improved Personalized Recommendation Hybrid Algorithm
Suruliandi et al. VALIDATING THE PERFORMANCE OF PERSONALIZATION TECHNIQUES IN SEARCH ENGINE.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20130918

WD01 Invention patent application deemed withdrawn after publication