CN103279535A - Method for recommending potential partners for patentee - Google Patents

Method for recommending potential partners for patentee Download PDF

Info

Publication number
CN103279535A
CN103279535A CN 201310215189 CN201310215189A CN103279535A CN 103279535 A CN103279535 A CN 103279535A CN 201310215189 CN201310215189 CN 201310215189 CN 201310215189 A CN201310215189 A CN 201310215189A CN 103279535 A CN103279535 A CN 103279535A
Authority
CN
China
Prior art keywords
patentee
affiliate
research field
ipc
core
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201310215189
Other languages
Chinese (zh)
Inventor
彭智勇
李蓉蓉
张厚望
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University WHU
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN 201310215189 priority Critical patent/CN103279535A/en
Publication of CN103279535A publication Critical patent/CN103279535A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to a method for recommending potential partners for a patentee. The method includes firstly judging whether relative number ranking of patents of the patentee under each small class level IPC (information procession center) exceeds a set threshold value or not and a field represented by the IPC is attributed to a core research field or non-core research field of the patentee. In the core research field, other patentees with complementary skills are recommended to the patentee; and in the non-core research field, other patentees sharing similar technology with the existing partners are recommended to the patentee. The method for recommending the potential partners for the patentee proposed for the first time solves the problem of complementary technology degree calculation of the other patentees and the given patentee in the core research field and the problem of predicted values of cooperative relationship between the other patentees and the given patentee by taking the existing partners as media in the non-core research field. The method for recommending the potential partners for the patentee has the advantages of good effect in recommending the potential partners for the patentee.

Description

The potential affiliate's recommend method of a kind of patentee
Technical field
The present invention relates to the potential affiliate's of a kind of patentee method, especially relate to the potential affiliate's recommend method of a kind of patentee.
Background technology
Since World War II finished, various countries greatly developed economy and science and technology, and economy and science and technology are moved towards to merge gradually, and scientific and technological economic implication is increasing, and the global economy activity changes to kownledge economy from material economy.The development of society also enters a fast-changing era of knowledge-driven economy thereupon.
Meanwhile, information management also becomes for expanding economy and becomes more and more important.Current kownledge economy fast development, the source of enterprise core competence generally all is technological innovation, an enterprise only constantly brings forth new ideas and just can not eliminated by market.The enterprise that some are in the relation of vying each other originally cooperates aspect intellecture property one after another, and the intellecture property cooperation becomes a kind of important techniques innovation organizational form, more and more is familiar with by each side and payes attention to.
It is increasing that affiliate's demand is looked for by enterprise, and in a period of time from now on, demand still has the space of rising at least.
Enterprise relates in particular to technology aspect competitive intelligence competitive intelligence aspect most important information is patent, so patent analyses becomes the part of information analysis in the competitive intelligence naturally.Patent information is analyzed from proposing to develop so far, and it is analyzed content and mainly comprises three aspects: the discovery potential rival reaches to be analyzed, finds potential affiliate, the analysis of industry patent statute and monitoring industry are changed the rival.From the patent text analysis, advance series of analysis and handled analysis again, finally can comprise that the existing affiliate of rival analyzes and potential affiliate finds for enterprise obtains competitive intelligence.
Use the automatic analysis of computer software not only can save the efficient that manpower improves patent analyses greatly, can also in the magnanimity patent data, excavate some association knowledge that imply, with close friend's interface, mode represents analysis result intuitively.
Summary of the invention
The present invention solves the existing in prior technology technical matters; Can continue the hypothesis of research in the prior art field based on the patentee, provide a kind of each research field with the patentee to be divided into core research field or non-core research field, gathered the method for recommending potential affiliate from its core research field set and non-core research field respectively.
Above-mentioned technical matters of the present invention is mainly solved by following technical proposals:
The potential affiliate's recommend method of a kind of patentee is characterized in that, based on definition: patent file set D={d 1, d 2..., d n, the patentee gathers P={P 1, P 2..., P m; Wherein, d iThe document content of expression patent i; P jRepresent j patentee; According to the relative patent quantity rank of given patentee under each IPC classification number that its patent relates to, the field of IPC classification number representative is classified as given patentee's core research field or non-core research field, the patentee of core research field recommended technology complementation, recommend the patentee similar to its existing affiliate's technology in non-core research field; Comprise and judge that research field is the step in core realm or non-core field, core research field given patentee is the step that given patentee recommends potential affiliate: and be the step that given patentee recommends potential affiliate in given patentee's non-core research field, specifically comprise:
Judge that research field is the step of core research field or non-core research field:
Step 1.1 is analyzed other IPC of group level under the patent of given patentee q, obtains the IPC S set 1
Step 1.2, pair set S 1In each IPC, the patent quantity of adding up all patentees under this IPC, by the descending sort of patent quantity to patentee's rank;
Step 1.3, calculate q at the relative patent quantity rank R (q) in this IPC representative field according to the ranking result of step 1.2, if R (q) surpasses setting threshold, then this field is the core research field of q, if R (q) does not surpass setting threshold, then this field is the non-core research field of q;
Step 1.4, pair set S 1In after each IPC analyzes, obtain the core research field S set of q 2With non-core research field S set 3
Core research field given patentee is the step that given patentee recommends potential affiliate:
Step 2.1 is analyzed S set 2In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of core research field who obtains q gathers P 1, screening patent quantity is no more than other patentees of setting threshold, and remaining patentee constitutes core research field candidate and gathers C 1
Step 2.2, pair set C 1In each patentee v, collect it and q in S set 2In patent file under each IPC, with the document vectorization, calculate the document similarity, according to similarity with clustering documents;
Step 2.3 according to the cluster result of step 2.2, is calculated the technology complementation degree of q and v;
Step 2.4 according to the result of calculation of step 2.3, is taken all factors into consideration v in S set 2Under research and development strength, inventor's quantity, affiliate's quantity, calculate the possibility that v becomes the q affiliate;
Step 2.5, to the result of calculation of step 2.4, descending sort is got preceding K patentee as the potential affiliate of core research field of q;
Non-core research field given patentee is the step that given patentee recommends potential affiliate:
Step 3.1 is analyzed S set 3In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of non-core research field who obtains q gathers P 2, other patentees constitute non-core research field candidate and gather C 2
Step 3.2, pair set P 2In each patentee k, calculate the cooperative relationship intensity of k and q;
Step 3.3, pair set C 2In each patentee c, calculate he with the set P 2In each patentee's technology similarity;
Step 3.4 is according to the cooperative relationship intensity of step 3.2 calculating and the c and set P of step 3.3 calculating 2In each patentee's technology similarity, calculate the cooperative relationship prediction of strength value of c and q;
Step 3.5 according to the result of calculation of step 3.4, is taken all factors into consideration c in S set 3Under research and development strength, inventor's quantity, affiliate's quantity, calculate the possibility that c becomes the q affiliate;
Step 3.6, to the result of calculation of step 3.5, descending sort is got preceding K patentee as the potential affiliate in non-core field of q.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 1.1 concrete operation method is:
Analyze affiliated other IPC of group level of patent of q, obtain the IPC S set 1
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 1.2 concrete operation method is:
Pair set S 1In each IPC, the patent quantity of adding up all patentees under this IPC, by the descending sort of patent quantity to patentee's rank.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 1.3 concrete operation method is:
Calculate q at the relative patent quantity rank R (q) in this IPC representative field according to the ranking result of step 1.2, its calculating formula is:
R ( q ) = Rank ( q ) N Formula one;
Wherein, Rank (q) is the rank of q, and N is all the patentee's quantity under this IPC;
If R (q) surpasses setting threshold, then this field is the core research field of q, if R (q) does not surpass setting threshold, then this field is the non-core research field of q.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 1.4 concrete operation method is:
Pair set S 1In after each IPC analyzes, obtain the core research field S set of q 2With non-core research field S set 3
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.1 concrete operation method is:
Analyze S set 2In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of core research field who obtains q gathers P 1, screening patent quantity is no more than other patentees of setting threshold, and remaining patentee constitutes core research field candidate and gathers C 1
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.2 comprises following substep:
Step 2.21 is collected the patent file of v and q and is gathered D Qv={ d 1, d 2..., d n;
Step 2.22, patent file is represented that with the space vector form concrete operation method is as follows:
For patent file set D Qv={ d 1, d 2..., d nIn any patent d i, utilize the space vector of one group of keyword to represent; Its process is, at first all patent files carried out Chinese word segmentation, removes stop words in the document according to self-defined or public stop words dictionary then, for removing lexical item behind the stop words, calculates the weight of each lexical item in document, and its calculating formula is:
w ( t j , d i ) = tf ( t j , d i ) × log ( N / n t j + 0.01 ) Σ t j ∈ d i [ tf ( t j , d i ) × log ( N / n t j ) + 0.01 ] 2 Formula two;
Wherein, w (t j, d i) be lexical item t jAt text d iIn weight, and tf (t j, d i) be word t jAt text d iIn word frequency, N is patent set D QvThe sum of middle patent, n are patent set D QvThe patent file number of lexical item occurs, denominator is normalized factor;
At last, represent each piece patent file with the space vector of each lexical item correspondence, be expressed as
Figure BDA00003284375700061
Wherein certain is worth w IjBe lexical item t jAt patent file d iIn weight;
Step 2.23 is represented according to the space vector of document, calculates the similarity between the patent file in twos, and concrete operation method is:
To any two patent file d iAnd d j, use included angle cosine between the vector of its correspondence to come the similarity of measurement, its formula is:
sim ( d i , d j ) = Σ l = 1 n w l ( d i ) × w l ( d j ) ( Σ l = 1 n w l 2 ( d i ) ) × ( Σ l = 1 n w l 2 ( d j ) ) Formula three;
W wherein l(d i) be that l lexical item is at document d iIn weight, w l(d j) be that l lexical item is at document d jIn weight;
Step 2.24, according to the similarity that step 2.23 is calculated, to the patent text cluster, formed each class i.e. a technical theme, produces the theme S set Qv
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.3 comprises following substep:
Step 2.31, according to the cluster result of step 2.24, calculate q and the complementary degree of v on any two technical theme i inequality and j:
g ( q , v , i , j ) = n qi - n vi sum i × n vj - n qj sum j if ( n qi - n vi ) × ( n vj - n qj ) > 0 0 else Formula four;
Wherein, n QiBe the patent quantity of q in theme i, n ViBe the patent quantity of v in theme i, sum iBe the theme patent sum in the i, n QjBe the patent quantity of q in theme j, n VjBe the patent quantity of v in theme j, sum jThe interior patent sum of j is the theme;
Step 2.32, according to the result of calculation of step 2.31, calculate the technology complementation degree between q and v:
C ( q , v ) = Σ i , j ∈ S qv g ( q , v , i , j ) Formula five;
Wherein, S QvBe the theme set that cluster forms, i, j are two different technical themes.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.4 comprises following substep:
Step 2.41, v is in S set for statistics 2Under patent quantity and application time, research and develop strength, calculating formula in the field according to patent quantity and application time calculated candidate patentee:
P ( v ) = Σ d ∈ S e - t - t d λ Formula six;
Wherein, S represents that v is in S set 2Under patent file set, t dBe the application time of patent file d, t is the current time, and λ is adjustable parameter;
Step 2.42, v is in S set for statistics 2Under inventor's quantity, affiliate's quantity;
Step 2.43, calculate the possibility that v becomes the q affiliate, calculating formula:
Candidate (v)=and α C (q, v)+β P (v)+γ n 1+ τ n 2Formula seven;
Wherein, α, β, γ, τ are adjustable parameters, and v is the candidate patentee, n 1Be that v is in S set 2Inventor's quantity, n 2Be that v is in S set 2Affiliate's quantity.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.5 concrete operation method is:
To the result of calculation of step 2.43, descending sort, K patentee is as the potential affiliate of the core realm of q before getting.
The potential affiliate's recommend method of above-mentioned a kind of patentee, the concrete operation method of described step 3.1 is:
Analyze S set 3In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of non-core research field who obtains q gathers P 2, other patentees constitute non-core research field candidate and gather C 2
The potential affiliate's recommend method of above-mentioned a kind of patentee, the concrete operation method of described step 3.2 is:
Collect q and set P 2In patentee k in S set 3All enjoy Patent right patent file S set jointly down Qk, calculate their cooperative relationship intensity again:
R ( q , k ) = Σ d ∈ S qk e - ( t - t d t ) Formula eight;
Wherein, t is the current time, t dBe S QkIn the application time of patent file d in the set, λ is adjustable parameter.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 3.3 comprises following substep:
Step 3.31 is represented patentee's vectorization based on the thought of this paper vectorization, represents the patentee with the n-dimensional vector that keyword is formed, with c and k in S set 3Under the set of all patent files be considered as the super document d of one piece of expression c and k cAnd d k, c and set C 3In all patentees' super document constitute collection of document D Sim={ d 1, d 2..., d n, for patent file set D Sim={ d 1, d 2..., d nIn any patent d iUtilize the space vector of one group of keyword to represent: its process is, at first patent file is carried out Chinese word segmentation, then according to the stop words in the self-defined or public stop words dictionary removal document, for removing lexical item behind the stop words, calculate the weight of each lexical item in document, its calculating formula is:
w ( t j , d i ) = tf ( t j , d i ) × log ( N / n t j + 0.01 ) Σ t j ∈ d i [ tf ( t j , d i ) × log ( N / n t j ) + 0.01 ] 2 Formula two;
Wherein, w (t j, d i) be lexical item t jAt text d iIn weight, and tf (t j, d i) be word t jAt text d iIn word frequency, N is patent set D SimThe sum of middle patent, n are patent set D SimThe patent file number of lexical item occurs, denominator is normalized factor;
At last, represent each piece patent file with the space vector of each lexical item correspondence, be expressed as Wherein certain is worth w IjBe lexical item t jAt patent file d iIn weight;
Step 3.32 is to any two patent file d iAnd d j, use included angle cosine between the vector of its correspondence to come the similarity of measurement, its formula is:
sim ( d i , d j ) = Σ l = 1 n w l ( d i ) × w l ( d j ) ( Σ l = 1 n w l 2 ( d i ) ) × ( Σ l = 1 n w l 2 ( d j ) ) Formula three;
W wherein l(d i) be that l lexical item is at document d iIn weight, w l(d j) be that l lexical item is at document d jIn weight.
The potential affiliate's recommend method of above-mentioned a kind of patentee, the concrete operation method of described step 3.4 is:
According to technology similarity and the cooperative relationship prediction of strength time c of k and q and the cooperative relationship intensity of q of c and k, concrete calculating formula is as follows:
P ( q , c ) = Max k ∈ P 2 ( R ( q , k ) × sim ( k , c ) ) Formula nine;
Wherein, q is given patentee, and k is the affiliate of q, and c is the candidate, P 2Be existing affiliate's set of q, (q k) is q and k cooperative relationship intensity to R, and (k c) is the technology similarity of k and c to sim.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 3.5 comprises following substep:
Step 3.51, c is in S set for statistics 3Under patent quantity and application time, research and develop strength, calculating formula in the field according to patent quantity and application time calculated candidate patentee:
P ( c ) = Σ d ∈ S e - t - t d λ Formula six;
Step 3.52, c is in S set for statistics 3Under inventor's quantity, affiliate's quantity;
Step 3.53, calculate the possibility that c becomes the q affiliate, calculating formula:
Candidate (c)=α C (q, v)+β P (v)+γ n 1+ τ n 2Formula seven;
Wherein, α, β, γ, τ are adjustable parameters, and c is the candidate patentee, n 1Be that c is in S set 3Under inventor's quantity, n 2Be that c is in S set 3Under affiliate's quantity.
The potential affiliate's recommend method of above-mentioned a kind of patentee, the concrete operation method of described step 3.6 is:
To the result of calculation of step 3.53, by descending sort, get preceding K patentee as the potential affiliate in non-core field of q.
Therefore, the present invention has following advantage: proposed the potential affiliate's discover method of patentee first 1.; 2. solved the technology complementation degree computational problem of patentee its core research field and other patentees; 3. solved vehicular with the existing affiliate cooperative relationship predictor calculation problem with other patentees in its non-core research field of patentee; 4. has the potential affiliate's recommendation effect of good patentee.
Description of drawings
The flow process that Fig. 1 recommends for the potential affiliate of patentee among the present invention.
Embodiment
Below by embodiment, and by reference to the accompanying drawings, technical scheme of the present invention is described in further detail.
Embodiment:
The potential affiliate's recommend method of a kind of patentee is characterized in that, based on definition: patent file set D={d 1, d 2..., d n, the patentee gathers P={P 1, P 2..., P m; Wherein, d iThe document content of expression patent i; P jRepresent j patentee; According to the relative patent quantity rank of given patentee under each IPC classification number that its patent relates to, the field of IPC classification number representative is classified as given patentee's core research field or non-core research field, the patentee of core research field recommended technology complementation, recommend the patentee similar to its existing affiliate's technology in non-core research field; Comprise and judge that research field is the step in core realm or non-core field, core research field given patentee is the step that given patentee recommends potential affiliate: and be the step that given patentee recommends potential affiliate in given patentee's non-core research field, specifically comprise:
Judge that research field is the step of core research field or non-core research field:
Step 1.1 is analyzed other IPC of group level under the patent of given patentee q, obtains the IPC S set 1
Step 1.2, pair set S 1In each IPC, the patent quantity of adding up all patentees under this IPC, by the descending sort of patent quantity to patentee's rank;
Step 1.3, calculate q at the relative patent quantity rank R (q) in this IPC representative field according to the ranking result of step 1.2, if R (q) surpasses setting threshold, then this field is the core research field of q, if R (q) does not surpass setting threshold, then this field is the non-core research field of q;
Step 1.4, pair set S 1In after each IPC analyzes, obtain the core research field S set of q 2With non-core research field S set 3
Core research field given patentee is the step that given patentee recommends potential affiliate:
Step 2.1 is analyzed S set 2In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of core research field who obtains q gathers P 1, screening patent quantity is no more than other patentees of setting threshold, and remaining patentee constitutes core research field candidate and gathers C 1
Step 2.2, pair set C 1In each patentee v, collect it and q in S set 2In patent file under each IPC, with the document vectorization, calculate the document similarity, according to similarity with clustering documents;
Step 2.3 according to the cluster result of step 2.2, is calculated the technology complementation degree of q and v;
Step 2.4 according to the result of calculation of step 2.3, is taken all factors into consideration v in S set 2Under research and development strength, inventor's quantity, affiliate's quantity, calculate the possibility that v becomes the q affiliate;
Step 2.5, to the result of calculation of step 2.4, descending sort is got preceding K patentee as the potential affiliate of core research field of q;
Non-core research field given patentee is the step that given patentee recommends potential affiliate:
Step 3.1 is analyzed S set 3In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of non-core research field who obtains q gathers P 2, other patentees constitute non-core research field candidate and gather C 2
Step 3.2, pair set P 2In each patentee k, calculate the cooperative relationship intensity of k and q;
Step 3.3, pair set C 2In each patentee c, calculate he with the set P 2In each patentee's technology similarity;
Step 3.4 is according to the cooperative relationship intensity of step 3.2 calculating and the c and set P of step 3.3 calculating 2In each patentee's technology similarity, calculate the cooperative relationship prediction of strength value of c and q;
Step 3.5 according to the result of calculation of step 3.4, is taken all factors into consideration c in S set 3Under research and development strength, inventor's quantity, affiliate's quantity, calculate the possibility that c becomes the q affiliate;
Step 3.6, to the result of calculation of step 3.5, descending sort is got preceding K patentee as the potential affiliate in non-core field of q.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 1.1 concrete operation method is:
Analyze affiliated other IPC of group level of patent of q, obtain the IPC S set 1
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 1.2 concrete operation method is:
Pair set S 1In each IPC, the patent quantity of adding up all patentees under this IPC, by the descending sort of patent quantity to patentee's rank.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 1.3 concrete operation method is:
Calculate q at the relative patent quantity rank R (q) in this IPC representative field according to the ranking result of step 1.2, its calculating formula is:
R ( q ) = Rank ( q ) N Formula one;
Wherein, Rank (q) is the rank of q, and N is all the patentee's quantity under this IPC;
If R (q) surpasses setting threshold, then this field is the core research field of q, if R (q) does not surpass setting threshold, then this field is the non-core research field of q.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 1.4 concrete operation method is:
Pair set S 1In after each IPC analyzes, obtain the core research field S set of q 2With non-core research field S set 3
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.1 concrete operation method is:
Analyze S set 2In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of core research field who obtains q gathers P 1, screening patent quantity is no more than other patentees of setting threshold, and remaining patentee constitutes core research field candidate and gathers C 1
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.2 comprises following substep:
Step 2.21 is collected the patent file of v and q and is gathered D Qv={ d 1, d 2..., d n;
Step 2.22, patent file is represented that with the space vector form concrete operation method is as follows:
For patent file set D Qv={ d 1, d 2..., d nIn any patent d i, utilize the space vector of one group of keyword to represent; Its process is, at first all patent files carried out Chinese word segmentation, removes stop words in the document according to self-defined or public stop words dictionary then, for removing lexical item behind the stop words, calculates the weight of each lexical item in document, and its calculating formula is:
w ( t j , d i ) = tf ( t j , d i ) × log ( N / n t j + 0.01 ) Σ t j ∈ d i [ tf ( t j , d i ) × log ( N / n t j ) + 0.01 ] 2 Formula two;
Wherein, w (t j, d i) be lexical item t jAt text d iIn weight, and tf (t j, d i) be word t jAt text d iIn word frequency, N is patent set D QvThe sum of middle patent, n are patent set D QvThe patent file number of lexical item occurs, denominator is normalized factor;
At last, represent each piece patent file with the space vector of each lexical item correspondence, be expressed as Wherein certain is worth w IjBe lexical item t jAt patent file d iIn weight;
Step 2.23 is represented according to the space vector of document, calculates the similarity between the patent file in twos, and concrete operation method is:
To any two patent file d iAnd d j, use included angle cosine between the vector of its correspondence to come the similarity of measurement, its formula is:
sim ( d i , d j ) = Σ l = 1 n w l ( d i ) × w l ( d j ) ( Σ l = 1 n w l 2 ( d i ) ) × ( Σ l = 1 n w l 2 ( d j ) ) Formula three;
W wherein l(d i) be that l lexical item is at document d iIn weight, w l(d j) be that l lexical item is at document d jIn weight;
Step 2.24, according to the similarity that step 2.23 is calculated, to the patent text cluster, formed each class i.e. a technical theme, produces the theme S set Qv
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.3 comprises following substep:
Step 2.31, according to the cluster result of step 2.24, calculate q and the complementary degree of v on any two technical theme i inequality and j:
g ( q , v , i , j ) = n qi - n vi sum i × n vj - n qj sum j if ( n qi - n vi ) × ( n vj - n qj ) > 0 0 else Formula four;
Wherein, n QiBe the patent quantity of q in theme i, n ViBe the patent quantity of v in theme i, sum iBe the theme patent sum in the i, n QjBe the patent quantity of q in theme j, n VjBe the patent quantity of v in theme j, sum jThe interior patent sum of j is the theme;
Step 2.32, according to the result of calculation of step 2.31, calculate the technology complementation degree between q and v:
C ( q , v ) = Σ i , j ∈ S qv g ( q , v , i , j ) Formula five;
Wherein, S QvBe the theme set that cluster forms, i, j are two different technical themes.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.4 comprises following substep:
Step 2.41, v is in S set for statistics 2Under patent quantity and application time, research and develop strength, calculating formula in the field according to patent quantity and application time calculated candidate patentee:
P ( v ) = Σ d ∈ S e - t - t d λ Formula six;
Wherein, S represents that v is in S set 2Under patent file set, t dBe the application time of patent file d, t is the current time, and λ is adjustable parameter;
Step 2.42, v is in S set for statistics 2Under inventor's quantity, affiliate's quantity;
Step 2.43, calculate the possibility that v becomes the q affiliate, calculating formula:
Candidate (v)=and α C (q, v)+β P (v)+γ n 1+ τ n 2Formula seven;
Wherein, α, β, γ, τ are adjustable parameters, and v is the candidate patentee, n 1Be that v is in S set 2Inventor's quantity, n 2Be that v is in S set 2Affiliate's quantity.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 2.5 concrete operation method is:
To the result of calculation of step 2.43, descending sort, K patentee is as the potential affiliate of the core realm of q before getting.
The potential affiliate's recommend method of above-mentioned a kind of patentee, the concrete operation method of described step 3.1 is:
Analyze S set 3In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of non-core research field who obtains q gathers P 2, other patentees constitute non-core research field candidate and gather C 2
The potential affiliate's recommend method of above-mentioned a kind of patentee, the concrete operation method of described step 3.2 is:
Collect q and set P 2In patentee k in S set 3All enjoy Patent right patent file S set jointly down Qk, calculate their cooperative relationship intensity again:
R ( q , k ) = Σ d ∈ S qk e - ( t - t d t ) Formula eight;
Wherein, t is the current time, t dBe S QkIn the application time of patent file d in the set, λ is adjustable parameter.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 3.3 comprises following substep:
Step 3.31 is represented patentee's vectorization based on the thought of this paper vectorization, represents the patentee with the n-dimensional vector that keyword is formed, with c and k in S set 3Under the set of all patent files be considered as the super document d of one piece of expression c and k cAnd d k, c and set C 3In all patentees' super document constitute collection of document D Sim={ d 1, d 2..., d n, for patent file set D Sim={ d 1, d 2..., d nIn any patent d iUtilize the space vector of one group of keyword to represent: its process is, at first patent file is carried out Chinese word segmentation, then according to the stop words in the self-defined or public stop words dictionary removal document, for removing lexical item behind the stop words, calculate the weight of each lexical item in document, its calculating formula is:
w ( t j , d i ) = tf ( t j , d i ) × log ( N / n t j + 0.01 ) Σ t j ∈ d i [ tf ( t j , d i ) × log ( N / n t j ) + 0.01 ] 2 Formula two;
Wherein, w (t j, d i) be lexical item t jAt text d iIn weight, and tf (t j, d i) be word t jAt text d iIn word frequency, N is patent set D SimThe sum of middle patent, n are patent set D SimThe patent file number of lexical item occurs, denominator is normalized factor;
At last, represent each piece patent file with the space vector of each lexical item correspondence, be expressed as Wherein certain is worth w IjBe lexical item t jAt patent file d iIn weight;
Step 3.32 is to any two patent file d iAnd d j, use included angle cosine between the vector of its correspondence to come the similarity of measurement, its formula is:
sim ( d i , d j ) = Σ l = 1 n w l ( d i ) × w l ( d j ) ( Σ l = 1 n w l 2 ( d i ) ) × ( Σ l = 1 n w l 2 ( d j ) ) Formula three;
W wherein l(d i) be that l lexical item is at document d iIn weight, w l(d j) be that l lexical item is at document d jIn weight.
The potential affiliate's recommend method of above-mentioned a kind of patentee, the concrete operation method of described step 3.4 is:
According to technology similarity and the cooperative relationship prediction of strength time c of k and q and the cooperative relationship intensity of q of c and k, concrete calculating formula is as follows:
P ( q , c ) = Max k ∈ P 2 ( R ( q , k ) × sim ( k , c ) ) Formula nine;
Wherein, q is given patentee, and k is the affiliate of q, and c is the candidate, P 2Be existing affiliate's set of q, (q k) is q and k cooperative relationship intensity to R, and (k c) is the technology similarity of k and c to sim.
The potential affiliate's recommend method of above-mentioned a kind of patentee, described step 3.5 comprises following substep:
Step 3.51, c is in S set for statistics 3Under patent quantity and application time, research and develop strength, calculating formula in the field according to patent quantity and application time calculated candidate patentee:
P ( c ) = Σ d ∈ S e - t - t d λ Formula six;
Step 3.52, c is in S set for statistics 3Under inventor's quantity, affiliate's quantity;
Step 3.53, calculate the possibility that c becomes the q affiliate, calculating formula:
Candidate (c)=α C (q, v)+β P (v)+γ n 1+ τ n 2Formula seven;
Wherein, α, β, γ, τ are adjustable parameters, and c is the candidate patentee, n 1Be that c is in S set 3Under inventor's quantity, n 2Be that c is in S set 3Under affiliate's quantity.
The potential affiliate's recommend method of above-mentioned a kind of patentee, the concrete operation method of described step 3.6 is:
To the result of calculation of step 3.53, by descending sort, get preceding K patentee as the potential affiliate in non-core field of q.
What should specify is: the present invention has following main beneficial effect: the patentee who proposes among the present invention potential affiliate's recommend method has remedied the blank that the potential affiliate of patentee recommends research field; The potential affiliate's recommend method of the patentee who proposes among the 2nd, the present invention has good validity, specifies as follows:
The present invention is based on the patentee can be in the hypothesis of prior art field continuation research, and at first the research field with the patentee is classified as core research field or non-core research neck.Considered that the patentee seeks the patent cooperation partner that can offer help to its technical field in its core research field, has considered that also the patentee seeks the patent cooperation partner to having affiliate's technology now similar in its non-core research field.Estimate the possibility that given patentee affiliate potential with it cooperates with regard to two kinds of situations respectively.
Specific embodiment described herein only is that the present invention's spirit is illustrated.Those skilled in the art can make various modifications or replenish or adopt similar mode to substitute described specific embodiment, but can't depart from spirit of the present invention or surmount the defined scope of appended claims.

Claims (16)

1. the potential affiliate's recommend method of patentee is characterized in that, based on definition: patent file set D={d 1, d 2..., d n, the patentee gathers P={P 1, P 2..., P m; Wherein, d iThe document content of expression patent i; P jRepresent j patentee; According to the relative patent quantity rank of given patentee under each IPC classification number that its patent relates to, the field of IPC classification number representative is classified as given patentee's core research field or non-core research field, the patentee of core research field recommended technology complementation, recommend the patentee similar to its existing affiliate's technology in non-core research field; Comprise and judge that research field is the step in core realm or non-core field, core research field given patentee is the step that given patentee recommends potential affiliate: and be the step that given patentee recommends potential affiliate in given patentee's non-core research field, specifically comprise:
Judge that research field is the step of core research field or non-core research field:
Step 1.1 is analyzed other IPC of group level under the patent of given patentee q, obtains the IPC S set 1
Step 1.2, pair set S 1In each IPC, the patent quantity of adding up all patentees under this IPC, by the descending sort of patent quantity to patentee's rank;
Step 1.3, calculate q at the relative patent quantity rank R (q) in this IPC representative field according to the ranking result of step 1.2, if R (q) surpasses setting threshold, then this field is the core research field of q, if R (q) does not surpass setting threshold, then this field is the non-core research field of q;
Step 1.4, pair set S 1In after each IPC analyzes, obtain the core research field S set of q 2With non-core research field S set 3
Core research field given patentee is the step that given patentee recommends potential affiliate:
Step 2.1 is analyzed S set 2In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of core research field who obtains q gathers P 1, screening patent quantity is no more than other patentees of setting threshold, and remaining patentee constitutes core research field candidate and gathers C 1
Step 2.2, pair set C 1In each patentee v, collect it and q in S set 2In patent file under each IPC, with the document vectorization, calculate the document similarity, according to similarity with clustering documents;
Step 2.3 according to the cluster result of step 2.2, is calculated the technology complementation degree of q and v;
Step 2.4 according to the result of calculation of step 2.3, is taken all factors into consideration v in S set 2Under research and development strength, inventor's quantity, affiliate's quantity, calculate the possibility that v becomes the q affiliate;
Step 2.5, to the result of calculation of step 2.4, descending sort is got preceding K patentee as the potential affiliate of core research field of q;
Non-core research field given patentee is the step that given patentee recommends potential affiliate:
Step 3.1 is analyzed S set 3In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of non-core research field who obtains q gathers P 2, other patentees constitute non-core research field candidate and gather C 2
Step 3.2, pair set P 2In each patentee k, calculate the cooperative relationship intensity of k and q;
Step 3.3, pair set C 2In each patentee c, calculate he with the set P 2In each patentee's technology similarity;
Step 3.4 is according to the cooperative relationship intensity of step 3.2 calculating and the c and set P of step 3.3 calculating 2In each patentee's technology similarity, calculate the cooperative relationship prediction of strength value of c and q;
Step 3.5 according to the result of calculation of step 3.4, is taken all factors into consideration c in S set 3Under research and development strength, inventor's quantity, affiliate's quantity, calculate the possibility that c becomes the q affiliate;
Step 3.6, to the result of calculation of step 3.5, descending sort is got preceding K patentee as the potential affiliate in non-core field of q.
2. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that, described step 1.1 concrete operation method is:
Analyze affiliated other IPC of group level of patent of q, obtain the IPC S set 1
3. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that, described step 1.2 concrete operation method is:
Pair set S 1In each IPC, the patent quantity of adding up all patentees under this IPC, by the descending sort of patent quantity to patentee's rank.
4. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that, described step 1.3 concrete operation method is:
Calculate q at the relative patent quantity rank R (q) in this IPC representative field according to the ranking result of step 1.2, its calculating formula is:
R ( q ) = Rank ( q ) N Formula one;
Wherein, Rank (q) is the rank of q, and N is all the patentee's quantity under this IPC;
If R (q) surpasses setting threshold, then this field is the core research field of q, if R (q) does not surpass setting threshold, then this field is the non-core research field of q.
5. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that, described step 1.4 concrete operation method is:
Pair set S 1In after each IPC analyzes, obtain the core research field S set of q 2With non-core research field S set 3
6. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that, described step 2.1 concrete operation method is:
Analyze S set 2In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of core research field who obtains q gathers P 1, screening patent quantity is no more than other patentees of setting threshold, and remaining patentee constitutes core research field candidate and gathers C 1
7. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that described step 2.2 comprises following substep:
Step 2.21 is collected the patent file of v and q and is gathered D Qv={ d 1, d 2..., d n;
Step 2.22, patent file is represented that with the space vector form concrete operation method is as follows:
For patent file set D Qv={ d 1, d 2..., d nIn any patent d i, utilize the space vector of one group of keyword to represent; Its process is, at first all patent files carried out Chinese word segmentation, removes stop words in the document according to self-defined or public stop words dictionary then, for removing lexical item behind the stop words, calculates the weight of each lexical item in document, and its calculating formula is:
w ( t j , d i ) = tf ( t j , d i ) × log ( N / n t j + 0.01 ) Σ t j ∈ d i [ tf ( t j , d i ) × log ( N / n t j ) + 0.01 ] 2 Formula two;
Wherein, w (t j, d i) be lexical item t jAt text d iIn weight, and tf (t j, d i) be word t jAt text d iIn word frequency, N is patent set D QvThe sum of middle patent, n are patent set D QvThe patent file number of lexical item occurs, denominator is normalized factor;
At last, represent each piece patent file with the space vector of each lexical item correspondence, be expressed as
Figure FDA00003284375600051
Wherein certain is worth w IjBe lexical item t jAt patent file d iIn weight;
Step 2.23 is represented according to the space vector of document, calculates the similarity between the patent file in twos, and concrete operation method is:
To any two patent file d iAnd d j, use included angle cosine between the vector of its correspondence to come the similarity of measurement, its formula is:
sim ( d i , d j ) = Σ l = 1 n w l ( d i ) × w l ( d j ) ( Σ l = 1 n w l 2 ( d i ) ) × ( Σ l = 1 n w l 2 ( d j ) ) Formula three;
W wherein l(d i) be that l lexical item is at document d iIn weight, w l(d j) be that l lexical item is at document d jIn weight;
Step 2.24, according to the similarity that step 2.23 is calculated, to the patent text cluster, formed each class i.e. a technical theme, produces the theme S set Qv
8. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that described step 2.3 comprises following substep:
Step 2.31, according to the cluster result of step 2.24, calculate q and the complementary degree of v on any two technical theme i inequality and j:
g ( q , v , i , j ) = n qi - n vi sum i × n vj - n qj sum j if ( n qi - n vi ) × ( n vj - n qj ) > 0 0 else Formula four;
Wherein, n QiBe the patent quantity of q in theme i, n ViBe the patent quantity of v in theme i, sum iBe the theme patent sum in the i, n QjBe the patent quantity of q in theme j, n VjBe the patent quantity of v in theme j, sum jThe interior patent sum of j is the theme;
Step 2.32, according to the result of calculation of step 2.31, calculate the technology complementation degree between q and v:
C ( q , v ) = Σ i , j ∈ S qv g ( q , v , i , j ) Formula five;
Wherein, S QvBe the theme set that cluster forms, i, j are two different technical themes.
9. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that described step 2.4 comprises following substep:
Step 2.41, v is in S set for statistics 2Under patent quantity and application time, research and develop strength, calculating formula in the field according to patent quantity and application time calculated candidate patentee:
P ( v ) = Σ d ∈ S e - t - t d λ Formula six;
Wherein, S represents that v is in S set 2Under patent file set, t dBe the application time of patent file d, t is the current time, and λ is adjustable parameter;
Step 2.42, v is in S set for statistics 2Under inventor's quantity, affiliate's quantity;
Step 2.43, calculate the possibility that v becomes the q affiliate, calculating formula:
Candidate (v)=and α C (q, v)+β P (v)+γ n 1+ τ n 2Formula seven;
Wherein, α, β, γ, τ are adjustable parameters, and v is the candidate patentee, n 1Be that v is in S set 2Inventor's quantity, n 2Be that v is in S set 2Affiliate's quantity.
10. the potential affiliate's recommend method of a kind of patentee according to claim 9 is characterized in that, described step 2.5 concrete operation method is:
To the result of calculation of step 2.43, descending sort, K patentee is as the potential affiliate of the core realm of q before getting.
11. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that the concrete operation method of described step 3.1 is:
Analyze S set 3In patentee under each IPC, if patentee one hurdle of the patent file of q not only exists q to also have other patentees, then this patentee is considered as having cooperative relationship with q, the existing affiliate of non-core research field who obtains q gathers P 2, other patentees constitute non-core research field candidate and gather C 2
12. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that the concrete operation method of described step 3.2 is:
Collect q and set P 2In patentee k in S set 3All enjoy Patent right patent file S set jointly down Qk, calculate their cooperative relationship intensity again:
R ( q , k ) = Σ d ∈ S qk e - ( t - t d t ) Formula eight;
Wherein, t is the current time, t dBe S QkIn the application time of patent file d in the set, λ is adjustable parameter.
13. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that described step 3.3 comprises following substep:
Step 3.31 is represented patentee's vectorization based on the thought of this paper vectorization, represents the patentee with the n-dimensional vector that keyword is formed, with c and k in S set 3Under the set of all patent files be considered as the super document d of one piece of expression c and k cAnd d k, c and set C 3In all patentees' super document constitute collection of document D Sim={ d 1, d 2..., d n, for patent file set D Sim={ d 1, d 2..., d nIn any patent d iUtilize the space vector of one group of keyword to represent: its process is, at first patent file is carried out Chinese word segmentation, then according to the stop words in the self-defined or public stop words dictionary removal document, for removing lexical item behind the stop words, calculate the weight of each lexical item in document, its calculating formula is:
w ( t j , d i ) = tf ( t j , d i ) × log ( N / n t j + 0.01 ) Σ t j ∈ d i [ tf ( t j , d i ) × log ( N / n t j ) + 0.01 ] 2 Formula two;
Wherein, w (t j, d i) be lexical item t jAt text d iIn weight, and tf (t j, d i) be word t jAt text d iIn word frequency, N is patent set D SimThe sum of middle patent, n are patent set D SimThe patent file number of lexical item occurs, denominator is normalized factor;
At last, represent each piece patent file with the space vector of each lexical item correspondence, be expressed as
Figure FDA00003284375600081
Wherein certain is worth w IjBe lexical item t jAt patent file d iIn weight;
Step 3.32 is to any two patent file d iAnd d j, use included angle cosine between the vector of its correspondence to come the similarity of measurement, its formula is:
sim ( d i , d j ) = Σ l = 1 n w l ( d i ) × w l ( d j ) ( Σ l = 1 n w l 2 ( d i ) ) × ( Σ l = 1 n w l 2 ( d j ) ) Formula three;
W wherein l(d i) be that l lexical item is at document d iIn weight, w l(d j) be that l lexical item is at document d jIn weight.
14. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that the concrete operation method of described step 3.4 is:
According to technology similarity and the cooperative relationship prediction of strength time c of k and q and the cooperative relationship intensity of q of c and k, concrete calculating formula is as follows:
P ( q , c ) = Max k ∈ P 2 ( R ( q , k ) × sim ( k , c ) ) Formula nine;
Wherein, q is given patentee, and k is the affiliate of q, and c is the candidate, P 2Be existing affiliate's set of q, (q k) is q and k cooperative relationship intensity to R, and (k c) is the technology similarity of k and c to sim.
15. the potential affiliate's recommend method of a kind of patentee according to claim 1 is characterized in that described step 3.5 comprises following substep:
Step 3.51, c is in S set for statistics 3Under patent quantity and application time, research and develop strength, calculating formula in the field according to patent quantity and application time calculated candidate patentee:
P ( c ) = Σ d ∈ S e - t - t d λ Formula six;
Step 3.52, c is in S set for statistics 3Under inventor's quantity, affiliate's quantity;
Step 3.53, calculate the possibility that c becomes the q affiliate, calculating formula:
Candidate (c)=α C (q, v)+β P (v)+γ n 1+ τ n 2Formula seven;
Wherein, α, β, γ, τ are adjustable parameters, and c is the candidate patentee, n 1Be that c is in S set 3Under inventor's quantity, n 2Be that c is in S set 3Under affiliate's quantity.
16. the potential affiliate's recommend method of a kind of patentee according to claim 15 is characterized in that the concrete operation method of described step 3.6 is:
To the result of calculation of step 3.53, by descending sort, get preceding K patentee as the potential affiliate in non-core field of q.
CN 201310215189 2013-05-31 2013-05-31 Method for recommending potential partners for patentee Pending CN103279535A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201310215189 CN103279535A (en) 2013-05-31 2013-05-31 Method for recommending potential partners for patentee

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201310215189 CN103279535A (en) 2013-05-31 2013-05-31 Method for recommending potential partners for patentee

Publications (1)

Publication Number Publication Date
CN103279535A true CN103279535A (en) 2013-09-04

Family

ID=49062054

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201310215189 Pending CN103279535A (en) 2013-05-31 2013-05-31 Method for recommending potential partners for patentee

Country Status (1)

Country Link
CN (1) CN103279535A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109308331A (en) * 2018-07-25 2019-02-05 厦门海汇运通软件有限公司 A kind of recommended method and device of patent transaction
CN109829634A (en) * 2019-01-18 2019-05-31 北京工业大学 A kind of adaptive patent Research Team, colleges and universities recognition methods
CN109918420A (en) * 2019-03-18 2019-06-21 重庆摩托车(汽车)知识产权信息中心 A kind of rival's recommended method, server
CN111553583A (en) * 2020-04-24 2020-08-18 广东电网有限责任公司 Cooperative operator matching method and device for audit task
CN113362015A (en) * 2021-05-10 2021-09-07 北京大学 Patent data-based cooperative institution recommendation method and system

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109308331A (en) * 2018-07-25 2019-02-05 厦门海汇运通软件有限公司 A kind of recommended method and device of patent transaction
CN109829634A (en) * 2019-01-18 2019-05-31 北京工业大学 A kind of adaptive patent Research Team, colleges and universities recognition methods
CN109829634B (en) * 2019-01-18 2021-02-26 北京工业大学 Self-adaptive college patent and scientific research team identification method
CN109918420A (en) * 2019-03-18 2019-06-21 重庆摩托车(汽车)知识产权信息中心 A kind of rival's recommended method, server
CN109918420B (en) * 2019-03-18 2019-12-13 重庆摩托车(汽车)知识产权信息中心 Competitor recommendation method and server
CN111553583A (en) * 2020-04-24 2020-08-18 广东电网有限责任公司 Cooperative operator matching method and device for audit task
CN113362015A (en) * 2021-05-10 2021-09-07 北京大学 Patent data-based cooperative institution recommendation method and system

Similar Documents

Publication Publication Date Title
CN101944099B (en) Method for automatically classifying text documents by utilizing body
Boyack et al. Clustering more than two million biomedical publications: Comparing the accuracies of nine text-based similarity approaches
CN101751438B (en) Theme webpage filter system for driving self-adaption semantics
CN102073730B (en) Method for constructing topic web crawler system
CN103279535A (en) Method for recommending potential partners for patentee
CN105930856A (en) Classification method based on improved DBSCAN-SMOTE algorithm
CN105279277A (en) Knowledge data processing method and device
CN110543595B (en) In-station searching system and method
CN109522562B (en) Webpage knowledge extraction method based on text image fusion recognition
CN103235812B (en) Method and system for identifying multiple query intents
CN106372061A (en) Short text similarity calculation method based on semantics
CN101814086A (en) Chinese WEB information filtering method based on fuzzy genetic algorithm
CN102012915A (en) Keyword recommendation method and system for document sharing platform
CN103226578A (en) Method for identifying websites and finely classifying web pages in medical field
CN103049569A (en) Text similarity matching method on basis of vector space model
CN101763431A (en) PL clustering method based on massive network public sentiment information
CN103309862A (en) Webpage type recognition method and system
CN102033949A (en) Correction-based K nearest neighbor text classification method
CN104408148A (en) Field encyclopedia establishment system based on general encyclopedia websites
CN102955857A (en) Class center compression transformation-based text clustering method in search engine
CN102968410A (en) Text classification method based on RBF (Radial Basis Function) neural network algorithm and semantic feature selection
CN105183813A (en) Mutual information based parallel feature selection method for document classification
CN104142960A (en) Internet data analysis system
CN104699817A (en) Search engine ordering method and search engine ordering system based on improved spectral clusters
CN112215629B (en) Multi-target advertisement generating system and method based on construction countermeasure sample

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20130904

RJ01 Rejection of invention patent application after publication