CN105931122A - Ranking method of magic research communities in academic social network - Google Patents

Ranking method of magic research communities in academic social network Download PDF

Info

Publication number
CN105931122A
CN105931122A CN201610224907.5A CN201610224907A CN105931122A CN 105931122 A CN105931122 A CN 105931122A CN 201610224907 A CN201610224907 A CN 201610224907A CN 105931122 A CN105931122 A CN 105931122A
Authority
CN
China
Prior art keywords
community
research
research community
prime
worker
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201610224907.5A
Other languages
Chinese (zh)
Inventor
张忠宝
苏森
王亚坤
徐嘉帅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Posts and Telecommunications
Original Assignee
Beijing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications filed Critical Beijing University of Posts and Telecommunications
Priority to CN201610224907.5A priority Critical patent/CN105931122A/en
Publication of CN105931122A publication Critical patent/CN105931122A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Abstract

The invention discloses a ranking method of magic research communities in an academic social network and belongs to the data mining technical field. According to the ranking method, a time window is given, the magic research communities are found in the academic social network, wherein the magic research communities are attractive research communities which gradually become popular. The method specifically includes the steps of research community internal feature calculation, external feature FE calculation and ranking. The method of the invention can assist researchers to better understand and grasp current research trends and hotspots. According to the ranking method, hypotheses in existing researches can be broken through, namely, one person is only located at one community at one time point; and the characteristics of the magic research communities are extracted, and a unified algorithm is designed to rank potential popularity degrees of the research communities.

Description

The sort method of magic power research community in a kind of academic social networks
Technical field
The present invention is applied to find magic power research community in academic social networks, belongs to data mining technology field.
Background technology
In recent years, social network-i i-platform (such as: Facebook and Twitter) quickly grew.Meanwhile, social network analysis Also the extensive concern of academia is received.Academic network, as ingredient very important in social networks, also becomes research The research emphasis of personnel, sees list of references [1] J.Tang, J.Zhang, L.Yao, J.Li, L.Zhang, and Z.Su, “Arnetminer:extraction and mining of academic social networks,”in Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining.ACM,2008,pp. 990 998. lists of references [2] J.Tang, R.Jin, and J.Zhang, " A topic modeling approach and its integration into the random walk framework for academic search,”in Data Mining Eighth IEEE International Conference on.IEEE,2008,pp.1055–1060。
List of references [3] (G.Wang, Y.Zhao, X.Shi, and P.S.Yu, " Magnet community identification on social networks,”in Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining.ACM, 2012, pp.588 596.) propose the concept of magic power community and it It is applied to IT company and the sequence of financial company's captivation.This list of references [3] is intended to find the community that certain is popular, assumes simultaneously Being independent of each other between community, a people at a time can be only in a community.But in a lot of reality scenes (as Academic social networks), if the research worker of research identical content is regarded as a community, then find that those are the newest, The popular community of following meeting rather than those the most popular communities are more valuable.
Summary of the invention
It is an object of the invention to help research worker, particular without the research worker of experience, grind existing from the angle of macroscopic view The development studying carefully community has recognized, and helps research worker preferably to select the research topic of oneself.The present invention provides a kind of academic society Handing over the sort method of magic power research community in network, in described sort method, a given time window, at academic social networks Middle discovery magic power research community.Described magic power research community does not refer to those the most popular communities, but those are gradually The attractive research community come into vogue.The present invention is suitable for the application such as information retrieval and community's recommendation.
The sort method of magic power research community in the academic social networks that the present invention provides, specifically includes following steps:
The first step, research community internal feature calculates;
Select novelty degree as research community internal feature FC, jth research community CjNovel degree NjRefer to the list of community's theme Word frequency change from time window s to s+1, uses Nj∈FCRepresent;Research community CjNovel degree NjBe equivalent under theme every The sum of individual word novelty degree;
Research community CjNovel degree NjCalculate in the following manner:
N j = Σ v = 1 | W | CW j v · l o g | C | δ C W v · δ W v ( s + 1 ) δ W v ( s ) + δ W v ( s + 1 ) - - - ( 1 )
Wherein,Represent and comprise word WvResearch communityNumber,WithRepresent word W respectivelyvAt time window s With the number of times occurred in s+1;| W | is word WvQuantity, v=1,2 ..., | W |;| C | is to study community in research community network Quantity;
Second step, research community surface FECalculate;
Given research worker RiIn the community of time window s and s+1, distribution is respectivelyWithCalculation and Study personnel Ri Total transfer amount, Calculation and Study personnel RiFrom research community Cj'To research community CjTransfer amountWith transfer amount as matrix Element obtains transfer matrixTo each research worker RiTransfer matrixIt is added, obtains final transfer matrix T;
3rd step, sequence;
Weighted Directed Graph G=(C, E, the F of given research community networkC,FE), to arbitrary Cj∈ C, definition research community CjInhale Draw other research community Cj'Ability u of research worker attentionjj′:
u jj ′ = α · T jj ′ T O j + ( 1 - α ) · N j - - - ( 2 )
Wherein, α is weight parameter,It is research community CjTo research community Cj'Transfer matrix Tjj'Transposition, OjIt is to grind Study carefully community CjSize;
For arbitrary research community Cj∈ C, it is propagated attention and studies community C to otherj'Ability be defined as:
v jj ′ = T jj ′ O j . - - - ( 3 )
Based on formula (2) and formula (3), for two scores of each research community definition: front score PS and negative score NS; Front score weighs the captivation of research community from the angle attracting attention, and negative score is weighed from the angle propagating attention and ground Study carefully the captivation of community, for research community CjFront score PSjWith negative score NSjIt is defined as follows:
PS j = Σ j ′ = 1 | C | 1 | Z | u jj ′ NS j ′ NS j = Σ j ′ = 1 | C | 1 | Z ′ | v jj ′ PS j ′ - - - ( 4 )
Wherein,WithIt it is normalization factor;
Based on the two score, study community CjCaptivation grade AjCalculated by equation below:
Aj=PSj-NSj (5)。
It is an advantage of the current invention that:
(1) present invention proposes for the first time, how to find potential attractive research community, the most just in academic social networks Being the problem of magic power research community, the present invention can help research worker to be best understood from and hold current research tendency and focus;
(2) it is during the present invention breaches existing research it is assumed that i.e. one people is only in a community a moment;
(3) present invention has extracted the feature of magic power research community, and devises the unified algorithm potential popular journey to research community Degree is ranked up.
Accompanying drawing explanation
Figure 1A and Figure 1B is the inventive method with existing algorithm Indegree, PageRank and MIM respectively at two different numbers According to the comparison schematic diagram on collection.
Fig. 2 A and Fig. 2 B is that related algorithm is illustrated with HotRank, TrandRank contrast on two different pieces of information collection respectively Figure.
Fig. 3 is community network G=(C, E, a F given in embodimentC,FE) schematic diagram.
Detailed description of the invention
Below in conjunction with drawings and Examples, the present invention is described in further detail.
The present invention provides the sort method of magic power research community in a kind of academic social networks, is that one is found to have captivation research society The method (Attractive Research Community Ranking is called for short ARTRank) in district.Described attractive research society District is also referred to as magic power research community, present invention AiThe captivation grade of the magic power research community described in expression, Ai∈ A, A are institute Charm and study the captivation class set of community.Given research community network G=(C, E, FC, FE), then magic power research society Captivation class set A in district is defined as A=f (FC,FE), A is the internal feature F of research community networkCWith surface FE Copula.Definition research community CjWith research community Cj′Captivation grade be respectively AjAnd Aj', then research community CjRatio Research community Cj′More attractive and if only if Aj> Aj'.So finding the problem of magic power research community is a sequencing problem.
Research community network is defined as Weighted Directed Graph G=(C, E, F by the present inventionC,FE), wherein C represents research community Cj's Set, each research community CjBeing the research worker of one group of polymerization, E represents the set of link between research community, represents research people Member's transfer between research community.FCRepresent the internal feature of each research community, FEOutside for link between expression research community Portion's feature.
Present invention RiRepresent i-th research worker, use Latent Dirichlet Allocation (LDA) (list of references [4]: D. M.Blei,A.Y.Ng,and M.I.Jordan,“Latent dirichlet allocation,”the Journal of machine Learning Research, vol.3, pp.993 1022,2003. list of references [5]: T.L.Griffiths and M.Steyvers, " Finding scientific topics,”Proceedings of the National Academy ofSciences,vol.101,no.suppl 1,pp. 5228 5235,2004.) research worker is clustered by model.By cluster, each research worker can be expressed as one and lead Probability distribution in topic (research community).The result of LDA model can be expressed as two matrixes, respectively matrix RC and square Battle array CW:
Matrix RC is the matrix of | R | × | C |, and | R | is the quantity of the research worker in research community network, and | C | is research community Network is studied the quantity of community, is also the research community total in the set of research community.RCijIt it is research worker RiBelong to research Community CjProbability.I=1,2 ..., | R |, j=1,2 ..., | C |.
Matrix C W is the matrix of | C | × | W |, and | W | is word WvQuantity, v=1,2 ..., | W |.CWjvIt is word WvAssign Give research community CjProbability.
Based on above-mentioned definition, the invention provides the sort method of magic power research community in a kind of academic social networks, including grinding Study carefully community's internal feature calculating, surface calculates and the sequence of grade captivation, specifically comprises the following steps that
The first step, research community internal feature calculates.
The present invention selects novelty degree as research community internal feature FC.Jth research community CjNovel degree NjRefer to community master The frequency change from time window s to s+1 of the word of topic, uses Nj∈FCRepresent.The concept source of the novel degree of research community is in society Hand over the concept of the topic detection that happens suddenly in network event detection.In event detection, when the word the most more frequency of certain event topic When occurring, then this event is defined as accident and (sees list of references [6]: Q.Diao, J.Jiang, F.Zhu, and numerously E.-P.Lim,“Finding bursty topics from microblogs,”in Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics:Long Papers-Volume 1.Association for Computational Linguistics,2012,pp.536–544.).One research community having novel subject matter can attract to grind more Study carefully the attention of personnel.The definition of the novel degree according to research community, studies community CjNovel degree NjBe equivalent under theme every The sum of individual word novelty degree.
First, word WvResearch community C should be able to be represented welljTheme.Use word WvIt is assigned to study community Cj's Probability, namely CWjv, represent word WvTo research community CjImportance.Wherein CWjvObtained by LDA model. But this single standard is inadequate, such as word " network " occurs in multiple research communities, and word " social " only exists Some research community therein occurs, it is clear that than " network ", " social " can preferably represent that this studies community.So, this Invention uses IDF (inverse document frequency) value to weigh word W as supplementingvTo research community CjImportant Degree.According further to the definition of the novel degree of research community, word WvFrequency change can image study community CjNovelty Degree.Analyzed by above, study community CjNovel degree NjCan calculate in the following manner:
N j = Σ v = 1 | W | CW j v · l o g | C | δ C W v · δ W v ( s + 1 ) δ W v ( s ) + δ W v ( s + 1 ) - - - ( 1 )
Wherein,Represent and comprise word WvResearch communityNumber,WithRepresent word W respectivelyvAt time window s With the number of times occurred in s+1.
Second step, research community surface FECalculate.
Research worker transfer between research community reflects the development trend of research community, and the present invention divides the community of research worker From research community C in clothj'To research community CjChange be defined as research worker from research community Cj'To research community CjTurn Move.Given research worker RiIn the community of time window s and s+1, distribution is respectivelyWithGenerally research worker Ri Attention can from some research community transfer to other research community.Assume for research worker Ri, research community obtains Transfer amount come from those research communities losing concern in proportion.After the transfer matrix calculating each research worker, The transfer matrix of all research worker is added, and just obtains the transfer matrix between final research community, is research community outside special Levy FE
The generation method of described transfer matrix includes: (1) Calculation and Study personnel RiTotal transfer amount, (2) Calculation and Study personnel RiFrom Research community Cj'To research community CjTransfer amountTransfer matrix is obtained for matrix element with transfer amountAnd (3) To each research worker RiTransfer matrixIt is added, obtains final transfer matrix T.Concrete methods of realizing is as follows:
3rd step, sequence.
Weighted Directed Graph G=(C, E, the F of given research community networkC,FE), to arbitrary Cj∈ C, definition research community CjInhale Draw other research community Cj'Ability u of research worker attentionjj:
u jj ′ = α · T jj ′ T O j + ( 1 - α ) · N j - - - ( 2 )
Wherein, α is weight parameter,It is research community CjTo research community Cj'Transfer matrix Tjj'Transposition, OjIt is to grind Study carefully community CjSize, say, that the sort method of the present invention is to research community magnitude.Such a popular grinding Study carefully community and will obtain little contribution from other research communities, and the tribute that communities obtain is studied, from other, by attractive research community Offering a lot, because popular research community is the biggest, and attractive research community is the most relatively small.With Sample, when the ability of Calculation and Study Community Communication attention, it is also considered that research community size, to avoid finding those the least Research community.It is to say, for arbitrary research community Cj∈ C, it is propagated attention and studies community C to otherj'Energy Power can be defined as:
v jj ′ = T jj ′ O j . - - - ( 3 )
Based on formula (2) and formula (3), for two scores of each research community definition: front score PS and negative score NS. Front score weighs the captivation of research community from the angle attracting attention, and negative score is weighed from the angle propagating attention and ground Study carefully the captivation of community.For research community CjFront score PSjWith negative score NSjIt is defined as follows:
PS j = Σ j ′ = 1 | C | 1 | Z | u jj ′ NS j ′ NS j = Σ j ′ = 1 | C | 1 | Z ′ | v jj ′ PS j ′ - - - ( 4 )
Wherein,WithIt it is normalization factor.One research community has high front score PS and means that it studies community from other Obtain a lot of contributions, had high negative score NS on the contrary and mean that it is that a lot of contributions has been done by other research communities.One Can there be high front score PS and high negative score NS in individual popular research community simultaneously, and attractive research community has High front score PS and low negative score NS.Based on the two score, study community CjCaptivation grade AjPermissible Calculated by equation below:
Aj=PSj-NSj (5)
According to formula (4) and formula (5), the detailed step of described research community captivation grade sort method is as follows:
Embodiment:
The data set used in the present embodiment is from ArnetMiner (list of references [1], [2]).
Computer science data set: this data set comprises 2 relevant with computer science, 084,055 paper, and every paper comprises Exercise question, author, deliver time and summary etc..Author, exercise question and summary letter is extracted from the data of 2005 to 2010 years Breath, then obtains and 2005-2006, the data that two time windows of 2007-2008 are relevant, respectively contains 47565 works The relevant information of person.
Data fields conferencing data collection: this data set is the subset of computer science data set, these data take from nine top-level meetings (SIGMOD, KDD, VLDB, SIGIR, ICDE, CIKM, WWW, ICDM and WSDM), by pretreatment, Two time windows of 2005-2006 and 2007-2008 remain the information of 2399 authors respectively.
The present embodiment uses LDA model to carry out community's detection.On computer science data set, if number of topics | C | is 300, That is 300 research communities are found.On data fields conferencing data collection, if number of topics | C | is 50, namely want Find 50 research communities.Two Dirichlet hyper parameter in LDA model are set toWith 0.01.
The sort method that the present invention proposes compares with following four method.
Control methods 1: in-degree (Indegree): Indegree weighs the captivation of research community only by the amount of proceeding to of research worker.
Control methods 2:PageRank (sees list of references [7]: L.Page, S.Brin, R.Motwani, and T.Winograd, " The Pagerank citation ranking:Bringing order to the web. " 1999.): PageRank research worker between community A kind of ballot is regarded in transfer as, when weighing community's captivation, it not only allows for the quantity of transfer, it is also contemplated that the quality of transfer.
Control methods 3: magic power community discovery model (MIM) (sees list of references [3]): this model uses one based on PageRank Optimization Framework weigh the captivation of community, in an experiment, in this model that community's novelty degree factor is joined.
Control methods 4: trend sequence (TrendRank): TrendRank uses linear regression method, utilizes topic keyword every year Shared ratio generates the Trendline (list of references [8]: A.Hoonlor, B.K.Szymanski, and M. of theme popularity change J.Zaki,“Trends in computer science research,”Communications of the ACM,vol.56,no.10,pp. 74–83,2013.).With with time window 2007-2008,2009,2010 relevant data carry out trend sequence.Instinctively, more Attractive community more should have a good development trend in the near future.In the present embodiment using TrendRank as The standard of experiment.
The present embodiment uses recommends intensity (recommendation intensity) (to see list of references [9]: R.Hampel and M. Hauck,“Towards an effective use of audio conferencing in distance language courses,”Language Learning&Technology, vol.8, no.1, pp.66 82,2004.) as estimating standard, it is defined as follows:
R I ( L i ) @ k = 2 - | O r - O g | k L i ∈ T r e n d R a n k 0 L i ∉ T r e n d R a n k - - - ( 6 )
In formula (6), L is the sorted lists of front k the research community that certain sort method generates.OrIt is LiIn L Sorting position, OgIt is LiSorting position in TrendRank.This also implies that, if LiOccur in TrendRank's In the list of front k, and its sorting position is the most accurate, then LiRecommendation the highest will be obtained Intensity value.So the recommendation intensity of front k the sequence of L can be defined as:
R I ( L ) @ k = Σ L i ∈ L R I ( L i ) @ k - - - ( 7 )
Table 1 illustrates by front 15 research communities of PageRank, MIM, ARTRank and Trendrank sequence.According to Word keyword in theme is that these themes label, and the community adding bold Italic occurs from Trendrank sequence front 15 Community.Owing to the result of Indegree and PageRank is closely similar, so table 1 omits the result of Indegree.
It will be seen that the ARTRank method that the present invention provides has the biggest advantage than relevant method from table 1, ARTRank Front the 15 of sorted lists have 9 to occur in Trendrank, and MIM and PageRank only 5 and 3 respectively.Ratio It is the most attractive community in TrendRank as studied community " cloud ", only occurs in the sequence of 15 before ARTRank In result, and before PageRank and MIM, the ranking results of 15 does not occur.On data fields conferencing data collection, Have also been obtained similar result, owing to space limits, omit herein.
Except accuracy, the present invention also compares these methods quantized result under recommendation intensity, as Shown in Figure 1A and Figure 1B.ARTRank is substantially better than additive method, because InDegree only considered the quantity of transfer, this Sample it can tend to find those popular communities.Although PageRank considers the architectural feature of network, it can be looked for equally To the community that those are popular.Some is similar, this is because the present invention all ensures that often for the result of MIM and the result of the method for the present invention The amount of proceeding to of individual community wants the unnecessary amount of producing.But, the effect of ARTRank is still good than MIM because MIM be based on PageRank's, its purpose is the community finding a certain class popular, this target obviously and the present invention to find new gradually The target of popular research community is different.
15 research community sequence before table 1 computer science data set
It is an object of the present invention to find those attractive research communities rather than community that those are popular.Then, this Bright related algorithm and community temperature sort algorithm (HotRank) compare, and HotRank is based on the time of community's theme What community was ranked up by intensity (sees list of references [10]: X.Wang, C.Zhai, and D.Roth, " Understanding evolution of research themes:a probabilistic generative model for citations,”in Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining.ACM, 2013, pp.1115 1123. lists of references [11]: D.Zhou, X.Ji, H.Zha, and C.L.Giles, " Topic evolution and social interactions:how authors effect research,”in Proceedings of the 15th ACM international conference on Information and knowledge management.ACM,2006,pp.248–257.)。
Fig. 2 A and Fig. 2 B gives result of the comparison, and wherein the data of HotRank take from 2009.It will be seen that Indegree With the ranking results of PageRank and HotRank closer to, and and the ranking results difference of TrendRank far, say, that this Two algorithms are more likely to find popular community.But the sequence to community's temperature, the trend of community development can not be reflected, But because awfully hot community starting to go down hill still can in temperature sorts ranking the highest.The ranking results of ARTRank Then the most consistent with TrendRank, say, that the ranking results of ARTRank more can reflect the trend of community development.
Present invention could apply in academic social networks, research on utilization personnel are at the intercommunal transfer of difference research, Yi Jiyan Study carefully the subject novel degree of community, the captivation grade of research community is ranked up.ARTRank algorithm is with research worker At the article of different year as input, with study community captivation grade ranking for output.Such as, Fig. 3 gives one Embodiment.Community network in Fig. 3 is at the article content of different year by research worker, shifts square according to novelty degree and community Battle array algorithm generates, and the size of circle represents the size of community, and the value of square frame represents the novel degree of community, and the value on limit represents research The transfer amount of personnel.By the sort method of the present invention, obtain studying the captivation ranking of community: 4,1,2,3.Obviously grind Study carefully community 4 and have the theme of novelty, while attracting a lot of research worker, have again little people to leave, simultaneously the scale of community The least, it is clear that to be new and the most popular research community to be looked in the present embodiment.

Claims (3)

1. the sort method of magic power research community in an academic social networks, it is characterised in that comprise the steps:
The first step, research community internal feature calculates;
Select novelty degree as research community internal feature FC, jth research community CjNovel degree NjRefer to the list of community's theme Word frequency change from time window s to s+1, uses Nj∈FCRepresent;Research community CjNovel degree NjBe equivalent under theme every The sum of individual word novelty degree;
Research community CjNovel degree NjCalculate in the following manner:
N j = Σ v = 1 | W | CW j v · log | C | δ C W v · δ W v ( s + 1 ) δ W v ( s ) + δ W v ( s + 1 ) - - - ( 1 )
Wherein,Represent and comprise word WvResearch communityNumber,WithRepresent word W respectivelyvAt time window s With the number of times occurred in s+1;| W | is word WvQuantity, v=1,2 ..., | W |;| C | is to study community in research community network Quantity;
Second step, research community surface FECalculate;
Given research worker RiIn the community of time window s and s+1, distribution is respectivelyWithCalculation and Study personnel Ri Total transfer amount, Calculation and Study personnel RiFrom research community Cj'To research community CjTransfer amountWith transfer amount as matrix Element obtains transfer matrixTo each research worker RiTransfer matrixIt is added, obtains final transfer matrix T;
3rd step, sequence;
Weighted Directed Graph G=(C, E, the F of given research community networkC,FE), to arbitrary Cj∈ C, definition research community CjInhale Draw other research community Cj'Ability u of research worker attentionjj':
u jj ′ = α · T jj ′ T O j + ( 1 - α ) · N j - - - ( 2 )
Wherein, α is weight parameter,It is research community CjTo research community Cj'Transfer matrix Tjj'Transposition, OjIt is to grind Study carefully community CjSize;
For arbitrary research community Cj∈ C, it is propagated attention and studies community C to otherj'Ability be defined as:
v jj ′ = T jj ′ O j . - - - ( 3 )
Based on formula (2) and formula (3), for two scores of each research community definition: front score PS and negative score NS; Front score weighs the captivation of research community from the angle attracting attention, and negative score is weighed from the angle propagating attention and ground Study carefully the captivation of community, for research community CjFront score PSjWith negative score NSjIt is defined as follows:
{ PS j = Σ j ′ = 1 | C | 1 | Z | u jj ′ NS j ′ NS j = Σ j ′ = 1 | C | 1 | Z ′ | v jj ′ PS j ′ - - - ( 4 )
Wherein,WithIt it is normalization factor;
Based on the two score, study community CjCaptivation grade AjCalculated by equation below:
Aj=PSj-NSj (5)。
The sort method of magic power research community in a kind of academic social networks the most according to claim 1, it is characterised in that second The concrete calculation procedure of transfer matrix T final in step is as follows:
Step 1, each research worker Ri, i=1,2 ..., | R |, perform step 2-11;
Step 2, make research worker RiTotal transfer amount Q=0,
Step 3, for each research community Cj∈ C, performs step 4;
Step 4, given research worker RiIn the community of time window s and s+1, distribution is respectivelyWithIfPerform step 5;
Step 5, update total transfer amount
Step 6, for each research community Cj∈ C, performs step 7-10;
If step 7Perform step 8;
Step 8, for each research community Cj'∈ C, performs step 9;
If step 9Perform step 10;
Step 10, research worker RiFrom research community Cj' to CjTransfer amount:
T j ′ j ( R i ) = RC i j ( s + 1 ) - RC i j ( s ) Q · | RC ij ′ ( s + 1 ) - RC ij ′ ( s ) | ;
Step 11, to the transfer amount of each research worker Ri as transfer matrixElement, and by all of transfer matrix Add and, obtain final transfer matrix T.
The sort method of magic power research community in a kind of academic social networks the most according to claim 1, it is characterised in that the 3rd The detailed step studying community's captivation grade sort method in step is as follows:
Step 1, a given positive number ξ > 0, front score and negative score for each research community compose initial value PS respectively0With NS0
Step 2, (1,1,1,1) ∈ R|C|It is assigned to PS0;R|C|Represent that numerical value dimension is the vector space of | C |;
Step 3, (1,1,1,1) ∈ R|C|It is assigned to NS0
Step 4, repeated execution of steps 5 to step 9;
Step 5, use NSk-1Update PSk;Formula (4);
Step 6, use PSk-1Update NSk;Formula (4);
Step 7, standardization PSk
Step 8, standardization NSk
Step 9, k increase by 1;
Step 10, until meet condition | PSk-PSk-1| < ξ, and | NSk-NSk-1| < ξ.
CN201610224907.5A 2016-04-12 2016-04-12 Ranking method of magic research communities in academic social network Withdrawn CN105931122A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610224907.5A CN105931122A (en) 2016-04-12 2016-04-12 Ranking method of magic research communities in academic social network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610224907.5A CN105931122A (en) 2016-04-12 2016-04-12 Ranking method of magic research communities in academic social network

Publications (1)

Publication Number Publication Date
CN105931122A true CN105931122A (en) 2016-09-07

Family

ID=56838114

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610224907.5A Withdrawn CN105931122A (en) 2016-04-12 2016-04-12 Ranking method of magic research communities in academic social network

Country Status (1)

Country Link
CN (1) CN105931122A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108664661A (en) * 2018-05-22 2018-10-16 武汉理工大学 A kind of scientific paper recommendation method based on frequent theme collection preference

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108664661A (en) * 2018-05-22 2018-10-16 武汉理工大学 A kind of scientific paper recommendation method based on frequent theme collection preference
CN108664661B (en) * 2018-05-22 2021-08-17 武汉理工大学 Academic paper recommendation method based on frequent theme set preference

Similar Documents

Publication Publication Date Title
Panchenko et al. Russe: The first workshop on russian semantic similarity
Van Gysel et al. Unsupervised, efficient and semantic expertise retrieval
Wang et al. Studies on a multidimensional public opinion network model and its topic detection algorithm
Lin et al. Detecting fake news articles
CN106844665B (en) Thesis recommendation method based on reference relation distributed expression
Shekarpour et al. Keyword query expansion on linked data using linguistic and semantic features
Fengmei et al. FSFP: Transfer learning from long texts to the short
Gupta et al. Continuous space models for CLIR
CN107832319B (en) Heuristic query expansion method based on semantic association network
Qin et al. Feature selection for text classification based on part of speech filter and synonym merge
Pereira et al. Crisis Event Social Media Summarization with GPT-3 and Neural Reranking
Jia et al. Collaborative filtering recommendation of online learning resources based on knowledge association model
Kang et al. A short texts matching method using shallow features and deep features
CN105931122A (en) Ranking method of magic research communities in academic social network
Wang et al. A novel ontology analysis tool
Xu et al. Generating risk maps for evolution analysis of societal risk events
Ho et al. Information diffusion on complex networks: a novel approach based on topic modeling and pretopology theory
Niebler et al. Learning Word Embeddings from Tagging Data: A methodological comparison.
Hasanuzzaman et al. Hultech at the ntcir-11 temporalia task: Ensemble learning for temporal query intent classification
Mary et al. Jen-Ton: A framework to enhance the accuracy of aspect level sentiment analysis in big data
Chumachenko et al. Dynamics of topic formation and quantitative analysis of hot trends in physical science
Ban et al. CICPV: A new academic expert search model
Wang et al. Knowledge graph-based semantic ranking for efficient semantic query
Misale et al. A survey on recommendation system for technical paper reviewer assignment
Eykens et al. Clustering social sciences and humanities publications: Can word and document embeddings improve cluster quality

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20160907