CN103345581B

CN103345581B - Based on online from the Dynamic Network Analysis system and method for center model

Info

Publication number: CN103345581B
Application number: CN201310280241.1A
Authority: CN
Inventors: 李武军; 王灏; 过敏意
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2013-07-04
Filing date: 2013-07-04
Publication date: 2016-12-28
Anticipated expiration: 2033-07-04
Also published as: CN103345581A

Abstract

The invention discloses a kind of based on online from the Dynamic Network Analysis system and method for center model, this system at least includes: object function sets up module, dynamically on the basis of center model, to need parameter beta and the topic proportion omegab of study_kObject function is set up as variable；The minimization of object function module, after a new events or a series of new events occur, utilizes alternative projection algorithm alternately to update parameter vector β and this topic proportion omegab of the study of these needs_k, it is thus achieved that the optimal solution of object function, the present invention is by being modeled with model parameter the topic feature of time-varying, so that the accuracy that model elapses prediction over time will not decline.

Description

Based on online from the Dynamic Network Analysis system and method for center model

Technical field

The present invention, about a kind of Dynamic Network Analysis system and method, particularly relates to one based on online from center model Dynamic Network Analysis system and method.

Background technology

Analysis of network, particularly Dynamic Network Analysis (Dynamic Network Analysis, i.e. DNA) are including society Science has seemed more and more important with biology in interior many fields.Although having had now many about dynamic network The work analyzed, but wherein most otherwise only focus on the large-scale data under the thickest fine granularity exactly, otherwise be exactly Only focus on the analysis of fine particle size in a network the least.In recent years, it is thus proposed that dynamically from center model (Dynamic Egocentric Model, i.e. DEM), this model is based on multivariate counting process and successfully to fine particle size Large-scale time-varying citation network be modeled.In general, in DEM original text, there is the mutation of two DEM: one the most right Chain feature is modeled, and chain feature is modeled with topic feature (text message) by another simultaneously.Due to the latter's Accuracy is easier to obtain, unless specifically indicated, in the present invention far above the text message of the former and an article DEM refer to the latter.Hereinafter DEM is simply introduced:

N is the sum of nodes (article).DEM attempt by each node i (i=1,2 ..., n) upper placement one Individual counting process N_iT () is to be modeled dynamic network.Wherein N_iT () represents tiring out of t deadline of " event " in node i Meter frequency.Here the definition of " event " context to be depended on.Such as, in citation network, one " event " can be right Answer and once quote.

Although the full probability of these counting processes can be maximized, release the model of a continuous time, but for drawing For network, it is clear that estimate that the parameter that those are relevant to time varying statistics can be more real by the method maximizing inclined probability Border.So DEM attempts to maximize the likelihood of following whole network:

L (β) = Π_{e = 1}^{m} \frac{\exp (β^{T} s_{i_{e}} (t_{e}))}{Σ_{i = 1}^{n} Y_{i} (t_{e}) \exp (β^{T} s_{i_{e}} (t_{e}))}, - - - (1)

Wherein m is the total degree quoting event, and e is the index every time quoting event, i_eExpression is cited in event e Article, t_eThe time that expression event e occurs, Y_iT the worthwhile node i of () exists at time t is for 1, is otherwise 0.s_i(t_e) represent Node i is at time t_eCharacteristic vector.β is the parameter vector needing study.

s_i(t_eVector in) can be divided into two classes.One class is referred to as " chain feature (statistic) ", another kind of referred to as " topic Feature ".8 chain features are had, including three preferential attachment statistics, three triangle in DEM Statistic and two out-path statistics.Additionally every article is extracted 50 also by the summary of article is run LDA Topic feature.More specifically, it is assumed that at time t_eThe article newly arrived is i, can be special with the topic of any existing article j calculated as below Levy:

Wherein θ_iRepresenting the topic ratio of article i, o is that the element between vector is multiplied item by item.

From the foregoing, it will be observed that s_i(t_e) be a vector containing 58 features, the most front 8 features are chain feature, after 50 Individual for topic feature.Accordingly, β is the parameter vector of a length of 58.

But, although during the prediction of dynamic network, DEM can be dynamically updated node and (represent literary composition in original text Chapter) chain feature, but the DEM parameter beta that learns out and topic feature θ_iBut it is fixing during prediction.Therefore, DEM As time goes on, it was predicted that accuracy can seriously decline, because actually topic feature and parameter all should be as Time change.Such as, one of chain feature of model is that the in-degree by certain time point node is (article cited secondary Number), As time goes on, the citation times of an article can become more and more, number of references in the most whole data set Distribution change as well as the time, such as a result, the parameter of this feature corresponding, even other parameters, also Should change.It addition, about topic feature, although at first sight, the topic feature of an article can change over time can Can seem somewhat inconceivable, because from the point of view of usually, the word of an article delivered all will not change over time, But, quote many articles of this article the most constantly in change.Therefore, reference information is combined with content of text information Determine that the topic feature of an article is more reasonable.Such as, one can in the 1950's about the article of neutral net Can be considered as and psychology or height correlation biology, but in today, it is the most more likely divided into about machine The article of device study, because there being more and more article delivered to refer to it in decades.It follows that the topic of an article Feature of course can change over time, simply the varying in size of amplitude.Due to cannot be to the parameter of time-varying and topic Parameter model, DEM and cannot well dynamic network be modeled accurately so that the accuracy predicted can along with time Between and decline.

Summary of the invention

For the deficiency overcoming above-mentioned prior art to exist, the purpose of the present invention is to provide one based on online from center die The Dynamic Network Analysis system and method for type, it is by being modeled with model parameter the topic feature of time-varying, so that Model elapses the accuracy of prediction over time and will not decline.

For reaching above and other purpose, the present invention propose a kind of based on online from the Dynamic Network Analysis system of center model System, at least includes:

Object function sets up module, dynamic on the basis of center model, to need parameter beta and the topic ratio of study ω_kObject function is set up as variable；

The minimization of object function module, after a new events or a series of new events occur, utilizes alternating projection to calculate Method alternately updates parameter vector β and this topic proportion omegab of these needs study_k, it is thus achieved that the optimal solution of object function.

Further, this object function is:

\begin{matrix} \min i m i z e & - \log L (β, ω) + λ Σ_{k = 1}^{n} | | ω_{k} - θ_{k} | |_{2}^{2} \end{matrix}

Wherein ω_kIt is the new topic ratio of node k to be learned, θ_kIt is topic ratio current for node k, Represent ω_kIn each element be non-negative, 1 is the vector that an element is all 1, these Restriction is used for ensureing ω_kIn all elements be all non-negative and element and be 1, λ is one and controls weight between two items Hyper parameter.

Further, this minimization of object function module includes:

β parameter more new module, uses Newton method undated parameter to need the parameter beta of study after fixing topic proportion omegab；

Topic ratio more new module 111, in actualite ratio θ after fixing Beta_kOn the basis of, minimize this target letter Number is to obtain the topic proportion omegab after updating_k。

Further, one is updated after this β parameter more new module and this topic ratio more new module quote event at every q time Secondary.

This β parameter more new module is after fixing ω, and the object function needing the parameter beta of study is as follows:

L (β) = Π_{e = x}^{x + q - 1} \frac{\exp (β^{T} s_{i_{e}} (t_{e}))}{Σ_{i = 1}^{n} Y_{i} (t_{e}) \exp (β^{T} s_{i} (t_{e}))},

First event during wherein x is mini-batch, q is the event number in mini-batch, and mini-batch is The event sets of accumulation.

Further, this topic ratio more new module the most only updates the topic proportion omegab of an article_k, updating ω_k Time, the topic ratio { ω of other articles_i| i ≠ k} keeps constant.

Further, the object function that this topic ratio more new module need to optimize is:

- l o g (Π_{i = 1}^{p} \frac{α_{i} \exp (a_{i}^{T} ω_{k})}{A_{i} + α_{i} \exp (a_{i}^{T} ω_{k})} Π_{u = p + 1}^{q} \frac{C_{u}}{B_{u} + γ_{u} \exp (b_{u}^{T} ω_{k})}) + λ | | ω_{k} - θ_{k} | |_{2}^{2},

Wherein,

α_{i} = \exp (β_{l}^{T} s_{k}^{l} (t_{e_{i}})),

γ_{u} = \exp (β_{l}^{T} s_{k}^{l} (t_{e_{u}})),

A_{i} = \underset{j &NotEqual; k}{Σ} Y_{j} (t_{e_{i}}) \exp (β^{T} s_{j} (t_{e_{i}})),

B_{u} = \underset{j &NotEqual; k}{Σ} Y_{j} (t_{e_{u}}) \exp (β^{T} s_{j} (t_{e_{u}})),

Further, this topic ratio more new module obtains approximate gradient, root according to the object function local derviation that need to optimize The approximate objective function of object function is obtained according to approximate gradient.

For reaching above-mentioned and other purpose, the present invention also provide for a kind of based on online from the Dynamic Network Analysis of center model Method, comprises the steps:

Step one, dynamic on the basis of center model, to need parameter vector β and the topic proportion omegab of study_kAs Variable sets up object function；

Step 2, after a new events or a series of new events occur, utilizing alternative projection algorithm alternately to update should Need parameter vector and the topic ratio of study, it is thus achieved that the optimal solution of object function.

Further, this object function is:

\begin{matrix} \min i m i z e & - \log L (β, ω) + λ Σ_{k = 1}^{n} | | ω_{k} - θ_{k} | |_{2}^{2} \end{matrix}

Wherein ω_kIt is the new topic ratio of node k to be learned, θ_kIt is topic ratio current for node k, Represent ω_kIn each element be non-negative, 1 is the vector that an element is all 1, and these restrictions are used for ensureing ω_kIn all elements be all non-negative and element and be 1, λ is one and controls the hyper parameter of weight between two items.

Further, this step 2 comprises the steps:

Step 1.1 uses Newton method undated parameter to need the parameter beta of study after fixing topic proportion omegab；

Step 1.2 after fixing Beta in actualite ratio θ_kOn the basis of, minimize this object function with obtain update after Topic proportion omegab_k；

Repeat step 1.1 and step 1.2 until meeting end condition.

Further, this step 2 updates once after quoting event at every q time.

Further, this step 1.1 is after fixing ω, and the object function needing the parameter beta of study is as follows:

L (β) = Π_{e = x}^{x + q - 1} \frac{\exp (β^{T} s_{i_{e}} (t_{e}))}{Σ_{i = 1}^{n} Y_{i} (t_{e}) \exp (β^{T} s_{i} (t_{e}))},

Further, this step 1.2 the most only updates the topic proportion omegab of an article_k, updating ω_kTime, other articles Topic ratio { ω_i| i ≠ k} keeps constant.

Further, in step 1.2, the object function that need to optimize is:

- l o g (Π_{i = 1}^{p} \frac{α_{i} \exp (a_{i}^{T} ω_{k})}{A_{i} + α_{i} \exp (a_{i}^{T} ω_{k})} Π_{u = p + 1}^{q} \frac{C_{u}}{B_{u} + γ_{u} \exp (b_{u}^{T} ω_{k})}) + λ | | ω_{k} - θ_{k} | |_{2}^{2},

Wherein,

α_{i} = \exp (β_{l}^{T} s_{k}^{l} (t_{e_{i}})),

γ_{u} = \exp (β_{l}^{T} s_{k}^{l} (t_{e_{u}})),

A_{i} = \underset{j &NotEqual; k}{Σ} Y_{j} (t_{e_{i}}) \exp (β^{T} s_{j} (t_{e_{i}})),

B_{u} = \underset{j &NotEqual; k}{Σ} Y_{j} (t_{e_{u}}) \exp (β^{T} s_{j} (t_{e_{u}})),

Further, in step 1.2, this object function local derviation that need to optimize is obtained approximate gradient, according to approximation ladder Degree obtains the approximate objective function of object function.

Compared with prior art, the present invention a kind of based on online from the Dynamic Network Analysis system and method for center model with The dynamic network of time-varying is modeled, by regularized learning algorithm model parameter over time and topic feature so that the present invention gram The shortcoming having taken DEM, it is to avoid the problem of the accuracy rate degradation over time that DEM exists.

Accompanying drawing explanation

Fig. 1 be the present invention a kind of based on online from the system architecture diagram of the Dynamic Network Analysis system of center model；

Fig. 2 be the present invention a kind of based on online from the flow chart of steps of the Dynamic Network Analysis method of center model；

Fig. 3 is the thin portion flow chart of steps of the step 202 of Fig. 2；

Fig. 4 is the Comparison of experiment results schematic diagram of the present invention.

Detailed description of the invention

Below by way of specific instantiation accompanying drawings embodiments of the present invention, those skilled in the art can Further advantage and effect of the present invention is understood easily by content disclosed in the present specification.The present invention also can be different by other Instantiation implemented or applied, the every details in this specification also can based on different viewpoints and application, without departing substantially from Various modification and change is carried out under the spirit of the present invention.

Fig. 1 be the present invention a kind of based on online from the system architecture diagram of the Dynamic Network Analysis system of center model.Such as Fig. 1 Shown in, present invention one, at least includes: object function is built from the Dynamic Network Analysis system of center model (OEM) based on online Formwork erection group 10 and the minimization of object function module 11.

Wherein, object function sets up module 10 dynamically on the basis of center model, with need the parameter vector β of study with Topic proportion omegab_kObject function is set up as variable.

Although intactly can learn from the set of whole article LDA (Latent Dirichlet allocation, Three layers of bayesian probability model), if but it is clear that the LDA model being directly used in line can consuming time very.Cause This, in the present invention, learn topic ratio θ again after first fixing topic.Because in citation network, even if some articles itself Topic changes over time than regular meeting, main topic be relatively stablize constant, so it is rational for doing so.

It should be noted that, in embodiments of the present invention, it is only necessary to update all in the time long every one Topic.From experiment it can be seen that do so still can reach good accuracy.

Therefore, in present pre-ferred embodiments, object function is:

\begin{matrix} \min i m i z e & - \log L (β, ω) + λ Σ_{k = 1}^{n} | | ω_{k} - θ_{k} | |_{2}^{2} \end{matrix}

Wherein ω_kIt is the new topic ratio of node k to be learned, θ_kIt is topic ratio current for node k,L The definition of (β, ω) is identical with the L (β) in the formula of DEM (1), except here all (β being noticed L as variable with topic ratio (β, ω) is different from L (β), and in L (β), only β is variable and ω is constant).Represent ω_kIn each unit Element is all non-negative, and 1 is the vector that an element is all 1, and these restrictions are used for ensureing ω_kIn all elements be all non-negative And element and be 1.λ is one and controls the hyper parameter of weight between two items.

The minimization of object function module 11, after a new events or a series of new events occur, utilizes alternating projection Algorithm (altemating projection) alternately updates the parameter vector β needing study and topic ratio, it is thus achieved that object function Optimal solution.

When a new events or a series of new events are observed, after the Section 2 in formula (2) can ensure to update Topic proportion omegab_kWill not be apart from current topic ratio θ_kThe most remote.In addition, the present invention uses old β to come more as initial value New β.

It is obvious that the optimization problem of formula (2) is not to combine convex to (β, ω).But may certify that this target Function is when a variable is fixed, and is convex about another one variable.Then the present invention devises an alternating projection calculation Method (altemating projection) is to find out the optimal solution of object function.Specifically, the minimization of object function module 11 Farther include: β parameter more new module 110 and topic ratio more new module 111, wherein, β parameter more new module 110, Yu Gu Using Newton method undated parameter to need the parameter beta of study after determining topic proportion omegab, that initialize is current β；Topic ratio More new module 111, in actualite ratio θ after fixing Beta_kOn the basis of, minimize after object function updates with acquisition Topic proportion omegab_k.β parameter more new module 110 and topic ratio more new module 111 generally require and are repeated several times by until meeting termination bar Part.

It should be noted that, each new article i occurs, can add it and use utilization after in former citation network at once β parameter more new module 110 and topic ratio more new module 111 are until restraining.But, this is for large-scale citation network It it is quite time-consuming.Therefore, in the present invention it is possible to just start after waiting new article to run up to some to update.This Mini-batch skill is possible not only to save the calculating time, and can reduce effect of noise.Therefore preferably in the present invention In embodiment, β parameter more new module 110 and topic ratio more new module 111 update once after quoting event at every q time rather than every Update once after secondary event.Q is set to about 1500 in an experiment

Specifically, β parameter more new module 110 is after fixing ω, and the object function needing the parameter beta of study is as follows:

L (β) = Π_{e = x}^{x + q - 1} \frac{\exp (β^{T} s_{i_{e}} (t_{e}))}{Σ_{i = 1}^{n} Y_{i} (t_{e}) \exp (β^{T} s_{i} (t_{e}))},

First event during wherein x is mini-batch, q is the event number in mini-batch.

In order to avoid update β time travel through all before event of quoting, the present invention used one training window so that The smaller subset considering to quote in event is had only to during training parameter β.If the width of training window is W_t(1≤W_t≤ Q), β can be learnt by optimizing following formula:

L_{w} (β) = Π_{e = x + q - W_{t}}^{x + q - 1} \frac{\exp (β^{T} s_{i_{e}} (t_{e}))}{Σ_{i = 1}^{n} Y_{i} (t_{e}) \exp (β^{T} s_{i} (t_{e}))} .

And the present invention can also cache the chain feature of each node to reduce computation burden further, as DEM institute Do.

The time will be extremely expended, topic ratio more new module owing to disposably updating all topic ratios in ω The 111 topic proportion omegab the most only updating an article_k, updating ω_kTime, the topic ratio { ω of other articles_i|i≠k} Keep constant.If in the mini-batch that size is q, node k is quoting event e₁, e₂..., e_pIn be cited and At time e_p+1, e_p+2..., e_qIt is not cited and (notices that the time of e2 generation is not necessarily at e_p+2Before, although the former subscript Little compared with the latter),

Here, the object function f (ω optimized is needed_k) it is:

- l o g (Π_{i = 1}^{p} \frac{α_{i} \exp (a_{i}^{T} ω_{k})}{A_{i} + α_{i} \exp (a_{i}^{T} ω_{k})} Π_{u = p + 1}^{q} \frac{C_{u}}{B_{u} + γ_{u} \exp (b_{u}^{T} ω_{k})}) + λ | | ω_{k} - θ_{k} | |_{2}^{2}, - - - (3)

Wherein

α_{i} = \exp (β_{l}^{T} s_{k}^{l} (t_{e_{i}})),

γ_{u} = \exp (β_{l}^{T} s_{k}^{l} (t_{e_{u}})),

A_{i} = \underset{j &NotEqual; k}{Σ} Y_{j} (t_{e_{i}}) \exp (β^{T} s_{j} (t_{e_{i}})),

B_{u} = \underset{j &NotEqual; k}{Σ} Y_{j} (t_{e_{u}}) \exp (β^{T} s_{j} (t_{e_{u}})),

Here, β_lComprise front 8 elements (correspond to chain feature) of parameter beta, β_tComprise rear 50 units of parameter beta Element (corresponding is topic feature), θ_iIt is to quote event e_iThe topic ratio of person who quote,It is to quote event e_iIn joint The chain feature (front 8 features) of some k, C_uIt it is one and ω_kUnrelated constant.

The single order of formula (3) is as follows with second order local derviation:

\begin{matrix} \frac{\partial f}{\partial ω_{k}} = - Σ_{i = 1}^{p} a_{i} + Σ_{i = 1}^{p} \frac{a_{i} α_{i} \exp (a_{i}^{T} ω_{k})}{A_{i} + α_{i} \exp (a_{i}^{T} ω_{k})} \\ + Σ_{u = p + 1}^{q} \frac{b_{u} γ_{u} \exp (b_{u}^{T} ω_{k})}{B_{u} + γ_{u} \exp (b_{u}^{T} ω_{k})} \\ + 2 λ (ω_{k} - θ_{k}), \end{matrix} - - - (4)

\begin{matrix} \frac{\partial^{2} f}{\partial ω_{k}^{2}} = Σ_{i = 1}^{p} \frac{A_{i} α_{i} a_{i} a_{i}^{T} \exp (a_{i}^{T} ω_{k})}{{(A_{i} + α_{i} \exp (a_{i}^{T} ω_{k}))}^{2}} \\ + Σ_{u = p + 1}^{q} \frac{B_{u} γ_{u} b_{u} b_{u}^{T} \exp (b_{u}^{T} ω_{k})}{{(B_{u} + γ_{u} \exp (b_{u}^{T} ω_{k}))}^{2}} + 2 λ I, \end{matrix}

Wherein I is unit matrix.

Can be seen that Hessian matrix normal Wishart distribution (PD) from formula above, therefore the function of (3) is convex.At this point it is possible to Solver is directly used to find globally optimal solution.

It is also preferred that the left in formula (4), A_iIt is much larger thanWithAnd p is in each batch The most relatively small.In like manner, B_uIt is much larger thanWithAnd (q-p) is the most relatively small.Therefore, in (4) Second will be much smaller than other two with Section 3.This means the ladder that can leave out less two to obtain an approximation Degree:

\frac{\partial f}{\partial ω_{k}} \approx - Σ_{i = 1}^{p} a_{i} + 2 λ (ω_{k} - θ_{k}) .

Based on approximate gradient above, the approximate objective function of (2) can be recovered:

\begin{matrix} \min i m i z e & - Σ_{i = 1}^{p} a_{i}^{T} ω_{k} + λ Σ_{k = 1}^{n} | | ω_{k} - θ_{k} | |_{2}^{2} \end{matrix}

The mutation of (5) this OEM is referred to as " approximation OEM " (approximative OEM) by the present invention, and by original OEM is referred to as " full OEM " (full OEM).In an experiment it appeared that approximation OEM can reach with expire the most close for OEM accuracy and Need the most a lot of time.

Fig. 2 be the present invention a kind of based on online from the flow chart of steps of the Dynamic Network Analysis method of center model.Such as Fig. 2 Shown in, the present invention a kind of based on online from the Dynamic Network Analysis method of center model, comprise the steps:

Step 201, dynamic on the basis of center model, to need parameter vector β and the topic proportion omegab of study_kAs Variable sets up object function.

In step 201, the object function of foundation is:

Step 202, after a new events or a series of new events occur, utilizes alternative projection algorithm (alternating projection) alternately updates the parameter vector β needing study and topic ratio, it is thus achieved that object function Optimal solution.

It is obvious that the optimization problem of formula (2) is not to combine convex to (β, ω).But may certify that this target Function is when a variable is fixed, and is convex about another one variable.Then the present invention devises an alternating projection calculation Method (alternating projection) is to find out the optimal solution of object function.More specifically, in each iteration, we fix In two variablees one and update another.Specifically, step 202 farther includes following steps (as shown in Figure 3):

Step 301, online β step (online β step): use Newton method undated parameter β after fixing ω, initialize It is current β；

Step 302, online topic step (online topic step): in actualite ratio θ after fixing Beta_kBasis On, minimize formula (2) to obtain the topic proportion omegab after updating_k。

Said process needs to be repeated several times by until meeting end condition.

It should be noted that, each new article i occurs, can add it and use utilization after in former citation network at once β parameter more new module 110 and topic ratio more new module 111 are until restraining.But, this is for large-scale citation network It it is quite time-consuming.Therefore, in the present invention it is possible to just start after waiting new article to run up to some to update.This Mini-batch skill is possible not only to save the calculating time, and can reduce effect of noise.Therefore preferably in the present invention In embodiment, update once after quoting event every q time rather than update once after each event.It is left that q is set to 1500 in an experiment Right

In online β step, after fixing ω, the object function needing the parameter beta of study is as follows:

L (β) = Π_{e = x}^{x + q - 1} \frac{\exp (β^{T} s_{i_{e}} (t_{e}))}{Σ_{i = 1}^{n} Y_{i} (t_{e}) \exp (β^{T} s_{i} (t_{e}))},

L_{w} (β) = Π_{e = x + q - W_{t}}^{x + q - 1} \frac{\exp (β^{T} s_{i_{e}} (t_{e}))}{Σ_{i = 1}^{n} Y_{i} (t_{e}) \exp (β^{T} s_{i} (t_{e}))} .

The time will be extremely expended owing to disposably updating all topic ratios in ω, in online topic step, if Count an algorithm alternately to update ω.More specifically, the most only update the topic proportion omegab of an article_k, updating ω_kTime, the topic ratio { ω of other articles_i| i ≠ k} keeps constant.If in the mini-batch that size is q, joint Point k is quoting event e₁, e₂..., e_pIn be cited and at time e_p+1, e_p+2..., e_qIt is not cited and (notes what e2 occurred Time is not necessarily at e_p+2Before, although the former subscript is little compared with the latter).

Need exist for the object function f (ω optimized_k) it is:

- l o g (Π_{i = 1}^{p} \frac{α_{i} \exp (a_{i}^{T} ω_{k})}{A_{i} + α_{i} \exp (a_{i}^{T} ω_{k})} Π_{u = p + 1}^{q} \frac{C_{u}}{B_{u} + γ_{u} \exp (b_{u}^{T} ω_{k})}) + λ | | ω_{k} - θ_{k} | |_{2}^{2}, - - - (3)

Wherein

α_{i} = \exp (β_{l}^{T} s_{k}^{l} (t_{e_{i}})),

γ_{u} = \exp (β_{l}^{T} s_{k}^{l} (t_{e_{u}})),

A_{i} = \underset{j &NotEqual; k}{Σ} Y_{j} (t_{e_{i}}) \exp (β^{T} s_{j} (t_{e_{i}})),

Here, β_lComprise front 8 elements (correspond to chain feature) of parameter beta, β_tComprise rear 50 units of parameter beta Element (corresponding is topic feature), θ i is to quote event e_iThe topic ratio of person who quote,It is to quote event e_iIn joint The chain feature (front 8 features) of some k, C_uIt it is one and ω_kUnrelated constant.

Wherein I is unit matrix.

It is preferred that in formula (4), A_iIt is much larger thanWithAnd p is in each batch The most relatively small.In like manner, B_uIt is much larger thanWithAnd (q-p) is the most relatively small.Therefore, in (4) Second will be much smaller than other two with Section 3.This means the ladder that can leave out less two to obtain an approximation Degree:

\frac{\partial f}{\partial ω_{k}} \approx - Σ_{i = 1}^{p} a_{i} + 2 λ (ω_{k} - θ_{k}) .

\begin{matrix} \min i m i z e & - Σ_{i = 1}^{p} a_{i}^{T} ω_{k} + λ Σ_{k = 1}^{n} | | ω_{k} - θ_{k} | |_{2}^{2} \end{matrix}

Owing in each iteration, the algorithm of study ensures that the value of object function always declines, and target function value is total Being greater than equal to 0, therefore the present invention is convergence.

Below by by the DEM of prior art and the OEM of the present invention being applied to two citation networks and comparing two moulds The experimental result of type illustrates the progressive of the present invention, the most also analyzes the differentiation of article topic ratio.

1, data set

Citation network analysis is one of most important application in Dynamic Network Analysis, the present invention test in, be two Data set arXiv-TH and arXiv-PH of individual citation network.Two data sets be all from arXiv (http: // Snap.stanford.edu/data) crawl.The main information of data set is shown in Table 1.

Table 1 data set information

ArXiv-TH data set is the series of articles theoretical about high-energy physics.The scope of time be 1993 to 1997 Year, this data set has the highest temporal analytical density (being accurate to millisecond).ArXiv-PH data set is about high-energy physics phenomenon Series of articles, time range is 1993 to 1997, and the time is accurate to every day.Due to the temporal analytical density in data set The highest, it can be assumed that every new article all the different time join in network and also the most at the same time in may have More than one quotes event.As previous joint is mentioned, mono-batch ground of a batch updates topic ratio and parameter.More Body ground, data set is divided into mini-batch one by one, comprises in a period of time in each mini-batch by the present invention Middle generation quote the time.Timestamp number in mini-batch each for arXiv-TH is 100, and for arXiv-PH is 20.The corresponding event number with each mini-batch is about 1500.

2, baseline

In an experiment, the performance of following 4 models is compared:

(1) DEM: the DEM having 8 chain features and 50 topic features originally.Notice that original DEM is not online (online), parameter and topic feature are fixing after training.

(2) OEM-β: the OEM only walked with online β, in this model, β can update in time but topic feature will not.

(3) OEM-full: with the full OEM of online β step with topic step, topic feature and parameter all can change over time Become, employ object function (2).

(4) OEM-appr: with the OEM of online β step with approximation topic step, topic feature and parameter all can change over time Become, employ object function (5).

3, evaluating standard

Similar with DEM, the present invention evaluates and tests model above by following three standards:

(1) average test log-likelihood (Average held-out log-likelihood): in each test Quote and i.e. can obtain after event takes log to the likelihoodL (β) in formula (1) testing log-likelihood.Will be all The test of test event be log-likelihood's and divided by the sum of event in this batch, i.e. can obtain average test log-likelihood.This numerical value is the highest, then explanation test accuracy is the highest.

(2) recall rate@K (Recall of top-K recommendation list): recall rate here is defined as K The individual most probable ratio quoting truly generation in event.Here K is a cut-off (cut-point).

(3) the regular ranking of average test (Average held-out normalized rank): the most each quote thing The ranking (rank) of part refers to this and quotes the physical location in the recommendation list sorted.This ranking is divided by possible Quote the ranking after the sum of event i.e. obtains normalization (normalize).This numerical value is the lowest, represents that estimated performance is the best.

4, result and analysis

Such as DEM, each data set is divided into three parts by the present invention: establishment stage, training stage and test phase.Set up Stage, typically its time range can be longer to alleviate truncation effect primarily to set up the statistic of citation network (the front time of quoting in 1993 does not appears in data set) also avoids bias.In the training stage, we train initially Model parameter and topic feature.In order to more comprehensively show and the estimated performance of comparison model, test phase ratio here Longer.Test phase is divided into 24 batch.Notice that statistic (chain feature) is all in training stage with test phase Can dynamically change.The size of data (representing with quoting event number) in each stage is as shown in table 2.

The foundation of table 2 data set, training, the segmentation of test phase

In order to reduce further OEM training with test time, only randomly selected in each batch a part time Event of quoting between is to optimize the topic ratio of article.Such as when optimizing the topic ratio of article i, arrive at the 1st batch After reaching, randomly select 10% person who quote (citer) of (being referred to as citer percentage ratio by 10% here, the most as the same) rather than complete Portion person who quote.This can to a certain degree speed-up computation.In OEM, if hyper parameter λ=0.1, if citer percentage ratio is 10%, remove Non-other explanation.The impact of model can be illustrated in ensuing experiment by hyper parameter citer percentage ratio with λ.

The details of the test process of OEM is as follows.First with the data of establishment stage and training stage train one initial OEM.The most now this initial OEM is equivalent to DEM.Then evaluate and test this model (to notice in the estimated performance of Batch 1 We do not use the data of Batch 1 when training).It is extra training number by the data absorption of Batch 1 the most again According to and update parameter and the feature of OEM.Then followed by using this OEM updated present to predict Batch 2.Thus may be used See, before testing some batch, be not used for training by the data of this batch.Therefore the result tested can be true Extensive/the predictive ability of ground reflection OEM.

Fig. 4 (a) and (b) are the average test log-likelihood of all models.Owing to initial OEM Yu DEM is Valency, it can be seen that all of model performance when test b atch 1 is all identical.But, As time goes on, The estimated performance of DEM can seriously decline, and each mutation of OEM then will not.Such as, from Fig. 4 (a) it can be seen that DEM Log-likelihood declines fairly obvious over time, and OEM-β simply drops to-8.97 from-8.24.OEM-full's is pre- Survey ability has exceeded above two models, and the scope of log-likelihood is-7.89 to-8.38.OEM-appr is then from-8.24 Drop to-8.56.

Fig. 4 (a) and (b) are the average test log-likelihood that event is quoted in test.(c) and (d) front K recommendation list In recall rate.E () and (f) are the regular ranking of average test.Owing to all of model is after establishment stage with training stage Initial parameter is identical, and they are identical in the performance of the 1st test batch.This from (a) to (f) it can be seen that.(g) with H () is to develop at the topic that the 8001st and the 8005th time point is two the article collection being cited.In order to prevent the mixed of image Disorderly, we only depict front several topics that ratio is the highest.

Fig. 4 (c) and (d) are the recall rates in front K recommendation list, K value 250.It appeared that DEM, OEM-β and OEM- The performance of appr declines the most over time, however OEM-full without.Although the estimated performance of OEM-appr as well as Time declines, but its performance is still significantly more than DEM.The performance of OEM-β is similar with DEM, the most undesirable.This means The quantity of information of topic feature is the biggest, and it is not nearly enough for being simply updated β.Note can also obtaining when K takes other values similar Result, not discussed here owing to length limits.

Fig. 4 (e) and (f) are the regular rankings of average test.It appeared that the performance of DEM Yu OEM-β cannot over time and Improve.OEM-full with OEM-appr is the most permissible.Notice that rank value is the lowest and mean that predictive ability is the highest.With above phase Seemingly, the undesirable effect of OEM-β further illustrates the renewal of the topic feature importance to this evaluating standard.Because more Arriving batch below, the event number of quoting of candidate can be the most, if by absolute ranking, the performance of DEM actually along with time Between and decline.But mbox{OEM-full} be but possible to prevent the decline of performance, even coming from the angle of absolute ranking See.This is consistent with the result of (d) with Fig. 4 (a), (b), (c).

Table 3 compares the calculation consumption of OEM and approximation OEM.As seen from table, although approximation OEM is than full OEM estimated performance slightly Difference, but but save the time of 50%.

The calculating time (second) of OEM-full Yu OEM-appr during table 3 λ=0.1

Table 4 citer percentage ratio is average test log-likelihood when 10%

Average test log-likelihood during table 5 λ=0.1

In order to study the hyper parameter (citer percentage ratio and the λ) impact on estimated performance, the present invention uses arXiv-TH data Collect and calculate citer percentage ratio and take the average test log-likelihood of all test batch during different value with λ.Result Refer to table 4 and table 5.As shown in Table 4,0.1 is the optimal value of λ.As can be seen from Table 5 after citer percentage ratio is more than 10%, in advance Survey performance is less along with the raising of citer percentage ratio, and time loss has greatly increased, it means that selection 10% is Citer percentage ratio is rational.

Sum it up, model OEM is for these hyper parameter insensitive.

The present invention have selected 2 article set from arXiv-TH data set and carrys out the topic differentiation of expository writing chapter.In order to keep away Exempt from the chaotic topic ratio to each article set to be averaged, figure only depicts average topic ratio.Due to topic number altogether There are 50, only have selected the topic that the ratio accounted for is maximum.Specifically, S is made_t={ r₁, r₂..., r_lRepresent and drawn at time t Article set (article in same set is quoted with an article).It is then article set S_tFlat All topic vectors.Here have selected S₈₀₀₁With S₈₀₀₅As the example of explanation, such as Fig. 4 (g) and (h).

Knowable to Fig. 4 (g), the ratio of topic 7 is (i.e.) with the ratio of topic 46 (i.e.) it is as time decline 's.But the ratio of topic 15Ratio with topic 44The most contrary.One explanation is that this is the 8001st The article set that individual time point is cited is originally about certain physical sub-field, but As time goes on, these The value of article is found that by the researcher in other sub-fields.After being refer to enough times by the article in other sub-fields again, The topic of this article set starts from topic in talk (topic 7 and topic 46) to new topic (topic 15 and topic 44) transfer.With The thing of sample can occur in the field (frontier) such as the field such as statistics, psychology (former field) and machine learning above.? Article set (the S that 8005 time points are cited₈₀₀₅) topic develop similar with the 8001st time point, such as Fig. 4 (h) institute Show.

In sum, the present invention a kind of based on online from the Dynamic Network Analysis system and method for center model with to time-varying Dynamic network be modeled, by regularized learning algorithm model parameter over time and topic feature so that instant invention overcomes DEM Shortcoming, it is to avoid the problem of the accuracy rate degradation over time that DEM exists, the experiment knot on two truthful data collection Fruit shows, the present invention can reach the most considerable estimated performance in actual applications.

Although the experiment of the present invention is only limitted to article citation network, as described in DEM, the present invention is readily adaptable for use in other classes The network of type, the present invention is not limited.

The principle of above-described embodiment only illustrative present invention and effect thereof, not for limiting the present invention.Any Above-described embodiment all can be modified under the spirit and the scope of the present invention and change by skilled person.Therefore, The scope of the present invention, should be as listed by claims.

Claims

1., based on online from a Dynamic Network Analysis system for center model, at least include:

Object function sets up module, dynamic on the basis of center model, to need parameter beta and the topic proportion omegab of study_kAs Variable sets up object function；

The minimization of object function module, after a new events or a series of new events occur, utilizes alternative projection algorithm to hand over For parameter vector β and this topic proportion omegab of updating the study of these needs_k, it is thus achieved that the optimal solution of object function；Wherein, this target letter Number is:

\min imize - \log L (β, ω) + λ Σ_{k = 1}^{n} {| | ω_{k} - θ_{k} | |}_{2}^{2}

Subject to: ω_k>=0,1^Tω_k=1,

Wherein ω_kIt is the new topic ratio of node k to be learned, θ_kIt is topic ratio current for node k,ω_k >=0 represents ω_kIn each element be non-negative, 1 is the vector that an element is all 1, and these restrictions are used for ensureing ω_k In all elements be all non-negative and element and be 1, λ is one and controls the hyper parameter of weight between two items, and n is network The sum of interior joint.

A kind of based on online from the Dynamic Network Analysis system of center model, it is characterised in that should The minimization of object function module includes:

Topic ratio more new module, in actualite ratio θ after fixing Beta_kOn the basis of, minimize this object function to obtain Topic proportion omegab after renewal_k。

A kind of based on online from the Dynamic Network Analysis system of center model, it is characterised in that: should β parameter more new module and this topic ratio more new module update once after quoting event at every q time.

A kind of based on online from the Dynamic Network Analysis system of center model, it is characterised in that should β parameter more new module is after fixing ω, and the object function needing the parameter beta of study is as follows:

L (β) = Π_{e = x}^{x + q - 1} \frac{\exp (β^{T} s_{i_{e}} (t_{e}))}{Σ_{i = 1}^{n} Y_{i} (t_{e}) \exp (β^{T} s_{i} (t_{e}))},

First event during wherein x is mini-batch, q is the event number in mini-batch, and mini-batch is accumulation Event sets, e is the index every time quoting event, i_eRepresent at the node that event e is formed, t_eExpression event e occur time Between, Y_i(t_e) represent that e quotes event when occurring, whether node i is present in network, there is i.e. Y_i(t_e)=1, otherwise Y_i (t_e)=0, S_i(t_e) represent that node i is at time t_eCharacteristic vector, β be need study parameter vector, β^TIt it is parameter vector Transposition.

A kind of based on online from the Dynamic Network Analysis system of center model, it is characterised in that: should Topic ratio more new module the most only updates the topic proportion omegab of an article_k, updating ω_kTime, the topic ratio of other articles Example { ω_i| i ≠ k} keeps constant.

6., based on online from a Dynamic Network Analysis method for center model, comprise the steps:

Step one, dynamic on the basis of center model, to need parameter vector β and the topic proportion omegab of study_kBuild as variable Vertical object function；

Step 2, after a new events or a series of new events occur, utilizes alternative projection algorithm alternately to update these needs The parameter vector of study and topic ratio, it is thus achieved that the optimal solution of object function；Wherein, this object function is:

\min imize - \log L (β, ω) + λ Σ_{k = 1}^{n} {| | ω_{k} - θ_{k} | |}_{2}^{2}

Subject to: ω_k>=0,1^Tω_k=1,

A kind of based on online from the Dynamic Network Analysis method of center model, it is characterised in that should Step 2 comprises the steps:

Step 1.2 after fixing Beta in actualite ratio θ_kOn the basis of, minimize after this object function updates with acquisition Topic proportion omegab_k；

Repeat step 1.1 and step 1.2 until meeting end condition.

A kind of based on online from the Dynamic Network Analysis method of center model, it is characterised in that: should Step 2 updates once after quoting event at every q time.

A kind of based on online from the Dynamic Network Analysis method of center model, it is characterised in that: should Step 1.1 is after fixing ω, and the object function needing the parameter beta of study is as follows:

L (β) = Π_{e = x}^{x + q - 1} \frac{\exp (β^{T} s_{i_{e}} (t_{e}))}{Σ_{i = 1}^{n} Y_{i} (t_{e}) \exp (β^{T} s_{i} (t_{e}))},

A kind of based on online from the Dynamic Network Analysis method of center model, it is characterised in that: This step 1.2 the most only updates the topic proportion omegab of an article_k, updating ω_kTime, the topic ratio { ω of other articles_i|i ≠ k} keeps constant.