KR102269932B1

KR102269932B1 - Method for identifying drug target molecules based on influence analysis of molecules in signaling networks

Info

Publication number: KR102269932B1
Application number: KR1020180154390A
Authority: KR
Inventors: 조광현; 이대원
Original assignee: 한국과학기술원
Priority date: 2018-12-04
Filing date: 2018-12-04
Publication date: 2021-06-28
Also published as: KR20200067481A

Abstract

신호전달네트워크의 출력노드의 제어를 위해 제어되어야 하는 제어표적노드의 결정방법을 공개한다. 이 방법은, 제1신호전달네트워크에 포함된 복수 개의 노드들 각각의 제1출력노드에 대한 인플루언스를 계산하여 상기 복수 개의 노드들 각각에 대응하는 복수 개의 인플루언스들을 산출하는 인플루언스 산출단계, 및 상기 복수 개의 인플루언스들의 값을 기초로 상기 제1출력노드에 대하여 미리 설정되어 있는 제어목표에 따라 상기 복수 개의 노드들 중 일부를 선택하여 상기 제1출력노드의 제어를 위한 제어표적노드로서 결정하는 제어표적노드 결정단계를 포함한다. 이때, 상기 복수 개의 노드들 중 임의의 제1노드의 상기 제1출력노드에 대한 인플루언스는 상기 제1노드의 활성도 변화가 상기 제1출력노드의 활성도의 변화에 미치는 영향의 정도를 나타낸다.A method of determining a control target node to be controlled for controlling an output node of a signaling network is disclosed. In this method, the influence of each of a plurality of nodes included in the first signaling network is calculated for a first output node to calculate a plurality of influences corresponding to each of the plurality of nodes. a calculation step, and a control for controlling the first output node by selecting some of the plurality of nodes according to a control target preset for the first output node based on the values of the plurality of influences and a control target node determining step of determining as a target node. In this case, the influence of any first node among the plurality of nodes on the first output node indicates the degree of influence of the change in the activity of the first node on the change in the activity of the first output node.

Description

Method for identifying drug target molecules based on influence analysis of molecules in signaling networks

본 발명은 컴퓨터를 이용한 정보처리 기술에 관한 것이며, 특히 세포 내 분자들 간의 신호전달에 의해 모델링되는 신호전달네트워크의 신호흐름에 관한 정보를 분석하는 기술에 관한 것이다.The present invention relates to information processing technology using a computer, and more particularly, to a technology for analyzing information about the signal flow of a signal transduction network modeled by signal transduction between molecules in a cell.

본 발명은 발명자가 제시한 새로운 개념들을 이용하여 실시될 수 있다. 따라서 이하, 상술한 새로운 개념들을 설명한다. 이 챕터에서 설명된 내용 중 모든 내용이 종래기술로서 취급되어서는 안 된다.The present invention can be practiced using the novel concepts presented by the inventor. Accordingly, the new concepts described above will be described below. Not all content described in this chapter should be treated as prior art.

도 1은 본 발명에서 다루는 신호전달네트워크에서의 생화학 반응(biochemical reaction) 및 유향부호링크(directed signed link)의 개념에 대하여 나타낸 것이다.1 shows the concept of a biochemical reaction and a directed signed link in the signaling network dealt with in the present invention.

본 명세서에서, '신호전달네트워크'는 다양한 신호전달분자 간의 상호작용으로 구성된 네트워크를 의미한다. 한 개의 신호전달네트워크에는 복수 개의 링크들이 포함될 수 있다. 이때, 각각의 링크는 방향성이 있는 상기 유향부호링크일 수 있다. 따라서 상기 신호전달네트워크 내에서 선택된 두 개의 노드(=신호전달분자)는 유향부호링크로 연결될 수 있다. 도 2는 복잡한 신호전달네트워크의 구조의 예를 도식화하여 제시한 것이다.As used herein, the 'signal transduction network' refers to a network composed of interactions between various signal transduction molecules. A single signaling network may include a plurality of links. In this case, each link may be the directional code link having a direction. Therefore, two selected nodes (=signaling molecules) in the signaling network can be connected by a directed code link. Figure 2 schematically presents an example of the structure of a complex signaling network.

세포 외부에서 특정 단백질이나 화합물이 세포막수용체(receptor)에 결합하거나 세포 내부에서 유전자 돌연변이나 약물에 의해 특정 신호전달분자의 기능이 영향을 받는 경우, 일련의 생화학 반응이 일어나면서 결과적으로 세포의 중요한 생리적 기능이 변하게 된다. 이때 발생하는 일련의 생화학 반응을 신호전달(signal transduction, signaling)이라고 지칭할 수 있다. When a specific protein or compound binds to a cell membrane receptor outside the cell, or the function of a specific signaling molecule is affected by a gene mutation or a drug inside the cell, a series of biochemical reactions occur, resulting in important physiological functions of the cell. function will change. A series of biochemical reactions occurring at this time may be referred to as signal transduction (signaling).

도 1의 (a)에 나타낸 생화학 반응은 단순히 인산기 결합 촉매에 의한 단백질의 활성화이지만, 이와 같은 생화학 반응이 연쇄적으로 일어나 세포의 중요한 기능이 변화되는 경우 이를 '신호전달'의 관점에서 해석할 수 있다. 즉, 활성화 된 단백질A가 단백질B에게 '신호를 전달한다'라고 볼 수 있으며 이를 도 1의 (a)의 우측에 나타낸 바와 같이 개념적으로 추상화하여 '유향부호링크'로 나타낼 수 있다. The biochemical reaction shown in FIG. 1 (a) is simply the activation of a protein by a phosphate group-bonding catalyst, but when such a biochemical reaction occurs in a chain and an important function of a cell is changed, it can be interpreted in terms of 'signal transduction'. have. That is, it can be seen that the activated protein A 'transmits a signal' to protein B, which can be conceptually abstracted as shown on the right side of FIG. 1 (a) and expressed as a 'direction link'.

생화학 반응에서는 비활성화형도 함께 나타내지만 (e.g., 인산기 P가 붙지 않은 단백질B) 유향부호링크에서는 주로 활성화형만 나타내게 된다. 신호전달에 관련된 수많은 생화학 반응은 유향부호링크의 형태로 정리하여 통합할 수 있으며, 그 결과로서 도 2에 나타낸 것과 같은 복잡한 신호전달네트워크의 구조를 얻을 수 있다. In the biochemical reaction, the inactivated form is also shown (e.g., protein B not attached to the phosphate group P), but only the activated form is mainly shown in the directed link. Numerous biochemical reactions related to signal transduction can be organized and integrated in the form of a directed link, and as a result, a complex signal transduction network structure as shown in FIG. 2 can be obtained.

본 발명의 일 양상에서 신호전달의 기본적인 형태는, 활성화 된 신호전달분자가 다른 신호전달분자를 활성화시키는 생화학 반응이다. 예를 들어, 단백질A 및 단백질B를 각각 상술한 신호전달분자의 예로 삼을 수 있다. 이때, 도 1의 (a)의 좌측에 나타낸 바와 같이, 활성화 된 단백질A는 ATP를 이용하여 단백질B에 인산기(phosphate group)가 결합하는 반응을 촉매함으로써 단백질B를 활성화 시킨다. 따라서 보통, 단백질B의 활성도는 활성화 된 단백질B (e.g., 단백질B의 인산화 형; phosphorylated form of protein B)의 농도로서 측정할 수 있다. 이와 같이 활성화 된 단백질은 다른 단백질의 활성화를 촉매하거나 세포핵으로 들어가 세포 생리에 중요한 유전자들의 발현을 활성화시키게 된다.In one aspect of the present invention, the basic form of signaling is a biochemical reaction in which an activated signaling molecule activates another signaling molecule. For example, protein A and protein B may be used as examples of the aforementioned signaling molecules, respectively. At this time, as shown on the left side of FIG. 1 (a), the activated protein A activates the protein B by catalyzing a reaction in which a phosphate group binds to the protein B using ATP. Therefore, usually, protein B activity can be measured as the concentration of activated protein B (e.g., phosphorylated form of protein B). The activated protein catalyzes the activation of other proteins or enters the cell nucleus to activate the expression of genes important for cell physiology.

상기 유향부호링크는 상기 신호전달네트워크 내에 두 개의 노드들 간의 관계를 나타내는 링크이다. 한 개의 유향부호링크에는 방향 및 부호에 관한 정보가 매핑될 수 있다. 제1유향부호링크의 소스는 상기 제1유향부호링크의 출발노드이며, 상기 제1유향부호링크의 타겟은 상기 제1유향부호링크의 도착노드이다. 즉, 유향부호링크의 방향은 소스로부터 타겟을 향하도록 정해진다. The directed code link is a link representing a relationship between two nodes in the signaling network. Information on a direction and a sign may be mapped to one directed code link. A source of the first directed link is a departure node of the first directed link, and a target of the first directed link is an arrival node of the first directed link. That is, the direction of the directed code link is determined from the source to the target.

도 1의 (a)의 우측에 나타낸 것과 같이, 유향부호링크의 부호는 소스가 타겟을 조절하는 방식에 따라 결정되는데, 상기 신호전달네트워크에서는 주로 두 가지 방식이 존재한다. 제1방식은 활성(activation; +) 방식이고, 제2방식은 억제(inactivation; -) 방식이다.As shown on the right side of Fig. 1 (a), the code of the directed code link is determined according to the method of the source controlling the target, and there are mainly two methods in the signaling network. The first mode is an activation (+) mode, and the second mode is an inactivation (-) mode.

상기 유향부호링크와 관련하여, 예컨대 실제 생화학 반응을 촉매하는 촉매 단백질을 상기 유향부호링크의 '원천(source)'으로, 촉매의 대상이 되는 단백질을 상기 유향부호링크의 '표적(target)'이라고 부를 수 있다. With respect to the directional code link, for example, a catalytic protein that catalyzes an actual biochemical reaction is referred to as a 'source' of the directional code link, and a protein to be catalyzed is referred to as a 'target' of the directional code link. can call

도 1의 (a)의 우측을 살펴보면, 표시된 각 유향부호링크에 대하여 노드A는 상기 유향부호링크의 원천 노드이고 노드B는 상기 유향부호링크의 표적 노드이다. 본 명세서에서 상기 원천 노드는 소스(source) 노드라고 지칭하고, 상기 표적 노드는 타겟(target) 노드라고 지칭할 수도 있다.Referring to the right side of FIG. 1A, for each indicated coded link, Node A is a source node of the directional coded link and Node B is a target node of the directional coded link. In this specification, the source node may be referred to as a source node, and the target node may also be referred to as a target node.

상기 소스 노드는, 유향부호링크의 출발점에 해당되는 노드이다. 상기 신호전달네트워크에서 소스 노드는 유향부호링크의 조절 방식에 따라 타겟 노드의 활성도를 증가시키거나 감소시키게 된다.The source node is a node corresponding to the starting point of the directed coded link. In the signaling network, the source node increases or decreases the activity of the target node according to the directed code link control method.

상기 타겟 노드는, 유향부호링크의 도착점에 해당되는 노드이다. 상기 신호전달네트워크에서 상기 타겟 노드는 상기 소스 노드에 의해 그 활성도가 변하게 된다.The target node is a node corresponding to the destination point of the directed coded link. In the signaling network, the activity of the target node is changed by the source node.

상기 신호전달분자는 활성화 된 상태(활성 상태)로 존재할 수 있으며 활성화 되었을 때 활성화되기 이전에는 관찰할 수 없는 다양한 기능을 수행할 수 있게 된다. 예를 들어, 활성화 된 신호전달분자는 다른 종류의 신호전달분자를 활성화 시킬 수 있으며, 혹은 유전자의 발현을 촉진하여 세포 생리에 변화를 가져올 수도 있다. The signaling molecule may exist in an activated state (active state), and when activated, it may perform various functions that cannot be observed before activation. For example, activated signaling molecules can activate other types of signaling molecules, or they can promote gene expression to bring about changes in cell physiology.

따라서 상기 활성도는, 활성 상태에 있는 신호전달분자의 '정도(extent)' 또는 '양(quantity)'를 나타내는 개념으로서, 다양하게 정의할 수 있다. 예를 들어, 활성도를 활성 상태에 있는 신호전달분자의 세포 내 농도(e.g., 몰농도(M))로 정의할 수 있으며, 또는 특정한 물리적 단위 없이 추상적인 변수(a variable with arbitrary unit)로도 정의할 수 있다.Accordingly, the activity is a concept representing the 'extent' or 'quantity' of the signaling molecule in the active state, and can be defined in various ways. For example, activity can be defined as an intracellular concentration (eg, molarity (M)) of a signaling molecule in an active state, or as an abstract variable (a variable with arbitrary unit) without a specific physical unit. can

예컨대 상기 활성도가 세포 내 농도로 정의되는 경우 상기 활성도는 0 이상의 값을 가질 수 있다. For example, when the activity is defined as an intracellular concentration, the activity may have a value of 0 or more.

예컨대 상기 활성도가 상기 세포 내 농도의 로그값으로 정의되는 경우 상기 활성도는 음수를 포함하는 실수값을 가질 수 있다. 이 경우 특별히 상기 활성도를 '로그활성도'라고 부를 수 있지만, 단순히 '활성도'라고 통칭할 수도 있다.For example, when the activity is defined as a log value of the intracellular concentration, the activity may have a real value including a negative number. In this case, in particular, the activity may be referred to as 'log activity', but may simply be referred to as 'activity'.

'신호(signal)'라는 용어는 분석하고자 하는 생체 시스템에 따라 다르게 해석될 수 있는 개념이다. 그런데 생물학적 신호전달 관점에서는 주로 "신호 전달이 가능한 형태의 신호전달분자의 농도"를 '신호의 강도' 또는 '신호의 세기'로 해석하거나 간주할 수 있다. The term 'signal' is a concept that can be interpreted differently depending on the biological system to be analyzed. However, from the viewpoint of biological signal transduction, it can be mainly interpreted or regarded as “concentration of signal transduction molecules capable of signal transduction” as 'signal strength' or 'signal strength'.

본 발명의 일 관점에 따르면, 신호전달의 생화학 반응식을 상기 유향부호링크로 추상화한 경우, 신호의 개념을 신호전달분자의 농도에 직접 대응시키는 대신에 '활성도(activity)'라는 좀 더 추상화 된 개념으로 다룰 수 있다. 즉, 유향부호링크를 통해 어떤 노드(=소스 노드)의 활성도(=신호)가 다른 노드(=타겟 노드)의 활성도(=신호)에 영향을 준다고 간주할 수 있으며, 이와 같이 하나의 신호가 다른 신호에 영향을 주는 현상을 '신호가 흐른다' 라고 표현할 수 있다. 다시 말해, 원천 노드로부터 표적 노드로 신호가 흐른다고 할 수 있다. According to one aspect of the present invention, when the biochemical reaction equation of signal transduction is abstracted with the directed link, the more abstract concept of 'activity' instead of directly corresponding the concept of signal to the concentration of signaling molecules can be dealt with as That is, it can be considered that the activity (=signal) of one node (=source node) affects the activity (=signal) of another node (=target node) through the directed code link, and in this way, one signal A phenomenon that affects a signal can be expressed as 'a signal flows'. In other words, it can be said that a signal flows from the source node to the target node.

본 발명에 따르면, 각각의 노드에 대하여 각각 서로 구분되는 신호가 매핑될 수 있다. 즉, 제1노드에 대하여 제1신호가 매핑되어 정의될 수 있으며, 상기 제1신호는 상기 제1노드의 활성도인 제1활성도를 나타낸 값일 수 있다. 또한, 제2노드에 대하여 제2신호가 매핑되어 정의될 수 있으며, 상기 제2신호는 상기 제2노드의 활성도인 제2활성도를 나타낸 값일 수 있다. According to the present invention, signals that are distinguished from each other may be mapped to each node. That is, the first signal may be defined by being mapped to the first node, and the first signal may be a value representing the first activity, which is the activity level of the first node. In addition, a second signal may be mapped to the second node and defined, and the second signal may be a value representing a second activity, which is the activity level of the second node.

즉, 본 발명에 다루는 신호전달네트워크에는 복수 개의 '신호'가 정의될 수 있다. 그리고 상기 각 신호는 각 노드마다 서로 다르게 정의된다. That is, a plurality of 'signals' may be defined in the signaling network covered by the present invention. In addition, each signal is defined differently for each node.

따라서 본 발명에서, 제1노드에 대응하는 신호인 제1신호는 상기 제1노드의 활성도인 것으로 정의될 수 있다.Therefore, in the present invention, the first signal corresponding to the first node may be defined as the activity of the first node.

상술한 바와 같이, 상기 신호전달네트워크에서 활성도는 특정 신호전달분자가 신호를 전달하기 위해 활성화 된 정도를 의미한다. 특정 분자의 활성도가 다른 분자의 활성도의 변화에 영향을 미치는데, 이는 특정 분자의 신호가 다른 분자의 신호를 높여주거나 낮춰주기 위해 '흘러간다' 라고 표현할 수 있다. As described above, activity in the signal transduction network refers to the degree to which a specific signaling molecule is activated to transmit a signal. The activity of a specific molecule affects the change in the activity of other molecules, which can be expressed as 'flowing' in order to increase or decrease the signal of another molecule.

본 발명에서 제시하는 '신호가 흐른다'라는 개념은, '제1신호의 크기의 변화가, 제2신호의 크기의 변화를 유발한다' 라는 개념으로 해석될 수도 있다. 또한 상술한 "원천 노드로부터 표적 노드로 신호가 흐른다"라는 것의 의미는 "원천 노드에 대응하는 신호(=원천 노드의 활성도)의 변화가 표적 노드의 신호(=표적 노드의 활성도)의 변화에 영향을 준다"라는 것을 의미할 수도 있다.The concept of 'a signal flows' presented in the present invention may be interpreted as a concept that 'a change in the magnitude of the first signal causes a change in the magnitude of the second signal'. In addition, the meaning of "a signal flows from the source node to the target node" as described above means "a change in the signal corresponding to the source node (= activity of the source node) affects the change in the signal of the target node (= activity of the target node)" It can also mean "to give".

본 발명에서 '신호흐름'(signal flow)은 어떤 신호전달분자의 활성도가 다른 신호전달분자의 분자의 활성도에 미치는 현상 또는 영향을 의미할 수 있다. 상기 신호흐름은 도 1의 (b)에 나타낸 바와 같이 정의되거나 또는 후술하는 수식 5와 같이 정의될 수 있다. In the present invention, 'signal flow' may refer to a phenomenon or an effect that the activity of one signaling molecule has on the activity of another signaling molecule. The signal flow may be defined as shown in (b) of FIG. 1 or may be defined as in Equation 5 to be described later.

즉, 도 1의 (b)에 나타낸 F_BA는 노드A로부터 노드B로의 신호흐름을 나타내는 기호이다. 본 발명에서, 신호흐름(F_BA)의 크기는 소스 노드인 노드A의 활성도의 크기에 비례하고, 또한 노드A를 소스 노드로 갖고 노드 B를 타겟 노드로 갖는 유향부호링크(L_BA)에 매핑되는 가중치(W_BA)에 비례하는 것으로 정의될 수 있다. 상기 가중치(W_BA)는 유향부호링크(L_BA)에 매핑되는 것으로 간주할 수도 있고, 또는 상기 신호흐름(F_BA)에 매핑되는 것으로 간주할 수도 있다. 이하, 상기 유향부호링크에 매핑되는 가중치를 유향부호링크의 가중치라고 지칭할 수도 있다. That is, F _BA shown in FIG. 1B is a symbol indicating the signal flow from the node A to the node B. In the present invention, _{the magnitude of the signal flow (F BA} ) is proportional to the magnitude of the activity of the node A, which is a source node, and is mapped _{to a directed code link (L BA ) having node A as a source node and node B as a target node.} It can be defined as being proportional to the weight W _{BA .} The weight W _BA may be regarded as being mapped to the directed coded link L _BA or may be regarded as being mapped to the signal flow F _BA . Hereinafter, a weight mapped to the directional coded link may be referred to as a directional coded link weight.

이와 같이, 상기 신호흐름(F_BA)은 유향부호링크의 가중치(W_BA)와 소스의 로그활성도(x_A)의 곱으로 정의된다(F_BA= W_BA × x_A). 도 1의 (c)에 나타낸 바와 같이, 이와 같은 신호흐름의 유형(type)은 링크의 가중치와 소스의 로그활성도의 부호 조합에 따라 4가지 존재할 수 있다. 이때, 상기 신호흐름의 부호는 유향부호링크의 부호와 다를 수 있다. 예를 들어, 소스의 로그활성도는 음의 값을 갖는 반면 유향부호링크가 양의 값을 가지면(즉, 타겟을 활성화 시켜주는 링크이면) 신호흐름은 음의 값을 갖게 된다.Thus, the signal flow (F _BA) is defined as the product of a directed link code weight (W _BA) and the activity log _(A x) of the source (F _BA _BA = W × x _A). As shown in (c) of FIG. 1 , there may be four types of such a signal flow according to the sign combination of the link weight and the log activity of the source. In this case, the code of the signal flow may be different from that of the directed code link. For example, the log activity of the source has a negative value, whereas if the directed code link has a positive value (ie, a link that activates the target), the signal flow has a negative value.

상술한 바와 같이, 소스의 로그활성도가 음의 값을 갖는다는 것은, 예컨대 소스 노드를 나타내는(representing) 활성 상태에 있는 신호전달분자의 세포 내 농도의 로그값 또는 농도로부터 추정되는 값의 로그값이 음의 값을 갖는다는 것을 의미할 수 있다.As described above, that the log activity of the source has a negative value means that, for example, the log value of the intracellular concentration of the signaling molecule in the active state representing the source node or the log value of the value estimated from the concentration is It may mean that it has a negative value.

본 발명은, 상기 신호전달네트워크의 구조정보는 주어져 있지만, 상기 가중치의 구체적인 값이 주어지지 않은 상황에서 상기 가중치들의 값을 추정하는 방법에 관한 것이다.The present invention relates to a method for estimating the values of the weights in a situation in which structural information of the signaling network is given but specific values of the weights are not given.

도 2에 나타낸 바와 같이, 상기 신호전달네트워크는 복수 개의 노드들, 및 복수 개의 유향부호링크들을 요소로 갖는 모델을 이용하여 모델링될 수 있다. 상기 신호전달네트워크의 각 요소의 값은 생물학적 실험데이터를 이용하여 결정할 수 있다. 즉, 상기 신호전달네트워크의 각 요소의 값을 나타내는 파라미터를 정의하고, 상기 파라미터의 값을 상기 실험데이터를 이용한 트레이닝 과정을 거쳐 피팅(fitting)할 수 있다. 상기 트레이닝을 위한 상기 실험데이터가 충분히 제공된다면, 상기 트레이닝을 통해 관심 있는 신호전달분자(=노드)의 활성도(=신호)를 결정할 수 있다.As shown in FIG. 2 , the signaling network can be modeled using a model having a plurality of nodes and a plurality of directed code links as elements. The value of each element of the signal transduction network may be determined using biological experimental data. That is, it is possible to define a parameter indicating a value of each element of the signal transmission network, and fit the parameter value through a training process using the experimental data. If the experimental data for the training is sufficiently provided, the activity (=signal) of the signaling molecule of interest (=node) can be determined through the training.

그러나 상기 복수 개의 노드들 및 복수 개의 유향부호링크들로 구성되는 토폴로지는 많이 연구되어 있지만, 생물학적 연구결과에 따른 상기 트레이닝을 위한 데이터가 부족하다는 문제가 있다.However, although the topology composed of the plurality of nodes and the plurality of directed link links has been studied a lot, there is a problem in that data for the training according to the biological research result is insufficient.

본 발명과 직접적 연관이 없더라도, 본 발명의 이해를 도울 수 있는 관련된 정보들은, 대한민국 특허출원번호 10-2012-0098296, Attractor Landscape Analysis Reveals Feedback Loops in the p53 Network That Control the Cellular Response to DNA Damage (www.SCIENCESIGNALING.org 20 November 2012 Vol 5 Issue 251 ra83), 및 Control and Coordination in Biochemical Networks (IEEE Control Systems Magazine, August 2004) 등에 공개되어 있다.Although not directly related to the present invention, related information that can help understand the present invention is provided in Korean Patent Application No. 10-2012-0098296, Attractor Landscape Analysis Reveals Feedback Loops in the p53 Network That Control the Cellular Response to DNA Damage (www .SCIENCESIGNALING.org 20 November 2012 Vol 5 Issue 251 ra83), and Control and Coordination in Biochemical Networks (IEEE Control Systems Magazine, August 2004).

지금까지 설명한 내용은 본 발명의 이해를 돕기 위한 배경지식으로서 제공한 것이며, 위 내용들 모두가 본 출원시점 이전에 공개된 기술인 것으로 인정될 수 없다.The contents described so far are provided as background information to help the understanding of the present invention, and all of the above contents cannot be recognized as being disclosed prior to the time of the present application.

본 발명에서는 신호전달네트워크의 출력노드의 제어를 위해 제어되어야 하는 제어표적노드를 결정하는 방법을 제공하고자 한다. 구체적으로는 임의의 제1노드의 활성도의 변화가 다른 제2노드의 활성도의 변화에 미치는 영향으로 정의되는 인플루언스를 이용하여 상기 제어 표적노드를 결정하는 방법을 제공하고자 한다. 여기서 상기 제어표적노드는 약물표적이 되는 노드를 의미한다. An object of the present invention is to provide a method for determining a control target node to be controlled in order to control an output node of a signal transmission network. Specifically, an object of the present invention is to provide a method for determining the control target node using influence, which is defined as the effect that a change in the activity of an arbitrary first node has on a change in the activity of another second node. Here, the control target node means a node that is a drug target.

본 발명에서는 생물학적 실험데이터가 결핍된 상황에서도, 상기 신호전달네트워크에 속한 관심 있는 신호전달분자(=노드)의 활성도(=신호)를 결정할 수 있도록, 상기 신호전달네트워크의 각 유향부호링크의 가중치를 추정하는 기술을 제공하고자 한다. In the present invention, in order to determine the activity (=signal) of a signaling molecule of interest (=node) belonging to the signaling network even in a situation in which biological experimental data is lacking, the weight of each directed code link of the signaling network is determined. We want to provide an estimation technique.

상기 가중치가 결정되면, 상기 신호흐름도 결정될 수 있다. 따라서 본 발명에서는, 생물학적 실험데이터가 결핍된 상황에서도, 상기 신호흐름을 추정하는 기술을 제공하고자 한다.When the weight is determined, the signal flow may also be determined. Therefore, in the present invention, it is an object to provide a technique for estimating the signal flow even in a situation in which biological experimental data is lacking.

본 발명의 일 관점에 따라 제공되는 알고리즘에서는, 특정 노드의 활성도(또는 신호)가 수식 1과 같이 활성도의 곱에 의해 결정되는 것으로 가정한다.In the algorithm provided according to an aspect of the present invention, it is assumed that the activity (or signal) of a specific node is determined by the product of the activity as shown in Equation 1.

[수식 1][Formula 1]

여기에서 a_i(t)∈R는 시간 t일 때 노드 i의 활성도, a_b(i)∈R는 노드 i의 기저활성도 (또는 기저신호)를 나타내며, α∈R는 하이퍼 파라미터이다. 수식 1의 양변에 로그를 취하면 수식 2와 같이 선형 차분방정식으로 좀 더 단순하게 나타낼 수 있다.Here, a _i (t) ∈ R denotes the activity of node i at time t, a _b (i) ∈ R denotes the basal activity (or base signal) of node i, and α ∈ R is a hyperparameter. By taking the logarithm of both sides of Equation 1, it can be expressed more simply as a linear difference equation as shown in Equation 2.

[수식 2][Formula 2]

여기에서 x는 log(a)∈R^N을 나타내고, b는 log(a_b)∈R^N을 나타내고, 그리고 W∈R^NㅧN 는 가중치행렬(weight matrix)을 나타낸다. 상기 신호전달네트워크가 시간에 따라 안정화된 상태, 즉 시간에 따라 상기 각 신호의 변화가 없는 정상 상태(steady-state)를 가정하면, 수식 2로부터 정상 상태에서의 x의 값을 구할 수 있다. 정상 상태에서의 x의 값은 아래 수식 3에 의해 산출될 수 있다.Where x is the log (a) shows a ∈R ^N, b is log (a _b) shows a ∈R ^N, ^{N ㅧ} W∈R and ^N represents the weight matrix (weight matrix). Assuming that the signal transmission network is in a stable state over time, that is, a steady-state in which there is no change in each signal over time, the value of x in a steady state can be obtained from Equation 2 . The value of x in a steady state can be calculated by Equation 3 below.

[수식 3][Equation 3]

여기에서 x_s∈R^N는 정상 상태일 때의 x의 값을 나타낸다. Here, x _s ∈R ^N represents the value of x in a steady state.

본 발명의 일 관점에 따라 제공되는 알고리즘은 유향부호링크의 가중치를 신호전달네트워크의 구조 정보만을 이용하여 계산함으로써 신호흐름을 추정하고, 주요 신호전달분자의 활성도 변화의 방향을 예측하는 것을 목표로 한다. 따라서 가중치를 네트워크의 구조 정보만을 이용하여 추정하는 것은 중요한 과정 중 하나이며 이는 수식 4와 같다.The algorithm provided according to an aspect of the present invention aims to estimate the signal flow by calculating the weight of the directed code link using only the structural information of the signaling network, and predict the direction of change in the activity of major signaling molecules. . Therefore, estimating the weight using only the structure information of the network is one of the important processes, which is shown in Equation 4.

[수식 4][Equation 4]

여기에서 A∈R^NㅧN 는 인접행렬이며, D_in와 D_out는 대각행렬로서 각각 i번째 노드의 입력링크수 (in-degree; 인디그리)와 출력링크수(out-degree; 아웃디그리) 값을 원소로 갖는다. i번째 노드의 입력링크수는 인접행렬 A의 i번째 행의 값을 모두 더함으로써 구할 수 있으며 (D_in), i번째 노드의 출력링크수는 인접행렬 A의 i번째 열의 값을 모두 더함으로써 구할 수 있다 (D_out).Here, A∈R ^N ㅧN is an adjacency matrix, and D _in and D _out are diagonal matrices, respectively, the number of input links (in-degree; in-degree) and number of output links (out-degree) of the i-th node. has a value as an element. The number of input links of the i-th node can be obtained by adding all the values of the i-th row of the adjacent matrix A (D _in ), and the number of output links of the i-th node can be obtained by adding all the values of the i-th column of the adjacent matrix A can (D _out ).

본 발명의 일 관점에 따라 제공되는 알고리즘은 세포막수용체를 통해 전달되는 입력 신호(i.e., input signal)나 특정 신호분자의 섭동(주로 약물에 의한 저해)에 의해 변화되는 신호흐름 F를 추정하고 정상상태에서의 생체 분자의 활성도를 예측하는 것을 목표로 한다. 신호의 흐름을 나타내는 행렬 F의 (i, j) 원소인 F _ij 는 도 1의 (b) 및 수식 5에 나타낸 바와 같이 링크의 가중치와 로그활성도의 곱으로 나타낼 수 있다.The algorithm provided according to one aspect of the present invention estimates the signal flow F changed by the input signal (ie, input signal) transmitted through the cell membrane receptor or the perturbation of a specific signal molecule (mainly inhibition by a drug), and the steady state It aims to predict the activity of biomolecules in F _{ij, which} is the (i, j) element of the matrix F representing the signal flow, can be expressed as the product of the link weight and log activity, as shown in FIG. 1 (b) and Equation 5.

[수식 5][Equation 5]

신호흐름을 나타내는 행렬 F는 시간 t에 따라 변할 수 있다. 왜냐하면 수식 5에서와 같이 F는 x(t)에 의존적이기 때문이다. 이때, W는 시간에 따라 변하지 않는다고 가정한다. The matrix F representing the signal flow may change with time t. This is because, as in Equation 5, F is dependent on x(t). In this case, it is assumed that W does not change with time.

그러나 정상상태에서는 x가 시간에 따라 변하지 않으며 그에 따라 F 또한 시간에 따라 변하지 않게 된다. F는 양 또는 음의 값을 가질 수 있는데 이는 x가 활성도 a의 로그값이기 때문이다. 즉, 음의 부호를 갖는 유향부호링크에서 신호 원천 노드의 활성도가 감소하는 경우 결과적으로 신호흐름 F_ij는 양의 값을 갖게 된다. 양의 값을 갖는 F는 결과적으로 표적 신호전달분자의 활성도를 높여주는 역할을 하게 된다. 유향부호링크의 가중치와 원천 노드의 활성도에 의해 결정되는 신호흐름의 부호는 유향부호링크의 부호와 다를 수 있다.However, in the steady state, x does not change with time, and therefore F does not change with time. F can be positive or negative because x is the logarithm of activity a. That is, when the activity of the signal source node decreases in the directed code link having a negative sign, as a result, the signal flow F _ij has a positive value. As a result, F having a positive value plays a role in increasing the activity of the target signaling molecule. The sign of the signal flow determined by the weight of the directed code link and the activity of the source node may be different from the sign of the directed code link.

본 발명의 일 관점에 따라, 복수 개의 노드들 및 복수 개의 링크들을 포함하여 정의되는 신호전달네트워크에 있어서, 상기 각각의 링크에 매핑되는 가중치를 산출하는 방법을 제공할 수 있다. 이 방법은, 컴퓨팅장치가, 상기 각 링크에 연결된 소스 노드 및 타겟 노드에 관한 정보, 및 상기 각 링크에 매핑된 가중치의 부호에 관한 정보를 포함하는, 상기 신호전달네트워크의 구조정보를 획득하는 단계; 및 상기 컴퓨팅장치가, 상기 구조정보를 이용하여 상기 각 링크에 매핑된 가중치의 값을 생성하는 가중치 생성단계를 포함한다. According to one aspect of the present invention, in a signaling network defined including a plurality of nodes and a plurality of links, a method for calculating a weight mapped to each link may be provided. The method includes the steps of: obtaining, by a computing device, structural information of the signaling network, including information about a source node and a target node connected to each link, and information about a sign of a weight mapped to each link ; and a weight generation step of generating, by the computing device, a value of a weight mapped to each link by using the structure information.

이때, 상기 가중치 생성단계는, 상기 각 링크에 대하여, 상기 링크에 매핑되는 가중치의 값을, 상기 링크의 소스 노드를 소스로 하는 제1세트의 링크들의 개수인 제1개수, 및 상기 링크의 타겟 노드를 타겟으로 하는 제2세트의 링크들의 개수인 제2개수의 대푯값으로 결정하는 단계를 포함할 수 있다. In this case, the weight generating step includes, for each link, a value of a weight mapped to the link, a first number that is the number of a first set of links using a source node of the link as a source, and a target of the link determining the node as a representative value of a second number that is the number of links in a second set of targets.

이때, 상기 가중치 생성단계는, 상기 각 링크에 대하여, 상기 링크에 매핑되는 가중치의 값을, 상기 링크의 소스 노드를 소스로 하는 제1세트의 타겟 노드들의 개수인 제1개수, 및 상기 링크의 타겟 노드를 타겟으로 하는 제2세트의 소스 노드들의 개수인 제2개수의 대푯값으로 결정하는 단계를 포함할 수 있다. In this case, the weight generation step includes, for each link, a value of a weight mapped to the link, a first number that is the number of target nodes in a first set using a source node of the link as a source, and The method may include determining the target node as a representative value of a second number that is the number of source nodes of a second set as a target.

이때, 상기 대푯값은 상기 제1개수 및 상기 제2개수의 기하평균의 역수인 것일 수 있다.In this case, the representative value may be a reciprocal of a geometric mean of the first number and the second number.

이때, 상기 가중치 생성단계는, 상기 각 링크에 대하여, 상기 링크에 매핑되는 가중치의 값을, 상기 링크의 소스 노드를 소스로 하는 제1세트의 링크들의 개수인 제1개수의 역수인 값으로 결정하거나, 또는 상기 링크의 타겟 노드를 타겟으로 하는 제2세트의 링크들의 개수인 제2개수의 역수인 값으로 결정하는 단계;를 포함할 수 있다.In this case, in the weight generation step, for each link, a value of a weight mapped to the link is determined as a value that is an inverse of a first number that is the number of a first set of links using a source node of the link as a source. or, determining the target node of the link as a value that is an inverse number of a second number that is the number of links in the second set as a target.

이때, 상기 가중치 생성단계는, 상기 각 링크에 대하여, 상기 링크에 매핑되는 가중치의 값을, 상기 링크의 소스 노드를 소스로 하는 제1세트의 타겟 노드들의 개수인 제1개수의 역수인 값으로 결정하거나, 또는 상기 링크의 타겟 노드를 타겟으로 하는 제2세트의 소스 노드들의 개수인 제2개수의 역수인 값으로 결정하는 단계;를 포함할 수 있다.In this case, in the weight generation step, for each link, the value of the weight mapped to the link is set to a value that is the reciprocal of a first number that is the number of target nodes in a first set using the source node of the link as a source. or determining the target node of the link as a value that is the reciprocal of a second number that is the number of source nodes of a second set as a target.

이때, 상기 가중치 생성단계는, 상기 구조정보를 이용하여 인접행렬을 생성하는 단계; 상기 인접행렬로부터 대각행렬을 생성하는 단계; 및 상기 인접행렬과 상기 대각행렬을 행렬곱 연산하여 생성한 가중치 행렬을 이용하여 상기 각 링크에 매핑된 가중치의 값을 산출하는 단계를 포함할 수 있다.In this case, the weight generating step may include: generating an adjacency matrix using the structure information; generating a diagonal matrix from the adjacency matrix; and calculating a weight value mapped to each link using a weight matrix generated by performing a matrix multiplication operation on the adjacent matrix and the diagonal matrix.

이때, 상기 방법은, 상기 생성된 각 가중치의 값을 이용하여 각 노드의 활성도에 관한 값을 산출하는 활성도 산출단계를 더 포함할 수 있다.In this case, the method may further include an activity calculation step of calculating a value related to the activity of each node by using the generated value of each weight.

이때, 상기 방법은, 상기 각각의 링크들에 대하여 산출된 가중치와 상기 각각의 노드들에 대하여 산출된 활성도를 이용하여, 상기 신호전달네트워크의 신호흐름을 결정하는 단계를 더 포함할 수 있다.In this case, the method may further include determining a signal flow of the signaling network by using the calculated weight for each of the links and the calculated activity for each of the nodes.

본 발명의 다른 관점에 따라, 복수 개의 노드들 및 복수 개의 링크들을 포함하여 정의되는 신호전달네트워크에 있어서, 상기 각각의 링크에 매핑되는 가중치를 산출하는 방법을 실행하는 프로그램이 기록된 컴퓨터로 읽을 수 있는 논-트랜지토리(non-transitory) 기록매체가 제공될 수 있다. 이때, 상기 프로그램은, 컴퓨팅장치로 하여금, 상기 각 링크에 연결된 소스 노드 및 타겟 노드에 관한 정보, 및 상기 각 링크에 매핑된 가중치의 부호에 관한 정보를 포함하는, 상기 신호전달네트워크의 구조정보를 획득하는 단계; 및 상기 구조정보를 이용하여 상기 각 링크에 매핑된 가중치의 값을 생성하는 가중치 생성단계;를 실행하도록 되어 있다.According to another aspect of the present invention, in a signal delivery network defined including a plurality of nodes and a plurality of links, a program executing a method of calculating a weight mapped to each link can be read by a computer in which is recorded. A non-transitory recording medium may be provided. In this case, the program causes the computing device to obtain structural information of the signaling network, including information on the source node and the target node connected to each link, and information on the sign of the weight mapped to each link. obtaining; and a weight generation step of generating a weight value mapped to each link by using the structure information.

본 발명의 또 다른 관점에 따라, 복수 개의 노드들 및 복수 개의 링크들을 포함하여 정의되는 신호전달네트워크에 있어서, 상기 각각의 링크에 매핑되는 가중치를 산출하는 방법을 수행하도록 되어 있으며, 처리부를 포함하는 컴퓨팅장치가 제공될 수 있다. 이때, 상기 처리부가, 상기 각 링크에 연결된 소스 노드 및 타겟 노드에 관한 정보, 및 상기 각 링크에 매핑된 가중치의 부호에 관한 정보를 포함하는, 상기 신호전달네트워크의 구조정보를 획득하고, 그리고 상기 구조정보를 이용하여 상기 각 링크에 매핑된 가중치의 값을 생성하도록 되어 있다. According to another aspect of the present invention, in a signaling network defined including a plurality of nodes and a plurality of links, a method of calculating a weight mapped to each link is performed, comprising a processing unit A computing device may be provided. At this time, the processing unit obtains structural information of the signaling network, including information about a source node and a target node connected to each link, and information about a sign of a weight mapped to each link, and the A value of a weight mapped to each link is generated using the structure information.

이때, 상기 컴퓨팅장치는, 저장부 또는 통신부를 더 포함하며, 상기 신호전달네트워크의 구조정보는 상기 저장부에 저장되어 있는 정보로부터 획득되거나, 또는 상기 통신부를 통하여 상기 컴퓨팅장치의 외부에 있는 장치로부터 획득된 것일 수 있다.In this case, the computing device further includes a storage unit or a communication unit, and the structure information of the signal transmission network is obtained from information stored in the storage unit, or from an external device of the computing device through the communication unit. may have been obtained.

본 발명의 또 다른 관점에 따라, 단백질을 나타내는 복수 개의 노드들 및 복수 개의 링크들을 포함하여 정의되는 신호전달네트워크에 있어서, 특정 단백질의 활성도에 관한 정보를 산출하는 단백질 활성도 산출방법이 제공될 수 있다. 이 방법은, 컴퓨팅장치가, 상기 각 링크에 연결된 소스 노드 및 타겟 노드에 관한 정보, 및 상기 각 링크에 매핑된 가중치의 부호에 관한 정보를 포함하는, 상기 신호전달네트워크의 구조정보를 획득하는 단계; 상기 컴퓨팅장치가, 상기 구조정보를 이용하여 상기 각 링크에 매핑된 가중치의 값을 생성하는 가중치 생성단계; 및 상기 컴퓨팅장치가, 상기 생성된 각 가중치의 값을 이용하여, 제1노드가 나타내는 제1단백질의 활성도의 변화에 따라, 제2노드가 나타내는 제2단백질의 활성도의 변화에 관한 정보를 산출하는 단계를 포함한다. According to another aspect of the present invention, in a signal transduction network defined including a plurality of nodes and a plurality of links representing a protein, a method for calculating protein activity for calculating information about the activity of a specific protein may be provided. . The method includes the steps of: obtaining, by a computing device, structural information of the signaling network, including information about a source node and a target node connected to each link, and information about a sign of a weight mapped to each link ; a weight generation step of generating, by the computing device, a value of a weight mapped to each link by using the structure information; And the computing device, using the generated value of each weight, according to the change in the activity of the first protein indicated by the first node, calculating information about the change in the activity of the second protein indicated by the second node includes steps.

이때, 상기 제1단백질의 활성도의 변화는 상기 제1단백질을 타겟으로 하는 제1약품의 투여에 따라 변화하는 상기 제1단백질의 기저활성도이고, 상기 제2노드가 나타내는 제2단백질의 활성도의 변화에 관한 정보는, 상기 제1단백질의 활성도의 변화율을 나타낼 수 있다.In this case, the change in the activity of the first protein is the basal activity of the first protein that changes according to the administration of the first drug targeting the first protein, and the change in the activity of the second protein indicated by the second node The information about may represent the rate of change of the activity of the first protein.

본 발명의 일 관점에 따라, 신호전달네트워크의 출력노드의 제어를 위해 제어되어야 하는 제어표적노드의 결정방법을 제공할 수 있다. 이 결정방법은, 제1신호전달네트워크에 포함된 복수 개의 노드들 각각의 제1출력노드에 대한 인플루언스를 계산하여, 상기 복수 개의 노드들 각각에 대응하는 복수 개의 인플루언스들을 산출하는 인플루언스 산출단계; 및 상기 복수 개의 인플루언스들의 값을 기초로, 상기 제1출력노드에 대하여 미리 설정되어 있는 제어목표에 따라, 상기 복수 개의 노드들 중 일부를 선택하여 상기 제1출력노드의 제어를 위한 제어표적노드로서 결정하는 제어표적노드 결정단계;를 포함한다. 그리고 상기 복수 개의 노드들 중 임의의 제1노드의 상기 제1출력노드에 대한 인플루언스는, 상기 제1노드의 활성도 변화가 상기 제1출력노드의 활성도의 변화에 미치는 영향의 정도를 나타낸다. According to one aspect of the present invention, it is possible to provide a method of determining a control target node to be controlled for controlling an output node of a signaling network. This determination method calculates the influence on a first output node of each of a plurality of nodes included in the first signal delivery network, and calculates an influence of a plurality of influences corresponding to each of the plurality of nodes. Luance calculation step; and a control target for controlling the first output node by selecting some of the plurality of nodes according to a control target preset for the first output node based on the values of the plurality of influences. and a control target node determining step of determining as a node. In addition, the influence of any first node among the plurality of nodes on the first output node indicates the degree of influence of the change in the activity of the first node on the change in the activity of the first output node.

이때, 상기 결정방법은, 상기 인플루언스 산출단계 및 상기 제어표적노드 결정단계 사이에, 상기 복수 개의 노드들 각각으로부터 상기 제1출력노드까지의 SPLO(Shortest Path Length to Output)를 계산하여, 상기 복수 개의 노드들 각각에 대응하는 복수 개의 SPLO들을 산출하는 단계를 더 포함할 수 있다. 그리고 이때, 상기 제어표적노드 결정단계는, 상기 복수 개의 노드들을 상기 SPLO의 값을 기준으로 복수 개의 SPLO 그룹들로 분류하는 단계; 및 상기 복수 개의 SPLO 그룹들 각각에 대하여 독립적으로, 상기 각 SPLO 그룹에 속한 노드들 중 적어도 일부를 선택하여 상기 제1출력노드의 제어를 위한 제어표적노드로서 결정하는 그룹별 결정단계;를 포함할 수 있다. 그리고 임의의 상기 노드로부터 상기 제1출력노드까지의 상기 SPLO는, 상기 임의의 노드로부터 상기 제1출력노드까지 도달하기 위하여, 상기 임의의 노드와 상기 제1출력노드를 서로 연결하는 한 개 이상의 링크들로부터 선택될 수 있는 한 개 이상의 후보경로들 중, 각 후보경로에 포함된 링크의 개수가 가장 작은 최단경로에 포함된 상기 링크의 수를 나타낼 수 있다.In this case, the determining method includes calculating a Shortest Path Length to Output (SPLO) from each of the plurality of nodes to the first output node between the influence calculation step and the control target node determination step, The method may further include calculating a plurality of SPLOs corresponding to each of the plurality of nodes. And in this case, the step of determining the control target node includes: classifying the plurality of nodes into a plurality of SPLO groups based on the value of the SPLO; and independently for each of the plurality of SPLO groups, a group-by-group determination step of selecting at least some of the nodes belonging to each SPLO group and determining as a control target node for controlling the first output node; can and the SPLO from the arbitrary node to the first output node is one or more links connecting the arbitrary node and the first output node to each other to reach the first output node from the arbitrary node. Among one or more candidate paths that can be selected from among the one or more candidate paths, the number of links included in the shortest path having the smallest number of links included in each candidate path may be indicated.

이때, 상기 신호전달네트워크는, 복수 개의 생체 분자 간의 상호작용을 모델링하는 네트워크로서, 방향성이 있는 복수 개의 링크들, 및 상기 링크들에 의해 연결되는 복수 개의 노드들을 포함하여 정의될 수 있다. 그리고 상기 각각의 노드는 생체분자를 나타낼 수 있다.In this case, the signal transduction network is a network that models the interaction between a plurality of biomolecules, and may be defined including a plurality of directional links and a plurality of nodes connected by the links. And each node may represent a biomolecule.

이때, 상기 제1신호전달네트워크에 포함된 임의의 노드의 상기 활성도는, 상기 임의의 노드가 나타내는 활성화된 생체분자의 양을 나타낼 수 있다. In this case, the activity of any node included in the first signal transduction network may indicate the amount of activated biomolecules represented by the arbitrary node.

이때, 상기 제1출력노드에 대하여 미리 설정되어 있는 제어목표는 상기 제1출력노드의 활성도를 증가시키거나 감소시키는 것일 수 있다. In this case, the control target preset for the first output node may be to increase or decrease the activity of the first output node.

이때, 상기 제1출력노드는, 세포의 사멸을 나타내는 노드, 세포의 증식을 나타내는 노드, 및 세포의 전이를 나타내는 노드 중 어느 하나일 수 있다.In this case, the first output node may be any one of a node representing cell death, a node representing cell proliferation, and a node representing cell metastasis.

이때, 상기 제1노드의 활성도 변화는 상기 제1노드의 기저활성도의 변화(∂b_j)이며, 상기 제1노드의 상기 제1출력노드에 대한 인플루언스(S_ij)는 다음의 식 11을 만족할 수 있다. [식 11] S_ij = ∂x_i/ ∂b_j. At this time, the change in the activity of the first node is the change (∂b _j _{) of the basal activity of the first node, and the influence (S ij} ) of the first node to the first output node is the following Equation 11 can be satisfied [Equation 11] S _ij = ∂x _i / ∂b _j.

본 발명의 다른 관점에 따라 신호전달네트워크의 출력노드의 제어를 위해 제어되어야 하는 제어표적노드의 결정방법이 제공될 수 있다. 이 결정방법은, 제1신호전달네트워크에 포함된 복수 개의 노드들 각각의 제1출력노드에 대한 인플루언스를 계산하여, 상기 복수 개의 노드들 각각에 대응하는 복수 개의 인플루언스들을 산출하는 단계; 상기 복수 개의 노드들 각각으로부터 상기 제1출력노드까지의 SPLO를 계산하여, 상기 복수 개의 노드들 각각에 대응하는 복수 개의 SPLO들을 산출하는 단계; 상기 복수 개의 노드들을 상기 SPLO의 값을 기준으로 복수 개의 SPLO 그룹들로 분류하는 단계; 및 상기 복수 개의 SPLO 그룹들 각각에 대하여 독립적으로, 상기 각 SPLO 그룹에 속한 노드들 중 적어도 일부를 선택하여 상기 제1출력노드의 제어를 위한 제어표적노드로서 결정하는 단계;를 포함한다. 이때, 상기 복수 개의 노드들 중 임의의 제1노드의 임의의 상기 출력노드에 대한 인플루언스는, 상기 제1노드의 활성도 변화가 상기 임의의 출력노드의 활성도의 변화(∂x_i)에 미치는 영향의 정도이다. According to another aspect of the present invention, a method for determining a control target node to be controlled for controlling an output node of a signaling network may be provided. The determining method includes calculating an influence for a first output node of each of a plurality of nodes included in the first signal delivery network, and calculating a plurality of influences corresponding to each of the plurality of nodes. ; calculating an SPLO from each of the plurality of nodes to the first output node, and calculating a plurality of SPLOs corresponding to each of the plurality of nodes; classifying the plurality of nodes into a plurality of SPLO groups based on the value of the SPLO; and independently for each of the plurality of SPLO groups, selecting at least some of the nodes belonging to each of the SPLO groups and determining them as a control target node for controlling the first output node. At this time, the influence of the arbitrary first node among the plurality of nodes on the arbitrary output node is the effect of the change in the activity of the first node on the change in the activity of the arbitrary output node (∂x _{i )} is the degree of

본 발명의 또 다른 관점에 따라 신호전달네트워크의 출력노드의 제어를 위해 제어되어야 하는 제어표적노드의 결정방법을 제공할 수 있다. 상기 결정방법은, 제1신호전달네트워크에 포함된 복수 개의 출력노드들 각각에 대하여: 상기 제1신호전달네트워크에 포함된 복수 개의 노드들 각각의 상기 출력노드에 대한 인플루언스를 계산하여, 상기 복수 개의 노드들 각각에 대응하는 복수 개의 인플루언스들을 산출하고, 상기 복수 개의 노드들 각각으로부터 상기 출력노드까지의 SPLO를 계산하여, 상기 복수 개의 노드들 각각에 대응하는 복수 개의 SPLO들을 산출하고, 상기 복수 개의 노드들을 상기 SPLO의 값을 기준으로 복수 개의 SPLO 그룹들로 분류하고, 그리고 상기 복수 개의 SPLO 그룹들 각각에 대하여 독립적으로, 상기 각 SPLO 그룹에 속한 노드들 중 적어도 일부를 선택하여 상기 출력노드의 제어를 위한 제어표적노드로서 결정하도록 되어 있는, 출력노드별 제어표적 후보집합 결정 단계를 포함한다. 그리고 상기 결정방법은, 상기 복수 개의 출력노드들 각각에 대하여 결정된 상기 제어표적노드들의 교집합을 기초로 상기 제1출력노드의 제어를 위한 최종 제어표적노드를 결정하는 제어표적노드 결정단계;를 포함한다. 이때, 상기 복수 개의 노드들 중 임의의 제1노드의 임의의 상기 출력노드에 대한 인플루언스는, 상기 제1노드의 활성도 변화가 상기 임의의 출력노드의 활성도의 변화(∂x_i)에 미치는 영향의 정도이다. According to another aspect of the present invention, it is possible to provide a method for determining a control target node to be controlled for controlling an output node of a signaling network. The determining method includes: for each of a plurality of output nodes included in the first signaling network: calculating an influence on the output node of each of a plurality of nodes included in the first signaling network, calculating a plurality of influences corresponding to each of a plurality of nodes, calculating an SPLO from each of the plurality of nodes to the output node, and calculating a plurality of SPLOs corresponding to each of the plurality of nodes; Classifying the plurality of nodes into a plurality of SPLO groups based on the value of the SPLO, and independently selecting at least some of the nodes belonging to each SPLO group for each of the plurality of SPLO groups and outputting the output and determining a control target candidate set for each output node, configured to be determined as a control target node for control of the node. And the determining method includes a control target node determining step of determining a final control target node for controlling the first output node based on the intersection of the control target nodes determined for each of the plurality of output nodes. . At this time, the influence of the arbitrary first node among the plurality of nodes on the arbitrary output node is the effect of the change in the activity of the first node on the change in the activity of the arbitrary output node (∂x _{i )} is the degree of

이때, 임의의 상기 노드로부터 임의의 상기 출력노드까지의 상기 SPLO는, 상기 임의의 노드로부터 상기 임의의 출력노드까지 도달하기 위하여, 상기 임의의 노드와 상기 임의의 출력노드를 서로 연결하는 한 개 이상의 링크들로부터 선택될 수 있는 한 개 이상의 후보경로들 중, 각 후보경로에 포함된 링크의 개수가 가장 작은 최단경로에 포함된 상기 링크의 수를 나타낼 수 있다.At this time, the SPLO from the arbitrary node to the arbitrary output node is at least one connecting the arbitrary node and the arbitrary output node to each other in order to reach the arbitrary output node from the arbitrary node. Among one or more candidate paths that can be selected from the links, the number of links included in the shortest path having the smallest number of links included in each candidate path may be indicated.

이때, 상기 복수 개의 노드들 중 임의의 제1노드의 상기 제1출력노드에 대한 인플루언스는, 상기 임의의 제1노드로부터 상기 제1출력노드에 이르는 상기 제1신호전달네트워크의 모든 경로에서, 각 경로에 포함된 링크에 할당된 가중치들의 곱을 합하는 연산을 이용하여 산출될 수 있다.At this time, the influence of any first node among the plurality of nodes on the first output node is in all paths of the first signaling network from the arbitrary first node to the first output node, It may be calculated using an operation of summing the products of weights assigned to links included in each path.

이때, 상기 각 경로에 포함된 링크에 할당된 가중치들은, 컴퓨팅장치가, 상기 각 링크에 연결된 소스 노드 및 타겟 노드에 관한 정보, 및 상기 각 링크에 매핑된 가중치의 부호에 관한 정보를 포함하는, 상기 신호전달네트워크의 구조정보를 획득하는 단계; 및 상기 컴퓨팅장치가, 상기 구조정보를 이용하여 상기 각 링크에 매핑된 가중치의 값을 생성하는 가중치 생성단계;를 포함하는, 신호전달네트워크에 관한 정보를 생성하는 방법을 이용하여 생성된 것일 수 있다.In this case, the weights assigned to the links included in each path include, by the computing device, information on a source node and a target node connected to each link, and information on a sign of a weight mapped to each link, obtaining structural information of the signaling network; and a weight generation step of generating, by the computing device, a value of a weight mapped to each link by using the structure information; .

이때, 상기 가중치 생성단계는, 상기 구조정보를 이용하여 인접행렬을 생성하는 단계; 상기 인접행렬로부터 대각행렬을 생성하는 단계; 및 상기 인접행렬과 상기 대각행렬을 행렬곱 연산하여 생성한 가중치 행렬을 이용하여 상기 각 링크에 매핑된 가중치의 값을 산출하는 단계를 포함할 수 있다. In this case, the weight generating step may include: generating an adjacency matrix using the structure information; generating a diagonal matrix from the adjacency matrix; and calculating a weight value mapped to each link using a weight matrix generated by performing a matrix multiplication operation on the adjacent matrix and the diagonal matrix.

본 발명의 일 관점에 따라, 생체분자들 간의 상호작용으로 구성된 생체분자네트워크에서 특정한 생체분자인 출력생체분자의 활성도의 제어를 위해 약물의 표적이 되어야 하는 생체분자를 결정하는 방법이 제공될 수 있다. 이 방법은, 상기 생체분자네트워크에 포함된 복수 개의 생체분자들 각각의 상기 출력생체분자에 대한 인플루언스를 계산하여, 상기 복수 개의 생체분자들 각각에 대응하는 복수 개의 인플루언스들을 산출하는 인플루언스 산출단계; 및 상기 복수 개의 인플루언스들의 값을 기초로, 상기 출력생체분자에 대하여 미리 설정되어 있는 제어목표에 따라, 상기 복수 개의 생체분자들 중 일부를 선택하여 상기 출력셍체분자의 제어를 위한 약물표적 생체분자로서 결정하는 약물표적 생체분자 결정단계;를 포함한다. 그리고 이때, 상기 복수 개의 생체분자들 중 임의의 제1생체분자의 상기 출력생체분자에 대한 인플루언스는, 상기 제1생체분자의 활성도 변화가 상기 출력생체분자의 활성도의 변화에 미치는 영향의 정도이다. 여기서, 상기 '복수 개의 생체분자들 각각'은 집합적인 개념으로서, 상술한 신호절단네트워크의 각 노드에 대응할 수 있다.According to one aspect of the present invention, there may be provided a method for determining a biomolecule that should be a target of a drug in order to control the activity of an output biomolecule that is a specific biomolecule in a biomolecular network composed of interactions between biomolecules. . In this method, the influence of each of a plurality of biomolecules included in the biomolecule network is calculated on the output biomolecule to calculate a plurality of influences corresponding to each of the plurality of biomolecules. Luance calculation step; and a drug target biomolecule for controlling the output biomolecule by selecting some of the plurality of biomolecules according to a control target preset for the output biomolecule based on the values of the plurality of influences and a drug target biomolecule determination step of determining as a molecule. And in this case, the influence of any first biomolecule among the plurality of biomolecules on the output biomolecule is the degree to which the change in the activity of the first biomolecule affects the change in the activity of the output biomolecule. . Here, the 'each of the plurality of biomolecules' is a collective concept and may correspond to each node of the signal cutting network described above.

상기 생체분자들 간의 상호작용으로 구성된 생체분자네트워크는 상술한 신호전달네트워크로서 모델링될 수 있다. 상기 특정한 생체분자인 출력생체분자는 상술한 출력노드를 의미할 수 있다. 상기 약물의 표적이 되어야 하는 생체분자는 상술한 제어표적노드를 의미할 수 있다.The biomolecule network composed of the interactions between the biomolecules can be modeled as the aforementioned signal transduction network. The output biomolecule, which is the specific biomolecule, may refer to the above-described output node. The biomolecule to be the target of the drug may refer to the aforementioned control target node.

본 발명에 따르면, 신호전달네트워크의 출력노드의 제어를 위해 제어되어야 하는 제어표적노드(=약물표적노드)를 결정하는 방법을 제공할 수 있다. 구체적으로는 임의의 제1노드의 활성도의 변화가 다른 제2노드의 활성도의 변화에 미치는 영향으로 정의되는 인플루언스를 이용하여 상기 제어 표적노드를 결정하는 방법을 제공할 수 있다.According to the present invention, it is possible to provide a method for determining a control target node (=drug target node) to be controlled for the control of an output node of a signaling network. Specifically, it is possible to provide a method of determining the control target node by using influence defined as an effect of a change in the activity of an arbitrary first node on a change in the activity of another second node.

본 발명에 따르면, 생물학적 실험데이터가 결핍된 상황에서도, 상기 신호전달네트워크의 각 유향부호링크의 가중치를 추정할 수 있으며, 그 결과 상기 신호흐름을 추정할 수 있다. According to the present invention, even in a situation in which biological experimental data is lacking, the weight of each directed code link of the signaling network can be estimated, and as a result, the signal flow can be estimated.

그 결과 정의된 상기 신호전달네트워크를 이용하여, 상기 신호전달네트워크에 속한 관심 있는 신호전달분자(=노드)의 활성도(=신호)를 결정할 수 있도록 하는 기반 기술이 제공될 수 있다.As a result, by using the defined signaling network, a base technology capable of determining the activity (=signal) of a signaling molecule of interest (=node) belonging to the signaling network can be provided.

본 발명에 따르면 시간과 비용이 많이 요구되는 동역학 실험 데이터 없이 복잡한 신호전달네트워크의 구조 정보만을 이용하여 신약 표적 후보(drug target candidate)의 발굴이 가능하다. According to the present invention, it is possible to discover drug target candidates using only structural information of a complex signal transduction network without kinetic experimental data requiring a lot of time and money.

본 발명은 네트워크의 구조 정보만을 활용하여 비선형 동역학에 대한 실험적 데이터 없이 약물 표적이 될 만한 제어표적을 우선적으로 선별할 수 있다는 장점이 있다. 의생명과학 관련 연구자가 경험, 지식, 통찰력 등을 바탕으로 임의의 약물 표적을 선택하는 것과는 달리, 본 발명을 활용하면 오랜 시간 축적되어 온 방대한 양의 신호전달네트워크의 구조 정보에 대한 정량적 분석을 통해 약물 표적 후보를 우선적으로 선별할 수 있다.The present invention has an advantage in that it is possible to preferentially select a control target that can be a drug target without experimental data on nonlinear kinetics by using only the structure information of the network. Unlike a biomedical science-related researcher who selects an arbitrary drug target based on experience, knowledge, insight, etc., the present invention enables quantitative analysis of the structural information of a vast amount of signal transduction network that has been accumulated over a long period of time. Drug target candidates may be preferentially selected.

도 1은 본 발명에서 다루는 신호전달네트워크에서의 생화학 반응 및 유향부호링크의 개념에 대하여 나타낸 것이다.
도 2는 복잡한 신호전달네트워크의 구조의 예를 도식화하여 제시한 것이다.
도 3은 본 발명의 일 실시예에 따라 제공되는 신호전달네트워크에 관한 정보를 생성하는 방법을 실행하는 컴퓨팅장치의 구조를 나타낸 것이다.
도 4는 본 발명의 일 실시예에 따라 제공되는 신호전달네트워크에 관한 정보를 생성하는 방법을 설명하는 순서도이다.
도 5는 본 발명의 일 실시예에 따라 정의되는 신호전달네트워크의 구조의 예를 나타낸 다이어그램이다.
도 6은 본 발명의 일 실시예에 따라 제공되는 구조정보를 포함하는 테이블의 예를 나타낸 것이다.
도 7은 도 6에 제시한 테이블에 의해 제공되는 상기 구조정보를 이용하여 생성한 인접행렬을 나타낸 것이다.
도 8은 본 발명의 일 실시예에 따라 생성되는 대각행렬의 일 예를 나타낸 것이다.
도 9는 본 발명의 일 실시예에 따라 생성되는 가중치 행렬의 일 예를 나타낸 것이다.
도 10은 3-노드 케스케이드(cascade) 구조를 갖는 신호전달네트워크의 예를 나타낸다.
도 11은 3개의 경로를 갖는 피드포워드(feed-forward) 구조를 갖는 신호전달네트워크의 예를 나타낸다.
도 12는 3-노드 음성피드백루프(negative feedback loop) 구조를 갖는 신호전달네트워크의 예를 나타낸다.
도 13은 3-노드 음성피드백루프에 대한 루프 언롤링(unrolling) 구조를 갖는 신호전달네트워크의 예를 나타낸다.
도 14는 EMT 조절 신호전달네트워크를 나타낸다.
도 15 및 도 16은 도 14에 제시한 EMT 조절 네트워크에 대한 인플루언스 계산 결과의 예를 나타낸 것이다.
도 17은 본 발명의 일 실시예에 따라 신호전달네트워크에서의 제어표적을 결정하는 방법을 나타낸 순서도이다.
도 18은 본 발명의 일 실시예에 따른 제어표적노드의 선별 과정을 나타낸 순서도이다.
도 19는 본 발명의 다른 실시예에 따른 제어표적노드의 선별 과정을 나타낸 순서도이다.1 shows the concept of a biochemical reaction and a directed code link in the signaling network dealt with in the present invention.
Figure 2 schematically presents an example of the structure of a complex signaling network.
3 is a diagram illustrating a structure of a computing device executing a method for generating information on a signaling network provided according to an embodiment of the present invention.
4 is a flowchart illustrating a method of generating information on a signaling network provided according to an embodiment of the present invention.
5 is a diagram illustrating an example of the structure of a signaling network defined according to an embodiment of the present invention.
6 shows an example of a table including structure information provided according to an embodiment of the present invention.
FIG. 7 shows an adjacency matrix generated using the structure information provided by the table shown in FIG. 6 .
8 shows an example of a diagonal matrix generated according to an embodiment of the present invention.
9 illustrates an example of a weight matrix generated according to an embodiment of the present invention.
10 shows an example of a signaling network having a 3-node cascade structure.
11 shows an example of a signaling network having a feed-forward structure having three paths.
12 shows an example of a signaling network having a three-node negative feedback loop structure.
13 shows an example of a signaling network having a loop unrolling structure for a 3-node voice feedback loop.
14 shows an EMT control signaling network.
15 and 16 show examples of influence calculation results for the EMT control network shown in FIG. 14 .
17 is a flowchart illustrating a method of determining a control target in a signaling network according to an embodiment of the present invention.
18 is a flowchart illustrating a process of selecting a control target node according to an embodiment of the present invention.
19 is a flowchart illustrating a process of selecting a control target node according to another embodiment of the present invention.

이하, 본 발명의 실시예를 첨부한 도면을 참고하여 설명한다. 그러나 본 발명은 본 명세서에서 설명하는 실시예에 한정되지 않으며 여러 가지 다른 형태로 구현될 수 있다. 본 명세서에서 사용되는 용어는 실시예의 이해를 돕기 위한 것이며, 본 발명의 범위를 한정하고자 의도된 것이 아니다. 또한, 이하에서 사용되는 단수 형태들은 문구들이 이와 명백히 반대의 의미를 나타내지 않는 한 복수 형태들도 포함한다.Hereinafter, embodiments of the present invention will be described with reference to the accompanying drawings. However, the present invention is not limited to the embodiments described herein and may be implemented in various other forms. The terminology used in this specification is intended to help the understanding of the embodiments, and is not intended to limit the scope of the present invention. Also, singular forms used hereinafter include plural forms unless the phrases clearly indicate the opposite.

도 3은 본 발명의 일 실시예에 따라 제공되는 신호전달네트워크에 관한 정보를 생성하는 방법을 실행하는 컴퓨팅장치의 구조를 나타낸 것이다.3 is a diagram illustrating a structure of a computing device executing a method for generating information on a signaling network provided according to an embodiment of the present invention.

컴퓨팅장치(100)는 CPU 등으로 이루어진 처리부(101), 저장부(102), 및 통신부(103)를 포함할 수 있다.The computing device 100 may include a processing unit 101 made of a CPU or the like, a storage unit 102 , and a communication unit 103 .

이하 설명하는 신호전달네트워크에 관한 정보를 생성하는 방법에 포함되는 각 단계들은 상기 컴퓨팅장치(100)에 의해 실행될 수 있다.Each of the steps included in the method for generating information about the signal transmission network to be described below may be executed by the computing device 100 .

도 4는 본 발명의 일 실시예에 따라 제공되는 신호전달네트워크에 관한 정보를 생성하는 방법을 설명하는 순서도이다.4 is a flowchart illustrating a method of generating information on a signaling network provided according to an embodiment of the present invention.

도 5는 본 발명의 일 실시예에 따라 정의되는 신호전달네트워크의 구조의 예를 나타낸 다이어그램이다.5 is a diagram illustrating an example of the structure of a signaling network defined according to an embodiment of the present invention.

이하 도 3 내지 도 5를 함께 참조하여 설명한다.Hereinafter, it will be described with reference to FIGS. 3 to 5 together.

<가중치의 산출><Calculation of weights>

신호전달네트워크(1)는 복수 개의 노드(2)들 및 복수 개의 링크(3)들을 포함하여 정의될 수 있다. The signaling network 1 may be defined to include a plurality of nodes 2 and a plurality of links 3 .

각 노드(2)는 생물이 세포 내에 존재하는 분자를 나타낼 수 있다. 상기 분자는 상술한 신호전달분자일 수 있다. 각 노드(2)를 나타내는 원 안에 표기된 숫자는 해당 노드의 노드 식별자이다.Each node 2 may represent a molecule in which an organism exists in a cell. The molecule may be the aforementioned signal transduction molecule. The number indicated in the circle representing each node 2 is the node identifier of the corresponding node.

각 링크(3)는 상술한 유향부호링크의 형태를 가질 수 있다. 각 링크(3)는 상기 소스 노드와 상기 타겟 노드 간의 상호작용을 나타낸 것일 수 있다.Each link 3 may have the form of the above-described directional code link. Each link 3 may represent an interaction between the source node and the target node.

각 링크(3)는 화살표 모양의 지시선으로 상징되는 제1타입(31)과, 못머리 모양의 지시선으로 상징되는 제2타입(32)으로 구분될 수 있다. 상기 제1타입(31)은 소스 노드의 신호의 증가가 타겟 노드의 신호의 증가를 활성화하는 타입이며, 상기 제2타입(32)은 소스 노드의 신호의 증가가 타겟 노드의 신호의 증가를 억제하는 타입일 수 있다.Each link 3 may be divided into a first type 31 symbolized by an arrow-shaped leader line and a second type 32 symbolized by a nail head-shaped leader line. The first type (31) is a type in which an increase in a signal at a source node activates an increase in a signal at a target node, and in the second type (32), an increase in a signal at a source node suppresses an increase in a signal at the target node. It may be of the type that

상기 각 링크에는 가중치가 부여될 수 있다. 상기 가중치는 양의 값 또는 음의 값을 갖는 실수일 수 있다.A weight may be assigned to each link. The weight may be a real number having a positive value or a negative value.

본 발명의 일 실시예는 상기 가중치의 구체적인 값을 결정하는 것을 목적으로 할 수 있다.An embodiment of the present invention may aim to determine a specific value of the weight.

본 발명의 일 실시예에 따라 제공되는 신호전달네트워크에 관한 정보를 생성하는 방법은 아래 단계들을 포함할 수 있다. A method for generating information on a signaling network provided according to an embodiment of the present invention may include the following steps.

단계(S10)에서, 컴퓨팅장치(100)의 처리부(101)는, 상기 신호전달네트워크의 구조정보를 상기 저장부(102)로부터 획득하거나, 또는 상기 통신부(103)를 통해 외부의 저장장치 또는 서버와 같은 외부의 다른 네트워크 엔티티로부터 획득할 수 있다.In step S10 , the processing unit 101 of the computing device 100 obtains the structure information of the signal transmission network from the storage unit 102 , or an external storage device or server through the communication unit 103 . It can be obtained from other network entities outside such as

상기 구조정보는, 상기 각 링크(3)에 연결된 두 개의 노드(2)들인 소스 노드 및 타겟 노드에 관한 정보, 및 상기 각 링크(3)에 매핑된 가중치의 부호에 관한 정보를 포함할 수 있다. 그러나 상기 구조정보는 상기 매핑된 가중치의 구체적인 값은 포함하지 않는다.The structure information may include information about a source node and a target node, which are two nodes 2 connected to each link 3 , and information about a sign of a weight mapped to each link 3 . . However, the structure information does not include a specific value of the mapped weight.

도 6은 본 발명의 일 실시예에 따라 제공되는 구조정보를 포함하는 테이블의 예를 나타낸 것이다.6 shows an example of a table including structure information provided according to an embodiment of the present invention.

상기 테이블(50)의 서로 다른 레코드(행)를 구분하는 키(Key)는 각 링크(3)의 식별자일 수 있다.A key for distinguishing different records (rows) of the table 50 may be an identifier of each link 3 .

상기 테이블(50)에는, 상기 각 링크의 소스 노드의 식별자(Ns)에 관한 제1필드, 상기 각 링크의 타입에 관한 제2필드, 및 상기 각 링크의 타겟 노드의 식별자(Nt)에 관한 제3필드가 포함되어 있을 수 있다. In the table 50, a first field relating to the identifier (Ns) of the source node of each link, a second field relating to the type of each link, and a first field relating to the identifier (Nt) of the target node of each link It can contain 3 fields.

상기 제2필드에는 해당 레코드의 링크가 상기 제1타입(21)인 경우에는 이를 나타내는 값이 저장되고, 해당 레코드의 링크가 상기 제2타입(22)인 경우에는 이를 나타내는 값이 저장될 수 있다.In the second field, when the link of the corresponding record is the first type (21), a value indicating this may be stored, and when the link of the corresponding record is the second type (22), a value indicating this may be stored. .

가중치 생성단계(S20)에서, 컴퓨팅장치(100)의 처리부(101)는, 상기 구조정보를 이용하여 상기 각 링크(3)에 매핑된 가중치의 값을 생성할 수 있다.In the weight generation step S20 , the processing unit 101 of the computing device 100 may generate a weight value mapped to each link 3 using the structure information.

본 발명의 제1실시예에서, 상기 가중치 생성단계(S20)는, 상기 링크(3)에 매핑되는 가중치의 값을, 상기 링크(3)의 소스 노드를 소스로 하는 제1세트의 링크들의 개수인 제1개수, 및 상기 링크의 타겟 노드를 타겟으로 하는 제2세트의 링크들의 개수인 제2개수의 대푯값으로 결정하는 단계(S201)를 포함할 수 있다. 상기 단계(S201)는 각 링크(3)마다 실행될 수 있다. In the first embodiment of the present invention, in the weight generation step ( S20 ), the value of the weight mapped to the link 3 is the number of links in the first set using the source node of the link 3 as a source. and determining ( S201 ) as a representative value of the first number and the second number, which is the number of links in a second set targeting the target node of the link. The step S201 may be executed for each link 3 .

예컨대 도 5에 나타낸 신호전달네트워크(1)에서, 상기 단계(S201)를 제2링크(3, L2)에 대하여 수행할 수 있다. 이때, 상기 제2링크(3, L2)에 매핑되는 가중치의 값을, 상기 제2링크(3, L2)의 소스 노드인 노드(N2)를 소스로 하는 제1세트의 링크들(L2)의 개수인 제1개수(=1), 및 상기 제2링크(3, L2)의 타겟 노드인 노드(N3)를 타겟으로 하는 제2세트의 링크들(L2, L5)의 개수인 제2개수(=2)의 대푯값으로 결정할 수 있다. 이때, 상기 대푯값은 예컨대 상기 제1개수 및 상기 제2개수의 기하평균일 수 있다. For example, in the signaling network 1 shown in FIG. 5 , the step S201 may be performed for the second links 3 and L2. In this case, the value of the weight mapped to the second link (3, L2) is the value of the first set of links (L2) whose source is the node (N2), which is the source node of the second link (3, L2). A first number (=1) that is the number, and a second number that is the number of the second set of links (L2, L5) targeting a node (N3) that is a target node of the second link (3, L2) ( =2) can be determined as a representative value. In this case, the representative value may be, for example, a geometric average of the first number and the second number.

본 발명의 제2실시예에서, 상기 가중치 생성단계(S20)는, 상기 링크(3)에 매핑되는 가중치의 값을, 상기 링크(3)의 소스 노드를 소스로 하는 제1세트의 타겟 노드들의 개수인 제1개수, 및 상기 링크의 타겟 노드를 타겟으로 하는 제2세트의 소스 노드들의 개수인 제2개수의 대푯값으로 결정하는 단계(S202)를 포함할 수 있다. 상기 단계(S202)는 각 링크(3)마다 실행될 수 있다. In the second embodiment of the present invention, the weight generation step ( S20 ) includes the value of the weight mapped to the link 3 , of a first set of target nodes using the source node of the link 3 as a source. and determining ( S202 ) a representative value of a first number that is the number and a second number that is the number of source nodes of a second set targeting the target node of the link. The step S202 may be executed for each link 3 .

예컨대 도 5에 나타낸 신호전달네트워크(1)에서, 상기 단계(S202)를 제2링크(3, L2)에 대하여 수행할 수 있다. 이때, 상기 제2링크(3, L2)에 매핑되는 가중치의 값을, 상기 제2링크(3, L2)의 소스 노드인 노드(N2)를 소스로 하는 제1세트의 타겟 노드들(N3)의 개수인 제1개수(=1), 및 상기 제2링크(3, L2)의 타겟 노드인 노드(N3)를 타겟으로 하는 제2세트의 소스 노드들(N2, N4)의 개수인 제2개수(=2)의 대푯값으로 결정할 수 있다. 이때, 상기 대푯값은 예컨대 상기 제1개수 및 상기 제2개수의 기하평균일 수 있다. For example, in the signaling network 1 shown in FIG. 5 , the step S202 may be performed for the second links 3 and L2. In this case, the first set of target nodes N3 using the value of the weight mapped to the second link 3 and L2 as the source node N2 as the source node of the second link 3 and L2 as a source. A first number (=1) that is the number of , and a second number that is the number of the second set of source nodes N2 and N4 targeting the node N3 that is the target node of the second link 3 and L2 It can be determined as a representative value of the number (=2). In this case, the representative value may be, for example, a geometric average of the first number and the second number.

본 발명의 제3실시예에서, 상기 가중치 생성단계(S20)는, 상기 링크(3)에 매핑되는 가중치의 값을, 상기 링크(3)의 소스 노드를 소스로 하는 제1세트의 링크들의 개수인 제1개수의 역수인 값으로 결정하거나, 또는 상기 링크(3)의 타겟 노드를 타겟으로 하는 제2세트의 링크들의 개수인 제2개수의 역수인 값으로 결정하는 단계(S203)을 포함할 수 있다. In the third embodiment of the present invention, in the weight generation step ( S20 ), the value of the weight mapped to the link 3 is the number of links in the first set using the source node of the link 3 as a source. determining as a value that is the reciprocal of the first number, or determining the target node of the link 3 as a value that is the reciprocal of a second number that is the number of links in a second set that targets the target node of the link 3 ( S203 ). can

예컨대 도 5에 나타낸 신호전달네트워크(1)에서, 상기 단계(S203)를 제2링크(3, L2)에 대하여 수행할 수 있다. 이때, 상기 제2링크(3, L2)에 매핑되는 가중치의 값을, 상기 제2링크(3, L2)의 소스 노드인 노드(N2)를 소스로 하는 제1세트의 링크들(L2)의 개수인 제1개수(=1)의 역수인 값(=1)으로 결정하거나, 또는 상기 제2링크(3, L2)의 타겟 노드인 노드(N3)를 타겟으로 하는 제2세트의 링크들(L2, L5)의 개수인 제2개수(=2)의 역수인 값(=1/2)으로 결정할 수 있다.For example, in the signaling network 1 shown in FIG. 5, the step S203 may be performed for the second links 3 and L2. In this case, the value of the weight mapped to the second link (3, L2) is the value of the first set of links (L2) whose source is the node (N2), which is the source node of the second link (3, L2). A second set of links (=1) that is determined as a value (=1) that is the reciprocal of the first number (=1), or that targets a node N3 that is a target node of the second link (3, L2) ( It can be determined as a value (=1/2) that is the reciprocal of the second number (=2) that is the number of L2 and L5).

본 발명의 제4실시예에서, 상기 가중치 생성단계(S20)는, 상기 링크(3)에 매핑되는 가중치의 값을, 상기 링크(3)의 소스 노드를 소스로 하는 제1세트의 타겟 노드들의 개수인 제1개수의 역수인 값으로 결정하거나, 또는 상기 링크(3)의 타겟 노드를 타겟으로 하는 제2세트의 소스 노드들의 개수인 제2개수의 역수인 값으로 결정하는 단계(S204)을 포함할 수 있다. In the fourth embodiment of the present invention, the weight generating step ( S20 ) includes the value of the weight mapped to the link 3 , of the first set of target nodes using the source node of the link 3 as a source. The step of determining as a value that is the reciprocal of the first number that is the number, or determining the target node of the link 3 as a value that is the reciprocal of the second number that is the number of the second set of source nodes targeting the target node (S204) may include

예컨대 도 5에 나타낸 신호전달네트워크(1)에서, 상기 단계(S204)를 제2링크(3, L2)에 대하여 수행할 수 있다. 이때, 상기 제2링크(3, L2)에 매핑되는 가중치의 값을, 상기 제2링크(3, L2)의 소스 노드인 노드(N2)를 소스로 하는 제1세트의 타겟 노드들(N3)의 개수인 제1개수(=1)의 역수인 값(=1)으로 결정하거나, 또는 상기 제2링크(3, L2)의 타겟 노드인 노드(N3)를 타겟으로 하는 제2세트의 소스 노드들(N2, N4)의 개수인 제2개수(=2)의 역수인 값(=1/2)으로 결정할 수 있다.For example, in the signaling network 1 shown in FIG. 5 , the step S204 may be performed for the second links 3 and L2. In this case, the first set of target nodes N3 using the value of the weight mapped to the second link 3 and L2 as the source node N2 as the source node of the second link 3 and L2 as a source. The second set of source nodes is determined as a value (=1) that is the inverse of the first number (=1) that is the number of , or targets the node N3, which is the target node of the second link 3, L2. It can be determined as a value (=1/2) that is the reciprocal of the second number (=2) that is the number of the items N2 and N4.

본 발명의 제5실시예에서, 상기 가중치 생성단계(S20)는, 상기 구조정보를 이용하여 인접행렬(A)을 생성하는 단계(S211); 상기 인접행렬(A)로부터 대각행렬(D)을 생성하는 단계(S212); 및 상기 인접행렬(A)과 상기 대각행렬(D)을 가중치 생성 방식(S201~S204)에 따라 행렬곱 연산하여 생성한 가중치 행렬(W)을 이용하여 상기 각 링크에 매핑된 가중치의 값을 산출하는 단계(S213)를 포함할 수 있다.In the fifth embodiment of the present invention, the weight generating step (S20) includes: generating an adjacency matrix (A) using the structure information (S211); generating a diagonal matrix (D) from the adjacent matrix (A) (S212); and calculating the weight value mapped to each link using the weight matrix W generated by performing matrix multiplication operation on the adjacency matrix A and the diagonal matrix D according to the weight generation method (S201 to S204). It may include a step (S213) of doing.

상기 제5실시예의 인접행렬(A)를 생성하는 단계(S211)의 구체적인 방법은 도 7을 이용하여 설명한다.A specific method of generating the adjacency matrix A of the fifth embodiment ( S211 ) will be described with reference to FIG. 7 .

도 7은 도 6에 제시한 테이블(50)에 의해 제공되는 상기 구조정보를 이용하여 생성한 인접행렬(A)을 나타낸 것이다.FIG. 7 shows an adjacency matrix A generated using the structure information provided by the table 50 shown in FIG. 6 .

상기 인접행렬(A)의 행의 개수와 열의 개수는 각각, 상기 신호전달네트워크(1)에 포함된 노드들의 개수와 동일하다. 도 5 내지 도 7의 예에서, 상기 신호전달네트워크(1)에 포함된 노드들의 개수는 4이므로, 상기 인접행렬(A)은 4×4의 정방행렬이 된다.The number of rows and the number of columns of the adjacent matrix A are the same as the number of nodes included in the signal transmission network 1, respectively. 5 to 7, since the number of nodes included in the signaling network 1 is 4, the adjacency matrix A becomes a 4x4 square matrix.

상기 인접행렬(A)의 모든 요소는 '0'의 값으로 초기화될 수 있다.All elements of the adjacency matrix A may be initialized to a value of '0'.

그 다음, 상기 인접행렬(A)에 포함된 요소들 중, 상기 신호전달네트워크(1)에 포함된 링크의 개수(N)와 동일한 N개의 요소들의 값을 +1 또는 -1의 값으로 치환한다.Then, among the elements included in the adjacency matrix A, the values of N elements equal to the number N of links included in the signaling network 1 are replaced with values of +1 or -1. .

이때, +1 또는 -1의 값으로 치환되는 요소의 위치(s, t)는 각 링크의 소스 노드의 식별자 인덱스(s) 및 타겟 노드의 식별자 인덱스(t)에 의해 결정된다. 그리고 상기 치환되는 값은 각 링크의 타입에 의해 결정된다.In this case, the position (s, t) of the element replaced with a value of +1 or -1 is determined by the identifier index (s) of the source node of each link and the identifier index (t) of the target node. And the substituted value is determined by the type of each link.

예컨대, 도 6의 테이블(50)에서, 링크ID가 L1인 제1링크(L1)의 소스 노드의 식별자 인덱스(s)는 1이고, 상기 제1링크(L1)의 타겟 노드의 식별자 인덱스(t)는 2이다. 따라서 도 7에 나타낸 인접행렬(A) 중 제s열(=제1열) 및 제t행(=제2행)이 교차하는 위치의 요소를 +1 또는 -1로 치환한다. 이때, 상기 치환되는 값은, 상기 제1링크(L1)의 링크타입이 제1타입이므로 +1로 치환된다.For example, in the table 50 of FIG. 6 , the identifier index s of the source node of the first link L1 whose link ID is L1 is 1, and the identifier index t of the target node of the first link L1 is ) is 2. Accordingly, in the adjacent matrix A shown in FIG. 7 , an element at a position where the s-th column (= first column) and the t-th row (= second row) intersect is substituted with +1 or -1. In this case, the substituted value is substituted with +1 because the link type of the first link L1 is the first type.

또 다른 예로서, 도 6의 테이블(50)에서, 링크ID가 L3인 제3링크(L3)의 소스 노드의 식별자 인덱스(s)는 3이고, 상기 제3링크(L3)의 타겟 노드의 식별자 인덱스(t)는 1이다. 따라서 도 7에 나타낸 인접행렬(A) 중 제s열(=제3열) 및 제t행(=제1행)이 교차하는 위치의 요소를 +1 또는 -1로 치환한다. 이때, 상기 치환되는 값은, 상기 제3링크(L3)의 링크타입이 제2타입이므로 -1로 치환된다.As another example, in the table 50 of FIG. 6 , the identifier index s of the source node of the third link L3 whose link ID is L3 is 3, and the identifier of the target node of the third link L3 is The index (t) is 1. Accordingly, in the adjacent matrix A shown in FIG. 7, an element at a position where the s-th column (= third column) and the t-th row (= first row) intersect is substituted with +1 or -1. In this case, the substituted value is substituted with -1 because the link type of the third link L3 is the second type.

이하, 상기 제5실시예의 대각행렬(D)을 생성하는 단계(S212)의 예를 설명한다. 도 7에 나타낸 인접행렬(A)에 의해 생성되는 대각행렬(D)는 도 8과 같이 주어진다는 점을, 상기 수식 4의 아래쪽에 설명한 대각행렬의 정의로부터 쉽게 이해할 수 있다. 대각행렬(D)은 입력 대각행렬(Din)과 출력 대각행렬(Dout)로 구성될 수 있다.Hereinafter, an example of the step S212 of generating the diagonal matrix D of the fifth embodiment will be described. The fact that the diagonal matrix D generated by the adjacency matrix A shown in FIG. 7 is given as shown in FIG. 8 can be easily understood from the definition of the diagonal matrix described below in Equation 4 above. The diagonal matrix D may include an input diagonal matrix Din and an output diagonal matrix Dout.

도 8은 본 발명의 일 실시예에 따라 생성되는 대각행렬의 일 예를 나타낸 것이다.8 shows an example of a diagonal matrix generated according to an embodiment of the present invention.

이하, 제5실시예의 가중치의 값을 산출하는 단계(S213)의 예를 설명한다. 상기 가중치 생성 단계(S201)에서 대푯값이 상기 제1개수 및 상기 제2개수의 기하평균의 역이 되도록 도 7에 나타낸 인접행렬(A)과 도 8에 나타낸 대각행렬(D)의 행렬곱을 연산함으로써(수식 4 참고), 도 9에 예시한 가중치 행렬(W)를 얻을 수 있다. 이때, 예컨대, 상기 가중치 행렬(W)의 W₂₁ 부분은 제1링크(L1)의 가중치 값을 나타낸다.Hereinafter, an example of the step S213 of calculating the weight value of the fifth embodiment will be described. By calculating the matrix product of the adjacency matrix A shown in FIG. 7 and the diagonal matrix D shown in FIG. 8 so that the representative value is the inverse of the geometric mean of the first number and the second number in the weight generation step S201 (refer to Equation 4), the weight matrix W illustrated in FIG. 9 can be obtained. In this case, for example, the W ₂₁ part of the weight matrix W represents the weight value of the first link L1.

도 9는 본 발명의 일 실시예에 따라 생성되는 가중치 행렬의 일 예를 나타낸 것이다.9 illustrates an example of a weight matrix generated according to an embodiment of the present invention.

본 발명의 일 실시예에서, 가중치 행렬을 생성하기 위하여, 링크에 연결된 노드의 디그리로 나눠줌으로써 (또는 인디그리와 아웃디그리의 기하평균값으로 나눠줌으로써) 가중치를 생성하는 프로세스를 이용하는 경우, 상기 프로세스를 링크 웨이트 노말라이제이션(link weight normalization)이라고 부를 수 있다. 가중치를 생성하는 과정 그 자체가 링크 웨이트를 노말라이제이션하는 과정은 아니지만, 링크 웨이트를 노말라이제이션하는 과정을 통해 가중치를 생성할 수 있다.In one embodiment of the present invention, when using a process of generating weights by dividing by degrees of nodes connected to a link (or by dividing by geometric mean values of Indigri and Outdegree) to generate a weight matrix, the above process is This can be called link weight normalization. Although the process of generating the weight itself is not a process of normalizing the link weight, the weight can be generated through the process of normalizing the link weight.

상기 단계(S223)를 실행하여 생성된 가중치 행렬은 상기 단계(S213)를 실행하여 생성된 가중치 행렬과 동일할 수 있다.The weight matrix generated by executing step S223 may be the same as the weight matrix generated by executing step S213.

<인플루언스><Influence>

본 발명의 일 실시예에서는 상기 가중치 행렬을 더 활용할 수 있다.In an embodiment of the present invention, the weight matrix may be further utilized.

본 발명에서 '인플루언스(influence)'라는 용어는 신호전달네트워크에서 특정 출력노드(output)가 지정돼 있을 때 어떤 다른 노드의 활성도 변화가 신호흐름을 통해 상기 출력노드의 활성도에 미치는 영향의 정도를 나타내며, 상기 인플루언스(S_ij)라는 측정치(measure)는 수식 6과 같이 정의된다.In the present invention, the term 'influence' refers to the degree of influence of a change in the activity of any other node on the activity of the output node through the signal flow when a specific output node is designated in the signal delivery network. , and the _{measure of the influence (S ij} ) is defined as in Equation 6.

[수식 6][Equation 6]

S_ij = ∂x_i/ ∂b_j= (∂x_i/ ∂x_j)·(∂x_j/ ∂b_j)S _ij = ∂x _i / ∂b _j = (∂x _i / ∂x _j )·(∂x _j / ∂b _j )

여기서 x_i은 i이라는 인덱스를 갖는 제i노드의 활성도를 의미하고, b_j은 j이라는 인덱스를 갖는 제j노드(j)의 기저활성도를 나타낸다. 상기 제i노드를 출력노드라고 지칭할 수 있다.Here, x _i denotes the activity of the i-th node having the index i, and b _j denotes the basal activity of the j-th node j having the index j. The i-th node may be referred to as an output node.

따라서 인플루언스(S_ij)는 제j노드(j)의 기저활성도의 변화에 따른 제i노드의 활성도의 변화율을 나타내는 값이다.Therefore, the influence (S _ij ) is a value representing the rate of change in the activity of the i-th node according to the change in the basal activity of the j-th node (j).

수학적으로, 인플루언스(S_ij)는 제j노드(j)의 기저활성도 b_j의 변화에 대한 제i노드(i)의 활성도(x_i)의 변화율(rate of change) 또는 편미분을 의미한다. 예를 들어, 인플루언스는 돌연변이나 약물로 인해 어떤 신호전달분자(제j노드(j))의 기저활성도가 변할 때 세포의 사멸이나 증식을 조절하는 단백질(출력노드인 제i노드(i))의 활성도가 얼마나 변하는지(i.e., 변화율)를 나타낸다.Mathematically, the influence (S _ij ) means the rate of change or partial derivative of _{the activity (x i} ) of the i-th node (i) with respect to the change in _{the basal activity b j} of the j-th node (j). . For example, influence is a protein that regulates cell death or proliferation when the basal activity of a certain signaling molecule (j-node (j)) is changed due to mutation or drug (output node, i-node (i)) It indicates how much the activity of (ie, rate of change) changes.

인플루언스의 값은 부호 관점 및 크기 관점에서 해석할 수 있다. 출력노드인 제i노드(i)에 대한 어떤 제j노드(j)의 인플루언스(S_ij)가 음수인 경우, 제j노드(j)의 활성도가 증가하면 출력노드인 제i노드(i)의 활성도는 감소한다는 것을 의미한다. 반대로 인플루언스(S_ij)가 양수인 경우, 제j노드(j)의 활성도가 증가하면 출력노드인 제i노드(i)의 활성도는 증가한다는 것을 의미한다. 예를 들어, 어떤 신호전달 단백질을 저해하여 활성도를 감소시킴으로써 세포 증식에 관련된 단백질의 활성도를 감소시키고자 한다면, 세포증식 관련 단백질에 대해 양의 인플루언스를 갖는 신호전달 단백질을 선택해야 하는 것이다. 그리고 인플루언스의 크기가 클수록 출력노드에 미치는 영향이 크다는 것을 의미하기 때문에 세포상태를 제어하고자 하는 목표에 따라 인플루언스의 부호를 결정하고, 인플루언스의 크기에 따라 제어표적 노드들의 우선순위(priority) 또는 순위(rank)를 매길 수 있다.The value of influence can be interpreted in terms of sign and magnitude. _{If the influence (S ij} ) of a certain j-th node (j) on the output node i-th node (i) is negative, if the activity of the j-th node (j) increases, the output node i-th node (i) ) means that the activity decreases. Conversely, when the influence (S _ij ) is a positive number, it means that when the activity of the j-th node (j) increases, the activity of the i-th node (i), which is an output node, increases. For example, if it is desired to reduce the activity of a protein related to cell proliferation by inhibiting a certain signaling protein to decrease its activity, a signaling protein having a positive influence on the cell proliferation related protein should be selected. And since it means that the larger the size of the influence, the greater the effect on the output node. Therefore, the sign of the influence is determined according to the goal to control the cell state, and the priority of the control target nodes according to the size of the influence. (priority) or rank (rank) can be assigned.

제j노드(j)를 타겟으로 하는 약품을 투여했을 때에, 약품 타겟 노드인 제j노드(j)의 기저활성도(b_j)가 변화할 것이다. 그래서 상기 약품을 투여함에 따라 제j노드(j)의 기저활성도(b_j)가 변화하고, 그 결과 상기 출력노드인 제i노드(i)의 활성도가 어떻게 변하는지를 관찰할 수 있다.When a drug targeting the j-th node j is administered, the basal activity b _j of the j-th node j, which is the drug target node, will change. So, one can observe how the activity of the j-th node (j) changes in the basal activity (b _j), and the result of the i-node (i) the output node as the administration of the drug varies.

<가중치를 이용한 인플루언스의 산출방법 1><Method of calculating influence using weights 1>

본 발명에서는, 각 노드의 활성도를 추정하기 위하여 상기 정의한 인플루언스의 개념을 적용한다. 우선 아래의 수식 7을 이용한다.In the present invention, the concept of influence defined above is applied to estimate the activity of each node. First, use Equation 7 below.

[수식 7][Equation 7]

여기서 x∈R^N, W∈R^N×N, α∈R 이다. where ^{x∈R N} , W∈R ^N×N , α ∈R .

수식 7이 수식 2와 다른 점은 (1-α) 항 대신에 β (β∈R)를 도입한 것인데, 이는 기존 수식을 좀 더 일반화하기 위해 수식 7에서 신호흐름의 영향을 나타내는 Wx(t)와 기저활성도의 영향을 나타내는 b의 가중치 합에 사용되는 가중치의 합 α+β가 반드시 1이라는 제약 조건을 완화한 것이다 (반면, 수식 2에서는 α값이 정해지면 자동으로 β는 1-α로 결정된다). 하지만 보통 α+β=1이 되도록 α와 β를 설정하는 것이 알고리즘의 해를 구하는 수치해석의 안전성(stability)을 보장한다. The difference between Equation 7 and Equation 2 is that β ( β ∈R) is introduced instead of the (1-α ) term, which is Wx(t), which represents the effect of signal flow in Equation 7 to make the existing equation more general. The constraint that the sum of weights α + β used in the sum of weights of b and b indicating the effect of basal activity is always 1 is relaxed. (On the other hand, in Equation 2, when the value of α is determined, β is automatically determined as 1- α. do). However, in general , setting α and β so that α + β = 1 guarantees the stability of the numerical analysis that solves the algorithm.

수식 7의 x(t)에 대하여 인플루언스 행렬 S는 수식 8과 같이 유도된다.For x(t) in Equation 7, the influence matrix S is derived as Equation 8.

[수식 8][Equation 8]

본 실시예에서는 정상상태(steady-state)일 때의 인플루언스 S(∞)를 이용한다. 실제로 S(∞)를 구하기 위한 수치해석 과정에서는 행렬 놈(norm), ||S(t+1) - S(t)||의 값이 특정 값(e.g., tol=1e^-6) 이하로 수렴할 때의 S(t)의 값을 S(∞)의 근사값으로 사용한다.In this embodiment, the influence S(∞) in the steady-state is used. In fact, in the numerical analysis process to obtain S(∞), the values of the matrix norm, ||S(t+1) - S(t)||, converge below a ^{specific value (eg, tol=1e -6 ).} Use the value of S(t) as an approximation of S(∞).

이하, 도 10을 참조하여 인플루언스를 계산하는 방법을 설명한다. 도 10은 인플루언스의 값을 계산하는 개념을 설명하기 위한 것이다.Hereinafter, a method of calculating the influence will be described with reference to FIG. 10 . 10 is for explaining the concept of calculating the value of influence.

도 10은 3-노드 케스케이드(cascade) 구조를 갖는 신호전달네트워크의 예를 나타낸다. 10 shows an example of a signaling network having a 3-node cascade structure.

도 10은 세 개의 노드가 차례로 연결된 신호전달 케스케이드(cascade)로서 가장 단순한 구조 중 하나이다. 노드 x₁의 노드 x₂에 대한 인플루언스(S₂₁)는 수식 9와 같이 계산된다. 수식 9에서 b₁은 노드 x₁의 기저 활성도를 나타내며 b₂은 노드 x₂의 기저 활성도를 나타낸다.10 is one of the simplest structures as a signaling cascade in which three nodes are sequentially connected. The influence (S ₂₁ ) of the node x _{1 on} the node x ₂ is calculated as in Equation 9. In Equation 9, b ₁ represents the basal activity of the node x ₁ _{and b 2} represents the basal activity of the node x ₂ .

[수식 9][Equation 9]

한 단계 더 나아가, 노드 x₁의 노드 x₃에 대한 인플루언스(S₃₁)는 수식 10과 같다.Going one step further, the influence (S ₃₁ _{) of the node x 1 on} the node x ₃ is as in Equation 10.

[수식 10][Equation 10]

이하, 도 11을 참조하여 인플루언스를 계산하는 방법을 설명한다. 도 11은 인플루언스의 값을 계산하는 개념을 설명하기 위한 것이다.Hereinafter, a method of calculating the influence will be described with reference to FIG. 11 . 11 is for explaining the concept of calculating the value of influence.

도 11은 3개의 경로를 갖는 피드포워드(feed-forward) 구조를 갖는 신호전달네트워크의 예를 나타낸다. 11 shows an example of a signaling network having a feed-forward structure having three paths.

도 11은 노드 x₁부터 x₆까지 3개의 경로를 갖는 피드포워드 구조이며, 이에 대한 인플루언스 계산은 수식 11과 같다.11 is _{a feedforward structure having three paths from a node x 1} to x ₆ , and the influence calculation therefor is as shown in Equation 11.

[수식 11][Equation 11]

도 10과 도 11의 구조에 대한 인플루언스 계산 수식에서 우리가 알 수 있는 것은, 인플루언스(S_ij)는 소스 노드 j로부터 출력노드 i에 이르는 모든 경로에서 각 경로 상의 링크 가중치들의 곱(ΠW)을 합한 결과라는 것이다. 예를 들어, 도 11의 S₆₁은 가중치 W의 곱으로 이루어진 항을 3개 갖고 있는데, 각 항은 도 11의 피드포워드 상의 각 경로에 대응되는 것을 알 수 있다. 즉, 예컨대 W₆₄·W₄₂·W₂₁는 경로 x₁ - x₂ - x₄ - x₆에 대응된다.What we can know from the influence calculation formulas for the structures of FIGS. 10 and 11 is that the influence (S _ij ) is the product of the link weights on each path in all paths from the source node j to the output node i ( It is the result of summing ΠW). For example, S ₆₁ of FIG. 11 has three terms made up of a product of a weight W, and it can be seen that each term corresponds to each path on the feedforward of FIG. 11 . That is, for example, W ₆₄ ·W ₄₂ ·W ₂₁ corresponds to the path x ₁ - x ₂ - x ₄ - x _{6 .}

이하, 도 12 및 도 13을 참조하여 인플루언스를 계산하는 방법을 설명한다. 도 12 및 도 13은 인플루언스의 값을 계산하는 개념을 설명하기 위한 것이다.Hereinafter, a method of calculating the influence will be described with reference to FIGS. 12 and 13 . 12 and 13 are for explaining the concept of calculating an influence value.

도 12는 3-노드 음성피드백루프(negative feedback loop) 구조를 갖는 신호전달네트워크의 예를 나타낸다.12 shows an example of a signaling network having a three-node negative feedback loop structure.

그리고 도 13은 3-노드 음성피드백루프에 대한 루프 언롤링(unrolling) 구조를 갖는 신호전달네트워크의 예를 나타낸다. 도 13에서, t는 언롤링 정도를 결정하게 된다. And FIG. 13 shows an example of a signaling network having a loop unrolling structure for a 3-node voice feedback loop. In FIG. 13 , t determines the degree of unrolling.

도 12의 음성피드백루프 구조에 대한 인플루언스 계산은 케스케이드나 피드포워드 구조보다는 약간 더 복잡하다. 이와 같은 구조는 루프(loop) 또는 싸이클(cycle)을 포함하고 있기 때문에 경로를 무한하게 고려할 수 있다. 도 12에서 노드 중복을 허용했을 때 노드 1으로부터 노드 3에 이르는 경로는 싸이클로 인해 무한하게 많아지게 된다. 정상상태일 때 인플루언스는 t → ∞ 로 가정하여 구하게 되는데 이는 루프 언롤링(loop unrolling) 관점에서 무한한 길이의 경로에 대한 인플루언스를 계산하는 것으로 생각할 수 있다(도 13 참고). 하지만 인플루언스 계산을 위한 실제 수치해석에서는 ||S(t+1) - S(t)||의 값이 매우 작아지므로 인플루언스 계산을 위한 루프의 반복횟수를 어떤 자연수 T로 한정할 수 있다. 가중치 W와 파라미터 α 및 β의 값에 따라 인플루언스 계산이 발산하는 경우도 있는데 이와 같은 경우 인플루언스 계산을 위한 루프의 반복횟수를 특정한 값의 자연수인 T _L 로 지정해 줄 수도 있다. 경험적으로 W는 상술한 가중치 추정 알고리즘으로 구하며, α = 0.9 및 β = 0.1로 정할 수도 있다.The influence calculation for the voice feedback loop structure of FIG. 12 is slightly more complicated than the cascade or feedforward structure. Since such a structure includes a loop or a cycle, the path can be considered infinitely. When node redundancy is allowed in FIG. 12 , paths from node 1 to node 3 become infinitely numerous due to the cycle. In a steady state, the influence is obtained by assuming t → ∞, which can be considered as calculating the influence for a path of infinite length from the perspective of loop unrolling (refer to FIG. 13). However, in actual numerical analysis for influence calculation, the value of ||S(t+1) - S(t)|| becomes very small, so the number of iterations of the loop for influence calculation can be limited to a certain natural number T. have. In some cases, the influence calculation diverges depending on the weight W and the values of the parameters α and β . In this case, the number of repetitions of the loop for the influence calculation may be designated as T _{L, which is a natural number of a specific value.} Empirically, W is obtained by the above-described weight estimation algorithm, and α = 0.9 and β = 0.1 may be set.

<가중치를 이용한 인플루언스의 산출방법 2><Method of calculating influence using weights 2>

인플루언스(S_ij)를 구하는 제1실시예에서, 인플루언스(S_ij)는 수식 12와과 같이 주어질 수 있다. In the first embodiment of obtaining the influence (S _ij _{), the influence (S ij} ) may be given as in Equation (12).

[수식 12][Equation 12]

S = (1-α)*(I + αW + α ² W ²+ ... + α ^t ^-1 W ^t ^-1) S = (1- α )*( I + αW + α ² W ² + ... + α ^t ^-1 W ^t ^-1 )

수식 12는 수식 8에서 β=1-α 인 경우를 나타낸다.Equation 12 shows the case where β = 1- α in Equation 8.

수식 12에서 α는 사용자가 지정하여 입력할 수 있는 하이퍼파라미터이다. 그리고 W는 상술한 가중치 행렬을 나타낸다. 따라서 상기 가중치를 이용하여 상기 인플루언스 값의 행렬(S∈R^N×N)을 산출할 수 있다. 그 결과 제j노드(j)를 타겟으로 하는 약품을 투여했을 때에, 상기 출력노드인 제i노드의 활성도가 어떻게 변하는지를 추정할 수 있다. In Equation 12, α is a hyperparameter that the user can specify and input. And W denotes the above-described weight matrix. Accordingly, the matrix of the influence values (S∈R ^N×N ) may be calculated using the weights. As a result, when a drug targeting the j-th node (j) is administered, it can be estimated how the activity of the i-th node, which is the output node, changes.

<인플루언스를 이용한 활성도의 산출><Calculation of activity using influence>

단계(S30)에서는, 상기 상술한 가중치 W_ij 와 상기 수식 7을 이용하여 상술한 인플루언스(S_ij)의 값을 계산할 수 있다. 그리고 상기 계산된 인플루언스(S_ij)의 값을 이용하여 각 노드(i)의 활성도(x_i)에 관한 값을 산출할 수 있다. In step S30 , the value of the above-described influence S _ij may be calculated _{using the above-described weight W ij and Equation 7 above.} In addition, a value related to the activity (x _i ) of each node (i) may be calculated using the calculated value of the influence (S _{ij ).}

단계(S30)에서, 상기 각 노드의 활성도에 관한 값을 산출하기 위하여 수식 2 또는 수식 7을 사용할 수 있다.In step S30, Equation 2 or Equation 7 may be used to calculate a value related to the activity of each node.

단계(S40)에서, 상기 각각의 링크(3)들에 대하여 산출된 가중치(W_ij)와 상기 각각의 노드(2)들에 대하여 산출된 활성도(x_i)를 이용하여, 상기 신호전달네트워크의 신호흐름을 결정할 수 있다.In step S40, using the weight (W _ij _{) calculated for each of the links (3) and the activity (x i} ) calculated for each of the nodes (2), signal flow can be determined.

<SI-플롯(SI-plot)을 활용한 제어표적 발굴><Excavation of control targets using SI-plot>

본 발명자들은 상기 설명한 인플루언스와 더불어 출력노드에 대한 최단경로길이(the shortest path length to output, 이하 SPLO)를 함께 분석하면, 세포의 증식이나 사멸을 제어할 수 있는 효과적인 제어표적을 발굴할 수 있다는 것을 알게 되었다. By analyzing the shortest path length to output (hereinafter, SPLO) of the output node together with the influence described above, the present inventors can discover effective control targets that can control the proliferation or death of cells. found out that

도 14는 EMT 조절 신호전달네트워크를 나타낸다.14 shows an EMT control signaling network.

상기 EMT 조절 신호전달네트워크는 EMT 노드를 유일한 출력으로 갖는다. The EMT control signaling network has an EMT node as its only output.

도 15 및 도 16은 도 14에 제시한 EMT 조절 네트워크에 대한 인플루언스 계산 결과의 예를 나타낸 것이다.15 and 16 show examples of influence calculation results for the EMT control network shown in FIG. 14 .

도 15는 도 14에 EMT 조절 신호전달네트워크에 대한 인플루언스 계산 결과이다. 15 is an influence calculation result for the EMT control signaling network in FIG. 14 .

도 15에서 가로축은 상기 EMT 조절 네트워크의 노드들 중 EMT 노드에 대한 인플루언스 값을 나타낸 것이고, 세로축은 EMT 조절 신호전달네트워크의 각 노드의 식별자를 나타낸 것이다. 예컨대 TWIST1 노드의 EMT 노드에 대한 인플루언스는 1에서 2 사이의 값을 갖고, EGFR 노드의 EMT 노드에 대한 인플루언스는 0에서 1 사이의 값을 갖고, E-Cadhorin 노드의 EMT 노드에 대한 인플루언스는 약 -3의 값을 갖는다는 것을 확인할 수 있다. 인플루언스 값은 실수(real number)일 수 있다.In FIG. 15 , the horizontal axis indicates an influence value for an EMT node among the nodes of the EMT control network, and the vertical axis indicates an identifier of each node of the EMT control signaling network. For example, the influence on the EMT node of the TWIST1 node has a value between 1 and 2, the influence on the EMT node of the EGFR node has a value between 0 and 1, and the influence on the EMT node of the E-Cadhorin node. It can be seen that s has a value of about -3. The influence value may be a real number.

도 15에서는 인플루언스 계산 결과를 인플루언스값을 기준으로 하는 내림차순으로 정렬하였다. 도 15에 나타낸 막대그래프 옆의 숫자는 EMT 출력노드에 대한 최단경로길이(SPLO)를 나타낸다. 최단경로는 자연수일 수 있다. 참고로 인플루언스와 SPLO의 피어슨 상관계수는 -0.753이다. In FIG. 15, the influence calculation results are arranged in descending order based on the influence value. The number next to the bar graph shown in FIG. 15 indicates the shortest path length (SPLO) for the EMT output node. The shortest path may be a natural number. For reference, the Pearson correlation coefficient between influence and SPLO is -0.753.

본 발명에서 제1노드로부터 제2노드까지의 최단경로길이(Shortest Path Length, SPL)는, 상기 제1노드로부터 제2노드까지 도달하기 위하여, 제1노드와 제2노드를 서로 연결하는 한 개 이상의 링크들로부터 선택될 수 있는 한 개 이상의 경로들 중, 각 경로에 포함된 링크의 개수가 가장 작은 최단경로에 포함된 상기 링크의 수를 나타낸다. In the present invention, the Shortest Path Length (SPL) from the first node to the second node is one connecting the first node and the second node to each other in order to reach the second node from the first node. Among one or more paths that can be selected from the above links, the number of links included in the shortest path having the smallest number of links included in each path is indicated.

도 16은 인플루언스를 SPLO와 함께 표시한 것으로서 이하 SI-플롯(SPLO-Influence plot)이라고 지칭할 수 있다. 16 shows influence together with SPLO, which may be referred to as an SI-plot (SPLO-Influence plot) below.

SI-플롯은 가로축과 세로축을 갖는다. 상기 가로축은 주어진 노드로부터 EMT 노드까지의 SPLO의 값을 나타낸 것이고, 상기 세로축은 상기 주어진 노드의 EMT 노드에 대한 인플루언스 값을 나타낸 것이다. 상기 가로축은 자연수를 나타낼 수 있으며, 상기 세로축은 실수값을 나타낼 수 있다. 가로축에서 우측으로 갈수록 SPLO의 값이 커지도록 설정하였으며, 세로축에서 위쪽으로 갈수록 인플루언스의 값이 커지도록 설정하였다.The SI-plot has a horizontal axis and a vertical axis. The horizontal axis represents the SPLO value from the given node to the EMT node, and the vertical axis represents the influence value of the given node to the EMT node. The horizontal axis may represent a natural number, and the vertical axis may represent a real number. The value of SPLO was set to increase from the horizontal axis to the right, and the value of influence was set to increase from the vertical axis to the top.

제1 SI-플롯이 제1출력노드에 관한 것이라면, 상기 제1 SI-플롯과 다른 제2 SI-플롯은 상기 제1출력노드와는 다른 제2출력노드에 관한 것일 수 있다. 예컨대, 도 16은 EMT 노드에 관한 SI-플롯이다.If the first SI-plot relates to a first output node, the second SI-plot different from the first SI-plot may relate to a second output node different from the first output node. For example, FIG. 16 is an SI-plot for an EMT node.

도 16에 나타낸 EMT 노드에 관한 SI-플롯에는, EMT 노드를 제외한 다른 특정 노드를 나타내는 정보가 표시될 수 있다. 예컨대 도 16의 SI-플롯에는 TWIST1 노드가 표시될 수 있으며, 그 표시 위치는 TWIST1 노드로부터 EMT 노드까지의 SPLO 및 TWIST1 노드의 EMT 노드에 대한 인플루언스 값(S_{EMT_TWIST1})에 대응하는 위치이다.In the SI-plot for the EMT node shown in FIG. 16 , information indicating other specific nodes other than the EMT node may be displayed. For example, the TWIST1 node may be displayed in the SI-plot of FIG. 16 , and the displayed position is a position corresponding to the SPLO from the TWIST1 node to the EMT node and the influence value (S _{EMT_TWIST1} ) of the TWIST1 node for the EMT node.

상술한 설명으로부터 상기 제1출력노드에 대한 제1 SI-플롯에는 상기 제1출력노드를 제외한 다른 노드들이 플롯될 수 있다는 점을 이해할 수 있다. 또한 상기 제1 SI-플롯의 세로축은 주어진 임의의 노드에 대한 상기 제1출력노드에 대한 인플루언스 값을 나타내며, 가로축은 상기 주어진 임의의 노드로부터 상기 제1출력노드까지의 SPLO를 나타낸다.From the above description, it can be understood that nodes other than the first output node may be plotted in the first SI-plot for the first output node. In addition, the vertical axis of the first SI-plot represents the influence value of the first output node for a given arbitrary node, and the horizontal axis represents the SPLO from the given arbitrary node to the first output node.

도 16에서 노드의 이름에 밑줄이 추가되어 표시된 노드(삼각형으로 표시된 노드)는, 자세한 로직 정보를 기반으로 구축한 불리언 모델에 대해 본 발명자들이 단일 저해(single inhibition) 또는 이중 저해(dual inhibition)의 모든 조건에서 탐색하여 찾은 '제어표적노드(control target node)'이다. In FIG. 16 , the node indicated by adding an underscore to the name of the node (the node indicated by a triangle) is the result of single inhibition or dual inhibition by the present inventors for a Boolean model built based on detailed logic information. It is a 'control target node' found by searching under all conditions.

도 16으로부터 인플루언스의 절대값이 큰 경우 SPLO는 작은 경향이 있다는 것을 쉽게 알 수 있다(피어슨 상관계수 -0.753). 이는 출력노드(도 16에서는 EMT 노드가 출력노드임)에 가까울수록 영향력이 크다는 것을 의미하며 상식적으로 이해하기 쉬운 결과라고 볼 수 있다. It can be easily seen from FIG. 16 that when the absolute value of influence is large, the SPLO tends to be small (Pearson's correlation coefficient -0.753). This means that the closer to the output node (the EMT node is the output node in FIG. 16), the greater the influence, and it can be seen as a result that is easy to understand by common sense.

하지만 이와 같은 경향을 따르지 않는 노드들이 있다. 예를 들어, ZEB2는 SPLO=2이지만 동일한 SPLO를 갖는 노드들보다 인플루언스가 상대적으로 작다. 반대로, NOTCH(SPLO=5)나 RAS(SPLO=6)의 경우 동일한 SPLO를 갖는 노드들보다 인플루언스가 상대적으로 크다.However, there are nodes that do not follow this trend. For example, ZEB2 has SPLO=2 but has relatively less influence than nodes with the same SPLO. Conversely, in the case of NOTCH (SPLO = 5) or RAS (SPLO = 6), the influence is relatively larger than that of nodes having the same SPLO.

이하 도 16을 참조하여 설명한다.It will be described below with reference to FIG. 16 .

출력노드(EMT 노드)에 가까울수록 인플루언스가 반드시 크지 않은 경우를 분석하기 위하여 본 발명자들은 도 16과 같이 인플루언스를 SPLO와 함께 표시해보았다(SI-플롯). 그 결과, 본 발명자들이 불리언 시뮬레이션 분석 및 생물학 실험 검증을 통해 발굴한 '제어표적노드'들이 SI-플롯에서 각 SPLO 값에 대하여 최상단에 위치하여 있다는 것을 발견하였다. 또한 실시예에 다라서는 상기 발굴된 '제어표적노드'들이 왼쪽 위에서부터 오른쪽 아래 방향으로 향하는 대각선 상에 분포하는 경향이 있다는 것을 발견하였다. 이는 인플루언스의 절대값이 크지 않아도 동일 SPLO 그룹 내에서 인플루언스가 크면 제어표적노드가 될 가능성이 높다는 것을 의미한다.In order to analyze the case where the closer to the output node (EMT node) the influence is not necessarily large, the present inventors displayed the influence together with SPLO as shown in FIG. 16 (SI-plot). As a result, it was found that the 'control target nodes' discovered by the present inventors through Boolean simulation analysis and biological experiment verification were located at the top of each SPLO value in the SI-plot. In addition, according to the embodiment, it was found that the discovered 'control target nodes' tend to be distributed in a diagonal direction from the upper left to the lower right. This means that even if the absolute value of influence is not large, if the influence is large within the same SPLO group, it is highly likely to become a control target node.

즉, 제어관점에서 인플루언스의 절대값이 중요하지만 각 SPLO 그룹 내에서 인플루언스의 상대적 크기도 중요하다는 것을 의미한다. 본 발명자들은 이와 같은 경향을 혈액암 세포 관련 신호전달 네트워크(Zanudo et al. PLoS Comp. Biol. 11(4), e1004193) 및 다양한 규모의 대장암 신호전달네트워크(Fumia et al. PLoS ONE, 8(7), e69008, Cho et al. BMC Sys. Biol., 10(1), 96)에 대해서도 확인하였다. This means that the absolute value of influence is important from a control point of view, but the relative size of influence within each SPLO group is also important. The present inventors analyzed this trend in the blood cancer cell-related signaling network (Zanudo et al. PLoS Comp. Biol. 11(4), e1004193) and the colorectal cancer signaling network of various scales (Fumia et al. PLoS ONE, 8 ( 7), e69008, Cho et al., BMC Sys. Biol., 10(1), 96).

앞서 살펴본 결과를 근거로 본 발명에서 제어표적을 발굴하는 방법은, 인플루언스와 SPLO를 계산하여 SI-플롯을 작성한 이후 각 SPLO 그룹에서 인플루언스가 높은 노드들을 제어표적으로 선택하는 방법이다. 본 발명에서 제어표적을 발굴하는 방법을 정리하면 도 17에 나타낸 순서도와 같다.The method of discovering a control target in the present invention based on the results discussed above is a method of selecting nodes with high influence in each SPLO group as a control target after creating an SI-plot by calculating the influence and SPLO. The method of excavating the control target in the present invention is summarized as the flowchart shown in FIG. 17 .

<단일 출력노드에 대한 신호흐름 제어표적 결정 방법><Method of determining signal flow control target for single output node>

도 17은 본 발명의 일 실시예에 따라 신호전달네트워크에서의 제어표적을 결정하는 방법을 나타낸 순서도이다.17 is a flowchart illustrating a method of determining a control target in a signaling network according to an embodiment of the present invention.

단계(S110)에서, 컴퓨팅 장치는, 제1신호전달네트워크 및 제1신호전달네트워크의 제1출력노드를 입력으로 받는다.In step S110 , the computing device receives the first signaling network and the first output node of the first signaling network as inputs.

단계(S120)에서, 상기 컴퓨팅 장치는, 상기 제1신호전달네트워크에 포함된 임의의 노드의 상기 제1출력노드에 대한 인플루언스를 계산하여, 상기 제1출력노드에 대한 한 개 이상 또는 복수 개의 인플루언스 값을 산출한다.In step S120, the computing device calculates the influence on the first output node of any node included in the first signal delivery network, and one or more or a plurality of the first output nodes Calculate the influence value of dogs.

단계(S130)에서, 상기 컴퓨팅 장치는, 상기 각 임의의 노드들로부터의 상기 제1출력노드에 대한 복수 개의 SPLO들을 산출한다.In step S130, the computing device calculates a plurality of SPLOs for the first output node from each of the arbitrary nodes.

단계(S140)에서, 상기 컴퓨팅 장치는, 상기 산출한 복수 개의 인플루언스 값들 및 상기 복수 개의 SPLO들을 기초로 상기 제1출력노드에 대한 제1 SI-플롯을 생성한다. 이때 상기 제1 SI-플롯을 생성한다는 것은 상기 제1 SI-플롯의 생성에 필요한 데이터를 생성하는 것일 수 있다.In step S140 , the computing device generates a first SI-plot for the first output node based on the calculated plurality of influence values and the plurality of SPLOs. In this case, generating the first SI-plot may mean generating data necessary for generating the first SI-plot.

제1출력노드에 대하여 상술한 단계(S110) 내지 단계(S140)을 이용하여 제1 SI-플롯을 생성할 수 있다는 점을 이해할 수 있다. 또한 상기 제1출력노드와는 다른 제2출력노드에 대하여 상기 단계(S110) 내지 단계(S140)를 적용하면 상기 제1 SI-플롯과는 다른 제2 SI-플롯을 생성할 수 있다는 점을 이해할 수 있다.It can be understood that the first SI-plot can be generated using the above-described steps S110 to S140 with respect to the first output node. Also, it will be understood that a second SI-plot different from the first SI-plot can be generated by applying the steps S110 to S140 for a second output node different from the first output node can

단계(S150)에서, 상기 제1 SI-플롯에서 SPLO 그룹별로 제어목표에 따라 노드를 선택한다. 선택된 노드들은 '제어표적노드'라고 지칭될 수 있다. In step S150, a node is selected according to a control target for each SPLO group in the first SI-plot. The selected nodes may be referred to as 'control target nodes'.

이때 각각의 상기 SPLO 그룹은 제1 SI-플롯의 가로축의 값에 의해 구별될 수 있다. 즉, 제1 SI-플롯 내에 플롯되는 노드들 중 서로 동일한 SPLO를 갖는 노드들은 동일한 SPLO 그룹에 속한 것으로 간주될 수 있다. 또한, 제1 SI-플롯 내에 플롯되는 제1노드의 SPLO와 제2노드의 SPLO가 서로 다르다면, 상기 제1노드와 상기 제2노드는 서로 다른 SPLO 그룹에 속한 것으로 간주될 수 있다.In this case, each of the SPLO groups may be distinguished by the value of the horizontal axis of the first SI-plot. That is, nodes having the same SPLO as each other among the nodes plotted in the first SI-plot may be regarded as belonging to the same SPLO group. In addition, if the SPLO of the first node and the SPLO of the second node plotted in the first SI-plot are different from each other, the first node and the second node may be regarded as belonging to different SPLO groups.

여기서 상기 제어표적노드를 선택하는 기준은 SPLO 그룹별로 독립적으로 설정될 수 있다. Here, the criterion for selecting the control target node may be independently set for each SPLO group.

예를 들어, 제1 SI-플롯 상에 플롯되는 임의의 특정 노드의 활성도를 낮춤으로써 상기 제1출력노드의 활성도를 낮추는 것이 제어목표라면, 각 SPLO 그룹에서 인플루언스의 부호는 양(+)이며, 상기 인플루언스의 절대값이 미리 설정한 임계값보다 큰 노드를 선택하여 상기 제어표적노드로 결정할 수 있다. 정량적으로 "인플루언스의 절대값이 미리 설정한 임계값보다 큰 노드"는 사용자가 정의한 인플루언스의 한계점(threshold)보다 큰 인플루언스를 갖는 노드인 것으로 정의할 수 있다. 예를 들어, "각 SPLO 그룹 내의 상위 3개" 혹은 "각 SPLO 그룹 내의 상위 5%"와 같이 정의할 수 있다.For example, if the control goal is to lower the activity of the first output node by lowering the activity of a specific node plotted on the first SI-plot, the sign of the influence in each SPLO group is positive (+). , it is possible to select a node having an absolute value of the influence greater than a preset threshold and determine it as the control target node. Quantitatively, "a node having an absolute value of influence greater than a preset threshold" may be defined as a node having an influence greater than a user-defined threshold of influence. For example, it can be defined as “top 3 within each SPLO group” or “top 5% within each SPLO group”.

다른 예로서, 제1 SI-플롯 상에 플롯되는 임의의 특정 노드의 활성도를 낮춤으로써 상기 제1출력노드의 활성도를 증가시키는 것이 제어목표라면, 위와는 다른 기준으로 상기 '제어표적노드'를 선택할 수 있다.As another example, if the control goal is to increase the activity of the first output node by lowering the activity of a specific node plotted on the first SI-plot, the 'control target node' may be selected based on a different criterion than the above. can

<복수 출력노드에 대한 신호흐름 제어표적 결정 방법><Method of determining signal flow control target for multiple output nodes>

여러 개의 출력노드가 존재하는 신호전달네트워크의 제어는 출력노드가 유일한 네트워크의 제어와 크게 다르지 않다. 각 출력노드 별로 SI-플롯을 구한 후 제어목표에 맞게 제어표적을 선택한다. The control of a signaling network with multiple output nodes is not much different from the control of a network with only one output node. After obtaining an SI-plot for each output node, a control target is selected according to the control target.

예를 들어, 어떤 신호전달네트워크가 사멸(apoptosis)을 나타내는 제1출력노드, 증식(proliferation)을 나타내는 제2출력노드, 및 전이(metastasis)를 나타내는 제3출력노드와 같이 3개의 출력노드를 갖는 경우, 보통 제어의 목표는 약물 저해를 통해 사멸 출력노드의 활성도는 증가시키고 증식 출력노드 및 전이 출력노드의 활성도는 감소시키는 것이 된다. For example, a signaling network may have three output nodes, such as a first output node indicating apoptosis, a second output node indicating proliferation, and a third output node indicating metastasis. In this case, the goal of normal control is to increase the activity of the death output node and decrease the activity of the proliferative output node and the transition output node through drug inhibition.

이와 같은 경우 사멸 출력노드에 대해 음의 인플루언스를 가지면서 증식 출력노드 및 전이 출력노드에 대해서는 양의 인플루언스를 갖는 노드를 '제어표적노드'로서 결정해야 한다. In this case, a node that has negative influence on the death output node and positive influence on the proliferation output node and transition output node should be determined as the 'control target node'.

도 18은 본 발명의 일 실시예에 따른 제어표적노드의 선별 과정을 나타낸 순서도이다.18 is a flowchart illustrating a process of selecting a control target node according to an embodiment of the present invention.

단계(S210)에서, 사멸 출력노드에 대한 SI-플롯인 제1 SI-플롯, 증식 출력노드에 대한 SI-플롯인 제2 SI-플롯, 전이 출력노드에 대한 SI-플롯인 제3 SI-플롯을 생성하기 위한 정보를 생성한다.In step S210, a first SI-plot that is an SI-plot for a dead output node, a second SI-plot that is an SI-plot for a proliferating output node, and a third SI-plot that is an SI-plot for a transitional output node information to create

단계(S220)에서, 상기 제1 SI-플롯에서 '제어표적노드'들을 선택하여 상기 선택된 '제어표적노드'들로 구성되는 제1제어표적 후보집합을 생성한다. 이 때 상기 '제어표적노드'들 각각의 인플루언스는 음의 부호를 가지며 그 크기(절대값)는 상기 제1 SI-플롯의 각 SPLO 그룹에서 상위권에 속한다. In step S220, 'control target nodes' are selected from the first SI-plot to generate a first control target candidate set including the selected 'control target nodes'. At this time, the influence of each of the 'control target nodes' has a negative sign and its magnitude (absolute value) belongs to a higher rank in each SPLO group of the first SI-plot.

단계(S230)에서, 상기 제2 SI-플롯에서 '제어표적노드'들을 선택하여 상기 선택된 '제어표적노드'들로 구성되는 제2제어표적 후보집합을 생성한다. 이 때 상기 '제어표적노드'들 각각의 인플루언스는 양의 부호를 가지며 그 크기(절대값)는 상기 제2 SI-플롯의 각 SPLO 그룹에서 상위권에 속한다.In step S230, 'control target nodes' are selected from the second SI-plot to generate a second control target candidate set including the selected 'control target nodes'. At this time, the influence of each of the 'control target nodes' has a positive sign, and its magnitude (absolute value) belongs to a higher rank in each SPLO group of the second SI-plot.

단계(S240)에서, 상기 제3 SI-플롯에서 '제어표적노드'들을 선택하여 상기 선택된 '제어표적노드'들로 구성되는 제3제어표적 후보집합을 생성한다. 이 때 상기 '제어표적노드'들 각각의 인플루언스는 양의 부호를 가지며 그 크기(절대값)는 상기 제3 SI-플롯의 각 SPLO 그룹에서 상위권에 속한다.In step S240, 'control target nodes' are selected from the third SI-plot to generate a third control target candidate set including the selected 'control target nodes'. At this time, the influence of each of the 'control target nodes' has a positive sign, and its magnitude (absolute value) belongs to a higher rank in each SPLO group of the third SI-plot.

상기 단계(S220), 단계(S230), 및 단계(S240)에서, "상위권에 속한다"는 것은, 해당하는 인플루언스의 크기가 미리 결정된 임계값 이상이라는 것을 의미할 수 있다.In steps S220 , S230 , and S240 , “belonging to the upper rank” may mean that the size of the corresponding influence is greater than or equal to a predetermined threshold.

단계(S250)에서, 상기 제1제어표적 후보집합, 상기 제2제어표적 후보집합, 및 상기 제3제어표적 후보집합의 교집합으로부터 한 개 이상의 '최종 제어표적노드'를 선택할 수 있다. 상기 '최종 제어표적노드'는 최종적으로 약물 표적이 될 수 있다.In step S250, one or more 'final control target nodes' may be selected from the intersection of the first control target candidate set, the second control target candidate set, and the third control target candidate set. The 'final control target node' may finally become a drug target.

상기 단계(S210~S250)의 과정을 일반화하면 도 19에 나타낸 순서도와 같이 표현할 수 있다.If the process of the steps S210 to S250 is generalized, it can be expressed as in the flowchart shown in FIG. 19 .

도 19는 본 발명의 다른 실시예에 따른 제어표적노드의 선별 과정을 나타낸 순서도이다.19 is a flowchart illustrating a process of selecting a control target node according to another embodiment of the present invention.

단계(S310)에서, 특정 신호전달네트워크에서 정의된 복수 개의 출력노드들 각각에 대하여 SI-플롯을 생성하여 복수 개의 SI-플롯들을 생성한다. In step S310, an SI-plot is generated for each of a plurality of output nodes defined in a specific signaling network to generate a plurality of SI-plots.

단계(S320)에서, 상기 각각의 SI-플롯에 대하여 상술한 단일 출력노드에 대한 신호흐름 제어표적 결정 방법을 적용하여, 상기 각각의 SI-플롯의 출력노드 별로 설정된 제어목표에 따라 각 출력노드 별로 제어표적 후보집합을 구한다. 출력노드가 N개인 경우 상기 제어표적 후보집합도 N개 생성될 수 있다.In step S320, for each SI-plot, the signal flow control target determination method for a single output node is applied to each output node according to the control target set for each output node of each SI-plot. Obtain a control target candidate set. When there are N output nodes, N number of control target candidate sets may also be generated.

단계(S330)에서, 상기 제어표적 후보집합들의 교집합을 구하고, 상기 교집합에 포함되어 있는 노드들 중 하나 이상을 선택하여 최종 제어표적노드를 결정할 수 있다.In step S330, the intersection of the control target candidate sets may be obtained, and one or more of the nodes included in the intersection may be selected to determine the final control target node.

상술한 본 발명의 실시예들을 이용하여, 본 발명의 기술 분야에 속하는 자들은 본 발명의 본질적인 특성에서 벗어나지 않는 범위 내에 다양한 변경 및 수정을 용이하게 실시할 수 있을 것이다. 특허청구범위의 각 청구항의 내용은 본 명세서를 통해 이해할 수 있는 범위 내에서 인용관계가 없는 다른 청구항에 결합될 수 있다.By using the above-described embodiments of the present invention, those skilled in the art will be able to easily implement various changes and modifications within the scope without departing from the essential characteristics of the present invention. The content of each claim in the claims may be combined with other claims without reference within the scope that can be understood through this specification.

Claims

A method for determining a control target node to be controlled for controlling an output node of a signal transmission network, the method comprising:
The computing device calculates the influence on the first output node of each of the plurality of nodes included in the first signal transmission network, and calculates the influence of the plurality of influences corresponding to each of the plurality of nodes. word calculation step; and
The computing device controls the first output node by selecting some of the plurality of nodes according to a control target preset for the first output node based on the values of the plurality of influences A control target node determining step of determining as a control target node for;
includes,
The influence of any first node among the plurality of nodes on the first output node is a degree of an effect that a change in the activity of the first node has on a change in the activity of the first output node,
The change in the activity of the first node is the change in the basal activity of the first node (∂b _j ),
The influence of the first node on the first output node (S _ij ) satisfies Equation 1 ([Equation 1] S _ij = ∂x _i / ∂b _j ),
Determination method of control target node.

According to claim 1,
The computing device calculates a Shortest Path Length to Output (SPLO) from each of the plurality of nodes to the first output node between the step of calculating the influence and the step of determining the control target node, and the plurality of Further comprising the step of calculating a plurality of SPLOs corresponding to each of the nodes,
The control target node determination step is,
classifying, by the computing device, the plurality of nodes into a plurality of SPLO groups based on the value of the SPLO; and
A group-by-group determination step in which the computing device independently selects at least some of the nodes belonging to each SPLO group for each of the plurality of SPLO groups and determines as a control target node for controlling the first output node ;
includes,
The SPLO from any of the nodes to the first output node includes one or more links connecting the arbitrary node and the first output node to each other to reach the first output node from the arbitrary node. represents the number of links included in the shortest path with the smallest number of links included in each candidate path among one or more candidate paths that can be selected from
Determination method of control target node.

According to claim 1,
The signal transduction network is a network that models interactions between a plurality of biomolecules, and is defined by including a plurality of directional links and a plurality of nodes connected by the links,
Each node represents a biomolecule,
Determination method of control target node.

The method according to claim 1, wherein the activity of any node included in the first signaling network represents the amount of activated biomolecules represented by the arbitrary node.

The method according to claim 1, wherein the control target preset for the first output node is to increase or decrease the activity of the first output node.

The method according to claim 1, wherein the first output node is any one of a node indicating cell death, a node indicating proliferation of a cell, and a node indicating cell metastasis.

delete

A method for determining a control target node to be controlled for controlling an output node of a signal transmission network, the method comprising:
calculating, by a computing device, an influence on a first output node of each of a plurality of nodes included in a first signal delivery network, and calculating a plurality of influences corresponding to each of the plurality of nodes;
calculating, by the computing device, an SPLO from each of the plurality of nodes to the first output node, and calculating, by the computing device, a plurality of SPLOs corresponding to each of the plurality of nodes;
classifying, by the computing device, the plurality of nodes into a plurality of SPLO groups based on the value of the SPLO; and
determining, by the computing device, as a control target node for controlling the first output node by independently selecting at least some of the nodes belonging to each SPLO group for each of the plurality of SPLO groups;
includes,
The influence of any first node among the plurality of nodes on any output node is the degree of influence of the change in the activity of the first node on the change in activity (∂x _{i ) of the arbitrary output node.} ego,
The change in the activity of the first node is the change in the basal activity of the first node (∂b _j ),
The influence of the first node on the first output node (S _ij ) satisfies Equation 1 ([Equation 1] S _ij = ∂x _i / ∂b _j ),
Determination method of control target node.

A method for determining a control target node to be controlled for controlling an output node of a signal transmission network, the method comprising:
For each of the plurality of output nodes included in the first signaling network:
The computing device calculates the influence on the output node of each of the plurality of nodes included in the first signal delivery network, and calculates the plurality of influences corresponding to each of the plurality of nodes,
The computing device calculates an SPLO from each of the plurality of nodes to the output node, and calculates a plurality of SPLOs corresponding to each of the plurality of nodes,
The computing device classifies the plurality of nodes into a plurality of SPLO groups based on the value of the SPLO, and
so that the computing device independently selects at least some of the nodes belonging to each SPLO group for each of the plurality of SPLO groups and determines as a control target node for controlling the output node.
determining a set of control target candidates for each output node; and
a control target node determining step of determining, by the computing device, a final control target node for controlling the first output node based on the intersection of the control target nodes determined for each of the plurality of output nodes;
includes,
The influence of any first node among the plurality of nodes on any output node is the degree of influence of the change in the activity of the first node on the change in activity (∂x _{i ) of the arbitrary output node.} ego,
The change in the activity of the first node is the change in the basal activity of the first node (∂b _j ),
The influence of the first node on the first output node (S _ij ) satisfies Equation 1 ([Equation 1] S _ij = ∂x _i / ∂b _j ),
Determination method of control target node.

10. The method of claim 9, wherein the SPLO from any of the nodes to any of the output nodes connects the any node and the any output node to each other to reach from the any node to the any output node. A method of determining a control target node, indicating the number of links included in the shortest path having the smallest number of links included in each candidate path among one or more candidate paths that can be selected from one or more links.

The method of claim 1 , wherein the influence of the first output node of any first node of the plurality of nodes on the first output node is that of the first signaling network from the arbitrary first node to the first output node. A method of determining a control target node, which is calculated using an operation that sums the products of weights assigned to links included in each path in all paths.

12. The method of claim 11,
The weights assigned to the links included in each path are,
obtaining, by the computing device, structural information of the signaling network including information about a source node and a target node connected to each link, and information about a sign of a weight mapped to each link; and
a weight generation step of generating, by the computing device, a value of a weight mapped to each link by using the structure information;
containing,
Which is generated using a method for generating information about the signaling network,
Determination method of control target node.

13. The method of claim 12,
The weight generation step is
generating, by the computing device, an adjacency matrix using the structure information;
generating, by the computing device, a diagonal matrix from the adjacent matrix; and
calculating, by the computing device, a weight value mapped to each link using a weight matrix generated by performing a matrix multiplication operation on the adjacent matrix and the diagonal matrix
containing,
Determination method of control target node.

A method for determining a biomolecule that should be a target of a drug in order to control the activity of an output biomolecule, which is a specific biomolecule, in a biomolecule network composed of interactions between biomolecules, the method comprising:
The computing device calculates the influence on the output biomolecule of each of the plurality of biomolecules included in the biomolecule network, and calculates the influence of a plurality of influences corresponding to each of the plurality of biomolecules. Luance calculation step; and
The computing device selects some of the plurality of biomolecules according to a preset control target for the output biomolecule based on the values of the plurality of influences to control the output biomolecule. a drug target biomolecule determination step of determining as a drug target biomolecule for
includes,
The influence of any first biomolecule among the plurality of biomolecules on the output biomolecule is a degree to which a change in the activity of the first biomolecule affects a change in the activity of the output biomolecule;
The change in the activity of the first node is the change in the basal activity of the first node (∂b _j ),
The influence of the first node on the first output node (S _ij ) satisfies Equation 1 ([Equation 1] S _ij = ∂x _i / ∂b _j ),
A method for determining a drug target biomolecule.