JP2010113413A

JP2010113413A - Apparatus, method and program for analyzing gene control network

Info

Publication number: JP2010113413A
Application number: JP2008283456A
Authority: JP
Inventors: Go Asakawa; 剛浅川; Takeshi Hamada; 雄浜田
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2008-11-04
Filing date: 2008-11-04
Publication date: 2010-05-20

Abstract

<P>PROBLEM TO BE SOLVED: To provide an apparatus, network and program for analyzing a gene control network, which may improve reliability. <P>SOLUTION: The expression amount for each measurement time for a plurality of genes in a target cell is acquired. The type of expression suppressing mechanism for a gene in a complementary sequence is classified based on the behavioral pattern of the expression amount for each measurement time corresponding to the gene and on presence of overlapping of the same chromosome in a complementary sequence for each gene in complementary sequence in the target cell. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は遺伝子制御ネットワーク解析装置、遺伝子制御ネットワーク解析方法及び遺伝子制御ネットワーク解析プログラムに関し、遺伝子解析技術などの分野に属するものである。 The present invention relates to a gene regulatory network analysis apparatus, a gene regulatory network analysis method, and a gene regulatory network analysis program, and belongs to the field of gene analysis technology and the like.

従来、ＮＡＴｓ（Natural Antisense Transcripts）と呼ばれるコンセプトがある（例えば非特許文献１参照）。このＮＡＴｓは、一方の１本鎖ＤＮＡのコーディング領域と、他方の１本鎖ＤＮＡのコーディング領域との相補配列がオーバーラップする部分をもち、一方の１本鎖ＤＮＡのコーディング領域の発現が、他方の１本鎖ＤＮＡのコーディング領域の発現によって抑制されるというものである。非特許文献１では、ケースＡ〜Ｄとして定義されている。
Michal Lapidot & Yitzhak Pilpel(Weizmann Institute ofSience,Rehovot,Israel)、Genome-wide natural antisense transcription:coupling itsregulation to its different regulatory mechanisms、 EMBO reports、 EUROPEANMOLECULAR BIOLOGY ORGANIZATION、Submitted 28 March 2006;accepted 18 October 2006、p.1216-1222 Conventionally, there is a concept called NATs (Natural Antisense Transcripts) (see Non-Patent Document 1, for example). These NATs have a portion where the complementary sequence of the coding region of one single-stranded DNA and the coding region of the other single-stranded DNA overlap, and the expression of the coding region of one single-stranded DNA is It is suppressed by the expression of the coding region of the single-stranded DNA. In Non-Patent Document 1, it is defined as cases A to D.
Michal Lapidot & Yitzhak Pilpel (Weizmann Institute of Sience, Rehovot, Israel), Genome-wide natural antisense transcription: coupling itsregulation to its different regulatory mechanisms, EMBO reports, EUROPEANMOLECULAR BIOLOGY ORGANIZATION, Submitted 28 March 2006; accepted 18 October 2006, p.1216 -1222

ところで、上記非特許文献では、ケースＣは実証されておらず仮説として存在するものである。また、一方の１本鎖ＤＮＡのコーディング領域と、他方の１本鎖ＤＮＡのコーディング領域との相補配列がオーバーラップする部分の発現量が示してあるが、これは推測に過ぎない。 By the way, in the said nonpatent literature, case C is not proved but exists as a hypothesis. Moreover, although the expression level of the part which the complementary sequence of the coding region of one single stranded DNA and the coding region of the other single stranded DNA overlap is shown, this is only speculation.

相補配列部分における遺伝子発現量から、遺伝子の発現と抑制との制御に関するネットワークを構築することができれば、該ネットワークの信頼性が向上するものと考えられる。 If a network related to the control of gene expression and suppression can be constructed from the gene expression level in the complementary sequence portion, it is considered that the reliability of the network is improved.

本発明は以上の点を考慮してなされたもので、信頼性を向上し得る遺伝子制御ネットワーク解析装置、遺伝子制御ネットワーク解析方法及び遺伝子制御ネットワーク解析プログラムを提案しようとするものである。 The present invention has been made in consideration of the above points, and intends to propose a gene regulatory network analyzer, a gene regulatory network analysis method, and a gene regulatory network analysis program capable of improving reliability.

かかる課題を解決するため本発明は、遺伝子制御ネットワーク解析装置であって、標的細胞における複数の遺伝子に対する測定時間ごとの発現量を取得する取得部と、標的細胞において相補配列となる遺伝子ごとに、当該遺伝子に対応する測定時間ごとの発現量の挙動パターンと、相補配列における同じ染色体でのオーバーラップの有無とに応じて、相補配列となる遺伝子に対する発現抑制メカニズムの種を分類する分類部とを有する。 In order to solve such a problem, the present invention is a gene regulatory network analyzer, an acquisition unit for acquiring the expression level for each of a plurality of genes in a target cell, and for each gene that is a complementary sequence in the target cell, A classification unit that classifies the species of the expression suppression mechanism for the gene serving as the complementary sequence according to the behavior pattern of the expression level for each measurement time corresponding to the gene and the presence or absence of overlap in the same chromosome in the complementary sequence. Have.

また本発明は、遺伝子制御ネットワーク解析方法であって、標的細胞における複数の遺伝子に対する測定時間ごとの発現量を取得する取得ステップと、標的細胞において相補配列となる遺伝子ごとに、当該遺伝子に対応する測定時間ごとの発現量の挙動パターンと、相補配列における同じ染色体でのオーバーラップの有無とに応じて、相補配列となる遺伝子に対する発現抑制メカニズムの種を分類する分類ステップとを有する。 The present invention also relates to a gene regulatory network analysis method, wherein an acquisition step of acquiring an expression level for each of a plurality of genes in a target cell is measured, and each gene that is a complementary sequence in the target cell corresponds to the gene. A classification step of classifying the species of the expression suppression mechanism for the gene serving as the complementary sequence according to the behavior pattern of the expression level for each measurement time and the presence or absence of overlap in the same chromosome in the complementary sequence.

また本発明は、遺伝子制御ネットワーク解析プログラムであって、コンピュータに対して、標的細胞における複数の遺伝子に対する測定時間ごとの発現量を取得すること、標的細胞において相補配列となる遺伝子ごとに、当該遺伝子に対応する測定時間ごとの発現量の挙動パターンと、相補配列における同じ染色体でのオーバーラップの有無とに応じて、相補配列となる遺伝子に対する発現抑制メカニズムの種を分類することを実行させる。 The present invention also relates to a gene regulatory network analysis program, wherein a computer obtains the expression level for a plurality of genes in a target cell at each measurement time, and each gene that becomes a complementary sequence in the target cell Classifying the species of the expression suppression mechanism for the gene serving as the complementary sequence according to the behavior pattern of the expression level for each measurement time corresponding to and the presence or absence of overlap in the same chromosome in the complementary sequence.

本発明では、構造レベルで発現抑制関係をもつ可能性を有する遺伝子（相補配列となる遺伝子）について、その遺伝子における実際の発現量の挙動パターンと、同じ染色体でのオーバーラップの有無とから、実際の機序レベルで発現抑制メカニズムの種を分類することができる。 In the present invention, a gene having a possibility of having an expression-inhibiting relationship at the structural level (a gene serving as a complementary sequence) is actually determined from the behavior pattern of the actual expression level in the gene and the presence or absence of overlap in the same chromosome. The mechanism of expression suppression mechanism can be classified at the mechanism level.

このように本発明は、発現と抑制の相互関係を実際の機序レベルを経て自動的かつ網羅的に解析することができるため、遺伝子制御ネットワーク解析に関して信頼性を向上することができる。 As described above, the present invention can automatically and comprehensively analyze the interrelationship between expression and repression through the actual mechanism level, and thus can improve the reliability of gene regulatory network analysis.

なお、引用文献１に記載のＮＡＴｓは、特定の生物の特定部分だけ実証されたものであるが、この実証されたメカニズムが多岐にわたる各種生物にもあるのか、またある場合にはどの細胞のどの部分にあるのかということを確実に情報として得ることができるようになる点で、実際の機序レベルを経て自動的かつ網羅的に解析できるということは非常に有用となるであろう。 The NATs described in Cited Document 1 have been proved only in a specific part of a specific organism, but whether this proven mechanism exists in a wide variety of organisms, and in some cases, in which cell It will be very useful to be able to automatically and comprehensively analyze through the actual mechanism level in that it can be surely obtained as information whether it is in the part.

以下図面について本発明の一実施の形態を詳述する。 Hereinafter, an embodiment of the present invention will be described in detail with reference to the drawings.

（１）遺伝子制御ネットワーク解析システムの全体構成
図１において、本実施の形態による遺伝子制御ネットワーク解析システム１の全体構成を示す。この遺伝子制御ネットワーク解析システム１は、ゲノムサーバ２と、蛍光強度測定装置３と、遺伝子制御ネットワーク解析装置４とによって構成される。 (1) Overall Configuration of Gene Regulatory Network Analysis System FIG. 1 shows the overall configuration of a gene regulatory network analysis system 1 according to this embodiment. The gene regulatory network analysis system 1 includes a genome server 2, a fluorescence intensity measuring device 3, and a gene regulatory network analyzing device 4.

ゲノムサーバ２には、真核生物又は原核生物における各種生物のゲノムに関するデータベースが保持される。ゲノムサーバ２は、インターネット等のコンピュータネットワークを通じてアクセスされる装置の要求に応じたゲノム情報を、該装置に提供し得るようになされている。 The genome server 2 holds a database relating to genomes of various organisms in eukaryotes or prokaryotes. The genome server 2 can provide genome information corresponding to a request of a device accessed through a computer network such as the Internet to the device.

蛍光強度測定装置３は、測定ステージを有し、該測定ステージには核酸チップＣＰがセットされる。核酸チップＣＰは、標的生物における全遺伝子に対応する核酸プローブが配される基盤である。 The fluorescence intensity measuring device 3 has a measurement stage, and a nucleic acid chip CP is set on the measurement stage. The nucleic acid chip CP is a base on which nucleic acid probes corresponding to all genes in the target organism are arranged.

この核酸チップＣＰでは、例えば図２に示すように、核酸プローブに（長波線で示す部分）対して、標的生物の細胞から抽出され、標識物質（黒丸で示す部分）を付加された標的核酸（短波線で示す部分）が与えられ、相補鎖の形成反応（以下、これをハイブリダイゼーションとも呼ぶ）が行われる。 In this nucleic acid chip CP, for example, as shown in FIG. 2, a target nucleic acid (part indicated by a black circle) extracted from a cell of a target organism and added with a labeling substance (part indicated by a black circle) is added to a nucleic acid probe (part indicated by a long wavy line). (Part indicated by a short wave line) is given, and a complementary strand formation reaction (hereinafter also referred to as hybridization) is performed.

核酸プローブは、特定の遺伝子を検出するための探り針（検出子）として機能する１本鎖のヌクレオチドである。一般に、核酸プローブは、特定の遺伝子全体の配列に相補対となるヌクレオチドではなく、当該遺伝子において特異的とされる部分配列を含む複数のプローブ断片（以下、これをプローブセットとも呼ぶ）としてデザインされる。具体的には、１８〜６０[mer]程度のＤＮＡ(deoxyribonucleic acid) 断片、ｃＤＮＡ(complementary DNA) 断片又はＰＮＡ(peptide nucleic acid)などにデザインされる。 A nucleic acid probe is a single-stranded nucleotide that functions as a probe (detector) for detecting a specific gene. In general, a nucleic acid probe is designed as a plurality of probe fragments (hereinafter also referred to as a probe set) including partial sequences that are specific to the gene rather than nucleotides that are complementary to the entire sequence of a specific gene. The Specifically, it is designed as a DNA (deoxyribonucleic acid) fragment, cDNA (complementary DNA) fragment or PNA (peptide nucleic acid) of about 18 to 60 [mer].

一方、標的核酸は、核酸プローブとのハイブリダイゼーション対象とされる１本鎖のヌクレオチドである。一般に、標的核酸は、ｍＲＮＡ（pre-ｍＲＮＡを含む）又はその断片そのものが用いられるのではなく、該ｍＲＮＡ又はその断片を逆転写酵素により変換したものが用いられる。 On the other hand, the target nucleic acid is a single-stranded nucleotide to be hybridized with a nucleic acid probe. In general, mRNA (including pre-mRNA) or a fragment thereof is not used as the target nucleic acid, but a mRNA obtained by converting the mRNA or a fragment thereof with a reverse transcriptase is used.

他方、標識物質は、一般に、ビオチンまたはＦＩＴＣ(fluorescein isothiocyanate)等の蛍光色素とされるが、これに限定されるものではなく、例えば放射性同位元素等としてもよい。 On the other hand, the labeling substance is generally a fluorescent dye such as biotin or FITC (fluorescein isothiocyanate), but is not limited thereto, and may be a radioisotope, for example.

蛍光強度測定装置３（図１）は、測定指令が与えられた場合、測定ステージにセットされる核酸チップＣＰに対して、標的核酸に付加される標識物質の励起光を照射する。図２に示したように、核酸チップＣＰの核酸プローブが標的核酸と相補鎖を形成している場合、該標的核酸に付加された標識物質が励起光により発光する。この発光量は、標的核酸と核酸プローブとの相補鎖の形成量と相関があり、該核酸プローブと相補鎖が形成される標的核酸の量が多いほど発光量が強くなる。 When a measurement command is given, the fluorescence intensity measurement device 3 (FIG. 1) irradiates the nucleic acid chip CP set on the measurement stage with excitation light of a labeling substance added to the target nucleic acid. As shown in FIG. 2, when the nucleic acid probe of the nucleic acid chip CP forms a complementary strand with the target nucleic acid, the labeling substance added to the target nucleic acid emits light by excitation light. The amount of luminescence correlates with the amount of complementary strand formed between the target nucleic acid and the nucleic acid probe, and the amount of luminescence increases as the amount of target nucleic acid that forms a complementary strand with the nucleic acid probe increases.

蛍光強度測定装置３は、励起光を照射した核酸チップＣＰから、該核酸チップに配される全遺伝子に対応する核酸プローブでの発光量をセンサＣＥにより読み取る。そして蛍光強度測定装置３は、例えば図３に示すように、読み取った各遺伝子の発光量ＥＭ_ｉ（ｉ＝１、２、……、ｍ）に遺伝子固有の識別子ｇ_ｉを割り当て、これら組み合わせに読取時刻やチップ番号等の付加情報ＴＭを付加したデータ（以下、これを測定データと呼ぶ）を出力するようになされている。 The fluorescence intensity measuring device 3 reads, from the nucleic acid chip CP irradiated with the excitation light, the amount of light emitted from the nucleic acid probes corresponding to all the genes arranged on the nucleic acid chip by the sensor CE. The fluorescence intensity measuring device 3, for example, as shown in FIG. 3, the light emission quantity of each gene was read _{EM i (i = 1,2, ......} , m) allocates a gene unique identifier g _i in, the combination Data to which additional information TM such as reading time and chip number is added (hereinafter referred to as measurement data) is output.

遺伝子制御ネットワーク解析装置４は、ゲノムサーバ２からゲノム情報を取得し、蛍光強度測定装置３から相補鎖形成量データを取得する。そして遺伝子制御ネットワーク解析装置４は、これらゲノム情報と、相補鎖形成量データとを用いて、標的細胞において相補配列となる遺伝子に対する発現抑制メカニズムの種を分類するようになされている。 The gene control network analysis device 4 acquires genome information from the genome server 2 and acquires complementary strand formation amount data from the fluorescence intensity measurement device 3. Then, the gene regulatory network analyzer 4 classifies the species of the expression suppression mechanism for the gene that becomes a complementary sequence in the target cell using these genome information and complementary strand formation amount data.

（２）遺伝子制御ネットワーク解析装置の構成
次に、遺伝子制御ネットワーク解析装置４の構成について説明する。この遺伝子制御ネットワーク解析装置４は、図４に示すように、該遺伝子制御ネットワーク解析装置４全体の制御を司るＣＰＵ(Central Processing Unit)１０に対して各種ハードウェアを接続することにより構成される。 (2) Configuration of Gene Regulatory Network Analyzer Next, the configuration of the gene regulatory network analyzer 4 will be described. As shown in FIG. 4, the gene regulatory network analyzer 4 is configured by connecting various hardware to a CPU (Central Processing Unit) 10 that controls the entire gene regulatory network analyzer 4.

具体的にはＲＯＭ(Read Only Memory)１１、ＣＰＵ１０のワークメモリとなるＲＡＭ(Random Access Memory)１２、操作部１３、記憶部１４、インターフェース１５及び表示部１６がバス１７を介して接続される。 Specifically, a ROM (Read Only Memory) 11, a RAM (Random Access Memory) 12 serving as a work memory of the CPU 10, an operation unit 13, a storage unit 14, an interface 15, and a display unit 16 are connected via a bus 17.

ＲＯＭ１１には、遺伝子制御ネットワーク解析処理を実行するためのプログラム（以下、これを遺伝子制御ネットワーク解析プログラムとも呼ぶ）が格納され、またインターフェイス１５は、ゲノムサーバ２及び蛍光強度測定装置３に対して個々に設けられる。 The ROM 11 stores a program for executing gene control network analysis processing (hereinafter also referred to as gene control network analysis program), and the interface 15 is individually connected to the genome server 2 and the fluorescence intensity measuring device 3. Is provided.

ＣＰＵ１０は、ＲＯＭ１１に格納された遺伝子制御ネットワーク解析プログラムをＲＡＭ１２に展開した場合、該遺伝子制御ネットワーク解析プログラムに基づいて記憶部１４、インターフェース１５及び表示部１６を適宜制御し、遺伝子制御ネットワーク解析処理を実行するようになされている。 When the gene control network analysis program stored in the ROM 11 is expanded in the RAM 12, the CPU 10 appropriately controls the storage unit 14, the interface 15 and the display unit 16 based on the gene control network analysis program, and performs gene control network analysis processing. It is made to run.

（３）遺伝子制御ネットワーク解析プログラムに基づくＣＰＵの処理内容
遺伝子制御ネットワーク解析プログラムをＲＡＭ１２に展開したＣＰＵ１０は、機能的には、図４に示したように、遺伝子発現量取得部２１、ゲノム情報取得部２２、発現量読出部２３、バイナリ化部２４及び分類部２５の各処理部に分類することができる。 (3) Processing contents of the CPU based on the gene regulatory network analysis program The CPU 10, which has expanded the gene regulatory network analysis program in the RAM 12, functionally obtains the gene expression level acquisition unit 21, genome information acquisition as shown in FIG. The processing unit can be classified into a unit 22, an expression level reading unit 23, a binarization unit 24, and a classification unit 25.

遺伝子発現量取得部２１は、例えば図５に示すように、継代した標的細胞から所定期間ごとに抽出される標的核酸と、核酸チップＣＰ_ｊ（ｊ＝１、２、……、ｎ）における核酸プローブとの相補鎖の形成量に基づいて、標的生物における細胞で発現される遺伝子ごとに測定時間ｔ_１〜ｔ_ｎ当たりの遺伝子発現量ＧＥ_１〜ＧＥ_ｍを取得する。 For example, as shown in FIG. 5, the gene expression level acquisition unit 21 uses a target nucleic acid extracted from a passaged target cell every predetermined period and a nucleic acid chip CP _j (j = 1, 2,..., N). Based on the amount of complementary strand formed with the nucleic acid probe, gene expression levels GE _{1 to} GE _m per measurement time t _{1 to} t _n are obtained for each gene expressed in cells in the target organism.

ここで、この遺伝子発現量取得部２１における処理の具体例を１つ挙げる。 Here, one specific example of the processing in the gene expression level acquisition unit 21 is given.

遺伝子発現量取得部２１は、ターゲットとされる細胞の生物種を、操作部１３を用いて入力させ、該入力された生物種に応じた核酸チップの会社名及びその製品番号を内部又はインターネット上のデータベースから検索する。 The gene expression level acquisition unit 21 inputs the target cell species using the operation unit 13, and inputs the company name and product number of the nucleic acid chip corresponding to the input species on the inside or on the Internet. Search from the database.

そして遺伝子発現量取得部２１は、検索した核酸チップの会社名及びその製品番号を表示部１６に提示するとともに、標的細胞におけるｍＲＮＡをある時間間隔で測定すべきこと及びその測定条件を提示する。 The gene expression level acquisition unit 21 presents the company name and product number of the searched nucleic acid chip on the display unit 16, and presents that mRNA in the target cell should be measured at certain time intervals and the measurement conditions.

具体的には、継代した標的細胞から標的核酸を抽出すべき時間間隔や、各時間帯で抽出される標的核酸を核酸チップの核酸プローブとハイブリダイゼーションする際に守るべき温度や湿度等の各種環境条件の範囲などの測定条件を提示する。このように遺伝子発現量取得部２１は、測定上の条件を具体的に提示することで測定上のばらつきを抑えさせ、測定精度を高め得るようになされている。 Specifically, the time interval at which the target nucleic acid should be extracted from the passaged target cells, and the temperature and humidity to be protected when the target nucleic acid extracted at each time zone is hybridized with the nucleic acid probe of the nucleic acid chip Present measurement conditions such as a range of environmental conditions. As described above, the gene expression level acquisition unit 21 can suppress measurement variations by specifically presenting measurement conditions, and can increase measurement accuracy.

また遺伝子発現量取得部２１は、核酸チップの会社名及びその製品番号等を提示した時点から蛍光強度測定装置３から出力される測定データの待ち受けを開始する。そして遺伝子発現量取得部２１は、測定データを受けるごとに、該測定データに示される読取時刻ＴＭ（図３）と、その１つ前に読み取られた測定データに示される読取時刻ＴＭとの差を算出する。 In addition, the gene expression level acquisition unit 21 starts waiting for measurement data output from the fluorescence intensity measuring device 3 from the time when the company name of the nucleic acid chip and its product number are presented. The gene expression level acquisition unit 21 receives the difference between the reading time TM (FIG. 3) indicated in the measurement data and the reading time TM indicated in the measurement data read immediately before the measurement data. Is calculated.

この算出結果が、測定条件とされる時間間隔を基準として設定される上限閾値と下限閾値との許容範囲外となる場合、遺伝子発現量取得部２１は、最初から測定しなおすべき旨を表示部１６を用いて視覚的に通知する。これにより遺伝子発現量取得部２１は、標的細胞における標的核酸量の推移を正確に取得し得るようになされている。なお、測定条件とされる時間間隔が狭いほど、標的生物の細胞で発現される遺伝子発現量の推移が詳細になることは理解できるであろう。 When this calculation result is outside the allowable range between the upper threshold and the lower threshold set based on the time interval set as the measurement condition, the gene expression level acquisition unit 21 displays that the measurement should be performed again from the beginning. 16 is used for visual notification. Thereby, the gene expression level acquisition unit 21 can accurately acquire the transition of the target nucleic acid level in the target cell. It will be understood that the narrower the time interval, which is the measurement condition, is, the more detailed the transition of the gene expression level expressed in the cells of the target organism.

一方、算出結果が許容範囲内となる場合、遺伝子発現量取得部２１は、測定データに含まれる各遺伝子の発光量を遺伝子発現量に変換した後、該測定データを記憶部１４に記憶させる。 On the other hand, when the calculation result falls within the allowable range, the gene expression level acquisition unit 21 stores the measurement data in the storage unit 14 after converting the luminescence level of each gene included in the measurement data into the gene expression level.

遺伝子発現量は、標的核酸と核酸プローブとの相補鎖の形成量に相関する発光量から、統計学的手法によりバックグラウンド（局所的な物理的影響）等を排除したものであり、標的細胞内において発現している遺伝子を示す量として信頼性が高いものとなる。 The gene expression level is the amount of luminescence correlated with the amount of complementary strand formed between the target nucleic acid and the nucleic acid probe, and the background (local physical effect), etc., is excluded by a statistical method. It is highly reliable as an amount indicating the gene expressed in.

例えば、Affymetrix社のＭＡＳ(Micro Array Suite)と呼ばれるデータ解析ソフトウェアを用いた場合、発光量の割合として遺伝子発現量が算出される。 For example, when data analysis software called MAS (Micro Array Suite) manufactured by Affymetrix is used, the gene expression level is calculated as a ratio of the luminescence level.

ちなみに、このＭＡＳのバージョン５を、ある１つの遺伝子に対応する核酸プローブに着目して簡単に説明する。ＭＡＳ５では、（１）プローブセット（当該遺伝子において特異的とされる塩基配列を含む複数のプローブ断片）での発光量から、局所的な物理的影響（バックグランド）が排除される。（２）各プローブ断片（パーフェクトマッチプローブと呼ばれる）の発光量が、当該プローブ断片と対応する断片コントロール（ミスマッチプローブと呼ばれる）との差に応じて適宜補正される。（３）各プローブ断片の発光量が対数変換により遺伝子発現量として算出される。 Incidentally, this MAS version 5 will be briefly described by focusing on a nucleic acid probe corresponding to a certain gene. In MAS5, (1) the local physical influence (background) is excluded from the amount of light emitted from the probe set (a plurality of probe fragments including a base sequence specific to the gene). (2) The light emission amount of each probe fragment (referred to as a perfect match probe) is appropriately corrected according to the difference between the probe fragment and the corresponding fragment control (referred to as a mismatch probe). (3) The amount of luminescence of each probe fragment is calculated as a gene expression level by logarithmic conversion.

Ｉ．Ｓ．Ｋｏｈａｎｅ／Ａ．Ｔ．Ｋｈｏ／Ａ．Ｊ．Ｂｕｔｔｅ星田有人著、統合ゲノミクスのためのマイクロアレイデータアナリシス、シュプリンガー・ジャパン出版、ｐ．５８−７４を参照されたい。 I. S. Kohane / A. T.A. Kho / A. J. et al. Butte Yoshi Hoshida, Microarray Data Analysis for Integrated Genomics, Springer Japan Publishing, p. 58-74.

以上のように、遺伝子発現量取得部２１における処理の具体例を挙げたが、該処理は、この例に限定されるものではない。 As mentioned above, although the specific example of the process in the gene expression level acquisition part 21 was given, this process is not limited to this example.

ゲノム情報取得部２２は、所定の更新期間ごとに、記憶部１４に記憶されるｃｉｓ−ＮＡＴｓデータベース及びｔｒａｎｓ−ＮＡＴｓデータベースを更新する。 The genome information acquisition unit 22 updates the cis-NATS database and the trans-NATS database stored in the storage unit 14 every predetermined update period.

このｃｉｓ−ＮＡＴｓデータベースは、一方の１本鎖ＤＮＡのコーディング領域と、他方の１本鎖ＤＮＡのコーディング領域とが相補配列となり、かつその配列の一部がオーバーラップする遺伝子を、生物及び細胞の種別単位で対応付けしたものである。また、ｔｒａｎｓ−ＮＡＴｓデータベースは、ある染色体上のコーディング領域と、別の染色体上のコーディング領域とで相補配列となる遺伝子を、生物及び細胞の種別単位で対応付けしたものである。 This cis-NATS database contains genes in which the coding region of one single-stranded DNA and the coding region of the other single-stranded DNA are complementary sequences, and a part of the sequence overlaps. These are associated with each type. The trans-NATS database is a database in which genes that are complementary sequences in a coding region on one chromosome and a coding region on another chromosome are associated with each other in units of organisms and cells.

ここで、このゲノム情報取得部２２における処理の具体例を１つ挙げる。 Here, one specific example of processing in the genome information acquisition unit 22 will be given.

ゲノム情報取得部２２は、更新期間を経過した場合、インターフェース１５を用いてゲノムサーバ２へ自動的にアクセスし、該ゲノムサーバ２と相互認証する。そしてゲノム情報取得部２２は、ゲノムサーバ２との相互認証が成立した場合、該ゲノムサーバ２に対してｃｉｓ−ＮＡＴｓデータベース又はｔｒａｎｓ−ＮＡＴｓデータベースが更新されたか否かを確認する。 When the update period has elapsed, the genome information acquisition unit 22 automatically accesses the genome server 2 using the interface 15 and performs mutual authentication with the genome server 2. When the mutual authentication with the genome server 2 is established, the genome information acquisition unit 22 confirms whether or not the cis-NATS database or the trans-NATS database is updated with respect to the genome server 2.

ここでゲノム情報取得部２２は、更新されたことを確認した場合、記憶部１４に記憶されるｃｉｓ−ＮＡＴｓデータベースと、ｔｒａｎｓ−ＮＡＴｓデータベースとの一方又は双方を更新する。 Here, when it is confirmed that the genome information acquisition unit 22 has been updated, the genome information acquisition unit 22 updates one or both of the cis-NATS database and the trans-NATS database stored in the storage unit 14.

ちなみに、このときゲノム情報取得部２２は、操作部１３により指定されたゲノム情報があった場合、該ゲノム情報を記憶部１４に記憶する。ゲノム情報としては、例えば標的生物において発現され得る全ての成熟ｍＲＮＡ及びそれらの前駆ｍＲＮＡ（ｐｒｅ−ｍＲＮＡ）の塩基配列や、イントロン及びエキソン等がある。 Incidentally, at this time, when there is genome information designated by the operation unit 13, the genome information acquisition unit 22 stores the genome information in the storage unit 14. Examples of genomic information include the base sequences of all mature mRNAs and their precursor mRNAs (pre-mRNA) that can be expressed in the target organism, introns and exons.

以上のように、ゲノム情報取得部２２における処理の具体例を挙げたが、該処理は、この例に限定されるものではない。 As mentioned above, although the specific example of the process in the genome information acquisition part 22 was given, this process is not limited to this example.

発現量読出部２３は、記憶部１４に記憶されるｃｉｓ−ＮＡＴｓデータベース及びｔｒａｎｓ−ＮＡＴｓデータベースにおける相補配列となる遺伝子のすべてを認識する。ちなみに、ｃｉｓ−ＮＡＴｓデータベースにおける遺伝子は、上述したように、一方の１本鎖ＤＮＡのコーディング領域と、他方の１本鎖ＤＮＡのコーディング領域とがオーバーラップする状態で相補配列となるものである。また、ｔｒａｎｓ−ＮＡＴｓデータベースにおける遺伝子は、ある染色体上のコーディング領域と、別の染色体上のコーディング領域とで相補配列となるものである。 The expression level reading unit 23 recognizes all the genes that are complementary sequences in the cis-NATS database and the trans-NATS database stored in the storage unit 14. Incidentally, the gene in the cis-NATS database is a complementary sequence in a state where the coding region of one single-stranded DNA overlaps with the coding region of the other single-stranded DNA, as described above. The gene in the trans-NATS database is a complementary sequence between a coding region on one chromosome and a coding region on another chromosome.

そして発現量読出部２３は、各相補配列の遺伝子に対応する測定時間ｔ_１〜ｔ_ｎごとの遺伝子発現量ＧＥ（図５）を示すデータを記憶部１４から読み出し、当該データをバイナリ化部２４に与える。 Then, the expression level reading unit 23 reads data indicating the gene expression level GE (FIG. 5) for each measurement time t _{1 to} t _n corresponding to the gene of each complementary sequence from the storage unit 14, and the data is converted into a binarization unit 24. To give.

なお、発現量読出部２３は、各相補配列に対応する測定時間ｔ_１〜ｔ_ｎごとの遺伝子発現量ＧＥ（図５）を示すデータが記憶部１４に記憶されていない場合、その旨を、表示部１６を用いて視覚的に通知するようになされている。 It should be noted that the expression level reading section 23, when data indicating a gene expression amount GE of each measurement time t ₁ ~t _n for each complementary sequence (Fig. 5) is not stored in the storage unit 14, to that effect, The display unit 16 is used for visual notification.

バイナリ化部２４は、発現量読出部２３から、各相補配列に対応する測定時間ｔ_１〜ｔ_ｎごとの遺伝子発現量ＧＥ_１〜ＧＥ_ｍ（図５）を示すデータが与えられた場合、当該データを用いて、相補配列の各配列における遺伝子発現量の挙動をバイナリ化する。 The binarization unit 24 receives the data indicating the gene expression levels GE _{1 to} GE _m (FIG. 5) for each measurement time t _{1 to} t _n corresponding to each complementary sequence from the expression level reading unit 23. Using the data, the behavior of the gene expression level in each sequence of the complementary sequence is binarized.

すなわち、バイナリ化部２４は、遺伝子発現量ＧＥごとに、観測点ｔ_ｎの時間経過方向における正又は負の差をとってバイナリ列（以下、これを発現量バイナリ列とも呼ぶ）を生成する。各観測点ｔ_ｎにおける遺伝子発現量ＧＥが図５の場合、発現量バイナリ列は、図６に示すようになる。 That is, the binarizing unit 24 generates a binary string (hereinafter also referred to as an expression level binary string) by taking a positive or negative difference in the time lapse direction of the observation point t _n for each gene expression level GE. If gene expression levels GE at each observation point t _n is 5, the expression level binary string is as shown in FIG.

ちなみに、図６における各遺伝子発現量ＧＥの値は便宜的に示したものであり、実際の数値ではない。なお、遺伝子発現量のバイナリ化に関しては、既に本発明者らによって提案されており、詳細は２００８−２１３１１２を参照されたい。 Incidentally, the value of each gene expression level GE in FIG. 6 is shown for convenience and is not an actual numerical value. Note that the present inventors have already proposed binarization of gene expression levels, and refer to 2008-213112 for details.

ここで、図７又は図８に示すように、ある配列の遺伝子発現量ＧＥＸと、該配列に相補となる配列の遺伝子発現量ＧＥＹとにおける挙動が同一または略同一にある場合、図９に示すように、発現量バイナリ列における正負のパターンは同一となる。 Here, as shown in FIG. 7 or FIG. 8, when the behavior of the gene expression level GEX of a certain sequence and the gene expression level GEY of a sequence complementary to the sequence are the same or substantially the same, it is shown in FIG. In this way, the positive and negative patterns in the expression level binary string are the same.

一方、図１０又は図１１に示すように、ある配列の遺伝子発現量ＧＥＸと、該配列に相補となる配列の遺伝子発現量ＧＥＹとにおける挙動が一定の逆相関関係またはおおよその逆相関関係にある場合、図１２に示すように、発現量バイナリ列における正負のパターンは反対となる。 On the other hand, as shown in FIG. 10 or FIG. 11, the behavior of the gene expression level GEX of a certain sequence and the gene expression level GEY of a sequence complementary to the sequence has a constant inverse correlation or an approximate inverse correlation. In this case, as shown in FIG. 12, the positive and negative patterns in the expression amount binary string are opposite.

分類部２５は、標的細胞において相補配列となる遺伝子に対応する発現量バイナリ列と、該相補配列における同じ染色体でのオーバーラップの有無とに応じて、相補配列となる遺伝子に対する発現抑制メカニズムの種を分類する。 The classification unit 25 is a seed of an expression suppression mechanism for a gene serving as a complementary sequence according to an expression amount binary string corresponding to the gene serving as a complementary sequence in the target cell and the presence or absence of overlap in the same chromosome in the complementary sequence. Classify.

すなわち分類部２５は、相補配列の各配列に対応する発現量バイナリ列の正負パターンが同一又は反対となる場合、当該相補配列の遺伝子間には、発現と抑制の作用相関があるものとして検出する。 That is, when the positive / negative pattern of the expression amount binary string corresponding to each sequence of the complementary sequence is the same or opposite, the classification unit 25 detects that there is an action correlation between expression and suppression between the genes of the complementary sequence. .

そして分類部２５は、作用相関があるとした相補配列の遺伝子が同じ染色体上の一部でオーバーラップするものであるか否かを、ｃｉｓＮＡＴｓデータベース及びｔｒａｎｓＮＡＴｓデータベースに基づいて調べる。 Then, the classification unit 25 checks whether or not the genes of complementary sequences that have an action correlation overlap with each other on a part of the same chromosome based on the cisNATS database and the transNATS database.

ここで、発現と抑制の作用相関にある相補配列が「同じ染色体上でオーバーラップ」し、該相補配列における発現量バイナリ列が「反対」の正負パターンである場合、このことは、上記非特許文献におけるケースＡまたはケースＤのｃｉｓＮＡＴｓに相当することを意味する。この場合、分類部２５は、相補配列がケースＡまたはケースＤのメカニズムの関係にあることを示す情報を生成し、これを、遺伝子制御ネットワーク解析用のデータベースに登録する。 Here, when complementary sequences that are in an action correlation between expression and suppression are “overlapping on the same chromosome”, and the expression amount binary string in the complementary sequence is an “opposite” positive / negative pattern, this is It means that it corresponds to cisNATS of case A or case D in the literature. In this case, the classification unit 25 generates information indicating that the complementary sequence has a mechanism relationship of case A or case D, and registers this information in the database for gene regulatory network analysis.

また、発現と抑制の作用相関にある相補配列が「同じ染色体上でオーバーラップ」し、該相補配列における発現量バイナリ列が「同一」の正負パターンである場合、このことは、上記非特許文献におけるケースＣのｃｉｓＮＡＴｓに相当することを意味する。この場合、分類部２５は、相補配列がケースＣのメカニズムの関係にあることを示す情報を生成し、これを、遺伝子制御ネットワーク解析用のデータベースに登録する。 In addition, when complementary sequences that are in an action relationship between expression and suppression are “overlapping on the same chromosome” and the expression amount binary string in the complementary sequence is “identical”, this is the above-mentioned non-patent document. It corresponds to cisNATS of case C in FIG. In this case, the classification unit 25 generates information indicating that the complementary sequence is in the relationship of the mechanism of case C, and registers this information in the database for gene regulatory network analysis.

さらに、発現と抑制の作用相関にある相補配列が「オーバーラップ」しない場合、このことは、該相補配列における発現量バイナリ列が「同一」又は「反対」の正負パターンのいずれであっても、ｔｒａｎｓＮＡＴｓに相当することを意味する。この場合、分類部２５は、相補配列がｔｒａｎｓＮＡＴｓのメカニズムの関係にあることを示す情報を生成し、これを、遺伝子制御ネットワーク解析用のデータベースに登録する。 Furthermore, when the complementary sequence in the action correlation between the expression and the suppression does not “overlap”, this means that even if the expression binary string in the complementary sequence is “same” or “opposite” Means equivalent to transNATs. In this case, the classification unit 25 generates information indicating that the complementary sequence has a transNATS mechanism relationship, and registers this in the database for gene regulatory network analysis.

具体的にこの遺伝子制御ネットワーク解析用のデータベースには、例えば、標的生物の名称、相補となる各配列に関する情報（当該配列をもつ染色体番号や、当該配列部分における遺伝子名や、当該配列における発現量バイナリ列等）、遺伝子制御メカニズムの種（ケースＡ、ケースＣ、ｔｒａｎｓＮＡＴｓ）に割り当てられるコード（識別子）などとして登録される。ちなみに、名称には通称又は略称が含まれる。 Specifically, this database for gene control network analysis includes, for example, the name of the target organism, information about each complementary sequence (chromosome number having the sequence, gene name in the sequence portion, expression level in the sequence A binary string or the like), a code (identifier) assigned to a gene control mechanism seed (case A, case C, transNATs), or the like. Incidentally, common names or abbreviations are included in the names.

このようにして分類部２５は、相補配列となる遺伝子に対する発現抑制メカニズムの種を、当該相補配列における「同一染色体上でのオーバーラップの有無」と、「遺伝子発現量の簡易的な挙動パターン」とに応じて分類することができる。 In this way, the classification unit 25 determines the seeds of the expression suppression mechanism for the gene serving as the complementary sequence, “presence / absence of overlap on the same chromosome” and “simple behavior pattern of gene expression level” in the complementary sequence. And can be classified according to.

したがって、この遺伝子制御ネットワーク解析装置４では、実証済みの遺伝子制御メカニズム（ケースＡまたはケースＤのｃｉｓＮＡＴｓ）と、仮説段階の遺伝子制御メカニズム（ケースＣのｃｉｓＮＡＴｓ又はｔｒａｎｓＮＡＴｓ）とのうち、該仮説段階の遺伝子制御メカニズムを、遺伝子制御ネットワーク解析用のデータベースから抹消する等といったように、その後のデータベースの編集を簡易化できる。 Therefore, in this gene regulatory network analysis apparatus 4, among the proven gene regulatory mechanisms (case NAT or cis NATs in case D) and hypothetical stage gene regulatory mechanisms (case C cisNATS or transNATs), Subsequent editing of the database can be simplified, such as deleting the gene regulatory mechanism from the database for gene regulatory network analysis.

また所定の表示ソフトを用いて、例えば図１３に示すように、データベースに登録された遺伝子の相互作用関係を表示部１６に表示する場合、当該発現と抑制の作用相関が実証段階にあるものであるか、仮説段階にあるものであるかを、色分け等により直感的に区別することが可能となる。 Further, when the interaction relationship between genes registered in the database is displayed on the display unit 16 using predetermined display software, for example, as shown in FIG. 13, the action correlation between the expression and the suppression is in the verification stage. It is possible to intuitively distinguish whether it is in the hypothesis stage by color coding or the like.

（４）遺伝子制御ネットワーク解析処理手順
次に、遺伝子制御ネットワーク解析処理手順を、図１４に示すフローチャートを用いて説明する。 (4) Gene Regulatory Network Analysis Processing Procedure Next, the gene regulatory network analysis processing procedure will be described using the flowchart shown in FIG.

ＣＰＵ１０は、例えば電源投入時をトリガーとしてこの遺伝子制御ネットワーク解析処理を開始し、ステップＳＰ１に進んで、遺伝子制御ネットワークの解析要求を待ち受ける。ここで、遺伝子制御ネットワークの解析要求を受けた場合、ＣＰＵ１０は、ステップＳＰ２に進んで、標的細胞における観測点ごとの遺伝子発現量が既に取得されているか否かを確認する。 The CPU 10 starts this gene regulatory network analysis process, for example, when the power is turned on, and proceeds to step SP1 to wait for a gene regulatory network analysis request. Here, when the analysis request of the gene regulatory network is received, the CPU 10 proceeds to step SP2 and confirms whether or not the gene expression level for each observation point in the target cell has already been acquired.

ここで、当該遺伝子発現量が取得されていない場合、ＣＰＵ１０は、ステップＳＰ３に進んで、遺伝子発現量取得部２１として機能し、蛍光強度測定装置３から、標的生物における各観測点ｔ_１〜ｔ_ｎの遺伝子発現量ＧＥ_１〜ＧＥ_ｍを取得した後（図５）、次のステップＳＰ４に進む。これに対して当該遺伝子発現量が取得されている場合、ＣＰＵ１０は、ステップＳＰ３を経ずにステップＳＰ４に進む。 Here, when the gene expression level is not acquired, the CPU 10 proceeds to step SP3 to function as the gene expression level acquisition unit 21, and from the fluorescence intensity measurement device 3, each observation point t _{1 to} t in the target organism. After acquiring _n gene expression levels GE _{1 to} GE _m (FIG. 5), the process proceeds to the next step SP4. On the other hand, when the gene expression level is acquired, the CPU 10 proceeds to step SP4 without passing through step SP3.

ＣＰＵ１０は、ステップＳＰ４において、ｃｉｓ−ＮＡＴｓデータベース及びｔｒａｎｓ−ＮＡＴｓデータベースが既に取得されているか否かを確認する。 In step SP4, the CPU 10 checks whether or not the cis-NATS database and the trans-NATS database have already been acquired.

ここで、ゲノム情報が取得されていない場合、ＣＰＵ１０は、ステップＳＰ５に進んで、ゲノム情報取得部２２として機能し、ゲノムサーバ２からｃｉｓ−ＮＡＴｓデータベース及びｔｒａｎｓ−ＮＡＴｓデータベースを取得した後、次のステップＳＰ６に進む。 If the genome information is not acquired, the CPU 10 proceeds to step SP5, functions as the genome information acquisition unit 22, acquires the cis-NATS database and the trans-NATS database from the genome server 2, and then Proceed to step SP6.

これに対してｃｉｓ−ＮＡＴｓデータベース及びｔｒａｎｓ−ＮＡＴｓデータベースが取得されている場合、ＣＰＵ１０は、ステップＳＰ５を経ずにステップＳＰ６に進む。 On the other hand, when the cis-NATS database and the trans-NATS database have been acquired, the CPU 10 proceeds to step SP6 without passing through step SP5.

ＣＰＵ１０は、ステップＳＰ６において、発現量読出部２３として機能し、記憶部１４に記憶されるｃｉｓ−ＮＡＴｓデータベース及びｔｒａｎｓ−ＮＡＴｓデータベースを参照し、相補配列となる遺伝子同士を探索する。 In step SP6, the CPU 10 functions as the expression level reading unit 23 and searches for genes that are complementary sequences by referring to the cis-NATS database and the trans-NATS database stored in the storage unit 14.

ここで、１以上の相補配列となる遺伝子が検出された場合、このことは、標的細胞においてＮＡＴｓ機能が存在する可能性があることを意味する。この場合、ＣＰＵ１０は、ステップＳＰ７に進んで、相補配列の遺伝子ごとに、当該遺伝子発現量の挙動（発現量バイナリ列（図６））を生成する。 Here, when a gene serving as one or more complementary sequences is detected, this means that NATs function may exist in the target cell. In this case, the CPU 10 proceeds to step SP7 and generates the behavior of the gene expression level (expression level binary string (FIG. 6)) for each gene of the complementary sequence.

そしてＣＰＵ１０は、ステップＳＰ８に進んで、発現量バイナリ列のパターンと、該作用相関もつ相補配列における同じ染色体でのオーバーラップの有無とに応じて、相補配列に対する発現と抑制のメカニズムの種を分類（遺伝子制御ネットワーク解析用のデータベースに登録）した後、この遺伝子制御解析処理を終了する。 Then, the CPU 10 proceeds to step SP8 to classify the species of the mechanism of expression and suppression for the complementary sequence according to the pattern of the expression level binary string and the presence or absence of overlap in the same chromosome in the complementary sequence having the action correlation. After (registered in the database for gene regulatory network analysis), this gene regulatory analysis process is terminated.

一方、ステップＳＰ６において１以上の相補配列となる遺伝子が検出されない場合、ＣＰＵ１０は、ステップＳＰ９に進んで、標的生物では発現と抑制の相互関係をもつ遺伝子がない（ＮＡＴｓ機能をもたない）旨を視覚的に通知した後、この遺伝子制御ネットワーク解析処理を終了する。 On the other hand, if a gene that becomes one or more complementary sequences is not detected in step SP6, the CPU 10 proceeds to step SP9, and the target organism has no gene that has a correlation between expression and suppression (has no NATs function). After the visual notification, the gene regulatory network analysis process is terminated.

このようにしてＣＰＵ１０は、遺伝子制御ネットワーク解析処理を実行し、相補配列となる遺伝子ごとに発現抑制メカニズムの種を分類することができるようになされている。なお、各ステップの順序はこの図１４に示すものに限定されるものではない。 In this way, the CPU 10 can execute the gene regulatory network analysis process and classify the species of the expression suppression mechanism for each gene to be a complementary sequence. Note that the order of the steps is not limited to that shown in FIG.

（５）動作及び効果
以上の構成において、この遺伝子制御ネットワーク解析装置４は、標的細胞において相補配列となる遺伝子ごとに、当該遺伝子に対応する測定時間ｔ_１〜ｔ_ｎごとの発現量ＧＥの挙動パターンと、該遺伝子における同じ染色体でのオーバーラップの有無とに応じてＮＡＴｓの種を分類する。 (5) Operation and Effect In the above configuration, the gene regulatory network analyzer 4 performs the behavior of the expression level GE for each measurement time t _{1 to} t _n corresponding to the gene for each gene that is a complementary sequence in the target cell. The NATs species are classified according to the pattern and the presence or absence of overlap in the same chromosome in the gene.

したがってこの遺伝子制御ネットワーク解析装置４では、構造レベルで発現抑制関係をもつ可能性を有する遺伝子（相補配列となる遺伝子）について、その遺伝子における実際の発現量の挙動パターンと、同じ染色体でのオーバーラップの有無とから、実際の機序レベルで発現抑制メカニズムの種を裏付け（証明）し分類することができる。 Therefore, in this gene regulatory network analysis device 4, with respect to a gene having a possibility of having an expression suppression relationship at the structure level (a gene serving as a complementary sequence), the behavior pattern of the actual expression level in the gene and the overlap in the same chromosome Based on the presence / absence of phenotypes, the species of the expression suppression mechanism can be supported (proven) and classified at the actual mechanism level.

この結果この遺伝子制御ネットワーク解析装置４では、発現と抑制の相互関係を実際の機序レベルを経て自動的かつ網羅的に解析することができるため、遺伝子制御ネットワーク解析に関して信頼性を向上することができる。 As a result, the gene regulatory network analyzer 4 can automatically and comprehensively analyze the correlation between expression and repression through the actual mechanism level, so that the reliability of gene regulatory network analysis can be improved. it can.

また、この遺伝子制御ネットワーク解析装置４では、発現量ＧＥの挙動パターンを生成する手法として、観測時間の時間経過方向における遺伝子発現量ＧＥの正又は負の差をとってバイナリ列を生成する手法が採用される（図６）。 Moreover, in this gene control network analysis apparatus 4, as a method for generating a behavior pattern of the expression level GE, a method for generating a binary string by taking a positive or negative difference in the gene expression level GE in the time passage direction of the observation time is used. Adopted (FIG. 6).

従来、遺伝子を分類するという要請があるがあるものの、遺伝子発現量の時間的挙動がどのようなときにいかなるカテゴリに分類するのかという一般的な基準がないということが現状であった。しかしながら、この遺伝子制御ネットワーク解析装置４では、各観測時間の発現量を、当該観測時間の時間推移に沿って「上がった」もしくは「下がった」とする挙動だけに着目した２値の系列（バイナリ列）に変換することで、該バイナリ列がとり得るパターン（正負パターン）を必然的に得ることができる。 Conventionally, although there has been a request for classifying genes, there is no general standard for when to classify the gene expression level in what category. However, in this gene regulatory network analyzer 4, a binary series (binary) that focuses only on the behavior that the amount of expression at each observation time is “up” or “down” along the time transition of the observation time. By converting into (column), a pattern (positive / negative pattern) that can be taken by the binary string can be inevitably obtained.

またこの遺伝子制御ネットワーク解析装置４は、各遺伝子を、その発現量の時間的挙動から簡易にパターン化することができることができるため、数万個もある遺伝子からとり得る全ての組み合わせを比較する場合に効率的となる。 In addition, since the gene regulatory network analyzer 4 can easily pattern each gene from the temporal behavior of its expression level, when comparing all possible combinations from tens of thousands of genes. Become more efficient.

また、この遺伝子制御ネットワーク解析装置４では、ＮＡＴｓの種として、実証済みの遺伝子制御メカニズム（ケースＡまたはケースＤのｃｉｓＮＡＴｓ）と、仮説段階の遺伝子制御メカニズム（ケースＣのｃｉｓＮＡＴｓ又はｔｒａｎｓＮＡＴｓ）とに細分する。 Further, in this gene regulatory network analysis device 4, the NATs species are subdivided into proven gene control mechanisms (case NAT or cis NATs) and hypothetical gene control mechanisms (case NAT cisNATS or transNATSs). To do.

したがって、この遺伝子制御ネットワーク解析装置４では、その後の仮説の立証等の研究動向に応じて、ネットワーク構築の編集を容易化することができる。 Therefore, in this gene regulatory network analysis apparatus 4, editing of network construction can be facilitated according to research trends such as subsequent verification of hypotheses.

またこの遺伝子制御ネットワーク解析装置４は、標的生物の細胞で発現される遺伝子における発現量を単に取得するのではなく、該発現量の測定間隔が、設定される上限閾値と下限閾値との範囲となる当該発現量を、選択的に取得するようになされている。 The gene regulatory network analyzer 4 does not simply acquire the expression level in the gene expressed in the cells of the target organism, but the measurement interval of the expression level is a range between a set upper threshold and a lower threshold. The expression level is obtained selectively.

したがって遺伝子制御ネットワーク解析装置４では、実際の機序レベルで裏付けする際の判定基準となる推移量を、一定の基準のもとで取得するため、多岐にわたる標的生物の発現と抑制の相互関係を、同等の基準で発現と抑制の作用相関があるか否かを判定することができる。 Therefore, since the gene regulatory network analyzer 4 obtains the transition amount as a criterion for supporting at the actual mechanism level under a certain standard, the correlation between expression and suppression of a wide variety of target organisms is obtained. It is possible to determine whether or not there is an action correlation between expression and suppression based on equivalent criteria.

以上の構成によれば、標的生物における発現と抑制の相互関係を実際の機序レベルを経て自動的かつ網羅的に解析することができるようにしたことにより、信頼性を向上し得る遺伝子制御ネットワーク解析装置４を実現し得る。 According to the above configuration, the gene regulatory network that can improve the reliability by enabling the automatic and comprehensive analysis of the correlation between expression and suppression in the target organism through the actual mechanism level. The analysis device 4 can be realized.

（６）他の実施の形態
上述の実施の形態では、蛍光強度測定装置３を用いて細胞中の標的核酸の遺伝子発現量を示す発現量データが取得された。しかしながら、取得態様はこの実施の形態に限定されるものではない。例えば、標的細胞で発現される各ｍＲＮＡを抽出し、これらをリアルタイムＰＣＲ(Polymerase Chain Reaction)を用いて、一定量に増殖することにより直接的に取得するといった態様も適用可能である。 (6) Other Embodiments In the above-described embodiment, the expression level data indicating the gene expression level of the target nucleic acid in the cell is acquired using the fluorescence intensity measurement device 3. However, the acquisition mode is not limited to this embodiment. For example, an embodiment in which each mRNA expressed in the target cell is extracted and directly obtained by proliferating to a certain amount using real-time PCR (Polymerase Chain Reaction) is also applicable.

また例えば、所定の機関において蛍光強度測定装置やリアルタイムＰＣＲを用いて測定され、その結果得られるｍＲＮＡの量を示すデータを格納したデータ格納媒体から取得するといった態様も適用可能である。この態様を適用した場合、例えば遠方となる複数の実験場所で得られたものから、当該実験場所で平均的な指標値として、ｍＲＮＡの量を示すデータを格納するといったことが可能となる。 Further, for example, it is also possible to apply a mode in which data is obtained from a data storage medium that stores data indicating the amount of mRNA obtained as a result of measurement using a fluorescence intensity measuring device or real-time PCR in a predetermined organization. When this aspect is applied, for example, data indicating the amount of mRNA can be stored as an average index value at a plurality of experimental locations that are far away.

なお、データ格納媒体としては、例えばフレキシブルディスク、ＣＤ−ＲＯＭ（Compact Disk-Read Only Memory）、ＤＶＤ（Digital Versatile Disc）等のパッケージメディアや、データが一時的若しくは永続的に格納される半導体メモリや磁気ディスク等がある。またこれらデータ格納媒体からデータを取得する方法としては、ローカルエリアネットワークやインターネット、ディジタル衛星放送等の有線又は無線の通信媒体を利用することができる。 Examples of the data storage medium include package media such as a flexible disk, CD-ROM (Compact Disk-Read Only Memory), and DVD (Digital Versatile Disc), a semiconductor memory in which data is temporarily or permanently stored, There are magnetic disks. In addition, as a method for acquiring data from these data storage media, a wired or wireless communication medium such as a local area network, the Internet, or digital satellite broadcasting can be used.

また、読取量として、上述の実施の形態では光学的に発光量を読み取るようにした。しかしながら、読取量はこの実施の形態に限定されるものではない。例えば、電磁学的に電気量又はインピーダンス量等を適用することもできる。要は、所定の物理量を読み取るセンサによって読み取られた読取量を適用することができる。なお、核酸チップＣＰとしては、例えば、Affymetorix社製、スタンフォード型等を適用することができ、これら以外のものを適用することもできる。 Further, as the reading amount, in the above-described embodiment, the light emission amount is optically read. However, the reading amount is not limited to this embodiment. For example, an electromagnetic quantity or an impedance quantity can be applied electromagnetically. In short, a reading amount read by a sensor that reads a predetermined physical quantity can be applied. In addition, as the nucleic acid chip CP, for example, a Stanford type manufactured by Affymetorix can be applied, and a chip other than these can also be applied.

また、読取場所として、上述の実施の形態では核酸チップＣＰとした。しかしながら、読取場所はこれに限定されるものではない。例えば、組織切片を適用することができ、これ以外の場所も適用することができる。 In addition, the reading place is the nucleic acid chip CP in the above-described embodiment. However, the reading place is not limited to this. For example, a tissue section can be applied and other locations can also be applied.

また、遺伝子発現量の演算手法として、上述の実施の形態ではＭＡＳを適用した。しかしながら、演算手法はこれに限定されるものではない。例えば、ＲＭＡ等の既知のデータ解析ソフトウェア、その他の新規の演算手法を適用することができる。要は、センサから読み取られた、相補鎖の形成量を示すデータを統計学的手法を用いて補正するものであればよい。 Further, MAS is applied as a method for calculating the gene expression level in the above-described embodiment. However, the calculation method is not limited to this. For example, known data analysis software such as RMA or other new calculation methods can be applied. In short, it is sufficient if data indicating the amount of complementary strand formed read from the sensor is corrected using a statistical method.

また上述の実施の形態では、各観測点（ｔ_１〜ｔ_ｎ（図６参照））における挙動パターンに発現と抑制の作用相関がある場合として、観測時間の時間経過方向における遺伝子発現量ＧＥの正又は負の差をとったバイナリ列（発現量バイナリ列）の符号が完全に同一又は反対となるパターンとなる場合とされた。しかしながら、必ずしも、完全に同一又は反対となる場合でなくてもよい。すなわち、符号の１つあるいは２つが相違していても、発現と抑制の作用相関がある場合とすることができる。この符号の相違数は、バイナリ列が長い（バイナリ列を構成する符号数が多い）ほど、大きく設定することができる。 Further, in the above-described embodiment, _{assuming that} the behavior pattern at each observation point (t _{1 to} t _n (see FIG. 6)) has an action correlation between expression and suppression, the gene expression level GE in the time course direction of the observation time It was set as the case where it became a pattern from which the code | symbol of the binary sequence (expression amount binary sequence) which took the positive or negative difference became completely the same or opposite. However, it does not necessarily have to be completely the same or opposite. That is, even if one or two of the codes are different, there can be a case where there is an action correlation between expression and suppression. The number of code differences can be set larger as the binary string is longer (the number of codes constituting the binary string is larger).

また、バイナリ列（発現量バイナリ列）のパターンに代えて、別のものを採用してもよい。例えば、比較対象とされる各観測点（ｔ_１〜ｔ_ｎ（図６参照））における発現量の相関係数を算出し、この相関係数が発現と抑制の作用相関があるとすべき最低値以上となる場合とすることができる。 Further, instead of a binary string (expression binary string) pattern, another pattern may be adopted. For example, the correlation coefficient of the expression level at each observation point (t _{1 to} t _n (see FIG. 6)) to be compared is calculated, and this correlation coefficient should be the minimum that should have an action correlation between expression and suppression. It can be the case where it becomes more than a value.

また別例として、発現量の総測定時間長（ｔ_ｎ−ｔ_１）に対する、比較される発現量に一定の比例関係がある推移部分の時間長の割合を算出し、この割合が発現と抑制の作用相関があるとすべき最低値以上となる場合とすることができる。 As another example, the ratio of the time length of the transition portion having a certain proportional relationship to the expression level to be compared with the total measurement time length (t _n −t ₁ ) of the expression level is calculated, and this ratio is expressed and suppressed. It can be set as the case where it becomes more than the minimum value which should be there.

また別例として、発現量における推移量（図６におけるｔ_１〜ｔ_ｎ）を形状パターンとして捉えてマッチングし、該形状パターンの重なりの程度が最も大きくなる場合の重なり量と角度が、発現と抑制の作用相関があるとすべき最低値以上となる場合とすることができる。 As another example, the transition amount (t _{1 to} t _n in FIG. 6) of the expression level is matched as a shape pattern and matched, and the overlap amount and angle when the degree of overlap of the shape pattern is the largest are expressed and It can be set as the case where it becomes more than the minimum value which should have an action correlation of suppression.

また上述の実施の形態では、非特許文献におけるケースＡまたはケースＤのｃｉｓＮＡＴｓと、ケースＣのｃｉｓＮＡＴｓと、ｔｒａｎｓＮＡＴｓとが分類要素とされた。しかしながら、これら以外の発現抑制メカニズムを分類要素とするようにしてもよい。 In the above-described embodiment, cisNATSs of case A or case D, cisNATSs of case C, and transNATSs in the non-patent literature are used as classification elements. However, expression suppression mechanisms other than these may be used as classification elements.

本発明は、遺伝子実験、医薬の創製又は患者の経過観察などのバイオ産業上において利用可能である。 The present invention can be used in the bio-industry such as genetic experiments, creation of medicines, or patient follow-up.

本実施の形態による遺伝子制御ネットワーク解析システムの全体構成を示す略線図である。It is a basic diagram which shows the whole structure of the gene regulatory network analysis system by this Embodiment. ハイブリダイゼーションの説明に供する略線図である。It is an approximate line figure used for explanation of hybridization. 測定データの構造例を示す略線図である。It is a basic diagram which shows the structural example of measurement data. 遺伝子制御ネットワーク解析装置の構成を示すブロック図である。It is a block diagram which shows the structure of a gene regulatory network analyzer. 標的生物における単位時間当たりの遺伝子発現量の取得の説明に供する略線図である。It is a basic diagram with which it uses for description of acquisition of the gene expression level per unit time in a target organism. 遺伝子発現量のバイナリ化の説明に供する略線図である。It is an approximate line figure used for explanation of binarization of gene expression level. 発現と抑制の関係をもつと推定された遺伝子ペアの各遺伝子発現量の増減推移に一定の比例関係がある場合（１）を示すグラフである。It is a graph which shows (1) when there exists a fixed proportional relationship in the increase / decrease transition of each gene expression level of the gene pair estimated to have the relationship between expression and suppression. 発現と抑制の関係をもつと推定された遺伝子ペアの各遺伝子発現量の増減推移に一定の比例関係がある場合（２）を示すグラフである。It is a graph which shows (2) when there exists a fixed proportional relationship in the increase / decrease transition of each gene expression level of the gene pair estimated to have the relationship between expression and suppression. 遺伝子発現量の挙動が同一又は略同一の関係にある場合の発現量バイナリ列を示す略線図である。It is a basic diagram which shows an expression level binary string when the behavior of a gene expression level has the same or substantially the same relationship. 発現と抑制の関係をもつと推定された遺伝子ペアの各遺伝子発現量の増減推移におおよその比例関係がある場合（１）を示すグラフである。It is a graph which shows (1) when there exists an approximate proportional relationship in the increase-and-decrease transition of each gene expression level of the gene pair estimated to have the relationship between expression and suppression. 発現と抑制の関係をもつと推定された遺伝子ペアの各遺伝子発現量の増減推移におおよその比例関係がある場合（２）を示すグラフである。It is a graph which shows (2) when there exists an approximate proportional relationship in the increase-and-decrease transition of each gene expression level of the gene pair estimated to have the relationship between expression and suppression. 遺伝子発現量の挙動が反対又は略反対の関係にある場合の発現量バイナリ列を示す略線図である。It is a basic diagram which shows an expression level binary string when the behavior of a gene expression level is opposite or substantially opposite. 遺伝子制御ネットワークの表示例を示す略線図である。It is a basic diagram which shows the example of a display of a gene regulatory network. 遺伝子制御ネットワーク解析処理手順を示すフローチャートである。It is a flowchart which shows a gene regulatory network analysis processing procedure.

Explanation of symbols

１……遺伝子制御ネットワーク解析システム、２……ゲノムサーバ、３……蛍光強度測定装置、４……遺伝子制御ネットワーク解析装置、１０……ＣＰＵ、１１……ＲＯＭ、１２……ＲＡＭ、１３……操作部、１４……記憶部、１５……インターフェイス、１６……表示部、２１……遺伝子発現量取得部、２２……ゲノム情報取得部、２３……発現量読出部、２４……バイナリ化部、２５……分類部。 DESCRIPTION OF SYMBOLS 1 ... Gene control network analysis system, 2 ... Genome server, 3 ... Fluorescence intensity measuring device, 4 ... Gene control network analysis device, 10 ... CPU, 11 ... ROM, 12 ... RAM, 13 ... Operation unit, 14 ... storage unit, 15 ... interface, 16 ... display unit, 21 ... gene expression level acquisition unit, 22 ... genome information acquisition unit, 23 ... expression level reading unit, 24 ... binarization Part, 25 ... classification part.

Claims

An acquisition unit that acquires the expression level for each measurement time for a plurality of genes in the target cell;
For each gene that becomes a complementary sequence in the target cell, a complementary sequence depending on the behavior pattern of the expression level for each measurement time corresponding to the gene and the presence or absence of overlap in the same chromosome in the gene that becomes the complementary sequence A gene regulatory network analysis apparatus comprising: a classification unit that classifies species of an expression suppression mechanism for a gene to be expressed.

For each gene that becomes a complementary sequence in the target cell, further comprising a generating unit that generates a binary string by taking a positive or negative difference in the expression level in the direction of passage of the observation time,
The classification part
The gene control according to claim 1, wherein the species of expression suppression mechanism for the gene serving as the complementary sequence is classified according to the pattern of the binary string and the presence or absence of overlap in the same chromosome in the gene serving as the complementary sequence. Network analysis device.

The classification part
The gene regulatory network analysis apparatus according to claim 2, wherein the NATs are classified according to the binary string pattern and the presence or absence of overlap in the same chromosome in the gene serving as the complementary sequence.

The classification part
4. The gene regulatory network analysis according to claim 3, which is classified into at least proven NATs and hypothetical NATs according to the binary string pattern and the presence or absence of overlap in the same chromosome in the gene serving as the complementary sequence. apparatus.

The classification part
The gene regulatory network analyzer according to claim 3, wherein the gene is classified into at least cis-NATS and trans-NATS according to the binary string pattern and the presence or absence of overlap in the same chromosome in the gene serving as the complementary sequence. .

The acquisition unit
The gene control network analysis apparatus according to claim 3, wherein the expression level is measured so that the expression level measurement interval in the gene expressed in the cells of the target organism falls within a range between a set upper threshold and a lower threshold.

An acquisition step of acquiring an expression level for each measurement time for a plurality of genes in the target cell;
For each gene that becomes a complementary sequence in the target cell, a complementary sequence depending on the behavior pattern of the expression level for each measurement time corresponding to the gene and the presence or absence of overlap in the same chromosome in the gene that becomes the complementary sequence A gene regulatory network analysis method comprising: a classification step of classifying species of an expression suppression mechanism for a gene to be expressed.

Against the computer,
Obtaining the expression level of each gene for a plurality of genes in a target cell;
For each gene that becomes a complementary sequence in the target cell, a complementary sequence depending on the behavior pattern of the expression level for each measurement time corresponding to the gene and the presence or absence of overlap in the same chromosome in the gene that becomes the complementary sequence A gene regulatory network analysis program that classifies the species of the expression suppression mechanism for the target gene.