JP2006294014A

JP2006294014A - Analysis program, protein chip, method for manufacturing protein chip and antibody cocktail

Info

Publication number: JP2006294014A
Application number: JP2006072392A
Authority: JP
Inventors: Norie Araki; 令江荒木; Atsunobu Tokushige; 篤信徳重; Susumu Takahashi; 進高橋
Original assignee: Kumamoto Technology and Industry Foundation
Current assignee: Kumamoto Technology and Industry Foundation
Priority date: 2005-03-16
Filing date: 2006-03-16
Publication date: 2006-10-26

Abstract

<P>PROBLEM TO BE SOLVED: To efficiently proceed with proteomics analysis. <P>SOLUTION: Raw data including at least an electrophoretic image and a mass spectroscopic signal pattern by a mass analyzer having various different characteristics obtained by a proteome analysis method using tissues, cells and cultured cells derived from patients and disease model animals showing various clinical conditions, a protein ID identified based on the raw data and quantitative data are stored in a local DB and a proteomics DB. The proteomics analysis system converts the pieces of data stored in the proteomics DB into one language, fuses them together, retrieves other opened DBs accessible via a network to extract a specific protein group and a specific intracellular signal network. The clinical conditions are analyzed and information to the most sure medical treatment and drug design is obtained by predicting a function from these obtained results and performing verification by a unique protein chip. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、病態プロテオミクスによる様々な疾患対象の生物学的な個性、各時点に於ける病態・進行状況等をタンパク質レベルで詳細且つ正確に解析することにより、疾患発症メカニズムや各患者に最も適した合理的な治療薬や予防法を確立することに最適な解析プログラムに関する。特に、この情報処理システムは、医薬、臨床検査、創薬などの分野で利用することに適する。 The present invention is most suitable for the disease onset mechanism and each patient by analyzing the biological personality of various disease subjects by pathological proteomics, and the detailed and accurate analysis at the protein level of the pathological condition and progress at each time point. It is related to an analysis program that is optimal for establishing reasonable therapeutic drugs and prevention methods. In particular, this information processing system is suitable for use in fields such as medicine, clinical examination, and drug discovery.

さらに本発明は、プロテインチップ、並びにそれを用いた病態の進行状況の診断方法や薬剤感受性の判定方法に関する。 Furthermore, the present invention relates to a protein chip and a method for diagnosing the progress of a disease state and a method for determining drug sensitivity using the protein chip.

これまでのプロテオーム解析の概念は、適当な網羅的なタンパク質解析装置あるいはＤＮＡアレイ法などの遺伝子レベルにおける解析法にて分離・解析・検索されたタンパク質について、主にディファレンシャルディスプレイによる検出とデータベース検索による同定を行って，スタンドアロン的にターゲットタンパク質を絞り出し、様々な疾患における病態・進行状況等をタンパク質レベルで探索する方法が一般的であった。 The concept of proteome analysis so far is mainly based on differential display detection and database search for proteins that have been separated, analyzed, and searched by appropriate comprehensive protein analyzers or DNA array methods such as DNA array methods. A common method is to identify and target proteins in a stand-alone manner, and to search for pathological conditions and progress in various diseases at the protein level.

また，プロテオミクス研究において２次元電気泳動を用いたディファレンシャルディスプレイ解析やＬＣ−ショットガン法用いたディファレンシャルディスプレイ解析は、それぞれの結果のみではすべての組織細胞内のタンパク質情報を網羅することができない。 In addition, differential display analysis using two-dimensional electrophoresis and differential display analysis using LC-shot gun method in proteomics research cannot cover protein information in all tissue cells only by each result.

さらに、現在の解析に用いられるペプチドのアミノ酸配列情報を得ることの出来る質量分析計においては、ペプチド種によっては同定が得意不得意分野を有する特徴をもつ様々なイオン化法（ＭＡＬＤＩ法やＥＳＩ法など）や異なる分離・検出法を有しており、一種類の質量分析計による分析方法のみや、一つの解析方法論ではすべての組織・細胞内のタンパク質を網羅できないことから、複数の解析方法、質量分析器を用いて解析する必要がある。組織・細胞内のタンパク質を網羅的に解析において分析装置から出力される結果は、大量に出力され、画像データやＨＴＭＬなどのインターネットファイルといった多種多様のデータタイプがあり、これらの異なる言語を用いてハイスループットで網羅的解析をすることは、実現には大きな障壁であった。 Furthermore, in mass spectrometers that can obtain the amino acid sequence information of peptides used in the current analysis, various ionization methods (MALDI method, ESI method, etc.) have characteristics that are notable for identification depending on the peptide type. ) And different separation / detection methods, and only one analysis method using one mass spectrometer, or a single analysis methodology cannot cover all tissues and cells. It is necessary to analyze using an analyzer. The results output from analyzers in comprehensive analysis of proteins in tissues and cells are output in large quantities, and there are a wide variety of data types such as image data and Internet files such as HTML. Using these different languages Comprehensive analysis with high throughput was a major barrier to realization.

従来の技術では、複数の解析装置データを統合して解析することができないため，プロテオミクスで最も重要とされる細胞内タンパク質ネットワークや翻訳後修飾、細胞内タンパク質の分解産物などの相互作用機能を再現することが困難であり、さらに、トランスクリプトーム研究で用いられるＤＮＡアレイ解析、リアルタイムＰＣＲ解析などによって得られるｍＲＮＡ情報は、タンパク質情報にも大きく関連しているが、同一サンプルからのプロテオミクス情報とｍＲＮＡ情報を統合して解析することが困難であるため、研究者の経験的知識と長時間の文献検索による調査により試行錯誤的に導出されていた。 With conventional technology, data from multiple analyzers cannot be integrated and analyzed, thus reproducing the most important functions of proteomics such as intracellular protein networks, post-translational modifications, and intracellular protein degradation products. In addition, mRNA information obtained by DNA array analysis, real-time PCR analysis, etc. used in transcriptome research is greatly related to protein information, but proteomic information and mRNA from the same sample Since it was difficult to integrate and analyze information, it was derived on a trial and error basis by investigating researcher's empirical knowledge and long-term literature search.

しかも、分子を探索した結果を得た後、再度生体内での再現性を確認する必要がある。 Moreover, after obtaining the results of searching for molecules, it is necessary to confirm reproducibility in vivo again.

また、特定の疾病へのかかりやすさ、医薬に対する副作用などに関与する所定遺伝子変異の検出／診断などの用途を開発するには、どのプローブを搭載すれば何が分かるかと言うコンテンツを整備する必要があり、単一ではなく複数のプローブの組み合わせにより初めて解析できる事象も多い。 In addition, in order to develop applications such as detection / diagnosis of specific gene mutations that are related to susceptibility to specific diseases and side effects on medicines, it is necessary to prepare content that indicates what probe is installed. There are many events that can be analyzed for the first time by a combination of multiple probes instead of a single one.

他方、非常に予後の悪い腫瘍として脳腫瘍があげられる。本発明者らは、病態発症メカニズム、治療法、治療・予防薬、予後を予測する臨床マーカー
の開発を目的として、種々の脳腫瘍のニューロプロテオミクスの手法を用いた解析を行っている（たとえば、非特許文献１、非特許文献２）。予後の悪い悪性グリオーマにおいて抗癌剤に感受性を持つものとして、乏突起神経膠腫(anaplastic origodendroglioma/astrosytoma:AOG)が知られている。一般的に知られるAOGの興味ある所見としては、第１染色体短腕上（1p36）及び第１９染色体長腕上（19q13）に片アレル欠失（Loss
of Heterozygosity ：LOH）を有しているタイプのもの(グレード２から３において60％以上)と、1p, 19qLOHを示さないタイプのものに分類できる。両者は病理学的には全く同様の所見を示すが、LOHを有するグループは抗癌剤療法や放射線治療に高感受性で一般的に予後がよく、逆にLOHを持たないタイプは、これらの療法に非感受性で非常に予後が悪いという特徴を持っている。現在までのところ、分子レベルで両タイプの違いを示すマーカー蛋白質はもとより、予後の違いの原因となる分子群や薬剤耐性のメカニズムに関する情報は、全く報告されていない。
「Proteomic analysis of human brain identifies -enolase as anovel autoantigen in Hashimoto's encephalopathy」、Ochi H、Horiuchi I、Araki N他著、FEBSLetters、528(1-3)、pp197-202、２００２年「脳腫瘍の病態プロテオミクスで見えてくる新たなシグナル伝達経路」、荒木令江著、“プロテオミクスでみえてくる生命機能の新たなメカニズム” 実験医学 vol23(7)、羊土社、pp1050-1058、２００５年 On the other hand, a brain tumor is mentioned as a tumor with a very poor prognosis. The present inventors have conducted analysis using various neuroproteomic techniques for brain tumors for the purpose of developing pathological mechanisms, therapeutic methods, therapeutic / preventive drugs, and clinical markers for predicting prognosis (for example, non-proteomics). Patent Document 1, Non-Patent Document 2). An oligoplastic glioma (anaplastic origodendroglioma / astrosytoma: AOG) is known to be sensitive to anticancer agents in malignant gliomas with a poor prognosis. An interesting finding of the commonly known AOG is the loss of a single allele on the short arm of chromosome 1 (1p36) and the long arm of chromosome 19 (19q13) (Loss
of Heterozygosity: LOH) (60% or more in grades 2 to 3) and those not showing 1p, 19qLOH. Although both have the same pathological findings, the group with LOH is highly sensitive to anticancer drug therapy and radiotherapy and generally has a good prognosis, while the type without LOH is not sensitive to these therapies. It is characterized by sensitivity and very poor prognosis. To date, there has been no report on the molecular proteins that cause differences in prognosis, as well as on the marker proteins that show the difference between the two types at the molecular level, and on the mechanism of drug resistance.
"Proteomic analysis of human brain identifies -enolase as anovel autoantigen in Hashimoto's encephalopathy", Ochi H, Horiuchi I, Araki N et al., FEBSLetters, 528 (1-3), pp197-202, 2002 “New signal transduction pathways revealed by pathological proteomics in brain tumors” by Rekie Araki, “New mechanisms of vital functions seen in proteomics” Experimental Medicine vol23 (7), Yodosha, pp1050-1058, 2005 Year

上述したように、従来のプロテオーム解析の概念は、２Ｄ−ＰＡＧＥや２Ｄ−ＬＣなどで分離したタンパク質やペプチドについて、種々の質量分析装置によりペプチドマスフィンガープリント法やシークエンスタグ法などでデータベース検索することにより、病態に関わるタンパク質の同定と定量を行うものである。しかしながら、分析装置の解析は個々の装置によって異なるため，得られたデータはそれぞれ独立しており，複数の装置の解析データを統合して評価することは困難である。このため，複数の装置によって異なる概念で解析を行い、当該タンパク質のより多くの情報を得たとしても、それを有機的に結びつけて解釈することは困難であり，本来の網羅的解析を実行するに至っていない。 As described above, the concept of conventional proteome analysis is to search the database for proteins and peptides separated by 2D-PAGE, 2D-LC, etc. using various mass spectrometers using the peptide mass fingerprint method or sequence tag method. By this, identification and quantification of proteins involved in pathological conditions are performed. However, since the analysis of the analysis device differs depending on the individual device, the obtained data is independent, and it is difficult to evaluate the analysis data of a plurality of devices in an integrated manner. For this reason, even if more than one piece of information is analyzed using different devices and more information about the protein is obtained, it is difficult to organically link it and interpret it. It has not reached.

したがって、プロテオミクス解析に要求される翻訳後修飾や、細胞内タンパク質の分解産物などの相互作用機能の評価や再現性を確認する際に困難を伴う。また、新しい概念の手法や装置が開発された場合でもそれらのデータを結びつけて解析することが困難である。 Therefore, there are difficulties in evaluating post-translational modifications required for proteomics analysis and the evaluation and reproducibility of interaction functions such as degradation products of intracellular proteins. In addition, even when a new conceptual method or apparatus is developed, it is difficult to combine and analyze the data.

本発明は、プロテオミクス解析を、効率的に進めることができる解析プログラムを提供することを目的とする。 An object of this invention is to provide the analysis program which can advance a proteomics analysis efficiently.

さらに本発明は、上記したプロテオミクス解析システムおよび方法により得られた分子情報を利用して、疾患の進行状況や薬剤感受性などのモニタリングに使用可能な分子群をリストアップして、患者組織・細胞や血清蛋白質のｎａｔｕｒａｌ微小生体組織・細胞タンパクチップ上における上記分子群の発現パターンをプロファイリングし、データベース化して、より確実な病態の進行状況や薬剤感受性などをモニタリングするシステム及び方法を提供することを解決すべき課題とした。 Furthermore, the present invention uses molecular information obtained by the above-mentioned proteomic analysis system and method to list molecular groups that can be used for monitoring disease progress, drug sensitivity, etc. Solved the problem of providing a system and method for profiling the expression pattern of the above-mentioned molecular groups on a natural micro-biological tissue / cell protein chip of a serum protein and creating a database to monitor the progress of the disease state and drug sensitivity more reliably. It was an issue that should be done.

本発明は、プロテオミクス研究分野で使用されている様々な分析装置、解析ソフトウェアから得られる大量かつ多種多様のデータタイプのタンパク質情報データ、および、ゲノミクスやトランスクリプトーム研究分野から導き出されたタンパク質分子データをプロテオミクスデータベースに統合集中管理し、データマイニングにより機能分析した結果を後機能分析データベースに格納することにより、研究者の経験や自動計算分類された統合化タンパク質情報と実験から得られた信頼性の高いＲＡＷデータを区別し、データの信頼性を確保しながら分子機能分析も大量に、また以前よりも短期間に進めることが可能となる。 The present invention relates to a variety of analytical devices used in the field of proteomics research, protein information data of a large variety of data types obtained from analysis software, and protein molecular data derived from the field of genomics and transcriptome research. Is integrated into the proteomics database, and the results of functional analysis by data mining are stored in the post-functional analysis database. It is possible to distinguish high RAW data and perform molecular function analysis in a large amount and in a shorter time than before while ensuring data reliability.

本発明の目的は、各種病態を示す患者や疾患モデル動物由来組織、細胞および培養細胞を用いてプロテオーム解析方法によって得られた、電気泳動画像、および、各種異なる特徴をもつ質量分析器による質量分析シグナルパターンを少なくとも含む、生データ、並びに、当該生デーに基づいて同定したタンパク質ＩＤ、および、定量的データを格納したデータベースを備えたコンピュータにおいて、前記コンピュータに、
データベースに格納されたデータを同一言語に変換し、融合させるステップと、
ネットワークを介してアクセス可能な公開された他のデータベースを検索するステップと、
特異的タンパク質群を抽出し、これらの機能を予測し病態を解析するステップと、を実行させることを特徴とする解析プログラムにより達成される。 An object of the present invention is to provide an electrophoretic image obtained by a proteome analysis method using tissues, cells and cultured cells derived from patients and disease model animals exhibiting various pathological conditions, and mass spectrometry using mass spectrometers having various characteristics. In a computer comprising a raw data including at least a signal pattern, a protein ID identified based on the raw data, and a database storing quantitative data, the computer includes:
Converting and merging data stored in the database into the same language;
Searching other publicly accessible databases accessible over the network;
It is achieved by an analysis program characterized in that a specific protein group is extracted, and these functions are predicted to analyze a disease state.

ある実施態様では、複数のＬＣ−ＭＳショットガン法を適用して得た、ＭＳスペクトルデータを格納する、ＭＳスペクトルデータベースを備えたコンピュータにおいて、前記コンピュータに、
タンパク質を同定するステップと、
前記タンパク質の同定により得られた解析結果を、データベースに格納するステップと、
データベースに格納された解析結果と、タンパク相互機能解析システムおよび生物情報統合プラットフォームとを、ネットワークを介しリンクさせて融合するステップと、を実行させる。 In one embodiment, in a computer with an MS spectrum database storing MS spectrum data obtained by applying a plurality of LC-MS shotgun methods, the computer includes:
Identifying a protein;
Storing the analysis result obtained by the identification of the protein in a database;
The step of merging the analysis result stored in the database with the protein interaction analysis system and the biological information integration platform by linking them via a network is executed.

別の実施態様では、論理的スペクトルを格納したデータベースを備えたコンピュータにおいて、前記コンピュータに、
安定同位体標識ＬＣ−ＭＳショットガン法を適用して得た生データを、既存の条件にて作成したペプチドピークリストと、前記データベースに格納された理論的スペクトルとの間の類似性に基づいて、ペプチドを同定するステップと、
安定同位体標識されたペアのピークの強度差から、前記ペプチドを含むタンパク質含有相対量を決定するステップと、
特定の組織・細胞に特異的に出現したタンパク質群を分類するステップと、を実行させる。 In another embodiment, in a computer comprising a database storing logical spectra, said computer includes:
The raw data obtained by applying the stable isotope-labeled LC-MS shotgun method is based on the similarity between the peptide peak list created under the existing conditions and the theoretical spectrum stored in the database. Identifying a peptide; and
Determining a relative amount of protein containing the peptide from the difference in intensity of the pair of stable isotope-labeled pairs;
And classifying a protein group that specifically appears in a specific tissue / cell.

さらに他の実施態様においては、ＤＮＡアレイ、および／または、リアルタイムＰＣＲを含む、異なる性質の分析によるｍＲＮＡ定量的発現解析データ、および、２次元電気泳動、ＬＣ−ＥＳＩ−ＭＳ／ＭＳショットガン法、および／または、ＬＣ—ＭＡＬＤＩ−ＭＳ／ＭＳショットガン法を含む、異なる手法を用いたプロテオミクスによる定量的データを格納したデータベースを備えたコンピュータにおいて、前記コンピュータに、
前記データベースに格納されたデータのそれぞれについて、ディファレンシャルディスプレイを含む、同一のアルゴリズムによる解析を施すステップを実行させる。 In yet another embodiment, mRNA quantitative expression analysis data by analysis of different properties, including DNA arrays and / or real-time PCR, and two-dimensional electrophoresis, LC-ESI-MS / MS shotgun method, And / or a computer comprising a database storing quantitative data by proteomics using different techniques, including the LC — MALDI-MS / MS shotgun method,
Each of the data stored in the database is subjected to an analysis using the same algorithm including a differential display.

より好ましい実施態様においては、さらに、前記コンピュータが、
前記データベースを異なる他のデータベースとリンクされ、前記コンピュータに
前記データベースに格納された、前記解析を施された結果である解析データについて、前記リンクされた異なる他のデータベースに格納された情報に基づいて解析するステップを実行させる。 In a more preferred embodiment, the computer further comprises:
The analysis data, which is the result of the analysis performed by linking the database with another different database and stored in the database on the computer, is based on the information stored in the other linked different database. Run the step to analyze.

さらに他の実施態様においては、２Ｄ―ＤＩＧＥを含む画像データであって、スポットの情報を含む画像データ、および、翻訳後修飾構造を含むタンパク質情報を格納したデータベースを備えたコンピュータにおいて、前記コンピュータに、
前記画像データ中のスポットと、前記データベースからの翻訳後修飾構造を含むタンパク質情報とを関連付け、当該関連付けを、前記データベースに格納するステップを実行させる。 In still another embodiment, in a computer comprising image data including 2D-DIGE, including image data including spot information and protein information including post-translational modification structures, the computer includes ,
The step of associating the spot in the image data with the protein information including the post-translational modification structure from the database and storing the association in the database is executed.

好ましい実施態様においては、前記コンピュータに、作成した統合データベースの情報を分析装置の解析プラットフォームにリバースインプットし、再解析するステップを実行させる
また、より好ましい実施態様においては、ｍＲＮＡは、ＤＮＡアレイおよび／またはリアルタイムＰＣＲ、可溶化タンパク質は、２Ｄ−ＤＩＧＥを含む蛍光標識２次元電気泳動および／またはＩＣＡＴ、ｉＴＲＡＱを含む安定同位体標識法のＬＣ−ＭＳショットガン法を用いて、同時進行で、方式の定量的ディファレンシャルディスプレイによるデータを一元化して解析する。 In a preferred embodiment, the computer is caused to reversely input the information of the created integrated database to the analysis platform of the analyzer and execute a reanalysis step. In a more preferred embodiment, the mRNA is a DNA array and / or Alternatively, real-time PCR, solubilized proteins can be performed simultaneously using a fluorescence-labeled two-dimensional electrophoresis including 2D-DIGE and / or a stable isotope labeling LC-MS shotgun method including iTRAQ. Centralize and analyze data from quantitative differential displays.

また、別の実施態様においては、分子ネットワークを記憶したデータベースを備えたコンピュータにおいて、前記コンピュータに、
分子統合データに基づいて、前記データベース中の分子ネットワークを検するステップと、
病態で特異的に活性化あるいは特異的に失活しているなどの病態組織・細胞特異的シグナルカスケードを抽出するステップと、を実行させる。 In another embodiment, in a computer having a database storing a molecular network, the computer includes:
Examining a molecular network in the database based on molecular integration data;
Extracting a pathological tissue / cell-specific signal cascade such as specifically activated or specifically deactivated in a pathological condition.

さらに別の好ましい実施態様においては、異なる特徴を持つ質量分析器による質量分析シグナルパターンを含む生データに基づいて同定したＡｃｃｅｓｓｉｏｎＮｏ、および、Ｉｎｔｅｎｓｉｔｙ比を少なくとも格納したデータベースを備えたコンピュータにおいて、前記コンピュータに、
前記データベースに格納された、異なる質量分析器に基づく、同一のＡｃｃｅｓｓｉｏｎＮｏに関するＩｎｔｅｎｓｉｔｙ比を抽出するステップと、
前記Ｉｎｔｅｎｓｉｔｙ比の値を統合し、統合した値を、前記ＡｃｃｅｓｓｉｏｎＮｏと関連付けて、前記データベースに記憶するステップと、を実行させる。 In yet another preferred embodiment, a computer comprising an Accession No identified based on raw data including mass spectrometry signal patterns from mass analyzers having different characteristics, and a database storing at least the intensity ratio, the computer includes: ,
Extracting an intensity ratio for the same Accession No. based on different mass analyzers stored in the database;
Integrating the values of the intensity ratio, and storing the integrated values in the database in association with the AccessionNo.

より好ましくは、さらに、前記データベースが、ＤＮＡアレイに由来するデータに基づくＡｃｃｅｓｓｉｏｎＮｏ、および、Ｉｎｔｅｎｓｉｔｙ比を備え、
前記データベースに格納された、同一のＡｃｃｅｓｓｉｏｎＮｏに関する統合されたＩｎｔｅｎｓｉｔｙ比、および、ＤＮＡアレイに関するＩｎｔｅｎｓｉｔｙ比を抽出するステップと、
前記Ｉｎｔｅｎｓｉｔｙ比の値をノーマライズして、ノーマライズされた値を、前記ＡｃｃｅｓｓｉｏｎＮｏと関連付けて、前記データベースに記憶するステップと、を実行させる。 More preferably, the database further includes an Accession No based on data derived from the DNA array, and an Intensity ratio,
Extracting an integrated intensity ratio for the same Accession No and an intensity ratio for a DNA array stored in the database;
And normalizing the intensity ratio value and storing the normalized value in association with the AccessionNo in the database.

また、別の好ましい実施態様においては、２Ｄ−ＤＩＧＥを含む画像データであって、スポットの情報を含む画像データを格納したデータベースを備えたコンピュータにおいて、前記コンピュータに、
前記スポットのそれぞれの情報、および、前記コンピュータに接続されたデータベースであって、前記既知のタンパク質のスポットの情報を格納した他のデータベースを参照して、前記スポットの情報と、前記既知のタンパク質のスポットの情報がマッチする場合に、当該スポットの情報として、前記既知のタンパク質のスポットの情報を与え、前記データベースに格納するステップと、
マッチしない場合に、質量分析計による解析を含む他の手法により取得し、前記データベースに格納した、前記スポットの第２の情報に基づいて、前記スポットにかかるタンパク質を同定するステップと、を実行させる。 In another preferred embodiment, in a computer comprising a database storing image data including 2D-DIGE and including spot information, the computer includes:
Each spot information and a database connected to the computer, the other information database storing the known protein spot information, the spot information and the known protein information When the spot information matches, giving the spot information of the known protein as the spot information, and storing in the database,
A step of identifying a protein applied to the spot based on the second information of the spot, which is obtained by another method including analysis by a mass spectrometer and stored in the database when there is no match .

好ましい実施態様においては、前記前記第２の情報に基づいてタンパク質を同定するステップは、前記コンピュータに接続されたさらに他のデータベースであって、前記他の手法により同定されたタンパク質の情報が格納されたデータベースを検索するステップを含む。 In a preferred embodiment, the step of identifying a protein based on the second information is a further database connected to the computer, and stores information of the protein identified by the other method. Searching the database.

また、本発明は、病態組織・細胞の特異的な部位より、ｍＲＮＡおよびタンパク質を同時に同じ可溶化組織から調製する方法を提供する。 The present invention also provides a method for simultaneously preparing mRNA and protein from the same solubilized tissue from specific sites of pathological tissues / cells.

さらに、本発明は、生体組織や細胞から抽出したタンパク質をそのまま微少二次元電気泳動（２ＤＥ）に供与することを特徴とする方法であって、非常に小さなプロテイン２Ｄマップを提供する。更に作成された２Ｄマップを電気的にプローブに転写して固相化することを特徴とする、組織細胞内で合成されたすべてのタンパク質を網羅したｎａｔｕｒａｌな微小生体組織細胞タンパクチップ提供する。 Furthermore, the present invention provides a very small protein 2D map, which is characterized in that a protein extracted from a living tissue or cell is directly supplied to micro two-dimensional electrophoresis (2DE). Furthermore, the present invention provides a natural micro-tissue cell protein chip that covers all proteins synthesized in tissue cells, characterized in that the prepared 2D map is electrically transferred to a probe and immobilized.

また、ｎａｔｕｒａｌ微小生体組織細胞タンパクチップ上のすべてのタンパク質や糖鎖やリン酸化などの翻訳後修飾タンパク質を、Ｃｙなどの蛍光色素によってあらかじめ標識する、あるいは特異的な蛍光染試薬で染色して、微少チップ上のすべてのタンパク質スポット情報を画像化する方法を提供する。 In addition, all proteins on the natural micro-tissue cell protein chip, post-translationally modified proteins such as sugar chains and phosphorylation are pre-labeled with a fluorescent dye such as Cy, or stained with a specific fluorescent staining reagent, A method for imaging all protein spot information on a microchip is provided.

さらに、ｎａｔｕｒａｌ微小生体組織・細胞タンパクチップを用いて、各種病態をしめる患者血清と反応させて、患者血清中の自己抗体に対する抗原を検出する方法、および、これらの結果からチップ上の患者自己抗体陽性スポット群のプロファイリングを行い、データベース化して、病態の進行状況や薬剤感受性などをモニタリングする方法を提供する。 Furthermore, a method for detecting antigens against autoantibodies in patient sera by reacting with patient sera showing various pathological conditions using natural micro-tissue / cell protein chips, and patient autoantibodies on the chip based on these results Providing a method for profiling positive spots and creating a database to monitor the progress of disease states and drug sensitivity.

また、本発明は、たとえば、上記請求項１、２、３、８、９などによって得られた分子情報から進行状況や薬剤感受性などのモニタリングに使用可能な分子群をリストアップして、これらの分子に対する抗体カクテルを用いて、患者組織・細胞や血清タンパク質のｎａｔｕｒａｌ微小生体組織・細胞タンパクチップ上の抗体反応陽性パターンをプロファイリングし、データベース化して、より確実な病態の進行状況や薬剤感受性などのモニタリングに供与する方法を提供する。 In addition, the present invention lists molecular groups that can be used for monitoring progress, drug sensitivity, etc. from the molecular information obtained by, for example, the above claims 1, 2, 3, 8, 9, etc. Using antibody cocktails against molecules, we can profile antibody reaction positive patterns on natural micro-biological tissues / cell protein chips of patient tissues / cells and serum proteins, and create a database for more accurate pathological progress and drug sensitivity. Provide a method to provide for monitoring.

また、本発明の他の目的は、生体組織又は細胞から抽出した蛋白質を蛍光色素で標識して得られた蛋白質試料をストリップ上で一次元目の等電点電気泳動に供し、次いで等電点電気泳動後のストリップを還元処理及びSDS化した後、二次元目のSDS-ポリアクリルアミド電気泳動に供することにより得られる、生体組織又は細胞内で合成された全蛋白質を網羅した微小サイズのプロテインチップにより達成される。 Another object of the present invention is to subject a protein sample obtained by labeling a protein extracted from living tissue or cells with a fluorescent dye to first-dimensional isoelectric focusing on a strip, and then isoelectric focusing. A micro-sized protein chip that covers all proteins synthesized in living tissue or cells after reduction and SDS conversion of the electrophoretic strip, followed by SDS-polyacrylamide electrophoresis in the second dimension. Is achieved.

好ましい実施態様においては、プロテインチップは、１０cm×１０cm以下のサイズを有する。 In a preferred embodiment, the protein chip has a size of 10 cm × 10 cm or less.

また、別の好ましい実施態様において、プロテインチップは、蛋白質試料が電気泳動ゲルからメンブレンに転写されて固相化されている。 In another preferred embodiment, the protein chip is immobilized on a protein sample transferred from an electrophoresis gel to a membrane.

好ましい実施態様において、プロテインチップの製造方法は、（１）生体組織又は細胞から抽出した蛋白質を蛍光色素で標識して得られた蛋白質試料をストリップ上で一次元目の等電点電気泳動に供する工程、及び、（２）等電点電気泳動後のストリップを還元処理及びSDS化した後、二次元目のSDS-ポリアクリルアミド電気泳動に供する工程を含む。 In a preferred embodiment, the method for producing a protein chip comprises (1) subjecting a protein sample obtained by labeling a protein extracted from a biological tissue or cell with a fluorescent dye to first-dimensional isoelectric focusing on a strip. And (2) a step of reducing the strip after isoelectric focusing and converting it to SDS, and then subjecting it to SDS-polyacrylamide electrophoresis in the second dimension.

また、別の好ましい実施態様において、プロテインチップの製造方法は、（１）生体組織又は細胞から抽出した蛋白質を蛍光色素で標識して得られた蛋白質試料をストリップ上で一次元目の等電点電気泳動に供する工程、（２）等電点電気泳動後のストリップを還元処理及びSDS化した後、二次元目のSDS-ポリアクリルアミド電気泳動に供する工程、及び（３）蛋白質試料をSDS-ポリアクリルアミド電気泳動後の電気泳動ゲルからメンブレンに転写する工程、を含む。 In another preferred embodiment, the method for producing a protein chip comprises (1) a protein sample obtained by labeling a protein extracted from a biological tissue or cell with a fluorescent dye, and a first-dimensional isoelectric point on the strip. Step for subjecting to electrophoresis, (2) Step for subjecting strip after isoelectric focusing to reduction treatment and SDS, and then subjecting it to second-dimensional SDS-polyacrylamide electrophoresis, and (3) SDS-poly for protein samples A step of transferring from the electrophoresis gel after acrylamide electrophoresis to the membrane.

別の好ましい実施態様において、該蛋白質試料をプロテインチップ上における蛋白質スポット情報として画像化する方法は、上記プロテインチップ上の蛋白質試料に標識されている蛍光色素に由来する蛍光を検出することを含む。 In another preferred embodiment, the method of imaging the protein sample as protein spot information on a protein chip includes detecting fluorescence derived from a fluorescent dye labeled on the protein sample on the protein chip.

また、本発明においては、（１）請求項１、２、３、８又は９に記載のプログラムをコンピュータに実行させることによって得られた分子情報から病態の進行状況又は薬剤感受性のモニタリングに使用可能な分子群をリストアップして、これらの分子に対する抗体カクテルを用意する工程、（２）請求項１４から１６の何れかに記載のプロテインチップに上記抗体カクテルを接触させる工程、及び（３）プロテインチップ上の抗体反応陽性パターンをプロファイリングし、データベース化する工程を含む、病態の進行状況又は薬剤感受性をモニタリングする方法が提供される。 In the present invention, (1) it can be used for monitoring the progress of a disease state or drug sensitivity from molecular information obtained by causing a computer to execute the program according to claim 1, 2, 3, 8, or 9. Preparing a group of molecules and preparing an antibody cocktail against these molecules, (2) contacting the antibody cocktail with the protein chip according to any one of claims 14 to 16, and (3) a protein A method for monitoring the progress of a disease state or drug sensitivity, comprising profiling antibody reaction positive patterns on a chip and creating a database is provided.

また、本発明においては、（１）抗GFAP抗体、抗CD44抗体、抗p53抗体、抗prothrombin抗体、抗α1-antitrypsin抗体、抗ubiqutine
activating enzyme E1抗体、抗Erk抗体、抗p-Tyr抗体、及び抗zyxin抗体から選択される少なくとも２種類以上の抗体を含む抗体カクテルを用意する工程、（２）脳腫瘍患者の生体組織又は細胞から抽出した蛋白質を蛍光色素で標識して得られた蛋白質試料をストリップ上で一次元目の等電点電気泳動に供し、次いで等電点電気泳動後のストリップを還元処理及びSDS化した後、二次元目のSDS-ポリアクリルアミド電気泳動に供することにより得られる、生体組織又は細胞内で合成された全蛋白質を網羅した微小サイズのプロテインチップに上記抗体カクテルを接触させる工程、及び（３）プロテインチップ上の抗体反応陽性パターンをプロファイリングし、データベース化する工程を含む、脳腫瘍の病態の進行状況又は薬剤感受性をモニタリングする方法が提供される。 In the present invention, (1) anti-GFAP antibody, anti-CD44 antibody, anti-p53 antibody, anti-prothrombin antibody, anti-α1-antitrypsin antibody, anti-ubiqutine
activating enzyme E1 antibody, anti-Erk antibody, anti-p-Tyr antibody, and a step of preparing an antibody cocktail containing at least two kinds of antibodies selected from anti-zyxin antibodies, (2) extraction from biological tissue or cells of brain tumor patients The protein sample obtained by labeling the obtained protein with a fluorescent dye is subjected to first-dimension isoelectric focusing on the strip, and then the strip after isoelectric focusing is reduced and converted to SDS. Contacting the antibody cocktail with a micro-sized protein chip covering all proteins synthesized in living tissue or cells obtained by subjecting to eye SDS-polyacrylamide electrophoresis, and (3) on the protein chip A method for monitoring the progress of a disease state or drug sensitivity of a brain tumor, comprising the steps of profiling a positive antibody reaction pattern of the Provided.

さらに、本発明においては、請求項２１に記載の方法において脳腫瘍の病態の進行状況又は薬剤感受性をモニタリングするために使用される抗体カクテルであって、抗GFAP抗体、抗CD44抗体、抗p53抗体、抗prothrombin抗体、抗α1-antitrypsin抗体、抗ubiqutine
activating wnzyme E1抗体、抗Erk抗体、抗p-Tyr抗体、及び抗zyxin抗体から選択される少なくとも２種類以上の抗体を含む抗体カクテルが提供される。 Furthermore, in the present invention, an antibody cocktail used in the method of claim 21 for monitoring the progress of a disease state or drug sensitivity of a brain tumor, comprising an anti-GFAP antibody, an anti-CD44 antibody, an anti-p53 antibody, Anti-prothrombin antibody, anti-α1-antitrypsin antibody, anti-ubiqutine
An antibody cocktail comprising at least two antibodies selected from activating wnzyme E1 antibody, anti-Erk antibody, anti-p-Tyr antibody, and anti-zyxin antibody is provided.

本発明によれば、プロテオミクス解析を、効率的に進めることができる解析システムおよび方法を提供することが可能となる。 ADVANTAGE OF THE INVENTION According to this invention, it becomes possible to provide the analysis system and method which can advance proteomics analysis efficiently.

さらに本発明によれば、上記したプロテオミクス解析システムおよび方法により得られた分子情報を利用して、疾患の進行状況や薬剤感受性などのモニタリングに使用可能な分子群をリストアップして、患者組織・細胞や血清蛋白質のｎａｔｕｒａｌ微小生体組織・細胞タンパクチップ上における上記分子群の発現パターンをプロファイリングし、データベース化して、より確実な病態の進行状況や薬剤感受性などをモニタリングするシステム及び方法を提供することが可能となる。 Furthermore, according to the present invention, by utilizing the molecular information obtained by the above-described proteomic analysis system and method, a list of molecular groups that can be used for monitoring disease progress, drug sensitivity, etc. is listed, To provide a system and method for profiling the expression pattern of the above-mentioned molecular groups on cells and serum protein natural micro-biological tissues / cell protein chips, creating a database, and monitoring the progress of disease states and drug sensitivity more reliably. Is possible.

以下、添付図面を参照して、本発明の実施の形態について説明する。図１は、本発明の実施の形態にかかるプロテオミクス解析システムの全体構成を示すブロックダイヤグラムである。図１に示すように、プロテオミクス解析システムは、プロテオミクスベースの処理装置群１２と、ゲノミクスベースの処理装置群１４とが設けられている。また、プロテオミクス解析システムは、それぞれの処理装置群に設けられたデータベース（ＤＢ）からの情報のうち所定の情報を記憶するプロテオミクスＤＢ１６、機能分類ＤＢ１８、プロテオミクスＤＢ１６へのデータ格納などを制御する統合解析システム２０および機能分類ＤＢ１８へのデータ格納などを制御するデータマイニングシステム２２を有する。これらの間は、バス、ＬＡＮ或いはＷＡＮなどのデータ通信路２６で相互に接続されている。また、国際ＤＢ２４との間でもデータ送受信が可能となっている。また、図示しないが、プロテオミクス解析システムは、表示装置と、マウスやキーボードを含む入力装置を備え、システムに含まれる種々のＤＢから読み出されたデータに基づく画像を表示することができ、また、オペレータからの指示や入力を受け付けて、必要な処理を実行することができる。 Embodiments of the present invention will be described below with reference to the accompanying drawings. FIG. 1 is a block diagram showing the overall configuration of a proteomic analysis system according to an embodiment of the present invention. As shown in FIG. 1, the proteomic analysis system includes a proteomics-based processing device group 12 and a genomics-based processing device group 14. The proteomics analysis system also performs integrated analysis for controlling data storage in the proteomics DB 16, function classification DB 18, and proteomics DB 16 that stores predetermined information among information from databases (DBs) provided in each processing device group. The system 20 has a data mining system 22 that controls data storage in the function classification DB 18. These are connected to each other via a data communication path 26 such as a bus, LAN, or WAN. In addition, data can be transmitted / received to / from the international DB 24. Although not shown, the proteomics analysis system includes a display device and an input device including a mouse and a keyboard, and can display images based on data read from various DBs included in the system. Necessary processing can be executed in response to an instruction or input from the operator.

本実施の形態において、タンパク質のデータベースとして一般に公開されたものを、「国際ＤＢ」と総称している。国際ＤＢ２４として、「ＳｗｉｓｓｎＰｌｏｔ」、「ＭＳＤＢ」、「ＧｅｎｅＢａｎｋ」などを利用することができる。 In the present embodiment, the publicly disclosed database of proteins is collectively referred to as “international DB”. As the international DB 24, “SwissnPlot”, “MSDB”, “GeneBank” and the like can be used.

図２は、プロテオミクスベースの処理装置群の構成をより詳細に示すブロックダイヤグラムである。第１の質量分析計３０〜第３の質量分析計３４は、それぞれ、ＭＡＬＤＩ−ＴＯＦ−ＴＯＦやＬＣ−ＭＳ／ＭＳなど、といった個々のタンパク質およびペプチドのマススペクトルを計測する装置である。第１の質量分析計３０〜第３の質量分析計３４にそれぞれ接続された、制御・解析部４０〜４４は、対応する質量分析計に対して、計測のための制御パラメータを送信するタンパク質計測のためのコントローラとしての機能、および、生データ（ＲＡＷデータ）の測定ポイントやスレッシュホールドを指定して生データから正規な信号を得るための解析機能を有している。 FIG. 2 is a block diagram showing the configuration of the proteomics-based processing device group in more detail. The first mass spectrometer 30 to the third mass spectrometer 34 are apparatuses that measure mass spectra of individual proteins and peptides such as MALDI-TOF-TOF and LC-MS / MS, respectively. The control / analysis units 40 to 44 connected to the first mass spectrometer 30 to the third mass spectrometer 34 respectively transmit protein control parameters for measurement to the corresponding mass spectrometer. And a function for analysis for obtaining a normal signal from the raw data by specifying a measurement point and a threshold of the raw data (RAW data).

スペクトルデータからタンパク質を導出する手法として、公開されている国際ＤＢなどの公共データベースの情報、或いは、インハウスで格納される独自のＤＢ（たとえば、後述するローカルＤＢ）が利用される。第１のローカルＤＢ５０〜第３のローカルＤＢ５４は、それぞれ、スペクトルの生データや、解析で得たデータを含むデータファイルを格納する機能を有する。なお、大量のデータを格納可能とするため、制御・解析部４０〜４４と別体に設けても良い。 As a method for deriving a protein from spectrum data, public database information such as an international DB or a unique DB (for example, a local DB described later) stored in-house is used. Each of the first local DB 50 to the third local DB 54 has a function of storing raw data of a spectrum and a data file including data obtained by analysis. In order to be able to store a large amount of data, it may be provided separately from the control / analysis units 40 to 44.

２Ｄ−ＤＩＧＥ測定装置３６は、プロテオーム解析を目的としたタンパク質の２次元電気泳動測定計で、タンパク質の分子量や等電点の特性からゲル上に分離したタンパク質の２次元位置からタンパク質を同定するために用いられる。２Ｄ−ＤＩＧＥ測定装置３６に接続された制御・解析部４６は、２Ｄ−ＤＩＧＥ測定計の制御、および、ゲルパターンの画像をデジタル画像として取り込み、画像処理後に解析データを出力する。第４のローカルＤＢ５６はゲルパターン画像や解析データを格納する。 The 2D-DIGE measuring device 36 is a protein two-dimensional electrophoresis measurement device for proteomic analysis, and identifies a protein from the two-dimensional position of the protein separated on the gel from the characteristics of the molecular weight and isoelectric point of the protein. Used for. A control / analysis unit 46 connected to the 2D-DIG measurement device 36 takes in the image of the control of the 2D-DIG measurement meter and the gel pattern as a digital image, and outputs analysis data after image processing. The fourth local DB 56 stores gel pattern images and analysis data.

図３は、ゲノミクスベースの処理装置群の構成をより詳細に示すブロックダイヤグラムである。マイクロダイセクション測定装置６０は、人の組織細胞の解析に供されるべき重要部位を選択的に紫外線レーザーで切りとり、ＤＮＡ、ＲＮＡタンパク質を採取する。マイクロダイセクション測定装置６０に接続された制御・解析部７０は、細胞切片を採取する機構の制御や、細胞の画像解析を実行する。得られた画像データや解析結果は、第５のローカルＤＢ８０に格納される。 FIG. 3 is a block diagram showing the configuration of the genomics-based processing device group in more detail. The microdissection measuring device 60 selectively cuts out important sites to be used for analysis of human tissue cells with an ultraviolet laser and collects DNA and RNA proteins. The control / analysis unit 70 connected to the microdissection measurement apparatus 60 executes control of a mechanism for collecting a cell slice and cell image analysis. The obtained image data and analysis results are stored in the fifth local DB 80.

ＤＮＡアレイは、ガラス製、ナイロン製またはプラスチック製の基板上に、特定の核酸配列の組み合わせを一定のパターンでスポットしたものである。ＤＮＡアレイ測定装置６２は、ＤＮＡアレイを使用して標識ＤＮＡまたはＲＮＡサンプルとハイブリダイゼーションさせることで機能的ゲノム解析を行う。ＤＮＡアレイ測定装置６２に接続された制御・解析部７２はＤＮＡアレイ測定装置６２の制御およびスポットＤＮＡの解析を行い、解析データを出力する。ＤＮＡアレイの画像データや解析データは、第６のローカルＤＢ８２に格納される。 A DNA array is obtained by spotting a specific combination of nucleic acid sequences in a fixed pattern on a glass, nylon or plastic substrate. The DNA array measurement device 62 performs functional genome analysis by hybridizing with a labeled DNA or RNA sample using a DNA array. A control / analysis unit 72 connected to the DNA array measuring device 62 controls the DNA array measuring device 62 and analyzes the spot DNA, and outputs analysis data. Image data and analysis data of the DNA array are stored in the sixth local DB 82.

リアルタイムＰＣＲ測定装置６４は、ＰＣＲの増幅量をリアルタイムでモニターし、ＤＮＡを解析する装置である。制御・解析部７４は、接続されたリアルタイムＰＣＲ測定装置６４の制御およびＤＮＡ解析を行う。第７のローカルＤＢ８４は、その解析結果を格納する。 The real-time PCR measurement device 64 is a device that monitors the amplification amount of PCR in real time and analyzes DNA. The control / analysis unit 74 controls the connected real-time PCR measurement device 64 and performs DNA analysis. The seventh local DB 84 stores the analysis result.

図１に表示されたプロテオミクスＤＢ１６は、第１のローカルＤＢ５０〜第７のローカルＤＢ８４と、通信路２６で接続され、これらＤＢ中の生データや解析データを、集結させる記憶を有する。ここで、集結とは、それぞれのローカルＤＢ中の生データや解析データの全部或いは一部を、プロテオミクスＤＢ１６が記憶するような構成により実現しても良いし、或いは、生データや解析データのそれぞれについて、ローカルＤＢ中の格納位置を示すポインタを、プロテオミクスＤＢ１６が記憶するような構成により実現しても良い。 The proteomics DB 16 displayed in FIG. 1 is connected to the first local DB 50 to the seventh local DB 84 via the communication path 26, and has a memory for collecting the raw data and analysis data in these DBs. Here, the collection may be realized by a configuration in which the proteomics DB 16 stores all or part of the raw data and analysis data in each local DB, or each of the raw data and analysis data. For the above, a pointer indicating the storage position in the local DB may be realized by a configuration in which the proteomics DB 16 stores the pointer.

統合解析システム２０は、上述したプロテオミクスベースの処理装置群１２やゲノミクスベースの処理装置群１４の制御・解析部が実行する解析を実施できるようになっており、プロテオミクスＤＢ１６に蓄積された生データ（ＲＡＷデータ）を解析し、解析結果をプロテオミクスＤＢ１６に格納することができる。また、統合解析システム２０は、プロテオミクス解析システム内で得られる全てのデータの配備アドレス（ローカルＤＢやプロテオミクスＤＢ１６中のアドレス）や、測定データの来歴等を管理するラボラトリー情報管理システムＬＩＭＳ（ＬａｂｏｒａｔｏｒｙＩｎｆｏｒｍａｔｉｏｎＭａｎａｇｅｍｅｎｔＳｙｓｔｅｍ）の機能も併せ持つ。 The integrated analysis system 20 can perform the analysis executed by the control / analysis unit of the above-described proteomics-based processing device group 12 or genomics-based processing device group 14, and the raw data stored in the proteomics DB 16 ( RAW data) and the analysis result can be stored in the proteomics DB 16. In addition, the integrated analysis system 20 is a laboratory information management system LIMS (Laboratory Information Management) that manages the deployment addresses of all data obtained in the proteomic analysis system (addresses in the local DB and the proteomics DB 16), the history of measurement data, and the like. It also has a System) function.

データマイニングシステム２２は、プロテオミクスＤＢに蓄積された膨大な量のデータから、意味のある相関関係やパターンを発見するデータマイニングを実行することができる。データマイニングシステム２２のプログラムに含まれるデータマイニングアルゴリズムにより、カテゴリー分類や機能分類等された結果は、機能分類データベースＤＢ１８に格納される。データマイングに使用されたデータは、統合解析システム２０のＬＩＭＳ機能によって常にモニターできる。 The data mining system 22 can execute data mining to find meaningful correlations and patterns from a huge amount of data stored in the proteomics DB. The result of category classification, function classification, and the like by the data mining algorithm included in the program of the data mining system 22 is stored in the function classification database DB18. The data used for data mining can always be monitored by the LIMS function of the integrated analysis system 20.

このように構成されたプロテオミクス解析システムにて実行される処理の例について以下に説明する。なお、図面においては、第１のローカルＤＢ〜第５のローカルＤＢを、それぞれ、ローカルＤＢ１〜ローカルＤＢ５とも表記している。 An example of processing executed by the proteomic analysis system configured as described above will be described below. In the drawing, the first local DB to the fifth local DB are also expressed as local DB1 to local DB5, respectively.

図４は、プロテオミクスベースの処理装置群１２を用いたデータプロファイリング方法のうち、安定同位体標識法を用いたＬＣ−ショットガン法、および、２次元電気泳動を用いたディファレンシャルディスプレイ法の概略を示す図である。 FIG. 4 shows an outline of LC − a shotgun method using a stable isotope labeling method and a differential display method using two-dimensional electrophoresis among data profiling methods using a proteomics-based processor group 12. FIG.

第３の質量分析計３４および制御・解析部４４を利用して、複数のサンプルを各種の異なる分子量を持つ同位体標識試薬や、同位体を含むアミノ酸の「ｉｎｖｉｖｏ標識法」などで標識し、同時進行で定量的差異解析をＬＣ−ＭＳ／ＭＳを用いたショットガン法などで行い、同定、および、定量データを、第３のローカルＤＢ５４に格納する。その一方、２Ｄ−ＤＩＧＥ測定装置４６および制御・解析部４６を利用して、同じ複数のサンプルを、二次元電気泳動を用いたディファレンシャルディスプレイ解析法などを用いて、タンパク質スポットの解析結果（等電点、分子量、定量的、タンパク質、翻訳後修飾などの同定結果）を、その画像データを含めて第４のローカルＤＢ５６に格納する。これらのデータとさらに補足的なデータを加え、一つのデータベース（プロテオミクスＤＢ１６）に統合し、プロテオミクス統合データベースとする。 Using the third mass spectrometer 34 and the control / analysis unit 44, a plurality of samples are labeled with various isotope labeling reagents having different molecular weights or “in vivo labeling method” of amino acids containing isotopes. Quantitative difference analysis is simultaneously performed by a shotgun method using LC-MS / MS, and identification and quantitative data are stored in the third local DB 54. On the other hand, by using the 2D-DIGE measurement device 46 and the control / analysis unit 46, the same plurality of samples are analyzed using a differential display analysis method using two-dimensional electrophoresis, etc. The identification results such as point, molecular weight, quantitative, protein, post-translational modification) are stored in the fourth local DB 56 including the image data. These data and additional data are added and integrated into one database (proteomics DB 16) to form a proteomics integrated database.

ＬＣ―ＭＳ／ＭＳを用いた手法について、この手法では、２つのペアの標識ペプチドで、クロマトグラフ形式で計測された後、ＭＳ／ＭＳ解析によりタンパク質同定処理が実行される。図５は、タンパク質同定処理を示すフローチャートである。この処理の結果、ＩＣＡＴ法により「Ｌｉｇｈｔ」と「Ｈｅａｖｙ」で示されたインテンシティ（発現強度）情報が得られ、第３のローカルＤＢ５４に格納される。図４および図５の処理については、後により詳しく説明する。 Regarding the technique using LC-MS / MS, in this technique, two pairs of labeled peptides are measured in a chromatographic format, and then a protein identification process is executed by MS / MS analysis. FIG. 5 is a flowchart showing the protein identification process. As a result of this processing, intensity (expression intensity) information indicated by “Light” and “Heavy” is obtained by the ICAT method and stored in the third local DB 54. The processing of FIGS. 4 and 5 will be described in detail later.

制御・解析部４４によるノイズ処理やセントロイド処理が施された後のピークリストやタンパク質情報も、第３のローカルＤＢ５４およびプロテオミクスＤＢ１６の双方に格納される。或いは、第３のローカルＤＢ５４にのみ、情報を格納し、プロテオミクスＤＢ１６には、第３のローカルＤＢ５４中に格納された場所のポインタ情報を格納しておき、検索などの際に、プロテオミクスＤＢ１６を経由して、同位体標識法によって解析されたデータをバックトラッキングできるように構成しても良い。これら情報の、ローカルＤＢへの格納やプロテオミクスＤＢへの格納は、制御・解析部４４が実行しても良いし、或いは、統合解析システム２０が実行しても良い。 The peak list and protein information after the noise processing and centroid processing by the control / analysis unit 44 are also stored in both the third local DB 54 and the proteomics DB 16. Alternatively, information is stored only in the third local DB 54, and pointer information of the location stored in the third local DB 54 is stored in the proteomics DB 16, and the search is performed via the proteomics DB 16. Then, the data analyzed by the isotope labeling method may be configured to be backtracked. The storage of these pieces of information in the local DB or the proteomics DB may be executed by the control / analysis unit 44 or may be executed by the integrated analysis system 20.

二次元電気泳動を用いたディファレンシャルディスプレイ解析の場合も同様である。制御・解析部４６は、正常細胞と疾患細胞を染色あるいは標識試薬色により分けたそれぞれの二次元画像プロファイルをパターン合成した画像、スポットアドレス、スポット上のタンパク質情報を、元の画像データとともに、第４のローカルＤＢ５６に格納する。制御・解析部４６或いは統合解析システム２０は、第４のローカルＤＢ５６に格納されたデータをプロテオミクスＤＢ１６にも格納することで、ダブルバッファリングする。或いは、プロテオミクスＤＢ１６には、第４のローカルＤＢ５６に格納した場所のポインタ情報を格納する。 The same applies to differential display analysis using two-dimensional electrophoresis. The control / analysis unit 46 dyes normal cells and diseased cells or pattern-synthesizes each two-dimensional image profile divided by the labeling reagent color, the spot address, and protein information on the spot together with the original image data. 4 in the local DB 56. The control / analysis unit 46 or the integrated analysis system 20 performs double buffering by storing the data stored in the fourth local DB 56 in the proteomics DB 16. Alternatively, the proteomics DB 16 stores the pointer information of the location stored in the fourth local DB 56.

以下、タンパク質同定処理（図５）について、より詳細に説明する。図５に示すように、質量分析計（たとえば、第３の質量分析計３４）および制御・解析部（たとえば、制御・解析部４４）は、サンプルから得られたタンパク質をトリプシンなどで分解処理を施し、質量分析に供して、ＭＳスペクトルを得る。次いで、制御・解析部４４は、ＭＳスペクトルの生データ（ＲＡＷデータ）がサンプルから得られたときに、サンプルに対するＭＳテーブルを、ローカルＤＢ（たとえば、第３のローカルＤＢ５４）に格納する。また、制御・解析部４４は、オペレータの入力などにしたがってスレッシュホールドを設定し、ピークリスト（ＰｅａｋＬｉｓｔ）を生成する。生成されたピークリストファイルも、第３のローカルＤＢ５４に格納される。 Hereinafter, the protein identification process (FIG. 5) will be described in more detail. As shown in FIG. 5, the mass spectrometer (for example, the third mass spectrometer 34) and the control / analysis unit (for example, the control / analysis unit 44) decompose the protein obtained from the sample with trypsin or the like. And subjected to mass spectrometry to obtain an MS spectrum. Next, when the raw data (RAW data) of the MS spectrum is obtained from the sample, the control / analysis unit 44 stores the MS table for the sample in a local DB (for example, the third local DB 54). In addition, the control / analysis unit 44 sets a threshold in accordance with an operator's input or the like, and generates a peak list (Peak List). The generated peak list file is also stored in the third local DB 54.

次いで、制御・解析部４４は、オペレータの入力などにしたがって国際ＤＢ２４を選択する。制御・解析部４４は、ピークリストキャリブレーションが施されたデータに基づき、選択された国際ＤＢを用いて、ＲＭＦ／シーケンスタグなどを検索し、タンパク質を定量的に翻訳後修飾も含めて同定する。 Next, the control / analysis unit 44 selects the international DB 24 in accordance with an operator input or the like. The control / analysis unit 44 searches the RMF / sequence tag etc. using the selected international DB based on the data subjected to the peak list calibration, and quantitatively identifies the protein including the post-translational modification. .

制御・解析部４４は、同定したタンパク質情報とペプチドテーブルとを、ローカルＤＢ（第３のローカルＤＢ５４）に格納する。また、制御・解析部４４或いは統合解析システム２０は、第３のローカルＤＢ５４に格納した情報と、同様の情報を、プロテオミクスＤＢ１６に格納する。或いは、制御・解析部４４或いは統合解析システム２０は、プロテオミクスＤＢ１６に、第３のローカルＤＢ５４中、タンパク質同定結果を格納した場所を示すポインタ情報を格納しても良い。この、制御・解析部４４や統合解析システム２０が、プロテオミクスＤＢ１６のポインタ情報から、第３のローカルＤＢ５４中のＭＳテーブル、ピークリストファイル、国際ＤＢを用いたタンパク質検索結果などをバックトラッキングすることができる。 The control / analysis unit 44 stores the identified protein information and the peptide table in the local DB (third local DB 54). Further, the control / analysis unit 44 or the integrated analysis system 20 stores the same information as the information stored in the third local DB 54 in the proteomics DB 16. Alternatively, the control / analysis unit 44 or the integrated analysis system 20 may store pointer information indicating the location where the protein identification result is stored in the third local DB 54 in the proteomics DB 16. The control / analysis unit 44 and the integrated analysis system 20 may backtrack the MS search result in the third local DB 54, the peak list file, the protein search result using the international DB, and the like from the pointer information of the proteomics DB 16. it can.

以下、本実施の形態にかかるデータ保管管理について、補足的に説明する。前述したように、質量分析計から取得したＭＳスペクトルは、ノイズ処理やピーク検出のためのセントロイド処理が施された後、ピークリストファイルとして出力され、ローカルＤＢ中に格納される。また、ローカルＤＢには、大容量のＬＣ−ＭＳ／ＭＳの測定生データ（バイナリーデータファイル）も格納される。統合解析システムで生データの処理ができる場合には、当該生データのバイナリーデータファイルを、プロテオミクスＤＢに転送して、そこに格納する。また、最終結果を導出した実験データや解析内容を管理するために、生データのバイナリーデータファイルがどのホストにあるかを把握する。生データのバイナリーファイルから取り出すピークリストファイルや、ピークリストをもとにＭａｓｃｏｔデータベースのような公共データベースを使用して導き出したタンパク質情報等は、通常、生データのバイナリーデータを格納したホスト上に配備されるので、解析結果が、ローカルＤＢに格納されていたり、プロテオミクスＤＢに格納されたりして、最終結論を導きだした解析結果がどこにあるかの管理が容易ではない。そこで、本実施の形態においては、複数箇所に点在するデータの整合性を取るように、図２１に示すような、データ保管管理テーブルをもつ。データ保管管理テーブルには、たとえば、タンパク質を導き出すために用いたデータファイルの名称とそのファイルを格納している場所を特定できる情報（ホスト名、ＩＰアドレス、データベース名、ホルダ名）が格納される。 The data storage management according to this embodiment will be supplementarily described below. As described above, the MS spectrum acquired from the mass spectrometer is subjected to noise processing and centroid processing for peak detection, and then output as a peak list file and stored in the local DB. The local DB also stores large-capacity LC-MS / MS measurement raw data (binary data file). When raw data can be processed by the integrated analysis system, the binary data file of the raw data is transferred to the proteomics DB and stored therein. In addition, in order to manage the experimental data from which the final results are derived and the contents of analysis, it knows which host has the binary data file of raw data. The peak list file extracted from the raw data binary file and the protein information derived from the peak list using a public database such as the Mascot database are usually deployed on the host that stores the raw data binary data. Therefore, it is not easy to manage where the analysis result is stored in the local DB or stored in the proteomics DB and where the analysis result that led to the final conclusion is located. Therefore, in the present embodiment, a data storage management table as shown in FIG. 21 is provided so as to maintain the consistency of data scattered at a plurality of locations. The data storage management table stores, for example, information (host name, IP address, database name, holder name) that can specify the name of the data file used to derive the protein and the location where the file is stored. .

本発明は、上記統合データベース上で、同時進行で複数の定量的ディファレンシャルディスプレイやＤＮＡアレイ法を行うことを特徴とする方法であって、得られた各種データをプロテオミクスデータベースとして格納し、格納された各種データを基にタンパク質を同定及び定量し、同一のプラットフォームに乗せるための言語に変換して統合化し、タンパク質相互機能解析ソフトウェア及び、国際ネットワークデータベース等から特異的に出現したタンパク質の機能を予測し病態をシステマチックに解析することを特徴とするアルゴリズムを提供する。 The present invention is a method characterized by performing a plurality of quantitative differential displays and a DNA array method simultaneously on the integrated database, wherein the obtained various data are stored as a proteomics database and stored. Proteins are identified and quantified based on various data, converted to a language for integration on the same platform, integrated, and protein functions that appear specifically from protein interactive function analysis software and international network databases are predicted. An algorithm characterized by systematic analysis of pathological conditions is provided.

次に、ゲノミクスベースの処理装置群によるデータプロファイリングの一例を、図６を参照して説明する。マイクロダイセクション装置６０および制御・解析部７０が、正常細胞と疾患細胞を、ダイセクションを用いて得たｍＲＮＡ情報を、検体画像情報とともに、第５のローカルＤＢ８０に格納する。同様に、リアルタイムＰＣＲ測定装置６４および制御・解析部７４が、リアルタイムＰＣＲを用いた遺伝子発現解析を行い、ＰＣＲ画像情報およびｍＲＮＡ発現解析情報を、第７のローカルＤＢ８４に格納する。 Next, an example of data profiling by a genomics-based processor group will be described with reference to FIG. The microdissection device 60 and the control / analysis unit 70 store normal cells and diseased cells in the fifth local DB 80 together with the sample image information, with the mRNA information obtained using the dissection. Similarly, the real-time PCR measurement device 64 and the control / analysis unit 74 perform gene expression analysis using real-time PCR, and store PCR image information and mRNA expression analysis information in the seventh local DB 84.

また、ＤＮＡアレイ測定装置６２および制御・解析部７２が、ＤＮＡアレイを用いた遺伝子発現解析を実行する際に、制御・解析部７２は、オペレータの入力などにしたがって、国際ＤＢ２４を選択し、発現解析によるｍＲＮＡ検索を実行し、検索結果のＲＮＡ情報、タンパク質分子情報、ＤＮＡアレイ画像データを、第６のローカルＤＢ８２に格納する。 Further, when the DNA array measuring device 62 and the control / analysis unit 72 execute gene expression analysis using the DNA array, the control / analysis unit 72 selects the international DB 24 in accordance with an operator input or the like, An mRNA search by analysis is executed, and the RNA information, protein molecule information, and DNA array image data of the search result are stored in the sixth local DB 82.

プロテオミクスベースの処理装置群１２に含まれるローカルＤＢのデータ格納およびプロテオミクスＤＢ１６へのデータ格納と同様に、ゲノミクスベースの処理装置群１４により得られたタンパク質分子情報は、ローカルＤＢおよびプロテオミクスＤＢ１６の双方に格納されても良い。あるいは、プロテオミクスＤＢ１６には、ファイルの格納場所を示すポインタ情報のみを格納しても良い。 Similar to the storage of data in the local DB and the storage of data in the proteomics DB 16 included in the proteomics-based processor group 12, the protein molecule information obtained by the genomics-based processor group 14 is stored in both the local DB and the proteomics DB 16. It may be stored. Alternatively, only the pointer information indicating the file storage location may be stored in the proteomics DB 16.

また、本発明においては、病態組織の特異的なダイセクションを行い、ｍＲＮＡとタンパク質を同時に同じ可溶化組織から調製し、ｍＲＮＡについては、ＤＮＡアレイ、可溶化タンパク質については、二次元電気泳動（たとえば蛍光標識法２Ｄ−ＤＩＧＥ法など）および安定同位体標識（ｃＩＣＡＴ法やｉＴＲＡＱ法など）ＬＣ−ＭＳショットガン法（たとえば、ＭＡＬＤＩ−ＴＯＦ−ＴＯＦ、ＬＣ−ＥＳＩ／ＭＡＬＤＩ−ＱＱ−ＴＯＦ、トリプル四重極型ＬＣ／ＭＳ／ＭＳ、イオントラップ型その他ハイブリッド型ＬＣ／ＭＳ／ＭＳやＦＴ−ＭＡＳなど）を用いて、同時進行で並列的に異種解析の複数のディファレンシャルディスプレイを行うことができる（請求項４参照）。また、これらの解析の結果データに基づいて、プロテオミクスＤＢ１６やタンパク相互機能解析ソフトウェアにリンクさせて融合した解析アルゴリズムや、特定の組織器官に特異的に出現したタンパク質を自動分類するデータマイニングアルゴリズム（請求項６参照）により特異的なタンパク質分子の抽出、タンパク質機能解析および病態プロテオミクス解析が可能となる。 In the present invention, a specific dissection of pathological tissue is performed, and mRNA and protein are simultaneously prepared from the same solubilized tissue. For mRNA, DNA arrays and solubilized proteins are subjected to two-dimensional electrophoresis (for example, Fluorescent labeling method 2D-DIGE method, etc.) and stable isotope labeling (cICAT method, iTRAQ method, etc.) LC-MS shotgun method (for example, MALDI-TOF-TOF, LC-ESI / MALDI-QQ-TOF, triple quadruple) Using a polar LC / MS / MS, an ion trap type or other hybrid type LC / MS / MS, FT-MAS, etc., a plurality of differential displays for heterogeneous analysis can be performed simultaneously in parallel. 4). In addition, based on the results of these analyzes, analysis algorithms fused and linked to proteomics DB 16 and protein interaction analysis software, and data mining algorithms that automatically classify proteins that specifically appear in specific tissue organs (claims) (See Item 6), specific protein molecule extraction, protein function analysis and pathological proteomic analysis can be performed.

本実施の形態かかるデータマイニング処理の概略を説明する。従来、タンパク質レベルとｍＲＮＡレベルとの間では、検出された分子間でどのような相互関係にあるかを関連付けすることができなかった。しかしながら、図８に示すように、分子シグナルネットワークの枠組みに当てはめた場合、分子シグナルリンケージパスウエイが見えてくる。図８において、右側のタンパク質レベルの分子シグナルネットワークと、左側のｍＲＮＡレベルの分子シグナリングネットワークとを、重ね合わせてみると、中央の「タンパク質ａｎｄｍＲＮＡ」に示すように、これらを重ね合わせることで、類似性を見出すことができる。 An outline of the data mining process according to this embodiment will be described. Conventionally, it has not been possible to correlate the correlation between detected molecules between protein level and mRNA level. However, as shown in FIG. 8, when applied to the framework of the molecular signal network, the molecular signal linkage pathway becomes visible. In FIG. 8, when the molecular signal network at the protein level on the right side and the molecular signaling network at the mRNA level on the left side are overlapped, as shown in the “protein and mRNA” in the center, they are similar by overlapping. You can find sex.

図８に示す、タンパク質およびｍＲＮＡレベルで上昇する分子シグナルネットワークとは、病態組織・細胞サンプル中で、タンパク質、および、ｍＲＮＡの両方で上昇しているシグナルネットワークを抽出する手法である。たとえば、タンパク質とｍＲＮＡの発現が一致しなくとも、ｍＲＮＡが近未来的にタンパク質として発現すると仮定し、両者の活性化分子ネットワークを合流させることで、ある特定のシグナルキャスケードを抽出することができる。 The molecular signal network that rises at the protein and mRNA level shown in FIG. 8 is a technique for extracting a signal network that rises in both protein and mRNA in a diseased tissue / cell sample. For example, even if the expression of protein and mRNA does not match, it is assumed that mRNA will be expressed as a protein in the near future, and a specific signal cascade can be extracted by joining both activated molecular networks.

このように、分子シグナルネットワークを得ることで、図８において、「Ａ」で示す最上流の分子や最下流の分子の抽出、治療や創薬などのターゲットとなるネットワークの情報も得られることになる。 In this way, by obtaining a molecular signal network, in FIG. 8, it is possible to obtain information on a target network such as extraction of the most upstream molecule or the most downstream molecule indicated by “A”, treatment or drug discovery. Become.

その一方、第４のローカルＤＢ５６などに記憶された、２Ｄ−ＤＩＧＥなどの電気泳動像は、個々のタンパク質の特異的性質に位置づけられるが，これらは画像データであるため，他の数値データとの統合が難しいが，画像データと、プロテオミクスＤＢ１６、機能分類ＤＢ１８、ローカルＤＢの数値データとをリンクさせることによって、データを一元化することができる（請求項６参照）。たとえば、ローカルＤＢ５６に記憶された画像データ中、あるスポットの複数の座標（たとえば矩形を表す４つの座標）で画定される領域と、データベースの種別（たとえば、プロテオミクスＤＢ、機能分類ＤＢなど）や、当該データベース中の格納場所を示す情報とを、関連付けて記憶しておくことにより実現することができる。機能分類ＤＢ１８およびプロテオミクスＤＢ１６を使用することにより、図９に示すような疾患プロテオミクスデータの連携が可能となる。本例では、２Ｄ−ＤＩＧＥのプロファイル画像のスポットを選択すればスポットのタンパク質情報を国際データベースから検索して表示できる。上述したように、スポットの領域の座標と、国際データベースの情報との関連付けを示す情報を、ローカルＤＢ（たとえば、第４のローカルＤＢ５６）に記憶しておけば良い。 On the other hand, electrophoretic images such as 2D-DIGE stored in the fourth local DB 56 and the like are positioned in specific properties of individual proteins, but these are image data. Although integration is difficult, data can be unified by linking image data with numerical data of the proteomics DB 16, the function classification DB 18, and the local DB (see claim 6). For example, in the image data stored in the local DB 56, an area defined by a plurality of coordinates of a spot (for example, four coordinates representing a rectangle), a database type (for example, a proteomics DB, a function classification DB, etc.), This can be realized by storing the information indicating the storage location in the database in association with each other. By using the function classification DB 18 and the proteomics DB 16, it is possible to link disease proteomics data as shown in FIG. In this example, if a spot in the profile image of 2D-DAGE is selected, the protein information of the spot can be retrieved from the international database and displayed. As described above, information indicating the association between the coordinates of the spot area and the information in the international database may be stored in the local DB (for example, the fourth local DB 56).

また、プロテオミクス解析システムにて獲得した、測定ＭＳデータや測定ｎａｎｏＬＣデータを、データマイニングシステム２２がヴィジュアルに表示する。このことによってタンパク質同定の確認の労力が大幅に改善できる。 The data mining system 22 displays the measurement MS data and measurement nanoLC data acquired by the proteomics analysis system visually. This can greatly improve the confirmation effort of protein identification.

図９について、追加的に説明する。プロテオミクス解析システムの表示装置に表示された２次元電気泳動によるプロファイル画像（２Ｄ−ＤＩＧＥや翻訳後修飾などの特殊染色や、抗体反応によるスポットを含む）の、あるタンパク質スポットを指定してクリックすると、そのタンパク質の名前や、各種国際ＤＢ（ＮＣＢＩ、ｓｗｉｓｓｓｐｌｏｔ、ｇｅｎｅｂａｎｋなど）のアクセッション番号やこれらのＤＢに直接リンクさせることができる。これは、上述したように、スポットの領域と、アクセッション番号などとを関連付けて、プロテオミクスＤＢ１６などに記憶しておくことで実現できる。また、ゲルから抽出して得られたペプチドの質量分析によるシグナルの生データ（Ｍ／Ｚやインテンシティ、ＭＳ／ＭＳ情報、翻訳後修飾、分子量、等電点、アミノ酸配列など）や、安定同位体を用いた差異解析の生データ、同じ材料を用いて２次元電気泳動以外の方法（ＬＣショットガンなど）から得られた差異解析の結果などをリンクさせる。さらには、より詳しい翻訳後修飾や立体構造や分子機能に細胞内局在などの情報、病態に関連する情報や、その分子の阻害・活性化剤などと関連する情報、その分子の抗体を使った細胞内局在の画像データベースや、その分子の異常による疾患の病態病理画像（ＭＲＩや組織切片の形態学的画像等）にもリンクでき、解析の材料となった組織や細胞に関するアトラス的要素を含んでいる。この情報を元に、この２次元電気泳動像をＰＶＤＦ膜などに転写した組織・細胞Ｎａｔｕｒａｌプロテインチップなどの開発に繋げることができる。 FIG. 9 will be additionally described. When a certain protein spot in the profile image (including 2D-DIGE and post-translational modification special staining and antibody reaction spots) displayed on the display device of the proteomics analysis system is specified and clicked, The name of the protein, the accession number of various international DBs (NCBI, Swiss slot, gene bank, etc.) and these DBs can be linked directly. As described above, this can be realized by associating the spot area with the accession number or the like and storing them in the proteomics DB 16 or the like. In addition, raw signal data (M / Z, intensity, MS / MS information, post-translational modification, molecular weight, isoelectric point, amino acid sequence, etc.) by mass spectrometry of peptides obtained by extraction from gels, stable isotopes Link the raw data of the difference analysis using the body, the result of the difference analysis obtained from a method other than two-dimensional electrophoresis (LC shotgun, etc.) using the same material. In addition, use more detailed post-translational modifications, information on 3D structures and molecular functions such as intracellular localization, information related to pathological conditions, information related to inhibitors and activators of the molecule, and the antibody of the molecule. Intracellular localization image database and pathological pathology images of diseases caused by abnormalities in the molecule (MRI, morphological images of tissue sections, etc.) Is included. Based on this information, it is possible to develop a tissue / cell Natural protein chip or the like obtained by transferring this two-dimensional electrophoresis image to a PVDF membrane or the like.

たとえば、図１９は、組織・細胞Ｎａｔｕｒａｌプロテインチップの補足説明する図である。この「組織・細胞Ｎａｔｕｒａｌプロテインチップ」は、本発明者らにより独自に命名したものである。二次元電気泳動による脳組織サンプル組織・細胞Ｎａｔｕｒａｌ
プロテインチップを用いた自己抗原の解析例である。組織・細胞を２次元電気泳動によって分離し、ＰＣＤＦ膜などのプローブに転写することによって、約１００００個ほどのタンパク質スポットが転写されたＮａｔｕｒａｌ
プロテインチップが作成できる。これに、患者の血清を反応させ、自己抗体が反応するスポットを図９で示すデータベース上から同定してくることができる。 For example, FIG. 19 is a diagram for explaining a supplement of a tissue / cell Natural protein chip. The “tissue / cell Natural protein chip” is uniquely named by the present inventors. Brain tissue sample tissue and cell natural by two-dimensional electrophoresis
It is an example of analysis of self-antigen using a protein chip. Natural that about 10,000 protein spots were transcribed by separating tissues and cells by two-dimensional electrophoresis and transferring them to a probe such as a PCDF membrane
Protein chips can be created. The patient's serum can be reacted with this, and the spot where the autoantibody reacts can be identified from the database shown in FIG.

図２０は、Ｎａｔｕｒａｌプロテインチップによる患者血清中の自己抗体反応性プロテインスポットのプロファイリングを示す図である。図１９に示す手法によって、作成されたＮａｔｕｒａｌプロテインチップを用いて同定された分子の例を示す。橋本脳症、多発性硬化症、筋萎縮性側索硬化症の患者血清からＮａｔｕｒａｌプロテインチップを用いて自己抗体ターゲットタンパク質の同定とプロファイリングを行った例。これによって、その患者の病態のプロファイリングが可能となる。これを小型化することによって、簡便なＮａｔｕｒａｌプロテインチップとして臨床検査などに供することができる。 FIG. 20 is a diagram showing profiling of autoantibody-reactive protein spots in patient serum using a Natural protein chip. The example of the molecule | numerator identified using the method shown in FIG. 19 using the produced Natural protein chip | tip is shown. Example of identification and profiling of autoantibody target protein using Natural protein chip from sera of patients with Hashimoto encephalopathy, multiple sclerosis, and amyotrophic lateral sclerosis. This allows profiling of the patient's condition. By miniaturizing this, it can be used for clinical examinations as a simple Natural protein chip.

また、本発明において、統合データベースが、複数の解析データを一元化し，本来の意味の網羅的解析を可能にする，これによって、単一装置の解析で検出できなかった，あるいは解釈できなかった微小データがクローズアップされ，再度検証する必要性が生じる可能性があるが，これを可能にするため，統合データベースで解析したデータを個々の解析ソフトウェアに戻し、再度解析を行うため処理アルゴリズムを提供する（請求項７参照）。 Further, in the present invention, the integrated database unifies a plurality of analysis data and enables an exhaustive analysis of the original meaning, thereby making it possible to detect a minute amount that could not be detected or interpreted by the analysis of a single device. Data may be closed up and need to be verified again. To make this possible, the data analyzed in the integrated database is returned to the individual analysis software, and a processing algorithm is provided to perform the analysis again. (See claim 7).

また、ある特定の腫瘍関連遺伝子産物の細胞内機能を解析することを特徴とするプロテオミクス解析法にあっては、ターゲットとする遺伝子が欠失，変異している特定の細胞のＤＮＡアレイ法や安定同位体標識法を用いた蛋白プロファイルを作成し、正常細胞とディファレンシャルディスプレイを行うが、変化しているタンパク質の定量解析を可能にするため多元データの定量的ディファレンシャルディスプレイを一元化するアルゴリズムを提供する。以下に述べるように、異なる機器（たとえば質量分析計）で測定すると、それぞれのシグナル情報が異なるパラメータで出てくる。そこで、これらを統一の言語に変換する作業（すぐなる強度のインターナルコントロールを用いた絶対定量化、ペプチド分子量（Ｍ／Ｚ）の表記の統一化、同定分子のアクセッションの統一化などによって一元化を図る。 Also, in the proteomic analysis method characterized by analyzing the intracellular function of a specific tumor-related gene product, the DNA array method or stability of specific cells in which the target gene is deleted or mutated Produces a protein profile using the isotope labeling method and performs differential display with normal cells, but provides an algorithm that unifies the quantitative differential display of multi-factor data to enable quantitative analysis of changing proteins. As described below, when measured with different instruments (for example, a mass spectrometer), each signal information comes out with different parameters. Therefore, work to convert them into a unified language (unification by absolute quantification using immediate internal control, standardization of peptide molecular weight (M / Z), accession of identified molecules, etc.) Plan.

複数台の質量分析計（たとえば、図２に示す第１の質量分析計３０〜第３の質量分析計３４）から同定されるタンパク質は、装置の感度や生データ（ＲＡＷデータ）による条件設定パラメータ（スレッシュホールド設定や公共データベース設定など）によってそれぞれの質量分析結果から同定されるタンパク質は完全に一致はしないのが一般的である。 Proteins identified from a plurality of mass spectrometers (for example, the first mass spectrometer 30 to the third mass spectrometer 34 shown in FIG. 2) are condition setting parameters based on the sensitivity of the apparatus and raw data (RAW data). In general, proteins identified from each mass spectrometry result (threshold setting, public database setting, etc.) do not completely match.

図７は、本実施の形態にかかる第１の質量分析計３０〜第３の質量分析計３４のそれぞれから得られたタンパク質を分類したベン図である。質量分析計２つ以上に属するタンパク質Ｐ＋は陽性タンパク質として分類し、インテンシティとペアで機能分類ＤＢに保存される。 FIG. 7 is a Venn diagram in which proteins obtained from each of the first mass spectrometer 30 to the third mass spectrometer 34 according to the present embodiment are classified. The protein P + belonging to two or more mass spectrometers is classified as a positive protein and stored in the function classification DB in pairs with intensity.

図７において、たとえば、ｃＩＣＡＴ（同位体標識法１）と、ＭＡＬＤＩ−ＴＯＦ−ＴＯＦ（質量分析計１）との組み合わせで行った場合、ｃＩＣＡＴ（同位体標識法１）とＥＳＩ−ＱＱ−ＴＯＦ（質量分析計２）の組み合わせで行った場合、さらには、ｉＴＲＡＱ(同位体標識法２)とＥＳＩ−ＱＱＱ(質量分析計３)の組み合わせで行った場合を用いて、全く同じサンプルのディファレンシャルディスプレイ（たとえば、癌組織と正常の比較において癌組織で特異的に上昇する分子群）のデータを統合マイニングしている。このような場合に、それぞれの解析法で解析可能な分子種に得意不得意があるため、図７のようにすべての方法論で共通して同定される同一の分子、２つの組み合わせのみで共通して同定される分子、一つの方法論で特異的に同定される分子（これには同定の際の擬陽性も含まれる場合が多い）群が存在する。これら分子群において、最も信頼性がある分子群はすべての方法論で共通に同定された分子群であり、その次に信頼性があるのは２つ以上の方法論で共通して同定された分子群である。一つの方法論のみで同定されたものについては、もう一度質量分析の生データ（ＲＡＷデータ）に帰って、その信頼性を確認する必要があるが、明かな陽性所見も含まれる。つまり、これらの方法論を組み合わせてのすべてを統合することで、はじめてすべての分子群が網羅される。 In FIG. 7, for example, when cICAT (isotope labeling method 1) and MALDI-TOF-TOF (mass spectrometer 1) are combined, cICAT (isotope labeling method 1) and ESI-QQ-TOF ( When using a combination of mass spectrometers 2), and using a combination of iTRAQ (isotope labeling method 2) and ESI-QQQ (mass spectrometer 3), a differential display of exactly the same sample ( For example, integrated mining is performed on data of molecular groups that specifically increase in cancer tissue in comparison with cancer tissue. In such a case, since the molecular species that can be analyzed by the respective analysis methods are not good, the same molecule identified in common in all methodologies as shown in FIG. There are groups of molecules that are identified in this way, molecules that are specifically identified by a single methodology (this often includes false positives at the time of identification). Among these groups of molecules, the most reliable group of molecules is a group of molecules commonly identified in all methodologies, followed by the group of molecules identified in common by two or more methodologies. It is. For those identified with only one methodology, it is necessary to return to the mass spectrometry raw data (RAW data) to confirm its reliability, but it also includes clear positive findings. In other words, all the molecular groups are covered only by combining all of these methodologies.

質量分析計１台のみでしか同定できなかったタンパク質は、偽陽性タンパク質Ｐ＃も含まれているため、偽陽性タンパク質Ｐ＃のようなケースは、解析ソフトウェアの条件設定パラメータを調整して陽性かどうか再度検索する必要がある。その手順を、図１０の処理フローにて説明する。 Proteins that could only be identified with a single mass spectrometer include false positive protein P #. Therefore, is a case like false positive protein P # positive by adjusting the condition setting parameters of the analysis software? Please search again. The procedure will be described with reference to the processing flow of FIG.

図１０は、機能分類データベースにある同定タンパク質の分類に際して行ったデータ処理が正しいものか遡って確認するためのアルゴリズムを示している。 FIG. 10 shows an algorithm for confirming retrospectively whether the data processing performed when classifying the identified protein in the function classification database is correct.

図１０の例では、同定したタンパク質を、データマイニングを行いクラスター分類し、マイニング結果判定が判定基準以下のとき、本実施の形態にかかるリターンマッチングマイニングが行えるようにする。データマイニングにより判定基準が満足されるケースがあれば、機能分類ＤＢ１８にプロファイリングする。また、再検索にあたって生データのレベル（ＲＡＷレベル）のデータを獲得した場合は、ローカルＤＢやプロテオミクスＤＢ１６にも追加格納する。なお、リターンマッチングマイニングについては後述する。 In the example of FIG. 10, the identified proteins are subjected to data mining and cluster classification, and when the mining result determination is equal to or less than the criterion, return matching mining according to the present embodiment can be performed. If there is a case where the determination criteria are satisfied by data mining, profiling is performed in the function classification DB 18. Further, when the data of the raw data level (RAW level) is acquired in the re-search, it is additionally stored in the local DB and the proteomics DB 16. Return matching mining will be described later.

図１０の例について、より詳細に説明する。図１に示すデータマイニングシステム２２は、各質量分析計等から導出されたタンパク質情報から可能なかぎり意味のある結果を、マイニング手法をもって導出するが、意味のある結果が無いか少ない場合、結論を導き出した経過のフィルタパラメータ情報の調整を行い、再度データマイニングを行う。図１０（リターンマッチングマイニングアルゴリズム）は、この再度行う過程をフローによって示している。 The example of FIG. 10 will be described in more detail. The data mining system 22 shown in FIG. 1 derives a meaningful result as much as possible from protein information derived from each mass spectrometer or the like by a mining technique. The filter parameter information of the derived process is adjusted, and data mining is performed again. FIG. 10 (return matching mining algorithm) shows the process of performing this again by a flow.

フロー中のボックス１「同定したタンパク質の確認判定」は、再度データマイニングを行うようにデータマイニングシステム２２に対して、オペレータが入力装置を操作して指示を与えたときに入るアルゴリズムエントリー部である。 Box 1 “Confirmation determination of identified protein” in the flow is an algorithm entry part that is entered when the operator gives an instruction by operating the input device to the data mining system 22 to perform data mining again. .

フロー中のボックス２「同定した質量分析計の設定条件を変え、タンパク質を再検索」では、図２に示す質量分析計３０〜３４に接続された制御・解析部４０〜４４が、スレッシュホールド設定、公共データベース設定など入力パラメータの条件を、以前設定した内容から変更して、タンパク質を再検索する。スレッシュホールド値を変更することによって、ピークリストの内容がかわり、以前では発見できなかったタンパク質が同定できる。また、ある国際ＤＢ（たとえば、公共データベースＡ）には、ピークリストに関連する情報が無いために発見できなかったタンパク質が、他の国際ＤＢ（たとえば、公共データベースＢ）には登録されていてこれを使用することで解決することがある。 In box 2 “Change the setting conditions of the identified mass spectrometer and search for the protein again” in the flow, the control / analysis units 40 to 44 connected to the mass spectrometers 30 to 34 shown in FIG. Change the input parameter conditions such as public database settings from the previously set contents, and search for proteins again. By changing the threshold value, the contents of the peak list change, and proteins that could not be discovered before can be identified. In addition, a protein that could not be found because there is no information related to the peak list in one international DB (for example, public database A) is registered in another international DB (for example, public database B). May be solved by using.

フロー中のボックス３「タンパク質は他の質量分析計で同定されたものに最も多く合致するまでマイニング」するステップにおいては、図２に示す３つの質量分析計３０〜３４にて個々に解析して出力されたタンパク質のトータルがマキシマムになるように、設定パラメータチューニングを行い、タンパク質情報を導出する。また、このステップでは、偽陽性タンパク質の検出も可能である。 In the step of box 3 “Minning until the protein most closely matches those identified by other mass spectrometers” in the flow, the three mass spectrometers 30 to 34 shown in FIG. The parameter information is tuned to derive the protein information so that the total of the output protein becomes the maximum. In this step, false positive proteins can also be detected.

フロー中のボックス４「最適タンパク質同定ができればローカルＤＢにレコード」するステップにおいては、再検索したスレッシュホールド設定、公共データベース設定など入力パラメータの条件とともに、解析結果を、測定や解析を実行した質量分析計および制御・解析部に関連するローカルＤＢに格納する。 In step 4 “Record to local DB if optimal protein identification is possible” in the flow step, mass spectrometry that measured and analyzed the analysis results together with the input parameter conditions such as threshold settings and public database settings that were re-searched Store in the local DB related to the meter and the control / analysis unit.

フロー中のボックス５「プロテオミクスＤＢにおいてレコード先を追加する」ステップにおいては、ローカルＤＢに格納した解析結果と設定パラメータが、プロテオミクスＤＢにおいて参照できるように必要な処理が実行される。 In the box 5 “add record destination in proteomics DB” step in the flow, necessary processing is performed so that the analysis result and the setting parameter stored in the local DB can be referred to in the proteomics DB.

フロー中のボックス６「機能分類ＤＢに再同定した結果をプロファイルする」ステップにおいては、リターンマッチングマイニングした結果、意味のある機能解析結果が導出できた場合、機能分類ＤＢ１８にプロファイルしている。 In the box 6 “profiling the result of re-identification in the function classification DB” in the flow, if a meaningful function analysis result can be derived as a result of return matching mining, the result is profiled in the function classification DB 18.

機能分類ＤＢ１８には、初期検索時にプロファイルした各種データの他にリターンマッチングアルゴリズムを走らせたときに得たＲＡＷデータや解析データもプロファイリングされていく。 In the function classification DB 18, RAW data and analysis data obtained by running the return matching algorithm are profiled in addition to various data profiled at the time of the initial search.

リターンマッチングマイニングアルゴリズムについて、さらに、図２２、図２３を参照して補足的に説明する。同定したタンパク質結果が思うように出てこない場合に、結論を導き出すために使用した公共データベースの内容や、データフィルタリングに使用した閾値が最適値でなかったために生じたのではないかと判断するケースがある。図１０に示すリターンマッチングアルゴリズムは、図２２のレベル３まで戻って、タンパク質の再検索を行った場合の処理フローである。再検索で得たプロテオミクス結果は、図３４に示すような構造体のトラッキングテーブルに格納される（つまり、ＤＢに記憶される）。同一階層に複数個登録されていれば、登録日時の新しいものが優先されるが、過去の結果も削除しない限り使用できる。 The return matching mining algorithm will be further described with reference to FIGS. 22 and 23. FIG. When the identified protein results do not come out as expected, there are cases where the contents of the public database used to draw conclusions and the threshold used for data filtering were judged to have occurred because they were not optimal. is there. The return matching algorithm shown in FIG. 10 is a processing flow when the protein is searched again after returning to level 3 in FIG. The proteomic result obtained by the re-search is stored in the structure tracking table as shown in FIG. 34 (that is, stored in the DB). If a plurality of items are registered in the same hierarchy, the one with the new registration date is given priority, but it can be used as long as past results are not deleted.

バックトラッキングは、図２３のトラッキングテーブルに基づいて実現することができる。なお、図２２は、プロテオミクス結果を得るまでのデータの流れを示している。実験や解析を試行錯誤で行う場合に、どこまで遡って行う必要があるかという要求に対して対応できるようにデータを管理する。本実施の形態では、レベルを、たとえば、４段階に分類し、レベル別にデータを管理するトラッキングテーブルを作成する。 Backtracking can be realized based on the tracking table of FIG. FIG. 22 shows a data flow until a proteomic result is obtained. When experimentation and analysis are performed by trial and error, the data is managed so as to be able to respond to the request of how far back to perform. In this embodiment, for example, the levels are classified into four stages, and a tracking table for managing data for each level is created.

タンパク質発現情報を用いて分子間ネットワークによる解析する場合、タンパク質のインテンシティの扱いに注意を要する。 When analyzing protein expression information using an intermolecular network, attention should be paid to the handling of protein intensity.

質量分析結果は、３つのグループのタンパク質を同定するが共通領域（図１１のベン図における領域Ａ）に含まれるタンパク質が同じインテンシティを持てば問題ないが、図１２の下に示す表のように異なる場合、補正式を使用して何らかのノーマライズ処理が必要となる。 The results of mass spectrometry identify three groups of proteins, but there is no problem if the proteins contained in the common region (region A in the Venn diagram of FIG. 11) have the same intensity, but as shown in the table at the bottom of FIG. If they are different, some normalization processing is required using the correction formula.

図１１および図１２について、説明する。個々の質量分析計で解析した結果、リファレンスに対するインテンシティは、絶対量でないため比較のためのノーマライズ処理が必要である。図９は、第１の質量分析計１（Ｍｇ１）、第２の質量分析計（Ｍｇ２）、第３の質量分析計（Ｍｇ３）が同じ病態サンプルからタンパク質を同定した結果、共通のタンパク質「Ｐｒｏｔｅｉｎ１」、「Ｐｒｏｔｅｉｎ２」、「Ｐｒｏｔｅｉｎ３」を得たときのノーマライズ処理計算を示している。質量分析計１で求めた「Ｐｒｏｔｅｉｎ１」、「Ｐｒｏｔｅｉｎ２」、「Ｐｒｏｔｅｉｎ３」のインテンシティの和をとってインテンシティトータル（Ｍｇ１＿ＩＴ）とする。同様に他の２台のインテンシティトータル「Ｍｇ２＿ＩＴ」、「Ｍｇ３＿ＩＴ」を求め、その算術平均値をとった式が、図１１に示すノーマライズ用補正式である。この式で求まったノーマライズ補正係数ＮＩは、３つのタンパク質にて求めたので１タンパク質単位では、α＝ＮＩ／３がノーマライズ補正係数となる。この係数αで導出したタンパク質のインテンシティを割ることによって発見したタンパク質の全体が比較できる。 11 and 12 will be described. As a result of analysis by individual mass spectrometers, the intensity with respect to the reference is not an absolute amount, and therefore a normalizing process for comparison is necessary. FIG. 9 shows that the first mass spectrometer 1 (Mg1), the second mass spectrometer (Mg2), and the third mass spectrometer (Mg3) identify proteins from the same disease state sample. ”,“ Protein 2 ”, and“ Protein 3 ”, the normalization calculation is shown. The sum of the intensities of “Protein 1”, “Protein 2”, and “Protein 3” obtained by the mass spectrometer 1 is taken as the intensity total (Mg1_IT). Similarly, the normalization correction equation shown in FIG. 11 is obtained by calculating the other two intensity totals “Mg2_IT” and “Mg3_IT” and taking the arithmetic average value thereof. Since the normalization correction coefficient NI obtained by this equation is obtained for three proteins, α = NI / 3 is a normalization correction coefficient for one protein unit. By dividing the intensity of the protein derived by this coefficient α, the whole proteins found can be compared.

同定タンパク質のインテンシティはノーマライズ係数ＮＩで除算され統一される。そのタンパク質ＩＤとインテンシティをインテンシティインターバルでグルーピングして分子シグナルリンケージ解析ソフトウェア（たとえば、Ｋｅｙｍｏｌｎｅｔ）にインポートすれば、図８に示す分子シグナルリンケージのタンパク質レベルネットワークが構築される。図８において、○で示す分子は、ある病態にある組織細胞を「ｐｒｏｔｅｏｍｉｃｓ」タンパク質レベル、または、「ｔｒａｎｓｃｒｉｐｔｏｍｅｍＲＮＡ」レベルによるディファレンシャルディスプレイ解析にて特異的に上昇、あるいは、減少、あるいは、翻訳後を修飾を受けるものとして検出された分子群で、ネットワーク解析によって相互リンクするものを矢印で示している。 The intensity of the identified protein is divided by the normalization coefficient NI and unified. If the protein ID and intensity are grouped by intensity interval and imported into molecular signal linkage analysis software (for example, Keymolnet), the protein level network of molecular signal linkage shown in FIG. 8 is constructed. In FIG. 8, a molecule indicated by ○ indicates that tissue cells in a certain pathological state are specifically increased, decreased, or translated after differential display analysis at the “proteins” protein level or “transscriptome mRNA” level. A group of molecules detected as being subjected to modification, and those linked by network analysis are indicated by arrows.

また、ゲノミクス・トランスクリプトームレベルで獲得したｍＲＮＡもプロテオミクスの時と同様にインテンシティのノーマライズを行い、ｍＲＮＡレベルの分子シグナルリンケージ解析ソフトウェアにインポートすれば、図８に示すｍＲＮＡレベルネットワークが構築される。 In addition, mRNA obtained at the genomics / transcriptome level is normalized in the same manner as in proteomics, and imported into the molecular signal linkage analysis software at the mRNA level, the mRNA level network shown in FIG. 8 is constructed. .

また，本実施の形態にかかるプロテオミクス解析システムは，異種解析装置から出力されるデータを統合して一元化処理し、網羅的解析に供することができる。したがって、新たな解析装置が開発され、或いは、新たな解析手法が開発された場合でも、出力されたデータを受け入れ、これを統合データベースに組み込むことが可能となる。 In addition, the proteomic analysis system according to the present embodiment can unify the data output from the heterogeneous analysis apparatus and perform a centralized process, which can be used for comprehensive analysis. Therefore, even when a new analysis device is developed or a new analysis method is developed, the output data can be accepted and incorporated into the integrated database.

本発明においては、以下のような６つの段階を経ることで、本発明を用いて知得した情報を臨床データとして適用している。
（１）質量分析計により取得したタンパク質のデータや、二次元電気泳動など他の手法を用いて取得したタンパク質データを一元化して、データベースに格納すること。
（２）格納されたデータを可視化することで画像解析を容易とすること。
（３）上記（１）、（２）のデータ（プロテオノミクスデータ）と、ＤＮＡアレイのデータとをマージすること。
（４）上記データやその可視化した画像に基づいて、重要シグナル、分子群を抽出すること。
（５）上記（１）〜（４）に基づいてプロテインチップを作製し検証すること。
（６）検証結果に基づいて、臨床データとして適用すること。 In the present invention, information obtained using the present invention is applied as clinical data through the following six stages.
(1) The protein data acquired by the mass spectrometer and the protein data acquired by using other methods such as two-dimensional electrophoresis are unified and stored in the database.
(2) To facilitate image analysis by visualizing stored data.
(3) Merging the data of (1) and (2) (proteomic data) with the data of the DNA array.
(4) Extracting important signals and molecular groups based on the data and the visualized images.
(5) Producing and verifying a protein chip based on (1) to (4) above.
(6) Apply as clinical data based on the verification results.

上記項目（１）は、図４、５，７，１０，１１，１２，１３，１５，２１，２２および２３を参照して説明した事項に関連する。また、項目（２）〜（４）は、それぞれ、図４，８，１３，１４，２１，２２および２３を参照して説明した事項、図６，８，１３，１６，１７，２１，２２および２３を参照して説明した事項、並びに、図８，１３，１７，１８，２１，２２および２３を参照して説明した事項に関連する。また、項目（５）のプロテインチップについては、図４，９，１４，１８，１９，２１，２２および２３を参照して説明した事項に関連する。 The item (1) relates to the matter described with reference to FIGS. 4, 5, 7, 10, 11, 12, 13, 15, 21, 22, and 23. Items (2) to (4) are items described with reference to FIGS. 4, 8, 13, 14, 21, 22 and 23, respectively, and FIGS. 6, 8, 13, 16, 17, 21, 22. And related to the matters described with reference to FIGS. 8, 13, 17, 18, 21, 22, and 23. In addition, the protein chip of item (5) relates to the matter described with reference to FIGS. 4, 9, 14, 18, 19, 21, 22, and 23.

以下、項目（１）、（３）について補足的に説明する。 Hereinafter, the items (1) and (3) will be supplementarily described.

上述したように、同じタンパク質を質量分析器で分析した場合であっても、装置によってそのデータ値は異なるのが一般的である。図２４（ａ）、（ｂ）は、あるタンパク質を、異なる２つの質量分析器で測定した結果を示す図である。この質量分析器による測定結果のデータは、ＤＢ（たとえば、図２に示す第１のローカルＤＢ５０など）に格納されている。図２４（ａ）、（ｂ）の例では、比（Ｉｎｔｅｎｓｉｔｙ比）が２つあるものとしたが、これに限定されず、比は３つ以上存在する場合もある。 As described above, even when the same protein is analyzed by a mass spectrometer, the data value is generally different depending on the apparatus. FIGS. 24A and 24B are diagrams showing the results of measuring a certain protein with two different mass analyzers. Data of measurement results obtained by the mass analyzer is stored in a DB (for example, the first local DB 50 shown in FIG. 2). In the examples of FIGS. 24A and 24B, there are two ratios (Intensity ratios). However, the ratio is not limited to this, and there may be three or more ratios.

比は、通常、安定同位体修飾試薬（ＩＣＡＴ、ＩＴＲＡＱ）を用いて、複数の比較したいサンプルを混合して解析し、得られた同定ペプチドのＩｎｔｅｎｓｉｔｙ比をタンパク質レベルで統合して平均したものを、タンパク質の「ａｃｃｅｓｓｉｏｎＮｏ」で標記する。上記「ａｃｃｅｓｓｉｏｎＮｏ」は、「ｓｗｉｓｓｐｌｏｔ−国際タンパク質データベース」に基づくものである。図２５は、図２４（ａ）、（ｂ）に示すデータを、図２６に示すデータ統合処理により統合して得たデータの例を示す図であり、図２６は、データ統合処理の一例を示すフローチャートである。 The ratio is usually obtained by mixing and analyzing a plurality of samples to be compared using stable isotope modifying reagents (ICAT, ITRAQ), and integrating the intensity ratio of the obtained identification peptides at the protein level and averaging them. , Marked with “accessionNo” of the protein. The above “accession No” is based on the “swiss plot-international protein database”. 25 is a diagram showing an example of data obtained by integrating the data shown in FIGS. 24A and 24B by the data integration processing shown in FIG. 26. FIG. 26 shows an example of the data integration processing. It is a flowchart to show.

図２６に示すように、データ統合処理においては、統合解析システム２０が、ローカルＤＢに記憶された、図２４（ａ）、（ｂ）に示すようなデータを読み込む（ステップ２６０１）。以下、Ｍ１［ｉ］を、質量分析計２による測定結果を記憶したローカルＤＢ中の第ｉ行目のデータレコード、Ｍ２［ｉ］を、質量分析計２による測定結果を記憶したローカルＤＢ中の第ｉ行目のデータレコードとする。また、Ｍｐ［ｉ］．ａｃｃｅｓｓｉｏｎを、第ｉ行のデータレコードにおける「ＡｃｃｅｓｓｉｏｎＮｏ」、Ｍｐ［ｉ］．ｒａｔｉｏｎ１を、第ｉ行のデータレコードにおける「ｒａｔｉｏ１」、Ｍｐ［ｉ］．ｒａｔｉｏｎ２を、第ｉ行のデータレコードにおける「ｒａｔｉｏ２」とする。この例では、ｐ＝１または２である。 As shown in FIG. 26, in the data integration process, the integrated analysis system 20 reads data as shown in FIGS. 24A and 24B stored in the local DB (step 2601). Hereinafter, M1 [i] is a data record of the i-th row in the local DB storing the measurement result by the mass spectrometer 2, and M2 [i] is a data record in the local DB storing the measurement result by the mass spectrometer 2. The data record in the i-th row is assumed. Mp [i]. accession, “AccessionNo”, Mp [i]. relation1 is “ratio1”, Mp [i]. Let relation2 be “ratio2” in the data record in the i-th row. In this example, p = 1 or 2.

次いで、ループにおけるパラメータが初期化され、終了条件が設定される（ステップ２６０２〜ステップ２６０６）。パラメータｊがＭ２の配列数、つまり、質量分析計２による測定結果のレコード数を超えない限り（ステップ２６０７でノー(No)）、質量分析計１および質量分析計２に比の平均を求める。ここでは、１つのレコードには比として、「Ｒａｔｉｏ１」および「Ｒａｔｉｏ２」が存在するため、２つの比の平均
（Ｍ１［ｉ］．ｒａｔｉｏ１＋Ｍ２［ｊ］．ｒａｔｉｏ１）／２
（Ｍ１［ｉ］．ｒａｔｉｏ２＋Ｍ２［ｊ］．ｒａｔｉｏ２）／２
が算出される（ステップ２６０９）。 Next, parameters in the loop are initialized, and end conditions are set (steps 2602 to 2606). Unless the parameter j exceeds the number of M2 arrays, that is, the number of records of the measurement result by the mass spectrometer 2 (No in step 2607), the average of ratios is obtained for the mass spectrometer 1 and the mass spectrometer 2. Here, since “Ratio1” and “Ratio2” exist as a ratio in one record, the average of the two ratios (M1 [i] .ratio1 + M2 [j] .ratio1) / 2
(M1 [i] .ratio2 + M2 [j] .ratio2) / 2
Is calculated (step 2609).

この算出された平均Ｒ［ｋ］．ｒａｔｉｏ１およびＲ［ｋ］．ｒａｔｉｏ２には、アクセッションＲ［ｋ］．ａｃｃｅｓｓｉｏｎとして、Ｍ１［ｉ］．ａｃｃｅｓｓｉｏｎが与えられる（ステップ２６０８参照）。 This calculated average R [k]. ratio1 and R [k]. In the ratio 2, accession R [k]. As accession, M1 [i]. An accession is given (see step 2608).

このようにして得られたＲ［ｋ］．ａｃｃｅｓｓｉｏｎ、Ｒ［ｋ］．ｒａｔｉｏ１およびＲ［ｋ］．ｒａｔｉｏ２が、統合化されたレコードとして、データベース（たとえば、プロテオミクスＤＢ１６）に格納される。これにより、図２５に示すように、たとえば、（ＡｃｃｅｓｓｉｏｎＮｏ＝Ｐ１０８０９、Ｐ１１１４２、Ｐ６１２４７）について、「Ｉｎｔｅｎｓｉｔｙ比」が平均化されたようなデータを得ることができる。 R [k]. accession, R [k]. ratio1 and R [k]. ratio2 is stored in a database (for example, proteomics DB 16) as an integrated record. As a result, as shown in FIG. 25, for example, for (AccessionNo = P10809, P11142, P61247), data in which the “Intensity ratio” is averaged can be obtained.

ＤＮＡアレイとタンパク質についても、同様な統合化が可能である。ＤＮＡアレイにはそれぞれのアレイ特有の「ＡｃｃｅｓｓｉｏｎＮｏ」が付与されている。本実施の形態では、これをタンパク質の「ＡｃｃｅｓｓｉｏｎＮｏ」と統合化させる。タンパク質のデータと、ＤＮＡアレイのデータからの抽出およびノーマライズは以下の手法により実現される。 Similar integration is possible for DNA arrays and proteins. A DNA array is given an “Accession No” unique to each array. In the present embodiment, this is integrated with the “Accession No” of the protein. Extraction and normalization from protein data and DNA array data is realized by the following method.

図２７（ａ）は、図２４（ａ）、（ｂ）に示すタンパク質のデータに、図２６に示す統合化処理を施した結果を示す図であり、図２５と同様である。また、図２７（ｂ）は、ローカルＤＢ（たとえば、第６のローカルＤＢ８２）に記憶された、あるＤＮＡアレイのデータを示す図である。また、図２９は、図２７（ａ）、（ｂ）に示すようなデータを統合する処理の例を示すフローチャートである。図２８は、データ統合処理による処理結果を示す図である。 FIG. 27A is a diagram showing the result of the integration process shown in FIG. 26 performed on the protein data shown in FIGS. 24A and 24B, and is the same as FIG. FIG. 27B is a diagram showing data of a certain DNA array stored in the local DB (for example, the sixth local DB 82). FIG. 29 is a flowchart showing an example of processing for integrating data as shown in FIGS. 27 (a) and 27 (b). FIG. 28 is a diagram illustrating a processing result by the data integration processing.

図２９に示すように、統合解析システム２０は、タンパク質についてのデータレコードと、ＤＮＡアレイについてのデータレコードを、それぞれＤＢから読み出す（ステップ２９０１）。統合解析システム２０は、同一の「ＡｃｃｅｓｓｉｏｎＮｏ」のレコードを抽出して（ステップ２９０２）、その「Ｉｎｔｅｎｓｉｔｙ比」をノーマライズする。ノーマライズは、以下のように実現される。 As shown in FIG. 29, the integrated analysis system 20 reads a data record for a protein and a data record for a DNA array from the DB, respectively (step 2901). The integrated analysis system 20 extracts the same “AccessionNo” record (step 2902) and normalizes the “Intensity ratio”. Normalization is realized as follows.

活性化分子群：統合解析システム２０は、「Ｉｎｔｅｎｓｉｔｙ比」として得られた値を比較して（ステップ２９０３、２９０４）、タンパク質およびＤＮＡアレイの双方で「１」よりも大きい場合には、双方の値を平均化し（ステップ２９０５）、平均値を当該「ＡｃｃｅｓｓｉｏｎＮｏ」に関するレコードの統合化された比として、ＤＢ（たとえば、プロテオミクスＤＢ１６）に記憶する。比較の結果、タンパク質に関する値のみが「１」より大きい場合には、統合解析システム２０は、タンパク質の値を採用し、これを統合化された比としてＤＢに記憶する（ステップ２９０６）。その一方、ＤＮＡアレイの値のみが「１」より大きい場合には、統合解析システム２０は、ＤＮＡアレイの値を採用し、これを統合化された比としてＤＢに記憶する（ステップ２９０７）。 Activated molecule group: The integrated analysis system 20 compares the values obtained as “Intensity ratio” (steps 2903, 2904), and if both are larger than “1” in both the protein and DNA arrays, The values are averaged (step 2905), and the average value is stored in the DB (eg, proteomics DB 16) as an integrated ratio of records relating to the “AccessionNo”. If only the value related to the protein is larger than “1” as a result of the comparison, the integrated analysis system 20 adopts the value of the protein and stores it in the DB as an integrated ratio (step 2906). On the other hand, when only the value of the DNA array is larger than “1”, the integrated analysis system 20 adopts the value of the DNA array and stores it in the DB as an integrated ratio (step 2907).

不活性分子群：統合解析システム２０は、「Ｉｎｔｅｎｓｉｔｙ比」として得られた値を比較して（ステップ２９０８、２９０９）、タンパク質およびＤＮＡアレイの双方で「１」よりも小さい場合には、双方の値を平均化し（ステップ２９１０）、平均値を当該「ＡｃｃｅｓｓｉｏｎＮｏ」に関するレコードの統合化された比として、ＤＢに記憶する。比較の結果、タンパク質に関する値のみが「１」より小さい場合には、統合解析システム２０は、タンパク質の値を採用し、これを統合化された比としてＤＢに記憶する（ステップ２９１１）。その一方、ＤＮＡアレイの値のみが「１」より小さい場合には、統合解析システム２０は、ＤＮＡアレイの値を採用し、これを統合化された比としてＤＢに記憶する（ステップ２９１２）。 Inactive molecule group: The integrated analysis system 20 compares the values obtained as the “Intensity ratio” (steps 2908 and 2909), and when both of the protein and the DNA array are smaller than “1”, both The values are averaged (step 2910), and the average value is stored in the DB as an integrated ratio of records relating to the “AccessionNo”. As a result of the comparison, if only the value relating to the protein is smaller than “1”, the integrated analysis system 20 adopts the value of the protein and stores it in the DB as an integrated ratio (step 2911). On the other hand, when only the value of the DNA array is smaller than “1”, the integrated analysis system 20 adopts the value of the DNA array and stores it in the DB as an integrated ratio (step 2912).

このようにして、同一の「ＡｃｃｅｓｓｉｏｎＮｏ」が付されたタンパク質のデータおよびＤＮＡアレイのデータについて、「Ｉｎｔｅｎｓｉｔｙ比」を統合化して、これをＤＢに格納しておくことができる。図８や図１７では、ｍＲＮＡレベルの分子の発現変化とタンパク質レベルでの発現変化について、それぞれ、シグナルリンケージを解析してこれをマージしている。上述したノーマライズを適用することにより、タンパク質のデータおよびＤＮＡアレイのデータのマージを自動化することが可能となる。 In this way, for the protein data and DNA array data to which the same “AccessionNo” is attached, the “Intensity ratio” can be integrated and stored in the DB. In FIG. 8 and FIG. 17, the signal linkage is analyzed and merged with respect to the change in the expression of the molecule at the mRNA level and the change in the expression at the protein level. By applying the above normalization, it is possible to automate the merging of protein data and DNA array data.

図３０に示すように、統合解析システム２０（或いは、制御・解析部４６であっても良い。以下同様）が、第４のローカルＤＢ５６などに記憶された、２Ｄ−ＤＩＧＥによるデータに基づくスポットリストを生成し（ステップ３００１）、ＤＢ（たとえば、国際ＤＢ２４）に格納された既知のスポット情報とマッチングする（ステップ３００２）。既知のスポット情報として、たとえば、ＳｗｉｓｓＰｌｏｔによる標準的なデータを利用することができる。次いで、統合解析システム２０は、タンパク質のスポットが既知であるか否か、つまり、参照スポットとマッチしたか否かを判断する（ステップ３００３）。スポットは、２次元電気泳動で分離された多数の個別のタンパク質であり、既知のスポット情報には、当該スポットの泳動パターンから得た等電点、分子量およびＩｎｔｅｎｓｉｔｙ（タンパク質量）が含まれる。したがって、スポットがマッチしたということは、等電点と分子量によって決定される２次元電気泳動の移動度が一致したことを意味している。 As shown in FIG. 30, the integrated analysis system 20 (or may be the control / analysis unit 46. The same applies hereinafter) is a spot list based on 2D-DIGE data stored in the fourth local DB 56 or the like. Is generated (step 3001) and matched with known spot information stored in the DB (for example, the international DB 24) (step 3002). For example, standard data by SwissPlot can be used as the known spot information. Next, the integrated analysis system 20 determines whether or not the protein spot is known, that is, whether or not it matches the reference spot (step 3003). A spot is a large number of individual proteins separated by two-dimensional electrophoresis, and known spot information includes an isoelectric point, a molecular weight, and an intensity (protein amount) obtained from the migration pattern of the spot. Therefore, the fact that the spots are matched means that the mobility of two-dimensional electrophoresis determined by the isoelectric point and the molecular weight is matched.

ステップ３００３でイエス(Yes)と判断された場合、参照スポットのタンパク質の名前や登録番号がわかれば、ＳｗｉｓｓＰｌｏｔを利用して、その情報が取得される（ステップ３００４）。取得された情報は、たとえば、プロテオミクスＤＢ１６に格納することができる。 If it is determined as Yes in step 3003, if the protein name or registration number of the reference spot is known, the information is acquired using SwissPlot (step 3004). The acquired information can be stored in, for example, the proteomics DB 16.

その一方、タンパク質スポットが既知ではなかった場合（ステップ３００３でノー(No)）
には、質量分析計による分析を行い、そのデータを取得する（ステップ３００５〜ステップ３００７）。質量分析によって得られたデータのそれぞれは、タンパク質をトリプシンで分解して得た複数のペプチドの質量を測定した値であり、そのペプチドの質量の情報から、ペプチドのアミノ酸シークエンスが推測される。統合解析システム２０は、複数のペプチドのアミノ酸シークエンスを組み合わせて、国際ＤＢ２４を検索してタンパク質を同定する（ステップ３００８）。 On the other hand, when the protein spot is not known (No in step 3003)
First, an analysis by a mass spectrometer is performed, and the data is acquired (step 3005 to step 3007). Each of the data obtained by mass spectrometry is a value obtained by measuring the mass of a plurality of peptides obtained by degrading a protein with trypsin, and the amino acid sequence of the peptide is estimated from information on the mass of the peptide. The integrated analysis system 20 combines the amino acid sequences of a plurality of peptides and searches the international DB 24 to identify proteins (step 3008).

質量分析計の個性により、それぞれのペプチドのアミノ酸までの同定ができないものがある。そこで、複数の質量分析計で測定して、測定結果を統合し、タンパク質同定を有利に行うことに意義がある。この際に、ペプチドレベルでデータを一元化して最終データを得るのと、タンパク質レベルでデータを得る方法があるが、本実施の形態では、ペプチドレベルでデータを一元化することで、最終タンパク質の同定を有効に行っている。 Depending on the individuality of the mass spectrometer, there are some that cannot identify amino acids of each peptide. Therefore, it is meaningful to perform protein identification advantageously by measuring with a plurality of mass spectrometers, integrating the measurement results. At this time, there is a method of obtaining the final data by unifying the data at the peptide level and a method of obtaining the data at the protein level. In this embodiment, the identification of the final protein is performed by unifying the data at the peptide level. Is going to be effective.

図８や図１７では、ｍＲＮＡレベルの分子の発現変化とタンパク質レベルでの発現変化について、それぞれ、シグナルリンケージを求め、求められたシグナルリンケージを解析してこれをマージした例を示している。これについて、図３１を参照して補足する。まず、正常組織と病態組織のｍＲＮＡを取り出し、ＤＮＡアレイを用いて、ｍＲＮＡの発現レベルを取得する（ステップ３１０１）。このｍＲＮＡの発現レベルのデータ（たとえば、スポットのＩｎｔｅｎｓｉｔｙ）は、第６のローカルＤＢ８２に格納される。 FIG. 8 and FIG. 17 show examples in which the signal linkage is obtained for the expression change of the molecule at the mRNA level and the expression change at the protein level, and the obtained signal linkage is analyzed and merged. This will be supplemented with reference to FIG. First, mRNA of normal tissue and pathological tissue is taken out, and the expression level of mRNA is obtained using a DNA array (step 3101). The mRNA expression level data (for example, spot intensity) is stored in the sixth local DB 82.

ＤＮＡアレイから得られた元データは、固有のＡｃｃｅｓｓｉｏｎＮｏで分子名が表示されており、したがって、プロテオミクスＤＢ１６のデータとマージするためには、ＳｗｉｓｓＰｒｏｔなどの国際ＤＢのＡｃｃｓｅｓｓｉｏｎＮｏに変換する必要がある。統合解析システム２０は、ＤＮＡアレイから得られた個々の分子のｍＲＮＡの発現レベルと、それぞれの分子の固有のＡｃｃｅｓｓｉｏｎＮｏを、ＳｗｉｓｓＰｒｏｔのような国際ＤＢのＡｃｃｅｓｓｉｏｎＮｏに変更したものとを含むリストを生成する（ステップ３１０２）。 In the original data obtained from the DNA array, the molecular name is displayed with a unique AccessNo. Therefore, in order to merge with the data in the proteomic DB 16, it is necessary to convert it into an Accession No of an international DB such as SwissProt. The integrated analysis system 20 generates a list including the mRNA expression level of each molecule obtained from the DNA array and the unique Access No of each molecule changed to the Access No of an international DB such as SwissProt. (Step 3102).

また、統合解析システム２０は、ゲノム分子シグナルネットワークのタンパク質ノードに、発現したタンパク質レベルを入力する（ステップ３１０３）。 Further, the integrated analysis system 20 inputs the expressed protein level to the protein node of the genome molecule signal network (step 3103).

その一方、タンパク質についても、正常組織と病態組織の発現レベルをＭＳレベルから導出し（ステップ３１０４）、ＭＳやＭＳ／ＭＳを用いて、たとえば、Ｍａｓｃｏｔサーチにより、タンパク質を検索してその情報を取得する（ステップ３１０５）。取得されたタンパク質の情報は、たとえば、第１のローカルＤＢ５０などに格納され得る。統合解析システム２０は、ＤＢに格納されたタンパク質の情報を取り出して、プロテオミクス用シグナルネットワークのタンパク質ノードに、発現したタンパク質レベルを入力する（ステップ３１０６）。このようにして、ｍＲＮＡの発現レベルに基づくシグナルネットワーク、および、タンパク質の発現レベルに基づくシグナルネットワークを得ることができる。 On the other hand, for protein, the expression level of normal tissue and pathological tissue is derived from the MS level (step 3104), and the information is obtained by searching for the protein by, for example, Mascot search using MS or MS / MS. (Step 3105). The acquired protein information can be stored in the first local DB 50, for example. The integrated analysis system 20 extracts the protein information stored in the DB, and inputs the expressed protein level to the protein node of the signal network for proteomics (step 3106). In this way, a signal network based on the expression level of mRNA and a signal network based on the expression level of the protein can be obtained.

統合解析システム２０は、シグナルネットワークを重ね合わせる（ステップ３１０７）。ここで、統合解析システム２０は、図２９に示す処理を実行して、ｍＲＮＡレベルの分子の発現変化およびタンパク質レベルでの発現変化をマージすることができる。つまり、対応するＡｃｃｅｓｓｉｏｎＮｏのＩｎｔｅｎｓｉｔｙ比を参照して、ノーマライズすることでマージし、マージした新たな値（ノーマライズされたＩｎｔｅｎｓｉｔｙ比）をＤＢ（たとえばプロテオミクスＤＢ１６）に格納することができる。 The integrated analysis system 20 superimposes the signal networks (step 3107). Here, the integrated analysis system 20 can execute the processing shown in FIG. 29 to merge the expression change of the molecule at the mRNA level and the expression change at the protein level. That is, by referring to the intensity ratio of the corresponding AccessionNo, it is possible to merge by normalizing, and store the merged new value (normalized intensity ratio) in a DB (for example, proteomics DB 16).

次に、上述した項目（２）格納されたデータを可視化することで画像解析を容易とすること、および、（４）上記データやその可視化した画像に基づいて、重要シグナル、分子群を抽出することについて補足的に説明する。図３２（ａ）は、２Ｄ−ＤＩＧＥの画像から得られ、ＤＢ（たとえば、第4のローカルＤＢ５６）に格納されたタンパク質の情報の例を示す図である。図３２（ａ）の例では、Ｐｒｏｔｅｉｎ１−＊は、正常なＤＩＧＥから得たタンパク質（スポット）をそれぞれ表し、Ｐｒｏｔｅｉｎ２−＊は、病態のＤＩＧＥから得たタンパク質（スポット）をそれぞれ表している。また、図３２（ａ）において、「Ｍｒ」は分子量、「ｐｌ」は等電点、「Ｄ」は濃度である。データマイニングに先立って、統合解析システムは、ＤＢに格納されたタンパク質データを参照して、それぞれのタンパク質の間のユークリッド尺度を算出する。ユークリッド尺度Ｕは、たとえば、以下の式にしたがって求められる。 Next, the above item (2) facilitates image analysis by visualizing the stored data, and (4) extracts important signals and molecular groups based on the data and the visualized images. This will be explained supplementarily. FIG. 32A is a diagram illustrating an example of protein information obtained from a 2D-DIGE image and stored in a DB (for example, the fourth local DB 56). In the example of FIG. 32 (a), Protein1- * represents a protein (spot) obtained from normal DIGE, and Protein2- * represents a protein (spot) obtained from DIGE in a pathological state. In FIG. 32A, “Mr” is the molecular weight, “pl” is the isoelectric point, and “D” is the concentration. Prior to data mining, the integrated analysis system refers to protein data stored in the DB and calculates a Euclidean measure between the respective proteins. The Euclidean scale U is obtained according to the following formula, for example.

Ｕ＝｛（Ｍｒ_ｉ−Ｍｒ_ｊ）^２＋（ｐｌ_ｉ−ｐｌ_ｊ）^２＋（Ｄ_ｉ−Ｄ_ｊ）^２｝^１／２
図３２（ｂ）は、求められたユークリッド尺度を示す距離テーブルである。 _{_{^{_{U = {(Mr i -Mr j}}}} ) 2 + (pl i -pl j) 2 + (D i -D j) 2} 1/2
FIG. 32B is a distance table showing the obtained Euclidean scale.

データマイニングに際して、統合解析システム２０は、たとえば、以下の処理を実行して、特異なタンパク質を見出す。
（１）距離テーブルの横第１列に着目して、距離が最小となるペアを探す。図３２（ｂ）の例では、「Ｐｒｏｔｅｉｎ１−１」との距離が最小となる（最も距離が近い）プロテインが見出される。たとえば、「Ｐｒｏｔｅｉ１−１」と「Ｐｒｏｔｅｉｎ２−１」との間が最小値となれば、（Ｐｒｏｔｅｉ１−１，Ｐｒｏｔｅｉｎ２−１）というペアが作られ、ペアのユークリッド尺度（ここでは尺度の小さいほうを採用する）が決定される。
（２）次いで、距離テーブルの横第２列に着目して、距離が最小となるペアを探す。たとえば、「Ｐｒｏｔｅｉｎ１−２」と「Ｐｒｏｔｅｉｎ２−２」との間が最小値となれば、（Ｐｒｏｔｅｉｎ１−２，Ｐｒｏｔｅｉｎ２−２）というペアができ、そのペアのユークリッド尺度が決定される。
（３）このような処理を繰り返して、最後の行まで進むことにより、全てのペアおよび当該ペアの尺度が決定される。
（４）上述した処理により決定されたペアは、横列での比較であるため、他のタンパク質と最小のペアになっているとは限らない。そこで、たとえば、（１）における（Ｐｒｏｔｅｉｎ１−１，Ｐｒｏｔｅｉｎ２−１）について、その一方と、その他の「Ｐｒｏｔｅｉｎ＊−＊」との間のユークリッド尺度を比較して、その他の「Ｐｒｏｔｅｉｎ＊−＊」の中から、ユークリッド尺度が最小となる「Ｐｒｏｔｅｉｎｘ−ｘ」を検索する。 At the time of data mining, the integrated analysis system 20 executes, for example, the following processing to find a specific protein.
(1) Focusing on the first horizontal row of the distance table, a pair having a minimum distance is searched for. In the example of FIG. 32 (b), the protein with the shortest distance (the closest distance) to “Protein 1-1” is found. For example, if the value between “Protei1-1” and “Protein2-1” is the minimum value, a pair of (Protei1-1, Protein2-1) is created, and the pair of Euclidean measures (here, the smaller one) To be adopted).
(2) Next, paying attention to the second horizontal row of the distance table, a pair with the smallest distance is searched. For example, if a value between “Protein 1-2” and “Protein 2-2” becomes the minimum value, a pair of (Protein 1-2, Protein 2-2) is formed, and the Euclidean scale of the pair is determined.
(3) By repeating such processing and proceeding to the last row, all pairs and the scale of the pair are determined.
(4) Since the pair determined by the above-described processing is a comparison in a row, it is not always the minimum pair with other proteins. Therefore, for example, for (Protein1-1, Protein2-1) in (1), the Euclidean scale between one and the other “Protein *-*” is compared, and the other “Protein *-*” is compared. Search for “Proteinx-x” that minimizes the Euclidean scale.

検索された３番目の「Ｐｒｏｔｅｉｎｘ−ｘ」を含むペアよりも、（Ｐｒｏｔｅｉｎ１−１，Ｐｒｏｔｅｉｎ２−１）のペアのほうが、そのユークリッド尺度が小さければ、（Ｐｒｏｔｅｉｎ１−１，Ｐｒｏｔｅｉｎ２−１，Ｐｒｏｔｅｉｎｘ−ｘ）を新たにグループ化し、グループ数３とする。このグループ化により、「Ｐｒｏｔｅｉｎｘ−ｘ」と対を成していたタンパク質は、「Ｐｒｏｔｅｉｎｘ−ｘ」とのペアを解消され、孤立する。 If the Euclidean scale of the pair of (Protein1-1, Protein2-1) is smaller than the pair including the third searched “Proteinx-x”, (Protein1-1, Protein2-1, Proteinx-x) ) Is newly grouped, and the number of groups is three. By this grouping, the protein paired with “Proteinx-x” is released from the pair with “Proteinx-x” and is isolated.

このようにして、デンドログラム（樹状図）の生成に類した処理により、グループ数２或いは３となるグループ化を繰り返すことにより、どのペア或いはグループにも属さないタンパク質が見出される。 In this way, a protein that does not belong to any pair or group is found by repeating the grouping with the group number 2 or 3 by processing similar to the generation of a dendrogram (dendrogram).

図３３は、上記特異タンパク質の抽出を概略的に示した図である。たとえば、正常組織の２Ｄ−ＤＩＧＥ画像には、２つのタンパク質「Ｐｒｏｔｅｉｎ１−１」、「Ｐｒｏｔｅｉｎ１−２」のスポットが存在し、病態組織の２Ｄ−ＤＩＧＥ画像には、３つのタンパク質「Ｐｒｏｔｅｉｎ２−１」〜「Ｐｒｏｔｅｉｎ２−３」のスポットが存在すると考える。これらのスポットの情報は、ＤＢに格納されている。統合解析システム２０の処理によって、デンドログラムに示すように、（Ｐｒｏｔｅｉｎ１−１，Ｐｒｏｔｅｉｎ２−１）というグループ数２のペア、および、（Ｐｒｏｔｅｉｎ１−２，Ｐｒｏｔｅｉｎ２−２）というグループ数２のペアが生成される一方、「Ｐｒｏｔｅｉｎ２−３」がペアないしグループにも属さないと判断すると、統合解析システム２０は、「Ｐｒｏｔｅｉｎ２−３」が特異タンパク質であると判断し、その情報を、ＤＢ（たとえば、プロテオミクスＤＢ１６）に格納する。 FIG. 33 is a diagram schematically showing the extraction of the specific protein. For example, in the 2D-DIGE image of a normal tissue, there are two protein “Protein 1-1” and “Protein 1-2” spots, and in the 2D-DIGE image of a diseased tissue, three proteins “Protein 2-1”. ~ Consider that there is a spot of “Protein 2-3”. Information on these spots is stored in the DB. As shown in the dendrogram, the integrated analysis system 20 generates a pair of (Protein1-1, Protein2-1) and a group of 2 (Protein1-2, Protein2-2). On the other hand, if it is determined that “Protein2-3” does not belong to a pair or group, the integrated analysis system 20 determines that “Protein2-3” is a specific protein, and the information is stored in a DB (for example, proteomics). DB16).

さらに、統合解析システム２０は、ＤＢに格納されている既知のスポット情報を参照して、位置のスポットの位置および濃度と、特異タンパク質と判断されたタンパク質の位置および濃度とを比較して、一致すれば、その特異タンパク質が同定される。これにより、同定されたタンパク質の情報を、ＤＢに追加することができる。 Further, the integrated analysis system 20 refers to the known spot information stored in the DB, compares the position and concentration of the position spot with the position and concentration of the protein determined to be a specific protein, and matches them. Then, the specific protein is identified. Thereby, the information of the identified protein can be added to DB.

本発明の解析は、図１３に示すようなプロトコールに従って、４段階のステップで行っている。まず、病態サンプルとして脳腫瘍（glioma:AOG）の例を示すが、腫瘍組織を外科的手術によって切除したのち、病理学的検査を行い、AOGと認められるものに関して、1p,
19q LOHの有無（抗癌剤感受性・非感受性）による分類を行う。もし、正常な組織が腫瘍サンプルの近隣から切除されている場合は、これをコントロールとして分類する。これらのサンプルに関して、２種類のプロテオミクスによる解析、及び同一のサンプルを用いたDNAチップによる解析を行う。プロテオーム解析の方法論として、2次元電気泳動法による2D-DIGE法、電気泳動を用いないLC-MASSによるcICAT法およびiTRAQ法を同時進行で行って、蛋白質レベルで差異のあった分子群を質量分析解析によって同定する。又、同時に、DNAチップで得られたmRNAレベルのデータをそれぞれ比較し、これら得られたすべてのデータを統合し、分子解析ソフトを用いてネットワーク解析に供す。また、すべてのデータをデータベース化する。これらの統合データから分子クラスター解析、ネットワーク解析を含めた種々の機能解析を行い、最も重要である分子群を抽出して検証した後、臨床マーカー、治療、予防、創薬開発などへの応用へ繋げていく。本ストラテジーにおいて、抽出された分子群で特に特徴的な蛋白質群、およびリン酸化のような翻訳後修飾構造に対する抗体をカクテル化し、以下に示す独自に開発したnatural
protein chipを用いてプロファイリングをおこなった。 The analysis of the present invention is performed in four steps according to a protocol as shown in FIG. First, an example of a brain tumor (glioma: AOG) is shown as a pathological sample. After surgical removal of the tumor tissue, pathological examination is performed.
Classification based on the presence or absence of 19q LOH (anticancer drug sensitive / insensitive). If normal tissue is removed from the vicinity of the tumor sample, it is classified as a control. For these samples, analysis by two types of proteomics and analysis by a DNA chip using the same sample are performed. As a methodology for proteome analysis, 2D-DIGE method using two-dimensional electrophoresis, cICAT method using LC-MASS without electrophoresis, and iTRAQ method are performed simultaneously, and mass analysis of molecular groups that differ at the protein level is performed. Identify by analysis. At the same time, the mRNA level data obtained with the DNA chip is compared, and all the obtained data are integrated and used for network analysis using molecular analysis software. In addition, all data is made into a database. After performing various functional analyzes including molecular cluster analysis and network analysis from these integrated data, and extracting and verifying the most important molecular groups, application to clinical markers, treatment, prevention, drug development, etc. Connect. In this strategy, the naturally-developed natural proteins shown below were cocktailed with the proteins that are particularly characteristic of the extracted molecules and the antibodies against post-translational modification structures such as phosphorylation.
Profiling was performed using protein chip.

本発明によれば、生体組織又は細胞内で合成された全蛋白質を網羅した微小サイズのプロテインチップが提供される。本発明の上記プロテインチップは、生体組織又は細胞から抽出した蛋白質を蛍光色素で標識して得られた蛋白質試料をストリップ上で一次元目の等電点電気泳動に供し、次いで等電点電気泳動後のストリップを還元処理及びSDS化した後、二次元目のSDS-ポリアクリルアミド電気泳動に供することにより得られることを特徴とするものである。ここで言う微小サイズとは、１０cm×１０cm以下のサイズを有するものが好ましく、例えば５cm×５cm程度のものを挙げることができる。本発明のプロテインチップは、上記したような微小サイズのチップ上に、生体組織又は細胞内で合成された全蛋白質が網羅されていることを特徴とする。等電点電気泳動及びSDS-ポリアクリルアミド電気泳動の条件は、上記したように生体組織又は細胞内で合成された全蛋白質が微小なチップ上に網羅される限りは、特に限定されるものではないが、以下の実施例に記載されている条件又はそれに準ずる条件を当業者であれば適宜採用することができる。 ADVANTAGE OF THE INVENTION According to this invention, the protein chip of the micro size which covered all the protein synthesize | combined in the biological tissue or the cell is provided. In the protein chip of the present invention, a protein sample obtained by labeling a protein extracted from biological tissue or cells with a fluorescent dye is subjected to first-dimensional isoelectric focusing on a strip, and then isoelectric focusing It is characterized in that it is obtained by subjecting the subsequent strip to reduction treatment and SDS, and then subjecting it to a second-dimensional SDS-polyacrylamide electrophoresis. The minute size mentioned here is preferably a size having a size of 10 cm × 10 cm or less, for example, a size of about 5 cm × 5 cm. The protein chip of the present invention is characterized in that all the proteins synthesized in living tissue or cells are covered on the above-described micro-sized chip. The conditions of isoelectric focusing and SDS-polyacrylamide electrophoresis are not particularly limited as long as all proteins synthesized in living tissue or cells are covered on a microchip as described above. However, those skilled in the art can appropriately adopt the conditions described in the following examples or conditions equivalent thereto.

また、本発明のプロテインチップは、二次元電気泳動後のゲルをそのまま用いてもよいが、好ましくは、ゲル中の蛋白質試料をメンブレンに転写して固相化したものを使用することが好ましい。ここで用いるメンブレンの種類は特に限定されず、PVDFメンブレンなどを用いることができる。本発明では、特に低蛍光性、タンパク質低吸着性で、高分解能を有するものを用いることが好ましい。 In addition, the protein chip of the present invention may use the gel after two-dimensional electrophoresis as it is, but it is preferable to use a protein sample in which the protein sample in the gel is transferred to a membrane and immobilized. The type of membrane used here is not particularly limited, and a PVDF membrane or the like can be used. In the present invention, it is particularly preferable to use one having low fluorescence, low protein adsorption and high resolution.

また、本発明のプロテインチップにおいては、生体組織又は細胞から抽出した蛋白質は、CyDye (Cy 5等)等の蛍光色素で標識されている。従って、この蛍光色素に由来する蛍光を検出することによって、プロテインチップ上の蛋白質を、プロテインチップ上における蛋白質スポット情報として画像化することが可能である。 In the protein chip of the present invention, a protein extracted from a living tissue or cell is labeled with a fluorescent dye such as CyDye (Cy 5 or the like). Therefore, by detecting the fluorescence derived from the fluorescent dye, it is possible to image the protein on the protein chip as protein spot information on the protein chip.

さらに、本発明においては、本明細書中上記した本発明によるプロテオミクス解析方法によって得られた分子情報から病態の進行状況又は薬剤感受性のモニタリングに使用可能な分子群をリストアップして、これらの分子に対する抗体カクテルを用意することができる。次いで、本明細書中上記したプロテインチップに上記抗体カクテルを接触させ、プロテインチップ上の抗体反応陽性パターンをプロファイリングし、データベース化することができる。これにより、病態の進行状況又は薬剤感受性をモニタリングすることが可能になる。 Furthermore, in the present invention, a molecular group that can be used for monitoring the progress of a disease state or drug sensitivity is listed from the molecular information obtained by the proteomic analysis method according to the present invention described above in the present specification, and these molecules are listed. An antibody cocktail against can be prepared. Next, the antibody cocktail can be brought into contact with the protein chip described above in this specification, and the antibody reaction positive pattern on the protein chip can be profiled and databased. This makes it possible to monitor the progress of the disease state or drug sensitivity.

本発明の一例としては、抗GFAP抗体、抗CD44抗体、抗p53抗体、抗prothrombin抗体、抗α1-antitrypsin抗体、抗ubiqutine
activating wnzyme E1抗体、抗Erk抗体、抗p-Tyr抗体、及び抗zyxin抗体から選択される少なくとも２種類以上の抗体を含む抗体カクテルを用いて、脳腫瘍患者の生体組織又は細胞から抽出した蛋白質を固定化した微小サイズのプロテインチップに上記抗体カクテルを接触させて、プロテインチップ上の抗体反応陽性パターンをプロファイリングし、データベース化することができる。これにより、脳腫瘍の病態の進行状況又は薬剤感受性をモニタリングすることが可能である。 Examples of the present invention include anti-GFAP antibody, anti-CD44 antibody, anti-p53 antibody, anti-prothrombin antibody, anti-α1-antitrypsin antibody, anti-ubiqutine
Activating wnzyme E1 antibody, anti-Erk antibody, anti-p-Tyr antibody, and antibody cocktail containing at least two kinds of antibodies selected from anti-zyxin antibodies, and immobilizing proteins extracted from living tissue or cells of brain tumor patients The antibody cocktail can be brought into contact with a micronized protein chip and the antibody reaction positive pattern on the protein chip can be profiled and databased. Thereby, it is possible to monitor the progress of the disease state or drug sensitivity of the brain tumor.

本発明者らは、鋭意研究の結果、下記の手段により新規解析法の実証実験を行い、上記課題を解決し、目的を達成することを見出した。すなわち、本発明では、生体組織や細胞から抽出したタンパク質をそのまま微小二次元電気泳動（２Ｄ）に供与して非常に小さなプロテイン２Ｄマップを作成することを特徴とし、これを電気的にプローブに転写して固相化することによって組織細胞内で合成されたすべてのタンパク質を網羅した生体組織タンパクチップを提供する。 As a result of diligent research, the present inventors have conducted a verification experiment of a novel analysis method by the following means, and found that the above problems are solved and the object is achieved. That is, the present invention is characterized in that a protein extracted from a living tissue or cell is directly supplied to micro two-dimensional electrophoresis (2D) to create a very small protein 2D map, which is electrically transferred to a probe. Thus, a biological tissue protein chip that covers all proteins synthesized in tissue cells by solid-phase formation is provided.

また、生体組織タンパクチップを用いて、抗体や、生体タンパク質や合成ペプチドや薬剤などの結合する分子を簡便に探索することを可能とする。 In addition, using a biological tissue protein chip, it is possible to easily search for a molecule to which an antibody, a biological protein, a synthetic peptide, a drug, or the like binds.

さらに、生体組織タンパクチップを用いて、一連の結果を得た後に、もう一度生体内で再現性を確認することなしに、翻訳後修飾や、細胞内タンパク質の分解産物などの相互作用機能を再現することを可能とすることを特徴とする。 Furthermore, after obtaining a series of results using a biological tissue protein chip, it is possible to reproduce interaction functions such as post-translational modifications and intracellular protein degradation products without confirming reproducibility in vivo. It is possible to do that.

一般に、タンパクチップに要求される相互作用の再現性の精度が増加すればするほど、細胞内タンパク質の分解産物の相互作用の再現は重要であり、本発明は上述の事情に基づいてなされたもので、その目的は人工的に合成されたタンパクチップを完全に生体組織タンパクチップに置き換えるばかりでなく、プロテオミクスを利用した病態研究、即ちゲノム研究に対応したタンパク質レベルでの細胞組織の網羅的研究に優れた方法を提供することを特徴とする。 In general, the more accurate the reproducibility of interaction required for a protein chip, the more important it is to reproduce the interaction of degradation products of intracellular proteins, and the present invention has been made based on the above circumstances. The purpose is not only to replace artificially synthesized protein chips completely with biological tissue protein chips, but also to pathological research using proteomics, that is, comprehensive research of cellular tissues at the protein level corresponding to genomic research. It is characterized by providing an excellent method.

本実証実験では、自己免疫性疾患を伴う原因不明の脳炎患者血清中に含まれる自己抗体の抗原タンパク質を検索する目的で、正常ヒト脳組織可溶化タンパク質のプロテオームマップの作成と、同時にそのタンパク質の一部を二次元電気泳動後、ＰＶＤＦ膜に転写することを特徴とし、請求項１記載の、生体組織タンパクチップを作成し、患者血清による免疫ブロッティングによって、血清に含まれる自己抗体に反応性のターゲットタンパク質のスクリーニング、同定を行った。 In this demonstration experiment, we created a proteome map of a normal human brain tissue solubilized protein, and at the same time, to search for the antigen protein of an autoantibody contained in the serum of an encephalitis patient with unknown cause associated with an autoimmune disease. A part of the tissue is transferred to a PVDF membrane after two-dimensional electrophoresis, and the biological tissue protein chip according to claim 1 is prepared and reactive to autoantibodies contained in serum by immunoblotting with patient serum. The target protein was screened and identified.

本実証実験において、今日までに発見されている自己抗体ターゲットタンパク質以外に、少なくとも７個のターゲットが検出された。 In this demonstration experiment, at least 7 targets were detected in addition to the autoantibody target protein discovered to date.

具体的には、自己免疫性疾患を伴う原因不明の脳炎は、中枢神経の炎症性脱髄を特徴とする多発性硬化症（ｍｕｌｔｉｐｌｅｓｃｌｅｒｏｓｉｓ、ＭＳ）を代表として、種々の報告があり、その自己免疫性と病態発症メカニズムの関連性が注目されているのにも関わらず、その患者自己抗体と神経系組織細胞の抗原となりうるターゲットタンパク質の網羅的解析は一般的ではない。本実験では、正常ヒト大脳組織可溶化タンパク質のプロテオームマップの作成と並行して同時にそのタンパク質の一部を二次元電気泳動後ＰＶＤＦ膜に転写し、ヒト自己抗体を含む患者血清による免疫ブロッティングによって自己抗体に反応性のターゲットタンパク質のスクリーニング・同定を行った。 Specifically, unexplained encephalitis associated with autoimmune diseases has been reported in various ways, such as multiple sclerosis (MS), characterized by inflammatory demyelination of the central nervous system. Despite the focus on the relationship between immunity and the pathogenesis mechanism, exhaustive analysis of the target protein that can be an antigen of the patient's autoantibody and nervous system tissue cells is not common. In this experiment, in parallel with the creation of a proteome map of a normal human cerebral tissue solubilized protein, a part of the protein was simultaneously transferred to a PVDF membrane after two-dimensional electrophoresis, and self-immunized by immunoblotting with patient serum containing human autoantibodies. Screening and identification of target proteins reactive with antibodies were performed.

本方式では、今日までに、ＭＳ患者の自己抗体に関しては、ミエリン蛋白がそのターゲットとして発見されていたが、上述の実験において、少なくとも７個のターゲットが検出され、更に、筋輪萎縮性側索硬化症（ａｍｙｏｔｒｏｐｈｉｃｌａｔｅｒａｌｓｃｒｅｏｓｉｓ，ＡＬＳ）の患者血清においても、4つの自己抗体反応性のタンパク質が大脳白質及び灰白質組織に存在していることを発見した。 In this system, to date, myelin protein has been discovered as the target for autoantibodies in MS patients. However, in the above experiment, at least 7 targets were detected, and the muscle ring atrophic lateral cord was detected. It was also discovered that four autoantibody-reactive proteins were present in cerebral white matter and gray matter tissue in the sera of patients with sclerosis (amyotrophic lateral sclerosis, ALS).

更に、最近、抗神経抗体の存在を示唆されている橋本脳症の患者血清に於いても、３つのターゲットタンパク質を検出し、等電点の異なる３つの「ａｌｆａ−ｅｎｏｌａｓｅ」が抗原として同定された。橋本脳症、脳症を示さない橋本病、自己免疫疾患有無における脳炎患者、及び正常ヒト血清合計１００人の解析の結果、ステロイド治療に感受性の橋本脳症患者１００％にａｌｆａ−ｅｎｏｌａｓｅに対する自己抗体が検出され、早期ステロイド治療のマーカーとしての可能性が示唆された。 Furthermore, recently in the sera of patients with Hashimoto's encephalopathy, which has been suggested to have anti-neuroantibodies, three target proteins were detected, and three “alfa-enolases” with different isoelectric points were identified as antigens. . As a result of analysis of Hashimoto encephalopathy, Hashimoto's disease without encephalopathy, encephalitis patients with or without autoimmune disease, and 100 normal human sera, autoantibodies against afa-enolase were detected in 100% of Hashimoto encephalopathy patients sensitive to steroid treatment This suggests that it may be used as a marker for early steroid therapy.

また、翻訳後修飾を含む構造が抗原となっていることが明らかであることから、このスクリーニング法は従来の大腸菌によるｃＤＮＡライブラリー発現スクリーニング（ＳＥＬＥＸ）法に比較して、特に有効であることが判明した。 In addition, since it is clear that the structure including post-translational modification is an antigen, this screening method is particularly effective compared to the conventional cDNA library expression screening (SELEX) method using E. coli. found.

さらに、癌のプロテオーム解析実験に於いては、種々の、脳腫瘍の発症メカニズムの解明、マーカー
開発などを目的とし、特に「ａｎａｐｌａｓｔｉｃｏｒｉｇｏｄｅｎｄｒｏｇｌｉｏｍａ」に注目し脳腫瘍の解析を行った。 Furthermore, in the proteome analysis experiment of cancer, for the purpose of elucidating various onset mechanisms of brain tumors, developing markers, etc., brain tumors were analyzed with particular attention to “anaplastic origo dendro gloma”.

「ａｎａｐｌａｓｔｉｃｏｒｉｇｏｄｅｎｄｒｏｇｌｉｏｍａ」は、一般的に知られる所見として、第１染色体短腕、および、第１９染色体長腕に典型的な片アレルの欠失ＬＯＨを有しているタイプのものと、ＬＯＨを示さないタイプのものがあり、病理学的には全く同様の所見を示すが、ＬＯＨを有するタイプは抗ガン剤や放射線治療に感受性で非常に予後がよく、逆にＬＯＨを持たないタイプは、抗がん剤に非感受性で非常に予後が悪いという特徴を持つ。 The "anaplastic origo dendro glioma" is generally known to have a type having a short allelic deletion LOH typical of the short arm of chromosome 1 and the long arm of chromosome 19, LOH There is a type that does not show the same, and pathologically shows exactly the same findings, but the type with LOH is sensitive to anticancer drugs and radiotherapy and has a very good prognosis, and conversely, the type without LOH It is insensitive to anticancer drugs and has a very poor prognosis.

現在までのところ、分子レベルでこれらの違いを示すマーカータンパク質はもとより、予後の違いの原因となる分子群や薬剤耐性のメカニズムに関する情報は、全く報告されていない。そこで、これらの脳腫瘍サンプルを用いて以下のような解析を行った。 To date, there has been no information on the molecular groups that cause these prognostic differences and the mechanisms of drug resistance, as well as marker proteins that show these differences at the molecular level. Therefore, the following analysis was performed using these brain tumor samples.

脳腫瘍組織をＬＯＨの有無、抗癌剤感受性による分類を行い、これらのサンプルからタンパク質とｍＲＮＡとを同時に抽出して、２種類のプロテオミクスによる解析、および、ＤＮＡチップによるディファレンシャル解析を行った。差異のあったすべての分子群を同定後、蛋白およびｍＲＮＡ発現レベルのデータを比較検討し、分子ネットワーク解析ソフトウェアを用いてシグナルリンケージ解析に供した（図１３参照）。 Brain tumor tissues were classified according to the presence or absence of LOH and anticancer drug sensitivity, and proteins and mRNA were simultaneously extracted from these samples, and analyzed by two types of proteomics and differential analysis using a DNA chip. After identifying all the molecular groups with differences, protein and mRNA expression level data were compared and subjected to signal linkage analysis using molecular network analysis software (see FIG. 13).

以下、図１３について説明する。図１３は、病態サンプルから分子統合情報を得るためのディファレンシャルディスプレイのストラテジーを説明する図である。脳腫瘍組織を外科的手術によって切除したのち、病理学的観点からグレードや進行状況を解析、「ｇｅｎｅｔｉｃｍｕｔａｔｉｏｎ」、
欠損、欠失などの有無、抗癌剤感受性、非感受性による分類等を行う。これらのサンプルよりタンパク質とｍＲＮＡとを同時に抽出して、数種類のプロテオミクスによるディファレンシャル解析（蛍光試薬標識法、安定同位体標識法、２次元電気泳動法、ＬＣショットガン法等）、および、ＤＮＡアレイ測定やリアルタイムＰＣＲにしたがったｍＲＮＡのディファレンシャル
解析を行う。差異のあったすべての分子群を同定後、蛋白およびｍＲＮＡ発現レベルのデータを統合マイニングし、分子解析ソフトウェアを用いてシグナルネットワーク解析に供す。すべてのデータをデータベース化する。これらの統合データから分子クラスター解析、ネットワーク解析を含めた種々の機能解析を行い、最も重要である分子群を抽出して、臨床マーカー、治療、予防、創薬開発など、応用へ繋げていく。 Hereinafter, FIG. 13 will be described. FIG. 13 is a diagram for explaining a differential display strategy for obtaining molecular integration information from a disease state sample. After surgical removal of brain tumor tissue, the grade and progress are analyzed from a pathological point of view, “genetic mutation”
Classification is based on the presence or absence of defects, deletions, sensitivity to anticancer agents, and insensitivity. Simultaneous extraction of protein and mRNA from these samples, differential analysis by several types of proteomics (fluorescent reagent labeling method, stable isotope labeling method, two-dimensional electrophoresis method, LC shotgun method, etc.) and DNA array measurement And differential analysis of mRNA according to real-time PCR. After identifying all the molecular groups with differences, protein and mRNA expression level data are integrated and mined, and subjected to signal network analysis using molecular analysis software. Make all data into a database. From these integrated data, various functional analyzes including molecular cluster analysis and network analysis are performed, and the most important molecular groups are extracted and connected to clinical markers, treatment, prevention, drug development, and other applications.

２Ｄ−ＤＩＧＥによるプロテオームディファレンシャル解析においては、主に翻訳後修飾タンパクのプロファイリングが可能である。二つの可溶化タンパク質を蛍光試薬で標識し、混合した後、二次元電気泳動に供与し、解析を行い、合計、約３５００個のスポットのうち、非感受性サンプルでは約９００個の発現上昇蛋白スポットと、約８００の減少タンパク質スポットが認められた。 In proteome differential analysis by 2D-DIGE, profiling of post-translationally modified proteins is mainly possible. Two solubilized proteins are labeled with a fluorescent reagent, mixed, then provided to two-dimensional electrophoresis, analyzed, and a total of about 3,500 spots, about 900 expression-enhancing protein spots in the insensitive sample About 800 reduced protein spots.

さらに、リン酸化タンパク質染色液であるＰＲＯ−Ｑ（登録商標）ダイアモンドを用いて、同一のゲル上でリン酸化タンパク質を解析し、感受性サンプルに対しても同様に解析したところ、上昇した蛋白スポットは３６９スポット、減少したものは６２８個、又、リン酸化で特に非感受性特異的なものとして１２スポットが検出された（図１４参照）。 Furthermore, when the phosphorylated protein was analyzed on the same gel using PRO-Q (registered trademark) diamond, which is a phosphorylated protein staining solution, and the same analysis was performed on a sensitive sample, the increased protein spots were 369 spots, 628 that decreased, and 12 spots that were particularly insensitive to phosphorylation were detected (see FIG. 14).

図１４は、複数の病態サンプルから多重蛍光標識２次元電気泳動と翻訳後修飾部位に対する特殊蛍光染色法によるディファレンシャルディスプレイを示す図である。脳腫瘍サンプルの例を示す脳腫瘍組織（抗がん剤非感受性）（Ｃｙ５でラベル）と、同一患者の正常部（Ｃｙ２でラベル）との、２Ｄ−ＤＩＧＥによるディファレンシャルディスプレイを行い、さらに、リン酸化タンパク質を特異的に染色するＰｒｏＱＤｉａｍｏｎｄで染色した。まるで囲んだスポットはリン酸化が腫瘍で亢進あるいは脱リン酸化されているタンパク質。これら検出されたスポットの数を図中に示した。 FIG. 14 is a diagram showing a differential display by multiple fluorescence labeling two-dimensional electrophoresis from a plurality of disease state samples and a special fluorescence staining method for post-translational modification sites. A brain tumor tissue (anti-cancer drug insensitive) showing an example of a brain tumor sample (labeled with Cy5) and a normal part (labeled with Cy2) of the same patient are subjected to differential display by 2D-DIGE, and further phosphorylated protein Were stained with ProQ Diamond which specifically stains. The enclosed spots are proteins whose phosphorylation is enhanced or dephosphorylated in the tumor. The number of these detected spots is shown in the figure.

検出された分子群の同定に於いて、非感受性サンプルで特異的にリン酸化を受けるものとして、ＨＳＰ９０をはじめ４個のスポットが、又、脱リン酸化をうけるものとして、「Ｎｅｕｒｏｆｉｌａｍｅｎｔ−Ｌ」を代表として５個のスポットが同定された。 In the identification of the detected molecule group, four spots including HSP90 were subjected to specific phosphorylation in the insensitive sample, and “Neurofilament-L” was regarded as subject to dephosphorylation. As a representative, five spots were identified.

また、同時に２次元電気泳動を用いないプロテオミクス解析法である誘導体化法（ｃＩＣＡＴ（登録商標）法）による解析を行った。ｃＩＣＡＴ法は２つの比較したいサンプルそれぞれを、安定同位体を用いて分子量が９ダルトン異なる「ｌｉｇｈｔ」と「ｈｅａｖｙ」の２つの標識試薬によって、タンパク質のシステイン残基をラベルする。異なる標識試薬でラベルしたのち両者を混合してトリプシンにより断片化し、質量分析を用いて解析を行う。常に同じフラクションに分画される２つのペアの標識ペプチドは、「Ｈｅａｖｙ」と「ｌｉｇｈｔ」の分子量の違いで質量分析によって区別され、その差異が定量される。同時にＭＳ／ＭＳ解析により、ペプチドのシークエンスを行い、タンパク質の同定を行う。脳腫瘍サンプルは各１００ｕｇからスタートして１２フラクションに分画し、最終的には約７０００個のペプチドのＭＳ／ＭＳの解析を行った。 At the same time, analysis was performed by a derivatization method (cICAT (registered trademark) method), which is a proteomic analysis method that does not use two-dimensional electrophoresis. In the cICAT method, each of two samples to be compared is labeled with a cysteine residue of a protein by two labeling reagents of “light” and “heavy”, each having a molecular weight of 9 daltons, using stable isotopes. After labeling with different labeling reagents, both are mixed, fragmented with trypsin, and analyzed using mass spectrometry. Two pairs of labeled peptides that are always fractionated into the same fraction are distinguished by mass spectrometry due to the difference in molecular weight between “Heavy” and “light”, and the difference is quantified. At the same time, peptides are sequenced and proteins are identified by MS / MS analysis. Brain tumor samples were fractionated into 12 fractions starting from 100 ug each, and finally MS / MS analysis of about 7000 peptides was performed.

非感受性と正常サンプルの比較により、上昇したものとして、グリオーマで上昇すると報告をされているマーカーとしてのタンパク質群などが検出された。感受性サンプルとの差異を見たところ、「ｃｅｌｌｃｙｃｌｅ」、「ｐｒｏｔｅｏｌｙｓｉｓ」に関連するタンパク質群などが検出された（図１５参照）。 A comparison between the insensitive and normal samples detected a group of proteins as markers that were reported to be elevated in glioma. When the difference from the sensitivity sample was observed, protein groups related to “cell cycle” and “proteolysis” were detected (see FIG. 15).

図１５は、複数の病態サンプルの安定同位体標識試薬を用いたＬＣ−ショットガンによるディファレンシャルディスプレイ（脳腫瘍サンプル）の例を示す図である。抗がん剤非感受性脳腫瘍組織と、同じ病態を示すが抗がん感受性組織のサンプルとの比較。ｃＩＣＡＴやｉＴＲＡＱを用いたＬＣ−ショットガンによるディファレンシャルディスプレイによって、抗がん剤非感受性脳腫瘍組織において特異的に上昇した分子群、減少した分子群の数と分子名の一部を示す。 FIG. 15 is a diagram showing an example of a differential display (brain tumor sample) by an LC-shot gun using a stable isotope labeling reagent for a plurality of disease state samples. Comparison of anticancer drug-insensitive brain tumor tissue with a sample of the same pathology but anticancer-sensitive tissue. The differential display by LC-shotgun using cICAT or iTRAQ shows the number of molecular groups specifically increased in the anticancer drug-insensitive brain tumor tissue, the number of decreased molecular groups, and part of the molecular names.

次に、同一のサンプルを用いてＤＮＡチップによる解析を行った。「ＣｏｄｅＬｉｎｋ」によるヒトｃＤＮＡオリゴ約２００００種類のチップを用い、非感受性サンプルにおいて、正常との比較では上昇２８７個、減少２１７個が、又感受性との比較では上昇４１２個、減少２３８個が検出された。プロテオミクスの手法によって特異的に同定されたタンパク質群と、ｍＲＮＡの発現パターンとを比較した結果、両者の発現パターンは大きく異なり、例えば、非感受性と正常のサンプルの比較において、１０３個の上昇したタンパク質に対し、ｍＲＮＡでは２８７個、その中で共通する分子は４１個、又、減少した分子群においても、同様に同定タンパク質分子のうち、約３０％としかｍＲＮＡ分子とクロスしないことが判明した（図１６参照）。 Next, analysis using a DNA chip was performed using the same sample. About 20,000 kinds of human cDNA oligo chips by “CodeLink” were detected, and 287 increase and 217 decrease compared to normal, and 412 increase and 238 decrease compared to sensitivity were detected in insensitive samples. It was. As a result of comparing the protein group specifically identified by the proteomics method with the expression pattern of mRNA, the expression pattern of both was greatly different. For example, in the comparison of insensitive and normal samples, 103 elevated proteins On the other hand, 287 mRNAs, 41 molecules in common, and about 30% of the identified protein molecules were found to cross the mRNA molecules even in the reduced molecular group ( (See FIG. 16).

そこで、タンパク質およびｍＲＮＡレベルで上昇した分子群のすべてのデータを、既存の分子ネットワーク検索ソフトウェアであるＫｅｙＭｏｎｅｔ（登録商標）に入力し、同定分子群のリンケージ解析を行った（図１７の左上右上：タンパク質レベル：左上、ｍＲＮＡレベル：右上、すべてのネットワークの解析結果の一部を示す）。 Therefore, all the data of the molecular groups that increased at the protein and mRNA levels were input into the existing molecular network search software KeyMonet (registered trademark), and the linkage analysis of the identified molecular groups was performed (upper left and upper right in FIG. 17: Protein level: upper left, mRNA level: upper right, some of the analysis results for all networks are shown).

図１６は、抗がん剤非感受性脳腫瘍組織ｖｓ同一患者の正常組織（ＭＳｖｓＮ）、抗がん剤非感受性脳腫瘍組織ｖｓ同じ病態を示すが抗がん１３感受性組織(ＭＳｖｓＨＬ)のタンパク質及びｍＲＮＡレベルで同定された分子数を示す図である。 FIG. 16 shows anti-cancer agent-insensitive brain tumor tissue vs normal tissue (MS vs N) of the same patient, anti-cancer agent-insensitive brain tumor tissue vs. the same pathology but anti-cancer 13 sensitive tissue (MS vs HL). FIG. 6 is a diagram showing the number of molecules identified at the protein and mRNA level.

プロテオミクスの手法によって特異的に発現が上昇及び減少するものとして同定されたタンパク質群と、同一のサンプルから検出されたｍＲＮＡの発現パターンとを比較した。ｍＲＮＡの発現とタンパク質の発現のパターンは大きく異なっている。タンパク質で同定された分子のうちの約３０％としかｍＲＮＡの発現分子と共通にクロスしない。また、これらの分子は時として、タンパク質レベルでは上昇しているのにもかかわらず、ｍＲＮＡレベルでは逆に減少しているものも多く見受けられた。 A group of proteins identified as those whose expression was specifically increased and decreased by the proteomic technique was compared with the expression pattern of mRNA detected from the same sample. The pattern of mRNA expression and protein expression are very different. Only about 30% of the molecules identified in proteins cross in common with mRNA expression molecules. Many of these molecules were also decreased at the mRNA level although they were elevated at the protein level.

上記違いの理由として、細胞内におけるタンパク質とｍＲＮＡの発現と安定性の時間的ラグが反映されているものと考えられる。ｍＲＮＡの発現の上昇はタンパク質として充分に反映された時終了し、減少する。タンパク質がクリアランスされ不足するとｍＲＮＡの発現は誘導され上昇する。つまり、細胞内のある時間軸だけを切り取ってみると、あるタンパク質は上昇しているが、ｍＲＮＡは減少していて、又あるタンパク質は減少しているが、mRNAは上昇している、ということは常に起こっていておかしくない。病態サンプルは微量でサンプル内細胞の同調を行なうなど、この時間差の謎を解決するような、タイムコースの実験はできない。 The reason for the difference is considered to reflect the time lag between the expression and stability of proteins and mRNAs in cells. The increase in the expression of mRNA is terminated when it is sufficiently reflected as a protein, and decreases. When the protein is cleared and insufficient, mRNA expression is induced and increased. In other words, if you cut only a certain time axis in the cell, some proteins are rising, mRNA is decreasing, some proteins are decreasing, but mRNA is rising. Is not always going on. Time course experiments that solve the mystery of this time difference, such as synchronizing the cells in the sample in minute amounts, are not possible.

時間的キャップを埋めることができるのかは、以下の分子発現の上昇メカニズムで説明できる。すなわち、すべてのｍＲＮＡがタンパク質として発現すると仮定した場合、ある時間軸でｍＲＮＡが過剰に発現していれば、その時点でタンパク質が減少していても近未来的にタンパク質は上昇する。ならば、その時間的ラグを無視して上昇する分子群をすべて一つに統合マイニングし、その病態サンプルおける時間軸の幅をもたせた特異的活性化シグナルネットワークを抽出できるはずである。図８ではその単純な概念の模式図を示した。このようなシグナルネットワークの抽出操作をすることによって、タンパク質やｍＲＮＡそれぞれの単独の解析のみでは見いだすことの出来なかった活性化シグナルの最上流因子（Ａ）、最下流因子（Ｇ）を割り出すことが出来る。そこで、タンパク質およびｍＲＮＡレベルで上昇した分子群のすべてのデータを、既存の分子ネットワーク検索ソフトウェアであるＫｅｙＭｏｌｎｅｔ（登録商標）に入力し、同定分子群のリンケージ解析を行った。 Whether the time cap can be filled can be explained by the following mechanism of increased molecular expression. That is, assuming that all mRNAs are expressed as proteins, if mRNA is excessively expressed on a certain time axis, the protein will increase in the near future even if the protein decreases at that time. Then, it should be possible to extract a specific activation signal network with the width of the time axis in the pathological sample by integrating and mining all of the rising molecular groups ignoring the temporal lag. FIG. 8 shows a schematic diagram of the simple concept. By performing such a signal network extraction operation, it is possible to determine the most upstream factor (A) and the most downstream factor (G) of the activation signal that could not be found only by analyzing each of the protein and mRNA alone. I can do it. Therefore, all the data of the molecular groups that increased at the protein and mRNA levels were input to Key Molnet (registered trademark), which is an existing molecular network search software, and the linkage analysis of the identified molecular groups was performed.

図１７左上で示すタンパク質の上昇シグナルマップの骨格の中に、ｍＲＮＡの発現分子群を挿入すると、共通してｍＲＮＡ発現の上昇を示す分子群などは非常に少なく、逆に減少した分子が散見され、タンパク質とｍＲＮＡの発現レベルには大きな相違があることが判明した。 When an mRNA expression molecule group is inserted into the skeleton of the protein increase signal map shown in the upper left of FIG. 17, there are very few molecule groups that show an increase in mRNA expression in common, and conversely, decreased molecules are occasionally observed. The protein and mRNA expression levels were found to be significantly different.

そこで、主に、抗癌剤非感受性サンプル中でタンパク質、および、ｍＲＮＡの両方で上昇しているシグナルネットワークの抽出の方法論を図のように考えた。例えば、シグナルの上流でタンパク質の発現があがっていて下流のシグナルは下がっているか変化したものとして同定できていない。逆にｍＲＮＡの発現がシグナルの上流では既にさがっているが、その下流は上昇している。これらの発現が一致しなくとも、両者の活性化分子を合流させることで、ある特定のシグナルキャスケードを抽出することができ、又、共通の上流因子、下流因子を決定できる可能性が示唆される（図８参照）
タンパク質、ｍＲＮＡで特異的に上昇した分子のネットワーク検索結果の一例として、補体系、サイクリンとＣＤＫによる細胞周期、レチノイン酸核内レセプターのネットワークが抽出された（図１７）。 Therefore, the methodology for extracting a signal network that is elevated in both protein and mRNA in an anticancer agent insensitive sample was mainly considered as shown in the figure. For example, protein expression is up upstream of the signal and the downstream signal cannot be identified as down or altered. Conversely, mRNA expression has already sunk upstream of the signal, but is rising downstream. Even if these expressions do not match, it is suggested that a specific signal cascade can be extracted by merging the two activation molecules, and that common upstream and downstream factors can be determined. (See Figure 8)
As an example of network search results for molecules specifically elevated by protein and mRNA, the complement system, the cell cycle by cyclin and CDK, and the network of retinoic acid nuclear receptors were extracted (FIG. 17).

抗がん剤非感受性サンプル中で、リン酸化を含めたタンパク質およびｍＲＮＡのレベルで、正常や抗がん剤感受性に比較して特異的に上昇しているシグナルネットワークをリストアップすると、主に、血管新成・血管透過性・細胞周期・接着因子関連分子群の関与するキャスケードの上昇、及び特異的核内レセプター群、転写因子群とそれらの下流分子群の活性化がみとめられた（図１８）
また、このように、プロテオミクスによって翻訳後修飾、タンパク質レベルの変化をｍＲＮＡ発現レベルの変化と比較解析することによってはじめて、病態に関連する特異的なシグナルネットワーク及び治療ターゲットとなる可能性のある分子キャスケードを抽出できることが判明した(図８)。 In the anticancer drug insensitive sample, when listing signal networks that are specifically increased compared to normal and anticancer drug sensitivity at the level of protein and mRNA including phosphorylation, Vascular neoplasia, vascular permeability, cell cycle, adhesion factor-related increase in cascades, and activation of specific nuclear receptor groups, transcription factors and their downstream molecules were observed (FIG. 18). )
In addition, molecular cascades that may become specific signal networks and therapeutic targets related to pathological conditions can be obtained only by comparing post-translational modifications by proteomics and changes in protein levels with changes in mRNA expression levels. It was found that can be extracted (FIG. 8).

生体組織タンパクチップを用いた病態プロテオミクス解析法により、脳炎を伴う自己免疫疾患患者の血清から、病態に関連する特異的な臨床マーカー及び治療ターゲットとなる自己抗体反応性分子を検出することができることが判明した（図１９、図２０）。 It is possible to detect specific clinical markers related to pathological conditions and autoantibody reactive molecules as therapeutic targets from sera of patients with autoimmune diseases with encephalitis by pathological proteomic analysis using biological tissue protein chips It became clear (FIG. 19, FIG. 20).

図１７は、同一組織細胞可溶化サンプルを用いて各種のディファレンシャルディスプレイ解析を行って抽出してきた分子群の、タンパク質レベル、および、ｍＲＮＡの両レベルで同様に上昇しているシグナルネットワークの一例を示す図である。 FIG. 17 shows an example of a signal network in which molecular groups extracted by performing various differential display analyzes using the same tissue cell solubilized sample are similarly increased at both the protein level and the mRNA level. FIG.

図８および図１６を参照して説明したような仮説にもとづき、データを統合マイニングし、主に、非感受性サンプル中でタンパク質レベル、および、ｍＲＮＡの両レベルで同様に上昇しているシグナルネットワークの一例を示す。レチノン酸レセプター（ＲＡＲ）を介した発現制御に関しての解析例を図に示す。ｂカテニン、ＭＡＰＫ、ＲＡＲ、ｔＴＧ, インテグリン（Ｉｎｔ）、コラーゲン、ラミニンなどがタンパク質レベルで上昇するものとして、又ｍＲＮＡで発現が亢進するものとして、ＥｒｂＢ２、e−カドヘリン、ＤＨＴＲ、ＭＡＰＫ、ｔＴＧ、Ｉｎｔ、さらにＲＡＲの下流因子ＬＦ、ＰＥＰＣＫ、ｐ２１ＣＩｐ１、ＨＯＸＤ、ＣＲＢＰ１なども有意に上昇しているのがわかる。タンパク質、ｍＲＮＡ両者で上昇しているものコアにして前後のキャスケードを見てみると、図中の太字の矢印で示すような活性化シグナルが、抽出できる。これらの所見は、これまでの実験的事実にもとづき、データベース化されたシグナルネットワークを用いて、新たな病態関連シグナルキャスケードを見いだす一つの方法論として新規のものである
図１８は、脳腫瘍患者組織の抗癌剤非感受性サンプル中で、特異的にタンパク質、および、ｍＲＮＡの両レベルで同様に上昇しているシグナルネットワークを示す図である。 Based on the hypothesis as described with reference to FIGS. 8 and 16, the data are integrated and mined, mainly for signal networks that are similarly elevated at both protein and mRNA levels in insensitive samples. An example is shown. An analysis example relating to expression control through retinoic acid receptor (RAR) is shown in the figure. bCatenin, MAPK, RAR, tTG, integrin (Int), collagen, laminin and the like are increased at the protein level, and mRNA is upregulated as ErbB2, e-cadherin, DHTR, MAPK, tTG, Int Furthermore, it can be seen that the downstream factors LF, PEPCK, p21CIp1, HOXD, CRBP1 and the like of RAR are also significantly increased. When both the protein and mRNA are elevated and the cascades before and after the core are viewed, an activation signal as indicated by bold arrows in the figure can be extracted. These findings are novel as one methodology for finding a new disease state-related signal cascade using a databased signal network based on experimental facts so far. FIG. 18 shows an anticancer drug for a brain tumor patient tissue. FIG. 5 shows a signal network that is similarly elevated at both protein and mRNA levels in an insensitive sample.

図８および図１６を参照して説明したような仮説に基づき、データを統合マイニングし、主に、非感受性サンプル中でタンパク質レベル、および、ｍＲＮＡの両レベルで同様に上昇しているシグナルネットワークをリストアップした。特に薬剤感受性に関連して特徴的なものとして、アポトーシス関連キャスケード（ＡＫＴ、Ｂａｄシグナルなど）の亢進、血管過剰新成・血管透過性を特徴づける補体凝固系因子・血管新成因子系、サイクリンを中心とする細胞周期Ｇ１／Ｓ、Ｇ２／Ｍ関連分子群によるシグナルの亢進、インテグリン・カドヘリンを介した接着因子群、分解系の亢進、又、レチノイン酸レセプター・アンドロジエンレセプター（ＤＨＴＲ）関連シグナルを代表とする核レセプターなど転写因子系を介したシグナル亢進が認められた。また、このように、プロテオミクスによって翻訳後修飾、タンパク質レベルの変化をｍＲＮＡ発現レベルの変化と比較解析することによってはじめて、病態に関連する特異的なシグナルネットワーク及び治療ターゲットとなる分子キャスケードを抽出できることがわかる。 Based on the hypotheses as described with reference to FIGS. 8 and 16, the data were integrated and mined, mainly to show signal networks that are similarly elevated at both protein and mRNA levels in insensitive samples. Listed up. In particular, in relation to drug sensitivity, enhancement of apoptosis-related cascades (AKT, Bad signal, etc.), vascular hyperplasia and vascular permeability, complement coagulation factor / vascular neoplastic factor system, cyclin Of signal cycle by G1 / S and G2 / M related molecule groups centering on the cell, adhesion factor group via integrin / cadherin, enhancement of degradation system, and retinoic acid receptor and androgen receptor (DHTR) related signal Signal enhancement via a transcription factor system such as a nuclear receptor such as was observed. In addition, specific signal networks related to pathological conditions and molecular cascades that are therapeutic targets can be extracted only by proteomics and post-translational modification, and by analyzing changes in protein levels with changes in mRNA expression levels. Recognize.

本実証実験では、特異的な分子の抽出及び、タンパク質機能解析及び病態プロテオミクス解析において、請求項３に記載された、同時進行で三方式のディファレンシャルディスプレイを行うこと、ならびに、請求項６に記載された、検索統合データマイニングアルゴリズムにより、プロテオミクスデータベースとタンパク相互機能解析ソフトウェアにリンクさせて融合し、全ての細胞内発現タンパク質を、網羅的に時間軸を追って解析し、如何にトータルとしての機能制御が行われて生物活動が成り立っていくのかを解明し、そして如何なる細胞内タンパク質群のどの様な変化が、何時、どこで、どの様にして正常の細胞活動を破綻させ、がんなどの複雑な疾患をｐｒｏｇｒｅｓｓさせて行くのかを各組織・細胞で網羅的かつ系統的に整理し、ゲノム解析の情報と併せて合理的な治療法や癌マーカーや予防法の開発に役立つ網羅的な情報を得ること、即ちこれらを全て含むプロテオーム解析を可能とする方法を見出した。
実施例１：
(方法)
手術により脳腫瘍患者の脳腫瘍部位を脳腫瘍組織サンプル，腫瘍近傍の正常部位を正常組織サンプル(数ミリメートル角)として、術中に液体窒素で凍結し、その後-80oCに保存したものを用いた．これらの組織を病理学的所見にて腫瘍のグレードを確認した後、一番染色体（１P）、１９番染色体（19q）のLOHの有無、抗癌剤感受性と抗癌剤耐性のものとに分類した．
それらの組織サンプルをLysis
buffer (9.8M尿素，4%CHAPS、0.5 ｍM EDTA、1mM DTT, 1mM PMSF, 1mM PEPSTATIN , 1mM
leupeptin, 10 mM NaF, 2 mM Na₃VO₄，1.5mM Okadaic acid )を用いてホモジネートして溶解させた。その溶解液を遠心分離(15,000
rpm，10min，4℃) し、不溶物を除去したものを腫瘍あるいは正常脳ホモジネートサンプルとした。 In this demonstration experiment, in the extraction of specific molecules, the protein function analysis and the pathological proteomics analysis, the simultaneous differential display of three modes described in claim 3 is performed, and the claim 6 is described. In addition, the integrated search data mining algorithm is linked to the proteomics database and protein interaction analysis software, and all the proteins expressed in the cell are comprehensively analyzed along the time axis, and how the total function is controlled. Elucidate what biological activity is carried out, and what changes in what intracellular protein group, when, where, and how normal cellular activity is disrupted, complicated diseases such as cancer Is organized in a systematic and systematic manner in each tissue / cell. That together with information of genomic analysis to obtain an exhaustive information to help in the development of rational therapies, cancer markers and prophylaxis, i.e. found a method that enables proteome analysis including all these.
Example 1:
(Method)
The brain tumor site of the brain tumor patient was treated as a brain tumor tissue sample, and the normal site near the tumor was treated as a normal tissue sample (several millimeters square), which was frozen in liquid nitrogen during the operation and then stored at -80oC. After confirming the tumor grade by pathological findings, these tissues were classified into the presence or absence of LOH on the first chromosome (1P) and chromosome 19 (19q), anticancer drug sensitivity and anticancer drug resistance.
Lysis those tissue samples
buffer (9.8M urea, 4% CHAPS, 0.5 mM EDTA, 1 mM DTT, 1 mM PMSF, 1 mM PEPSTATIN, 1 mM
leupeptin, 10 mM NaF, 2 mM Na ₃ VO ₄ , 1.5 mM Okadaic acid) and homogenated. The lysate was centrifuged (15,000
rpm, 10 min, 4 ° C.), and the insoluble matter removed was used as a tumor or normal brain homogenate sample.

そのサンプル約100 μgを2D
clean up kit (GE Healthcare Bio-Sciences Corp.)にあるいはアセトン沈殿法にて脱塩したのち，CyDye (Cy 5，GE
Healthcare Bio-Sciences Corp.)にてホモジネートサンプル中の蛋白質にミニマムラベルした。Cy5ラベルした蛋白質約10μg分を膨潤バッファーにて希釈して125
μLとし、その全量をストリップホルダーに添加したのち、DryStrip (pH 3-11NL, 7 cm.Invitrogen)をその溶液上にのせ、一次元目等電点電気泳動(1〜３時間)を行った。等電点電気泳動終了後、DryStripを0.1
M DTT含有SDS平衡化buffer (50 mM Tris-HCl, 6M Urea, 30% glycerol, 2% SDS, 0.02%
Bromophenol Blue, pH 8.8 )に10分間浸して還元処理及びSDS化を行った。さらに、そのDrystripの両端をカッターにて5cm長に切断し、SDS-ポリアクリルアミド電気泳動(Phast-gel
system, Pharmacia,および、 5cmx5cmPAGE産業総合研究所開発微小高分解能ゲル、20ｍA/ゲル，約1時間通電)にて二次元目電気泳動を行った。これをNatural
proteinゲルチップとした。 About 100 μg of the sample is 2D
After desalting in a clean up kit (GE Healthcare Bio-Sciences Corp.) or by acetone precipitation, CyDye (Cy 5, GE
Healthcare Bio-Sciences Corp.) labeled the protein in the homogenate sample at a minimum. Dilute about 10 μg of Cy5-labeled protein with swelling buffer to 125
Add μL to the strip holder, add DryStrip (pH 3-11NL, 7 cm. Invitrogen) onto the solution, and perform first-dimension isoelectric focusing (1 〜 3 hours) It was. After isoelectric focusing, DryStrip 0.1
M DTT-containing SDS equilibration buffer (50 mM Tris-HCl, 6M Urea, 30% glycerol, 2% SDS, 0.02%
It was immersed in Bromophenol Blue, pH 8.8) for 10 minutes for reduction treatment and SDS conversion. Furthermore, both ends of the Drystrip were cut into 5 cm length with a cutter, and SDS-polyacrylamide electrophoresis (Phast-gel
System, Pharmacia, and 5cmx5cmPAGE AIST developed micro high resolution gel, 20mA / gel, energized for about 1 hour). This is Natural
A protein gel chip was used.

そのNatural Proteinゲルチップを下記の方法にて電気的に5cmx5cmの低蛍光加工PVDFメンブレン（millipore,
immobilon-FL 0.45μm）へ転写し、Natural Proteinメンブレンチップとした。すなわち、低蛍光PDVFメンブレンをメタノールに約10秒間浸したのち、転写Buffer
(25 mM Tris, 200 mM グリシン, 10%メタノール)にて十分に平衡化した。セミドライトランスファー装置（株式会社バイオクラフト）を用いて、上記のSDS-ポリアクリルアミドゲル中の蛋白質を転写bufferにて平衡化したメンブレンへ電気的（約140
mA, 50分間通電）に転写した。転写後、直ちに蛍光スキャナー（Typhoon 9400, GE Healthcare Bio-Sciences Corp.）を用いてメンブレン上のタンパク質を検出した。 The Natural Protein gel chip was electrically converted into a 5 cm x 5 cm low-fluorescence PVDF membrane (millipore,
immobilon-FL 0.45 μm) and used as a Natural Protein membrane chip. That is, soak the low-fluorescence PDVF membrane in methanol for about 10 seconds, and then transfer the buffer.
(25 mM Tris, 200 mM glycine, 10% methanol) was fully equilibrated. Using a semi-dry transfer device (Biocraft Co., Ltd.), the protein in the SDS-polyacrylamide gel is electrically transferred to the membrane equilibrated with the transfer buffer (approx. 140
mA, 50 minutes energized). Immediately after the transfer, the protein on the membrane was detected using a fluorescence scanner (Typhoon 9400, GE Healthcare Bio-Sciences Corp.).

検出後、低蛍光PVDFメンブレンをブロッキング液(0.5%BSA/0.03%Tween20/PBS)に浸して室温にて10分間振とうした後、洗浄液(0.3%Tween20/PBS)にて低蛍光・低吸着・高感度・高分解能PVDFメンブレンを洗浄した(5分間×3回)。抗体カクテル溶液(anti
GFAP pAb 1/500倍 Santa Cruz, anti CD44 pAb 1/1000倍 Ancell, anti p53 mAb 1/1000倍
Santa Cruz, anti prothrombin mAb 1/1000倍 Abcam, anti α1-antitrypsin mAb 1/1000倍
Abcam, anti ubiqutine activating wnzyme E1 mAb 1/500倍 Upstate, anti Erk pAb
1/1000倍 Santa Cruz, anti p-Tyr mAb 1/1000倍 Upstate, anti zyxin pAb 1/1000倍
abcam)（ここで、pAbはウサギポリクローナル抗体を示し、mAbはマウスモノクローナル抗体を示す）に低蛍光PVDFメンブレンを浸して室温で1時間振とうし、洗浄液にて洗浄(5分間×3回)したのち、二次抗体溶液{anti
rabbit Ig-HRP linked F(ab’)₂
fragment (from donkey) 1/20,000倍あるいはanti mouse Ig-HRP linked F(ab’)₂ fragment (from sheep) 1/20,000倍、GE Healthcare Bio-Sciences Corp.}に低蛍光PVDFメンブレンを浸して室温にて30分間振とうした。洗浄液にて洗浄(5分間×3回)した後、ECL
Plus Western Blotting Detection Reagents (GE Healthcare Bio-Sciences Corp.) 1
mLに浸して室温で1分間反応させた。透明フィルムに挟んだ低蛍光PVDFメンブレンとHyperfilm (GE Healthcare Bio-Sciences
Corp.)を密着(約2分間)させて感光させたのち、Hyperfilmを現像した。また、同時に蛍光スキャナー(Typhoon 9400, GE Healthcare
Bio-Sciences Corp.)にてスキャンし、抗体反応性蛍光スポットを疑似色にて着色し、Cydye検出画像の上にマージさせることによって画像化した。
(結果)
Natural Proteinメンブレンチップによる脳腫瘍組織サンプル及び正常組織サンプルの抗体を用いたプロファイリングの結果を図３４〜図３７に示した。また、実際のサンプルパターンを図３８に示し、各患者のパターンをまとめたものを図３９に示した。 After detection, immerse the low-fluorescence PVDF membrane in a blocking solution (0.5% BSA / 0.03% Tween20 / PBS), shake at room temperature for 10 minutes, and then wash with a washing solution (0.3% Tween20 / PBS). High sensitivity and high resolution PVDF membrane was washed (5 minutes x 3 times). Antibody cocktail solution (anti
GFAP pAb 1 / 500x Santa Cruz, anti CD44 pAb 1 / 1000x Ancell, anti p53 mAb 1 / 1000x
Santa Cruz, anti prothrombin mAb 1/1000 times Abcam, anti α1-antitrypsin mAb 1/1000 times
Abcam, anti ubiqutine activating wnzyme E1 mAb 1 / 500x Upstate, anti Erk pAb
1/1000 times Santa Cruz, anti p-Tyr mAb 1/1000 times Upstate, anti zyxin pAb 1/1000 times
abcam) (where pAb represents a rabbit polyclonal antibody and mAb represents a mouse monoclonal antibody) a low-fluorescence PVDF membrane was soaked, shaken at room temperature for 1 hour, and washed with a washing solution (5 minutes × 3 times) Later, the secondary antibody solution {anti
rabbit Ig-HRP linked F (ab ') ₂
Fragment (from donkey) 1 / 20,000 times or anti mouse Ig-HRP linked F (ab ') ₂ fragment (from sheep) 1 / 20,000 times, GE Healthcare Bio-Sciences Corp. Shake for 30 minutes. After washing with washing liquid (5 minutes x 3 times), ECL
Plus Western Blotting Detection Reagents (GE Healthcare Bio-Sciences Corp.) 1
It was immersed in mL and reacted at room temperature for 1 minute. Low fluorescence PVDF membrane sandwiched between transparent films and Hyperfilm (GE Healthcare Bio-Sciences
Corp.) was allowed to adhere (about 2 minutes) and exposed to light, and then the Hyperfilm was developed. At the same time, the fluorescent scanner (Typhoon 9400, GE Healthcare
Bio-Sciences Corp.) and imaged by staining antibody reactive fluorescent spots with pseudo colors and merging them onto Cydye detection images.
(result)
The results of profiling using antibodies of brain tumor tissue samples and normal tissue samples using Natural Protein membrane chips are shown in FIGS. FIG. 38 shows an actual sample pattern, and FIG. 39 shows a summary of patterns of each patient.

GFAP，p53は腫瘍サンプルにおいてリン酸化を伴って増加し、ユビキチン化酵素E-1、CD44、アンタイトリプシン、プロトロンビンは量的な変動が明らかなパターンとして検出された。タイロシンリン酸化抗体、さらには、セリン/スレオニン酸染色法を用いると、正常と腫瘍に対する明らかな反応パターンの違いが認められた。また、各患者サンプルを用いて5種類（GFAP，CD44，タイロシンリン酸化抗体，Erk，Zyxin）の抗体カクテルにて抗がん剤感受性及び非感受性のプロファイリングにより比較すると、50及び40個の陽性スポットが得られ、それぞれに特徴的な共通の30個からなるパターンが検出された。これをType
１のパターンとして、現在，抗癌剤感受性に関連すると考えられる蛋白質の抗体をリストアップして、これらの抗体カクテルによるパターン化を行うことができる。 GFAP and p53 increased with phosphorylation in tumor samples, and ubiquitinase E-1, CD44, antitrypsin, and prothrombin were detected as patterns with obvious quantitative variations. When using tyrosin-phosphorylated antibody and serine / threonic acid staining method, a clear difference in response pattern between normal and tumor was observed. In addition, 50 and 40 positive spots were compared by anti-cancer drug susceptibility and insensitivity profiling with 5 types of antibody cocktails (GFAP, CD44, tyrosin phosphorylated antibody, Erk, Zyxin) using each patient sample. As a result, a common 30 pattern was detected. Type this
As a pattern, it is possible to list antibodies of proteins that are currently considered to be related to anticancer drug sensitivity, and to perform patterning using these antibody cocktails.

以上の結果より、正常と腫瘍、抗癌剤耐性と抗癌剤感受性との識別が、このNatural Protein (メンブレン)チップで可能であることが示された。また、数種類の抗体が同時に使用できること、さらには，反応の程度(量的変化)が定量化できることから、癌の診断として用いられている組織免疫染色及び癌マーカーのELISAによる定量のみでは得ることができなかった、翻訳後修飾変化などを含む多数の分子情報が数時間で得られ、脳腫瘍に関わらず、全ての疾患関連サンプルに応用できる有用な診断法となり得る。また、本法は通常、２日以上かかっていた方法論を最短で５時間に短縮して最終データを得ることができることから、臨床検査の方法論として有用である。 From the above results, it was shown that this Natural Protein (membrane) chip can be used to distinguish between normal and tumor, anticancer drug resistance and anticancer drug sensitivity. In addition, since several types of antibodies can be used simultaneously, and the degree of reaction (quantitative change) can be quantified, it can be obtained only by quantification of tissue immunostaining and cancer markers used for cancer diagnosis by ELISA. A large number of molecular information including post-translational modification changes that could not be obtained in a few hours can be a useful diagnostic method applicable to all disease-related samples regardless of brain tumors. In addition, this method is useful as a clinical test methodology because the final data can be obtained by shortening the method that took two days or more to 5 hours at the shortest.

図１は、本発明の実施の形態にかかるプロテオミクス解析システムの全体構成を示すブロックダイヤグラムである。FIG. 1 is a block diagram showing the overall configuration of a proteomic analysis system according to an embodiment of the present invention. 図２は、プロテオミクスベースの処理装置群の構成をより詳細に示すブロックダイヤグラムである。FIG. 2 is a block diagram showing the configuration of the proteomics-based processing device group in more detail. 図３は、ゲノミクスベースの処理装置群の構成をより詳細に示すブロックダイヤグラムである。FIG. 3 is a block diagram showing the configuration of the genomics-based processing device group in more detail. 図４は、プロテオミクスベースの処理装置群を用いたデータプロファイリング方法のうち、安定同位体標識法を用いたＬＣ−ショットガン法、および、２次元電気泳動を用いたディファレンシャルディスプレイ法の概略を示す図である。FIG. 4 shows an outline of LC − a shotgun method using a stable isotope labeling method and a differential display method using two-dimensional electrophoresis, among data profiling methods using a group of proteomics-based processors. FIG. 図５は、本実施の形態にかかるタンパク質同定処理を示すフローチャートである。FIG. 5 is a flowchart showing protein identification processing according to the present embodiment. 図６は、ゲノミクスベースの処理装置群によるデータプロファイリングの一例を示す図である。FIG. 6 is a diagram illustrating an example of data profiling performed by a genomics-based processing device group. 図７は、本実施の形態にかかる第１の質量分析計〜第３の質量分析計のそれぞれから得られたタンパク質を分類したベン図である。FIG. 7 is a Venn diagram in which proteins obtained from each of the first mass spectrometer to the third mass spectrometer according to the present embodiment are classified. 図８は、分子シグナルリンケージのタンパク質レベルネットワークの一例を示す図である。FIG. 8 is a diagram showing an example of a protein level network of molecular signal linkage. 図９は、機能分類ＤＢ内の疾患プロテオミクスデータの連携を説明する図である。FIG. 9 is a diagram for explaining cooperation of disease proteomic data in the function classification DB. 図１０は、リターンマッチングマイニングアルゴリズムの例を示す図である。FIG. 10 is a diagram illustrating an example of a return matching mining algorithm. 図１１は、タンパク質インテンシティのノーマライズ処理を説明する図である。FIG. 11 is a diagram for explaining the normalization processing of protein intensity. 図１２は、タンパク質インテンシティのノーマライズ処理を説明する図である。FIG. 12 is a diagram for explaining the normalization processing of protein intensity. 図１３は、病態サンプルから分子統合情報を得るためのディファレンシャルディスプレイのストラテジーを説明する図である。FIG. 13 is a diagram for explaining a differential display strategy for obtaining molecular integration information from a disease state sample. 図１４は、複数の病態サンプルから多重蛍光標識２次元電気泳動と翻訳後修飾部位に対する特殊蛍光染色法によるディファレンシャルディスプレイを示す図である。FIG. 14 is a diagram showing a differential display by multiple fluorescence labeling two-dimensional electrophoresis from a plurality of disease state samples and a special fluorescence staining method for post-translational modification sites. 図１５は、複数の病態サンプルの安定同位体標識試薬を用いたＬＣ−ショットガンによるディファレンシャルディスプレイ（脳腫瘍サンプル）の例を示す図である。FIG. 15 is a diagram showing an example of a differential display (brain tumor sample) by an LC-shot gun using a stable isotope labeling reagent for a plurality of disease state samples. 図１６は、抗がん剤非感受性脳腫瘍組織ｖｓ同一患者の正常組織（ＭＳｖｓＮ）、抗がん剤非感受性脳腫瘍組織ｖｓ同じ病態を示すが抗がん１３感受性組織(ＭＳｖｓＨＬ)のタンパク質及びｍＲＮＡレベルで同定された分子数を示す図である。FIG. 16 shows anti-cancer agent-insensitive brain tumor tissue vs normal tissue (MS vs N) of the same patient, anti-cancer agent-insensitive brain tumor tissue vs. the same pathology but anti-cancer 13 sensitive tissue (MS vs HL). FIG. 6 is a diagram showing the number of molecules identified at the protein and mRNA level. 図１７は、同一組織細胞可溶化サンプルを用いて各種のディファレンシャルディスプレイ解析を行って抽出してきた分子群の、タンパク質レベル、および、ｍＲＮＡの両レベルで同様に上昇しているシグナルネットワークの一例を示す図である。FIG. 17 shows an example of a signal network in which molecular groups extracted by performing various differential display analyzes using the same tissue cell solubilized sample are similarly increased at both the protein level and the mRNA level. FIG. 図１８は、脳腫瘍患者組織の抗癌剤非感受性サンプル中で、特異的にタンパク質、および、ｍＲＮＡの両レベルで同様に上昇しているシグナルネットワークを示す図である。FIG. 18 is a diagram showing a signal network that is similarly elevated at both protein and mRNA levels in an anticancer agent-insensitive sample of brain tumor patient tissue. 図１９は、組織・細胞Ｎａｔｕｒａｌプロテインチップの補足説明する図である。FIG. 19 is a diagram for explaining supplementary explanation of the tissue / cell Natural protein chip. 図２０は、Ｎａｔｕｒａｌプロテインチップによる患者血清中の自己抗体反応性プロテインスポットのプロファイリングを示す図である。FIG. 20 is a diagram showing profiling of autoantibody-reactive protein spots in patient serum using a Natural protein chip. 図２１は、データ保管管理テーブルを説明する図である。FIG. 21 is a diagram for explaining the data storage management table. 図２２は、リターンマッチングアルゴリズムを説明する図である。FIG. 22 is a diagram for explaining the return matching algorithm. 図２３は、リターンマッチングアルゴリズムを説明する図である。FIG. 23 is a diagram for explaining the return matching algorithm. 図２４（ａ）、（ｂ）は、あるタンパク質を、異なる２つの質量分析器で測定した結果を示す図である。FIGS. 24A and 24B are diagrams showing the results of measuring a certain protein with two different mass analyzers. 図２５は、図２４（ａ）、（ｂ）に示すデータを、データ統合処理により統合して得たデータの例を示す図である。FIG. 25 is a diagram illustrating an example of data obtained by integrating the data illustrated in FIGS. 24A and 24B by data integration processing. 図２６は、本実施の形態にかかるデータ統合処理の一例を示すフローチャートである。FIG. 26 is a flowchart showing an example of the data integration process according to the present embodiment. 図２７（ａ）は、図２４（ａ）、（ｂ）に示すタンパク質のデータに、図２６に示す統合化処理を施した結果を示す図、図２７（ｂ）は、図２７（ｂ）は、ローカルＤＢに記憶された、あるＤＮＡアレイのデータを示す図である。FIG. 27 (a) is a diagram showing the result of the integration processing shown in FIG. 26 on the protein data shown in FIGS. 24 (a) and (b), and FIG. 27 (b) is a diagram showing the result of FIG. These are figures which show the data of a certain DNA array memorize | stored in local DB. 図２８は、図２７（ａ）、（ｂ）に示すデータを、データ統合処理により統合して得たデータの例を示す図である。FIG. 28 is a diagram illustrating an example of data obtained by integrating the data illustrated in FIGS. 27A and 27B by data integration processing. 図２９は、図２７（ａ）、（ｂ）に示すようなデータを統合するデータ統合処理の例を示すフローチャートである。FIG. 29 is a flowchart showing an example of data integration processing for integrating data as shown in FIGS. 27 (a) and 27 (b). 図３０は、本実施の形態にかかるタンパク質同定手順の一例を示すフローチャートである。FIG. 30 is a flowchart showing an example of a protein identification procedure according to the present embodiment. 図３１は、本実施の形態にかかるタンパク質とｍＲＮＡのマージの例を説明するフローチャートである。FIG. 31 is a flowchart illustrating an example of merging of protein and mRNA according to the present embodiment. 図３２（ａ）は、２Ｄ−ＤＩＧＥの画像から得られたタンパク質の情報の例を示す図、図３２（ｂ）は、タンパク質間のユークリッド尺度の情報の例を示す図である。FIG. 32A is a diagram illustrating an example of protein information obtained from a 2D-DIGE image, and FIG. 32B is a diagram illustrating an example of Euclidean scale information between proteins. 図３３は、本実施の形態にかかる特異タンパク質の抽出を概略的に示した図である。FIG. 33 is a diagram schematically showing extraction of a specific protein according to the present embodiment. 図３４は、二次元ウエスタンブロッティングによる腫瘍特異的プロテインのプロファイリングを示す。FIG. 34 shows tumor-specific protein profiling by two-dimensional Western blotting. 図３５は、二次元ウエスタンブロッティングによる腫瘍特異的プロテインのプロファイリングを示す。FIG. 35 shows tumor-specific protein profiling by two-dimensional Western blotting. 図３６は、二次元ウエスタンブロッティング及びPro-Qダイアモンド染色による腫瘍特異的プロテインのプロファイリングを示す。FIG. 36 shows tumor-specific protein profiling by two-dimensional Western blotting and Pro-Q diamond staining. 図３７は、抗体カクテルを用いた二次元ウエスタンブロッティングによる腫瘍特異的プロテインのプロファイリングを示す。FIG. 37 shows tumor-specific protein profiling by two-dimensional Western blotting using an antibody cocktail. 図３８は、抗体カクテルによる個々の患者組織のメンブレンチップパターンを示す。FIG. 38 shows the membrane chip pattern of individual patient tissues with antibody cocktail. 図３９は、GCTEZカクテルを用いた非感受性及び感受性のAOG脳組織の患者試料の代表的なチップパターンを示す。FIG. 39 shows a representative chip pattern of patient samples of insensitive and sensitive AOG brain tissue using GCTEZ cocktail.

Explanation of symbols

１２プロテオミクスベースの処理装置群
１４ゲノミクスベースの処理装置群
１６プロテオミクスＤＢ
１８機能分類ＤＢ
２０統合解析システム
２２データマイニングシステム
２４国際ＤＢ
３０、３２、３４質量分析計
３６２Ｄ−ＤＩＧＥ測定装置
４０、４２、４４、４６制御・解析部
５０、５２、５４、５６ローカルＤＢ
６０マイクロダイセクション装置
６２ＤＮＡアレイ測定装置
６４リアルタイムＰＣＲ測定装置
７０、７２、７４制御・解析部
８０、８２、８４、ローカルＤＢ 12 Proteomics-based processor group 14 Genomics-based processor group 16 Proteomics DB
18 Function classification DB
20 Integrated analysis system 22 Data mining system 24 International DB
30, 32, 34 Mass spectrometer 36 2D-DIGE measuring device 40, 42, 44, 46 Control / analysis unit 50, 52, 54, 56 Local DB
60 Microdissection device 62 DNA array measurement device 64 Real-time PCR measurement device 70, 72, 74 Control / analysis unit 80, 82, 84, Local DB

Claims

Including at least electrophoretic images obtained by proteome analysis methods using tissues, cells and cultured cells derived from patients and disease model animals showing various pathological conditions, and mass spectrometry signal patterns by mass analyzers having various different characteristics, In a computer provided with a database storing raw data, a peptide / protein ID identified based on the raw data, and quantitative data, the computer includes:
Converting and merging data stored in the database into the same language;
Searching other publicly accessible databases accessible over the network;
An analysis program characterized by: extracting a specific protein group, predicting these functions, and analyzing a disease state.

In a computer having an MS spectrum database for storing MS spectrum data obtained by applying a plurality of LC-MS shotgun methods, the computer includes:
Identifying a protein;
Storing the analysis result obtained by the identification of the protein in a database;
An analysis program comprising: executing a step of linking and integrating an analysis result stored in a database, a protein interaction analysis system and a biological information integration platform through a network.

In a computer comprising a database storing logical spectra, the computer includes:
The raw data obtained by applying the stable isotope-labeled LC-MS shotgun method is based on the similarity between the peptide peak list created under the existing conditions and the theoretical spectrum stored in the database. Identifying a peptide; and
Determining a relative amount of protein containing the peptide from the difference in intensity of the pair of stable isotope-labeled pairs;
And a step of classifying a protein group that specifically appears in a specific tissue / cell.

MRNA quantitative expression analysis data by analysis of different properties, including DNA arrays and / or real-time PCR, and two-dimensional electrophoresis, LC-ESI-MS / MS shotgun method, and / or LC — In a computer comprising a database storing quantitative data by proteomics using different methods, including the MALDI-MS / MS shotgun method, the computer includes:
An analysis program for causing each of data stored in the database to execute a step of performing analysis by the same algorithm including a differential display.

Further, the computer
The analysis data, which is the result of the analysis performed by linking the database with another different database and stored in the database on the computer, is based on the information stored in the other linked different database. The analysis program according to claim 4, wherein the analysis step is executed.

In a computer comprising image data including 2D-DIGE, including image data including spot information and protein information including post-translational modification structures, the computer includes:
An analysis program for associating a spot in the image data with protein information including a post-translational modification structure from the database and storing the association in the database.

In the computer,
The analysis program according to any one of claims 1 to 5, wherein the step of reverse-inputting the information of the created integrated database to the analysis platform of the analyzer and executing the re-analysis is executed.

mRNA is DNA array and / or real-time PCR, solubilized protein is fluorescence-labeled two-dimensional electrophoresis including 2D-DIGE and / or LC-MS shotgun method of stable isotope labeling including ICAT and iTRAQ 5. The analysis program according to claim 4, wherein the analysis is performed by unifying the data by the quantitative differential display of the method at the same time.

In a computer comprising a database storing a molecular network, the computer includes:
Examining a molecular network in the database based on molecular integration data;
And a step of extracting a pathological tissue / cell-specific signal cascade that is specifically activated or specifically deactivated in a pathological condition.

In a computer comprising an Accession No identified based on raw data including mass spectrometry signal patterns by mass analyzers having different characteristics, and a database storing at least the intensity ratio, the computer includes:
Extracting an intensity ratio for the same Accession No. based on different mass analyzers stored in the database;
An analysis program, comprising: integrating the values of the intensity ratio, and storing the integrated values in the database in association with the AccessionNo.

Further, the database includes an Accession No based on data derived from a DNA array, and an Intensity ratio,
Extracting an integrated intensity ratio for the same Accession No and an intensity ratio for a DNA array stored in the database;
The analysis program according to claim 10, wherein the step of normalizing the value of the intensity ratio and storing the normalized value in association with the AccessionNo in the database is executed.

In a computer comprising image data including 2D-DIGE and storing image data including spot information, the computer includes:
Each spot information and a database connected to the computer, the other information database storing the known protein spot information, the spot information and the known protein information When the spot information matches, giving the spot information of the known protein as the spot information, and storing in the database,
A step of identifying a protein applied to the spot based on the second information of the spot, which is obtained by another method including analysis by a mass spectrometer and stored in the database when there is no match An analysis program characterized by that.

The step of identifying a protein based on the second information is a step of searching a database in which the information of the protein identified by the other method is stored, which is still another database connected to the computer. The analysis program according to claim 12, comprising:

A protein sample obtained by labeling a protein extracted from biological tissue or cells with a fluorescent dye is subjected to first-dimensional isoelectric focusing on the strip, and then the strip after isoelectric focusing is subjected to reduction treatment and SDS. A small-sized protein chip that covers all proteins synthesized in living tissue or cells, obtained by subjecting to second-dimensional SDS-polyacrylamide electrophoresis.

The protein chip according to claim 14, which has a size of 10 cm × 10 cm or less.

The protein chip according to claim 14 or 15, wherein the protein sample is transferred from the electrophoresis gel to a membrane and immobilized.

(1) a step of subjecting a protein sample obtained by labeling a protein extracted from biological tissue or cells with a fluorescent dye to first-dimensional isoelectric focusing on a strip; and (2) isoelectric focusing. The method for producing a protein chip according to claim 14 or 15, comprising a step of subjecting the subsequent strip to reduction treatment and SDS, and then subjecting the strip to a second-dimensional SDS-polyacrylamide electrophoresis.

(1) a step of subjecting a protein sample obtained by labeling a protein extracted from biological tissue or cells with a fluorescent dye to first-dimensional isoelectric focusing on a strip; (2) after isoelectric focusing The step of reducing the strip and converting it to SDS, then subjecting it to the second-dimensional SDS-polyacrylamide electrophoresis, and (3) transferring the protein sample from the electrophoresis gel after SDS-polyacrylamide electrophoresis to the membrane. The manufacturing method of the protein chip of Claim 16 containing.

17. Detecting fluorescence derived from a fluorescent dye labeled on the protein sample on the protein chip according to any one of claims 14 to 16, and imaging the protein sample as protein spot information on the protein chip. Method.

(1) A list of molecules that can be used for monitoring the progress of a disease state or drug sensitivity is listed from molecular information obtained by causing a computer to execute the program according to claim 1, 2, 3, 8, or 9. Preparing an antibody cocktail against these molecules, (2) bringing the antibody cocktail into contact with the protein chip according to any one of claims 14 to 16, and (3) a positive antibody reaction pattern on the protein chip. A method for monitoring the progress of a disease state or drug sensitivity, which comprises the steps of profiling and preparing a database.

(1) Anti-GFAP antibody, anti-CD44 antibody, anti-p53 antibody, anti-prothrombin antibody, anti-α1-antitrypsin antibody, anti-ubiqutine
activating enzyme E1 antibody, anti-Erk antibody, anti-p-Tyr antibody, and a step of preparing an antibody cocktail containing at least two kinds of antibodies selected from anti-zyxin antibodies, (2) extraction from biological tissue or cells of brain tumor patients The protein sample obtained by labeling the obtained protein with a fluorescent dye is subjected to first-dimension isoelectric focusing on the strip, and then the strip after isoelectric focusing is reduced and converted to SDS. Contacting the antibody cocktail with a micro-sized protein chip covering all proteins synthesized in living tissue or cells obtained by subjecting to eye SDS-polyacrylamide electrophoresis, and (3) on the protein chip A method of monitoring the progress of a disease state or drug sensitivity of a brain tumor, comprising the step of profiling the antibody reaction positive pattern of the above and preparing a database.

An antibody cocktail used in the method of claim 21 for monitoring the progress of a disease state or drug sensitivity of a brain tumor, comprising an anti-GFAP antibody, an anti-CD44 antibody, an anti-p53 antibody, an anti-prothrombin antibody, an anti-α1- antitrypsin antibody, anti-ubiqutine
An antibody cocktail comprising at least two kinds of antibodies selected from activating wnzyme E1 antibody, anti-Erk antibody, anti-p-Tyr antibody, and anti-zyxin antibody.