JP2008076406A

JP2008076406A - Method for protein structure analysis, protein structure analyzer, program, and recording medium

Info

Publication number: JP2008076406A
Application number: JP2007273069A
Authority: JP
Inventors: Naoyuki Yamada; 尚之山田; Eiichiro Suzuki; 榮一郎鈴木; Kazuo Hirayama; 和雄平山; Hideaki Umeyama; 秀明梅山
Original assignee: Ajinomoto Co Inc
Current assignee: Ajinomoto Co Inc
Priority date: 2002-04-26
Filing date: 2007-10-19
Publication date: 2008-04-03

Abstract

<P>PROBLEM TO BE SOLVED: To provide a highly sensitive and quick actual measuring technique for stereostructure information, capable of evaluating a stereostructure predicted result, and capable of providing structure information useful for predicting the structure. <P>SOLUTION: The technique of the present invention fragments an objective protein for predicting the stereostructure to measure a fragmentation spectrum, determines assignment information of a fragmentation ion in an amino acid sequence of the objective protein, based on the measured fragmentation spectrum, specifies a portion contacting with a solvent in the amino acid sequence of the objective protein, according to the measured fragmentation spectrum and the assignment information of the fragmentation ion, determines solvent contacting moiety information in the amino acid sequence of the objective protein, according to the portion, predicts the stereostructure as to the objective protein, calculates reference vibration as to the objective protein, and outputs the predicted stereostructure prediction data, the determined solvent contacting moiety information, and a calculation result by the reference vibration calculation, while correlated each other thereamong. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、タンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体に関し、特に、計算手法により求めたタンパク質の立体構造データを生化学的な実験手法により求めたデータにより補完することができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体に関する。 The present invention relates to a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium, and in particular, can supplement protein three-dimensional structure data obtained by a calculation method with data obtained by a biochemical experimental method. The present invention relates to a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium.

ヒトを始め多くの生物種において全ゲノム配列解析が進められ、その配列情報がデータベース化されている（非特許文献１）。ここで、ゲノム配列より、遺伝子の機能を特定、予測することがある程度可能であるが、配列情報のみでは機能を予測できない遺伝子が非常に多く存在する。遺伝子は、そのＤＮＡ配列から翻訳されたタンパク質が実際に機能を担っている。機能未知の遺伝子の場合においては、まずタンパク質のアミノ酸配列の相同性の高い機能既知のタンパク質を探し出すことにより、その機能を予測することが行われている。しかしながら、機能既知のタンパク質が存在しない場合が多くある。その場合は、立体構造の類似性の高いタンパク質を探し出し、または、新たに（ｄｅｎｏｖｏ）立体構造を予測し、機能予測をすることが可能である。 Whole genome sequence analysis is proceeding in many biological species including humans, and the sequence information is made into a database (Non-patent Document 1). Here, it is possible to identify and predict gene functions to some extent from genome sequences, but there are a large number of genes whose functions cannot be predicted only by sequence information. A gene is actually a function of a protein translated from its DNA sequence. In the case of a gene whose function is unknown, the function is predicted by first searching for a protein whose function is known with high homology in the amino acid sequence of the protein. However, there are many cases where a protein with a known function does not exist. In that case, it is possible to search for a protein having a high three-dimensional structure similarity or to predict a function by predicting a new (de novo) three-dimensional structure.

タンパク質が機能を発揮するためには、それにふさわしい立体構造を有していることが不可欠である。従って、アミノ酸配列の相同性を比較するよりも、立体構造の相同性を比較したほうが、より精密な機能推定をすることができる。 In order for a protein to exert its function, it is essential to have a three-dimensional structure suitable for it. Therefore, it is possible to estimate the function more precisely by comparing the homology of the three-dimensional structure than comparing the homology of the amino acid sequences.

立体構造未知のタンパク質アミノ酸配列から、その立体構造を予測する方法としては、ホモロジーモデリング法が一般的である。これは主に次の４つのステップよりなる計算科学的手法である。 As a method for predicting a three-dimensional structure from a protein amino acid sequence whose three-dimensional structure is unknown, a homology modeling method is generally used. This is a computational scientific method mainly consisting of the following four steps.

（１）任意のアミノ酸配列（目的配列）が与えられたとき、目的配列と類似の配列を有するタンパク質（参照タンパク質）をＰＤＢ（ＰｒｏｔｅｉｎＤａｔａＢａｎｋ）のような立体構造データベースから、単数もしくは複数検索（ホモロジー検索）し、目的配列と類似配列の間のアライメント（配列を並置したもの）を与える。 (1) When an arbitrary amino acid sequence (target sequence) is given, a protein having a sequence similar to the target sequence (reference protein) is searched from a three-dimensional structure database such as PDB (Protein Data Bank). Homology search) to give alignment (sequence alignment) between the target sequence and similar sequences.

このデータベース検索、アライメントを行うためには、ＦＡＳＴＡ、ＰＳＩ−ＢＬＡＳＴ、ＬＩＢＲＡなどのコンピュータソフトがある。ＦＡＳＴＡは２０種類の天然アミノ酸を意味する２０種のアルファベット文字配列のマッチングを行うプログラムであり、高ホモロジー（アミノ酸の一致度約３０％以上、ＦＡＳＴＡのｅ値では約０．０１以下に相当）の参照タンパク質に対して立体構造構築をすると、信頼性の高いモデルが構築出来るとされている。 In order to perform this database search and alignment, there are computer software such as FASTA, PSI-BLAST, and LIBRA. FASTA is a program that matches 20 alphabetic character sequences that mean 20 kinds of natural amino acids, and has high homology (corresponding to about 30% or more of amino acid matching, equivalent to about 0.01 or less in FASTA e value). It is said that a highly reliable model can be constructed by constructing a three-dimensional structure with respect to a reference protein.

また、ＰＳＩ−ＢＬＡＳＴでは、同じように文字配列のマッチングを行うが、文字が一致しているか否かの情報ではなく、プロファイルと呼ばれる文字の一致の度合いを類縁タンパク質の文字配列上部位ごとの置換行列として算出し、更に繰り返し計算を行うことによりアライメントを最適化する性質を持っている。 Also, in PSI-BLAST, matching of character sequences is performed in the same manner. However, instead of information on whether or not characters match, the degree of matching of characters called a profile is replaced for each region on the character sequence of related proteins. It has the property of optimizing the alignment by calculating it as a matrix and then repeatedly calculating it.

また、ＬＩＢＲＡは３Ｄ−１Ｄ法（ｔｈｒｅａｄｉｎｇ法ともいう）に基づくプログラムであり、既知立体構造をプローブにして類似配列を検索するため、ＦＡＳＴＡやＰＳＩ−ＢＬＡＳＴとは検索アルゴリズムが明らかに異なる。そのため、ＦＡＳＴＡやＰＳＩ−ＢＬＡＳＴとは異なる種類の配列間類似性を指摘できる場合がある。 LIBRA is a program based on the 3D-1D method (also called the threading method), and searches for similar sequences using a known three-dimensional structure as a probe. Therefore, the search algorithm is clearly different from FASTA and PSI-BLAST. Therefore, there may be a case where similarities between sequences different from FASTA and PSI-BLAST can be pointed out.

（２）ＦＡＳＴＡ、ＰＳＩ−ＢＬＡＳＴ、ＬＩＢＲＡなどにより算出したアライメントを用いれば、目的配列と類似配列間のアミノ酸ごとの対応関係が決まるので、この関係に基づき、参照タンパク質の３次元座標から目的配列上のアミノ酸ごとの３次元座標を作成する。 (2) If the alignment calculated by FASTA, PSI-BLAST, LIBRA, etc. is used, the corresponding relationship for each amino acid between the target sequence and the similar sequence is determined. Based on this relationship, the three-dimensional coordinates of the reference protein Create three-dimensional coordinates for each amino acid.

（３）目的配列側に対応するアミノ酸が存在しない場合には、参照タンパク質側のその位置のアミノ酸座標は用いず、逆に、参照タンパク質側に対応するアミノ酸が存在しない場合には、その位置の目的配列上のアミノ酸座標は、予め用意しておいたタンパク質断片座標データベースから適切なものを検索して作成する。 (3) When there is no amino acid corresponding to the target sequence side, the amino acid coordinates at that position on the reference protein side are not used. Conversely, when there is no amino acid corresponding to the reference protein side, The amino acid coordinates on the target sequence are created by searching for appropriate ones from a protein fragment coordinate database prepared in advance.

（４）上述の（２）および（３）によるタンパク質座標の構築では、アミノ酸残基間に構造的に不適切な隙間や衝突や歪みが生じることがあるので、エネルギー極小化計算により、これらの構造的な歪みを解消する。モデリングソフトによっては、この構造的な歪みの解消をスムーズに行うため、（２）〜（４）の計算及び検索処理をタンパク質全原子に対して同時に行うのではなく、段階的に行うものもある。即ち、まずタンパク質の骨格を形成するα炭素原子について行い、次いでα炭素原子を含む主鎖原子について行い、最後に側鎖原子を含むタンパク質原子全体について行うものである。 (4) In the construction of protein coordinates according to the above (2) and (3), structurally inappropriate gaps, collisions and distortions may occur between amino acid residues. Eliminate structural distortion. Depending on the modeling software, in order to smoothly eliminate this structural distortion, the calculation and search processing of (2) to (4) is not performed on all protein atoms at the same time, but in steps. . That is, it is performed first on the α carbon atom forming the protein skeleton, then on the main chain atom containing the α carbon atom, and finally on the entire protein atom containing the side chain atom.

以上により、目的配列に対するアライメントが得られれば、その立体構造を予測構築することができる。 As described above, if alignment with the target sequence is obtained, the three-dimensional structure can be predicted and constructed.

特許出願ＷＯ００／３９３２６Patent application WO00 / 39326 ＧｅｒａｒｄｏＪｉｍｅｎｅｚ−Ｓａｎｃｈｅｚ，Ｎａｔｕｒｅ４０９，８５３−８５５（２００１）Gerardo Jimenez-Sanchez, Nature 409, 853-855 (2001) Ｚｈａｎｇ，Ｚ．ａｎｄＳｍｉｔｈ，Ｄ．Ｌ．，（１９９３）ＰｒｏｔｅｉｎＳｃｉ．，２，５２２−５３１．Zhang, Z .; and Smith, D.A. L. (1993) Protein Sci. , 2, 522-531. Ａｋａｓｈｉ，Ｓ．，ａｎｄＴａｋｉｏ，Ｋ．，（２０００）ＰｒｏｔｅｉｎＳｃｉ，９，２４９７−２５０５．、明石知子「第４８回質量分析総合討論会要旨集」ｐ．２０８（２０００）Akashi, S .; , And Takao, K .; , (2000) Protein Sci, 9, 2497-2505. , Tomoko Akashi “Abstracts of the 48th General Meeting of Mass Spectrometry” p. 208 (2000) ＹａｍａｄａＮ．，ＳｕｚｕｋｉＥ．，ＨｉｒａｙａｍａＫ．，（２００２）ＲａｐｉｄＣｏｍｍｕｎ．ＭａｓｓＳｐｅｃｔｒｏｍ．，１６，２９３−２９９．Yamada N .; Suzuki E .; , Hirayama K .; (2002) Rapid Commun. Mass Spectrom. 16, 293-299.

しかしながら、計算科学によりＩｎＳｉｌｉｃｏで構築されたタンパク質の立体構造は実験による検証が全くなされていない。ここで、タンパク質の立体構造情報を実験的に測定する方法としては、Ｘ線結晶構造解析、電子顕微鏡、ＮＭＲなどがあるが、いずれの方法も構造を求めるためには、技術的に困難な上に、時間と労力を要する作業を必要とする。 However, the three-dimensional structure of the protein constructed by In Silico by computational science has not been verified by experiments. Here, methods for experimentally measuring the three-dimensional structure information of proteins include X-ray crystal structure analysis, electron microscope, and NMR, but these methods are technically difficult to obtain the structure. In addition, time-consuming and labor-intensive work is required.

ここで、近年、水素−重水素（Ｈ／Ｄ）交換反応と質量分析（ＭＳ）を用いて、タンパク質の立体構造情報を得る方法が報告されている。例えば、Ｈ／Ｄ交換反応を用いた技術は、特許文献１（Ｍｅｔｈｏｄｓｆｏｒｑｕａｎｔｉｆｙｉｎｇｈｅａｖｙｈｙｄｒｏｇｅｎｌｅｖｅｌｓａｔｓｐｅｃｉｆｉｃｐｅｐｔｉｄｅａｍｉｄｅｌｉｎｋａｇｅｓｉｎｐｒｏｔｅｉｎａｎｄｐｅｐｔｉｄｅ）、および、非特許文献２により開示されている。 Here, in recent years, methods for obtaining three-dimensional structure information of proteins using hydrogen-deuterium (H / D) exchange reaction and mass spectrometry (MS) have been reported. For example, a technique using an H / D exchange reaction is disclosed in Patent Document 1 (Methods for quantifying heavy hydrogen levels at specific peptide linkages in protein and peptides) and Non-Patent Document 2.

これらの方法は、重水から調製した溶液に溶解したタンパク質を経時的にサンプリングし、これを酸性溶液中でペプシン消化を行い、得られたペプチドをＬＣ／ＭＳで測定することにより、ペプチドの質量変化速度より、重水素化速度を算出する。これにより、重水素化速度が速いところは溶媒接触率が高いか揺らぎが大きい部位であり、遅いところは溶媒接触率が低い部位であることがわかる。 In these methods, the protein dissolved in a solution prepared from heavy water is sampled over time, digested with pepsin in an acidic solution, and the resulting peptide is measured by LC / MS. The deuteration rate is calculated from the rate. Thus, it can be seen that the place where the deuteration rate is fast is a part where the solvent contact rate is high or the fluctuation is large, and the part where the deuteration rate is slow is a part where the solvent contact rate is low.

しかしながら、この方法においても、多数のサンプリングの実施と測定時間の長いＬＣ／ＭＳを行わなくてはならないという問題点がある。また、ペプシン消化及びＬＣ／ＭＳ測定時において、タンパク質の立体構造が著しく変化し、これによりペプチド内、ペプチド間のＨ／Ｄスクランブリングが生じる可能性が高いという問題点もある。後者の問題点については、近年、ＦＴＩＣＲ−ＭＳ（フーリエ変換イオンサイクロトロン型質量分析計）による断片化法であるキャピラリースキマーＣＩＤ法を用いて、ペプシンの代わりにタンパク質を断片化し、Ｈ／Ｄ交換と併せてタンパク質−タンパク質相互作用部位情報を得たという報告がある（非特許文献３）が、依然として多数のサンプリングの実施が必要であるという問題点が存在している。 However, even in this method, there is a problem that a large number of samplings and LC / MS with a long measurement time must be performed. In addition, there is a problem that the three-dimensional structure of the protein is remarkably changed during pepsin digestion and LC / MS measurement, and thus there is a high possibility that H / D scrambling occurs within and between peptides. Regarding the latter problem, in recent years, the capillary skimmer CID method, which is a fragmentation method using FTICR-MS (Fourier transform ion cyclotron mass spectrometer), is used to fragment proteins instead of pepsin, and to perform H / D exchange. In addition, there is a report that protein-protein interaction site information has been obtained (Non-Patent Document 3), but there is still a problem that a large number of samplings are still necessary.

また、山田らは、ＦＴＩＣＲ−ＭＳによる断片化法、水素−重水素交換法（Ｈ／Ｄ交換法）とアフィニティークロマトグラフィーを組み合わせた方法により、タンパク質−タンパク質相互作用部位情報を得た報告がなされた（非特許文献４）。この方法では、少ないサンプリングで相互作用部位情報を得ることができる。 In addition, Yamada et al. Reported that protein-protein interaction site information was obtained by a method combining the fragmentation method by FTICR-MS, the hydrogen-deuterium exchange method (H / D exchange method), and affinity chromatography. (Non-Patent Document 4). In this method, interaction site information can be obtained with a small amount of sampling.

前述のように、従来のタンパク質立体構造予測においては、アミノ酸配列から計算機を利用して立体構造を構築していくが、あくまでも仮想構造であり、実験による検証がされていないため、予測構造を実際の立体構造と比べて評価するためには、Ｘ線結晶構造解析やＮＭＲなどを用いて構造解析を行わなくてはならない。そのためには、大量のタンパク質の調製や、結晶化、安定同位体ラベル化等を行わなければならず、非常に手間と時間を要する。 As described above, in conventional protein tertiary structure prediction, a three-dimensional structure is constructed using a computer from an amino acid sequence, but it is a virtual structure and has not been verified by experiments. In order to evaluate in comparison with the three-dimensional structure, the structural analysis must be performed using X-ray crystal structure analysis or NMR. For this purpose, preparation of a large amount of protein, crystallization, stable isotope labeling and the like must be performed, which requires much labor and time.

また、ＭＳとＨ／Ｄ交換反応とを組み合わせる方法は、精度は低いが高感度でタンパク質の立体構造情報を得ることができる手法であるが、依然として実験の手間と時間のかかる方法である。 Further, the method of combining MS and H / D exchange reaction is a method that can obtain protein three-dimensional structure information with high sensitivity but low accuracy, but still requires time and labor for experiments.

従って、本発明は、立体構造予測結果の評価や、構造予測に有用な構造情報の提供が可能であり、高感度かつ迅速な立体構造情報の実測技術を提供することを目的としている。 Accordingly, an object of the present invention is to provide a highly sensitive and quick measurement technique for three-dimensional structure information, which can evaluate a three-dimensional structure prediction result and provide structural information useful for structure prediction.

本発明者は、上記課題を解決するために、タンパク質の立体構造情報を迅速にかつ高感度で取得する方法を開発すべく、質量分析（ＭＳ）による断片化技術とそこから得られる構造情報について詳細に検討した。その結果、Ｈ／Ｄ交換反応による従来法は、多数のサンプリングが必要であること、および、Ｈ／Ｄのスクランブリングが生じること等の課題がまだあった。 In order to solve the above problems, the present inventor has developed a method for rapidly and highly sensitively obtaining three-dimensional structure information of a protein with respect to a fragmentation technique by mass spectrometry (MS) and structure information obtained therefrom. We examined in detail. As a result, the conventional method based on the H / D exchange reaction still has problems such as a large number of samplings and the occurrence of H / D scrambling.

そこで、ＭＳによるフラグメンテーションについて、ＨｅｘａｐｏｌｅａｓｓｉｓｔｅｄｃａｐｉｌｌａｒｙｓｋｉｍｍｅｒＣＩＤ法（以下、「ヘキサポールＣＩＤ法」という。）と、赤外多光子解離（Ｉｎｆｒａｒｅｄｍｕｌｔｉｐｈｏｔｏｎｄｉｓｓｏｃｉａｔｉｏｎ）法（以下、「ＩＲＭＰＤ法」という。）の測定法の検討と、その手法により得られる断片化スペクトルを鋭意検討した結果、タンパク質の２次構造、３次構造、揺らぎ等を反映した断片化が生じていることがわかった。 Therefore, for fragmentation by MS, measurement of Hexapore assisted capillary skimmer CID method (hereinafter referred to as “hexapole CID method”) and infrared multiphoton dissociation method (hereinafter referred to as “IRMPD method”). As a result of careful examination of the method and the fragmentation spectrum obtained by the method, it was found that fragmentation reflecting the secondary structure, tertiary structure, fluctuation, etc. of the protein occurred.

すなわち、タンパク質の構造のうちαへリックスやβストランドなどの２次構造をもつ強固な部分は断片化が起こりにくく、ループなど運動性の高い部位は断片化が生じやすいことが見出された。 That is, it has been found that a strong portion having a secondary structure such as α-helix or β-strand in the protein structure is less likely to be fragmented, and a highly mobile portion such as a loop is likely to be fragmented.

さらに、立体構造を知りたいタンパク質のアミノ酸配列から、３Ｄ−１Ｄ法を用いて順位付けをした立体構造既知の候補タンパク質の２次構造と比較することにより、立体構造予測に利用する候補タンパク質を絞り込むことができた。 Furthermore, the candidate protein used for three-dimensional structure prediction is narrowed down by comparing with the secondary structure of the candidate protein with the known three-dimensional structure ranked using the 3D-1D method from the amino acid sequence of the protein whose three-dimensional structure is desired. I was able to.

また、計算手法による予測構造にＭＳによるタンパク質の断片化データを照会することにより、予測構造を評価することができた。 In addition, the predicted structure could be evaluated by referring the protein fragmentation data by MS to the predicted structure by the calculation method.

これらの各種の知見に基づいて本発明が完成されるに到った。 The present invention has been completed based on these various findings.

上述した目的を達成するため、本発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、立体構造を予測する対象タンパク質について断片化して断片化スペクトルを測定する断片化スペクトル測定ステップ（断片化スペクトル測定手段）と、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）により測定された上記断片化スペクトルに基づいて、上記対象タンパク質のアミノ酸配列に対する上記断片化イオンの帰属情報を決定する帰属情報決定ステップ（帰属情報決定手段）と、上記帰属情報決定ステップ（帰属情報決定手段）により決定された上記断片化イオンの上記帰属情報により、上記対象タンパク質の上記アミノ酸配列において上記断片化イオンに断片化された箇所を特定し、当該箇所に従って上記対象タンパク質の上記アミノ酸配列における切断容易領域情報を決定する切断容易領域情報決定ステップ（切断容易領域情報決定手段）と、上記対象タンパク質について立体構造を予測する立体構造予測ステップ（立体構造予測手段）と、上記立体構造予測ステップ（立体構造予測手段）により予測された立体構造予測データと、上記切断容易領域情報決定ステップ（切断容易領域情報決定手段）により決定された上記切断容易領域情報とを対応させて出力する処理結果出力ステップ（処理結果出力手段）とを含む（備える）ことを特徴とする。 In order to achieve the above-described object, a protein structure analysis method, a protein structure analysis apparatus, and a program according to the present invention include a fragmentation spectrum measurement step of measuring a fragmentation spectrum by fragmenting a target protein that predicts a three-dimensional structure ( The fragmentation ion attribution information for the amino acid sequence of the target protein is determined based on the fragmentation spectrum measurement means) and the fragmentation spectrum measured by the fragmentation spectrum measurement step (fragmentation spectrum measurement means). By the attribution information determination step (attribute information determination means) and the attribute information of the fragmented ions determined by the attribute information determination step (attribute information determination means), the fragment ions in the amino acid sequence of the target protein Identify the fragmented part and An easy-cleavage region information determination step (easy-cleavage region information determination means) for determining easy-cleavage region information in the amino acid sequence of the target protein according to a place, and a three-dimensional structure prediction step (three-dimensional structure prediction) for predicting a three-dimensional structure of the target protein Means), the three-dimensional structure prediction data predicted by the three-dimensional structure prediction step (three-dimensional structure prediction means), and the easy-cleavage area information determined by the easy-cleavage area information determination step (easy-cleavage area information determination means) And a processing result output step (processing result output means) that outputs the data in correspondence with each other.

この方法、装置、および、プログラムによれば、立体構造を予測する対象タンパク質についてエレクトロスプレーイオン化法などによりイオン化したイオンをヘキサポールＣＩＤ法などにより断片化イオンに断片化し、断片化スペクトルを測定し、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し、決定された断片化イオンの帰属情報により、対象タンパク質のアミノ酸配列において断片化イオンに断片化された箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における切断容易領域情報を決定し、対象タンパク質について立体構造を予測し、予測された立体構造予測データと決定された切断容易領域情報とを対応させて出力する（例えば、グラフィック表示、テーブルによる一覧表示など）ので、立体構造未知のタンパク質（これをコードする遺伝子を含む）の立体構造予測を行う場合、実測した立体構造情報を加味することにより、精度の高い予測を行うことができるようになる。 According to this method, apparatus, and program, ions ionized by an electrospray ionization method or the like for a target protein for predicting a three-dimensional structure are fragmented into fragmented ions by a hexapole CID method or the like, and a fragmentation spectrum is measured. Based on the measured fragmentation spectrum, the fragmentation ion attribution information for the target protein amino acid sequence is determined, and the fragmentation ion attribution information is fragmented into fragmentation ions in the target protein amino acid sequence. The target region is identified, the easy-cleavage region information in the amino acid sequence of the target protein is determined, the three-dimensional structure is predicted for the target protein, and the predicted three-dimensional structure prediction data corresponds to the determined easy-cleavage region information. Output (for example, graphic display When displaying a three-dimensional structure of a protein with an unknown three-dimensional structure (including a gene encoding it), it is possible to make a high-precision prediction by taking into account the measured three-dimensional structure information. It becomes like this.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記処理結果出力ステップ（処理結果出力手段）は、上記立体構造予測データを針金モデル、リボンモデル、パイプモデル、ボールアンドスティックモデル、または、空間充填モデルのうちいずれかのモデルによりグラフィック表示し、上記切断容易領域情報を対応する上記立体構造予測データの表示部分に関連付けて表示することを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the above-described processing result output step (processing result output means) in the protein structure analysis method, protein structure analysis apparatus, and program described above. The three-dimensional structure prediction data is graphically displayed by any one of a wire model, a ribbon model, a pipe model, a ball and stick model, or a space filling model, and the three-dimensional structure prediction corresponding to the easy-to-cut region information is displayed. It is characterized by being displayed in association with a data display portion.

これは処理結果出力の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、立体構造予測データを針金モデル、リボンモデル、パイプモデル、ボールアンドスティックモデル、または、空間充填モデルのうちいずれかのモデルによりグラフィック表示し、切断容易領域情報を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、リンクを設定、モデル上に表示、モデル上に切断容易領域情報に対応する網掛けや斜線などの特定のパターン模様を表示など）ので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるようになる。 This more specifically shows an example of the processing result output. According to this method, apparatus, and program, the three-dimensional structure prediction data is graphically displayed by one of a wire model, a ribbon model, a pipe model, a ball and stick model, or a space filling model, and an easy-to-cut area Information is displayed in association with the display portion of the corresponding three-dimensional structure prediction data (for example, a link is set, displayed on the model, and a specific pattern pattern such as shading or diagonal lines corresponding to easy-to-cut area information is displayed on the model. Therefore, it becomes possible to display such information visually and easily, and the user can intuitively determine the reliability of the three-dimensional structure prediction data.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記処理結果出力ステップ（処理結果出力手段）は、上記切断容易領域情報に従って対応する上記立体構造予測データの表示部分を色分けして表示することを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the above-described processing result output step (processing result output means) in the protein structure analysis method, protein structure analysis apparatus, and program described above. Is characterized by displaying the corresponding three-dimensional structure prediction data display portion in different colors according to the easy-to-cut region information.

これは処理結果出力の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、グラフィック表示やテーブル表示を行う際に、切断容易領域情報に従って対応する上記立体構造予測データの表示部分を色分けして表示するので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるようになる。 This more specifically shows an example of the processing result output. According to this method, apparatus, and program, when the graphic display or the table display is performed, the display portion of the corresponding three-dimensional structure prediction data corresponding to the easy-to-cut area information is displayed in different colors. Therefore, the user can intuitively determine the reliability of the three-dimensional structure prediction data.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記立体構造予測ステップ（立体構造予測手段）により予測された立体構造予測データと、上記切断容易領域情報決定ステップ（切断容易領域情報決定手段）により決定された上記切断容易領域情報とを比較して、上記切断容易領域情報により特定された切断容易領域に対応する予測構造部分について評価を行い予測構造評価情報を決定する予測構造評価情報決定ステップ（予測構造評価情報決定手段）と、上記予測構造評価情報決定ステップ（予測構造評価情報決定手段）により決定された予測構造評価情報を出力する予測構造評価情報出力ステップ（予測構造評価情報出力手段）とをさらに含む（備える）ことを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the protein structure analysis method, protein structure analysis apparatus, and program described above, in the three-dimensional structure prediction step (three-dimensional structure prediction means). Is compared with the easy-to-cut region information determining step (the easy-to-cut region information determining unit), and the cutting specified by the easy-to-cut region information is compared. A prediction structure evaluation information determination step (prediction structure evaluation information determination means) for evaluating the prediction structure portion corresponding to the easy region and determining prediction structure evaluation information, and the prediction structure evaluation information determination step (prediction structure evaluation information determination means) Predictive structure evaluation information output for outputting predicted structure evaluation information determined by -Up further comprises a (predicted structure evaluation information output unit) (comprises) can be characterized.

この方法、装置、および、プログラムによれば、予測された立体構造予測データと、決定された切断容易領域情報とを比較して、切断容易領域情報により特定された切断容易領域に対応する予測構造部分について評価を行い予測構造評価情報を決定し、決定された予測構造評価情報を出力するので、計算機により予測された予測構造を生化学実験データに基づいて評価を行うことができるようになり、予測精度を著しく向上させることができるようになる。 According to this method, apparatus, and program, the predicted three-dimensional structure prediction data is compared with the determined easy-cleavage region information, and the predicted structure corresponding to the easy-cleavage region specified by the easy-cleavage region information Since the part is evaluated and the predicted structure evaluation information is determined and the determined predicted structure evaluation information is output, the predicted structure predicted by the computer can be evaluated based on the biochemical experiment data. The prediction accuracy can be remarkably improved.

また、これにより、ゲノム配列解析、ＤＮＡチップを用いた発現プロファイリング解析、プロテオーム解析などで見つかった興味深い遺伝子・タンパク質の機能解析を行う場合、立体構造を基準とした機能予測の精度が格段に向上するようになり、また、ゲノム配列解析においては、機能未知の遺伝子・タンパク質の立体構造を考慮した機能予測を行うことにより効率的に機能を推定することができるようになる。さらに、これにより、阻害剤などのドラッグデザインや蛋白質工学による活性向上や機能改変体のデザインを効率的に行うことができるようになる。 This also greatly improves the accuracy of function prediction based on the three-dimensional structure when conducting functional analysis of interesting genes and proteins found by genome sequence analysis, expression profiling analysis using DNA chips, proteome analysis, etc. In genome sequence analysis, the function can be efficiently estimated by performing function prediction in consideration of the three-dimensional structure of a gene / protein whose function is unknown. Further, this makes it possible to efficiently design drugs such as inhibitors, improve activity by protein engineering, and design functional modifications.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記対象タンパク質について基準振動計算を行う基準振動計算ステップ（基準振動計算手段）をさらに含み（備え）、上記処理結果出力ステップ（処理結果出力手段）は、上記基準振動計算ステップ（基準振動計算手段）による計算結果を対応する上記立体構造予測データの表示部分に関連付けて表示することを特徴とする。 The protein structure analysis method, the protein structure analysis apparatus, and the program according to the next invention are the reference vibration for performing the reference vibration calculation for the target protein in the protein structure analysis method, the protein structure analysis apparatus, and the program described above. The processing result output step (processing result output means) further includes (comprises) a calculation step (reference vibration calculation means), and the three-dimensional structure prediction data corresponding to the calculation result of the reference vibration calculation step (reference vibration calculation means) It is characterized by being displayed in association with the display part.

この方法、装置、および、プログラムによれば、対象タンパク質について基準振動計算を行い、基準振動計算結果（例えば、揺らぎの情報など）を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、立体構造モデル上にベクトル表示を行う）ので、切断容易領域として揺らぎの情報を考慮することができるようになる。すなわち、基準振動計算結果により揺らぎ情報を求め、ＭＳ実測データと比較することにより予測モデルの評価を行うことができるようになる。 According to this method, apparatus, and program, the reference vibration calculation is performed for the target protein, and the reference vibration calculation result (for example, information on fluctuation) is displayed in association with the display portion of the corresponding three-dimensional structure prediction data (for example, Since the vector is displayed on the three-dimensional structure model), the fluctuation information can be considered as the easy-to-cut region. That is, the prediction model can be evaluated by obtaining fluctuation information from the reference vibration calculation result and comparing it with the MS actual measurement data.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）は、ヘキサポールＣＩＤ法、ノズルスキマーＣＩＤ法、キャピラリースキマーＣＩＤ法、ＳＯＲＩ−ＣＩＤ法、および、マルチポールストアー補助キャピラリースキマーＣＩＤ法を含む衝突活性化解離（ＣＩＤ）法、ＩＲＭＰＤ法、インソース分解（ＩＳＤ）法、ポストソース分解（ＰＳＤ）法、表面誘起解離（ＳＩＤ）法、ＥＣＤ法、および、ＢＩＲＤ法のうち少なくとも一つの方法により上記対象タンパク質について断片化イオンに断片化し上記断片化スペクトルを測定することを特徴とする。 The protein structure analysis method, the protein structure analysis device, and the program according to the next invention are the above-described fragmentation spectrum measurement step (fragmentation spectrum measurement) in the protein structure analysis method, protein structure analysis device, and program described above. Means) include a hexapole CID method, a nozzle skimmer CID method, a capillary skimmer CID method, a SORI-CID method, and a collision activated dissociation (CID) method including a multipole store auxiliary capillary skimmer CID method, an IRMPD method, an in-source method The target protein is fragmented into fragment ions by at least one of decomposition (ISD) method, post-source decomposition (PSD) method, surface induced dissociation (SID) method, ECD method, and BIRD method. Measure And wherein the door.

これは断片化スペクトル測定の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、立体構造を予測する対象タンパク質についてヘキサポールＣＩＤ法、ノズルスキマーＣＩＤ法、キャピラリースキマーＣＩＤ法、ＳＯＲＩ−ＣＩＤ法、および、マルチポールストアー補助キャピラリースキマーＣＩＤ法を含む衝突活性化解離（ＣＩＤ）法、ＩＲＭＰＤ法、インソース分解（ＩＳＤ）法、ポストソース分解（ＰＳＤ）法、表面誘起解離（ＳＩＤ）法、ＥＣＤ法、および、ＢＩＲＤ法のうち少なくとも一つの方法により上記対象タンパク質について断片化イオンに断片化し上記断片化スペクトルを測定するので、対象タンパク質の立体構造を反映した断片化イオンを効率的に作成することができるようになる。 This shows one example of the fragmentation spectrum measurement more specifically. According to this method, apparatus, and program, a hexapole CID method, a nozzle skimmer CID method, a capillary skimmer CID method, a SORI-CID method, and a multipole store-assisted capillary skimmer CID method are used for a target protein that predicts a three-dimensional structure. At least one of collision activated dissociation (CID) method, IRMPD method, in-source decomposition (ISD) method, post-source decomposition (PSD) method, surface-induced dissociation (SID) method, ECD method, and BIRD method Since the target protein is fragmented into fragmented ions by the method and the fragmentation spectrum is measured, fragmented ions reflecting the three-dimensional structure of the target protein can be efficiently created.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）は、酵素反応により上記対象タンパク質について断片化を行い上記断片化スペクトルを測定することを特徴とする。 The protein structure analysis method, the protein structure analysis device, and the program according to the next invention are the above-described fragmentation spectrum measurement step (fragmentation spectrum measurement) in the protein structure analysis method, protein structure analysis device, and program described above. (Means) is characterized in that the target protein is fragmented by an enzymatic reaction and the fragmentation spectrum is measured.

これは断片化スペクトル測定の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、酵素反応により対象タンパク質について断片化を行い断片化スペクトルを測定するので、対象タンパク質の立体構造を反映した断片化を効率的に行うことができるようになる。 This shows one example of the fragmentation spectrum measurement more specifically. According to this method, apparatus, and program, since the target protein is fragmented by an enzymatic reaction and the fragmentation spectrum is measured, the fragmentation reflecting the three-dimensional structure of the target protein can be efficiently performed. Become.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、立体構造を予測する対象タンパク質について断片化して断片化スペクトルを測定する断片化スペクトル測定ステップ（断片化スペクトル測定手段）と、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）により測定された上記断片化スペクトルに基づいて、上記対象タンパク質のアミノ酸配列に対する上記断片化イオンの帰属情報を決定する帰属情報決定ステップ（帰属情報決定手段）と、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）により測定された上記断片化スペクトル、および、上記帰属情報決定ステップ（帰属情報決定手段）により決定された上記断片化イオンの上記帰属情報に従って、上記対象タンパク質の上記アミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って上記対象タンパク質の上記アミノ酸配列における溶媒接触残基情報を決定する溶媒接触残基情報決定ステップ（溶媒接触残基情報決定手段）と、上記対象タンパク質について立体構造を予測する立体構造予測ステップ（立体構造予測手段）と、上記立体構造予測ステップ（立体構造予測手段）により予測された立体構造予測データと、上記溶媒接触残基情報決定ステップ（溶媒接触残基情報決定手段）により決定された上記溶媒接触残基情報とを対応させて出力する処理結果出力ステップ（処理結果出力手段）とを含む（備える）ことを特徴とする。 A protein structure analysis method, a protein structure analysis apparatus, and a program according to the next invention include a fragmentation spectrum measurement step (fragmentation spectrum measurement means) that measures a fragmentation spectrum by fragmenting a target protein for which a three-dimensional structure is predicted. , An attribution information determination step (identification information) for determining the attribution information of the fragmentation ion to the amino acid sequence of the target protein based on the fragmentation spectrum measured by the fragmentation spectrum measurement step (fragmentation spectrum measurement means) Determination means), the fragmentation spectrum measured by the fragmentation spectrum measurement step (fragmentation spectrum measurement means), and the fragmentation ions determined by the attribution information determination step (attribute information determination means) According to attribution information A solvent contact residue information determination step (solvent contact residue information determination step) that identifies a position in contact with a solvent in the amino acid sequence of the protein and determines solvent contact residue information in the amino acid sequence of the target protein according to the position Determination means), a three-dimensional structure prediction step (three-dimensional structure prediction means) for predicting a three-dimensional structure of the target protein, three-dimensional structure prediction data predicted by the three-dimensional structure prediction step (three-dimensional structure prediction means), and the solvent contact A processing result output step (processing result output means) for outputting the corresponding information to the solvent contact residue information determined by the residue information determination step (solvent contact residue information determination means). And

この方法、装置、および、プログラムによれば、立体構造を予測する対象タンパク質について断片化し、断片化スペクトルを測定し、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における溶媒接触残基情報を決定し、対象タンパク質について立体構造を予測し、予測された立体構造予測データと決定された溶媒接触残基情報とを対応させて出力する（例えば、グラフィック表示、テーブルによる一覧表示など）ので、立体構造未知のタンパク質（これをコードする遺伝子を含む）の立体構造予測を行う場合、実測した立体構造情報を加味することにより、精度の高い予測を行うことができるようになる。 According to this method, apparatus, and program, fragmentation is performed on a target protein for which a three-dimensional structure is predicted, a fragmentation spectrum is measured, and based on the measured fragmentation spectrum, fragmentation ions for the amino acid sequence of the target protein are measured. Determine the attribution information, identify the location in contact with the solvent in the amino acid sequence of the target protein according to the measured fragmentation spectrum and the determined attribution information of the fragmented ion, and the amino acid of the target protein according to the location Solvent contact residue information in the sequence is determined, the three-dimensional structure is predicted for the target protein, and the predicted three-dimensional structure prediction data and the determined solvent contact residue information are output in correspondence with each other (eg, graphic display, table) List of proteins, etc.). When performing three-dimensional structure prediction of containing the gene) to, by adding the actually measured three-dimensional structure information, it is possible to perform a highly accurate prediction.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記処理結果出力ステップ（処理結果出力手段）は、上記立体構造予測データを針金モデル、リボンモデル、パイプモデル、ボールアンドスティックモデル、または、空間充填モデルのうちいずれかのモデルによりグラフィック表示し、上記溶媒接触残基情報を対応する上記立体構造予測データの表示部分に関連付けて表示することを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the above-described processing result output step (processing result output means) in the protein structure analysis method, protein structure analysis apparatus, and program described above. Displays the three-dimensional structure prediction data graphically by any one of a wire model, a ribbon model, a pipe model, a ball and stick model, or a space-filling model, and the solvent-contacting residue information corresponds to the corresponding three-dimensional structure It is characterized by being displayed in association with the display portion of the prediction data.

これは処理結果出力の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、立体構造予測データを針金モデル、リボンモデル、パイプモデル、ボールアンドスティックモデル、または、空間充填モデルのうちいずれかのモデルによりグラフィック表示し、溶媒接触残基情報を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、リンクを設定、モデル上に表示、モデル上に溶媒接触領域情報に対応する網掛けや斜線などの特定のパターン模様を表示など）ので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるようになる。 This more specifically shows an example of the processing result output. According to this method, apparatus, and program, the three-dimensional structure prediction data is graphically displayed by one of a wire model, a ribbon model, a pipe model, a ball and stick model, or a space filling model, and a solvent contact residue is displayed. Display the base information in association with the display portion of the corresponding three-dimensional structure prediction data (for example, set a link, display on the model, and display a specific pattern pattern such as shading or diagonal lines corresponding to the solvent contact area information on the model. Therefore, such information can be displayed visually and in an easily understandable manner, and the user can intuitively determine the reliability of the three-dimensional structure prediction data.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記処理結果出力ステップ（処理結果出力手段）は、上記溶媒接触残基情報に従って対応する上記立体構造予測データの表示部分を色分けして表示することを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the above-described processing result output step (processing result output means) in the protein structure analysis method, protein structure analysis apparatus, and program described above. Is characterized by displaying the corresponding three-dimensional structure prediction data in a color-coded manner in accordance with the solvent contact residue information.

これは処理結果出力の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、グラフィック表示やテーブル表示を行う際に、溶媒接触残基情報に従って対応する上記立体構造予測データの表示部分を色分けして表示するので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるようになる。 This more specifically shows an example of the processing result output. According to this method, apparatus, and program, when performing graphic display or table display, the display portion of the corresponding three-dimensional structure prediction data corresponding to the solvent contact residue information is displayed in different colors. It becomes possible to display visually and intelligibly, and the user can intuitively determine the reliability of the three-dimensional structure prediction data.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記立体構造予測ステップ（立体構造予測手段）により予測された立体構造予測データと、上記溶媒接触残基情報決定ステップ（溶媒接触残基情報決定手段）により決定された上記溶媒接触残基情報とを比較して、上記溶媒接触残基情報により特定された溶媒接触残基に対応する予測構造部分について評価を行い予測構造評価情報を決定する予測構造評価情報決定ステップ（予測構造評価情報決定手段）と、上記予測構造評価情報決定ステップ（予測構造評価情報決定手段）により決定された予測構造評価情報を出力する予測構造評価情報出力ステップ（予測構造評価情報出力手段）とをさらに含む（備える）ことを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the protein structure analysis method, protein structure analysis apparatus, and program described above, in the three-dimensional structure prediction step (three-dimensional structure prediction means). Is compared with the solvent contact residue information determined in the solvent contact residue information determination step (solvent contact residue information determination means), and the solvent contact residue information is compared. A predicted structure evaluation information determination step (predicted structure evaluation information determination means) that evaluates the predicted structure portion corresponding to the specified solvent-contacting residue and determines predicted structure evaluation information, and the predicted structure evaluation information determination step (predicted structure) Predicted structure evaluation information output unit for outputting the predicted structure evaluation information determined by the evaluation information determining means) -Up further comprises a (predicted structure evaluation information output unit) (comprises) can be characterized.

この方法、装置、および、プログラムによれば、予測された立体構造予測データと、決定された溶媒接触残基情報とを比較して、溶媒接触残基情報により特定された溶媒接触残基に対応する予測構造部分について評価を行い予測構造評価情報を決定し、決定された予測構造評価情報を出力するので、計算機により予測された予測構造を生化学実験データに基づいて評価を行うことができるようになり、予測精度を著しく向上させることができるようになる。 According to this method, apparatus, and program, the predicted three-dimensional structure prediction data is compared with the determined solvent contact residue information to correspond to the solvent contact residue specified by the solvent contact residue information. The predicted structure evaluation information is determined and the predicted structure evaluation information is determined and the determined predicted structure evaluation information is output, so that the predicted structure predicted by the computer can be evaluated based on the biochemical experiment data. Thus, the prediction accuracy can be remarkably improved.

この方法、装置、および、プログラムによれば、対象タンパク質について基準振動計算を行い、基準振動計算結果（例えば、揺らぎの情報など）を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、立体構造モデル上にベクトル表示を行う）ので、溶媒接触領域として揺らぎの情報を考慮することができるようになる。すなわち、基準振動計算結果により揺らぎ情報を求め、ＭＳ実測データと比較することにより予測モデルの評価を行うことができるようになる。 According to this method, apparatus, and program, the reference vibration calculation is performed for the target protein, and the reference vibration calculation result (for example, information on fluctuation) is displayed in association with the display portion of the corresponding three-dimensional structure prediction data (for example, Since the vector display is performed on the three-dimensional structure model, fluctuation information can be considered as the solvent contact area. That is, the prediction model can be evaluated by obtaining fluctuation information from the reference vibration calculation result and comparing it with the MS actual measurement data.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）は、上記対象タンパク質について水素−重水素交換反応を行った後、断片化し上記断片化スペクトルを測定し、上記溶媒接触残基情報決定ステップ（溶媒接触残基情報決定手段）は、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）により測定された上記断片化スペクトル、および、上記帰属情報決定ステップ（帰属情報決定手段）により決定された上記断片化イオンの上記帰属情報に従って、上記対象タンパク質の上記アミノ酸配列中の各アミノ酸残基における重水素化速度を決定し、当該重水素化速度に従って上記対象タンパク質の上記アミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って上記対象タンパク質の上記アミノ酸配列における溶媒接触残基情報を決定することを特徴とする。 The protein structure analysis method, the protein structure analysis device, and the program according to the next invention are the above-described fragmentation spectrum measurement step (fragmentation spectrum measurement) in the protein structure analysis method, protein structure analysis device, and program described above. Means) after subjecting the target protein to hydrogen-deuterium exchange reaction, fragmentation and measurement of the fragmentation spectrum, and the solvent contact residue information determination step (solvent contact residue information determination means) The target protein according to the fragmentation spectrum measured by the fragmentation spectrum measurement step (fragmentation spectrum measurement means) and the attribution information of the fragmentation ion determined by the attribution information determination step (attribute information determination means) At each amino acid residue in the above amino acid sequence Determine the deuteration rate, identify the position in contact with the solvent in the amino acid sequence of the target protein according to the deuteration rate, and determine the solvent contact residue information in the amino acid sequence of the target protein according to the position. It is characterized by determining.

これは断片化スペクトル測定の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、対象タンパク質について水素−重水素交換反応を行った後、断片化し断片化スペクトルを測定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列中の各アミノ酸残基における重水素化速度を決定し、当該重水素化速度に従って対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質の上記アミノ酸配列における溶媒接触残基情報を決定するので、効率的に溶媒接触情報を求めることができるようになる。 This shows one example of the fragmentation spectrum measurement more specifically. According to this method, apparatus, and program, after performing a hydrogen-deuterium exchange reaction on the target protein, the fragmentation spectrum is measured and the measured fragmentation spectrum and the determined fragmentation ion are determined. In accordance with the attribution information of the target protein, the deuteration rate at each amino acid residue in the amino acid sequence of the target protein is determined, the location in contact with the solvent in the amino acid sequence of the target protein is identified according to the deuteration rate, and the location Accordingly, the solvent contact residue information in the amino acid sequence of the target protein is determined according to the above, so that the solvent contact information can be efficiently obtained.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）は、上記対象タンパク質について化学修飾を行った後、断片化し上記断片化スペクトルを測定し、上記溶媒接触残基情報決定ステップ（溶媒接触残基情報決定手段）は、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）により測定された上記断片化スペクトル、および、上記帰属情報決定ステップ（帰属情報決定手段）により決定された上記断片化イオンの上記帰属情報に従って、上記対象タンパク質の上記アミノ酸配列中の各アミノ酸残基における化学修飾部分を決定し、当該化学修飾部分に従って上記対象タンパク質の上記アミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って上記対象タンパク質の上記アミノ酸配列における溶媒接触残基情報を決定することを特徴とする。 The protein structure analysis method, the protein structure analysis device, and the program according to the next invention are the above-described fragmentation spectrum measurement step (fragmentation spectrum measurement) in the protein structure analysis method, protein structure analysis device, and program described above. Means) after chemically modifying the target protein, fragmenting and measuring the fragmentation spectrum, and the solvent contact residue information determination step (solvent contact residue information determination means) includes the fragmentation spectrum measurement step The amino acid sequence of the target protein according to the fragmentation spectrum measured by (fragmentation spectrum measurement means) and the attribution information of the fragmentation ions determined by the attribution information determination step (attribute information determination means) Chemical modification at each amino acid residue Characterized in that, in accordance with the chemical modification portion, a position in contact with the solvent in the amino acid sequence of the target protein is identified, and solvent contact residue information in the amino acid sequence of the target protein is determined in accordance with the position. And

これは断片化スペクトル測定の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、対象タンパク質について化学修飾を行った後、断片化し断片化スペクトルを測定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列中の各アミノ酸残基における化学修飾部分を決定し、当該化学修飾部分に従って対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における溶媒接触残基情報を決定するので、効率的に溶媒接触情報を求めることができるようになる。 This shows one example of the fragmentation spectrum measurement more specifically. According to this method, apparatus, and program, after subjecting a target protein to chemical modification, a fragmentation fragmentation spectrum is measured, and according to the measured fragmentation spectrum and the determined fragmentation ion assignment information , Determine the chemical modification part of each amino acid residue in the amino acid sequence of the target protein, identify the position in contact with the solvent in the amino acid sequence of the target protein according to the chemical modification part, and according to the position, the amino acid sequence of the target protein Since the solvent contact residue information in is determined, the solvent contact information can be efficiently obtained.

これは断片化スペクトル測定の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、酵素反応により対象タンパク質について断片化を行い、Ｈ／Ｄ交換法や化学修飾法を組み合わせることにより、溶媒接触部位情報等を効率的に得ることができるようになる。 This shows one example of the fragmentation spectrum measurement more specifically. According to this method, apparatus, and program, it is possible to efficiently obtain solvent contact site information and the like by fragmenting the target protein by enzymatic reaction and combining the H / D exchange method and the chemical modification method. It becomes like this.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、立体構造を予測する対象タンパク質と化合物との複合体について断片化し、断片化スペクトルを測定する断片化スペクトル測定ステップ（断片化スペクトル測定手段）と、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）により測定された上記断片化スペクトルに基づいて、上記対象タンパク質および／または上記化合物に対する断片化イオンの帰属情報を決定する帰属情報決定ステップ（帰属情報決定手段）と、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）により測定された上記断片化スペクトル、および、上記帰属情報決定ステップ（帰属情報決定手段）により決定された上記断片化イオンの上記帰属情報に従って、上記対象タンパク質および／または上記化合物に関する相互作用界面情報を決定する相互作用界面情報決定ステップ（相互作用界面情報決定手段）と、上記対象タンパク質および／または上記化合物について立体構造を予測する立体構造予測ステップ（立体構造予測手段）と、上記立体構造予測ステップ（立体構造予測手段）により予測された立体構造予測データと、上記相互作用界面情報決定ステップ（相互作用界面情報決定手段）により決定された上記相互作用界面情報とを対応させて出力する処理結果出力ステップ（処理結果出力手段）とを含む（備える）ことを特徴とする。 A protein structure analysis method, a protein structure analysis apparatus, and a program according to the next invention include a fragmentation spectrum measurement step (fragment) for fragmenting a complex of a target protein and a compound for predicting a three-dimensional structure and measuring a fragmentation spectrum. Fragmentation spectrum measurement means) and the fragmentation ion attribution information for the target protein and / or the compound are determined based on the fragmentation spectrum measured by the fragmentation spectrum measurement step (fragmentation spectrum measurement means). Determined by the attribute information determination step (attribute information determination means), the fragmentation spectrum measured by the fragmentation spectrum measurement step (fragmentation spectrum measurement means), and the attribute information determination step (attribute information determination means). The above fragmentation ion An interaction interface information determination step (interaction interface information determination means) for determining interaction interface information regarding the target protein and / or the compound according to the information, and a three-dimensional structure for predicting a three-dimensional structure of the target protein and / or the compound Determined by the structure prediction step (stereostructure prediction means), the three-dimensional structure prediction data predicted by the three-dimensional structure prediction step (stereostructure prediction means), and the interaction interface information determination step (interaction interface information determination means). And a processing result output step (processing result output means) that outputs the interaction interface information in association with each other.

この方法、装置、および、プログラムによれば、立体構造を予測する対象タンパク質と化合物（例えば、タンパク質や低分子化合物や核酸など）との複合体について断片化し、断片化スペクトルを測定し、測定された断片化スペクトルに基づいて、対象タンパク質および／または化合物に対する断片化イオンの帰属情報を決定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質および／または化合物に関する相互作用界面情報を決定し、対象タンパク質および／または化合物について立体構造を予測し、予測された立体構造予測データと、決定された相互作用界面情報とを対応させて出力する（例えば、グラフィック表示、テーブルによる一覧表示など）ので、立体構造未知のタンパク質（これをコードする遺伝子を含む）の立体構造予測を行う場合、実測した立体構造情報を加味することにより、精度の高い予測を行うことができるようになる。 According to this method, apparatus, and program, a complex of a target protein and a compound (for example, a protein, a low molecular compound, or a nucleic acid) that predicts a three-dimensional structure is fragmented, and a fragmentation spectrum is measured and measured. The fragmentation ion attribution information for the target protein and / or compound is determined based on the fragmentation spectrum, and the target protein and / or the fragmentation ion identification information is determined according to the measured fragmentation spectrum and the determined fragmentation ion attribution information. The interaction interface information about the compound is determined, the three-dimensional structure is predicted for the target protein and / or compound, and the predicted three-dimensional structure prediction data and the determined interaction interface information are output in correspondence with each other (eg, graphic 3D structure unknown protein When performing a three-dimensional structure estimation of the (containing the gene encoding the same), by adding the actually measured three-dimensional structure information, it is possible to perform a highly accurate prediction.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記処理結果出力ステップ（処理結果出力手段）は、上記立体構造予測データを針金モデル、リボンモデル、パイプモデル、ボールアンドスティックモデル、または、空間充填モデルのうちいずれかのモデルにより上記対象タンパク質および／または上記化合物についてグラフィック表示し、上記相互作用界面情報を対応する上記立体構造予測データの表示部分に関連付けて表示することを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the above-described processing result output step (processing result output means) in the protein structure analysis method, protein structure analysis apparatus, and program described above. Displays the three-dimensional structure prediction data graphically with respect to the target protein and / or the compound by any one of a wire model, a ribbon model, a pipe model, a ball and stick model, or a space filling model, and the interaction The interface information is displayed in association with the display portion of the corresponding three-dimensional structure prediction data.

これは処理結果出力の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、立体構造予測データを針金モデル、リボンモデル、パイプモデル、ボールアンドスティックモデル、または、空間充填モデルのうちいずれかのモデルにより対象タンパク質および／または化合物についてグラフィック表示（例えば、ドッキングシミュレーションなども含む）し、相互作用界面情報を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、リンクを設定、モデル上に表示、モデル上に相互作用界面情報に対応する網掛けや斜線などの特定のパターン模様を表示など）ので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるようになる。 This more specifically shows an example of the processing result output. According to this method, apparatus, and program, the three-dimensional structure prediction data is obtained for a target protein and / or compound by any one of a wire model, a ribbon model, a pipe model, a ball and stick model, or a space filling model. Graphic display (for example, including docking simulation) and display the interaction interface information in association with the display part of the corresponding 3D structure prediction data (for example, set a link, display on the model, interaction interface on the model) This makes it possible to display such information visually and in an easy-to-understand manner, so that users can intuitively understand the reliability of 3D structure prediction data. Can be judged automatically.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記処理結果出力ステップ（処理結果出力手段）は、上記相互作用界面情報に従って上記対象タンパク質および／または上記化合物の対応する上記立体構造予測データの表示部分を色分けして表示することを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the above-described processing result output step (processing result output means) in the protein structure analysis method, protein structure analysis apparatus, and program described above. Is characterized in that the display portion of the corresponding three-dimensional structure prediction data of the target protein and / or the compound is displayed in different colors according to the interaction interface information.

これは処理結果出力の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、グラフィック表示やテーブル表示を行う際に、相互作用界面情報に従って対象タンパク質および／または化合物の対応する上記立体構造予測データの表示部分を色分けして表示するので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるようになる。 This more specifically shows an example of the processing result output. According to this method, apparatus, and program, when graphic display or table display is performed, the display portion of the above-described three-dimensional structure prediction data corresponding to the target protein and / or compound is displayed in different colors according to the interaction interface information. Therefore, it becomes possible to display such information visually and easily, and the user can intuitively determine the reliability of the three-dimensional structure prediction data.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記立体構造予測ステップ（立体構造予測手段）により予測された立体構造予測データと、上記相互作用界面情報決定ステップ（相互作用界面情報決定手段）により決定された上記相互作用界面情報とを比較して、上記相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定する予測構造評価情報決定ステップ（予測構造評価情報決定手段）と、上記予測構造評価情報決定ステップ（予測構造評価情報決定手段）により決定された予測構造評価情報を出力する予測構造評価情報出力ステップ（予測構造評価情報出力手段）とをさらに含む（備える）ことを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the protein structure analysis method, protein structure analysis apparatus, and program described above, in the three-dimensional structure prediction step (three-dimensional structure prediction means). Is compared with the interaction interface information determined in the interaction interface information determination step (interaction interface information determination means), and the interaction information specified by the interaction interface information is compared. A predicted structure evaluation information determining step (predicted structure evaluation information determining means) for evaluating the predicted structure portion corresponding to the working interface and determining predicted structure evaluation information, and the predicted structure evaluation information determining step (predicted structure evaluation information determining means). Predictive structure evaluation information output for outputting predicted structure evaluation information determined by -Up further comprises a (predicted structure evaluation information output unit) (comprises) can be characterized.

この方法、装置、および、プログラムによれば、予測された立体構造予測データと、決定された相互作用界面情報とを比較して、相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定し、決定された予測構造評価情報を出力するので、計算機により予測された予測構造を生化学実験データに基づいて評価を行うことができるようになり、予測精度を著しく向上させることができるようになる。 According to the method, the apparatus, and the program, the predicted three-dimensional structure prediction data is compared with the determined interaction interface information, and the predicted structure corresponding to the interaction interface specified by the interaction interface information is compared. Since the part is evaluated and the predicted structure evaluation information is determined and the determined predicted structure evaluation information is output, the predicted structure predicted by the computer can be evaluated based on the biochemical experiment data. The prediction accuracy can be remarkably improved.

また、これにより、ゲノム配列解析、ＤＮＡチップを用いた発現プロファイリング解析、プロテオーム解析などで見つかった興味深い遺伝子・タンパク質の機能解析を行う場合、立体構造を基準とした機能予測の精度が格段に向上するようになり、また、ゲノム配列解析においては、機能未知の遺伝子・タンパク質の立体構造を考慮した機能予測を行うことにより効率的に機能を推定することができるようになる。さらに、モデル構造と活性部位（相互作用部位）情報から、阻害剤などのドラッグデザインや蛋白質工学による活性向上、機能改変体のデザインが可能となる。 This also greatly improves the accuracy of function prediction based on the three-dimensional structure when performing functional analysis of interesting genes and proteins found by genome sequence analysis, expression profiling analysis using DNA chips, proteome analysis, etc. In genome sequence analysis, the function can be efficiently estimated by performing function prediction in consideration of the three-dimensional structure of a gene / protein whose function is unknown. Furthermore, from the model structure and active site (interaction site) information, drug design for inhibitors and the like, improvement of activity by protein engineering, and design of functional modifications are possible.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記対象タンパク質および／または上記化合物について基準振動計算を行う基準振動計算ステップ（基準振動計算手段）をさらに含み（備え）、上記処理結果出力ステップ（処理結果出力手段）は、上記基準振動計算ステップ（基準振動計算手段）による計算結果を対応する上記立体構造予測データの表示部分に関連付けて表示することを特徴とする。 The protein structure analysis method, protein structure analysis apparatus, and program according to the next invention are the above-described protein structure analysis method, protein structure analysis apparatus, and program. A reference vibration calculation step (reference vibration calculation means) for performing calculation is further included (provided), and the processing result output step (processing result output means) corresponds to the calculation result of the reference vibration calculation step (reference vibration calculation means). It is characterized by being displayed in association with the display portion of the three-dimensional structure prediction data.

この方法、装置、および、プログラムによれば、対象タンパク質および／または化合物について基準振動計算を行い、基準振動計算結果（例えば、揺らぎの情報など）を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、立体構造モデル上にベクトル表示を行う）ので、相互作用界面領域として揺らぎの情報を考慮することができるようになる。すなわち、基準振動計算結果により揺らぎ情報を求め、ＭＳ実測データと比較することにより予測モデルの評価を行うことができるようになる。 According to this method, apparatus, and program, the reference vibration calculation is performed on the target protein and / or compound, and the reference vibration calculation result (for example, fluctuation information) is associated with the display portion of the corresponding three-dimensional structure prediction data. Since the information is displayed (for example, a vector is displayed on the three-dimensional structure model), information on fluctuation can be considered as the interaction interface region. That is, the prediction model can be evaluated by obtaining fluctuation information from the reference vibration calculation result and comparing it with the MS actual measurement data.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）は、上記対象タンパク質と上記化合物との上記複合体について水素−重水素交換反応を行った後、断片化し上記断片化スペクトルを測定し、上記相互作用界面情報決定ステップ（相互作用界面情報決定手段）は、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）により測定された上記断片化スペクトル、および、上記帰属情報決定ステップ（帰属情報決定手段）により決定された上記断片化イオンの上記帰属情報に従って、上記対象タンパク質および／または上記化合物における重水素化速度を決定し、当該重水素化速度に従って上記対象タンパク質および／または上記化合物において相互作用界面を特定し、当該相互作用界面に従って上記対象タンパク質および／または上記化合物における相互作用界面情報を決定することを特徴とする。 The protein structure analysis method, the protein structure analysis device, and the program according to the next invention are the above-described fragmentation spectrum measurement step (fragmentation spectrum measurement) in the protein structure analysis method, protein structure analysis device, and program described above. Means), after performing a hydrogen-deuterium exchange reaction on the complex of the target protein and the compound, fragmenting, measuring the fragmentation spectrum, and determining the interaction interface information (determination of interaction interface information) Means) the fragmentation spectrum measured by the fragmentation spectrum measurement step (fragmentation spectrum measurement means), and the attribution of the fragmentation ions determined by the attribution information determination step (attribute information determination means). According to the information, the target protein and / or above Determine the deuteration rate in the compound, identify the interaction interface in the target protein and / or the compound according to the deuteration rate, and interact interface information in the target protein and / or the compound according to the interaction interface It is characterized by determining.

これは断片化スペクトル測定の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、対象タンパク質と化合物との複合体について水素−重水素交換反応を行った後、断片化し上記断片化スペクトルを測定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質および／または化合物における重水素化速度を決定し、当該重水素化速度に従って対象タンパク質および／または化合物において相互作用界面を特定し、当該相互作用界面に従って対象タンパク質および／または化合物における相互作用界面情報を決定するので、効率的に相互作用界面情報を求めることができるようになる。 This shows one example of the fragmentation spectrum measurement more specifically. According to this method, apparatus, and program, after a hydrogen-deuterium exchange reaction is performed on a complex of a target protein and a compound, the fragmentation is measured and the fragmentation spectrum is measured, the measured fragmentation spectrum, and Determining the deuteration rate in the target protein and / or compound according to the determined fragmentation ion assignment information, identifying the interaction interface in the target protein and / or compound according to the deuteration rate, and determining the interaction Since the interaction interface information in the target protein and / or compound is determined according to the interface, the interaction interface information can be efficiently obtained.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）は、上記対象タンパク質と上記化合物との上記複合体について化学修飾を行った後、断片化し上記断片化スペクトルを測定し、上記相互作用界面情報決定ステップ（相互作用界面情報決定手段）は、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）により測定された上記断片化スペクトル、および、上記帰属情報決定ステップ（帰属情報決定手段）により決定された上記断片化イオンの上記帰属情報に従って、上記対象タンパク質および／または上記化合物における化学修飾部分を決定し、当該化学修飾部分に従って上記対象タンパク質および／または上記化合物において相互作用界面を特定し、当該相互作用界面に従って上記対象タンパク質および／または上記化合物における相互作用界面情報を決定することを特徴とする。 The protein structure analysis method, the protein structure analysis device, and the program according to the next invention are the above-described fragmentation spectrum measurement step (fragmentation spectrum measurement) in the protein structure analysis method, protein structure analysis device, and program described above. Means) after chemically modifying the complex of the target protein and the compound, fragmenting and measuring the fragmentation spectrum, the interaction interface information determination step (interaction interface information determination means), According to the fragmentation spectrum measured by the fragmentation spectrum measurement step (fragmentation spectrum measurement means) and the attribution information of the fragmentation ions determined by the attribution information determination step (attribute information determination means) The target protein and / or the above compound Determining a chemical modification moiety, identifying an interaction interface in the target protein and / or the compound according to the chemical modification moiety, and determining interaction interface information in the target protein and / or the compound according to the interaction interface It is characterized by that.

これは断片化スペクトル測定の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、対象タンパク質と化合物との複合体について化学修飾を行った後、断片化し断片化スペクトルを測定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質および／または化合物における化学修飾部分を決定し、当該化学修飾部分に従って対象タンパク質および／または化合物において相互作用界面を特定し、当該相互作用界面に従って対象タンパク質および／または化合物における相互作用界面情報を決定するので、効率的に相互作用界面情報を求めることができるようになる。 This shows one example of the fragmentation spectrum measurement more specifically. According to this method, apparatus, and program, after chemical modification is performed on a complex of a target protein and a compound, a fragmented fragmentation spectrum is measured, and the measured fragmentation spectrum and the determined fragment are determined. In accordance with the attribute information of the cation ion, a chemical modification moiety in the target protein and / or compound is determined, an interaction interface in the target protein and / or compound is identified according to the chemical modification moiety, and the target protein and / or in accordance with the interaction interface Since the interaction interface information in the compound is determined, the interaction interface information can be efficiently obtained.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）は、ヘキサポールＣＩＤ法、ノズルスキマーＣＩＤ法、キャピラリースキマーＣＩＤ法、ＳＯＲＩ−ＣＩＤ法、および、マルチポールストアー補助キャピラリースキマーＣＩＤ法を含む衝突活性化解離（ＣＩＤ）法、ＩＲＭＰＤ法、インソース分解（ＩＳＤ）法、ポストソース分解（ＰＳＤ）法、表面誘起解離（ＳＩＤ）法、ＥＣＤ法、および、ＢＩＲＤ法のうち少なくとも一つの方法により上記対象タンパク質と上記化合物との上記複合体について断片化イオンに断片化し上記断片化スペクトルを測定することを特徴とする。 The protein structure analysis method, the protein structure analysis device, and the program according to the next invention are the above-described fragmentation spectrum measurement step (fragmentation spectrum measurement) in the protein structure analysis method, protein structure analysis device, and program described above. Means) include a hexapole CID method, a nozzle skimmer CID method, a capillary skimmer CID method, a SORI-CID method, and a collision activated dissociation (CID) method including a multipole store auxiliary capillary skimmer CID method, an IRMPD method, an in-source method Fragmentation of the complex of the target protein and the compound by at least one of decomposition (ISD) method, post-source decomposition (PSD) method, surface induced dissociation (SID) method, ECD method, and BIRD method Fragmented into ions above And measuring the fragmented spectrum.

これは断片化スペクトル測定の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、立体構造を予測する対象タンパク質についてヘキサポールＣＩＤ法、ノズルスキマーＣＩＤ法、キャピラリースキマーＣＩＤ法、ＳＯＲＩ−ＣＩＤ法、および、マルチポールストアー補助キャピラリースキマーＣＩＤ法を含む衝突活性化解離（ＣＩＤ）法、ＩＲＭＰＤ法、インソース分解（ＩＳＤ）法、ポストソース分解（ＰＳＤ）法、表面誘起解離（ＳＩＤ）法、ＥＣＤ法、および、ＢＩＲＤ法のうち少なくとも一つの方法により対象タンパク質と化合物との複合体について断片化イオンに断片化し上記断片化スペクトルを測定するので、対象タンパク質の立体構造を反映した断片化イオンを効率的に作成することができるようになる。 This shows one example of the fragmentation spectrum measurement more specifically. According to this method, apparatus, and program, a hexapole CID method, a nozzle skimmer CID method, a capillary skimmer CID method, a SORI-CID method, and a multipole store-assisted capillary skimmer CID method are used for a target protein that predicts a three-dimensional structure. At least one of collision activated dissociation (CID) method, IRMPD method, in-source decomposition (ISD) method, post-source decomposition (PSD) method, surface-induced dissociation (SID) method, ECD method, and BIRD method Since the complex of the target protein and the compound is fragmented into fragment ions by the method and the fragmentation spectrum is measured, fragment ions reflecting the three-dimensional structure of the target protein can be efficiently created.

つぎの発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムは、上記に記載のタンパク質構造解析方法、タンパク質構造解析装置、および、プログラムにおいて、上記断片化スペクトル測定ステップ（断片化スペクトル測定手段）は、酵素反応により上記対象タンパク質と上記化合物の上記複合体について断片化を行い上記断片化スペクトルを測定することを特徴とする。 The protein structure analysis method, the protein structure analysis device, and the program according to the next invention are the above-described fragmentation spectrum measurement step (fragmentation spectrum measurement) in the protein structure analysis method, protein structure analysis device, and program described above. (Means) is characterized in that the complex of the target protein and the compound is fragmented by an enzymatic reaction and the fragmentation spectrum is measured.

これは断片化スペクトル測定の一例を一層具体的に示すものである。この方法、装置、および、プログラムによれば、酵素反応により対象タンパク質と化合物の複合体について断片化を行い、Ｈ／Ｄ交換法や化学修飾法を組み合わせることにより、相互作用界面情報等を効率的に得ることができるようになる。 This shows one example of the fragmentation spectrum measurement more specifically. According to this method, apparatus, and program, fragmentation is performed on a complex of a target protein and a compound by an enzymatic reaction, and the interaction interface information is efficiently obtained by combining the H / D exchange method and the chemical modification method. To be able to get to.

また本発明は記録媒体に関するものであり、本発明かかる記録媒体は、上記に記載されたプログラムをコンピュータに実行させるための記録媒体であることを特徴とする。 The present invention also relates to a recording medium, and the recording medium according to the present invention is a recording medium for causing a computer to execute the program described above.

この記録媒体によれば、当該記録媒体をコンピュータに読み取らせて実行することによって、上記に記載されたプログラムをコンピュータを利用して実現することができ、これら各プログラムと同様の効果を得ることができる。 According to this recording medium, by causing the computer to read and execute the recording medium, the program described above can be realized using the computer, and the same effect as each of the programs can be obtained. it can.

以下に、本発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体の実施の形態を図面に基づいて詳細に説明する。なお、この実施の形態によりこの発明が限定されるものではない。 Hereinafter, embodiments of a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium according to the present invention will be described in detail with reference to the drawings. Note that the present invention is not limited to the embodiments.

特に、以下の実施の形態においては、本発明を、インターロイキン−６等に適用した例について説明するが、この場合に限られず、全てのタンパク質において、同様に適用することができる。 In particular, in the following embodiment, an example in which the present invention is applied to interleukin-6 and the like will be described. However, the present invention is not limited to this case, and can be similarly applied to all proteins.

［本発明の概要］
以下、本発明の概要について説明し、その後、本発明の構成および処理等について詳細に説明する。図１は本発明の基本原理を示す原理構成図である。 [Outline of the present invention]
Hereinafter, the outline of the present invention will be described, and then the configuration and processing of the present invention will be described in detail. FIG. 1 is a principle configuration diagram showing the basic principle of the present invention.

本発明は、概略的に、以下の基本的特徴を有する。まず、立体構造を予測する対象の立体構造予測対象タンパク質（以下、「対象タンパク質」という。）について、ＭＳによる各種の断片化法や酵素消化法などにより断片化し、断片化スペクトルを測定する。 The present invention generally has the following basic features. First, a three-dimensional structure prediction target protein (hereinafter referred to as “target protein”) for which a three-dimensional structure is predicted is fragmented by various fragmentation methods or enzyme digestion methods using MS, and a fragmentation spectrum is measured.

対象タンパク質のＭＳによる測定は、エレクトロスプレーイオン化法、レーザースプレーイオン化法、ソニックスプレーイオン化法、大気圧化学イオン化法等のスプレー方式のイオン化法、ＭＡＬＤＩ法、およびその他これらに相当するイオン化法のうち少なくともひとつの手法によりイオン化した後、当該生成したイオンをヘキサポール（ｈｅｘａｐｏｌｅ）ＣＩＤ法（ノズルスキマーＣＩＤ法、キャピラリースキマーＣＩＤ法、マルチポールストアー補助キャピラリースキマーＣＩＤ法と類似の手法である）、ＳＯＲＩ−ＣＩＤ法を含む衝突活性化解離（ＣＩＤ）法、ＩＲＭＰＤ法、インソース分解（ＩＳＤ）法、ポストソース分解（ＰＳＤ）、表面誘起解離（ＳＩＤ）法、ＥＣＤ（電子補足解離）法、ＢＩＲＤ（黒体赤外放射解離）法、およびその他これらに相当する断片化法のうち少なくとも一つの手法により対象タンパク質について断片化イオンに断片化し、断片化スペクトルを測定してもよく、また、ペプシンやトリプシンなどを用いた酵素消化法により対象タンパク質について断片化を行い断片化スペクトルを測定してもよい。 The target protein is measured by MS using at least one of an ionization method such as an electrospray ionization method, a laser spray ionization method, a sonic spray ionization method, an atmospheric pressure chemical ionization method, a MALDI method, and other equivalent ionization methods. After ionization by one method, the generated ions are subjected to hexapole CID method (similar method to nozzle skimmer CID method, capillary skimmer CID method, multipole store auxiliary capillary skimmer CID method), SORI-CID. Collision-induced dissociation (CID) method, IRMPD method, in-source decomposition (ISD) method, post-source decomposition (PSD), surface-induced dissociation (SID) method, ECD (electron supplemental dissociation) method, BIRD (black body) Infrared radiation dissociation method In addition, the target protein may be fragmented into fragment ions by measuring at least one of the corresponding fragmentation methods, and the fragmentation spectrum may be measured, or the target protein may be obtained by enzymatic digestion using pepsin or trypsin. The protein may be fragmented and the fragmentation spectrum may be measured.

キャピラリースキマーＣＩＤ法は、イオン化とイオン導入部にあるキャピラリーとスキマー間の電圧を利用して断片化を誘起するインソースフラグメンテーション法である（Ｓ．Ａｋａｓｈｉ，Ａｎａｌ．Ｃｈｅｍ．，７１，４９７４−４９８０（１９９９）参照）。 The capillary skimmer CID method is an in-source fragmentation method in which fragmentation is induced by using the voltage between the capillary and the skimmer in the ionization and ion introduction section (S. Akashi, Anal. Chem., 71, 4974-4980). 1999)).

また、ヘキサポールＣＩＤ法はさらにヘキサポールにイオンを一定時間溜め込むことにより、キャピラリースキマーＣＩＤ法の効率を向上させる方法である（Ｋ．Ａ．，Ｓａｎｎｅｓ−Ｌｏｗｅｒｙ，Ｊ．Ａｍ．Ｓｏｃ．ＭａｓｓＳｐｅｃｔｒｏｍ．，１１，１−９（２０００）参照）。 Further, the hexapole CID method is a method for further improving the efficiency of the capillary skimmer CID method by storing ions in the hexapole for a certain period of time (KA, Sannes-Lowery, J. Am. Soc. Mass Spectrom. , 11, 1-9 (2000)).

また、ＩＲＭＰＤ法は、イオンに炭酸ガスレーザーを照射することにより、フラグメンテーションを誘起する方法である（Ｃ．Ｐ．Ｄｕｆｒｅｓｎｅ，Ｊ．Ａｍ．Ｓｏｃ．ＭａｓｓＳｐｅｃｔｒｏｍ．，９，１２２２−１２２５（１９９８）参照）。 Further, the IRMPD method is a method of inducing fragmentation by irradiating a carbon dioxide laser to ions (see CP Dufresne, J. Am. Soc. Mass Spectrom., 9, 1222-1225 (1998)). ).

そして、上記手法を用いて得られた断片化スペクトルにおいて、各フラグメントの帰属を行う。すなわち、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定する。ここで、対象タンパク質のアミノ酸配列は、ＰＤＢに格納されている場合にはその配列を用いてもよく、また、ＭＡＬＤＩ−ＴＯＦ−ＭＳやＭＳ／ＭＳなどによるアミノ酸配列同定手法を用いることにより対象タンパク質のアミノ酸配列を同定してもよい。 Then, each fragment is assigned in the fragmentation spectrum obtained using the above method. That is, based on the measured fragmentation spectrum, the attribution information of the fragmented ions with respect to the amino acid sequence of the target protein is determined. Here, the amino acid sequence of the target protein may be used when stored in the PDB, and the target protein can be obtained by using an amino acid sequence identification method such as MALDI-TOF-MS or MS / MS. May be identified.

そして、決定された断片化イオンの帰属情報により、対象タンパク質のアミノ酸配列において断片化イオンに断片化された箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における切断容易領域情報を決定する。 And the location fragmented by the fragmentation ion in the amino acid sequence of the target protein is specified by the determined fragmentation ion attribution information, and easy-cleavage region information in the amino acid sequence of the target protein is determined according to the location.

ここで、図６は、ヒトのインターロイキン−６（ＩＬ−６）について断片化イオンの帰属情報を求め切断容易領域情報を決定する際の概念図である。図６において、ＩＲＭＰＤ法により断片化を行った際の断片化イオンのフラグメントを点線で、ヘキサポールＣＩＤ法により断片化を行った際の断片化イオンのフラグメントを実線で表している。また、これらの断片化イオンの切断部位に基づいて求めた切断容易領域情報を斜線で表している。 Here, FIG. 6 is a conceptual diagram when determining fragmentable ion attribution information for human interleukin-6 (IL-6) and determining easy-cleavage region information. In FIG. 6, fragmented ion fragments when fragmented by the IRMPD method are indicated by dotted lines, and fragmented ion fragments when fragmented by the hexapole CID method are indicated by solid lines. In addition, easy-cleavage region information obtained based on the cleavage sites of these fragmented ions is indicated by hatching.

そして、対象タンパク質について立体構造を予測する。ここで、立体構造予測は、ホモロジーモデリング法、分子シミュレーション法、アブイニシオ法、二次構造予測法、３Ｄ−１Ｄ法、スレッディング法などの既存の立体構造予測方法のいずれを用いてもよい。以下に、配列の相同性に基づく立体構造予測手法を一例に説明する。 Then, a three-dimensional structure is predicted for the target protein. Here, the three-dimensional structure prediction may use any of the existing three-dimensional structure prediction methods such as a homology modeling method, a molecular simulation method, an ab initio method, a secondary structure prediction method, a 3D-1D method, and a threading method. Hereinafter, a three-dimensional structure prediction method based on sequence homology will be described as an example.

ここで、立体構造予測対象タンパク質のアミノ酸配列（遺伝子の場合は、ＤＮＡ配列から得られるアミノ酸配列）と、類似のアミノ酸配列を有する既知立体構造の検索等には、ＦＡＳＴＡ（ＰｅａｒｓｏｎＷＲ，ＭｅｔｈｏｄｓＥｎｚｙｍｏｌ，２６６，２２７−２５８，１９９６）、ＰＳＩ−ＢＬＡＳＴ（ＳｃｈａｆｆｅｒＡＡ，ＷｏｌｆＹＩ，ＰｏｎｔｉｎｇＣＰ，ＫｏｏｎｉｎＥＶ，ＡｒａｖｉｎｄＬａｎｄＡｌｔｓｃｈｕｌＳＦ，Ｂｉｏｉｎｆｏｒｍａｔｉｃｓ，１２，１０００−１０１１，１９９９）、ＬＩＢＲＡ（Ｏｔａ，Ｍ．ａｎｄＮｉｓｈｉｋａｗａ，Ｋ．，ＰｒｏｔｅｉｎＥｎｇｉｎｅｅｒｉｎｇ，１０，３３９−３５１，１９９７）、ＲＢＳ−ＢＬＡＳＴ（ＳｃｈａｆｆｅｒＡＡ，ＷｏｌｆＹＩ，ＰｏｎｔｉｎｇＣＰ，ＫｏｏｎｉｎＥＶ，ＡｒａｖｉｎｄＬａｎｄＡｌｔｓｃｈｕｌＳＦ，Ｂｉｏｉｎｆｏｒｍａｔｉｃｓ，１２，１０００−１０１１，１９９９）、ＩＭＰＡＬＡ（Ａ．Ａ．Ｓｃｈａｆｆｅｅｔａｌ．，Ｂｉｏｉｎｆｏｒｍａｔｉｃｓ，１５（１２），１０００−１０１１，１９９９）、ＨＭＭＥＲ（Ｒ．Ｄｕｒｂｉｎ，Ｓ．Ｅｄｄｙ，Ａ．Ｋｒｏｇｈ，ａｎｄＧ．Ｍｉｔｃｈｉｓｏｎ，ＣａｍｂｒｉｄｇｅＵｎｉｖｅｒｓｉｔｙＰｒｅｓｓ，１９９８，Ｓ．Ｒ．Ｅｄｄｙ．Ｂｉｏｉｎｆｏｒｍａｔｉｃｓ，１４，７５５−７６３，１９９８）、または、ＣｌｕｓｔａｌＷ（Ｔｈｏｍｏｐｓｏｎ，Ｊ．Ｄ．，Ｄ．Ｇ．Ｈｉｇｇｉｎｓ，ａｎｄＴ．Ｊ．Ｇｉｂｓｏｎ，ＮｕｃｌｅｉｃＡｃｉｄｓＲｅｓ．２２，４６７３−４６８０，１９９４）等などのコンピュータプログラムを用いてもよい。 Here, FASTA (Pearson WR, Methods Enzymol, 266, 227-258, 1996), PSI-BLAST (Schaffer AA, Wolf YI, Ponting CP, Koonin EV, Aravind L and Altschul SF, Bioinformatics, 12, 1000-101, aI. , K., Protein Engineering, 10, 339-351, 1997), RBS-BLAST (Schaffer AA, olf YI, Ponting CP, Koonin EV, Aravind L and Altschul SF, Bioinformatics, 12, 1000-1011, 1999), IMPALA (A. A. Schaffet et al., Bioinformatics, 1000 (101), 1999 (101), 15 (19), 1999). HMMER (R. Durbin, S. Eddy, A. Krogh, and G. Mitchison, Cambridge University Press, 1998, SR Eddy. Bioinformatics, 14, 755-763, 1998), or ClustalW (Th. D., D. G. Higgins, and T. J. Gibson, Nuclei c Acids Res., 22, 4673-4680, 1994) and the like may be used.

検索対象には、予めＰＤＢ（ＰｒｏｔｅｉｎＤａｔａＢａｓｅ）を加工したデータベースを用いることが好ましい。加工データベースはＰＤＢに登録された全てのタンパク質配列を配列相同性（ホモロジー）９５％以上のクラスに分類し、各クラスの中で最も実験精度の高いものを代表配列として抽出する操作によって作成する。この加工データベースを用いることで、ホモロジー検索の効率が増大するとともに、検索件数を制限した場合でも検索される立体構造の多様性・バラエティを確保できる。 It is preferable to use a database obtained by processing PDB (Protein Data Base) in advance as a search target. The processing database is created by classifying all protein sequences registered in the PDB into classes having a sequence homology (homology) of 95% or more, and extracting the most experimental accuracy of each class as a representative sequence. By using this processing database, the efficiency of the homology search is increased, and the diversity and variety of the three-dimensional structures searched can be ensured even when the number of searches is limited.

検索プログラムによる出力結果から、立体構造予測対象タンパク質と参照タンパク質配列とのペアワイズアライメントを検索件数分だけ個々に作成する。ペアワイズアライメントごとに、参照タンパク質の立体構造における２次構造領域を同定し、参照タンパク質配列上に照会する。なお、立体構造予測対象タンパク質と複数の参照タンパク質配列とに基づいて、マルチプルアライメントを行ってもよい。 From the output result of the search program, pairwise alignments between the three-dimensional structure prediction target protein and the reference protein sequence are individually created for the number of searches. For each pair-wise alignment, a secondary structure region in the three-dimensional structure of the reference protein is identified and queried on the reference protein sequence. Multiple alignment may be performed based on the three-dimensional structure prediction target protein and a plurality of reference protein sequences.

そして、２次構造情報を表示したアライメント結果に、ＭＳの断片化データにより求めた切断容易領域情報を照会する。すなわち、ＭＳで得られた断片化イオンの両端は容易に切断されやすい部位であり、立体構造上においては運動性の高いループ部分に相当する可能性が高いと考えられる部位である。一方、ＭＳで得られた断片化イオンの両端に挟まれた部分は、断片化が起こりにくい部位であることから、αへリックスやβストランドなどの２次構造を有している部位に相当する可能性が高いと考えられる部位である。 Then, the easy-to-cut region information obtained from the MS fragmentation data is referred to the alignment result displaying the secondary structure information. That is, both ends of the fragmented ion obtained by MS are sites that are easily cleaved and are sites that are highly likely to correspond to loop portions with high mobility in terms of the three-dimensional structure. On the other hand, since the part sandwiched between both ends of the fragmented ions obtained by MS is a part where fragmentation hardly occurs, it corresponds to a part having a secondary structure such as α helix or β strand. It is a site that is considered highly likely.

そして、すべての断片化イオンの情報を加味して、候補タンパク質の評価（ランク付け）を行う。 Then, the candidate proteins are evaluated (ranked) in consideration of information on all fragmented ions.

続いて、評価の高い候補タンパク質の立体構造を基盤として、予測対象タンパク質の立体構造予測をホモロジーモデリング法により行う。ここで、立体構造予測プログラムとしては、「ＦＡＭＳ」（Ｏｇａｔａ，Ｋ．ａｎｄＵｍｅｙａｍａ，Ｈ．，Ｊ．Ｍｏｌ．ＧｒａｐｈｉｃｓＭｏｄ．１８，２５８−２７２，２０００）、Ｓｗｉｓｓ−Ｍｏｄｅｌ（ＰｅｉｔｓｃｈＭＣ．，Ｂｉｏｃｈｅｍ．Ｓｏｃ．Ｔｒａｎｓ．２４，２７４−２７９，１９９６）、ＣＰＨｍｏｄｅｌｓ（Ｏ．Ｌｕｎｄ，Ｋ．Ｆｒｉｍａｎｄ，Ｊ．Ｇｏｒｏｄｋｉｎ，Ｈ．Ｂｏｈｒ，Ｊ．Ｂｏｈｒ，Ｊ．Ｈａｎｓｅｎ，ａｎｄＳ．Ｂｒｕｎａｋ．，ＰｒｏｔｅｉｎＥｎｇｉｎｅｅｒｉｎｇ，１０，１２４１−１２４８，１９９７）、「ＳＡＭ−Ｔ９８」（Ｊ．Ｐａｒｋ，Ｋ．Ｋａｒｐｌｕｓ，Ｃ．Ｂａｒｒｅｔｔ，Ｒ．Ｈｕｇｈｅｙ，Ｄ．Ｈａｕｓｓｌｅｒ，Ｔ．Ｈｕｂｂａｒｄ，ａｎｄＣ．Ｃｈｏｔｈｉａ，ＪＭＢ２８４（４），１２０１−１２１０，１９９８）、「ＭＯＤＥＬＬＥＲ」（Ａ．Ｓａｌｉ，Ｌ．Ｐｏｔｔｅｒｔｏｎ，Ｆ．Ｙｕａｎ，Ｈ．ｖａｎＶｌｉｊｍｅｎ，ａｎｄＭ．Ｋａｒｐｌｕｓ，Ｐｒｏｔｅｉｎｓ，２３，３１８−３２６，１９９５）などを採用してもよい。また、ＤａｎｉｅｌＦｉｓｃｈｅｒら「ＣＡＦＡＳＰ２ＴｈｅＳｅｃｏｎｄＣｒｉｔｉｃａｌＡｓｓｅｓｓｍｅｎｔｏｆＦｕｌｌｙＡｕｔｏｍａｔｅｄＳｔｒｕｃｔｕｒｅＰｒｅｄｉｃｔｉｏｎＭｅｔｈｏｄｓ、ＰＲＯＴＥＩＮＳ、５；１７１−１８３（２００１）」において開示または参照された各種の立体構造予測プログラムを用いてもよい。 Subsequently, based on the three-dimensional structure of the highly evaluated candidate protein, the three-dimensional structure of the protein to be predicted is predicted by the homology modeling method. Here, as a three-dimensional structure prediction program, “FAMS” (Ogata, K. and Umeyama, H., J. Mol. Graphics Mod. 18, 258-272, 2000), Swiss-Model (Peitsch MC., Biochem. Soc.Trans.24,274-279, 1996), CPHmodels (O. Lund, K. Friand, J. Gorodkin, H. Bohr, J. Bohr, J. Hansen, and S. Brunak., Protein Engineering, 10, Protein Engineering). 1241-1248, 1997), “SAM-T98” (J. Park, K. Karplus, C. Barrett, R. Hughhey, D. Haussler, T. Hubbard, and C. C. othia, JMB 284 (4), 1201-1210, 1998), “MODELLER” (A. Sali, L. Potterton, F. Yuan, H. van Vlijmen, and M. Karplus, Proteins, 23, 318-326, 1995). ) Etc. may be adopted. In addition, the three-dimensional structure may be disclosed or referred to in Daniel Fischer et al., “CAFASP2 The Second Critical Assessment of Full Automated Structure Prediction Methods, PROTEINS, 5; 171-183 (2001)”.

そして、候補タンパク質について、予測された立体構造予測データと決定された切断容易領域情報とを対応させて出力する（例えば、グラフィック表示、テーブルによる一覧表示など）。例えば、立体構造予測データを針金モデル、リボンモデル、パイプモデル、ボールアンドスティックモデル、または、空間充填モデルのうちいずれかのモデルによりグラフィック表示し、切断容易領域情報を対応する立体構造予測データの表示部分に関連付けて表示（例えば、リンクを設定、モデル上に表示、モデル上に切断容易領域情報に対応する網掛けや斜線などの特定のパターン模様を表示など）してもよく、また、グラフィック表示やテーブル表示を行う際に、切断容易領域情報に従って対応する上記立体構造予測データの表示部分を色分けして表示してもよい。 For the candidate protein, the predicted three-dimensional structure prediction data and the determined easy-cleavage region information are output in correspondence with each other (for example, graphic display, list display by table, etc.). For example, the three-dimensional structure prediction data is displayed graphically as a wire model, ribbon model, pipe model, ball-and-stick model, or space filling model, and the three-dimensional structure prediction data corresponding to the easy-to-cut area information is displayed. It may be displayed in association with the part (for example, a link is set, displayed on the model, a specific pattern such as shading or diagonal lines corresponding to the easy-to-cut area information is displayed on the model, etc.), or a graphic display When the table is displayed, the display portion of the corresponding three-dimensional structure prediction data may be displayed in different colors according to the easy-to-cut area information.

予測された立体構造予測データと、決定された切断容易領域情報とを比較して、切断容易領域情報により特定された切断容易領域に対応する予測構造部分について評価を行い予測構造評価情報を決定し、決定された予測構造評価情報を出力する。すなわち、得られた予測構造とＭＳの断片化データを照会し、予測構造を評価する。 The predicted three-dimensional structure prediction data is compared with the determined easily cleavable region information, and the predicted structure evaluation information corresponding to the easily cleavable region specified by the easily cleavable region information is evaluated to determine the predicted structure evaluation information. The determined predicted structure evaluation information is output. That is, the obtained predicted structure and MS fragmentation data are queried to evaluate the predicted structure.

また、例えばホモロジーが高い候補タンパク質については、断片化スペクトルをＭＳで測定し、立体構造予測対象タンパク質の断片化スペクトルと比較することにより、さらに精度が高い評価を行うことが可能である。 Further, for example, for a candidate protein with high homology, it is possible to perform evaluation with higher accuracy by measuring the fragmentation spectrum by MS and comparing it with the fragmentation spectrum of the three-dimensional structure prediction target protein.

さらに、本発明は、立体構造を予測する対象タンパク質または複合体について水素−重水素交換反応を行い、断片化イオンに断片化し、断片化スペクトルを測定する。 Furthermore, the present invention performs a hydrogen-deuterium exchange reaction on a target protein or complex for which a three-dimensional structure is predicted, fragments into fragmented ions, and measures a fragmentation spectrum.

ここで、図１９は、ヒトのインターロイキン−６（ＩＬ−６）と、抗ＩＬ−６抗体との複合体（ｃｏｍｐｌｅｘ）について、水素−重水素交換反応を行い、エレクトロスプレーイオン化法により断片化イオンに断片化し、断片化スペクトルを測定する際の概念図である。 Here, FIG. 19 shows that a complex of human interleukin-6 (IL-6) and anti-IL-6 antibody is subjected to hydrogen-deuterium exchange reaction and fragmented by electrospray ionization. It is the conceptual diagram at the time of fragmenting into ion and measuring a fragmentation spectrum.

そして、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列中の各アミノ酸残基における重水素化速度を決定する。 Then, based on the measured fragmentation spectrum, the fragmentation ion attribution information for the amino acid sequence of the target protein is determined, and the target protein is determined according to the measured fragmentation spectrum and the determined fragmentation ion attribution information. The deuteration rate at each amino acid residue in the amino acid sequence is determined.

ここで、図２０は、水素−重水素交換反応を行ったヒトのＩＬ−６の断片化スペクトルから重水素化速度を求める際の概念図である。図２０に示すように、水素−重水素交換反応を行ったヒトのＩＬ−６の断片化スペクトル（上段）と、水素−重水素交換反応を行わない場合のヒトのＩＬ−６の断片化スペクトル（下段）の重心値の変化から重水素化率（Ｄ化率）を決定し、経時的に求めた重水素化率から重水素化速度を計算する。 Here, FIG. 20 is a conceptual diagram when the deuteration rate is obtained from the fragmentation spectrum of human IL-6 subjected to the hydrogen-deuterium exchange reaction. As shown in FIG. 20, the fragmentation spectrum (upper stage) of human IL-6 that had undergone a hydrogen-deuterium exchange reaction and the fragmentation spectrum of human IL-6 that had not undergone a hydrogen-deuterium exchange reaction The deuteration rate (D conversion rate) is determined from the change in the center of gravity value (lower), and the deuteration rate is calculated from the deuteration rate obtained over time.

そして、決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における溶媒接触残基情報を決定する。溶媒接触残基情報は、例えば、水素−重水素交換していないコントロール実験と比較して重水素化速度が遅くなっている部位を溶媒接触残基情報として決定してもよい。 Then, in accordance with the deuteration rate at each amino acid residue in the determined amino acid sequence, the location in contact with the solvent in the amino acid sequence of the target protein is specified, and the solvent contact residue in the amino acid sequence of the target protein according to the location Determine information. For the solvent contact residue information, for example, a site where the deuteration rate is slow compared to a control experiment in which hydrogen-deuterium exchange is not performed may be determined as the solvent contact residue information.

また、複合体の場合には、決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質のアミノ酸配列における相互作用界面情報を決定する。相互作用界面情報は、例えば、複合体を形成していない単独時に測定した場合と比較して、重水素化速度が遅くなっている部位を相互作用界面情報として決定してもよい。また、水素−重水素交換していないコントロール実験と比較して重水素化速度が遅くなっている部位を相互作用界面情報として決定してもよい。 In the case of a complex, interaction interface information in the amino acid sequence of the target protein is determined according to the deuteration rate at each amino acid residue in the determined amino acid sequence. As for the interaction interface information, for example, a site where the deuteration rate is slow may be determined as the interaction interface information as compared with the case where measurement is performed when the complex is not formed alone. Alternatively, a site where the deuteration rate is slow compared to a control experiment in which hydrogen-deuterium exchange is not performed may be determined as interaction interface information.

このようにして、決定した溶媒接触残基情報および相互作用界面情報を、予測データとともに出力したり、予測構造評価情報を決定したりする際に利用する。すなわち、予測された立体構造予測データと決定された溶媒接触残基情報や相互作用界面情報とを対応させて出力し（例えば、グラフィック表示、テーブルによる一覧表示など）、また、予測された立体構造予測データと、決定された溶媒接触残基情報や相互作用界面情報とを比較して、溶媒接触残基情報により特定された溶媒接触残基や相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定し、決定された予測構造評価情報を出力する。 Thus, the determined solvent contact residue information and interaction interface information are output together with the prediction data, or used when determining the predicted structure evaluation information. That is, the predicted three-dimensional structure prediction data is output in association with the determined solvent contact residue information and interaction interface information (for example, graphic display, list display by table, etc.), and the predicted three-dimensional structure Comparing the predicted data with the determined solvent contact residue information and interaction interface information to support the interaction interface specified by the solvent contact residue and interaction interface information specified by the solvent contact residue information The predicted structure portion to be evaluated is evaluated to determine the predicted structure evaluation information, and the determined predicted structure evaluation information is output.

ここで、図１２は、ヒトのインターロイキン−６（ＩＬ−６）について、立体構造予測データをリボンモデルによりグラフィック表示し、切断容易領域情報に従って対応する上記立体構造予測データの表示部分を色分けして表示する場合の表示画面の一例を示す図である。図１２において、切断容易領域情報を黒色で、また、相互作用界面情報を灰色で示している。 Here, FIG. 12 graphically displays the three-dimensional structure prediction data with a ribbon model for human interleukin-6 (IL-6), and color-codes the corresponding three-dimensional structure prediction data display portion according to the easy-to-cut region information. It is a figure which shows an example of the display screen in the case of displaying. In FIG. 12, the easy-to-cut region information is shown in black, and the interaction interface information is shown in gray.

また、図１７は、処理結果についてテーブルによる一覧表示を行う場合の表示画面の一例を示す図である。この図に示すように処理結果一覧表示画面は、例えば、タンパク質のアミノ酸残基番号を表示するための表示領域ＭＡ−１、本発明により決定された切断容易領域を表示するための表示領域ＭＡ−２、本発明により決定された溶媒接触残基情報を表示するための表示領域ＭＡ−３、本発明により決定された相互作用界面情報を表示するための表示領域ＭＡ−４、構造予測手法による立体構造予測データ（例えば、二次構造など）を表示するための表示領域ＭＡ−５、本発明により決定された立体構造予測データに対する予測構造評価情報を表示するための表示領域ＭＡ−６などを含んで構成されている。本図に示すように、タンパク質のアミノ酸残基毎にこれらの情報をテーブル形式で一覧表示してもよい。また、相互作用界面情報を表示する場合には、１対多数の相互作用を区別して表示することが望ましい（例えば、Ａ，Ｂ，Ｃ，，，，，という表示方法を採用してもよい）。 FIG. 17 is a diagram illustrating an example of a display screen when a list of processing results is displayed using a table. As shown in this figure, the processing result list display screen includes, for example, a display area MA-1 for displaying amino acid residue numbers of proteins, and a display area MA- for displaying easy-to-cut areas determined according to the present invention. 2. Display area MA-3 for displaying solvent contact residue information determined by the present invention, display area MA-4 for displaying interaction interface information determined by the present invention, three-dimensional structure by structure prediction method A display area MA-5 for displaying structure prediction data (for example, secondary structure), a display area MA-6 for displaying prediction structure evaluation information for the three-dimensional structure prediction data determined by the present invention, and the like are included. It consists of As shown in the figure, these pieces of information may be displayed in a table format for each amino acid residue of the protein. Further, when displaying the interaction interface information, it is desirable to distinguish and display one-to-many interactions (for example, a display method of A, B, C,... May be adopted). .

また、切断容易領域や、Ｈ／Ｄ交換しやすい領域が生じる要因としては、揺らぎの情報も含んでいると考えられる。従って、また、本発明は、揺らぎ情報として基準振動計算を実行し、基準振動計算結果とＭＳデータとを比較して、モデルの評価等をすることも可能である。よって、本発明は、対象タンパク質について各種の既知のコンピュータプログラム（例えば、富士通株式会社（商標）の分子軌道計算プログラムＭＯＰＡＣ（製品名）等）などを用いて基準振動計算を行い、基準振動計算結果（例えば、揺らぎの情報）を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、立体構造モデル上にベクトル表示を行う）ので、切断容易領域として揺らぎの情報を考慮することができるようになる。すなわち、基準振動計算結果により揺らぎ情報を求め、ＭＳ実測データと比較することにより予測モデルの評価を行うことができるようになる。 In addition, it is considered that information on fluctuation is included as a factor that causes an easy cutting region and a region where H / D replacement is easy. Therefore, according to the present invention, it is also possible to perform a reference vibration calculation as fluctuation information, compare the reference vibration calculation result with MS data, and evaluate the model. Therefore, the present invention performs the reference vibration calculation for the target protein using various known computer programs (for example, the molecular orbit calculation program MOPAC (product name) of Fujitsu Limited (trademark), etc.), and the reference vibration calculation result (For example, fluctuation information) is displayed in association with the display portion of the corresponding three-dimensional structure prediction data (for example, vector display is performed on the three-dimensional structure model), so that the fluctuation information can be considered as the easy-to-cut region. It becomes like this. That is, the prediction model can be evaluated by obtaining fluctuation information from the reference vibration calculation result and comparing it with the MS actual measurement data.

［システム構成］
まず、本システムの構成について説明する。図２は、本発明が適用される本システムの構成の一例を示すブロック図であり、該構成のうち本発明に関係する部分のみを概念的に示している。本システムは、概略的に、タンパク質構造解析装置１００と、配列情報等に関する外部データベースやホモロジー検索等の外部プログラム等を提供する外部システム２００とを、ネットワーク３００を介して通信可能に接続して構成されている。また、タンパク質構造解析装置１００は、質量分析装置４００の断片化スペクトル情報などが入力装置１１２またはネットワーク３００などを介して取得可能に構成されている。 [System configuration]
First, the configuration of this system will be described. FIG. 2 is a block diagram showing an example of the configuration of the system to which the present invention is applied, and conceptually shows only the portion related to the present invention in the configuration. This system is generally configured by connecting a protein structure analysis apparatus 100 and an external system 200 that provides an external database such as sequence information and an external program such as a homology search to be communicable via a network 300. Has been. Further, the protein structure analysis apparatus 100 is configured such that the fragmentation spectrum information of the mass spectrometer 400 can be acquired via the input device 112 or the network 300.

図２においてネットワーク３００は、タンパク質構造解析装置１００と外部システム２００とを相互に接続する機能を有し、例えば、インターネット等である。 In FIG. 2, a network 300 has a function of mutually connecting the protein structure analysis apparatus 100 and the external system 200, and is, for example, the Internet.

図２において外部システム２００は、ネットワーク３００を介して、タンパク質構造解析装置１００と相互に接続され、利用者に対して配列情報等に関する外部データベースやホモロジー検索やモチーフ検索等の外部プログラムを実行するウェブサイトを提供する機能を有する。 In FIG. 2, an external system 200 is connected to the protein structure analysis apparatus 100 via a network 300, and is a web for executing external programs such as homology search and motif search for an external database related to sequence information and the like for users. Has the function of providing a site.

ここで、外部システム２００は、ＷＥＢサーバやＡＳＰサーバ等として構成してもよく、そのハードウェア構成は、一般に市販されるワークステーション、パーソナルコンピュータ等の情報処理装置およびその付属装置により構成してもよい。また、外部システム２００の各機能は、外部システム２００のハードウェア構成中のＣＰＵ、ディスク装置、メモリ装置、入力装置、出力装置、通信制御装置等およびそれらを制御するプログラム等により実現される。 Here, the external system 200 may be configured as a WEB server, an ASP server, or the like, and the hardware configuration may be configured by an information processing apparatus such as a commercially available workstation or a personal computer and an accessory device thereof. Good. Each function of the external system 200 is realized by a CPU, a disk device, a memory device, an input device, an output device, a communication control device, and the like in the hardware configuration of the external system 200 and a program for controlling them.

図２において質量分析装置４００は、ヘキサポールＣＩＤ法、ノズルスキマーＣＩＤ法、キャピラリースキマーＣＩＤ法、ＳＯＲＩ−ＣＩＤ法、および、マルチポールストアー補助キャピラリースキマーＣＩＤ法を含む衝突活性化解離（ＣＩＤ）法、ＩＲＭＰＤ法、インソース分解（ＩＳＤ）法、ポストソース分解（ＰＳＤ）法、表面誘起解離（ＳＩＤ）法、ＥＣＤ法、または、ＢＩＲＤ法などの方法により断片化イオンに断片化し、断片化スペクトルを測定する。 In FIG. 2, the mass spectrometer 400 includes a collision activated dissociation (CID) method including a hexapole CID method, a nozzle skimmer CID method, a capillary skimmer CID method, a SORI-CID method, and a multipole store auxiliary capillary skimmer CID method, The fragmentation spectrum is measured by fragmenting into fragmented ions by methods such as IRMPD, in-source decomposition (ISD), post-source decomposition (PSD), surface-induced dissociation (SID), ECD, or BIRD. To do.

図２においてタンパク質構造解析装置１００は、概略的に、タンパク質構造解析装置１００の全体を統括的に制御するＣＰＵ等の制御部１０２、通信回線等に接続されるルータ等の通信装置（図示せず）に接続される通信制御インターフェース部１０４、入力装置１１２や出力装置１１４に接続される入出力制御インターフェース部１０８、および、各種のデータベースやテーブルなどを格納する記憶部１０６を備えて構成されており、これら各部は任意の通信路を介して通信可能に接続されている。さらに、このタンパク質構造解析装置１００は、ルータ等の通信装置および専用線等の有線または無線の通信回線を介して、ネットワーク３００に通信可能に接続されている。 In FIG. 2, the protein structure analysis apparatus 100 schematically includes a control unit 102 such as a CPU that comprehensively controls the entire protein structure analysis apparatus 100, a communication device such as a router connected to a communication line (not shown). ) Connected to the communication control interface unit 104, the input / output control interface unit 108 connected to the input device 112 and the output device 114, and the storage unit 106 for storing various databases and tables. These units are communicably connected via an arbitrary communication path. Further, the protein structure analysis apparatus 100 is communicably connected to the network 300 via a communication device such as a router and a wired or wireless communication line such as a dedicated line.

記憶部１０６に格納される各種のデータベースやファイルやテーブル（断片化スペクトルデータファイル１０６ａ〜処理結果データファイル１０６ｆ）は、固定ディスク装置等のストレージ手段であり、各種処理に用いる各種のプログラムやテーブルやファイルやデータベースやウェブページ用ファイル等を格納する。 Various databases, files and tables (fragmented spectrum data file 106a to processing result data file 106f) stored in the storage unit 106 are storage means such as a fixed disk device, and various programs and tables used for various processes, Stores files, databases, web page files, etc.

これら記憶部１０６の各構成要素のうち、断片化スペクトルデータファイル１０６ａは、質量分析装置４００から取得した断片化スペクトルデータを格納する断片化スペクトルデータ格納手段である。 Among the components of the storage unit 106, the fragmented spectrum data file 106a is a fragmented spectrum data storage unit that stores the fragmented spectrum data acquired from the mass spectrometer 400.

また、切断容易領域情報ファイル１０６ｂは、切断容易領域情報等を格納する切断容易領域情報格納手段である。 The easy-to-cut area information file 106b is easy-to-cut area information storage means for storing easy-to-cut area information and the like.

また、溶媒接触残基情報／相互作用界面情報ファイル１０６ｃは、溶媒接触残基情報および相互作用界面情報等を格納する相互作用界面情報格納手段である。 The solvent contact residue information / interaction interface information file 106c is interaction interface information storage means for storing solvent contact residue information, interaction interface information, and the like.

また、立体構造予測データファイル１０６ｄは、立体構造予測処理により予測された立体構造予測データ等を格納する立体構造予測データ格納手段である。 The 3D structure prediction data file 106d is a 3D structure prediction data storage unit that stores 3D structure prediction data predicted by the 3D structure prediction process.

また、予測構造評価情報ファイル１０６ｅは、予測構造評価情報等を格納する予測構造評価情報格納手段である。 The predicted structure evaluation information file 106e is a predicted structure evaluation information storage unit that stores predicted structure evaluation information and the like.

また、処理結果データファイル１０６ｆは、処理結果に関する情報等を格納する処理結果データ格納手段である。 The processing result data file 106f is processing result data storage means for storing information about processing results.

また、その他の情報として、タンパク質構造解析装置１００の記憶部１０６には、タンパク質のアミノ酸配列情報や構造情報を格納したタンパク質情報データベース等が記録されていてもよい。このタンパク質情報データベースは、ＰＤＢ等のインターネットを経由してアクセスする外部のデータベースであってもよく、また、これらのデータベースをコピーしたり、オリジナルの配列情報等を格納したり、さらに独自のクラスタリング情報やアノテーション情報等を付加したりして作成したインハウスデータベースであってもよい。 In addition, as other information, a protein information database storing protein amino acid sequence information and structure information may be recorded in the storage unit 106 of the protein structure analysis apparatus 100. This protein information database may be an external database accessed via the Internet, such as a PDB, or may be copied, stored with original sequence information, or unique clustering information. Or an in-house database created by adding annotation information or the like.

また、図２において、通信制御インターフェース部１０４は、タンパク質構造解析装置１００とネットワーク３００（またはルータ等の通信装置）との間における通信制御を行う。すなわち、通信制御インターフェース部１０４は、他の端末と通信回線を介してデータを通信する機能を有する。 In FIG. 2, the communication control interface unit 104 performs communication control between the protein structure analysis apparatus 100 and the network 300 (or a communication apparatus such as a router). That is, the communication control interface unit 104 has a function of communicating data with other terminals via a communication line.

また、図２において、入出力制御インターフェース部１０８は、入力装置１１２や出力装置１１４の制御を行う。ここで、出力装置１１４としては、モニタ（家庭用テレビを含む）の他、スピーカを用いることができる（なお、以下においては出力装置１１４をモニタとして記載する場合がある）。また、入力装置１１２としては、キーボード、マウス、および、マイク等を用いることができる。また、モニタも、マウスと協働してポインティングデバイス機能を実現する。 In FIG. 2, the input / output control interface unit 108 controls the input device 112 and the output device 114. Here, as the output device 114, in addition to a monitor (including a home TV), a speaker can be used (hereinafter, the output device 114 may be described as a monitor). As the input device 112, a keyboard, a mouse, a microphone, or the like can be used. The monitor also realizes a pointing device function in cooperation with the mouse.

また、図２において、制御部１０２は、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）等の制御プログラム、各種の処理手順等を規定したプログラム、および所要データを格納するための内部メモリを有し、これらのプログラム等により、種々の処理を実行するための情報処理を行う。制御部１０２は、機能概念的に、断片化スペクトルデータ取得部１０２ａ、帰属情報決定部１０２ｂ、切断容易領域情報決定部１０２ｃ、溶媒接触残基情報決定部１０２ｄ、相互作用界面情報決定部１０２ｅ、立体構造予測処理部１０２ｆ、予測構造評価情報決定部１０２ｇ、処理結果出力部１０２ｈ、予測構造評価情報出力部１０２ｉ、重水素化速度決定部１０２ｊ、および、基準振動計算処理部１０２ｋを備えて構成されている。 In FIG. 2, the control unit 102 has a control program such as an OS (Operating System), a program defining various processing procedures, and an internal memory for storing necessary data. Information processing for executing various processes is performed. The control unit 102 is functionally conceptually divided into a fragmentation spectrum data acquisition unit 102a, an attribution information determination unit 102b, an easily cleavable region information determination unit 102c, a solvent contact residue information determination unit 102d, an interaction interface information determination unit 102e, The structure prediction processing unit 102f, the prediction structure evaluation information determination unit 102g, the processing result output unit 102h, the prediction structure evaluation information output unit 102i, the deuteration rate determination unit 102j, and the reference vibration calculation processing unit 102k are configured. Yes.

このうち、断片化スペクトルデータ取得部１０２ａは、立体構造を予測する対象タンパク質についてＭＳや酵素などを用いて断片化し、その断片化スペクトルを測定した質量分析装置４００から断片化スペクトルデータを取得する断片化スペクトルデータ取得手段である。 Among these, the fragmentation spectrum data acquisition unit 102a fragments the target protein for which a three-dimensional structure is predicted using MS or an enzyme, and acquires fragmentation spectrum data from the mass spectrometer 400 that measures the fragmentation spectrum. This is a generalized spectral data acquisition means.

また、帰属情報決定部１０２ｂは、断片化スペクトルデータ取得手段により取得された断片化スペクトルデータに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定する帰属情報決定手段である。 Further, the attribution information determination unit 102b is an attribution information determination unit that determines the attribution information of fragmented ions for the amino acid sequence of the target protein based on the fragmentation spectrum data acquired by the fragmentation spectrum data acquisition unit.

また、切断容易領域情報決定部１０２ｃは、帰属情報決定手段により決定された断片化イオンの帰属情報により、対象タンパク質のアミノ酸配列において断片化イオンに断片化された箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における切断容易領域情報を決定する切断容易領域情報決定手段である。 In addition, the easy-cleavage region information determination unit 102c identifies the location fragmented into fragmented ions in the amino acid sequence of the target protein based on the fragmentation ion attribution information determined by the attribution information determination means, This is an easily cleavable region information determining means for determining easily cleavable region information in the amino acid sequence of a protein.

また、溶媒接触残基情報決定部１０２ｄは、重水素化速度決定手段にて決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における溶媒接触残基情報を決定する溶媒接触残基情報決定手段である。 The solvent contact residue information determination unit 102d is in contact with the solvent in the amino acid sequence of the target protein according to the deuteration rate at each amino acid residue in the amino acid sequence determined by the deuteration rate determination means. Solvent contact residue information determination means for specifying a location and determining solvent contact residue information in the amino acid sequence of the target protein according to the location.

また、相互作用界面情報決定部１０２ｅは、重水素化速度決定手段にて決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質のアミノ酸配列における相互作用界面情報を決定する相互作用界面情報決定手段である。 Further, the interaction interface information determination unit 102e determines the interaction interface information in the amino acid sequence of the target protein according to the deuteration rate at each amino acid residue in the amino acid sequence determined by the deuteration rate determination means. It is an interaction interface information determination means.

また、立体構造予測処理部１０２ｆは、ホモロジーモデリング法、分子シミュレーション法、アブイニシオ法、二次構造予測法、３Ｄ−１Ｄ法、スレッディング法などの既存の立体構造予測方法を用いて、対象タンパク質や候補タンパク質について立体構造を予測する立体構造予測手段である。 Further, the three-dimensional structure prediction processing unit 102f uses the existing three-dimensional structure prediction method such as a homology modeling method, a molecular simulation method, an ab initio method, a secondary structure prediction method, a 3D-1D method, and a threading method. This is a three-dimensional structure prediction means for predicting a three-dimensional structure of a protein.

また、予測構造評価情報決定部１０２ｇは、立体構造予測手段により予測された立体構造予測データと、切断容易領域情報決定手段により決定された切断容易領域情報とを比較して、切断容易領域情報により特定された切断容易領域に対応する予測構造部分について評価を行い予測構造評価情報を決定する予測構造評価情報決定手段、立体構造予測手段により予測された立体構造予測データと、溶媒接触残基情報決定手段により決定された溶媒接触残基情報とを比較して、溶媒接触残基情報により特定された溶媒接触残基に対応する予測構造部分について評価を行い予測構造評価情報を決定する予測構造評価情報決定手段、立体構造予測手段により予測された立体構造予測データと、相互作用界面情報決定手段により決定された相互作用界面情報とを比較して、相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定する予測構造評価情報決定手段である。 Further, the predicted structure evaluation information determination unit 102g compares the three-dimensional structure prediction data predicted by the three-dimensional structure prediction unit with the easy-to-cut region information determined by the easy-to-cut region information determination unit, and uses the easy-to-cut region information. Predicted structure evaluation information determining means for evaluating the predicted structure portion corresponding to the specified easily cleavable region and determining predicted structure evaluation information, three-dimensional structure prediction data predicted by the three-dimensional structure prediction means, and solvent contact residue information determination Predicted structure evaluation information that compares the solvent contact residue information determined by the means, evaluates the predicted structure portion corresponding to the solvent contact residue specified by the solvent contact residue information, and determines the predicted structure evaluation information Determination means, three-dimensional structure prediction data predicted by three-dimensional structure prediction means, and interaction interface determined by interaction interface information determination means By comparing the distribution, it is predicted structure evaluation information determining means for determining a predicted structure evaluation information evaluated for predicted structural portion corresponding to the interaction interface identified by the interaction interface information.

また、処理結果出力部１０２ｈは、立体構造予測手段により予測された立体構造予測データと、切断容易領域情報決定手段により決定された切断容易領域情報とを対応させて出力する処理結果出力手段、立体構造予測手段により予測された立体構造予測データと、溶媒接触残基情報決定手段により決定された溶媒接触残基情報とを対応させて出力する処理結果出力手段、立体構造予測手段により予測された立体構造予測データと、相互作用界面情報決定手段により決定された相互作用界面情報とを対応させて出力する処理結果出力手段である。 The processing result output unit 102h is a processing result output unit that outputs the three-dimensional structure prediction data predicted by the three-dimensional structure prediction unit and the easy-to-cut region information determined by the easy-to-cut region information determination unit in association with each other. Processing result output means for outputting the three-dimensional structure prediction data predicted by the structure prediction means and the solvent contact residue information determined by the solvent contact residue information determination means in correspondence with each other, and the three-dimensional prediction predicted by the three-dimensional structure prediction means It is a processing result output means for outputting the structure prediction data and the interaction interface information determined by the interaction interface information determination means in association with each other.

また、予測構造評価情報出力部１０２ｉは、予測構造評価情報決定手段により決定された予測構造評価情報を出力する予測構造評価情報出力手段である。 The predicted structure evaluation information output unit 102i is a predicted structure evaluation information output unit that outputs the predicted structure evaluation information determined by the predicted structure evaluation information determination unit.

また、重水素化速度決定部１０２ｊは、断片化スペクトルデータ取得手段により測定された断片化スペクトルデータ、および、帰属情報決定手段により決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列中の各アミノ酸残基における重水素化速度を決定する重水素化速度決定手段である。 In addition, the deuteration rate determination unit 102j determines whether the fragmentation spectrum data measured by the fragmentation spectrum data acquisition unit and the fragmentation ion attribution information determined by the attribution information determination unit are included in the amino acid sequence of the target protein. Is a deuteration rate determining means for determining a deuteration rate at each amino acid residue.

また、基準振動計算処理部１０２ｋは、対象タンパク質について基準振動計算を行う基準振動計算手段である。なお、これら各部によって行なわれる処理の詳細については、後述する。 The reference vibration calculation processing unit 102k is a reference vibration calculation unit that performs reference vibration calculation for the target protein. Details of processing performed by each of these units will be described later.

［システムの処理］
次に、このように構成された本実施の形態における本システムの処理の一例について、以下に図３などを参照して詳細に説明する。 System processing
Next, an example of processing of the system in the present embodiment configured as described above will be described in detail with reference to FIG.

［メイン処理］
まず、メイン処理の詳細について図３を参照して説明する。図３は、本実施形態における本システムのメイン処理の一例を示すフローチャートである。 [Main processing]
First, details of the main process will be described with reference to FIG. FIG. 3 is a flowchart showing an example of main processing of the system according to the present embodiment.

まず、対象タンパク質について質量分析装置４００により断片化スペクトルを測定する（ステップＳＡ−１）。すなわち、対象タンパク質溶液をシリンジポンプで質量分析装置４００に連続注入し、ヘキサポールＣＩＤ法などにより対象タンパク質を断片化イオンに断片化し、断片化スペクトルの測定を行う。 First, a fragmentation spectrum is measured for the target protein by the mass spectrometer 400 (step SA-1). That is, the target protein solution is continuously injected into the mass spectrometer 400 with a syringe pump, the target protein is fragmented into fragmented ions by the hexapole CID method or the like, and the fragmentation spectrum is measured.

ここで、図４は、本実施形態におけるヘキサポールＣＩＤ法により断片化スペクトルを測定する場合の一例を示すフローチャートである。 Here, FIG. 4 is a flowchart showing an example of measuring a fragmented spectrum by the hexapole CID method in the present embodiment.

まず、キャピラリー出口の電圧を上げる（ステップＳＢ−１）。 First, the capillary outlet voltage is increased (step SB-1).

そして、断片化の効率を向上させるため、ヘキサポール内でのイオンのトラップ時間を長くしてもよい（ステップＳＢ−２）。 In order to improve the fragmentation efficiency, the ion trap time in the hexapole may be increased (step SB-2).

そして、タンパク質が断片化するように、ＭＳイオン化と検出のパラメータ（例えば、キャピラリー電圧、ヘキサポール内トラップ時間など）を調整する（ステップＳＢ−３）。 Then, MS ionization and detection parameters (for example, capillary voltage, trap time in hexapole, etc.) are adjusted so that the protein is fragmented (step SB-3).

そして、ＭＳによる断片化スペクトルの取り込みと保存を行う（ステップＳＢ−４）。 Then, the fragmentation spectrum is captured and stored by the MS (step SB-4).

また、図５は、本実施形態におけるＩＲＭＰＤ法により断片化スペクトルを測定する場合の一例を示すフローチャートである。 FIG. 5 is a flowchart showing an example of measuring a fragmented spectrum by the IRMPD method in the present embodiment.

まず、ＭＳ内に存在する（例えば、ヘキサポール内、イオントラップ内）イオンに対して、ＩＲレーザー（１０．９ｕｍ）を照射する（ステップＳＣ−１）。 First, an IR laser (10.9 μm) is irradiated to ions existing in the MS (for example, in a hexapole or ion trap) (step SC-1).

そして、断片化の効率を向上させるため、ヘキサポール内でのイオンのトラップ時間を長くしてもよい（ステップＳＣ−２）。 In order to improve the fragmentation efficiency, the ion trap time in the hexapole may be increased (step SC-2).

そして、タンパク質が断片化するように、ＭＳイオン化と検出のパラメータ（例えば、キャピラリー電圧、ヘキサポール内トラップ時間、ＩＲレーザーのエネルギー、ＩＲレーザーの照射時間など）を調整する（ステップＳＣ−３）。 Then, MS ionization and detection parameters (for example, capillary voltage, trap time in hexapole, IR laser energy, IR laser irradiation time, etc.) are adjusted so that the protein is fragmented (step SC-3).

そして、ＭＳによる断片化スペクトルの取り込みと保存を行う（ステップＳＣ−４）。 Then, the fragmentation spectrum is captured and stored by the MS (step SC-4).

その他のノズルスキマーＣＩＤ法、キャピラリースキマーＣＩＤ法、マルチポールストアー補助キャピラリースキマーＣＩＤ法、ＥＣＤ法、ＢＩＲＤ法、または、ＳＯＲＩ−ＣＩＤ法などにより断片化スペクトルを取得してもよい。 The fragmentation spectrum may be obtained by other nozzle skimmer CID method, capillary skimmer CID method, multipole store auxiliary capillary skimmer CID method, ECD method, BIRD method, or SORI-CID method.

再び、図３に戻り、タンパク質構造解析装置１００は、断片化スペクトルデータ取得部１０２ａの処理により、質量分析装置４００から入力装置１１２またはネットワーク３００を経由して断片化スペクトルデータを取得し、断片化スペクトルデータファイル１０６ａに格納する（ステップＳＡ−２）。 Returning to FIG. 3 again, the protein structure analysis apparatus 100 acquires the fragmentation spectrum data from the mass spectrometer 400 via the input device 112 or the network 300 by the processing of the fragmentation spectrum data acquisition unit 102a, and fragments. It stores in the spectrum data file 106a (step SA-2).

ついで、タンパク質構造解析装置１００は、帰属情報決定部１０２ｂの処理により、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し、また、切断容易領域情報決定部１０２ｃの処理により、決定された断片化イオンの帰属情報により、対象タンパク質のアミノ酸配列において断片化イオンに断片化された箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における切断容易領域情報を決定する（ステップＳＡ−３）。 Next, the protein structure analysis apparatus 100 determines the attribution information of the fragmented ions with respect to the amino acid sequence of the target protein based on the measured fragmentation spectrum by the processing of the attribution information determination unit 102b. The portion fragmented into fragment ions in the amino acid sequence of the target protein is identified from the fragmentation ion attribution information determined by the processing of the determination unit 102c, and the easy-cleavage region information in the amino acid sequence of the target protein according to the portion Is determined (step SA-3).

ついで、タンパク質構造解析装置１００は、切断容易領域情報決定部１０２ｃの処理により、切断容易領域情報を切断容易領域情報ファイル１０６ｂに格納する（ステップＳＡ−４）。 Next, the protein structure analysis apparatus 100 stores the easy-cleavage region information in the easy-cleavage region information file 106b by the processing of the easy-cleavage region information determination unit 102c (step SA-4).

また、対象タンパク質やその複合体について、水素−重水素交換法による断片化スペクトルの測定を行う（ステップＳＡ−５）。 Further, the fragmentation spectrum of the target protein or complex thereof is measured by the hydrogen-deuterium exchange method (step SA-5).

ここで、図７は、本実施形態における水素−重水素交換法により断片化スペクトルを測定する場合の一例を示すフローチャートである。 Here, FIG. 7 is a flowchart showing an example of measuring a fragmented spectrum by the hydrogen-deuterium exchange method in the present embodiment.

まず、対象タンパク質を緩衝液に溶解または重水素置換する（ステップＳＤ−１）。 First, the target protein is dissolved or deuterated in a buffer solution (step SD-1).

そして、経時的にサンプリングを行いながら、緩衝液を酸性にして温度を下げ、Ｈ／Ｄ交換反応速度を下げる（ステップＳＤ−２）。 Then, while sampling over time, the buffer is acidified to lower the temperature and the H / D exchange reaction rate is lowered (step SD-2).

そして、酸性条件下で対象タンパク質をペプシン消化する（ステップＳＤ−３）。 Then, the target protein is digested with pepsin under acidic conditions (step SD-3).

そして、ペプチド消化物を抽出し（ステップＳＤ−４）、ＭＡＬＤＩ−ＴＯＦ−ＭＳ、または、ＬＣ／ＭＳ／ＭＳを行う（ステップＳＤ−５）。 And a peptide digest is extracted (step SD-4) and MALDI-TOF-MS or LC / MS / MS is performed (step SD-5).

また、図８は、本実施形態における水素−重水素交換法によりタンパク質をそのままＭＳで断片化し断片化スペクトルを測定する場合の一例を示すフローチャートである。 FIG. 8 is a flowchart showing an example in the case where a protein is fragmented as it is with MS by the hydrogen-deuterium exchange method in this embodiment and a fragmentation spectrum is measured.

まず、対象タンパク質を緩衝液に溶解または重水素置換する（ステップＳＥ−１）。 First, the target protein is dissolved or substituted with deuterium in a buffer solution (step SE-1).

そして、ＥＳＩ−ＭＳで断片化スペクトルの測定を行う（ステップＳＥ−２）。 Then, the fragmentation spectrum is measured by ESI-MS (step SE-2).

そして、経時的にサンプリングを行いながら、断片化スペクトルを取り込み保存する（ステップＳＥ−３）。 Then, the fragmentation spectrum is captured and stored while sampling over time (step SE-3).

また、図９は、本実施形態における水素−重水素交換法によりタンパク質の複合体を断片化し断片化スペクトルを測定する場合の一例を示すフローチャートである。 FIG. 9 is a flowchart showing an example of fragmenting a protein complex and measuring a fragmentation spectrum by the hydrogen-deuterium exchange method in the present embodiment.

まず、タンパク質の複合体を緩衝液に溶解または重水素置換する（ステップＳＦ−１）。 First, the protein complex is dissolved in a buffer or substituted with deuterium (step SF-1).

そして、経時的にサンプリングを行いながら、緩衝液を酸性にして温度を下げ、Ｈ／Ｄ交換反応速度を下げる（ステップＳＦ−２）。 Then, while sampling over time, the buffer solution is acidified to lower the temperature, and the H / D exchange reaction rate is lowered (step SF-2).

そして、酸性条件下で対象タンパク質をペプシン消化する（ステップＳＦ−３）。 Then, the target protein is digested with pepsin under acidic conditions (step SF-3).

そして、ペプチド消化物を抽出し（ステップＳＦ−４）、ＭＡＬＤＩ−ＴＯＦ−ＭＳ、または、ＬＣ／ＭＳ／ＭＳを行う（ステップＳＦ−５）。 And a peptide digest is extracted (step SF-4) and MALDI-TOF-MS or LC / MS / MS is performed (step SF-5).

また、図１０は、本実施形態における水素−重水素交換法によりタンパク質ＡとＢの複合体を断片化し断片化スペクトルを測定する場合の一例を示すフローチャートである。 FIG. 10 is a flowchart showing an example of fragmenting the complex of proteins A and B and measuring the fragmentation spectrum by the hydrogen-deuterium exchange method in the present embodiment.

まず、タンパク質ＡとＢを別々に緩衝液に溶解または重水素置換し、一定時間（ｔ１）放置する（ステップＳＧ−１）。 First, proteins A and B are separately dissolved or substituted with deuterium in a buffer solution and allowed to stand for a predetermined time (t1) (step SG-1).

そして、タンパク質ＡとＢを混合し複合体を形成して一定時間（ｔ２）放置する（ステップＳＧ−２）。 Then, proteins A and B are mixed to form a complex and left for a certain time (t2) (step SG-2).

そして、軽水緩衝液に置換、または、希釈する（ステップＳＧ−３）。 And it substitutes or dilutes with a light water buffer solution (step SG-3).

そして、一定時間（Ｔ＝ｔ１＋ｔ２）放置し、Ｈ／Ｄの逆交換反応（オフチェンジ）を行う（ステップＳＧ−４）。 Then, it is allowed to stand for a certain time (T = t1 + t2), and an H / D reverse exchange reaction (off change) is performed (step SG-4).

そして、ＥＳＩ−ＭＳで断片化スペクトルの測定を行う（ステップＳＧ−５）。 Then, the fragmentation spectrum is measured by ESI-MS (step SG-5).

再び、図３に戻り、タンパク質構造解析装置１００は、断片化スペクトルデータ取得部１０２ａの処理により、質量分析装置４００から入力装置１１２またはネットワーク３００を経由して断片化スペクトルデータを取得し、断片化スペクトルデータファイル１０６ａに格納する（ステップＳＡ−６）。 Returning to FIG. 3 again, the protein structure analysis apparatus 100 acquires the fragmentation spectrum data from the mass spectrometer 400 via the input device 112 or the network 300 by the processing of the fragmentation spectrum data acquisition unit 102a, and fragments. It stores in the spectrum data file 106a (step SA-6).

ついで、タンパク質構造解析装置１００は、帰属情報決定部１０２ｂの処理により、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し、また、重水素化速度決定部１０２ｊの処理により、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列中の各アミノ酸残基における重水素化速度を決定する。 Next, the protein structure analysis apparatus 100 determines the attribution information of the fragmented ions for the amino acid sequence of the target protein based on the measured fragmentation spectrum by the processing of the attribution information determination unit 102b, and the deuteration rate By the processing of the determination unit 102j, the deuteration rate at each amino acid residue in the amino acid sequence of the target protein is determined according to the measured fragmentation spectrum and the determined fragmentation ion attribution information.

ここで、重水素化速度は、以下により求める。まず、帰属情報決定部１０２ｂがペプチドの帰属情報を決定すると、重水素化速度決定部１０２ｊは、各アミノ酸残基について、質量増加分を交換性プロトン数で割ることにより重水素化率を求める。そして、重水素化速度決定部１０２ｊは、重水素化率に基づいて、各ペプチド配列のアミノ酸残基の重水化速度を決定する。 Here, the deuteration rate is obtained as follows. First, when the attribution information determination unit 102b determines the attribution information of the peptide, the deuteration rate determination unit 102j obtains the deuteration rate for each amino acid residue by dividing the increase in mass by the number of exchangeable protons. Then, the deuteration rate determining unit 102j determines the deuteration rate of the amino acid residue of each peptide sequence based on the deuteration rate.

そして、溶媒接触残基情報決定部１０２ｄは、決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における溶媒接触残基情報を決定し（ステップＳＡ−７）、溶媒接触残基情報／相互作用界面情報ファイル１０６ｃに格納する（ステップＳＡ−８）。溶媒接触残基情報は、例えば、水素−重水素交換していないコントロール実験と比較して重水素化速度が遅くなっている部位を溶媒接触残基情報として決定してもよい。 Then, the solvent contact residue information determination unit 102d identifies a position in contact with the solvent in the amino acid sequence of the target protein according to the deuteration rate at each amino acid residue in the determined amino acid sequence, and according to the position Solvent contact residue information in the amino acid sequence of the target protein is determined (step SA-7) and stored in the solvent contact residue information / interaction interface information file 106c (step SA-8). For the solvent contact residue information, for example, a site where the deuteration rate is slow compared to a control experiment in which hydrogen-deuterium exchange is not performed may be determined as the solvent contact residue information.

また、相互作用界面情報決定部１０２ｅは、決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質のアミノ酸配列における相互作用界面情報を決定し（ステップＳＡ−７）、溶媒接触残基情報／相互作用界面情報ファイル１０６ｃに格納する（ステップＳＡ−８）。相互作用界面情報は、例えば、複合体を形成していない単独時に測定した場合と比較して、重水素化速度が遅くなっている部位を相互作用界面情報として決定してもよい。また、水素−重水素交換していないコントロール実験と比較して重水素化速度が遅くなっている部位を相互作用界面情報として決定してもよい。 Further, the interaction interface information determination unit 102e determines interaction interface information in the amino acid sequence of the target protein according to the deuteration rate at each amino acid residue in the determined amino acid sequence (step SA-7), and the solvent The contact residue information / interaction interface information file 106c is stored (step SA-8). As for the interaction interface information, for example, a site where the deuteration rate is slow may be determined as the interaction interface information as compared with the case where measurement is performed when the complex is not formed alone. Alternatively, a site where the deuteration rate is slow compared to a control experiment in which hydrogen-deuterium exchange is not performed may be determined as interaction interface information.

また、タンパク質構造解析装置１００は、立体構造予測処理部１０２ｆの処理により、対象タンパク質について立体構造を予測し（ステップＳＡ−９）、予測した立体構造予測データを立体構造予測データファイル１０６ｄに格納する（ステップＳＡ−１０）。 Further, the protein structure analyzing apparatus 100 predicts the three-dimensional structure of the target protein by the processing of the three-dimensional structure prediction processing unit 102f (step SA-9), and stores the predicted three-dimensional structure prediction data in the three-dimensional structure prediction data file 106d. (Step SA-10).

そして、タンパク質構造解析装置１００は、このようにして決定した切断容易領域情報、溶媒接触残基情報および相互作用界面情報を、予測データとともに出力したり、予測構造評価情報を決定したりする際に利用する。 The protein structure analysis apparatus 100 outputs the easily cleavable region information, the solvent contact residue information, and the interaction interface information determined in this way together with the prediction data or when determining the prediction structure evaluation information. Use.

すなわち、処理結果出力部１０２ｈは、予測された立体構造予測データと決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを対応させて出力する（ステップＳＡ−１１）。 That is, the processing result output unit 102h outputs the predicted three-dimensional structure prediction data in association with the determined easy-cleavage region information, solvent contact residue information, and interaction interface information (step SA-11).

また、予測構造評価情報決定部１０２ｇは、予測された立体構造予測データと、決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを比較して（ステップＳＡ−１２）、切断容易領域情報により特定された切断容易領域や溶媒接触残基情報により特定された溶媒接触残基や相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定する（ステップＳＡ−１３）。 Further, the predicted structure evaluation information determination unit 102g compares the predicted three-dimensional structure prediction data with the determined cleavable region information, solvent contact residue information, and interaction interface information (step SA-12), Predictive structure evaluation is performed by evaluating the predicted structure portion corresponding to the interaction interface specified by the easy contact region specified by the easy access region information, the solvent contact residue specified by the solvent contact residue information or the interaction interface information. Information is determined (step SA-13).

また、予測構造評価情報出力部１０２ｉは、決定された予測構造評価情報を出力する（ステップＳＡ−１４）。これにて、メイン処理が終了する。 Further, the predicted structure evaluation information output unit 102i outputs the determined predicted structure evaluation information (step SA-14). This completes the main process.

［実施例１］
次に、本発明の実施例１の詳細について図１３等を参照して説明する。図１３は、本発明における本システムの実施例１の一例を示すフローチャートである。 [Example 1]
Next, details of the first embodiment of the present invention will be described with reference to FIG. FIG. 13 is a flowchart showing an example of the first embodiment of the system according to the present invention.

まず、対象タンパク質（または複合体）について質量分析装置４００により断片化スペクトルの測定を行う（ステップＳＪ−１）。また、対象タンパク質（または複合体）について水素−重水素交換反応を行いながら、断片化スペクトルの測定を行ってもよい。そして、タンパク質構造解析装置１００は、断片化スペクトルデータ取得部１０２ａの処理により、質量分析装置４００から入力装置１１２またはネットワーク３００を経由して断片化スペクトルデータを取得し、断片化スペクトルデータファイル１０６ａに格納する。 First, the fragmentation spectrum of the target protein (or complex) is measured by the mass spectrometer 400 (step SJ-1). Moreover, you may measure a fragmentation spectrum, performing hydrogen-deuterium exchange reaction about object protein (or complex). Then, the protein structure analysis apparatus 100 acquires fragmentation spectrum data from the mass spectrometry apparatus 400 via the input device 112 or the network 300 by the processing of the fragmentation spectrum data acquisition unit 102a, and stores the fragmentation spectrum data file 106a. Store.

ついで、タンパク質構造解析装置１００は、帰属情報決定部１０２ｂの処理により、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し（ステップＳＪ−２）、また、タンパク質構造解析装置１００は、切断容易領域情報決定部１０２ｃの処理により、決定された断片化イオンの帰属情報により、対象タンパク質のアミノ酸配列において断片化イオンに断片化された箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における切断容易領域情報を決定する。 Next, the protein structure analysis apparatus 100 determines fragmentation ion attribution information for the amino acid sequence of the target protein based on the measured fragmentation spectrum by the processing of the attribution information determination unit 102b (step SJ-2). In addition, the protein structure analysis apparatus 100 identifies the location fragmented into fragment ions in the amino acid sequence of the target protein based on the fragmentation ion attribution information determined by the processing of the easy-cleavage region information determination unit 102c. The easy-cleavage region information in the amino acid sequence of the target protein is determined according to the location.

また、水素−重水素交換反応を行っている場合には、タンパク質構造解析装置１００は、重水素化速度決定部１０２ｊの処理により、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列中の各アミノ酸残基における重水素化速度を決定する。そして、溶媒接触残基情報決定部１０２ｄは、決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における溶媒接触残基情報を決定し、溶媒接触残基情報／相互作用界面情報ファイル１０６ｃに格納する。また、相互作用界面情報決定部１０２ｅは、決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質のアミノ酸配列における相互作用界面情報を決定し、溶媒接触残基情報／相互作用界面情報ファイル１０６ｃに格納する。 In addition, when the hydrogen-deuterium exchange reaction is performed, the protein structure analysis apparatus 100 uses the process of the deuteration rate determination unit 102j to measure the measured fragmentation spectrum and the determined fragmented ions. According to the attribution information, the deuteration rate at each amino acid residue in the amino acid sequence of the target protein is determined. Then, the solvent contact residue information determination unit 102d identifies a position in contact with the solvent in the amino acid sequence of the target protein according to the deuteration rate at each amino acid residue in the determined amino acid sequence, and according to the position Solvent contact residue information in the amino acid sequence of the target protein is determined and stored in the solvent contact residue information / interaction interface information file 106c. In addition, the interaction interface information determination unit 102e determines interaction interface information in the amino acid sequence of the target protein according to the deuteration rate at each amino acid residue in the determined amino acid sequence, and determines solvent contact residue information / mutual The data is stored in the working interface information file 106c.

また、タンパク質構造解析装置１００は、立体構造予測処理部１０２ｆの処理により、対象タンパク質についてＰＤＢなどのタンパク質データベースを用いて相同性検索を行い、雛型の立体構造となる構造既知のタンパク質を抽出する（ステップＳＪ−３）。 In addition, the protein structure analysis apparatus 100 performs homology search on the target protein using a protein database such as PDB by the processing of the three-dimensional structure prediction processing unit 102f, and extracts a protein with a known structure that becomes a template three-dimensional structure. (Step SJ-3).

そして、立体構造予測処理部１０２ｆは、３Ｄ−１Ｄ法などの手法により、対象タンパク質と雛型の構造既知タンパク質とのアライメントを実行し（ステップＳＪ−４）、対象タンパク質の立体構造モデルのデータである立体構造予測データを作成し、立体構造予測データファイル１０６ｄに格納する（ステップＳＪ−５）。 Then, the three-dimensional structure prediction processing unit 102f performs alignment between the target protein and the template structure known protein by a method such as the 3D-1D method (step SJ-4), and uses the three-dimensional structure model data of the target protein. Certain 3D structure prediction data is created and stored in the 3D structure prediction data file 106d (step SJ-5).

そして、処理結果出力部１０２ｈは、予測された立体構造予測データと決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを対応させて出力する（ステップＳＪ−６）。 Then, the processing result output unit 102h outputs the predicted three-dimensional structure prediction data in association with the determined easy-cleavage region information, solvent contact residue information, and interaction interface information (step SJ-6).

また、予測構造評価情報決定部１０２ｇは、予測された立体構造予測データと、決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを比較して、切断容易領域情報により特定された切断容易領域や溶媒接触残基情報により特定された溶媒接触残基や相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定する。また、予測構造評価情報出力部１０２ｉは、決定された予測構造評価情報を出力する（ステップＳＪ−７）。これにて、実施例１の処理が終了する。 Further, the predicted structure evaluation information determination unit 102g compares the predicted three-dimensional structure prediction data with the determined easy-cleavage region information, solvent contact residue information, and interaction interface information, and specifies the easy-cleavage region information. The predicted structure evaluation information is determined by evaluating the predicted structure portion corresponding to the interaction interface specified by the solvent contact residue or the interaction interface information specified by the easily cleavable region or the solvent contact residue information. Also, the predicted structure evaluation information output unit 102i outputs the determined predicted structure evaluation information (step SJ-7). This completes the process of the first embodiment.

［実施例２］
次に、本発明の実施例２の詳細について図１４等を参照して説明する。図１４は、本発明における本システムの実施例２の一例を示すフローチャートである。 [Example 2]
Next, details of the second embodiment of the present invention will be described with reference to FIG. FIG. 14 is a flowchart showing an example of Embodiment 2 of the present system according to the present invention.

まず、対象タンパク質（または複合体）について質量分析装置４００により断片化スペクトルの測定を行う（ステップＳＫ−１）。また、対象タンパク質（または複合体）について水素−重水素交換反応を行いながら、断片化スペクトルの測定を行ってもよい。そして、タンパク質構造解析装置１００は、断片化スペクトルデータ取得部１０２ａの処理により、質量分析装置４００から入力装置１１２またはネットワーク３００を経由して断片化スペクトルデータを取得し、断片化スペクトルデータファイル１０６ａに格納する。 First, the fragmentation spectrum of the target protein (or complex) is measured by the mass spectrometer 400 (step SK-1). Moreover, you may measure a fragmentation spectrum, performing hydrogen-deuterium exchange reaction about object protein (or complex). Then, the protein structure analysis apparatus 100 acquires fragmentation spectrum data from the mass spectrometry apparatus 400 via the input device 112 or the network 300 by the processing of the fragmentation spectrum data acquisition unit 102a, and stores the fragmentation spectrum data file 106a. Store.

ついで、タンパク質構造解析装置１００は、帰属情報決定部１０２ｂの処理により、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し（ステップＳＫ−２）、また、タンパク質構造解析装置１００は、切断容易領域情報決定部１０２ｃの処理により、決定された断片化イオンの帰属情報により、対象タンパク質のアミノ酸配列において断片化イオンに断片化された箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における切断容易領域情報を決定する。 Next, the protein structure analysis apparatus 100 determines fragmentation ion attribution information for the amino acid sequence of the target protein based on the measured fragmentation spectrum by the process of the attribution information determination unit 102b (step SK-2). In addition, the protein structure analysis apparatus 100 identifies the location fragmented into fragment ions in the amino acid sequence of the target protein based on the fragmentation ion attribution information determined by the processing of the easy-cleavage region information determination unit 102c. The easy-cleavage region information in the amino acid sequence of the target protein is determined according to the location.

また、タンパク質構造解析装置１００は、立体構造予測処理部１０２ｆの処理により、対象タンパク質についてＰＤＢなどのタンパク質データベースを用いて相同性検索を行い、雛型の立体構造となる構造既知のタンパク質を抽出する（ステップＳＫ−３）。 In addition, the protein structure analysis apparatus 100 performs homology search on the target protein using a protein database such as PDB by the processing of the three-dimensional structure prediction processing unit 102f, and extracts a protein with a known structure that becomes a template three-dimensional structure. (Step SK-3).

ここで、雛形のタンパク質を抽出できなかった場合には、立体構造予測処理部１０２ｆは、分子シミュレーション法などの手法により、対象タンパク質の立体構造モデルのデータである立体構造予測データを作成し、立体構造予測データファイル１０６ｄに格納する（ステップＳＫ−４）。 If the template protein cannot be extracted, the three-dimensional structure prediction processing unit 102f creates three-dimensional structure prediction data that is data of the three-dimensional structure model of the target protein by a technique such as a molecular simulation method. It stores in the structure prediction data file 106d (step SK-4).

そして、処理結果出力部１０２ｈは、予測された立体構造予測データと決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを対応させて出力する（ステップＳＫ−５）。 Then, the processing result output unit 102h outputs the predicted three-dimensional structure prediction data and the determined easy-cleavage region information, solvent contact residue information, and interaction interface information in association with each other (step SK-5).

また、予測構造評価情報決定部１０２ｇは、予測された立体構造予測データと、決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを比較して、切断容易領域情報により特定された切断容易領域や溶媒接触残基情報により特定された溶媒接触残基や相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定する。また、予測構造評価情報出力部１０２ｉは、決定された予測構造評価情報を出力する（ステップＳＫ−６）。これにて、実施例２の処理が終了する。 Further, the predicted structure evaluation information determination unit 102g compares the predicted three-dimensional structure prediction data with the determined easy-cleavage region information, solvent contact residue information, and interaction interface information, and specifies the easy-cleavage region information. The predicted structure evaluation information is determined by evaluating the predicted structure portion corresponding to the interaction interface specified by the solvent contact residue or the interaction interface information specified by the easily cleavable region or the solvent contact residue information. Also, the predicted structure evaluation information output unit 102i outputs the determined predicted structure evaluation information (step SK-6). This completes the process of the second embodiment.

［実施例３］
次に、本発明の実施例３の詳細について図１５等を参照して説明する。図１５は、本発明における本システムの実施例３の一例を示すフローチャートである。 [Example 3]
Next, details of the third embodiment of the present invention will be described with reference to FIG. FIG. 15 is a flowchart showing an example of the third embodiment of the system according to the present invention.

まず、対象タンパク質（または複合体）について質量分析装置４００により断片化スペクトルの測定を行う（ステップＳＬ−１）。また、対象タンパク質（または複合体）について水素−重水素交換反応を行いながら、断片化スペクトルの測定を行ってもよい。そして、タンパク質構造解析装置１００は、断片化スペクトルデータ取得部１０２ａの処理により、質量分析装置４００から入力装置１１２またはネットワーク３００を経由して断片化スペクトルデータを取得し、断片化スペクトルデータファイル１０６ａに格納する。 First, the fragmentation spectrum of the target protein (or complex) is measured by the mass spectrometer 400 (step SL-1). Moreover, you may measure a fragmentation spectrum, performing hydrogen-deuterium exchange reaction about object protein (or complex). Then, the protein structure analysis apparatus 100 acquires fragmentation spectrum data from the mass spectrometry apparatus 400 via the input device 112 or the network 300 by the processing of the fragmentation spectrum data acquisition unit 102a, and stores the fragmentation spectrum data file 106a. Store.

ついで、タンパク質構造解析装置１００は、帰属情報決定部１０２ｂの処理により、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し（ステップＳＬ−２）、また、タンパク質構造解析装置１００は、切断容易領域情報決定部１０２ｃの処理により、決定された断片化イオンの帰属情報により、対象タンパク質のアミノ酸配列において断片化イオンに断片化された箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における切断容易領域情報を決定する。 Next, the protein structure analysis apparatus 100 determines the attribution information of the fragmented ions with respect to the amino acid sequence of the target protein based on the measured fragmentation spectrum by the processing of the attribution information determination unit 102b (step SL-2). In addition, the protein structure analysis apparatus 100 identifies the location fragmented into fragment ions in the amino acid sequence of the target protein based on the fragmentation ion attribution information determined by the processing of the easy-cleavage region information determination unit 102c. The easy-cleavage region information in the amino acid sequence of the target protein is determined according to the location.

また、タンパク質構造解析装置１００は、立体構造予測処理部１０２ｆの処理により、対象タンパク質についてＰＤＢなどのタンパク質データベースを用いて相同性検索を行い、雛型の立体構造となる構造既知のタンパク質の候補タンパク質を抽出する（ステップＳＬ−３）。 In addition, the protein structure analysis apparatus 100 performs a homology search on the target protein using a protein database such as PDB by the processing of the three-dimensional structure prediction processing unit 102f, and a candidate protein of a protein with a known structure that becomes a template three-dimensional structure Is extracted (step SL-3).

そして、立体構造予測処理部１０２ｆは、３Ｄ−１Ｄ法などの手法により、対象タンパク質と候補タンパク質とのアライメントを実行する（ステップＳＬ−４）。 Then, the three-dimensional structure prediction processing unit 102f performs alignment between the target protein and the candidate protein by a method such as the 3D-1D method (step SL-4).

そして、処理結果出力部１０２ｈは、候補タンパク質のアミノ酸配列や構造データと、決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを対応させて出力する（ステップＳＬ−５）。 Then, the processing result output unit 102h outputs the amino acid sequence and structure data of the candidate protein in association with the determined cleavable region information, solvent contact residue information, and interaction interface information (step SL-5). .

また、予測構造評価情報決定部１０２ｇは、候補タンパク質の構造データと、決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを比較して、切断容易領域情報により特定された切断容易領域や溶媒接触残基情報により特定された溶媒接触残基や相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定する。また、予測構造評価情報出力部１０２ｉは、決定された予測構造評価情報を出力する。これにより、各候補タンパク質の構造データと対象タンパク質の実験データとの一致性を順位付けることができ、さらに最も評価の高い候補タンパク質を絞り込むことができる（ステップＳＬ−６）。 The predicted structure evaluation information determination unit 102g compares the structure data of the candidate protein with the determined cleavable region information, solvent contact residue information, and interaction interface information, and is identified by the cleavable region information. The predicted structure evaluation information is determined by evaluating the predicted structure portion corresponding to the interaction interface specified by the solvent contact residue or the interaction interface information specified by the easy-cleavage region or the solvent contact residue information. The predicted structure evaluation information output unit 102i outputs the determined predicted structure evaluation information. Thereby, the coincidence between the structure data of each candidate protein and the experimental data of the target protein can be ranked, and the candidate protein with the highest evaluation can be narrowed down (step SL-6).

ついで、立体構造予測処理部１０２ｆは、最も評価の高い候補タンパク質を雛型のタンパク質として対象タンパク質の立体構造モデルのデータである立体構造予測データを作成し、立体構造予測データファイル１０６ｄに格納する（ステップＳＬ−７）。 Next, the three-dimensional structure prediction processing unit 102f creates three-dimensional structure prediction data, which is the data of the three-dimensional structure model of the target protein, using the candidate protein with the highest evaluation as a template protein, and stores it in the three-dimensional structure prediction data file 106d ( Step SL-7).

そして、実施例１のステップＳＪ−６およびステップＳＪ−７と同様の方法により、モデル構造の評価などを行う（ステップＳＬ−８）。これにて、実施例３の処理が終了する。 Then, the model structure is evaluated by the same method as in Step SJ-6 and Step SJ-7 of Example 1 (Step SL-8). This completes the process of the third embodiment.

［実施例４］
次に、本発明の実施例４の詳細について図１６等を参照して説明する。図１６は、本発明における本システムの実施例４の一例を示すフローチャートである。 [Example 4]
Next, details of the fourth embodiment of the present invention will be described with reference to FIG. FIG. 16 is a flowchart showing an example of Embodiment 4 of the system according to the present invention.

まず、タンパク質構造解析装置１００は、立体構造予測処理部１０２ｆの処理により、対象タンパク質についてＰＤＢなどのタンパク質データベースを用いて相同性検索を行い、雛型の立体構造となる構造既知のタンパク質を抽出する（ステップＳＭ−１）。 First, the protein structure analysis apparatus 100 performs a homology search on a target protein using a protein database such as PDB by the processing of the three-dimensional structure prediction processing unit 102f, and extracts a protein with a known structure that becomes a template three-dimensional structure. (Step SM-1).

ここで、雛形のタンパク質を抽出できなかった場合には、立体構造予測処理部１０２ｆは、分子シミュレーション法などの手法により、対象タンパク質の立体構造モデルのデータである立体構造予測データを作成し、立体構造予測データファイル１０６ｄに格納する（ステップＳＭ−２〜ステップＳＭ−３）。 If the template protein cannot be extracted, the three-dimensional structure prediction processing unit 102f creates three-dimensional structure prediction data that is data of the three-dimensional structure model of the target protein by a technique such as a molecular simulation method. It stores in the structure prediction data file 106d (step SM-2 to step SM-3).

そして、対象タンパク質（または複合体）および雛型タンパク質について質量分析装置４００により断片化スペクトルの測定を行う（ステップＳＭ−１およびステップＳＭ−４）。また、対象タンパク質（または複合体）および雛型タンパク質について水素−重水素交換反応を行いながら、断片化スペクトルの測定を行ってもよい。そして、タンパク質構造解析装置１００は、断片化スペクトルデータ取得部１０２ａの処理により、質量分析装置４００から入力装置１１２またはネットワーク３００を経由して断片化スペクトルデータを取得し、断片化スペクトルデータファイル１０６ａに格納する。 Then, the fragmentation spectrum of the target protein (or complex) and the template protein is measured by the mass spectrometer 400 (step SM-1 and step SM-4). Further, the fragmentation spectrum may be measured while performing a hydrogen-deuterium exchange reaction on the target protein (or complex) and the template protein. Then, the protein structure analysis apparatus 100 acquires fragmentation spectrum data from the mass spectrometry apparatus 400 via the input device 112 or the network 300 by the processing of the fragmentation spectrum data acquisition unit 102a, and stores the fragmentation spectrum data file 106a. Store.

ついで、タンパク質構造解析装置１００は、帰属情報決定部１０２ｂの処理により、測定された断片化スペクトルに基づいて、対象タンパク質および雛型タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し（ステップＳＭ−２）、また、タンパク質構造解析装置１００は、切断容易領域情報決定部１０２ｃの処理により、決定された断片化イオンの帰属情報により、対象タンパク質および雛型タンパク質のアミノ酸配列において断片化イオンに断片化された箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における切断容易領域情報を決定する。 Next, the protein structure analysis apparatus 100 determines the attribution information of the fragmented ions for the amino acid sequences of the target protein and the template protein based on the measured fragmentation spectrum by the processing of the attribution information determination unit 102b (step SM). -2) In addition, the protein structure analysis apparatus 100 uses the fragmentation ion assignment information determined by the processing of the easy-cleavage region information determination unit 102c to fragment the fragmented ions in the amino acid sequences of the target protein and the template protein. The identified site is identified, and easy-cleavage region information in the amino acid sequence of the target protein is determined according to the location.

また、水素−重水素交換反応を行っている場合には、タンパク質構造解析装置１００は、重水素化速度決定部１０２ｊの処理により、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質および雛型タンパク質のアミノ酸配列中の各アミノ酸残基における重水素化速度を決定する。そして、溶媒接触残基情報決定部１０２ｄは、決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質および雛型タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における溶媒接触残基情報を決定し、溶媒接触残基情報／相互作用界面情報ファイル１０６ｃに格納する。また、相互作用界面情報決定部１０２ｅは、決定されたアミノ酸配列中の各アミノ酸残基における重水素化速度に従って、対象タンパク質および雛型タンパク質のアミノ酸配列における相互作用界面情報を決定し、溶媒接触残基情報／相互作用界面情報ファイル１０６ｃに格納する。 In addition, when the hydrogen-deuterium exchange reaction is performed, the protein structure analysis apparatus 100 uses the process of the deuteration rate determination unit 102j to measure the measured fragmentation spectrum and the determined fragmented ions. According to the attribution information, the deuteration rate at each amino acid residue in the amino acid sequences of the target protein and the template protein is determined. Then, the solvent contact residue information determination unit 102d identifies a portion in contact with the solvent in the amino acid sequences of the target protein and the template protein according to the deuteration rate at each amino acid residue in the determined amino acid sequence. The solvent contact residue information in the amino acid sequence of the target protein is determined according to the location, and stored in the solvent contact residue information / interaction interface information file 106c. Further, the interaction interface information determination unit 102e determines interaction interface information in the amino acid sequences of the target protein and the template protein according to the deuteration rate at each amino acid residue in the determined amino acid sequence, and the solvent contact residue The base information / interaction interface information file 106c is stored.

そして、処理結果出力部１０２ｈは、対象タンパク質および雛型タンパク質の予測された立体構造予測データと決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを対応させて出力する（ステップＳＭ−５）。 Then, the processing result output unit 102h outputs the predicted three-dimensional structure prediction data of the target protein and the template protein in association with the determined cleavable region information, solvent contact residue information, and interaction interface information ( Step SM-5).

また、予測構造評価情報決定部１０２ｇは、予測された立体構造予測データと、決定された切断容易領域情報や溶媒接触残基情報や相互作用界面情報とを比較して、切断容易領域情報により特定された切断容易領域や溶媒接触残基情報により特定された溶媒接触残基や相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定する。また、予測構造評価情報出力部１０２ｉは、決定された予測構造評価情報を出力する（ステップＳＭ−６）。これにて、実施例４の処理が終了する。 Further, the predicted structure evaluation information determination unit 102g compares the predicted three-dimensional structure prediction data with the determined easy-cleavage region information, solvent contact residue information, and interaction interface information, and specifies the easy-cleavage region information. The predicted structure evaluation information is determined by evaluating the predicted structure portion corresponding to the interaction interface specified by the solvent contact residue or the interaction interface information specified by the easily cleavable region or the solvent contact residue information. Also, the predicted structure evaluation information output unit 102i outputs the determined predicted structure evaluation information (step SM-6). This completes the process of the fourth embodiment.

なお、実施例４においては、ステップＳＭ−２における相同性検索において雛型タンパク質が抽出できない場合を一例に説明したが、相同性検索において雛型タンパク質が抽出できた場合には、その雛型タンパク質についてステップＳＭ−４において実験値が収集される。 In Example 4, the case where the template protein cannot be extracted in the homology search in step SM-2 has been described as an example. However, if the template protein can be extracted in the homology search, the template protein is extracted. Experimental values are collected in step SM-4.

［実施例５］
次に、本発明の実施例５の詳細について図１１および図１８等を参照して説明する。 [Example 5]
Next, details of the fifth embodiment of the present invention will be described with reference to FIGS.

（１）リコンビナント・ヒトインターロイキン−６（ＩＬ−６）の調製
ＩＬ−６は文献の方法により、大腸菌を用いて発現、精製を行った（Ｄ．Ｅｊｉｍａ，Ｂｉｏｔｅｃｈｎｏｌ．Ｂｉｏｅｎｇ．，６２，３０１−１０（１９９９）参照）。 (1) Preparation of Recombinant Human Interleukin-6 (IL-6) IL-6 was expressed and purified using Escherichia coli according to literature methods (D. Eima, Biotechnol. Bioeng., 62, 301-). 10 (1999)).

（２）ＩＬ−６のＦＴＩＣＲ−ＭＳ測定
すべての測定は、ｇａｓａｓｓｉｓｔｅｄｄｙｎａｍｉｃｔｒａｐｐｉｎｇ（ＧＡＤＴ）法で行った。ＧＡＤＴ法は、イオンを測定セル内に捕捉する際に、一時的にトラップ電圧を高くしてかつイオンの運動エネルギーを低下させるためにアルゴンガスをトラップガスとしてパルスでセル内に導入する方法である。これにより効率よくイオンをセル内に捕捉することが可能となり、感度が向上する。 (2) FTICR-MS measurement of IL-6 All measurements were performed by the gas assisted dynamic trapping (GADT) method. The GADT method is a method of introducing argon gas into the cell as a trap gas in a pulse to temporarily increase the trapping voltage and lower the kinetic energy of the ions when trapping the ions in the measurement cell. . As a result, ions can be efficiently trapped in the cell, and the sensitivity is improved.

測定機器及び分析条件等は下記の通りである。
・質量分析計：７Ｔセルフシールド超伝導マグネットフーリエ変換型質量分析計ＡｐｅｘＩＩ（ブルカー・ダルトニクス社製）；
・流速：６０μｌ／ｈ；
・乾燥ガス（Ｄｒｉｎｇｇａｓ）：２０ｐｓｉ、１５０℃；
・Ｎｅｂｕｌｉｚｉｎｇｇａｓ：３０ｐｓｉ；
・アルゴンガス圧：７．８Ｔｏｏｒ；
・ヘキサポール内蓄積時間：１秒；
・キャピラリー出口電圧：１０６．５ｋＶ（ＭＳスペクトル）、１５０ｋＶ（ＩＲＭＰＤ）、２２０ｋＶ（ヘキサポールＣＩＤ）；及び
・データ取り込みポイント数：５１２Ｋ。 Measuring instruments and analysis conditions are as follows.
Mass spectrometer: 7T self-shielded superconducting magnet Fourier transform type mass spectrometer Apex II (manufactured by Bruker Daltonics);
Flow rate: 60 μl / h;
• Dry gas: 20 psi, 150 ° C;
Nebulizing gas: 30 psi;
Argon gas pressure: 7.8 Toor;
-Hexapole accumulation time: 1 second;
Capillary outlet voltage: 106.5 kV (MS spectrum), 150 kV (IRMPD), 220 kV (Hexapol CID); and Number of data acquisition points: 512K.

（３）ＩＲＭＰＤ法及びヘキサポールＣＩＤ法によるＩＬ−６の断片化
ＩＲＭＰＤ法は、イオンサイクロトロンセル内に閉じ込めたイオンに対して、３００ミリ秒間炭酸ガスレーザーを照射した。また、ヘキサポールＣＩＤ法は、通常の測定と同じパルスシークエンスを用いて、キャピラリー出口電圧を上げることとヘキサポール内蓄積時間を１−３秒にすることで断片化を誘起することができた。ＦＴＩＣＲ−ＭＳスペクトルは全て、装置に付属のソフトウエアであるＸｍａｓｓを用いて解析を行った。断片化イオンの帰属は、ソフトウエアＦｒａｇ−Ｐｒｏ（ｍｓｍｓｓｏｆｔ社製）を用いた。 (3) Fragmentation of IL-6 by IRMPD method and hexapole CID method In the IRMPD method, ions confined in the ion cyclotron cell were irradiated with a carbon dioxide laser for 300 milliseconds. The hexapole CID method was able to induce fragmentation by increasing the capillary outlet voltage and setting the accumulation time in the hexapole to 1-3 seconds using the same pulse sequence as in the normal measurement. All FTICR-MS spectra were analyzed using Xmass, software that came with the instrument. For the assignment of the fragmented ions, software Frag-Pro (manufactured by msmssoft) was used.

（４）３Ｄ−１Ｄ法及びＰＳＩ−ＢＬＡＳＴによるホモロジー検索
ヒトインターロイキン−６（ＩＬ６）のアミノ酸配列は、ＰＩＲ（タンパク質アミノ酸配列データベース）より取得した。プレ配列を含むアミノ酸残基数は２１２残基であった。ＬＩＢＲＡを用いて３Ｄ−１Ｄ法によるホモロジー検索を行った結果、上位３０件のＰＤＢファイルリストとして、１ａｌｕ，１ｌｋｉ，１ａｕ１＿Ａ，１ｎｆ１＿Ａ，１ｑｋｍ＿Ａ，１ｊｎｋ，１ｐｍｅ，１ｄｇ９＿Ａ，１ｂ５ｌ，１ｂｇｃ，１ｙｒｇ＿Ａ，１ｅｕｖ＿Ａ，２ｔｐｓ＿Ｂ，１ｂｇ０，１ｉａｌ＿Ａ，２ｃｎｄ，１ｃｊｓ＿Ａ，１ｗｅｒ，７ｏｄｃ＿Ａ，１ｅｑｆ＿Ａ，１ｆ６ｆ＿Ａ，１ｄｘｙ，１ｅｄｔ，１ｇｓｅ＿Ａ，１ｑｒ２＿Ａ，１ｌｌｐ，１ｑｉ７＿Ａ，２ｂｉｄ＿Ａ，２ｌｊｒ＿Ａ，１ｃ３ｊ＿Ａを得た。 (4) Homology search by 3D-1D method and PSI-BLAST The amino acid sequence of human interleukin-6 (IL6) was obtained from PIR (protein amino acid sequence database). The number of amino acid residues including the presequence was 212 residues. As a result of performing a homology search by 3D-1D method using LIBRA, the top 30 PDB file lists are 1alu, 1lki, 1au1_A, 1nf1_A, 1qkm_A, 1jnk, 1pme, 1dg9_A, 1b5l, 1bgc, 1yrg_A, 2eups_A, 2eups , 1bg0, 1ial_A, 2cnd, 1cjs_A, 1wer, 7odc_A, 1eqf_A, 1f6f_A, 1dxy, 1edt, 1gse_A, 1qr2_A, 1llp, 1qi7_A, 2bid_A, 2ljr_A, 2ljr_A1

また、ＰＳＩ−ＢＬＡＳＴを用いて、ｅ値が０．０１以下の条件によるホモロジー検索を行った結果、ＰＤＢファイルリストとして、１ＩＬ６，１ＡＬＵ，１ＧＮＣ，１ＢＧＥ＿Ｂ，１ＢＧＣ，１ＲＨＧ＿Ｃ，１Ｉ１Ｒ＿Ｂを得た。この場合、上位６件はＩＬ６自身またはＩＬ６と立体構造の類似性が既に指摘されているタンパク質であるＧＣＳＦ（ＧｒａｎｕｌｏｃｙｔｅＣｏｌｏｎｙＳｔｉｍｕｌａｔｉｎｇＦａｃｔｏｒ）であるため、いずれの参照タンパク質についても立体構造予測の候補になり得ることが確認できた。 In addition, as a result of the homology search under the condition that the e value is 0.01 or less using PSI-BLAST, 1IL6, 1ALU, 1GNC, 1BGE_B, 1BGC, 1RHG_C, 1I1R_B were obtained as a PDB file list. In this case, the top six cases are GCSF (Granulocyte Colony Stimulating Factor), which is a protein that has already been pointed out to be similar to IL6 itself or three-dimensional structure, and therefore any reference protein can be a candidate for three-dimensional structure prediction. I was able to confirm it.

（５）アライメントに対する２次構造情報とＭＳの断片化データの照会
そこで、ＬＩＢＲＡによるホモロジー検索によって得られた３０種類の各アライメントに対して、参照タンパク質の２次構造情報とＭＳの断片化データを同時に照会するプログラムを作成し、結果をアライメントごとに表示した。図１８は、実施例５における処理結果の一例を示す図である（本図においてＡの後にＢが結合する）。この図に示すように処理結果の表示画面は、例えば、対象タンパク質のアミノ酸配列を表示するための表示領域ＭＢ−１、雛型タンパク質のアミノ酸配列を表示するための表示領域ＭＢ−２、立体構造予測データの一例である二次立体構造予測データを表示するための表示領域ＭＢ−３、および、実験データの断片化イオンのフラグメントを表示するための表示領域ＭＢ−４を含んで構成されている。図１８において、ＭＢ−３における２次構造の表示は、「１」はαヘリックス、「２」はβストランド、「４」はループ領域を示している。 (5) Inquiry of secondary structure information and MS fragmentation data for alignment Therefore, for each of the 30 types of alignment obtained by LIBRA homology search, secondary structure information and MS fragmentation data of the reference protein are obtained. A program to inquire at the same time was created and the results were displayed for each alignment. FIG. 18 is a diagram illustrating an example of a processing result in the fifth embodiment (B is combined after A in the drawing). As shown in this figure, the processing result display screen includes, for example, a display area MB-1 for displaying the amino acid sequence of the target protein, a display area MB-2 for displaying the amino acid sequence of the template protein, and a three-dimensional structure. It includes a display area MB-3 for displaying secondary tertiary structure prediction data, which is an example of prediction data, and a display area MB-4 for displaying fragments of fragmented ions of experimental data. . In FIG. 18, the secondary structure in MB-3 indicates that “1” is an α helix, “2” is a β strand, and “4” is a loop region.

（６）候補タンパク質の評価
２次構造情報と全ての断片化データの情報からアライメントごとに、候補タンパク質の評価（ランク付け）を行うプログラムを作成し、これを検証した結果、ＩＬ６の立体構造予測に対して最も有効な候補タンパク質はＩＬ６自身（１ａｌｕ）であること、次いで有効な候補タンパク質は、ＩＬ６と立体構造の類似性が既に指摘されているタンパク質であるＧＣＳＦ（１ｂｇｃ）であることが確認できた。 (6) Evaluation of candidate protein A program for evaluating (ranking) candidate proteins is created for each alignment from secondary structure information and information on all fragmentation data. It is confirmed that IL6 itself (1alu) is the most effective candidate protein for Gd, and that the next effective candidate protein is GCSF (1bgc), a protein that has already been pointed out to be similar in three-dimensional structure to IL6 did it.

ここで、図１１は、本実施例において断片化スペクトルを解析する場合の一例を示すフローチャートである。 Here, FIG. 11 is a flowchart showing an example of analyzing a fragmented spectrum in the present embodiment.

まず、帰属情報決定部１０２ｂは、断片化スペクトルデータファイル１０６ａに格納された断片化スペクトルを参照して、断片化スペクトルにおける各イオンのｍ／ｚや、価数の情報（分解能が低い場合、価数はわからない場合もある）を取得する（ステップＳＨ−１）。 First, the attribution information determination unit 102b refers to the fragmentation spectrum stored in the fragmentation spectrum data file 106a, and the m / z of each ion in the fragmentation spectrum and valence information (if the resolution is low, The number may not be known) (step SH-1).

そして、帰属情報決定部１０２ｂは、各断片化イオンのアミノ酸配列への帰属を求め、各イオンのｍ／ｚ（＋価数）に基づいて、候補ペプチド断片を検索・抽出・表示する（ステップＳＨ−２）。 Then, the attribution information determination unit 102b obtains the attribution of each fragmented ion to the amino acid sequence, and searches, extracts, and displays candidate peptide fragments based on the m / z (+ valence) of each ion (step SH). -2).

そして、切断容易領域情報決定部１０２ｃは、切断が容易な部位を抽出して切断容易領域情報として切断容易領域情報ファイル１０６ｂに格納する（ステップＳＨ−３）。 Then, the easy-to-cut area information determination unit 102c extracts a part that can be easily cut and stores it in the easy-to-cut area information file 106b as easy-to-cut area information (step SH-3).

そして、処理結果出力部１０２ｈは、上述した図１２に示すように、立体構造表示ソフトを用いて切断容易領域情報に基づいて切断部位の色を変えるように制御して表示する（ステップＳＨ−４）。これにて、実施例５の処理が終了する。 Then, as shown in FIG. 12 described above, the processing result output unit 102h controls and displays the three-dimensional structure display software so as to change the color of the cut site based on the easy-to-cut region information (step SH-4). ). This completes the processing of the fifth embodiment.

［実施例６］
次に、本発明の実施例６の詳細について図２６等を参照して説明する。 [Example 6]
Next, details of the sixth embodiment of the present invention will be described with reference to FIG.

（１）ユビキチンのＦＴＩＣＲ−ＭＳによる断片化スペクトルの測定と帰属
ウシ由来ユビキチン（シグマ社製コードＮｏ．Ｕ６２５３）を５ｐｍｏｌ／μＬになるように１％酢酸−５０％メタノール水溶液に溶解したサンプルを使用した。全ての測定は、ＧＡＤＴ法で行った。 (1) Measurement and assignment of fragmentation spectrum of ubiquitin by FTICR-MS Use a sample obtained by dissolving bovine-derived ubiquitin (Sigma Code No. U6253) in 1% acetic acid-50% methanol aqueous solution to 5 pmol / μL. did. All measurements were performed by the GADT method.

測定機器及び分析条件等は下記の通りである。
・質量分析計：ナノエレクトロスプレーイオン源を装着した７Ｔセルフシールド超伝導マグネットフーリエ変換型質量分析計ＡｐｅｘＩＩ（ブルカー・ダルトニクス社製）；
・流速：２００ｎｌ／ｍｉｎ；
・乾燥ガス：５ｐｓｉ、２００℃；
・ヘキサポール内蓄積時間：０．６秒；
・キャピラリー電圧：１８００Ｖ；
・キャピラリー出口電圧：９０Ｖ；及び
・データ取り込みポイント数：５１２Ｋ。 Measuring instruments and analysis conditions are as follows.
Mass spectrometer: 7T self-shielded superconducting magnet Fourier transform mass spectrometer equipped with a nanoelectrospray ion source Apex II (Bruker Daltonics);
-Flow rate: 200 nl / min;
Dry gas: 5 psi, 200 ° C .;
-Hexapole accumulation time: 0.6 seconds;
-Capillary voltage: 1800V;
Capillary outlet voltage: 90V; and Number of data acquisition points: 512K.

本実施例において、高いキャピラリー出口電圧によるノイズスキマーＣＩＤ法を用いて、ユビキチンの断片化を誘起することができた。ＦＴＩＣＲ−ＭＳスペクトルの解析と断片化イオンの帰属は、実施例５と同じ方法で行った。アミノ酸配列はＰＩＲより取得した。 In this example, ubiquitin fragmentation could be induced using the noise skimmer CID method with a high capillary exit voltage. Analysis of FTICR-MS spectrum and assignment of fragmented ions were performed in the same manner as in Example 5. The amino acid sequence was obtained from PIR.

（２）立体構造とＭＳ断片化データの照会
ユビキチンは既に立体構造が明らかとなっている。ＰＤＢ（タンパク質立体構造データバンク）より、ユビキチンの構造ファイル（ＰＤＢファイル；１ＵＢＩ．ｐｄｂ）を取得し、立体構造表示ソフト「ＶｉｅｗｅｒＬｉｔｅ（製品名）」（Ａｃｃｅｌｒｙｓ（会社名）製）を用いて、図２６に示すようにユビキチンの立体構造をリボンモデルで表示した。図２６は、本発明における本システムの実施例６におけるウシ由来ユビキチンの立体構造の一例を示す図である。図２６において、αへリックスを赤、βストランドを水色で表示し、更にＭＳで得られた断片化部位を黄色で表示した。その結果、αへリックスの末端やβストランドの断片化が生じ易く、２次構造を有するαへリックス内部やβストランド内部では生じにくい傾向にあることがわかった。これにて、実施例６の説明が終了する。 (2) Query of 3D structure and MS fragmentation data The 3D structure of ubiquitin has already been clarified. Obtain a ubiquitin structure file (PDB file; 1UBI.pdb) from PDB (Protein 3D Structure Data Bank), and use 3D structure display software “ViewerLite (product name)” (manufactured by Accelrys (company name)). As shown in FIG. 26, the three-dimensional structure of ubiquitin was displayed as a ribbon model. FIG. 26 is a diagram showing an example of the three-dimensional structure of bovine-derived ubiquitin in Example 6 of the present system according to the present invention. In FIG. 26, α-helix is displayed in red, β-strand is displayed in light blue, and a fragmentation site obtained by MS is displayed in yellow. As a result, it was found that the α-helix ends and β-strands are likely to be fragmented, and the α-helix and β-strands having a secondary structure tend not to be generated. This completes the description of the sixth embodiment.

［実施例７］
次に、本発明の実施例７の詳細について図２７等を参照して説明する。 [Example 7]
Next, details of the seventh embodiment of the present invention will be described with reference to FIG.

（１）アルコール脱水素酵素のＦＴＩＣＲ−ＭＳによる断片化スペクトルの測定と帰属
酵母由来アルコール脱水素酵素（シグマ社製コードＮｏ．）を５ｐｍｏｌ／μＬになるように１％酢酸−５０％メタノール水溶液に溶解したサンプルを使用した。全ての測定は、ＧＡＤＴ法で行った。 (1) Measurement and assignment of fragmentation spectrum of alcohol dehydrogenase by FTICR-MS 1% acetic acid-50% methanol aqueous solution of yeast-derived alcohol dehydrogenase (Code No. made by Sigma) to 5 pmol / μL. Dissolved samples were used. All measurements were performed by the GADT method.

測定機器及び分析条件等は下記の通りである。
・質量分析計：ナノエレクトロスプレーイオン源を装着した７Ｔセルフシールド超伝導マグネットフーリエ変換型質量分析計ＡｐｅｘＩＩ（ブルカー・ダルトニクス社製）；
・流速：２００ｎｌ／ｍｉｎ；
・乾燥ガス：５ｐｓｉ、２００℃；
・ヘキサポール内蓄積時間：１秒；
・キャピラリー電圧：１７５０Ｖ；
・キャピラリー出口電圧：１７０Ｖ；及び
・データ取り込みポイント数：５１２Ｋ。 Measuring instruments and analysis conditions are as follows.
Mass spectrometer: 7T self-shielded superconducting magnet Fourier transform mass spectrometer equipped with a nanoelectrospray ion source Apex II (Bruker Daltonics);
-Flow rate: 200 nl / min;
Dry gas: 5 psi, 200 ° C .;
-Hexapole accumulation time: 1 second;
-Capillary voltage: 1750V;
-Capillary outlet voltage: 170V; and-Data acquisition point number: 512K.

本実施例において、キャピラリー出口の電圧を上げることによるヘキサポールＣＩＤ法を用いて、アルコール脱水素酵素の断片化を誘起することができた。ＦＴＩＣＲ−ＭＳスペクトルの解析と断片化イオンの帰属は、実施例５と同じ方法で行った。アミノ酸配列はＰＩＲより取得した。 In this example, the fragmentation of alcohol dehydrogenase could be induced using the hexapole CID method by increasing the voltage at the capillary outlet. Analysis of FTICR-MS spectrum and assignment of fragmented ions were performed in the same manner as in Example 5. The amino acid sequence was obtained from PIR.

（２）アルコール脱水素酵素のモデリングとＭＳ断片化データの照会
全自動タンパク質モデリングソフトＦＡＭＳを用いて、アルコール脱水素酵素の立体構造モデルを作成した。立体構造表示ソフト「ＶｉｅｗｅｒＬｉｔｅ（製品名）」（Ａｃｃｅｌｒｙｓ（会社名）製）を用いて、図２７に示すようにアルコール脱水素酵素の立体構造をリボンモデルで表示した。図２７は、本発明における本システムの実施例７における酵母由来アルコール脱水素酵素の立体構造の一例を示す図である。図２７において、αへリックスを赤、βストランドを水色で表示し、更にＭＳで得られた断片化部位を黄色で表示した。その結果、モデル構造上、αへリックスやβストランドの末端に断片化部位が多く、αへリックス内部やβストランド内部で少ない傾向にあることが視覚的に容易にわかり、モデル構造が実験データとよく一致した。これにて、実施例７の説明が終了する。 (2) Modeling of alcohol dehydrogenase and inquiry of MS fragmentation data A three-dimensional structure model of alcohol dehydrogenase was created using fully automated protein modeling software FAMS. Three-dimensional structure display software “ViewerLite (product name)” (manufactured by Accelrys (company name)) was used to display the three-dimensional structure of alcohol dehydrogenase as a ribbon model as shown in FIG. FIG. 27 is a diagram showing an example of the three-dimensional structure of a yeast-derived alcohol dehydrogenase in Example 7 of the present system in the present invention. In FIG. 27, the α helix is displayed in red, the β strand is displayed in light blue, and the fragmentation site obtained by MS is displayed in yellow. As a result, it can be easily seen visually that the model structure has many fragmentation sites at the ends of the α helix and β strands and tends to be less inside the α helix and β strands. Matched well. This completes the description of the seventh embodiment.

［実施例８；実験結果により興味の対象となったタンパク質の立体構造表示］
次に、本発明の実施例８の詳細について図２１〜図２５等を参照して説明する。
図２１は、本発明における本システムの実施例８の一例を示すフローチャートである。 [Example 8: Three-dimensional structure display of protein of interest based on experimental results]
Next, details of the eighth embodiment of the present invention will be described with reference to FIGS.
FIG. 21 is a flowchart showing an example of the eighth embodiment of the present system according to the present invention.

本実施例は、図２１に示すように、ＤＮＡマイクロアレイ等によるｍＲＮＡ発現プロファイリング等の発現解析結果や、プロテオーム解析で得られたＭＳによる同定結果（例えば、Ｍａｓｃｏｔ、ＳＥＱＵＥＳＴ等）や、比較ゲノム解析の結果や、タンパク質相互作用解析（例えば、酵母２ハイブリッド（ｙｅａｓｔｔｗｏ−ｈｙｂｒｉｄ）法やＴＡＰ法（ＡｎｕｊＫｕｍｅｒ，，Ｎａｔｕｒｅ、４１５，１２３−１２４（２００２）））の結果や、例えばＫＥＧＧ等の代謝パスウエイマップやシグナル伝達マップ上で興味深い遺伝子のアクセッション番号等の識別番号を指定（例えば、画面上でクリック等）することにより、立体構造データベース（例えば、ＰＤＢ等）やモデル構造データベース（例えば、ＦＡＭＳＢａｓｅ等）から立体構造データを検索・抽出し、立体構造表示プログラム（例えば、ＲａｓＭｏｌ等）を用いて表示する。 In this example, as shown in FIG. 21, the results of expression analysis such as mRNA expression profiling by a DNA microarray, the identification result by MS obtained by proteome analysis (for example, Mascot, SEQUEST, etc.), comparative genome analysis Results, protein interaction analysis (eg, yeast two-hybrid method or TAP method (Anuj Kumer, Nature, 415, 123-124 (2002))), metabolic pathways such as KEGG, etc. By specifying an identification number such as an accession number of an interesting gene on a map or signal transmission map (for example, clicking on the screen), a three-dimensional structure database (for example, PDB) or a model structure database (for example, FAMSBase) From) Body structure data is retrieved / extracted and displayed using a three-dimensional structure display program (for example, RasMol).

以下に、Ａｎａｂａｅｎａｓｐ．ＰＣＣ７１２０株のゲノムについて、転写量の増減と立体構造との相関を表す場合を一例に説明する。本実施例は、ＤＮＡマイクロアレイによるｍＲＮＡ転写の量転写量をマッピングしたデータと立体構造データベースとをリンクさせることにより、転写量の増減と立体構造との相関を明確化することを目的とする表示プログラムである。 Below, Anabaena sp. The case where the correlation between the increase / decrease in the transcription amount and the three-dimensional structure of the genome of the PCC7120 strain is described as an example. This embodiment is a display program for clarifying the correlation between increase / decrease in transcription amount and three-dimensional structure by linking the amount of mRNA transcription by DNA microarray and the three-dimensional structure database. It is.

図２２は、ＤＮＡマイクロアレイによる発現量の違いを色別に表示した表示画面の一例を示す図である。本画面中の小さなマスは、遺伝子一つ一つを示し、「赤」、「青」、「緑」は転写量の「増」、「減」、「不変」をそれぞれ表し、色の「濃」、「淡」は立体構造データベース（例えば、ＰＤＢ等）との配列的な相同性の「有」、「無」を表す。 FIG. 22 is a diagram showing an example of a display screen displaying the difference in expression level by DNA microarray for each color. Small squares on this screen indicate genes one by one, and “red”, “blue”, and “green” indicate “increase”, “decrease”, and “invariance” of the transfer amount, respectively. "And" light "indicate" present "or" no "of sequence homology with a three-dimensional structure database (for example, PDB).

ここで、利用者が興味深い遺伝子に対応するマスの一つをクリックすると、図２３に示すように、ＳＣＯＰ（ＳｔｕｒｕｃｔｕｒａｌＣｌａｓｓｉｆｉｃａｔｉｏｎｏｆＰｒｏｔｅｉｎｓ）の分類番号と、鋳型のＰＤＢ等のＩＤ（識別番号）が示され、さらに、ホモロジーの高いＰＤＢデータ（本図の例では上位五位）を示すクリックボタン（図２３においてアンダーライン表示された部分）が表示される。 Here, when the user clicks on one of the cells corresponding to the interesting gene, as shown in FIG. 23, the SCOP (Structural Classification of Proteins) classification number and the ID (identification number) such as the PDB of the template are displayed. In addition, a click button (the underlined portion in FIG. 23) indicating the PDB data with high homology (the top five in the example of this figure) is displayed.

利用者がクリックボタンをクリックすると、図２４に示すようなアライメントの表示画面や図２５に示すようなＲａｓＭｏｌ等のグラフィックソフトウェアによる立体構造の表示画面を表示することができる。 When the user clicks the click button, an alignment display screen as shown in FIG. 24 or a three-dimensional structure display screen by graphic software such as RasMol as shown in FIG. 25 can be displayed.

本実施例によると、ＤＮＡマイクロアレイ等によるトランスクリプトーム解析や、プロテオーム解析（ＭＳによるタンパク同定結果など）や、ゲノム解析（バイオインフォマティクス）等によりディスプレイ上に表示された注目すべき遺伝子・蛋白質の識別番号（アクセッション番号等）をクリックすると、立体構造および構造予測データベースから、その立体構造を引き出して３Ｄ表示をすることができるようになる。これにて、実施例８の処理が終了する。 According to this example, identification of notable genes and proteins displayed on the display by transcriptome analysis using DNA microarray, proteome analysis (protein identification results by MS, etc.), genome analysis (bioinformatics), etc. When a number (accession number or the like) is clicked, the 3D structure can be extracted from the 3D structure and structure prediction database and displayed in 3D. This completes the process according to the eighth embodiment.

［他の実施の形態］
さて、これまで本発明の実施の形態について説明したが、本発明は、上述した実施の形態以外にも、上記特許請求の範囲に記載した技術的思想の範囲内において種々の異なる実施の形態にて実施されてよいものである。 [Other embodiments]
Although the embodiments of the present invention have been described so far, the present invention can be applied to various different embodiments in addition to the above-described embodiments within the scope of the technical idea described in the claims. May be implemented.

例えば、タンパク質構造解析装置１００がスタンドアローンの形態で処理を行う場合を一例に説明したが、タンパク質構造解析装置１００とは別筐体で構成されるクライアント端末からの要求に応じて処理を行い、その処理結果を当該クライアント端末に返却するように構成してもよい。 For example, the case where the protein structure analysis apparatus 100 performs processing in a stand-alone form has been described as an example. However, the protein structure analysis apparatus 100 performs processing in response to a request from a client terminal configured in a separate housing from the protein structure analysis apparatus 100, You may comprise so that the process result may be returned to the said client terminal.

また、上述した実施例においては、溶媒接触残基や相互作用部位界面情報等を水素−重水素交換法を用いて得る場合を一例に説明したが、本発明はこのような場合に限定されるものではなく、例えば、化学修飾法などを用いて得てもよい。例えば、トランスグルタミナーゼを用いてタンパク質のグルタミン残基に安定同位体１５Ｎを選択的に導入することができる（Ｎ．Ｓｈｉｍｂａ，Ｎ．Ｙａｍａｄａ，Ｋ．Ｙｏｋｏｙａｍａ，Ｅ．Ｓｕｚｕｋｉ，Ａｎａｌ．Ｂｉｏｃｈｅｍ．，３０１，１２３−１２７（２００２））。この方法を上述した実施の形態と組み合わせることにより、グルタミン残基ごとの溶媒接触度の違いをラベル化率から求めることができるので、溶媒接触残基や相互作用部位界面情報等を水素−重水素交換法を用いる場合と同様に得ることができるようになる。 In the above-described embodiments, the case where the solvent contact residue, interaction site interface information, and the like are obtained using the hydrogen-deuterium exchange method has been described as an example. However, the present invention is limited to such a case. For example, it may be obtained by a chemical modification method or the like. For example, a stable isotope 15N can be selectively introduced into a glutamine residue of a protein using transglutaminase (N. Shimba, N. Yamada, K. Yokoyama, E. Suzuki, Anal. Biochem., 301, 123-127 (2002)). By combining this method with the above-described embodiment, the difference in the degree of solvent contact for each glutamine residue can be determined from the labeling rate, so that the solvent contact residue, interaction site interface information, etc. can be determined from hydrogen-deuterium. It can be obtained in the same manner as when the exchange method is used.

また、上述した実施例においては、ＭＳにより断片化を行う場合を一例に説明したが、本発明はこのような場合に限定されるものではなく、例えば、酵素消化法などを用いて得てもよい。すなわち、ペプシンやトリプシンなどの消化酵素などを用いた酵素反応によりタンパク質を断片化してもよい。 In the above-described examples, the case where fragmentation is performed by MS has been described as an example. However, the present invention is not limited to such a case, and may be obtained using, for example, an enzyme digestion method. Good. That is, the protein may be fragmented by an enzymatic reaction using digestive enzymes such as pepsin and trypsin.

また、立体構造予測処理部１０２ｆは、タンパク質構造解析装置１００に備える予測プログラムを用いて予測してもよく、また、外部システム２００に備える予測プログラムを用いて立体構造予測データを作成し、その立体構造予測データをネットワーク３００経由で取得してもよい。 The three-dimensional structure prediction processing unit 102f may perform prediction using a prediction program provided in the protein structure analysis apparatus 100. Alternatively, the three-dimensional structure prediction processing unit 102f creates three-dimensional structure prediction data using a prediction program provided in the external system 200. The structure prediction data may be acquired via the network 300.

また、実施形態において説明した各処理のうち、自動的に行なわれるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行なわれるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。 In addition, among the processes described in the embodiment, all or part of the processes described as being performed automatically can be performed manually, or all of the processes described as being performed manually are performed. Alternatively, a part can be automatically performed by a known method.

この他、上記文書中や図面中で示した処理手順、制御手順、具体的名称、各種の登録データや検索条件等のパラメータを含む情報、画面例、データベース構成については、特記する場合を除いて任意に変更することができる。 In addition, the processing procedures, control procedures, specific names, information including parameters such as various registration data and search conditions, screen examples, and database configurations shown in the above documents and drawings, unless otherwise specified. It can be changed arbitrarily.

また、タンパク質構造解析装置１００に関して、図示の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。 Moreover, regarding the protein structure analysis apparatus 100, each illustrated component is functionally conceptual and does not necessarily need to be physically configured as illustrated.

例えば、タンパク質構造解析装置１００の各部または各装置が備える処理機能、特に制御部１０２にて行なわれる各処理機能については、その全部または任意の一部を、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）および当該ＣＰＵにて解釈実行されるプログラムにて実現することができ、あるいは、ワイヤードロジックによるハードウェアとして実現することも可能である。なお、プログラムは、後述する記録媒体に記録されており、必要に応じてタンパク質構造解析装置１００に機械的に読み取られる。 For example, the processing functions of each unit or each device of the protein structure analysis apparatus 100, particularly the processing functions performed by the control unit 102, are all or any part thereof are assigned to a CPU (Central Processing Unit) and the CPU. It can be realized by a program that is interpreted and executed, or can be realized as hardware by wired logic. The program is recorded on a recording medium to be described later, and is mechanically read by the protein structure analysis apparatus 100 as necessary.

すなわち、ＲＯＭまたはＨＤなどの記憶部１０６などには、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）と協働してＣＰＵに命令を与え、各種処理を行うためのコンピュータプログラムが記録されている。このコンピュータプログラムは、ＲＡＭ等にロードされることによって実行され、ＣＰＵと協働して制御部１０２を構成する。また、このコンピュータプログラムは、タンパク質構造解析装置１００に対して任意のネットワーク３００を介して接続されたアプリケーションプログラムサーバに記録されてもよく、必要に応じてその全部または一部をダウンロードすることも可能である。 That is, in the storage unit 106 such as a ROM or an HD, computer programs for giving instructions to the CPU and performing various processes in cooperation with an OS (Operating System) are recorded. This computer program is executed by being loaded into a RAM or the like, and constitutes the control unit 102 in cooperation with the CPU. The computer program may be recorded in an application program server connected to the protein structure analysis apparatus 100 via an arbitrary network 300, and may be downloaded in whole or in part as necessary. It is.

また、本発明にかかるプログラムを、コンピュータ読み取り可能な記録媒体に格納することもできる。ここで、この「記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＥＰＲＯＭ、ＥＥＰＲＯＭ、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤ等の任意の「可搬用の物理媒体」や、各種コンピュータシステムに内蔵されるＲＯＭ、ＲＡＭ、ＨＤ等の任意の「固定用の物理媒体」、あるいは、ＬＡＮ、ＷＡＮ、インターネットに代表されるネットワークを介してプログラムを送信する場合の通信回線や搬送波のように、短期にプログラムを保持する「通信媒体」を含むものとする。 The program according to the present invention can also be stored in a computer-readable recording medium. Here, the “recording medium” is an arbitrary “portable physical medium” such as a flexible disk, a magneto-optical disk, a ROM, an EPROM, an EEPROM, a CD-ROM, an MO, and a DVD, and is incorporated in various computer systems. Program in a short time, such as a communication line or carrier wave when transmitting a program via any “fixed physical medium” such as ROM, RAM, HD, or a network such as LAN, WAN, or the Internet The “communication medium” that holds

また、「プログラム」とは、任意の言語や記述方法にて記述されたデータ処理方法であり、ソースコードやバイナリコード等の形式を問わない。なお、「プログラム」は必ずしも単一的に構成されるものに限られず、複数のモジュールやライブラリとして分散構成されるものや、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）に代表される別個のプログラムと協働してその機能を達成するものをも含む。なお、実施の形態に示した各装置において記録媒体を読み取るための具体的な構成、読み取り手順、あるいは、読み取り後のインストール手順等については、周知の構成や手順を用いることができる。 The “program” is a data processing method described in an arbitrary language or description method, and may be in any format such as source code or binary code. The “program” is not necessarily limited to a single configuration, but is distributed in the form of a plurality of modules and libraries, or in cooperation with a separate program represented by an OS (Operating System). Including those that achieve the function. Note that a well-known configuration and procedure can be used for a specific configuration for reading a recording medium, a reading procedure, an installation procedure after reading, and the like in each device described in the embodiment.

記憶部１０６に格納される各種のデータベース等（断片化スペクトルデータファイル１０６ａ〜処理結果データファイル１０６ｆ）は、ＲＡＭ、ＲＯＭ等のメモリ装置、ハードディスク等の固定ディスク装置、フレキシブルディスク、光ディスク等のストレージ手段であり、各種処理やウェブサイト提供に用いる各種のプログラムやテーブルやファイルやデータベースやウェブページ用ファイル等を格納する。 Various databases and the like (fragmented spectrum data file 106a to processing result data file 106f) stored in the storage unit 106 are memory means such as RAM and ROM, fixed disk devices such as hard disks, storage means such as flexible disks and optical disks. It stores various programs, tables, files, databases, web page files, etc. used for various processes and website provision.

また、タンパク質構造解析装置１００は、既知のパーソナルコンピュータ、ワークステーション等の情報処理端末等の情報処理装置にプリンタやモニタやイメージスキャナ等の周辺装置を接続し、該情報処理装置に本発明の方法を実現させるソフトウェア（プログラム、データ等を含む）を実装することにより実現してもよい。 The protein structure analyzing apparatus 100 connects a peripheral device such as a printer, a monitor or an image scanner to an information processing apparatus such as an information processing terminal such as a known personal computer or workstation, and the method of the present invention is connected to the information processing apparatus. You may implement | achieve by mounting the software (a program, data, etc. are included) which implement | achieve.

さらに、タンパク質構造解析装置１００の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷等に応じた任意の単位で、機能的または物理的に分散・統合して構成することができる。例えば、各データベースを独立したデータベース装置として独立に構成してもよく、また、処理の一部をＣＧＩ（ＣｏｍｍｏｎＧａｔｅｗａｙＩｎｔｅｒｆａｃｅ）を用いて実現してもよい。また、例えば、タンパク質構造解析装置１００は、質量分析装置４００と一体として構成してもよい。 Further, the specific form of dispersion / integration of the protein structure analysis apparatus 100 is not limited to that shown in the figure, and all or part of the protein structure analysis apparatus 100 may be functionally or physically dispersed / arbitrary in arbitrary units according to various loads. Can be integrated and configured. For example, each database may be independently configured as an independent database device, and a part of the processing may be realized by using CGI (Common Gateway Interface). For example, the protein structure analysis apparatus 100 may be configured integrally with the mass spectrometry apparatus 400.

また、ネットワーク３００は、タンパク質構造解析装置１００と外部システム２００とを相互に接続する機能を有し、例えば、インターネットや、イントラネットや、ＬＡＮ（有線／無線の双方を含む）や、ＶＡＮや、パソコン通信網や、公衆電話網（アナログ／デジタルの双方を含む）や、専用回線網（アナログ／デジタルの双方を含む）や、ＣＡＴＶ網や、ＩＭＴ２０００方式、ＧＳＭ方式またはＰＤＣ／ＰＤＣ−Ｐ方式等の携帯回線交換網／携帯パケット交換網や、無線呼出網や、Ｂｌｕｅｔｏｏｔｈ等の局所無線網や、ＰＨＳ網や、ＣＳ、ＢＳまたはＩＳＤＢ等の衛星通信網等のうちいずれかを含んでもよい。すなわち、本システムは、有線・無線を問わず任意のネットワークを介して、各種データを送受信することができる。 The network 300 has a function of mutually connecting the protein structure analysis apparatus 100 and the external system 200. For example, the Internet, an intranet, a LAN (including both wired and wireless), a VAN, a personal computer, and the like. Communication networks, public telephone networks (including both analog / digital), leased line networks (including both analog / digital), CATV networks, IMT2000 system, GSM system, PDC / PDC-P system, etc. It may include any one of a mobile circuit switching network / mobile packet switching network, a wireless paging network, a local wireless network such as Bluetooth, a PHS network, a satellite communication network such as CS, BS, or ISDB. That is, this system can transmit and receive various data via any network regardless of wired or wireless.

以上詳細に説明したように、本発明によれば、立体構造を予測する対象タンパク質についてエレクトロスプレーイオン化法などによりイオン化したイオンをヘキサポールＣＩＤ法などにより断片化イオンに断片化し、断片化スペクトルを測定し、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し、決定された断片化イオンの帰属情報により、対象タンパク質のアミノ酸配列において断片化イオンに断片化された箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における切断容易領域情報を決定し、対象タンパク質について立体構造を予測し、予測された立体構造予測データと決定された切断容易領域情報とを対応させて出力する（例えば、グラフィック表示、テーブルによる一覧表示など）ので、立体構造未知のタンパク質（これをコードする遺伝子を含む）の立体構造予測を行う場合、実測した立体構造情報を加味することにより、精度の高い予測を行うことができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 As described above in detail, according to the present invention, ions ionized by an electrospray ionization method or the like for a target protein for predicting a three-dimensional structure are fragmented into fragmented ions by a hexapole CID method or the like, and a fragmentation spectrum is measured. Then, based on the measured fragmentation spectrum, the assignment information of the fragment ion to the amino acid sequence of the target protein is determined, and the fragment information is fragmented in the amino acid sequence of the target protein by the determined assignment information of the fragment ion. And the cleavable region information in the amino acid sequence of the target protein is determined according to the portion, the three-dimensional structure is predicted for the target protein, the predicted three-dimensional structure prediction data and the determined cleavable region information (For example, graphic display, Therefore, when predicting the three-dimensional structure of proteins with unknown three-dimensional structures (including the genes that encode them), it is possible to make a highly accurate prediction by taking into account the measured three-dimensional structure information. A protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium can be provided.

また、本発明によれば、立体構造予測データを針金モデル、リボンモデル、パイプモデル、ボールアンドスティックモデル、または、空間充填モデルのうちいずれかのモデルによりグラフィック表示し、切断容易領域情報を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、リンクを設定、モデル上に表示、モデル上に切断容易領域情報に対応する網掛けや斜線などの特定のパターン模様を表示など）ので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, the three-dimensional structure prediction data is graphically displayed by any one of a wire model, a ribbon model, a pipe model, a ball and stick model, or a space filling model, and the easy-to-cut area information is supported. Since it is displayed in association with the display part of the 3D structure prediction data (for example, setting a link, displaying on the model, displaying a specific pattern pattern such as shading or diagonal lines corresponding to the easy-to-cut area information on the model, etc.) A protein structure analysis method, a protein structure analysis apparatus, a program, and a program that can display such information visually and in an easily understandable manner, and allow the user to intuitively determine the reliability of the three-dimensional structure prediction data. A recording medium can be provided.

また、本発明によれば、グラフィック表示やテーブル表示を行う際に、切断容易領域情報に従って対応する上記立体構造予測データの表示部分を色分けして表示するので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, when the graphic display or the table display is performed, the display portion of the corresponding three-dimensional structure prediction data is displayed in different colors according to the easy-to-cut area information. It is possible to provide a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium that can be displayed, and allow the user to intuitively determine the reliability of the three-dimensional structure prediction data. .

また、本発明によれば、予測された立体構造予測データと、決定された切断容易領域情報とを比較して、切断容易領域情報により特定された切断容易領域に対応する予測構造部分について評価を行い予測構造評価情報を決定し、決定された予測構造評価情報を出力するので、計算機により予測された予測構造を生化学実験データに基づいて評価を行うことができるようになり、予測精度を著しく向上させることができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, the predicted three-dimensional structure prediction data is compared with the determined easy-cleavage area information to evaluate the predicted structure portion corresponding to the easy-cut area specified by the easy-cleavage area information. The predicted structure evaluation information is determined and the determined predicted structure evaluation information is output, so that the predicted structure predicted by the computer can be evaluated based on the biochemical experiment data, and the prediction accuracy is remarkably improved. A protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium that can be improved can be provided.

また、これにより、ゲノム配列解析、ＤＮＡチップを用いた発現プロファイリング解析、プロテオーム解析などで見つかった興味深い遺伝子・タンパク質の機能解析を行う場合、立体構造を基準とした機能予測の精度が格段に向上するようになり、また、ゲノム配列解析においては、機能未知の遺伝子・タンパク質の立体構造を考慮した機能予測を行うことにより効率的に機能を推定することができるようになる。さらに、これにより、阻害剤などのドラッグデザインや蛋白質工学による活性向上や機能改変体のデザインを効率的に行うことができるようになる。 This also greatly improves the accuracy of function prediction based on the three-dimensional structure when performing functional analysis of interesting genes and proteins found by genome sequence analysis, expression profiling analysis using DNA chips, proteome analysis, etc. In genome sequence analysis, the function can be efficiently estimated by performing function prediction in consideration of the three-dimensional structure of a gene / protein whose function is unknown. Further, this makes it possible to efficiently design drugs such as inhibitors, improve activity by protein engineering, and design functional modifications.

また、本発明によれば、対象タンパク質について基準振動計算を行い、基準振動計算結果（例えば、揺らぎの情報など）を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、立体構造モデル上にベクトル表示を行う）ので、切断容易領域として揺らぎの情報を考慮することができるようになる。すなわち、基準振動計算結果により揺らぎ情報を求め、ＭＳ実測データと比較することにより予測モデルの評価を行うことができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 In addition, according to the present invention, the reference vibration calculation is performed on the target protein, and the reference vibration calculation result (for example, fluctuation information) is displayed in association with the display portion of the corresponding three-dimensional structure prediction data (for example, the three-dimensional structure model). Since the vector display is performed above), the fluctuation information can be considered as the easy-to-cut region. That is, it is possible to provide a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium capable of obtaining fluctuation information from the reference vibration calculation result and evaluating the prediction model by comparing with the MS actual measurement data. it can.

また、本発明によれば、立体構造を予測する対象タンパク質についてヘキサポールＣＩＤ法、ノズルスキマーＣＩＤ法、キャピラリースキマーＣＩＤ法、ＳＯＲＩ−ＣＩＤ法、および、マルチポールストアー補助キャピラリースキマーＣＩＤ法を含む衝突活性化解離（ＣＩＤ）法、ＩＲＭＰＤ法、インソース分解（ＩＳＤ）法、ポストソース分解（ＰＳＤ）法、表面誘起解離（ＳＩＤ）法、ＥＣＤ法、および、ＢＩＲＤ法のうち少なくとも一つの方法により上記対象タンパク質について断片化イオンに断片化し上記断片化スペクトルを測定するので、対象タンパク質の立体構造を反映した断片化イオンを効率的に作成することができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 In addition, according to the present invention, the collision activity including the hexapole CID method, the nozzle skimmer CID method, the capillary skimmer CID method, the SORI-CID method, and the multipole store auxiliary capillary skimmer CID method for the target protein for predicting the three-dimensional structure. The above-mentioned object by at least one of chemical dissociation (CID) method, IRMPD method, in-source decomposition (ISD) method, post-source decomposition (PSD) method, surface-induced dissociation (SID) method, ECD method, and BIRD method Since the protein is fragmented into fragment ions and the fragmentation spectrum is measured, a protein structure analysis method, a protein structure analysis apparatus, a program, and a protein structure analysis method capable of efficiently creating fragment ions reflecting the three-dimensional structure of the target protein Can provide recording media .

また、本発明によれば、酵素反応により対象タンパク質について断片化を行い断片化スペクトルを測定するので、対象タンパク質の立体構造を反映した断片化を効率的に行うことができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 In addition, according to the present invention, since the target protein is fragmented by an enzymatic reaction and the fragmentation spectrum is measured, the protein structure analysis method and protein that can efficiently perform fragmentation reflecting the three-dimensional structure of the target protein A structural analysis device, a program, and a recording medium can be provided.

また、本発明によれば、立体構造を予測する対象タンパク質について断片化し、断片化スペクトルを測定し、測定された断片化スペクトルに基づいて、対象タンパク質のアミノ酸配列に対する断片化イオンの帰属情報を決定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における溶媒接触残基情報を決定し、対象タンパク質について立体構造を予測し、予測された立体構造予測データと決定された溶媒接触残基情報とを対応させて出力する（例えば、グラフィック表示、テーブルによる一覧表示など）ので、立体構造未知のタンパク質（これをコードする遺伝子を含む）の立体構造予測を行う場合、実測した立体構造情報を加味することにより、精度の高い予測を行うことができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, the target protein for which the three-dimensional structure is predicted is fragmented, the fragmentation spectrum is measured, and the attribution information of the fragmented ions for the amino acid sequence of the target protein is determined based on the measured fragmentation spectrum. Then, in accordance with the measured fragmentation spectrum and the determined fragmentation ion attribution information, the portion in contact with the solvent in the amino acid sequence of the target protein is specified, and the solvent contact in the amino acid sequence of the target protein according to the location Residue information is determined, the three-dimensional structure of the target protein is predicted, and the predicted three-dimensional structure prediction data and the determined solvent contact residue information are output in correspondence with each other (for example, graphic display, list display by table, etc.) ) Because of the protein whose structure is unknown (including the gene that encodes it) When performing the body structure prediction, by adding the actually measured three-dimensional structure information, how protein structure analysis that can be performed with high accuracy prediction, protein structure analysis apparatus, a program, and can provide a recording medium.

また、本発明によれば、予測された立体構造予測データと、決定された溶媒接触残基情報とを比較して、溶媒接触残基情報により特定された溶媒接触残基に対応する予測構造部分について評価を行い予測構造評価情報を決定し、決定された予測構造評価情報を出力するので、計算機により予測された予測構造を生化学実験データに基づいて評価を行うことができるようになり、予測精度を著しく向上させることができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, the predicted three-dimensional structure prediction data and the determined solvent contact residue information are compared, and the predicted structure portion corresponding to the solvent contact residue specified by the solvent contact residue information Since the predicted structure evaluation information is determined and the determined predicted structure evaluation information is output, the predicted structure predicted by the computer can be evaluated based on the biochemical experiment data. It is possible to provide a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium that can significantly improve accuracy.

また、本発明によれば、対象タンパク質について水素−重水素交換反応を行った後、断片化し断片化スペクトルを測定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列中の各アミノ酸残基における重水素化速度を決定し、当該重水素化速度に従って対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質の上記アミノ酸配列における溶媒接触残基情報を決定するので、効率的に溶媒接触情報を求めることができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 According to the present invention, after subjecting the target protein to hydrogen-deuterium exchange reaction, fragmentation and fragmentation spectrum are measured, and according to the measured fragmentation spectrum and the determined fragmentation ion attribution information. , Determine the deuteration rate at each amino acid residue in the amino acid sequence of the target protein, identify the location in contact with the solvent in the amino acid sequence of the target protein according to the deuteration rate, and Since the solvent contact residue information in the amino acid sequence is determined, it is possible to provide a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium capable of efficiently obtaining solvent contact information.

また、本発明によれば、対象タンパク質について化学修飾を行った後、断片化し断片化スペクトルを測定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質のアミノ酸配列中の各アミノ酸残基における化学修飾部分を決定し、当該化学修飾部分に従って対象タンパク質のアミノ酸配列において溶媒に接触している箇所を特定し、当該箇所に従って対象タンパク質のアミノ酸配列における溶媒接触残基情報を決定するので、効率的に溶媒接触情報を求めることができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, after subjecting the target protein to chemical modification, the fragmentation and fragmentation spectrum is measured, and according to the measured fragmentation spectrum and the determined fragmentation ion attribution information, Determine the chemically modified part at each amino acid residue in the amino acid sequence, identify the part in contact with the solvent in the amino acid sequence of the target protein according to the chemical modified part, Since the base information is determined, it is possible to provide a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium that can efficiently obtain solvent contact information.

また、本発明によれば、立体構造を予測する対象タンパク質と化合物（例えば、タンパク質や低分子化合物や核酸など）との複合体について断片化し、断片化スペクトルを測定し、測定された断片化スペクトルに基づいて、対象タンパク質および／または化合物に対する断片化イオンの帰属情報を決定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質および／または化合物に関する相互作用界面情報を決定し、対象タンパク質および／または化合物について立体構造を予測し、予測された立体構造予測データと、決定された相互作用界面情報とを対応させて出力する（例えば、グラフィック表示、テーブルによる一覧表示など）ので、立体構造未知のタンパク質（これをコードする遺伝子を含む）の立体構造予測を行う場合、実測した立体構造情報を加味することにより、精度の高い予測を行うことができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 In addition, according to the present invention, a complex of a target protein and a compound (for example, a protein, a low molecular compound, a nucleic acid, etc.) that predicts a three-dimensional structure is fragmented, a fragmentation spectrum is measured, and the measured fragmentation spectrum is measured. To determine the fragmentation ion attribution information for the protein and / or compound of interest, and to interact with the protein and / or compound according to the measured fragmentation spectrum and the determined fragmentation ion attribution information Interface information is determined, the three-dimensional structure is predicted for the target protein and / or compound, and the predicted three-dimensional structure prediction data and the determined interaction interface information are output in correspondence with each other (for example, by graphic display or by table) Proteins with unknown three-dimensional structure (registration that codes this) Provided protein structure analysis method, protein structure analysis apparatus, program, and recording medium capable of performing high-precision prediction by taking into account the actually measured three-dimensional structure information. can do.

また、本発明によれば、立体構造予測データを針金モデル、リボンモデル、パイプモデル、ボールアンドスティックモデル、または、空間充填モデルのうちいずれかのモデルにより対象タンパク質および／または化合物についてグラフィック表示（例えば、ドッキングシミュレーションなども含む）し、相互作用界面情報を対応する立体構造予測データの表示部分に関連付けて表示する（例えば、リンクを設定、モデル上に表示、モデル上に相互作用界面情報に対応する網掛けや斜線などの特定のパターン模様を表示など）ので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, the three-dimensional structure prediction data is graphically displayed on the target protein and / or compound by any one of a wire model, a ribbon model, a pipe model, a ball and stick model, or a space filling model (for example, , Including docking simulation) and displaying the interaction interface information in association with the display portion of the corresponding three-dimensional structure prediction data (for example, setting a link, displaying on the model, corresponding to the interaction interface information on the model) This makes it possible to display such information visually and easily, and the user intuitively determines the reliability of the three-dimensional structure prediction data. Protein structure analysis method, protein structure analysis apparatus, program, Beauty, it is possible to provide a recording medium.

また、本発明によれば、グラフィック表示やテーブル表示を行う際に、相互作用界面情報に従って対象タンパク質および／または化合物の対応する上記立体構造予測データの表示部分を色分けして表示するので、これらの情報を視覚的に分かり易く表示することができるようになり、利用者が立体構造予測データの信頼性を直感的に判断することができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, when the graphic display or the table display is performed, the display portion of the three-dimensional structure prediction data corresponding to the target protein and / or compound is displayed in different colors according to the interaction interface information. Protein structure analysis method, protein structure analysis apparatus, program, and record that can display information visually and in an easy-to-understand manner, and allow users to intuitively determine the reliability of three-dimensional structure prediction data A medium can be provided.

また、本発明によれば、予測された立体構造予測データと、決定された相互作用界面情報とを比較して、相互作用界面情報により特定された相互作用界面に対応する予測構造部分について評価を行い予測構造評価情報を決定し、決定された予測構造評価情報を出力するので、計算機により予測された予測構造を生化学実験データに基づいて評価を行うことができるようになり、予測精度を著しく向上させることができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, the predicted three-dimensional structure prediction data is compared with the determined interaction interface information, and the predicted structure portion corresponding to the interaction interface specified by the interaction interface information is evaluated. The predicted structure evaluation information is determined and the determined predicted structure evaluation information is output, so that the predicted structure predicted by the computer can be evaluated based on the biochemical experiment data, and the prediction accuracy is remarkably improved. A protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium that can be improved can be provided.

また、本発明によれば、対象タンパク質と化合物との複合体について水素−重水素交換反応を行った後、断片化し上記断片化スペクトルを測定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質および／または化合物における重水素化速度を決定し、当該重水素化速度に従って対象タンパク質および／または化合物において相互作用界面を特定し、当該相互作用界面に従って対象タンパク質および／または化合物における相互作用界面情報を決定するので、効率的に相互作用界面情報を求めることができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, the complex of the target protein and the compound was subjected to a hydrogen-deuterium exchange reaction, and then fragmented to measure the fragmentation spectrum, and the measured fragmentation spectrum was determined. Determining the deuteration rate of the target protein and / or compound according to the fragmentation ion assignment information, identifying the interaction interface in the target protein and / or compound according to the deuteration rate, and determining the target protein according to the interaction interface Since interaction interface information in a compound and / or compound is determined, a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium capable of efficiently obtaining interaction interface information can be provided.

また、本発明によれば、対象タンパク質と化合物との複合体について化学修飾を行った後、断片化し断片化スペクトルを測定し、測定された断片化スペクトル、および、決定された断片化イオンの帰属情報に従って、対象タンパク質および／または化合物における化学修飾部分を決定し、当該化学修飾部分に従って対象タンパク質および／または化合物において相互作用界面を特定し、当該相互作用界面に従って対象タンパク質および／または化合物における相互作用界面情報を決定するので、効率的に相互作用界面情報を求めることができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Further, according to the present invention, after the chemical modification of the complex of the target protein and the compound, the fragmentation fragmentation spectrum is measured, and the measured fragmentation spectrum and the assignment of the determined fragmentation ion are determined. According to the information, a chemical modification moiety in the target protein and / or compound is determined, an interaction interface in the target protein and / or compound is identified according to the chemical modification moiety, and an interaction in the target protein and / or compound is determined according to the interaction interface Since the interface information is determined, it is possible to provide a protein structure analysis method, a protein structure analysis apparatus, a program, and a recording medium that can efficiently obtain interaction interface information.

さらに、本発明によれば、酵素反応により対象タンパク質と化合物の複合体について断片化を行い断片化スペクトルを測定するので、対象タンパク質の立体構造を反映した断片化を効率的に行うことができるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体を提供することができる。 Furthermore, according to the present invention, since the complex of the target protein and the compound is fragmented by an enzyme reaction and the fragmentation spectrum is measured, the protein capable of efficiently performing the fragmentation reflecting the three-dimensional structure of the target protein A structure analysis method, a protein structure analysis apparatus, a program, and a recording medium can be provided.

以上のように、本発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体は、計算手法により求めたタンパク質の立体構造データを生化学的な実験手法により求めたデータにより補完することができる。 As described above, the protein structure analysis method, the protein structure analysis apparatus, the program, and the recording medium according to the present invention supplement the protein three-dimensional structure data obtained by the calculation method with the data obtained by the biochemical experimental method. can do.

これにより、本発明にかかるタンパク質構造解析方法、タンパク質構造解析装置、プログラム、および、記録媒体は、タンパク質の質量分析や立体構造などの解析を行うバイオインフォマティクス分野において極めて有用である。 As a result, the protein structure analysis method, protein structure analysis apparatus, program, and recording medium according to the present invention are extremely useful in the bioinformatics field for performing analysis of protein mass analysis and three-dimensional structure.

本発明は、産業上多くの分野、特に医薬品、食品、化粧品、医療、遺伝子発現解析等の分野で広く実施することができ、極めて有用である。 The present invention can be widely implemented in many industrial fields, particularly in fields such as pharmaceuticals, foods, cosmetics, medicine, and gene expression analysis, and is extremely useful.

本発明の基本原理を示す原理構成図である。It is a principle block diagram which shows the basic principle of this invention. 本発明が適用される本システムの構成の一例を示すブロック図である。It is a block diagram which shows an example of a structure of this system with which this invention is applied. 本実施形態における本システムのメイン処理の一例を示すフローチャートである。It is a flowchart which shows an example of the main process of this system in this embodiment. 本実施形態におけるヘキサポールＣＩＤ法により断片化スペクトルを測定する場合の一例を示すフローチャートである。It is a flowchart which shows an example in the case of measuring a fragmentation spectrum by the hexapole CID method in this embodiment. 本実施形態におけるＩＲＭＰＤ法により断片化スペクトルを測定する場合の一例を示すフローチャートである。It is a flowchart which shows an example in the case of measuring a fragmentation spectrum by the IRMPD method in this embodiment. ヒトのインターロイキン−６（ＩＬ−６）について断片化イオンの帰属情報を求め切断容易領域情報を決定する際の概念図である。It is a conceptual diagram at the time of obtaining | requiring the fragmentation ion attribution information about human interleukin-6 (IL-6), and determining easy-cleavage area | region information. 本実施形態における水素−重水素交換法により断片化スペクトルを測定する場合の一例を示すフローチャートである。It is a flowchart which shows an example in the case of measuring a fragmentation spectrum by the hydrogen-deuterium exchange method in this embodiment. 本実施形態における水素−重水素交換法によりタンパク質をそのままＭＳで断片化し断片化スペクトルを測定する場合の一例を示すフローチャートである。It is a flowchart which shows an example in the case of fragmenting protein by MS as it is by the hydrogen-deuterium exchange method in this embodiment, and measuring a fragmentation spectrum. 本実施形態における水素−重水素交換法によりタンパク質の複合体を断片化し断片化スペクトルを測定する場合の一例を示すフローチャートである。It is a flowchart which shows an example in the case of fragmenting a protein complex by the hydrogen-deuterium exchange method in this embodiment, and measuring a fragmentation spectrum. 本実施形態における水素−重水素交換法によりタンパク質ＡとＢの複合体を断片化し断片化スペクトルを測定する場合の一例を示すフローチャートである。It is a flowchart which shows an example in the case of fragmenting the complex of protein A and B by the hydrogen-deuterium exchange method in this embodiment, and measuring a fragmentation spectrum. 本実施例において断片化スペクトルを解析する場合の一例を示すフローチャートである。It is a flowchart which shows an example in the case of analyzing a fragmentation spectrum in a present Example. ヒトのインターロイキン−６（ＩＬ−６）について、立体構造予測データをリボンモデルによりグラフィック表示し、切断容易領域情報に従って対応する上記立体構造予測データの表示部分を色分けして表示する場合の表示画面の一例を示す図である。Display screen for human interleukin-6 (IL-6), in which three-dimensional structure prediction data is displayed graphically by a ribbon model, and the corresponding three-dimensional structure prediction data is displayed in different colors according to easy-to-cut region information It is a figure which shows an example. 本発明における本システムの実施例１の一例を示すフローチャートである。It is a flowchart which shows an example of Example 1 of this system in this invention. 本発明における本システムの実施例２の一例を示すフローチャートである。It is a flowchart which shows an example of Example 2 of this system in this invention. 本発明における本システムの実施例３の一例を示すフローチャートである。It is a flowchart which shows an example of Example 3 of this system in this invention. 本発明における本システムの実施例４の一例を示すフローチャートである。It is a flowchart which shows an example of Example 4 of this system in this invention. 処理結果についてテーブルによる一覧表示を行う場合の表示画面の一例を示す図である。It is a figure which shows an example of the display screen in the case of performing list display by a table about a processing result. 実施例５における処理結果の一例を示す図である。It is a figure which shows an example of the process result in Example 5. FIG. ヒトのインターロイキン−６（ＩＬ−６）と、抗ＩＬ−６抗体との複合体（ｃｏｍｐｌｅｘ）について、水素−重水素交換反応を行い、エレクトロスプレーイオン化法により断片化イオンに断片化し、断片化スペクトルを測定する際の概念図である。A complex (complex) of human interleukin-6 (IL-6) and anti-IL-6 antibody is subjected to hydrogen-deuterium exchange reaction, fragmented into fragmented ions by electrospray ionization, and fragmented It is a conceptual diagram at the time of measuring a spectrum. 水素−重水素交換反応を行ったヒトのＩＬ−６の断片化スペクトルから重水素化速度を求める際の概念図である。It is a conceptual diagram at the time of calculating | requiring a deuteration rate from the fragmentation spectrum of human IL-6 which performed the hydrogen-deuterium exchange reaction. 本発明における本システムの実施例８の一例を示すフローチャートである。It is a flowchart which shows an example of Example 8 of this system in this invention. 本発明における本システムの実施例８のＤＮＡマイクロアレイによる発現量の違いを色別に表示した表示画面の一例を示す図である。It is a figure which shows an example of the display screen which displayed the difference in the expression level by the DNA microarray of Example 8 of this system in this invention according to the color. 本発明における本システムの実施例８の利用者が選択した遺伝子情報を表示する画面である。It is a screen which displays the gene information which the user of Example 8 of this system in this invention selected. 本発明における本システムの実施例８のアライメントの表示画面である。It is a display screen of alignment of Example 8 of this system in the present invention. 本発明における本システムの実施例８のグラフィックソフトウェアによる立体構造の表示画面である。It is a display screen of the three-dimensional structure by the graphic software of Example 8 of this system in the present invention. 本発明における本システムの実施例６におけるウシ由来ユビキチンの立体構造の一例を示す図である。It is a figure which shows an example of the three-dimensional structure of the bovine origin ubiquitin in Example 6 of this system in this invention. 本発明における本システムの実施例７における酵母由来アルコール脱水素酵素の立体構造の一例を示す図である。It is a figure which shows an example of the three-dimensional structure of the yeast origin alcohol dehydrogenase in Example 7 of this system in this invention.

Explanation of symbols

１００タンパク質構造解析装置
１０２制御部
１０２ａ断片化スペクトルデータ取得部
１０２ｂ帰属情報決定部
１０２ｃ切断容易領域情報決定部
１０２ｄ溶媒接触残基情報決定部
１０２ｅ相互作用界面情報決定部
１０２ｆ立体構造予測処理部
１０２ｇ予測構造評価情報決定部
１０２ｈ処理結果出力部
１０２ｉ予測構造評価情報出力部
１０２ｊ重水素化速度決定部
１０２ｋ基準振動計算処理部
１０４通信制御インターフェース部
１０６記憶部
１０６ａ断片化スペクトルデータファイル
１０６ｂ切断容易領域情報ファイル
１０６ｃ溶媒接触残基情報／相互作用界面情報ファイル
１０６ｄ立体構造予測データファイル
１０６ｅ予測構造評価情報ファイル
１０６ｆ処理結果データファイル
１０８入出力制御インターフェース部
１１２入力装置
１１４出力装置
２００外部システム
３００ネットワーク
４００ＭＳ 100 Protein structure analyzer
102 Control unit
102a Fragmentation spectrum data acquisition unit
102b Attribution information determination unit
102c Easy-to-cut area information determination unit
102d Solvent contact residue information determination unit
102e Interaction interface information determination unit
102f Three-dimensional structure prediction processing unit
102g Predictive structure evaluation information determination unit
102h Processing result output part
102i Predictive structure evaluation information output unit
102j Deuteration rate determination unit
102k reference vibration calculation processing unit
104 Communication control interface unit
106 Storage unit
106a Fragmentation spectrum data file
106b Easy cutting area information file
106c Solvent contact residue information / interaction interface information file
106d 3D structure prediction data file
106e Predictive structure evaluation information file
106f Processing result data file
108 Input / output control interface
112 Input device
114 Output device 200 External system 300 Network 400 MS

Claims

Fragmentation spectrum measurement step of measuring a fragmentation spectrum by fragmenting a target protein that predicts a three-dimensional structure;
Based on the fragmentation spectrum measured by the fragmentation spectrum measurement step, an attribution information determination step for determining the attribution information of the fragmented ions for the amino acid sequence of the target protein,
According to the fragmentation spectrum measured by the fragmentation spectrum measurement step and the attribution information of the fragmentation ion determined by the attribution information determination step, the amino acid sequence of the target protein is in contact with a solvent. A solvent contact residue information determination step for identifying a location and determining solvent contact residue information in the amino acid sequence of the target protein according to the location;
A three-dimensional structure prediction step for predicting a three-dimensional structure of the target protein;
A reference vibration calculation step for performing a reference vibration calculation for the target protein;
The three-dimensional structure prediction data predicted in the three-dimensional structure prediction step, the solvent contact residue information determined in the solvent contact residue information determination step, and the calculation result in the reference vibration calculation step are output in association with each other. Processing result output step,
A protein structure analysis method comprising:

Fragmentation spectrum measurement step of fragmenting a complex of a target protein and a compound that predicts a three-dimensional structure and measuring a fragmentation spectrum;
An attribution information determination step for determining attribution information of fragmented ions for the target protein and / or the compound based on the fragmentation spectrum measured by the fragmentation spectrum measurement step;
According to the fragmentation spectrum measured by the fragmentation spectrum measurement step and the attribution information of the fragmentation ion determined by the attribution information determination step, interaction interface information regarding the target protein and / or the compound is obtained. An interaction interface information determination step to determine;
A three-dimensional structure prediction step of predicting a three-dimensional structure of the target protein and / or the compound;
A reference vibration calculation step for performing a reference vibration calculation for the target protein and / or the compound;
Processing for outputting the three-dimensional structure prediction data predicted in the three-dimensional structure prediction step, the interaction interface information determined in the interaction interface information determination step, and the calculation result in the reference vibration calculation step in association with each other A result output step;
A protein structure analysis method comprising:

Fragmentation spectrum data acquisition means for obtaining a fragmentation spectrum data from a mass spectrometer that has fragmented a target protein for predicting a three-dimensional structure and has measured the fragmentation spectrum;
Based on the fragmentation spectrum data acquired by the fragmentation spectrum data acquisition means, attribution information determination means for determining attribution information of fragmented ions for the amino acid sequence of the target protein;
According to the fragmentation spectrum measured by the fragmentation spectrum measurement means and the attribution information of the fragmentation ions determined by the attribution information determination means, the amino acid sequence of the target protein is in contact with a solvent. A solvent contact residue information determining means for identifying a location and determining solvent contact residue information in the amino acid sequence of the target protein according to the location;
A three-dimensional structure prediction means for predicting a three-dimensional structure of the target protein;
A reference vibration calculation means for performing a reference vibration calculation for the target protein;
The three-dimensional structure prediction data predicted by the three-dimensional structure prediction means, the solvent contact residue information determined by the solvent contact residue information determination means, and the calculation result by the reference vibration calculation means are output in association with each other. Processing result output means,
A protein structure analyzing apparatus comprising:

Fragmentation spectrum data acquisition means for fragmenting a complex of a target protein and a compound that predicts a three-dimensional structure and acquiring fragmentation spectrum data from a mass spectrometer that measures the fragmentation spectrum;
Attribution information determination means for determining the attribution information of fragmented ions for the target protein and / or the compound based on the fragmentation spectrum data acquired by the fragmentation spectrum data acquisition means,
In accordance with the fragmentation spectrum measured by the fragmentation spectrum measurement means and the attribution information of the fragmentation ions determined by the attribution information determination means, interaction interface information regarding the target protein and / or the compound is obtained. An interaction interface information determining means for determining;
A three-dimensional structure prediction means for predicting a three-dimensional structure of the target protein and / or the compound;
A reference vibration calculation means for performing a reference vibration calculation for the target protein and / or the compound;
Processing for outputting the three-dimensional structure prediction data predicted by the three-dimensional structure prediction means, the interaction interface information determined by the interaction interface information determination means, and the calculation result by the reference vibration calculation means in association with each other A result output means;
A protein structure analyzing apparatus comprising:

Fragmentation spectrum data acquisition step of obtaining a fragmentation spectrum data from a mass spectrometer that has fragmented a target protein that predicts a three-dimensional structure and has measured the fragmentation spectrum;
Based on the fragmentation spectrum data acquired by the fragmentation spectrum data acquisition step, an attribution information determination step for determining the attribution information of fragmentation ions for the amino acid sequence of the target protein;
According to the fragmentation spectrum measured by the fragmentation spectrum measurement step and the attribution information of the fragmentation ion determined by the attribution information determination step, the amino acid sequence of the target protein is in contact with a solvent. A solvent contact residue information determination step for identifying a location and determining solvent contact residue information in the amino acid sequence of the target protein according to the location;
A three-dimensional structure prediction step for predicting a three-dimensional structure of the target protein;
A reference vibration calculation step for performing a reference vibration calculation for the target protein;
The three-dimensional structure prediction data predicted in the three-dimensional structure prediction step, the solvent contact residue information determined in the solvent contact residue information determination step, and the calculation result in the reference vibration calculation step are output in association with each other. Processing result output step,
A computer program for causing a computer to execute a protein structure analysis method including:

Fragmentation spectrum data acquisition step of obtaining a fragmentation spectrum data from a mass spectrometer that has fragmented a complex of a target protein and a compound that predicts a three-dimensional structure and measured the fragmentation spectrum;
Based on the fragmentation spectrum data acquired by the fragmentation spectrum data acquisition step, an attribution information determination step for determining attribution information of fragmentation ions for the target protein and / or the compound,
According to the fragmentation spectrum measured by the fragmentation spectrum measurement step and the attribution information of the fragmentation ion determined by the attribution information determination step, interaction interface information regarding the target protein and / or the compound is obtained. An interaction interface information determination step to determine;
A three-dimensional structure prediction step of predicting a three-dimensional structure of the target protein and / or the compound;
A reference vibration calculation step for performing a reference vibration calculation for the target protein and / or the compound;
Processing for outputting the three-dimensional structure prediction data predicted in the three-dimensional structure prediction step, the interaction interface information determined in the interaction interface information determination step, and the calculation result in the reference vibration calculation step in association with each other A result output step;
A computer program for causing a computer to execute a protein structure analysis method including: