JP7168979B2

JP7168979B2 - 3D structure determination device, 3D structure determination method, 3D structure discriminator learning device, 3D structure discriminator learning method and program

Info

Publication number: JP7168979B2
Application number: JP2019015086A
Authority: JP
Inventors: 政和関嶋; 洸依田; 信明安尾
Original assignee: Tokyo Institute of Technology NUC
Current assignee: Tokyo Institute of Technology NUC
Priority date: 2019-01-31
Filing date: 2019-01-31
Publication date: 2022-11-10
Anticipated expiration: 2039-01-31
Also published as: WO2020158609A1; JP2020123189A

Description

本発明は、立体構造判定装置、立体構造判定方法、立体構造の判別器学習装置、立体構造の判別器学習方法及びプログラムに関する。 The present invention relates to a three-dimensional structure determination device, a three-dimensional structure determination method, a three-dimensional structure classifier learning device, a three-dimensional structure classifier learning method, and a program.

創薬の初期段階においては、薬剤の標的となる蛋白質に結合する化合物、ペプチド、蛋白質、核酸等のリガンドを発見する必要がある。このようなリガンドを発見するスクリーニング手法の１つとして、蛋白質とリガンドのドッキングシミュレーションがある。そして、ドッキングシミュレーションの精度を向上させる技術の開発も進められている。例えば、特許文献１には、従来のドッキングシミュレーションよりも高精度なスクリーニングを実現するドッキングスコアリング方法が開示されている。 In the initial stage of drug discovery, it is necessary to discover ligands such as compounds, peptides, proteins, nucleic acids, etc. that bind to target proteins of drugs. One of screening methods for discovering such ligands is protein-ligand docking simulation. Also, the development of technology for improving the accuracy of docking simulation is underway. For example, Patent Literature 1 discloses a docking scoring method that realizes more accurate screening than conventional docking simulation.

特開２００５－１８１１０４号公報JP 2005-181104 A

特許文献１には、蛋白質の立体構造を基に結合部分の電子状態の計算を行った後に、化学シフト値の解析を行い、化学シフト値から結合残基の決定および結合強さの比較を行うことにより高精度なスクリーニングを実現できることが記載されている。しかし、スクリーニングの対象となるリガンドの数が膨大であることを考えると、特許文献１や従来のドッキングシミュレーションの精度はまだ不十分である。 In Patent Document 1, after calculating the electronic state of the binding portion based on the three-dimensional structure of the protein, the chemical shift value is analyzed, the binding residue is determined from the chemical shift value, and the binding strength is compared. It is described that high-precision screening can be realized by this. However, considering that the number of ligands to be screened is enormous, the accuracy of Patent Document 1 and conventional docking simulations is still insufficient.

本発明は、上記問題を解決するためになされたものであり、蛋白質・リガンド結合等の立体構造の判定精度を従来よりも格段に向上させることのできる立体構造判定装置、立体構造判定方法、立体構造の判別器学習装置、立体構造の判別器学習方法及びプログラムを提供することを目的とする。 DISCLOSURE OF THE INVENTION The present invention has been made in order to solve the above-mentioned problems, and is capable of significantly improving the accuracy of determination of three-dimensional structures such as protein-ligand binding. It is an object of the present invention to provide a structural classifier learning device, a three-dimensional structure classifier learning method, and a program.

上記目的を達成するため、本発明に係る立体構造判定装置は、
第１の物質と第２の物質の結合の立体構造を前記立体構造の周囲からの複数の視点で２次元平面に投影して得られる複数枚の画像を含む画像セットを生成する画像生成手段と、
前記画像セットに含まれる１枚の画像を入力すると、前記１枚の画像に投影されている前記立体構造に係る前記結合の所定の性質の有無を判別する判別器と、
前記判別器に前記画像セットに含まれる複数枚の画像のそれぞれを入力して得られる値を、前記画像セットに含まれる全ての画像について集計した値に基づいて、前記画像セットに含まれる画像に投影されている前記立体構造に係る前記結合の前記性質の有無を判定する総合判定手段と、
を備える。 In order to achieve the above object, the three-dimensional structure determination device according to the present invention includes:
image generating means for generating an image set including a plurality of images obtained by projecting a three-dimensional structure of a bond of a first substance and a second substance onto a two-dimensional plane from a plurality of viewpoints around the three-dimensional structure; ,
a discriminator that, when inputting one image included in the image set, discriminates whether or not the bond associated with the three-dimensional structure projected on the one image has a predetermined property;
A value obtained by inputting each of a plurality of images included in the image set to the discriminator, based on a total value for all images included in the image set, to an image included in the image set Comprehensive judgment means for judging whether or not the bond associated with the projected three-dimensional structure has the property;
Prepare.

本発明によれば、蛋白質・リガンド結合等の立体構造の判定精度を従来よりも格段に向上させることができる。 According to the present invention, the determination accuracy of three-dimensional structures such as protein-ligand binding can be remarkably improved as compared with conventional methods.

実施形態１に係る判定装置の機能構成を示す図である。2 is a diagram showing a functional configuration of a determination device according to Embodiment 1; FIG. 蛋白質とリガンドとの結合（ドッキング）を説明する図である。FIG. 2 is a diagram explaining binding (docking) between a protein and a ligand. 実施形態１に係る画像生成部が蛋白質とリガンドのドッキング構造の画像を全周網羅的に生成する様子を説明する図である。FIG. 4 is a diagram for explaining how the image generator according to Embodiment 1 comprehensively generates an image of the docking structure of the protein and the ligand. 畳み込みニューラルネットワーク（ＣＮＮ）の処理概要を説明する図である。It is a figure explaining the processing outline of a convolutional neural network (CNN). 実施形態１に係る学習処理のフローチャートである。4 is a flowchart of learning processing according to the first embodiment; 実施形態１に係る画像生成処理のフローチャートである。4 is a flowchart of image generation processing according to the first embodiment; 実施形態１に係る判定処理のフローチャートである。4 is a flowchart of determination processing according to the first embodiment; ３Ｄａｖｅｒａｇｅｐｏｏｌｉｎｇの処理概要を説明する図である。It is a figure explaining the process outline|summary of 3D average pooling. 変形例１に係る画像生成部が蛋白質とリガンドのドッキング構造の画像を生成する様子を説明する図である。FIG. 10 is a diagram for explaining how an image generation unit according to Modification 1 generates an image of a docking structure of a protein and a ligand; 変形例１に係る画像生成処理のフローチャートである。9 is a flowchart of image generation processing according to Modification 1;

以下、本発明の実施形態に係る蛋白質・リガンド結合判定装置、蛋白質・リガンド結合の判別器学習装置等について、図表を参照して説明する。なお、図中同一又は相当部分には同一符号を付す。 Hereinafter, a protein/ligand binding determination device, a protein/ligand binding discriminator learning device, and the like according to embodiments of the present invention will be described with reference to the drawings. The same reference numerals are given to the same or corresponding parts in the drawings.

（実施形態１）
実施形態１に係る判定装置１００は、蛋白質・リガンド結合の判別器学習装置としては、活性が既知の蛋白質・リガンド結合のドッキング構造の画像を大量に用いて、判別器を学習させて学習モデルを取得する。そして、判定装置１００は、蛋白質・リガンド結合判定装置としては、学習済みの判別器（学習モデル）に、活性が未知の蛋白質・リガンド結合のドッキング構造の画像を入力することにより、当該活性が未知の蛋白質・リガンド結合の活性の有無を判定する。このような判定装置１００について、以下に説明する。 (Embodiment 1)
The determination device 100 according to the first embodiment uses a large number of images of docking structures for protein/ligand binding with known activity as a discriminator learning device for protein/ligand binding, and trains a discriminator to create a learning model. get. As a protein/ligand binding determination device, the determination device 100 inputs an image of a docking structure of a protein/ligand binding with an unknown activity to a discriminator (learning model) that has been trained so that the activity is unknown. determine the presence or absence of protein-ligand binding activity. Such a determination device 100 will be described below.

実施形態１に係る判定装置１００は、図１に示すように、制御部１０、記憶部２０、出力部３１、通信部３２、操作入力部３３、を備える。 The determination device 100 according to the first embodiment includes a control unit 10, a storage unit 20, an output unit 31, a communication unit 32, and an operation input unit 33, as shown in FIG.

制御部１０は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）等で構成され、記憶部２０に記憶されたプログラムを実行することにより、後述する各部（画像生成部１１、活性取得部１２、判別器１３、判別器学習部１４、総合判定部１５）の機能を実現する。 The control unit 10 is composed of a CPU (Central Processing Unit) or the like, and by executing a program stored in the storage unit 20, each unit described later (image generation unit 11, activity acquisition unit 12, discriminator 13, discriminator It implements the functions of the learning unit 14 and the comprehensive determination unit 15).

記憶部２０は、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等で構成され、制御部１０のＣＰＵが実行するプログラムや必要なデータを記憶する。また、記憶部２０は、蛋白質・リガンド結合の活性の有無が記録された活性ＤＢ（Ｄａｔａｂａｓｅ）を記憶していてもよい。 The storage unit 20 includes a ROM (Read Only Memory), a RAM (Random Access Memory), etc., and stores programs executed by the CPU of the control unit 10 and necessary data. The storage unit 20 may also store an activity DB (database) in which the presence or absence of protein-ligand binding activity is recorded.

出力部３１は、蛋白質・リガンド結合の判定結果等を出力するためのデバイスである。例えば、出力部３１は、液晶ディスプレイや有機ＥＬ（Ｅｌｅｃｔｏｒｏ－Ｌｕｍｉｎｅｓｃｅｎｃｅ）ディスプレイである。ただし、判定装置１００は、出力部３１としてこれらディスプレイを備えてもよいし、外部のディスプレイを接続するためのインタフェースとしての出力部３１を備えてもよい。判定装置１００は、インタフェースとしての出力部３１を備える場合は、出力部３１を介して接続した外部のディスプレイに判定結果等を表示する。 The output unit 31 is a device for outputting determination results of protein/ligand binding. For example, the output unit 31 is a liquid crystal display or an organic EL (Electro-Luminescence) display. However, the determination device 100 may include these displays as the output unit 31, or may include the output unit 31 as an interface for connecting an external display. If the determination device 100 includes the output unit 31 as an interface, it displays determination results and the like on an external display connected via the output unit 31 .

通信部３２は、外部の他の装置（例えば、蛋白質・リガンド結合の活性の有無が記録された活性ＤＢが格納されているサーバ等）とデータの送受信を行うためのデバイス（ネットワークインタフェース等）である。判定装置１００は、通信部３２を介して様々なデータを取得することができる。 The communication unit 32 is a device (network interface, etc.) for transmitting/receiving data to/from other external devices (e.g., a server storing an activity DB that records the presence or absence of protein/ligand binding activity). be. The determination device 100 can acquire various data via the communication unit 32 .

操作入力部３３は、判定装置１００に対するユーザの操作入力を受け付けるデバイスであり、例えば、キーボード、マウス、タッチパネル等である。判定装置１００は、操作入力部３３を介して、ユーザからの指示等を受け付ける。 The operation input unit 33 is a device that receives a user's operation input to the determination device 100, and is, for example, a keyboard, mouse, touch panel, or the like. The determination device 100 receives an instruction or the like from the user via the operation input unit 33 .

次に、制御部１０の機能について説明する。制御部１０は、記憶部２０に記憶されているプログラムを実行することにより、画像生成部１１、活性取得部１２、判別器１３、判別器学習部１４、総合判定部１５の機能を実現する。 Next, functions of the control unit 10 will be described. The control unit 10 implements the functions of the image generation unit 11 , the activity acquisition unit 12 , the discriminator 13 , the discriminator learning unit 14 , and the comprehensive determination unit 15 by executing the programs stored in the storage unit 20 .

画像生成部１１は、蛋白質の種類とリガンドの種類とが与えられると、ドッキングシミュレーションにより、与えられた蛋白質及びリガンドの立体的なドッキング構造を得て、ドッキング構造をその周囲の様々な角度からの視点で２次元平面に投影した画像を生成する。ドッキングシミュレーションでは、図２に示すように、与えられた蛋白質２１１とリガンド２１２とから、これらが結合した立体構造（ドッキング構造２１３）が得られる。そして、画像生成部１１は、図３に示すように、このドッキング構造２１３を周囲のカメラ３１１から様々な角度（θ、φ）で全周（３６０°）網羅的に撮影したかのような画像を生成する。実際には、画像生成部１１は、カメラ３１１で撮影するのではなく、ドッキング構造から３Ｄイメージを作成し、その３Ｄイメージを様々な方向からの視点で２次元平面に投影して得られる画像を生成する。 When the type of protein and the type of ligand are given, the image generator 11 obtains a three-dimensional docked structure of the given protein and ligand by docking simulation, and visualizes the docked structure from various angles around it. Generate an image projected onto a two-dimensional plane at the viewpoint. In the docking simulation, as shown in FIG. 2, given a protein 211 and a ligand 212, a three-dimensional structure (docking structure 213) in which they are bound is obtained. Then, as shown in FIG. 3, the image generation unit 11 generates an image as if the docking structure 213 was comprehensively photographed from the surrounding camera 311 at various angles (θ, φ) all around (360°). to generate Actually, the image generation unit 11 creates a 3D image from the docking structure instead of taking an image with the camera 311, and projects the 3D image onto a two-dimensional plane from viewpoints from various directions. Generate.

具体的には、ドッキングシミュレーションでドッキング構造を得るソフトウェアとしては、例えばＧｌｉｄｅを用いることができる。また、ドッキング構造から３Ｄイメージを作成するソフトウェアとしては、例えばＰｙＭＯＬを用いることができる。ただし、これらのソフトウェアは一例に過ぎず、ドッキング構造の周囲から様々な角度の視点で２次元平面に投影して得られる画像を生成できるのであれば、任意のソフトウェアを用いることができる。なお、ドッキングにおいて水素結合が重要と考えられることから、画像生成部１１は、水素結合を強調表示した画像を生成してもよい。また、画像生成部１１は、水素結合に限らず、判定装置１００が判定する結合の性質に重要な役割を果たすと考えられる結合等を強調表示した画像を生成してもよい。画像生成部１１は、画像生成手段として機能する。 Specifically, Glide, for example, can be used as software for obtaining a docking structure by docking simulation. Also, PyMOL, for example, can be used as software for creating a 3D image from the docking structure. However, these software are only examples, and any software can be used as long as it can generate an image obtained by projecting onto a two-dimensional plane from viewpoints at various angles around the docking structure. Since hydrogen bonds are considered important in docking, the image generator 11 may generate an image in which hydrogen bonds are highlighted. In addition, the image generation unit 11 may generate an image in which not only hydrogen bonds but also bonds that are considered to play an important role in the properties of bonds determined by the determination device 100 are highlighted. The image generator 11 functions as an image generator.

活性取得部１２は、蛋白質の種類とリガンドの種類とが与えられると、蛋白質・リガンド結合の活性の有無が記録された活性ＤＢ（Ｄａｔａｂａｓｅ）を参照して、与えられた蛋白質とリガンドの結合の活性の有無を取得する。このような活性ＤＢとしては、例えば、ＤＵＤ－Ｅを用いることができる。活性取得部１２は、性質取得手段として機能する。 When the type of protein and the type of ligand are given, the activity acquiring unit 12 refers to an activity DB (Database) in which the presence or absence of protein-ligand binding activity is recorded, and determines the binding of the given protein and ligand. Gets the presence or absence of activity. DUD-E, for example, can be used as such an active DB. The activity acquisition unit 12 functions as property acquisition means.

判別器１３は、蛋白質・リガンド結合の画像が与えられるとその活性の有無を出力する、畳み込みニューラルネットワーク（ＣｏｎｖｏｌｕｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ：ＣＮＮ）による判別器である。ＣＮＮによる判別器を実現するプログラムを制御部１０が実行することにより、制御部１０は判別器１３として機能する。判別器１３は、図４に示すように、入力層１１１に与えられた入力画像に、畳み込み処理（畳み込みフィルタ１２１，１２３の走査）やプーリング処理（プーリングウィンドウ１２２，１２４の走査）を行って徐々にサイズの小さな特徴マップ１１２，１１３，１１４，１１５，１１６を算出していき、特徴マップ１１６から全結合接続１２５を経て、最終的に出力層１１９から、判別結果を示す２次元ベクトルを得る。なお、特徴マップ１１６は、出力層１１９からの出力（判別結果）に直接影響を与える１次元ベクトルなので、ここでは判別ベクトルと呼ぶことにする。 The discriminator 13 is a discriminator based on a convolutional neural network (CNN) that outputs the presence or absence of activity when an image of protein/ligand binding is given. The control unit 10 functions as a discriminator 13 by executing a program that realizes a discriminator by CNN. As shown in FIG. 4, the discriminator 13 performs convolution processing (scanning of the convolution filters 121 and 123) and pooling processing (scanning of the pooling windows 122 and 124) on the input image given to the input layer 111, and gradually Small-sized feature maps 112, 113, 114, 115, and 116 are calculated in increments, and a two-dimensional vector indicating the discrimination result is finally obtained from the output layer 119 via the fully-connected connection 125 from the feature map 116. FIG. Since the feature map 116 is a one-dimensional vector that directly affects the output (discrimination result) from the output layer 119, it will be called a discrimination vector here.

判別器学習部１４は、蛋白質・リガンド結合の画像とその蛋白質・リガンド結合の活性の有無とからなる教師データを大量に用いて、判別器１３を学習させる。ただし、通常、活性ＤＢに記録されている蛋白質・リガンド結合のサンプルの活性の有無は著しく不均衡であり、活性が無いものの方が、活性があるものに比べて格段に多い。このため、普通に学習させると、真の活性の有無にかかわらず、全て「活性無し」と判別する判別器１３になってしまう可能性がある。そこで、判別器学習部１４は、活性有りの蛋白質・リガンド結合を活性無しと判別した場合のペナルティが大きくなるような、重み付きの誤差関数を用いて判別器１３を学習させる。これにより、判別器学習部１４は、活性の有無の不均衡の影響をできるだけ受けずに判別器１３を学習させることができる。判別器学習部１４は、学習手段として機能する。 The discriminator learning unit 14 uses a large amount of teacher data consisting of protein/ligand binding images and the presence or absence of protein/ligand binding activity to train the discriminator 13 . However, the presence or absence of activity in the protein/ligand binding samples recorded in the activity DB is usually remarkably imbalanced. For this reason, if learned normally, there is a possibility that the discriminator 13 will all discriminate "no activity" regardless of the presence or absence of true activity. Therefore, the discriminator learning unit 14 trains the discriminator 13 using a weighted error function that increases the penalty when an active protein/ligand bond is discriminated as non-active. As a result, the discriminator learning unit 14 can learn the discriminator 13 with as little influence as possible from the imbalance between the presence and absence of activity. The classifier learning unit 14 functions as learning means.

具体的には、判別器学習部１４は、以下の式（１）で表されるような、重み係数Ｗ_posを用いた重み付きクロスエントロピー関数で算出される誤差Ｅを誤差逆伝播させることにより、判別器１３を学習させる。

Ｗ_pos＝（活性無しサンプルの個数）／（活性有りサンプルの個数）
ｎ：サンプルの個数
ｘ⁽ⁱ⁾：ｉ番目のサンプルで作成した画像データを入力した時の判別器１３の出力
ｙ⁽ⁱ⁾：ｉ番目のサンプルの真の活性の有無 Specifically, the classifier learning unit 14 backpropagates an error E calculated by a weighted cross-entropy function using a weighting factor W _pos as represented by the following equation (1): , to learn the classifier 13 .

W _pos = (number of non-active samples)/(number of active samples)
n: Number of samples x ⁽ⁱ⁾ : Output of discriminator 13 when inputting image data created with i-th sample y ⁽ⁱ⁾ : Presence or absence of true activity in i-th sample

総合判定部１５は、活性が未知の蛋白質・リガンド結合について、画像生成部１１で生成した複数の画像の１枚１枚を判別器１３に入力して得られる出力を、画像生成部１１で生成した全ての画像について集計した値に基づいて、当該蛋白質・リガンド結合の活性の有無を判定する。総合判定部１５は、総合判定手段として機能する。 Comprehensive determination unit 15 generates an output obtained by inputting each of the plurality of images generated by image generation unit 11 to discriminator 13 for a protein/ligand binding whose activity is unknown, in image generation unit 11 The presence or absence of the protein/ligand binding activity is determined based on the total values for all the images obtained. The comprehensive determination unit 15 functions as comprehensive determination means.

以上、判定装置１００の機能構成について説明した。次に、判定装置１００が行う学習処理について、図５を参照して説明する。学習処理は、操作入力部３３を介して、ユーザにより、判定装置１００に対して学習処理の開始が指示されると開始される。 The functional configuration of the determination device 100 has been described above. Next, the learning process performed by the determination device 100 will be described with reference to FIG. The learning process is started when the user instructs the determination device 100 to start the learning process via the operation input unit 33 .

まず、制御部１０は、活性ＤＢに登録されている蛋白質とリガンドとから、学習データとして用いたいもの（蛋白質とリガンドのペア）を任意の数抽出し、それぞれについてドッキングシミュレーションを行って、抽出した蛋白質とリガンドのペアの数の蛋白質・リガンド結合のドッキング構造を取得する（ステップＳ１０１）。 First, from the proteins and ligands registered in the activity DB, the control unit 10 extracts an arbitrary number of data (pairs of proteins and ligands) to be used as learning data, and performs docking simulations for each to extract. A docking structure of protein/ligand binding is acquired for the number of protein-ligand pairs (step S101).

次に、制御部１０は、ステップＳ１０１で得られた蛋白質・リガンド結合のドッキング構造のうち、学習用データとして用いたいデータを抽出する（ステップＳ１０２）。ここでは、ステップＳ１０１で取得した全てのドッキング構造を学習用データとしてもよいし、一部を学習用データとし、残りを評価用データとしてもよい。 Next, the control unit 10 extracts data desired to be used as learning data from among the docking structures of protein-ligand binding obtained in step S101 (step S102). Here, all the docking structures acquired in step S101 may be used as learning data, or part of them may be used as learning data and the rest may be used as evaluation data.

次に、画像生成部１１は、学習用データとして抽出されたドッキング構造から３Ｄイメージを作成し、これを様々な角度から全周網羅的に撮影（２次元平面に投影）した学習用画像の集合（学習用の画像セット）を生成する（ステップＳ１０３）。ステップＳ１０３は、画像生成ステップと呼ばれる。ステップＳ１０３の処理（画像生成処理）の詳細については、後述する。 Next, the image generation unit 11 creates a 3D image from the docking structure extracted as learning data, and captures the entire circumference comprehensively from various angles (projection onto a two-dimensional plane) to create a set of learning images. (image set for learning) is generated (step S103). Step S103 is called an image generation step. Details of the processing (image generation processing) in step S103 will be described later.

次に、活性取得部１２は、ステップＳ１０３で生成した各画像に対応する（当該画像に写っている）蛋白質及びリガンドについて活性ＤＢを参照して蛋白質・リガンド結合の活性の有無を取得する（ステップＳ１０４）。ステップＳ１０４は性質取得ステップと呼ばれる。 Next, the activity acquisition unit 12 refers to the activity DB for the protein and ligand corresponding to each image generated in step S103 (shown in the image) and acquires the presence or absence of protein/ligand binding activity (step S104). Step S104 is called a property acquisition step.

次に、判別器学習部１４は、当該画像と活性の有無とからなる教師データを生成する（ステップＳ１０５）。この時、ステップＳ１０３で生成した各画像をランダムに０°、９０°、１８０°、２７０°と回転させたものを用いて教師データを生成してもよい。そして、判別器学習部１４は、ステップＳ１０５で生成した教師データを用いて、判別器１３を学習させ（ステップＳ１０６）、学習処理を終了する。ステップＳ１０６は学習ステップと呼ばれる。 Next, the discriminator learning unit 14 generates teacher data consisting of the image and presence/absence of activity (step S105). At this time, each image generated in step S103 may be randomly rotated by 0°, 90°, 180°, and 270° to generate teacher data. Then, the discriminator learning unit 14 causes the discriminator 13 to learn using the teacher data generated in step S105 (step S106), and ends the learning process. Step S106 is called a learning step.

次に、ステップＳ１０３で行われる画像生成処理について図６を参照して説明する。画像生成処理は、Ｎ_θ及びＮ_φを引数にとる。これらは、撮影する角度（２次元平面に投影する際の視点の角度）を、図３に示すθの方向にＮ_θ分割、φの方向にＮ_φ分割することを意味し、画像生成処理では、合計Ｎ_θ×Ｎ_φ枚の画像が生成される。 Next, the image generation processing performed in step S103 will be described with reference to FIG. The image generation process takes arguments N _θ and N _φ . These means dividing the shooting angle (viewpoint angle when projecting onto a two-dimensional plane) into N _θ in the direction of θ and N _φ in the direction of φ shown in FIG. , a total of N _θ ×N _φ images are generated.

まず、画像生成部１１は、θ方向のインデックスを表す変数ｉを０に初期化する（ステップＳ２０１）。そして、角度θに（３６０°×ｉ）／Ｎ_θをセットする（ステップＳ２０２）。 First, the image generator 11 initializes a variable i representing an index in the θ direction to 0 (step S201). Then, the angle θ is set to (360°×i)/N _θ (step S202).

次に、画像生成部１１は、φ方向のインデックスを表す変数ｊを０に初期化する（ステップＳ２０３）。そして、角度φに（３６０°×ｊ）／Ｎ_φをセットする（ステップＳ２０４）。 Next, the image generator 11 initializes a variable j representing an index in the φ direction to 0 (step S203). Then, the angle φ is set to (360°×j)/N _φ (step S204).

そして、画像生成部１１は、図３に示すような（θ，φ）の方向からの視点でドッキング構造の３Ｄイメージを２次元平面に投影した画像を生成する（ステップＳ２０５）。そして、画像生成部１１は、変数ｊをインクリメントし（ステップＳ２０６）、変数ｊがＮ_φ未満であるか否かを判定する（ステップＳ２０７）。 Then, the image generator 11 generates an image by projecting the 3D image of the docking structure onto a two-dimensional plane from the viewpoint from the direction (θ, φ) as shown in FIG. 3 (step S205). The image generator 11 then increments the variable j (step S206) and determines whether the variable j is less than _Nφ (step S207).

変数ｊがＮ_φ未満であるなら（ステップＳ２０７；Ｙｅｓ）、ステップＳ２０４に戻る。変数ｊがＮ_φ以上であるなら（ステップＳ２０７；Ｎｏ）、画像生成部１１は、変数ｉをインクリメントし（ステップＳ２０８）、変数ｉがＮ_θ未満であるか否かを判定する（ステップＳ２０９）。 If the variable j is less than _Nφ (step S207; Yes), return to step S204. If the variable j is greater than or equal to _Nφ (step S207; No), the image generator 11 increments the variable i (step S208) and determines whether the variable i is less than _Nθ (step S209). .

変数ｉがＮ_θ未満であるなら（ステップＳ２０９；Ｙｅｓ）、ステップＳ２０２に戻る。変数ｉがＮ_θ以上であるなら（ステップＳ２０９；Ｎｏ）、画像生成処理を終了する。 If the variable i is less than _Nθ (step S209; Yes), the process returns to step S202. If the variable i is greater than or equal to _Nθ (step S209; No), the image generation process is terminated.

以上説明した学習処理（図５）及び画像生成処理（図６）により、判別器１３は学習され、判別器１３に活性が未知の蛋白質・リガンド結合の画像を入力すると、その蛋白質・リガンド結合の活性の有無を出力するようになる。判定装置１００では、上述したように、蛋白質・リガンド結合のドッキング構造について全周（３６０°）網羅的に様々な視点からの画像を生成し、この全周網羅的画像のそれぞれを用いて判別器１３を学習させる。したがって、学習後の判別器１３のＣＮＮは、ドッキング構造の立体的な特徴が抽出された学習モデルとなる。 Through the learning process (FIG. 5) and the image generation process (FIG. 6) described above, the discriminator 13 is learned. It will output whether it is active or not. As described above, the determination device 100 comprehensively generates images from various viewpoints over the entire circumference (360°) of the docking structure of the protein/ligand binding, and uses each of these comprehensive images to determine the classifier 13 is learned. Therefore, the CNN of the discriminator 13 after learning becomes a learning model from which the three-dimensional features of the docking structure are extracted.

次に、このようにして得られた判別器１３を用いて活性が未知の蛋白質・リガンド結合の判定を行う判定処理について、図７を参照して説明する。判定処理は、操作入力部３３を介して、ユーザにより、判定装置１００に対して判定処理の開始が指示されると開始される。この判定処理の開始の指示の際、ユーザは、判定する蛋白質の種類及びリガンドの種類を判定装置１００に入力する。 Next, determination processing for determining protein-ligand binding whose activity is unknown using the discriminator 13 thus obtained will be described with reference to FIG. The determination process is started when the user instructs the determination apparatus 100 to start the determination process via the operation input unit 33 . When instructing the start of the determination process, the user inputs the type of protein and the type of ligand to be determined to determination device 100 .

まず、制御部１０は、ユーザから入力された蛋白質とリガンドについてドッキングシミュレーションを行って、蛋白質・リガンド結合のドッキング構造を取得する（ステップＳ３０１）。 First, the control unit 10 performs a docking simulation for a protein and a ligand input by a user, and acquires a docked structure of protein/ligand binding (step S301).

次に、画像生成部１１は、ステップＳ３０１で取得されたドッキング構造から３Ｄイメージを作成し、これを様々な角度から全周網羅的に撮影（２次元平面に投影）したＮ_θ×Ｎ_φ枚の判定用画像の集合（判定用の画像セット）を生成する（ステップＳ３０２）。この処理は前述の画像生成処理（図６）と同じであり、ステップＳ３０２も画像生成ステップと呼ばれる。 Next, the image generating unit 11 creates a 3D image from the docking structure acquired in step S301, and captures N _θ ×N _φ images comprehensively from various angles (projecting onto a two-dimensional plane). A set of determination images (image set for determination) is generated (step S302). This process is the same as the above-described image generation process (FIG. 6), and step S302 is also called an image generation step.

次に、総合判定部１５は、ステップＳ３０２で生成された判定用画像のそれぞれを判別器１３に入力し、ＣＮＮの最終のアベレージプーリング層の直前の特徴マップ１１５を判定用画像の枚数分、取得する（ステップＳ３０３）。ステップＳ３０３は判別ステップと呼ばれる。 Next, the comprehensive determination unit 15 inputs each of the determination images generated in step S302 to the discriminator 13, and acquires the feature map 115 immediately before the final average pooling layer of the CNN for the number of determination images. (step S303). Step S303 is called a determination step.

そして、総合判定部１５は、図８に示すように、ステップ３０３で得られた特徴マップ１１５を全て用いて総合特徴マップ１１７を生成し、その総合特徴マップ１１７にアベレージプーリングを行う（ステップＳ３０４）。この処理は、通常の（２次元の）アベレージプーリングを、全周網羅的に撮影（２次元平面に投影）したＮ_θ×Ｎ_φ枚の画像の方向にも行う処理（次元が１つ追加されるアベレージプーリング処理）であるので、３Ｄａｖｅｒａｇｅｐｏｏｌｉｎｇ処理という。 Then, as shown in FIG. 8, the comprehensive determination unit 15 generates a comprehensive feature map 117 using all the feature maps 115 obtained in step 303, and performs average pooling on the comprehensive feature map 117 (step S304). . This process performs normal (two-dimensional) average pooling in the direction of N _θ ×N _φ images captured (projected onto a two-dimensional plane) covering the entire periphery (one dimension is added). 3D average pooling processing).

そして、総合判定部１５は、３Ｄａｖｅｒａｇｅｐｏｏｌｉｎｇ処理後の出力層１１９からの出力に基づいて、蛋白質・リガンド結合の活性の有無を判定し（ステップＳ３０５）、判定処理を終了する。ステップＳ３０５は、総合判定ステップと呼ばれる。 Then, based on the output from the output layer 119 after the 3D average pooling process, the comprehensive determination unit 15 determines the presence or absence of protein-ligand binding activity (step S305), and ends the determination process. Step S305 is called a comprehensive determination step.

３Ｄａｖｅｒａｇｅｐｏｏｌｉｎｇ処理について、図８を参照して補足説明する。まず、ステップＳ３０２で、判定用画像がＮ_θ×Ｎ_φ枚が得られるが、ｎ＝Ｎ_θ×Ｎ_φとすると、ステップＳ３０３で判定用画像のそれぞれ（ｎ枚）を判別器１３の入力層１１１に入力する。すると、判別器１３のＣＮＮの内部で、最終のアベレージプーリング層の直前の特徴マップ１１５がｎ枚得られる。ただし、これは同時並行に行う必要はなく、１つの判別器１３（ＣＮＮ）を順次ｎ回使って、特徴マップ１１５をｎ枚得ればよい。 A supplementary description of the 3D average pooling process will be given with reference to FIG. First, in step S302, N _θ _× N _φ judgment images are obtained _. Enter 111. Then, n feature maps 115 immediately before the final average pooling layer are obtained inside the CNN of the discriminator 13 . However, this does not have to be done in parallel, and one discriminator 13 (CNN) may be sequentially used n times to obtain n feature maps 115 .

特徴マップ１１５のそれぞれは複数（図８では２０４８）のチャネルを持つが、ｎ枚の特徴マップ１１５のそれぞれの第ｍ番目のチャネルを集めて平均したものを、第ｍ番目のチャネルに割り当てて総合特徴マップ１１７を算出する。そして、このように得られた総合特徴マップ１１７をアベレージプーリング処理して特徴マップ１１６を得る。これが３Ｄａｖｅｒａｇｅｐｏｏｌｉｎｇ処理である。そして、特徴マップ１１６から、全結合接続１２５を経て出力層１１９の出力を得る。 Each feature map 115 has a plurality of channels (2048 in FIG. 8). A feature map 117 is calculated. A feature map 116 is obtained by performing an average pooling process on the integrated feature map 117 thus obtained. This is the 3D average pooling process. Then, from the feature map 116 , the output of the output layer 119 is obtained via the fully connected connection 125 .

このように処理することにより、単一の画像を判別器１３に入力して得られる出力（判別器１３による判別結果）よりも、格段に判定精度の良い判定結果を得ることができる。３Ｄａｖｅｒａｇｅｐｏｏｌｉｎｇ処理によって得られる特徴マップ１１６は、総合特徴マップ１１７に基づくものであり、また、出力層１１９からの出力（判別結果）に直接影響を与える１次元ベクトルなので、総合判別ベクトルと呼ぶことにする。 By performing such processing, it is possible to obtain determination results with significantly higher determination accuracy than the output obtained by inputting a single image to the discriminator 13 (discrimination results by the discriminator 13). The feature map 116 obtained by the 3D average pooling process is based on the comprehensive feature map 117 and is a one-dimensional vector that directly affects the output (discrimination result) from the output layer 119, so it is called a comprehensive discriminant vector. to

実際に実験した結果を以下に示す。この実験では、ドッキングシミュレーションにＧｌｉｄｅ、ドッキング構造の画像生成にＰｙＭｏｌ、活性ＤＢにＤＵＤ－Ｅを用い、ＤＵＤ－ＥＤｉｖｅｒｓｅｓｕｂｓｅｔにおける８種類の蛋白質を実験対象にした。そして、学習処理（図５）の学習用データ抽出（ステップＳ１０２）においては、ステップＳ１０１で取得したドッキング構造のうちの７０％を学習用データとして抽出し、残りの３０％を評価用データとした。また、画像の入力サイズは２２４とし、判別器１３のＣＮＮとしてはＲｅｓＮｅｔ－５０を用いた。そして、学習時のバッチ数は１２８とし、バッチごとにランダムに画像を０°、９０°、１８０°、２７０°と右回転させて学習を行った。 The results of actual experiments are shown below. In this experiment, Glide was used for the docking simulation, PyMol was used for image generation of the docking structure, and DUD-E was used for the active DB. Then, in the learning data extraction (step S102) of the learning process (FIG. 5), 70% of the docking structures acquired in step S101 are extracted as learning data, and the remaining 30% are used as evaluation data. . The image input size was 224, and ResNet-50 was used as the CNN of the discriminator 13 . The number of batches during learning was set to 128, and learning was performed by randomly rotating the image to the right by 0°, 90°, 180°, and 270° for each batch.

表１は、画像生成処理において、Ｎ_θ＝７、Ｎ_φ＝７として、４９枚の画像を生成した場合の判定装置１００と、Ｇｌｉｄｅとの判定結果の比較である。また、表２は、Ｎ_θ＝９、Ｎ_φ＝９として、８１枚の画像を生成した場合の判定装置１００と、Ｇｌｉｄｅとの判定結果の比較である。 Table 1 compares the determination results of the determination apparatus 100 and Glide when 49 images are generated with N _θ =7 and N _φ =7 in the image generation process. Table 2 compares the determination results of the determination apparatus 100 and Glide when 81 images are generated with N _θ =9 and N _φ =9.

なお、評価指標の「ＡＵＣ」は、ＲＯＣ（ＲｅｃｅｉｖｅｒＯｐｅｒａｔｉｎｇＣｈａｒａｃｔｅｒｉｓｔｉｃ）曲線下の面積であるＡＵＣ（ＡｒｅａＵｎｄｅｒｔｈｅｒｅｃｅｉｖｅｒｏｐｅｒａｔｏｒＣｕｒｖｅ）の値である。また、「ＥＦ１％」は、以下の式（２）で表される指標ＥＦ（ＥｎｒｉｃｈｍｅｎｔＦａｃｔｏｒ）である。これは、活性のあるリガンドを上位１％にどれだけ濃縮できたかを表しており、実際の創薬において重視されている指標である。
ＥＦ＝ｎａ／（ＮＡ×０．０１） …（２）
ｎａ：上位１％にランキングされた中で活性が有る蛋白質・リガンド結合の数
ＮＡ：実験対象の全ての蛋白質・リガンド結合のうち活性が有るものの数 The evaluation index “AUC” is the value of AUC (Area Under the Receiver Operator Curve), which is the area under the ROC (Receiver Operating Characteristic) curve. Also, "EF1%" is an index EF (Enrichment Factor) represented by the following formula (2). This indicates how much active ligands can be concentrated in the top 1%, and is an index that is emphasized in actual drug discovery.
EF=na/(NA×0.01) (2)
na: number of active protein-ligand bonds ranked in the top 1% NA: number of active protein-ligand bonds out of all experimental subjects

表１や表２を見れば、判定装置１００での判定精度が従来技術（Ｇｌｉｄｅ）よりも格段に優れていることが確認できる。これは、蛋白質・リガンド結合の立体構造を全周網羅した多量の画像で学習することができたこと、偏りのある学習データに対して重み付きクロスエントロピー関数を用いて均等に学習することが可能になったこと、判定時に全周網羅的に撮影（２次元平面に投影）した複数の画像を用いて３ＤＡｖｅｒａｇｅＰｏｏｌｉｎｇを用いることにより各々の画像学習結果を統合し、立体として把握することができたこと等によるものと考えられる。 From Tables 1 and 2, it can be confirmed that the determination accuracy of the determination device 100 is significantly superior to that of the conventional technology (Glide). This is because it was possible to learn with a large amount of images that cover the entire circumference of the three-dimensional structure of protein / ligand binding, and it is possible to learn evenly using a weighted cross-entropy function for biased learning data. By using 3D Average Pooling using multiple images captured comprehensively (projected on a two-dimensional plane) at the time of judgment, each image learning result can be integrated and grasped as a three-dimensional object. This is thought to be due to

（変形例１）
上述の実施形態１では、蛋白質・リガンド結合のドッキング構造から作成した３Ｄイメージを全周網羅的に撮影（２次元平面に投影）する際、３Ｄイメージ生成時の蛋白質のサーフェスを無しにして、どの角度からの視点で撮影（２次元平面に投影）してもリガンドが確認できるようにしていた。しかし、実際には蛋白質にはサーフェスが存在し、リガンドはサーフェスのない部分からしか確認できない。そこで、３Ｄイメージ生成時の蛋白質のサーフェスを有りとして、サーフェスのない側から複数の画像を生成する変形例１について説明する。 (Modification 1)
In the above-described Embodiment 1, when a 3D image created from the docking structure of protein-ligand binding is comprehensively photographed (projected onto a two-dimensional plane), the surface of the protein at the time of 3D image generation is eliminated, and which The ligand can be confirmed even if it is photographed from an angular viewpoint (projected onto a two-dimensional plane). However, in reality, proteins have surfaces, and ligands can only be identified from the parts without surfaces. Therefore, a modified example 1 will be described in which a plurality of images are generated from the side without a surface, assuming that the surface of the protein is present when the 3D image is generated.

変形例１の判定装置１００では、画像生成部１１は、図９に示すように、蛋白質２１１のサーフェスの無い部分のリガンド２１２を正面に見て、ドッキング構造２１３を撮影（２次元平面に投影）し、さらにその周囲からθ、２θのように角度をつけて図９のｘやｏで示すような点からリガンド２１２の方向を視点として撮影（２次元平面に投影）した画像を生成する。変形例１の判定装置１００における学習処理及び判定処理は、実施形態１の判定装置１００における学習処理（図５）及び判定処理（図７）と基本的には同じであるが、これらの処理から呼び出される画像生成処理が異なるため、この処理について、図１０を参照して説明する。 In the determination device 100 of Modification 1, the image generation unit 11 views the ligand 212 in the surface-free portion of the protein 211 from the front, and photographs the docking structure 213 (projected onto a two-dimensional plane), as shown in FIG. Then, an image is generated by photographing (projecting onto a two-dimensional plane) from points such as x and o in FIG. The learning process and the determination process in the determination device 100 of Modification 1 are basically the same as the learning process (FIG. 5) and the determination process (FIG. 7) in the determination device 100 of the first embodiment. Since the image generation process called is different, this process will be described with reference to FIG.

変形例１の画像生成処理は、Ｎを引数にとる。Ｎは、撮影（２次元平面に投影）する角度を、図９に示すφの方向にＮ分割することを意味する。変形例１の画像生成処理では、正面からの画像と、正面の周囲でθの角度でＮ枚の画像と、２θの角度でＮ枚の画像を生成するので、合計（１＋２×Ｎ）枚の画像が生成される。 The image generation process of Modification 1 takes N as an argument. N means that the angle for photographing (projection onto a two-dimensional plane) is divided by N in the direction of φ shown in FIG. 9 . In the image generation process of Modification 1, an image from the front, N images around the front at an angle of θ, and N images at an angle of 2θ are generated. An image is generated.

まず、画像生成部１１は、図９に示すように、リガンド２１２が蛋白質２１１のサーフェスに隠れていない正面からドッキング構造の３Ｄイメージを２次元平面に投影した画像を生成する（ステップＳ２２１）。そして、画像生成部１１は、φ方向のインデックスを表す変数ｉを０に初期化する（ステップＳ２２２）。そして、角度φに（３６０°×ｉ）／Ｎをセットする（ステップＳ２２３）。 First, as shown in FIG. 9, the image generator 11 generates an image by projecting a 3D image of the docking structure onto a two-dimensional plane from the front where the ligand 212 is not hidden by the surface of the protein 211 (step S221). The image generator 11 then initializes the variable i representing the index in the φ direction to 0 (step S222). Then, the angle φ is set to (360°×i)/N (step S223).

次に、画像生成部１１は、図９にｘで示されるように、正面からθずれた円周上のφの方向からの視点でドッキング構造の３Ｄイメージを２次元平面に投影した画像を生成する（ステップＳ２２４）。そして、画像生成部１１は、図９にｏで示されるように、正面から２θずれた円周上のφの方向からの視点でドッキング構造の３Ｄイメージを２次元平面に投影した画像を生成する（ステップＳ２２５）。 Next, as indicated by x in FIG. 9, the image generating unit 11 generates an image by projecting the 3D image of the docking structure onto a two-dimensional plane from the viewpoint from the direction of φ on the circumference that is θ shifted from the front. (step S224). Then, as indicated by o in FIG. 9, the image generator 11 generates an image by projecting the 3D image of the docking structure onto a two-dimensional plane from the viewpoint from the direction of φ on the circumference that is shifted by 2θ from the front. (Step S225).

次に、画像生成部１１は、変数ｉをインクリメントし（ステップＳ２２６）、変数ｉがＮ未満であるか否かを判定する（ステップＳ２２７）。そして、変数ｉがＮ未満であるなら（ステップＳ２２７；Ｙｅｓ）、ステップＳ２２３に戻る。変数ｉがＮ以上であるなら（ステップＳ２２７；Ｎｏ）、画像生成処理を終了する。 Next, the image generator 11 increments the variable i (step S226) and determines whether the variable i is less than N (step S227). Then, if the variable i is less than N (step S227; Yes), the process returns to step S223. If the variable i is greater than or equal to N (step S227; No), the image generation process is terminated.

以上のように、変形例１では、蛋白質２１１のサーフェスの無い部分からリガンド２１２を含むドッキング構造２１３を撮影（２次元平面に投影）した画像を生成するので、サーフェス有りの画像からでも蛋白質・リガンド結合の立体構造を把握できる画像を生成することができる。 As described above, in Modification 1, an image of the docking structure 213 including the ligand 212 is captured (projected onto a two-dimensional plane) from a portion of the protein 211 without a surface. It is possible to generate an image from which the three-dimensional structure of the bond can be grasped.

（変形例２）
上述の実施形態１及び変形例１では、判定装置１００が、学習処理と判定処理の双方を行っているが、判定装置１００はこれに限定されない。例えば、判定装置１００は、判定処理を行わないが、学習処理を行って判別器１３を学習させる判別器学習装置であってもよい。また、判定装置１００は、学習処理を行わないが、他の判定装置１００によって学習された判別器１３を用いて判定処理を行う判定装置であってもよい。学習処理は、大量の学習用画像データを作成してディープラーニングを行う必要がある等、スーパーコンピュータでないと実施が難しい面がある。しかし、判定処理だけであれば、学習済みの判別器１３を用いれば、判定用画像データを作成するだけで判定が可能であるため、通常のＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）等のコンピュータによっても実施することができる。 (Modification 2)
In Embodiment 1 and Modification 1 described above, the determination device 100 performs both the learning process and the determination process, but the determination device 100 is not limited to this. For example, the determination device 100 may be a discriminator learning device that does not perform determination processing but performs learning processing to train the discriminator 13 . Further, the determination device 100 may be a determination device that does not perform learning processing, but performs determination processing using the discriminator 13 that has been learned by another determination device 100 . The learning process is difficult to implement without a supercomputer, such as the need to create a large amount of image data for learning and perform deep learning. However, if only the determination process is performed, it is possible to perform the determination simply by creating image data for determination using the discriminator 13 that has already been trained. can be done.

（変形例３）
上述の実施形態１及び変形例では、判定処理（図７）において、ステップＳ３０４で３Ｄａｖｅｒａｇｅｐｏｏｌｉｎｇ処理を行っているが、この処理は必須ではない。例えば、ステップＳ３０３では、総合判定部１５は、ステップＳ３０２で生成された判定用画像のそれぞれを判別器１３に入力して判別器１３の出力を判定用画像の枚数分生成し、ステップＳ３０４をスキップしてもよい。この場合、ステップＳ３０５では、総合判定部１５は、ステップＳ３０３で取得した判別器１３の出力（判定用画像の枚数分存在する）を平均したものに基づいて、蛋白質・リガンド結合の活性の有無を判定すればよい。 (Modification 3)
In the first embodiment and the modification described above, the 3D average pooling process is performed in step S304 in the determination process (FIG. 7), but this process is not essential. For example, in step S303, the comprehensive determination unit 15 inputs each of the determination images generated in step S302 to the discriminator 13, generates the output of the discriminator 13 for the number of determination images, and skips step S304. You may In this case, in step S305, the comprehensive determination unit 15 determines the presence or absence of protein/ligand binding activity based on the average output of the discriminator 13 acquired in step S303 (existing for the number of images for determination). Judge.

（その他の変形例）
上述の実施形態１及び変形例では、ドッキング構造の画像を生成する際の視点の角度を一定の角度間隔で変化させていたが、これに限られない。例えば、実施形態１ではφが０°や１８０°（地球で言うと赤道に対応する部分）付近においては、θ方向の分割数を多くし、φが９０°や２７０°（地球で言うと極に対応する部分）付近においては、θ方向の分割数を少なくしてもよい。 (Other modifications)
In the first embodiment and the modification described above, the angle of the viewpoint is changed at regular angular intervals when generating the image of the docking structure, but the present invention is not limited to this. For example, in Embodiment 1, the number of divisions in the θ direction is increased near φ of 0° or 180° (the portion corresponding to the equator on the earth), and ), the number of divisions in the θ direction may be reduced.

また、上述の実施形態及び変形例において、判別器１３のＣＮＮに入力する画像は、入力サイズが２２４×２２４ピクセルで、各ピクセルがＲＧＢ３チャネルのカラー画像として説明したが、これは一例である。入力サイズをもっと大きい値（例えば４４８×４４８ピクセル）にしてもよいし、逆にもっと小さい値（例えば１１２×１１２ピクセル）にしてもよい。また、縦横のピクセル数は同じである必要はない（例えば１９２０×１０８０ピクセル等）。また、カラー画像である必要もなく、白黒画像でもよい。白黒画像の場合は各ピクセルとも１チャネルの情報となるため、図４に示す畳み込みフィルタ１２１は１チャネルのフィルタ（例えば７×７×１ｃｈ）となる。 Also, in the above-described embodiment and modification, the image input to the CNN of the discriminator 13 has an input size of 224×224 pixels, and each pixel is a color image with RGB 3 channels, but this is just an example. The input size may be set to a larger value (eg, 448×448 pixels) or a smaller value (eg, 112×112 pixels). Also, the number of pixels vertically and horizontally need not be the same (for example, 1920×1080 pixels). Also, the image need not be a color image and may be a black and white image. In the case of a black-and-white image, each pixel is one-channel information, so the convolution filter 121 shown in FIG. 4 is a one-channel filter (for example, 7×7×1ch).

また、上述の実施形態１及び変形例は適宜組み合わせることができる。例えば、実施形態１と変形例１を組み合わせた場合、画像生成部１１は、蛋白質２１１とリガンド２１２のドッキング構造２１３をサーフェス無しで全周（３６０°）網羅的な視点で生成した画像と、サーフェス有りでサーフェスの無い側からの視点で生成した画像と、をそれぞれ生成する。そして、判別器学習部１４は、サーフェス無しの画像とサーフェス有りの画像を両方用いて判別器１３を学習させ、総合判定部１５は、サーフェス無しの画像とサーフェス有りの画像を両方用いて判定する。このようにすることで、サーフェス有りの場合の特徴とサーフェス無しの場合の特徴とを両方とも用いた判定が行えるようになる。 Moreover, the first embodiment and the modification described above can be combined as appropriate. For example, when Embodiment 1 and Modification 1 are combined, the image generation unit 11 generates an image of the docking structure 213 of the protein 211 and the ligand 212 from a comprehensive viewpoint without a surface (360°) and a surface An image generated from the viewpoint from the side with and without the surface is generated, respectively. Then, the classifier learning unit 14 learns the classifier 13 using both the image without the surface and the image with the surface, and the comprehensive judgment unit 15 judges using both the image without the surface and the image with the surface . By doing so, it becomes possible to make a determination using both the features in the presence of a surface and the features in the absence of a surface.

また、変形例１においては、θと２θというように２倍の関係の角度からの視点としているが、この２つの角度は全く無関係の角度でもよい。また、この角度は２つに限るわけではなく、正面の周囲の３以上の角度からの視点としてもよい。例えば、θ＝２０°、３０°、５５°、７０°の４つのθについて、正面からそれぞれθずれた円周上のφの方向からの視点でドッキング構造の３Ｄイメージを２次元平面に投影した画像を生成することにしてもよい。また、φ方向の分割数Ｎについても、θ毎に異なる値にしてもよい。 Also, in Modification 1, the viewpoint is from an angle that has a two-fold relationship, such as θ and 2θ, but these two angles may be completely unrelated angles. Also, the number of angles is not limited to two, and viewpoints from three or more angles around the front may be used. For example, for θ=20°, 30°, 55°, and 70°, the 3D image of the docking structure is projected onto a two-dimensional plane from the viewpoint from the direction of φ on the circumference that is shifted by θ from the front. An image may be generated. Also, the division number N in the φ direction may be set to a different value for each θ.

また、上述の実施形態及び変形例では、蛋白質・リガンド結合の活性の有無の判定を行う判定装置１００を例に挙げて説明したが、判定装置１００が判定する対象は蛋白質・リガンド結合の活性の有無に限られるわけではない。判定装置１００は、他の学習データを用いることにより、学習データに応じた他の判定を行うことも可能である。例えば、蛋白質同士の結合の活性の有無についてのデータと、当該蛋白質同士の結合のドッキング構造のデータ（ＰｙＭＯＬ等の分子グラフィックツールにより生成）と、を学習データとして用いることにより、判定装置１００は、蛋白質・蛋白質結合の活性の有無の判定を行うことができる。その他、任意の生体分子と当該生体分子に結合する物質との結合の活性の有無についてのデータと、当該生体分子と当該物質の結合のドッキング構造のデータと、を学習データとして用意できるなら、判定装置１００は、そのような任意の生体分子・物質結合の活性の有無の判定を行うことができる。 Further, in the above embodiments and modifications, the determination device 100 for determining the presence or absence of protein/ligand binding activity has been described as an example. It is not limited to presence or absence. The determination device 100 can also perform other determinations according to the learning data by using other learning data. For example, by using data on the presence or absence of protein-protein binding activity and data on the docking structure of the protein-protein binding (generated by a molecular graphics tool such as PyMOL) as learning data, the determination device 100 can: The presence or absence of protein-protein binding activity can be determined. In addition, if data on the presence or absence of binding activity between an arbitrary biomolecule and a substance that binds to the biomolecule, and data on the docking structure of the binding between the biomolecule and the substance, can be prepared as learning data. The device 100 can determine the presence or absence of such any biomolecule-substance binding activity.

また、判定装置１００が判定する対象は、結合の活性の有無に限られるわけではない。例えば、任意の第１の物質と当該第１の物質に結合する第２の物質との結合に関する何らかの性質（所定の性質）についてのデータと、当該第１の物質と当該第２の物質の結合のドッキング構造のデータ（ＰｙＭＯＬ等の分子グラフィックツールにより生成）と、を学習データとして用意できるなら、判定装置１００は、そのような任意の第１の物質と第２の物質の結合に関する当該性質の有無についての判定を行うことができる。 Further, the object to be judged by the judging device 100 is not limited to the presence or absence of binding activity. For example, data on some properties (predetermined properties) relating to binding between an arbitrary first substance and a second substance that binds to the first substance, and binding between the first substance and the second substance docking structure data (generated by a molecular graphics tool such as PyMOL) and the like can be prepared as learning data, the determination device 100 can determine the property related to the binding of such arbitrary first substance and second substance A determination of presence or absence can be made.

なお、上述の実施形態１及び変形例では、ＣＮＮによる判別器１３を実現するプログラムを制御部１０が実行することにより、制御部１０は判別器１３としても機能することとしていたが、これに限られない。判定装置１００は、制御部１０とは別に（例えば、ＧＰＵ（ＧｒａｐｈｉｃｓＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）や、専用のＩＣ（ＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）等の）判別器１３の機能を実現するデバイスを備えてもよい。 In the first embodiment and the modification described above, the control unit 10 executes a program that implements the discriminator 13 by CNN, so that the control unit 10 also functions as the discriminator 13. However, this is not the only option. can't The determination device 100 may include a device (for example, a GPU (Graphics Processing Unit), a dedicated IC (Integrated Circuit), etc.) that implements the function of the discriminator 13 , separately from the control unit 10 .

また、判別器１３はＣＮＮ以外（例えば、ＲＮＮ（ＲｅｃｕｒｒｅｎｔＮｅｕｒａｌＮｅｔｗｏｒｋ）等）のニューラルネットワークを用いた判別器であってもよい。また、判定処理（図７）において３Ｄａｖｅｒａｇｅｐｏｏｌｉｎｇ処理を行わないなら、判別器１３は、ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）等、ニューラルネット以外の判別器であってもよい。 Further, the discriminator 13 may be a discriminator using a neural network other than CNN (for example, RNN (Recurrent Neural Network), etc.). If 3D average pooling processing is not performed in the determination processing (FIG. 7), the discriminator 13 may be a discriminator other than a neural network, such as an SVM (Support Vector Machine).

なお、判定装置１００の判定処理は、通常のＰＣ等のコンピュータによっても実施することができる。また、将来的には学習処理もスーパーコンピュータによらず、通常のＰＣ等のコンピュータによっても実施することができるようになると考えられる。具体的には、上記実施形態では、判定装置１００が行う学習処理及び判定処理のプログラムが、記憶部２０のＲＯＭに予め記憶されているものとして説明した。しかし、プログラムを、フレキシブルディスク、ＣＤ－ＲＯＭ（ＣｏｍｐａｃｔＤｉｓｃＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）、ＭＯ（Ｍａｇｎｅｔｏ－ＯｐｔｉｃａｌＤｉｓｃ）、メモリカード、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）メモリ等のコンピュータ読み取り可能な記録媒体に格納して配布し、そのプログラムをコンピュータに読み込んでインストールすることにより、上述の各機能を実現することができるコンピュータを構成してもよい。 Note that the determination processing of the determination device 100 can also be performed by a computer such as a normal PC. Also, in the future, it is conceivable that the learning process can be performed not by a supercomputer but also by a computer such as a normal PC. Specifically, in the above embodiment, the programs for the learning process and the determination process performed by the determination device 100 are pre-stored in the ROM of the storage unit 20 . However, the program may be stored in a computer-readable storage medium such as a flexible disk, CD-ROM (Compact Disc Read Only Memory), DVD (Digital Versatile Disc), MO (Magneto-Optical Disc), memory card, USB (Universal Serial Bus) memory, etc. By storing and distributing the program in a recording medium, and reading and installing the program in the computer, a computer capable of realizing each of the functions described above may be configured.

以上、本発明の好ましい実施形態について説明したが、本発明は係る特定の実施形態に限定されるものではなく、本発明には、特許請求の範囲に記載された発明とその均等の範囲が含まれる。 Although the preferred embodiments of the present invention have been described above, the present invention is not limited to such specific embodiments, and the present invention includes the invention described in the claims and their equivalents. be

１０…制御部、１１…画像生成部、１２…活性取得部、１３…判別器、１４…判別器学習部、１５…総合判定部、２０…記憶部、３１…出力部、３２…通信部、３３…操作入力部、１００…判定装置、１１１…入力層、１１２，１１３，１１４，１１５，１１６…特徴マップ、１１７…総合特徴マップ、１１９…出力層、１２１，１２３…畳み込みフィルタ、１２２，１２４…プーリングウィンドウ、１２５…全結合接続、２１１…蛋白質、２１２…リガンド、２１３ドッキング構造、３１１…カメラ DESCRIPTION OF SYMBOLS 10... Control part, 11... Image generation part, 12... Activity acquisition part, 13... Discriminator, 14... Discriminator learning part, 15... Comprehensive determination part, 20... Storage part, 31... Output part, 32... Communication part, 33 Operation input unit 100 Determination device 111 Input layer 112, 113, 114, 115, 116 Feature map 117 Comprehensive feature map 119 Output layer 121, 123 Convolution filter 122, 124 ... pooling window, 125 ... all binding connections, 211 ... protein, 212 ... ligand, 213 docking structure, 311 ... camera

Claims

image generating means for generating an image set including a plurality of images obtained by projecting a three-dimensional structure of a bond of a first substance and a second substance onto a two-dimensional plane from a plurality of viewpoints around the three-dimensional structure; ,
a discriminator that, when inputting one image included in the image set, discriminates whether or not the bond associated with the three-dimensional structure projected on the one image has a predetermined property;
A value obtained by inputting each of a plurality of images included in the image set to the discriminator, based on a total value for all images included in the image set, to an image included in the image set Comprehensive judgment means for judging whether or not the bond associated with the projected three-dimensional structure has the property;
A three-dimensional structure determination device.

The image generating means generates an image set including a plurality of images obtained by projecting the three-dimensional structure of the binding of the protein and the ligand onto a two-dimensional plane from a plurality of viewpoints around the three-dimensional structure,
The discriminator, when inputting one image included in the image set, discriminates the presence or absence of activity of the binding related to the three-dimensional structure projected on the one image,
The overall determination means determines the image based on the values obtained by inputting each of the plurality of images included in the image set into the discriminator, based on the values obtained by totaling all the images included in the image set. Determining the presence or absence of activity of the bond associated with the three-dimensional structure projected on the image included in the set;
The three-dimensional structure determination device according to claim 1.

The image generating means generates an image set including a plurality of images obtained by projecting the three-dimensional structure of the binding of the protein and the ligand onto a two-dimensional plane from a plurality of viewpoints covering the entire periphery.
The three-dimensional structure determination device according to claim 2.

The image generation means generates a three-dimensional structure of the binding between the protein and the ligand with the surface of the protein present, and projects a plurality of images obtained by projecting onto a two-dimensional plane from a plurality of viewpoints from the side without the surface. generate an image set containing the images,
The three-dimensional structure determination device according to claim 2 or 3.

The comprehensive determination means uses 3D Average Pooling to determine the presence or absence of the property of the bond related to the three-dimensional structure projected on the images included in the image set.
The three-dimensional structure determination device according to any one of claims 1 to 4.

an image generating step of generating an image set including a plurality of images obtained by projecting a three-dimensional structure of a bond of a first substance and a second substance onto a two-dimensional plane from a plurality of viewpoints around the three-dimensional structure; ,
a determination step of determining whether or not the bond associated with the three-dimensional structure projected onto one image included in the image set has a predetermined property;
In the determination step, the results of determining whether or not the property of the bond related to the three-dimensional structure projected on each of the plurality of images included in the image set exists are totaled for all the images included in the image set. a comprehensive determination step of determining the presence or absence of the property of the bond associated with the three-dimensional structure projected onto the images included in the image set based on the values;
A three-dimensional structure determination method comprising:

image generating means for generating an image set including a plurality of images obtained by projecting a three-dimensional structure of a bond of a first substance and a second substance onto a two-dimensional plane from a plurality of viewpoints around the three-dimensional structure; ,
a property acquiring means for acquiring the presence or absence of a predetermined property of said bond;
a discriminator for discriminating whether or not each of the combinations has the properties, using images included in the image set generated by the image generation means and the properties acquired by the property acquisition means as training data for each of the combinations; a learning means for learning;
A three-dimensional structure discriminator learning device comprising:

The image generating means generates an image set including a plurality of images obtained by projecting the three-dimensional structure of the binding of the protein and the ligand onto a two-dimensional plane from a plurality of viewpoints around the three-dimensional structure,
The property acquisition means acquires the presence or absence of activity of the bond,
For each of the bonds, the learning means determines whether or not the bond is active, using images included in the image set generated by the image generating means and the presence or absence of activity obtained by the property obtaining means as teacher data. train the discriminator,
8. The three-dimensional structure discriminator learning device according to claim 7.

The image generating means generates an image set including a plurality of images obtained by projecting the three-dimensional structure of the binding of the protein and the ligand onto a two-dimensional plane from a plurality of viewpoints covering the entire periphery.
9. The three-dimensional structure discriminator learning device according to claim 8.

The image generation means generates a three-dimensional structure of the binding between the protein and the ligand with the surface of the protein present, and projects a plurality of images obtained by projecting onto a two-dimensional plane from a plurality of viewpoints from the side without the surface. generate an image set containing the images,
10. The three-dimensional structure discriminator learning device according to claim 8 or 9.

The learning means learns the discriminator using a weighted cross-entropy function,
11. The three-dimensional structure discriminator learning device according to any one of claims 7 to 10.

an image generating step of generating an image set including a plurality of images obtained by projecting a three-dimensional structure of a bond of a first substance and a second substance onto a two-dimensional plane from a plurality of viewpoints around the three-dimensional structure; ,
a property obtaining step of obtaining the presence or absence of a predetermined property of the bond;
a discriminator for discriminating whether or not the combination has the property, using images included in the image set generated in the image generation step and the presence or absence of the property acquired in the property acquisition step for each of the combinations as training data; a learning step for learning;
A three-dimensional structure classifier training method comprising:

to the computer,
An image generation step of generating an image set including a plurality of images obtained by projecting a three-dimensional structure of a bond of a first substance and a second substance onto a two-dimensional plane from a plurality of viewpoints around the three-dimensional structure;
a determination step of determining whether or not the bond associated with the three-dimensional structure projected onto one image included in the image set has a predetermined property;
In the determination step, the results of determining whether or not the property of the bond related to the three-dimensional structure projected on each of the plurality of images included in the image set exists are totaled for all the images included in the image set. a comprehensive determination step of determining the presence or absence of the property of the bond associated with the three-dimensional structure projected onto the images included in the image set, based on the values;
program to run the

to the computer,
An image generation step of generating an image set including a plurality of images obtained by projecting a three-dimensional structure of a bond of a first substance and a second substance onto a two-dimensional plane from a plurality of viewpoints around the three-dimensional structure;
a property acquisition step of acquiring the presence or absence of a predetermined property of the bond; and
a discriminator for discriminating whether or not the combination has the property, using images included in the image set generated in the image generation step and the presence or absence of the property acquired in the property acquisition step for each of the combinations as training data; learning steps to learn,
program to run the