KR20140100190A - Apparatus and method for prediction of protein binding relationships - Google Patents
Apparatus and method for prediction of protein binding relationships Download PDFInfo
- Publication number
- KR20140100190A KR20140100190A KR1020130013183A KR20130013183A KR20140100190A KR 20140100190 A KR20140100190 A KR 20140100190A KR 1020130013183 A KR1020130013183 A KR 1020130013183A KR 20130013183 A KR20130013183 A KR 20130013183A KR 20140100190 A KR20140100190 A KR 20140100190A
- Authority
- KR
- South Korea
- Prior art keywords
- protein
- binding
- unit
- search
- predicting
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/20—Identification of molecular entities, parts thereof or of chemical compositions
Landscapes
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Chemical & Material Sciences (AREA)
- Crystallography & Structural Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computing Systems (AREA)
- Theoretical Computer Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
And more particularly to an apparatus and method for predicting the actual binding and binding pattern of the proteins on a computer without directly binding specific proteins.
Protein-protein binding and protein-ligand binding are very important factors in protein function, drug side effects, drug selection, and so on.
In the past, the results could be obtained through actual binding between proteins in the laboratory, but this is a time-consuming task.
With advances in computer technology, it is now possible to predict the binding relationships between proteins in a PC, but it still does not take much time.
The method of predicting the binding relationship is to use a model of each protein atom to determine the position with the lowest entropy value at each binding site considering all involved forces such as hydrogen bonding, van der Waals force, attraction force, repulsive force, electrostatic force, And how to find the optimal complementary bond shape by shaping proteins into three-dimensional structural shapes.
According to one aspect, a first protein and a second protein used for binding are modeled and arranged in a three-dimensional space divided grid to form a first structure associated with the first protein and a second structure associated with the second protein And a search unit for searching for a binding relationship between the first structure and the second structure.
According to an embodiment, the protein binding relationship predicting apparatus may further include an input unit for receiving at least one item related to the first protein and the second protein.
In this case, the at least one item may be at least one of atomic name, atomic number, atomic coordinates and radius of the first protein and the second protein.
According to one embodiment, the arrangement may generate the first structure and the second structure using the at least one item input from the input unit.
According to an embodiment, the search unit may search for a relationship between the first structure and the second structure by calculating a correlation coefficient between the first structure and the second structure.
According to one embodiment, the search unit may change the position of the remaining one of the first structure and the second structure while fixing the position of one of the first structure and the second structure, And calculate a correlation coefficient between the second structures.
According to one embodiment, the search unit designates a value of a boundary part of the first structure and the second structure to be 1 when searching for a combination of the first structure and the second structure, The inner value of one of the second structures may be designated as -1 and the other inner value may be designated as +1.
According to one embodiment, the searching unit can search for the coupling relation using a plurality of GPUs (Graphic Processing Units).
According to another aspect, there is provided a method for modeling a first protein and a second protein used for binding, a modeling step of modeling each of the first protein and the second protein in a grid divided in a three-dimensional space, Comprising the steps of: generating a first structure associated with the first protein and a second structure associated with the second protein; and searching a combination relationship of the generated first structure and the second structure / RTI >
According to one embodiment, the method further comprises receiving at least one item associated with the first protein and the second protein, wherein the arranging step comprises: using the at least one item to construct the first structure and the second protein, 2 structure.
In this case, the at least one item may be at least one of atomic name, atomic number, atomic coordinates and radius of the first protein and the second protein.
According to an embodiment, the searching step may search for a relation of the first structure and the second structure by calculating a correlation coefficient between the first structure and the second structure.
According to one embodiment, the searching step may be performed while changing the position of the remaining one of the first structure and the second structure while fixing the position of one of the first structure and the second structure, And a correlation coefficient between the first structure and the second structure.
According to another aspect, a first protein and a second protein used for binding are modeled and arranged in a three-dimensional space divided grid to form a first structure associated with the first protein and a second structure associated with the second protein A calculation step of calculating a correlation coefficient between the first structure and the second structure, and a calculation step of calculating a correlation coefficient between the first structure and the second structure based on the calculation result, A method for predicting protein binding relationships is provided.
1 is a block diagram showing an apparatus for predicting a protein binding relationship according to an embodiment.
FIG. 2 is a diagram for explaining a process of modeling based on at least one item of an input protein according to an embodiment.
FIG. 3 is a view for explaining a process of voxelization of a protein structure according to an embodiment.
FIG. 4 is a diagram for explaining a protein binding relationship search process according to an embodiment.
5 is a flowchart illustrating a method of predicting a protein binding relationship according to an embodiment.
6 is a flowchart showing a method of predicting a protein binding relationship according to another embodiment.
In the following, some embodiments will be described in detail with reference to the accompanying drawings. However, it is not limited or limited by these embodiments. Like reference symbols in the drawings denote like elements.
Although the terms used in the following description have selected the general terms that are widely used in the present invention while considering the functions of the present invention, they may vary depending on the intention or custom of the artisan, the emergence of new technology, and the like.
Also, in certain cases, there may be terms chosen arbitrarily by the applicant for the sake of understanding and / or convenience of explanation, and in this case the meaning of the detailed description in the corresponding description section. Therefore, the term used in the following description should be understood based on the meaning of the term, not the name of a simple term, and the contents throughout the specification.
Throughout the specification, the first structure means a three-dimensional structure generated by arranging a first protein used for binding in a three-dimensional space corresponding to the coordinates of the first protein.
In addition, throughout the specification, the second structure represents the three-dimensional structure generated for the second protein to be bound to the first protein.
1 is a block diagram showing an
The protein binding
The protein
The
The
The
In this case, while the position of one of the first structure and the second structure is fixed, the
The
The searching
The protein binding
The
Here, the at least one item may be at least one of an atomic name, an atomic number, an atomic coordinate, and a radius of the first protein and the second protein.
In this case, the
FIG. 2 is a diagram illustrating a process of modeling based on at least one item of an input protein according to an embodiment.
The protein
In predicting the binding relationship between proteins, the protein
In order to generate a model of a protein used in the binding, the protein
Referring to FIG. 2, the at least one
The at least one
When the at least one
The generated protein model can be used for voxelization in which the protein is arranged in a three-dimensional structure, which can be performed as shown in FIG.
FIG. 3 is a view for explaining a process of voxelization of a protein structure according to an embodiment.
The protein binding
In FIG. 3, 310 and 320 are diagrams showing the structure of the protein in a planar structure in order to more easily explain the process of arranging the pattern on the grid.
The
When the protein model is generated in a three-dimensional structure, the protein model of FIG. 2 is mapped to the
The first structure and the second structure generated for the first protein and the second protein used for binding are used for searching the binding relationship between the proteins.
FIG. 4 is a diagram for explaining a protein binding relationship search process according to an embodiment.
The protein
In this process, the parts that are repeatedly performed, such as the FFT operation of the GPU, can be performed more quickly by parallel processing using MIC (Multi Intergrated Core) or a plurality of GPUs.
In the case of CPU (Central Processing Unit) technology, single-core, dual-core, quad-core and hexa-core are evolving day by day. Particularly, in the simple FFT operation processing, it is possible to use a GPU to show a speed improvement of several to several hundred times as compared with a single CPU. Also, according to the addition of GPU, you can multiply the number of times by that number (several hundreds of times) * You can expect speed increase by the number of GPUs.
FIG. 4 shows an embodiment for exploring a protein binding relationship using two GPUs and a dual core.
The protein binding
4, when the voxelization of the protein a of (1) is performed, the DFT (Discrete Fourier Transform) operation and (5) conjugation of the protein a and the protein b of (2) And the multiplication operation, (6) inverse FFT operation portions can be processed in parallel using the GPU. In particular, in the case of (5), multiplication between array elements can be performed at a high speed using a plurality of GPUs.
The searching
For example, when performing the calculation process of (3) to (7), the correlation coefficient between the protein a and the protein b located in various positions is calculated, And the position of the protein b for the case of having.
In this case, the protein b can be rotated by angles to repeat the processes of (3) to (7).
If the total number of revolutions is set, alpha = 0 to 360 degrees, beta = 0 to 180 degrees, and theta = 0 to 360 degrees, for example, if it is divided by 20 degrees, 18 * 9 * 18 = 2916 Rotation is required.
The
For example, the first GPU of the two GPUs calculates half of the total number of rotations (for (i = 0; i <number of rotations / number of GPUs; i ++) , The second GPU can perform the other half (which can be handled by an operation of for (i = number of rotations / number of
When the number of GPUs is extended and n GPUs are mounted, the calculation procedures of (3) to (7) described above can be shared by the n GPUs and processed in parallel.
The searching
However, the initial value designation is only one embodiment, and it may be specified by various types of numbers rather than specific values.
5 is a flowchart illustrating a method of predicting a protein binding relationship according to an embodiment.
In
The
In
In
The
In this case, while the position of one of the first structure and the second structure is fixed, the
The searching
The searching
6 is a flowchart showing a method of predicting a protein binding relationship according to another embodiment.
In
The
Here, the at least one item may be at least one of an atomic name, an atomic number, an atomic coordinate, and a radius of the first protein and the second protein.
In
In this case, while the position of one of the first structure and the second structure is fixed, the
The
The detailed description and various embodiments of each step are as described above with reference to Figs.
The apparatus described above may be implemented as a hardware component, a software component, and / or a combination of hardware components and software components. For example, the apparatus and components described in the embodiments may be implemented within a computer system, such as, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable array (FPA) A programmable logic unit (PLU), a microprocessor, or any other device capable of executing and responding to instructions. The processing device may execute an operating system (OS) and one or more software applications running on the operating system. The processing device may also access, store, manipulate, process, and generate data in response to execution of the software. For ease of understanding, the processing apparatus may be described as being used singly, but those skilled in the art will recognize that the processing apparatus may have a plurality of processing elements and / As shown in FIG. For example, the processing unit may comprise a plurality of processors or one processor and one controller. Other processing configurations are also possible, such as a parallel processor.
The software may include a computer program, code, instructions, or a combination of one or more of the foregoing, and may be configured to configure the processing device to operate as desired or to process it collectively or collectively Device can be commanded. The software and / or data may be in the form of any type of machine, component, physical device, virtual equipment, computer storage media, or device , Or may be permanently or temporarily embodied in a transmitted signal wave. The software may be distributed over a networked computer system and stored or executed in a distributed manner. The software and data may be stored on one or more computer readable recording media.
The method according to an embodiment may be implemented in the form of a program command that can be executed through various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, and the like, alone or in combination. The program instructions to be recorded on the medium may be those specially designed and configured for the embodiments or may be available to those skilled in the art of computer software. Examples of computer-readable media include magnetic media such as hard disks, floppy disks and magnetic tape; optical media such as CD-ROMs and DVDs; magnetic media such as floppy disks; Magneto-optical media, and hardware devices specifically configured to store and execute program instructions such as ROM, RAM, flash memory, and the like. Examples of program instructions include machine language code such as those produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like. The hardware devices described above may be configured to operate as one or more software modules to perform the operations of the embodiments, and vice versa.
While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. For example, it is to be understood that the techniques described may be performed in a different order than the described methods, and / or that components of the described systems, structures, devices, circuits, Lt; / RTI > or equivalents, even if it is replaced or replaced.
Therefore, other implementations, other embodiments, and equivalents to the claims are also within the scope of the following claims.
Claims (1)
A search unit searching for a relationship of the first structure and the second structure;
Wherein the protein binding relationship predictor comprises:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020130013183A KR20140100190A (en) | 2013-02-06 | 2013-02-06 | Apparatus and method for prediction of protein binding relationships |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020130013183A KR20140100190A (en) | 2013-02-06 | 2013-02-06 | Apparatus and method for prediction of protein binding relationships |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20140100190A true KR20140100190A (en) | 2014-08-14 |
Family
ID=51746154
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020130013183A KR20140100190A (en) | 2013-02-06 | 2013-02-06 | Apparatus and method for prediction of protein binding relationships |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20140100190A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111816257A (en) * | 2019-04-11 | 2020-10-23 | 富士通株式会社 | Combination structure search device, combination structure search method, and medium |
KR20220010327A (en) * | 2020-07-17 | 2022-01-25 | 주식회사 아론티어 | Protein-ligand binding affinity prediction using ensemble of 3d convolutional neural network and system therefor |
-
2013
- 2013-02-06 KR KR1020130013183A patent/KR20140100190A/en not_active Application Discontinuation
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111816257A (en) * | 2019-04-11 | 2020-10-23 | 富士通株式会社 | Combination structure search device, combination structure search method, and medium |
KR20220010327A (en) * | 2020-07-17 | 2022-01-25 | 주식회사 아론티어 | Protein-ligand binding affinity prediction using ensemble of 3d convolutional neural network and system therefor |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Nurvitadhi et al. | GraphGen: An FPGA framework for vertex-centric graph computation | |
US9916414B2 (en) | Apparatus and method for generating test cases for processor verification, and verification device | |
JP2020522825A (en) | Reconfigurable parallel processing | |
KR101702996B1 (en) | Structured grids and graph traversal for image processing | |
JP6039231B2 (en) | Design of 3D modeling assembly of objects in 3D scene | |
Humphries et al. | 3D FFTs on a Single FPGA | |
JP6372119B2 (en) | Multi-component computer fluid dynamics simulation | |
Niu et al. | Exploiting run-time reconfiguration in stencil computation | |
US20160147571A1 (en) | Method for optimizing the parallel processing of data on a hardware platform | |
KR20140100190A (en) | Apparatus and method for prediction of protein binding relationships | |
US20150149459A1 (en) | Tag Based System For Leveraging Design Data | |
RU2722691C2 (en) | System and method of identifying clones | |
KR20120041581A (en) | Reconfigurable processor and method for processing nested loop | |
US20160232275A1 (en) | Method and apparatus for logical design connectivity-based automatic macro placement | |
EP3134837A1 (en) | Duplicate pattern of assembly components in cad models | |
US11960868B2 (en) | Branch objects for dependent optimization problems | |
Sen et al. | Speeding up cycle based logic simulation using graphics processing units | |
US20190220551A1 (en) | Automated design of a piping system | |
CN116185378A (en) | Optimization method of calculation graph, data processing method and related products | |
KR20140147761A (en) | Designing a folded sheet object | |
US10296315B2 (en) | Multiple-thread processing methods and apparatuses | |
US9436796B2 (en) | Method and apparatus for determining common node logical connectivity | |
Nery et al. | A framework for automatic custom instruction identification on multi-issue ASIPs | |
Woulfe et al. | A hybrid fixed-function and microprocessor solution for high-throughput broad-phase collision detection | |
WO2019103775A1 (en) | Method and apparatus for automated suggestion of additional sensors or inputs from equipment or systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Withdrawal due to no request for examination |