US11594299B2 - Method for searching for modification site of peptide molecule and information processing apparatus - Google Patents

Method for searching for modification site of peptide molecule and information processing apparatus Download PDF

Info

Publication number
US11594299B2
US11594299B2 US17/001,020 US202017001020A US11594299B2 US 11594299 B2 US11594299 B2 US 11594299B2 US 202017001020 A US202017001020 A US 202017001020A US 11594299 B2 US11594299 B2 US 11594299B2
Authority
US
United States
Prior art keywords
steric structure
steric
peptide molecule
data
molecule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US17/001,020
Other versions
US20210118523A1 (en
Inventor
Yoshiaki Tanida
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Assigned to FUJITSU LIMITED reassignment FUJITSU LIMITED ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: TANIDA, YOSHIAKI
Publication of US20210118523A1 publication Critical patent/US20210118523A1/en
Application granted granted Critical
Publication of US11594299B2 publication Critical patent/US11594299B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/30Detection of binding sites or motifs
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • G16B15/20Protein or domain folding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • G16B15/30Drug targeting using structural data; Docking or binding prediction
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B45/00ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/30Data warehousing; Computing architectures
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2111/00Details relating to CAD techniques
    • G06F2111/10Numerical modelling

Definitions

  • the embodiment discussed herein is related to a method and an apparatus for searching a modification site of a peptide molecule.
  • a binding free energy of a target molecule for example, a protein
  • a drug candidate molecule to a stable binding structure (also referred to as a complex structure) in a solvent is predicted by using a computer experiment.
  • a drug candidate molecule is a low molecular compound
  • various methods for predicting the binding free energy have been studied.
  • a method for searching for a modification site of a peptide molecule includes: calculating, by a computer, a second steric structure of the peptide molecule by using data of a first steric structure of the peptide molecule, the first steric structure being a steric structure of the peptide molecule in a complex structure of a target molecule and the peptide molecule, the second steric structure being a stable steric structure of the peptide molecule in a state where a steric configuration of a main chain of the peptide molecule in the first steric structure is fixe; and comparing data of the second steric structure with the data of the first steric structure in order to search for a side chain having a difference in steric configuration between the two steric structures.
  • FIG. 1 is a diagram illustrating a configuration of an optimization apparatus (calculation unit) used in an annealing method
  • FIG. 2 is a circuit level block diagram of a transition control unit
  • FIG. 3 is a diagram illustrating an operation flow of the transition control unit
  • FIG. 4 is a flowchart of an example of a disclosed technique
  • FIG. 5 A is a schematic view of an example of a complex structure (1WCA);
  • FIG. 5 B is a diagram in which two peptide molecules are overlapped
  • FIG. 6 is a configuration example of a disclosed apparatus for searching for a modification site of a peptide molecule
  • FIG. 7 is another configuration example of the disclosed apparatus for searching for the modification site of the peptide molecule.
  • FIG. 8 is another configuration example of the disclosed apparatus for searching for the modification site of the peptide molecule.
  • the drug candidate molecule is a peptide molecule
  • sufficient sampling is difficult because both the target molecule and the drug candidate molecule have large structural fluctuations. Therefore, there has been no suitable methodology for predicting the binding free energy of a target molecule (for example, a protein) and a drug candidate molecule, which is a peptide molecule, to a binding structure in a solvent.
  • the disclosed method is a method of searching for a modification site of a peptide molecule using a computer (information processing apparatus).
  • the disclosed method includes a calculation process and a comparison process, and further includes other processes when appropriate.
  • a stable steric structure of the peptide molecule in a state where a steric configuration of a main chain of the peptide molecule in the steric structure is fixed, is calculated by using data of the steric structure of the peptide molecule in a complex structure of a target molecule and the peptide molecule.
  • the calculated data of the stable steric structure of the peptide molecule is compared with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
  • a drug candidate molecule In the development of new drugs using peptide molecules, it is important to know what the structure a drug candidate molecule has in a complex structure as quickly as possible. In a system having high binding activity, a molecule often takes a natural structure (stable steric structure) in a complex structure composed of the molecule and a target molecule.
  • the binding activity is considered to be large.
  • Amino acid residues configuring the peptide molecule are composed of a structure called a main chain and a structure called a side chain as described below.
  • R 1 , R 2 , R 3 that are bonded to an alpha-carbon represent side chains. A portion other than the side chains is the main chain.
  • the peptide molecule and the target molecule form a complex structure, they are relatively stabilized by having a structure in which they may obtain benefits each other energetically.
  • the steric configuration of the main chain distorted from its natural form due to formation of a complex structure is expected to affect the steric configuration of the side chains. Therefore, in a case where the stable steric configuration of the side chain under the distorted steric configuration of the main chain is different from the steric configuration of the side chain in the complex structure, it is considered that the side chain largely contributes to the structural stabilization of the complex structure.
  • Such side chains are hereinafter referred to as hotspots.
  • a structure of the hotspot is modified so that the structure of the peptide molecule in the complex structure approaches a more natural structure (stable steric structure), whereby the complex structure of the modified peptide molecule and the target molecule is more stabilized.
  • the stable steric structure of the peptide molecule in the state of fixing the steric configuration of the main chain of the peptide molecule in the steric structure is calculated by using the steric structure data of the peptide molecule in the complex structure of the target molecule and the peptide molecule.
  • the calculated data of the stable steric structure of the peptide molecule is compared with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
  • the electronic state calculation may not be performed when searching for a site where the contribution of binding is large. Therefore, in the disclosed technique, it is possible to efficiently search for a modification site of a peptide molecule.
  • the stable steric structure of the peptide molecule in a state where the steric configuration of the main chain of the peptide molecule in the steric structure is fixed, is calculated by using the data of the steric structure of the peptide molecule in the complex structure of the target molecule and the peptide molecule.
  • the target molecule is not particularly limited and may be appropriately selected depending on the intended purpose, and examples thereof include proteins, ribonucleic acids (RNA), deoxyribonucleic acids (DNA), and the like.
  • a peptide molecule is a molecule in which amino acids as monomers are linked in a short chain by a peptide bond.
  • Peptide molecules may be cyclic molecules or non-cyclic molecules.
  • the number of amino acids constituting the peptide molecule is not particularly limited and may be appropriately selected depending on the intended purpose, for example, the number may be equal to or more than 5 and equal to or less than 100, equal to or more than 5 and equal to or less than 50, equal to or more than 10 and equal to or less than 50, or equal to or more than 10 and equal to or less than 30.
  • the amino acid may be a naturally occurring amino acid or a non-naturally occurring amino acid as long as it is an organic compound having a functional group of both an amino group and a carboxyl group.
  • amino acid examples include:
  • a molecular weight of the amino acid is not particularly limited and may be appropriately selected depending on the intended purpose, provided that it is, for example, equal to or more than 89, which is a molecular weight of alanine, and, for example, the molecular weight of the amino acid may be equal to or more than 89 and equal to or less than 500, or equal to or more than 89 and equal to or less than 300.
  • the complex structure of the target molecule and the peptide molecule is a stable structure.
  • the complex structure may be a known complex structure in which the stable structure is a known or an unknown complex structure in which the stable structure is not known.
  • Examples of the known complex structures include complex structures recorded in known databases.
  • known databases for example, experimental data of a complex structure of a target molecule (receptor) and a molecule (ligand) obtained from experiments such as X-ray crystallography, nuclear magnetic resonance (NMR) analysis, and analysis using a Cryo-electron microscope is recorded.
  • PDB Protein Data Bank
  • the unknown complex structure may be obtained using, for example, a molecular mechanics method, a molecular dynamics method, or the like. Among them, the molecular dynamics method is preferable.
  • the molecular dynamics (MD) method means a method of simulating the motion of particles (mass points) such as atoms by numerically solving Newton's equation of motion.
  • the molecular dynamics calculation (simulation) by the molecular dynamics method may be performed using, for example, a molecular dynamics calculating program.
  • a molecular dynamics calculating program examples include AMBER, CHARMm, GROMACS, GROMOS, NAMD, myPresto, and the like.
  • a binding structure may be relaxed to a thermal equilibrium state or a state close to the thermal equilibrium state, for example, by performing calculation under a certain temperature and pressure condition (NPT ensemble).
  • NPT ensemble temperature and pressure condition
  • the steric structure data includes, for example, atomic information data, coordinate information data, and binding information data, and constructs a steric structure in a coordinate space.
  • a data format is not particularly limited and may be appropriately selected depending on the intended purpose, and for example, the data format may be text data, a structure data file (SDF) format, or an MOL file format.
  • SDF structure data file
  • MOL file format an MOL file format
  • the atomic information data is data on the type of atom.
  • the coordinate information data is data on coordinates (positions) of atoms.
  • the binding information data is data on a bond between atoms.
  • the stable steric structure of the peptide molecule in a state where the steric configuration of the main chain of the peptide molecule in the steric structure of the peptide molecule in the complex structure is fixed, is calculated.
  • a relative permittivity around the peptide molecule is set in consideration of a relative permittivity around the peptide molecule in the complex structure.
  • the consideration means, for example, matching or approximating the relative permittivity around the peptide molecule to the relative permittivity around the peptide molecule in the complex structure.
  • the relative permittivity around the peptide molecule is set to, for example, four.
  • the method for calculating the stable steric structure of the peptide molecule, in a state in which the steric configuration of the main chain is fixed is not particularly limited and may be appropriately selected depending on the intended purpose, however, it is preferable to calculate a minimum energy of an Ising model by performing a ground state search using an annealing method on the Ising model converted based on restriction conditions of the side chain of the peptide molecule.
  • the search for a stable steric structure of the peptide molecule may be reduced to a combinatorial optimization problem of the steric configuration of the side chain. Therefore, the minimum energy of the Ising model may be calculated.
  • the calculation of the minimum energy of the Ising model is a method that may be performed in a very short time among methods of exhaustively searching for a stable steric structure of a peptide molecule in a state where the steric configuration of the main chain is fixed. Therefore, this greatly contributes to more efficiently performing the disclosed technique.
  • the energy equation of the Ising model may be expressed, for example, by the following equation.
  • H A ⁇ [ - ⁇ i ⁇ ( ⁇ j ⁇ W ij + b i ) ] ⁇ x i + B ⁇ ⁇ res ⁇ ⁇ rot ⁇ ( x k - 1 ) 2
  • W ij represents a side chain-side chain interaction
  • b i represents a main chain-side chain interaction at the amino acid residue
  • x i represents bits of a rotor state of the side chain.
  • “res” represents an amino acid residue.
  • “rot” represents the rotation of the side chain.
  • x k represents bits of the rotor state of the k-th amino acid residue.
  • A represents a positive number.
  • B represents a positive number.
  • the minimum energy of the Ising model may be calculated using an annealing machine.
  • the annealing machine is not particularly limited and may be appropriately selected depending on the intended purpose as long as it is a computer that employs an annealing method of performing a ground state search for an energy function represented by an Ising model, and the annealing machine may be a quantum annealing machine, a semiconductor annealing machine using semiconductor technology, or simulated annealing executed by software using a central processing unit (CPU) or a graphics processing unit (GPU).
  • CPU central processing unit
  • GPU graphics processing unit
  • the annealing method (simulated annealing method, SA method) is a kind of Monte Carlo method, and is a method of probabilistically obtaining a solution by using a random number value.
  • SA method simulated annealing method
  • an object of minimizing a value of an evaluation function to be optimized will be described as an example, and the value of the evaluation function will be referred to as energy.
  • energy In a case of maximization, a sign of the evaluation function may be changed.
  • a state transition from the current state (a combination of values of variables) to a state close to the current state (for example, a state in which only one of the variables has been changed) is examined.
  • a change in energy associated with the state transition is calculated, and it is stochastically determined whether to adopt the state transition and change the current state or to maintain the current state without adopting the state transition, according to the calculated value.
  • an acceptance probability p of the state transition is determined by any of the following functions f( ).
  • T is a parameter called a temperature value and is changed as follows.
  • T T 0 ⁇ log ⁇ ( c ) log ⁇ ( t + c ) ( Formula ⁇ ⁇ 2 )
  • T 0 represents an initial temperature value and it is desirable that a sufficiently large value be set in accordance with the problem.
  • an occupation probability of an individual state is in accordance with a Boltzmann distribution with respect to a thermal equilibrium state in thermodynamics. Since the occupation probability of a lower-energy state increases when the temperature is gradually lowered from a high initial temperature, a low-energy state is supposed to be obtained when the temperature has sufficiently decreased.
  • This method is referred to as an annealing method (or pseudo-annealing method) because this behavior resembles state change when annealing a material.
  • the stochastic occurrence of a state transition that results in a rise in the energy corresponds to thermal excitation in physics.
  • FIG. 1 illustrates a conceptual configuration of an optimization apparatus (calculation unit) that performs the annealing method. Although cases where a plurality of candidates for the state transition is generated will be also described in the following description, the transition candidates are generated one by one in the normal basic annealing method.
  • An optimization apparatus 100 includes a state holding unit 111 configured to hold a current state S (values of a plurality of state variables).
  • the optimization apparatus 100 also includes an energy calculation unit 112 configured to calculate energy change values ⁇ Ei ⁇ of state transitions in a case where the state transition occurs from the current state S as a result of change in any of the values of the plurality of state variables.
  • the optimization apparatus 100 includes a temperature control unit 113 configured to control a temperature value T and a transition control unit 114 configured to control state changes.
  • the transition control unit 114 stochastically determines whether or not any one of a plurality of state transitions is accepted depending on a relative relationship between the energy change values ⁇ Ei ⁇ and thermal excitation energy, based on the temperature value T, the energy change values ⁇ Mi ⁇ , and the random number value.
  • the transition control unit 114 When the transition control unit 114 is subdivided, the transition control unit 114 includes a candidate generation unit 114 a for generating a candidate for a state transition, and an acceptance determination unit 114 b for stochastically determining for each candidate whether or not the state transition is accepted based on the energy change values ⁇ Ei ⁇ of the candidates and the temperature value T.
  • the transition control unit 114 further includes a transition determination unit 114 c for determining a candidate to be adopted from the accepted candidates, and a random number generation unit 114 d for generating a probability variable.
  • the candidate generation unit 114 a generates one or a plurality of candidates (candidate numbers ⁇ Ni ⁇ ) for the state transition from the current state S held by the state holding unit 111 to the next state.
  • the energy calculation unit 112 calculates energy change values ⁇ Ei ⁇ for each of the state transitions for the candidates, by using the current state S and the candidates for the state transition.
  • the acceptance determination unit 114 b uses the temperature value T generated in the temperature control unit 113 and a probability variable (random number value) generated by the random number generation unit 114 d, and accepts the state transition with the acceptance probability expressed by the above Formulas in (1) according to the energy change values ⁇ Ei ⁇ of the respective state transitions.
  • the acceptance determination unit 114 b outputs the acceptances ⁇ fi ⁇ of the respective state transitions. In a case where a plurality of state transitions is accepted, the transition determination unit 114 c randomly selects one thereof by using a random number value. The transition determination unit 114 c then outputs a transition number N of the selected state transition, and a transition acceptance f. In a case where there is an accepted state transition, the values of the state variable stored in the state holding unit 111 is updated according to the adopted state transition.
  • the above-described iteration is repeated while causing the temperature control unit 113 to lower the temperature value, and the operation ends when an end determination condition, for example, a certain number of iterations is reached, or the energy becomes lower than a predetermined value, is satisfied.
  • the solution outputted by the optimization apparatus 110 is the state corresponding to the end of the operation.
  • FIG. 2 is a circuit level block diagram of a configuration example of a transition control unit in a normal annealing method for generating candidates one by one, especially, an arithmetic portion used for an acceptance determination unit.
  • the transition control unit 114 includes a random number generation circuit 114 b 1 , a selector 114 b 2 , a noise table 114 b 3 , a multiplier 114 b 4 , and a comparator 114 b 5 .
  • the selector 114 b 2 selects and outputs an energy change value corresponding to the transition number N, which is a random number value generated by the random number generation circuit 114 b 1 .
  • noise table 114 b 3 Functions of the noise table 114 b 3 will be described later.
  • a memory such as a random-access memory (RAM) or a flash memory may be used.
  • the multiplier 114 b 4 outputs a product obtained by multiplying a value outputted by the noise table 114 b 3 by the temperature value T (corresponding to the thermal excitation energy described above).
  • the comparator 114 b 5 outputs a comparison result in which a multiplication result outputted by the multiplier 114 b 4 is compared with an energy change value ⁇ E selected by the selector 114 b 2 , as the transition acceptance f.
  • the transition control unit 114 illustrated in FIG. 2 basically implements the above-described function as is, but the mechanism for permitting the state transition with the acceptance probability expressed by Formulas in (1) has not been described so far, and therefore, this will be supplemented.
  • a circuit that outputs 1 at the acceptance probability p and outputs 0 at the probability (1-p) may be realized by a comparator that has two inputs A and B, and outputs 1 when A>B, and outputs 0 when A ⁇ B by inputting the acceptance probability p to the input A and a uniform random number having a value in a section [0, 1) to the input B.
  • a comparator that has two inputs A and B, and outputs 1 when A>B, and outputs 0 when A ⁇ B by inputting the acceptance probability p to the input A and a uniform random number having a value in a section [0, 1) to the input B.
  • the situation may be accepted, however, the same function may be realized by the following modification. Even when the same monotonically increasing function is applied to two numbers, the two numbers maintain the same magnitude relationship. Therefore, even when the same monotonically increasing function is applied to the two inputs of the comparator, the same output is obtained.
  • an inverse function f ⁇ 1 off is adopted as this monotonically increasing function, it is seen that a circuit that outputs 1 when ⁇ E/T is greater than f ⁇ 1 (u) may be adopted. Since the temperature value T is positive, it is seen that a circuit that outputs 1 when ⁇ E is greater than Tf ⁇ 1 (u) may be adopted.
  • the noise table 114 b 3 in FIG. 2 is a conversion table for realizing the inverse function f ⁇ 1 (u), and is a table for outputting a value of the following function with respect to the input obtained by discretizing the section [0, 1).
  • transition control unit 114 includes a latch that holds a determination result and the like, a state machine that generates the corresponding timing, and the like, these components are not illustrated in FIG. 2 for simple illustration.
  • FIG. 3 illustrates the flow of operation of the transition control unit 114 .
  • the flow of operation includes a step of selecting one state transition as a candidate (S 0001 ), a step of determining whether a state transition is accepted or not by comparing the energy change value with respect to the state transition with a product of a temperature value and a random number value (S 0002 ), and a step in which the state transition is adopted when the state transition is accepted, and the state transition is not adopted when the state transition is not accepted (S 0003 ).
  • the calculated data of the stable steric structure of the peptide molecule is compared with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
  • the comparison is preferably performed by visualizing the stable steric structure of the peptide molecule and the steric structure of the peptide molecule in the complex structure. In doing so, the side chain having a difference in steric configuration between two peptide molecules may be easily found.
  • the method for visualizing the steric structure of the peptide molecule is not particularly limited and may be appropriately selected depending on the intended purpose, and may be performed using known molecular graphic software.
  • the molecular graphic software include, for example, PyMOL, and the like.
  • Visualization may be performed, for example, by incorporating the steric structure data of the peptide molecule into molecular graphic software to construct a steric structure, and displaying the created steric structure on a display device.
  • the comparison is preferably performed by superposing the main chain of the visualized stable steric structure of the peptide molecule with the main chain of the visualized steric structure of the peptide molecule. In this way, the side chain having a difference in steric configuration between the two peptide molecules may be found more easily.
  • the superposition of the main chains may be carried out, for example, by overlapping a C ⁇ atom of each amino acid residue in the peptide molecule and overlapping a C ⁇ atom of the side chain.
  • the superposition of the main chains may be carried out using, for example, molecular graphic software.
  • molecular graphic software include, for example, PyMOL, and the like.
  • a side chain (hotspot) that largely contributes to the structural stabilization of the complex structure is specified from the side chain having a difference in steric configuration between two peptide molecules.
  • the identification of the hotspot may be appropriately determined from the side chain having a difference in steric configuration between two peptide molecules. For example, when the calculated stable steric structure of the peptide molecule is superimposed on the steric structure of the peptide molecule of the complex structure so that the main chains of the two peptide molecules overlap, in a case where the side chain of the calculated peptide molecule overlaps a binding site of the target molecule of the complex structure (the site where the peptide molecule binds to the target molecule), the side chain is specified as a hotspot.
  • the side chain in the peptide molecule of the complex structure corresponding to the side chain overlapping the binding site is likely to interfere with the structural stabilization of the complex structure. Specifically, the side chain is likely to be a hotspot.
  • FIG. 4 is an example of a flow chart of the disclosed technique.
  • the stable steric structure of the peptide molecule in a state where the steric configuration of the main chain of the peptide molecule in the steric structure is fixed, is calculated using data of the steric structure of the peptide molecule in the complex structure of the target molecule and the peptide molecule (S 1 ).
  • the data of the steric structure of the peptide molecule in the complex structure may be acquired from data of the complex structure recorded in a known database.
  • known databases include Protein Data Bank (PDB) and the like.
  • the calculation of the stable steric structure is preferably performed, for example, by performing a ground state search using an annealing method on the Ising model converted based on the restriction conditions of the side chain of the peptide molecule to calculate the minimum energy of the Ising model.
  • the calculated data of the stable steric structure of the peptide molecule is compared with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between the two peptide molecules (S 2 ).
  • the comparison is performed by visualizing the stable steric structure of the peptide molecule and the steric structure of the peptide molecule in the complex structure using, for example, molecular graphic software.
  • the side chain of the peptide molecule to be modified for stabilizing the complex structure may be efficiently found.
  • FIG. 5 A is the complex structure of 1WCA in PDB.
  • the protein is CYCLOPHILIN A (CypA) and the peptide molecule is CYCLOSPORIN A (CsA).
  • the stable steric structure of the peptide molecule was calculated in a state where the steric configuration of the main chain of the peptide molecule was fixed.
  • FIG. 5 B the main chains of the two peptide molecules to be compared overlap, and some side chains have a difference in steric configuration.
  • the side chains of 1-Leu, 4-Ver, and 6-Leu that are indicated by circles have a large difference in steric configuration and are likely to be hotspots.
  • the calculated structure of the side chain of the peptide molecule is indicated by a relatively thin line.
  • the disclosed program is a program for causing a computer to execute the disclosed method for searching for a modification site of a peptide molecule
  • the program may be created using various known program languages according to a configuration of a computer system to be used, and a type, a version, and the like of an operating system.
  • the program may be recorded using a recording medium such as an internal hard disk or an external hard disk, or may be recorded using a recording medium such as a compact disc read-only memory (CD-ROM), a digital versatile disk read-only memory (DVD-ROM), a magneto-optical disk (MO disk), or a Universal Serial Bus (USB) memory [USB flash drive].
  • a recording medium such as a CD-ROM, a DVD-ROM, an MO disk, or a USB memory
  • the program may be used directly or by being installed on a hard disk through a recording medium reading device included in the computer system when appropriate.
  • the program may be also recorded in an external storage area (another computer or the like) accessible from the computer system through the information communication network, and the program may be used directly from the external storage area through the information communication network or by being installed on the hard disk when appropriate.
  • the program may be recorded using a plurality of recording media while being divided for each arbitrary process.
  • the disclosed recording medium records the disclosed program.
  • the disclosed recording medium is computer-readable.
  • the disclosed recording medium may be transitory or non-transitory.
  • the disclosed recording medium is, for example, a recording medium having recorded thereon a program for causing a computer to execute the disclosed method for searching for a modification site of a peptide molecule.
  • the recording medium is not particularly limited, and may be appropriately selected according to the purpose, and examples thereof include, for example, an internal hard disk, an external hard disk, a CD-ROM, a DVD-ROM, an MO disk, a USB memory, and the like.
  • the recording medium may be a plurality of recording media in which the program is divided and recorded for each arbitrary process.
  • the disclosed apparatus for searching for a modification site of a peptide molecule includes at least a calculation unit and a comparison unit, and further includes other units when appropriate.
  • the calculation unit calculates the stable steric structure of the peptide molecule in a state in which the steric configuration of the main chain of the peptide molecule in the steric structure is fixed.
  • the comparison unit compares the calculated data of the stable steric structure of the peptide molecule with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
  • An aspect of the calculation unit is the same as the aspect of the calculation process in the disclosed method for searching for a modification site of a peptide molecule.
  • An aspect of the comparison unit is the same as the aspect of the comparison process in the disclosed method for searching for a modification site of a peptide molecule.
  • the disclosed apparatus for searching for a modification site of a peptide molecule includes, for example, a memory, a processor, and other units when appropriate.
  • the memory stores, for example, data of a complex structure of a target molecule and a peptide molecule.
  • the memory stores, for example, data of the steric structure of the peptide molecule in the complex structure.
  • the memory stores, for example, data of the calculated stable steric structure of the peptide molecule.
  • the processor is coupled to the memory.
  • the processor is configured to use the data of the steric structure of the peptide molecule in the complex structure of the target molecule and the peptide molecule to calculate a stable steric structure of the peptide molecule in a state where the steric configuration of the main chain of the peptide molecule in the steric structure is fixed.
  • the processor is configured to compare the calculated data of the stable steric structure of the peptide molecule with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
  • the processor is, for example, a CPU, a GPU, or a combination thereof.
  • FIG. 6 illustrates a configuration example of a disclosed apparatus for searching for a modification site of a peptide molecule.
  • the apparatus 10 is configured, for example, by a CPU 11 , a memory 12 , a storage unit 13 , a display unit 14 , an input unit 15 , an output unit 16 , an I/O interface unit 17 , and the like that are coupled via a system bus 18 .
  • the CPU 11 performs operations (four arithmetic operations, comparison operations, and the like), operation control of hardware and software, and the like.
  • the memory 12 is a memory such as a RAM, a read-only memory (ROM), or the like.
  • the RAM stores an operating system (OS), an application program, and the like read from the ROM and the storage unit 13 , and functions as a main memory and a work area of the CPU 11 .
  • OS operating system
  • application program application program
  • the storage unit 13 is a device for storing various programs and data, and is a hard disk, for example.
  • the storage unit 13 stores a program to be executed by the CPU 11 , data to be used for execution of the program, the OS, and the like.
  • the program is stored in the storage unit 13 , loaded into the RAM (main memory) of the memory 12 , and executed by the CPU 11 .
  • the display unit 14 is a display device, and is, for example, a display device such as a CRT monitor, a liquid crystal panel, or the like.
  • the input unit 15 is an input device for various data, and is, for example, a keyboard, a pointing device (for example, a mouse, or the like), or the like.
  • the output unit 16 is an output device for various data, and is, for example, a printer, or the like.
  • the I/O interface unit 17 is an interface for coupling various external devices.
  • the I/O interface unit 17 enables input and output of data of a CD-ROM, a DVD-ROM, an MO disk, a USB memory, or the like.
  • FIG. 7 illustrates another example of the configuration of the disclosed apparatus for searching for a modification site of a peptide molecule.
  • the configuration example of FIG. 7 is a cloud-type configuration example, and the CPU 11 is independent of the storage unit 13 and the like.
  • a computer 30 that stores the storage unit 13 and the like, and a computer 40 that stores the CPU 11 are coupled via network interface units 19 and 20 .
  • the network interface units 19 and 20 are hardware configured to perform communication by using the Internet.
  • FIG. 8 illustrates another example of the configuration of the disclosed apparatus for searching for a modification site of a peptide molecule.
  • the configuration example of FIG. 8 is a cloud-type configuration example, and the storage unit 13 is independent of the CPU 11 and the like.
  • the computer 30 that stores the CPU 11 and the like, and the computer 40 that stores the storage unit 13 are coupled via the network interface units 19 and 20 .
  • the above-described problems in the related art may be solved, and a modification site of a peptide molecule may be efficiently searched for.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biophysics (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Chemical & Material Sciences (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Medicinal Chemistry (AREA)
  • Computer Hardware Design (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Geometry (AREA)
  • Evolutionary Computation (AREA)
  • Bioethics (AREA)
  • Analytical Chemistry (AREA)
  • Databases & Information Systems (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Peptides Or Proteins (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method for searching for a modification site of a peptide molecule includes: calculating, by a computer, a second steric structure of the peptide molecule by using data of a first steric structure of the peptide molecule, the first steric structure being a steric structure of the peptide molecule in a complex structure of a target molecule and the peptide molecule, the second steric structure being a stable steric structure of the peptide molecule in a state where a steric configuration of a main chain of the peptide molecule in the first steric structure is fixe; and comparing data of the second steric structure with the data of the first steric structure in order to search for a side chain having a difference in steric configuration between the two steric structures.

Description

CROSS-REFERENCE TO RELATED APPLICATION
This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2019-191785, filed on Oct. 21, 2019, the entire contents of which are incorporated herein by reference.
FIELD
The embodiment discussed herein is related to a method and an apparatus for searching a modification site of a peptide molecule.
BACKGROUND
A binding free energy of a target molecule (for example, a protein) and a drug candidate molecule to a stable binding structure (also referred to as a complex structure) in a solvent is predicted by using a computer experiment. In a case where a drug candidate molecule is a low molecular compound, various methods for predicting the binding free energy have been studied.
Related techniques are disclosed in, for example, Japanese Laid-open Patent Publication No. 2006-209764 and Ming-Hong Hao, Omar Haq, and Ingo Muegge, “Torsion Angle Preference and Energetics of Small-Molecule Ligands Bound to Proteins”, J. Chem. Inf. Model. 2007, 47, 2242-2252.
SUMMARY
According to an aspect of the embodiment, a method for searching for a modification site of a peptide molecule includes: calculating, by a computer, a second steric structure of the peptide molecule by using data of a first steric structure of the peptide molecule, the first steric structure being a steric structure of the peptide molecule in a complex structure of a target molecule and the peptide molecule, the second steric structure being a stable steric structure of the peptide molecule in a state where a steric configuration of a main chain of the peptide molecule in the first steric structure is fixe; and comparing data of the second steric structure with the data of the first steric structure in order to search for a side chain having a difference in steric configuration between the two steric structures.
The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention.
BRIEF DESCRIPTION OF DRAWINGS
FIG. 1 is a diagram illustrating a configuration of an optimization apparatus (calculation unit) used in an annealing method;
FIG. 2 is a circuit level block diagram of a transition control unit;
FIG. 3 is a diagram illustrating an operation flow of the transition control unit;
FIG. 4 is a flowchart of an example of a disclosed technique;
FIG. 5A is a schematic view of an example of a complex structure (1WCA);
FIG. 5B is a diagram in which two peptide molecules are overlapped;
FIG. 6 is a configuration example of a disclosed apparatus for searching for a modification site of a peptide molecule;
FIG. 7 is another configuration example of the disclosed apparatus for searching for the modification site of the peptide molecule; and
FIG. 8 is another configuration example of the disclosed apparatus for searching for the modification site of the peptide molecule.
DESCRIPTION OF EMBODIMENT
However, in a case where the drug candidate molecule is a peptide molecule, sufficient sampling is difficult because both the target molecule and the drug candidate molecule have large structural fluctuations. Therefore, there has been no suitable methodology for predicting the binding free energy of a target molecule (for example, a protein) and a drug candidate molecule, which is a peptide molecule, to a binding structure in a solvent.
Nevertheless, development of new drugs using peptide molecules that have very large binding activity due to the flexibility of the structure as drug candidates has been actively conducted. In an actual development process, in a case where a peptide molecule having a certain degree of binding activity is found by an actual experiment, the peptide molecule is modified to enhance the binding activity. However, these operations are often carried out by the experience and intuition of researchers, and development may be delayed.
(Method for Searching for Modification Site of Peptide Molecule)
The disclosed method is a method of searching for a modification site of a peptide molecule using a computer (information processing apparatus).
The disclosed method includes a calculation process and a comparison process, and further includes other processes when appropriate.
In the calculation process, a stable steric structure of the peptide molecule, in a state where a steric configuration of a main chain of the peptide molecule in the steric structure is fixed, is calculated by using data of the steric structure of the peptide molecule in a complex structure of a target molecule and the peptide molecule.
In the comparison process, the calculated data of the stable steric structure of the peptide molecule is compared with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
In a case where a modification site of a peptide molecule is intended to be searched for by existing simulation using a computer, electronic state calculation is performed on a system composed of the peptide molecule, a target molecule, and water molecules present around them, and a site to be modified in the peptide molecule is searched for. However, when this method is applied to a general system of equal to or more than one hundred thousand atoms, a calculation time of several years is used, which is not realistic.
In the development of new drugs using peptide molecules, it is important to know what the structure a drug candidate molecule has in a complex structure as quickly as possible. In a system having high binding activity, a molecule often takes a natural structure (stable steric structure) in a complex structure composed of the molecule and a target molecule.
Therefore, when the structure of the peptide molecule is a natural structure in the complex structure, the binding activity is considered to be large.
Amino acid residues configuring the peptide molecule are composed of a structure called a main chain and a structure called a side chain as described below.
Figure US11594299-20230228-C00001
In the above-described formula, R1, R2, R3 that are bonded to an alpha-carbon represent side chains. A portion other than the side chains is the main chain.
In a case where the peptide molecule and the target molecule form a complex structure, they are relatively stabilized by having a structure in which they may obtain benefits each other energetically. For peptide molecules, the steric configuration of the main chain distorted from its natural form due to formation of a complex structure is expected to affect the steric configuration of the side chains. Therefore, in a case where the stable steric configuration of the side chain under the distorted steric configuration of the main chain is different from the steric configuration of the side chain in the complex structure, it is considered that the side chain largely contributes to the structural stabilization of the complex structure. Such side chains are hereinafter referred to as hotspots.
It is considered that a structure of the hotspot is modified so that the structure of the peptide molecule in the complex structure approaches a more natural structure (stable steric structure), whereby the complex structure of the modified peptide molecule and the target molecule is more stabilized.
Therefore, in the disclosed technique, the following is performed.
The stable steric structure of the peptide molecule in the state of fixing the steric configuration of the main chain of the peptide molecule in the steric structure is calculated by using the steric structure data of the peptide molecule in the complex structure of the target molecule and the peptide molecule.
The calculated data of the stable steric structure of the peptide molecule is compared with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
In the disclosed technique, the electronic state calculation may not be performed when searching for a site where the contribution of binding is large. Therefore, in the disclosed technique, it is possible to efficiently search for a modification site of a peptide molecule.
<Calculation Process>
In the calculation process, the stable steric structure of the peptide molecule, in a state where the steric configuration of the main chain of the peptide molecule in the steric structure is fixed, is calculated by using the data of the steric structure of the peptide molecule in the complex structure of the target molecule and the peptide molecule.
<<Target Molecule>>
The target molecule is not particularly limited and may be appropriately selected depending on the intended purpose, and examples thereof include proteins, ribonucleic acids (RNA), deoxyribonucleic acids (DNA), and the like.
<<Peptide Molecule>>
A peptide molecule is a molecule in which amino acids as monomers are linked in a short chain by a peptide bond.
Peptide molecules may be cyclic molecules or non-cyclic molecules.
The number of amino acids constituting the peptide molecule is not particularly limited and may be appropriately selected depending on the intended purpose, for example, the number may be equal to or more than 5 and equal to or less than 100, equal to or more than 5 and equal to or less than 50, equal to or more than 10 and equal to or less than 50, or equal to or more than 10 and equal to or less than 30.
The amino acid may be a naturally occurring amino acid or a non-naturally occurring amino acid as long as it is an organic compound having a functional group of both an amino group and a carboxyl group.
Examples of amino acid include:
Glycine (Gly)
Proline (Pro)
Alanine (Ala)
Arginine (Arg)
Asparagine (Asn)
Aspartic acid (Asp)
Cysteine (Cys)
Glutamine (Gin)
Glutamic acid (Glu)
Histidine (His)
Isoleucine (Ile)
Leucine (Leu)
Lysine (Lys)
Methionine (Met)
Phenylalanine (Phe)
Serine (Ser)
Threonine (Thr)
Tryptophan (Trp)
Tyrosine (Tyr)
Valine (Val)
Ornithine (Orn)
Selenocysteine (Sec)
Pyrrolidine (Pyl)
Norvaline
Norleucine
Citrulline
Creatine
Cystine
Thyroxine
Phosphoserine
A molecular weight of the amino acid is not particularly limited and may be appropriately selected depending on the intended purpose, provided that it is, for example, equal to or more than 89, which is a molecular weight of alanine, and, for example, the molecular weight of the amino acid may be equal to or more than 89 and equal to or less than 500, or equal to or more than 89 and equal to or less than 300.
<<Complex Structure>>
The complex structure of the target molecule and the peptide molecule is a stable structure.
The complex structure may be a known complex structure in which the stable structure is a known or an unknown complex structure in which the stable structure is not known.
Examples of the known complex structures include complex structures recorded in known databases. In the known databases, for example, experimental data of a complex structure of a target molecule (receptor) and a molecule (ligand) obtained from experiments such as X-ray crystallography, nuclear magnetic resonance (NMR) analysis, and analysis using a Cryo-electron microscope is recorded.
Examples of the known databases include Protein Data Bank (PDB) and the like.
The unknown complex structure may be obtained using, for example, a molecular mechanics method, a molecular dynamics method, or the like. Among them, the molecular dynamics method is preferable.
The molecular dynamics (MD) method means a method of simulating the motion of particles (mass points) such as atoms by numerically solving Newton's equation of motion.
The molecular dynamics calculation (simulation) by the molecular dynamics method may be performed using, for example, a molecular dynamics calculating program. Examples of the molecular dynamics calculating program include AMBER, CHARMm, GROMACS, GROMOS, NAMD, myPresto, and the like.
In the molecular dynamics calculation, a binding structure may be relaxed to a thermal equilibrium state or a state close to the thermal equilibrium state, for example, by performing calculation under a certain temperature and pressure condition (NPT ensemble).
The steric structure data includes, for example, atomic information data, coordinate information data, and binding information data, and constructs a steric structure in a coordinate space.
A data format is not particularly limited and may be appropriately selected depending on the intended purpose, and for example, the data format may be text data, a structure data file (SDF) format, or an MOL file format.
The atomic information data is data on the type of atom.
The coordinate information data is data on coordinates (positions) of atoms.
The binding information data is data on a bond between atoms.
In the calculation process, the stable steric structure of the peptide molecule, in a state where the steric configuration of the main chain of the peptide molecule in the steric structure of the peptide molecule in the complex structure is fixed, is calculated.
In the calculation of the stable steric structure of the peptide molecule, for example, a relative permittivity around the peptide molecule is set in consideration of a relative permittivity around the peptide molecule in the complex structure. The consideration here means, for example, matching or approximating the relative permittivity around the peptide molecule to the relative permittivity around the peptide molecule in the complex structure. The relative permittivity around the peptide molecule is set to, for example, four.
The method for calculating the stable steric structure of the peptide molecule, in a state in which the steric configuration of the main chain is fixed, is not particularly limited and may be appropriately selected depending on the intended purpose, however, it is preferable to calculate a minimum energy of an Ising model by performing a ground state search using an annealing method on the Ising model converted based on restriction conditions of the side chain of the peptide molecule.
By fixing the steric configuration of the main chain, the search for a stable steric structure of the peptide molecule may be reduced to a combinatorial optimization problem of the steric configuration of the side chain. Therefore, the minimum energy of the Ising model may be calculated. The calculation of the minimum energy of the Ising model is a method that may be performed in a very short time among methods of exhaustively searching for a stable steric structure of a peptide molecule in a state where the steric configuration of the main chain is fixed. Therefore, this greatly contributes to more efficiently performing the disclosed technique.
The energy equation of the Ising model may be expressed, for example, by the following equation.
= A · [ - i ( j W ij + b i ) ] · x i + B · res rot ( x k - 1 ) 2
In the equation, Wij represents a side chain-side chain interaction, bi represents a main chain-side chain interaction at the amino acid residue. xi represents bits of a rotor state of the side chain. “res” represents an amino acid residue. “rot” represents the rotation of the side chain. xk represents bits of the rotor state of the k-th amino acid residue. A represents a positive number. B represents a positive number.
The minimum energy of the Ising model may be calculated using an annealing machine. The annealing machine is not particularly limited and may be appropriately selected depending on the intended purpose as long as it is a computer that employs an annealing method of performing a ground state search for an energy function represented by an Ising model, and the annealing machine may be a quantum annealing machine, a semiconductor annealing machine using semiconductor technology, or simulated annealing executed by software using a central processing unit (CPU) or a graphics processing unit (GPU).
An example of an annealing method and an annealing machine will be described below.
The annealing method (simulated annealing method, SA method) is a kind of Monte Carlo method, and is a method of probabilistically obtaining a solution by using a random number value. Hereinafter, an object of minimizing a value of an evaluation function to be optimized will be described as an example, and the value of the evaluation function will be referred to as energy. In a case of maximization, a sign of the evaluation function may be changed.
First, starting with an initial state in which one discrete value is assigned to an individual variable, a state transition from the current state (a combination of values of variables) to a state close to the current state (for example, a state in which only one of the variables has been changed) is examined. A change in energy associated with the state transition is calculated, and it is stochastically determined whether to adopt the state transition and change the current state or to maintain the current state without adopting the state transition, according to the calculated value. When setting an adoption probability of a state transition that results in a drop in the energy to be greater than that of a state transition that results in a rise in the energy, a state change occurs in a direction in which the energy drops on average, and thus it is possible to expect that the state is transitioned to a more suitable state with the lapse of time. Finally, an optimal solution or an approximate solution that possibly results in energy close to that of the optimal solution may be obtained. When a state transition that results in a drop in the energy in a deterministic way is adopted and a state transition that results in a rise in the energy is not adopted, the change in energy broadly and monotonically decreases over time, however, once a local solution is reached, no further change may occur. Since an extraordinarily large number of local solutions are present in a discrete optimization problem as described above, in many cases the state gets stuck at a local solution that is not very close to an optimal solution. Therefore, it is important to stochastically decide whether it is adopted.
In the annealing method, it has been proven that the state reaches the optimal solution at a limit of infinite time (the number of iterations) when the adoption (acceptance) probability of the state transition is determined as follows.
(1) For an energy change (energy decrease) value (−ΔE) associated with a state transition, an acceptance probability p of the state transition is determined by any of the following functions f( ).
p ( Δ E , T ) = f ( - Δ E / T ) ( Formula 1 - 1 ) f metro ( x ) = min ( 1 , e x ) ( Metropolis method ) ( Formula 1 - 2 ) f Gibbs ( x ) = 1 1 + e - x ( Gibbs method ) ( Formula 1 - 3 )
T is a parameter called a temperature value and is changed as follows.
(2) The temperature value T is logarithmically reduced with respect to the number of iterations t as expressed by the following equation.
T = T 0 log ( c ) log ( t + c ) ( Formula 2 )
T0 represents an initial temperature value and it is desirable that a sufficiently large value be set in accordance with the problem.
In a case of using the acceptance probability expressed by the Formulas in (1), when a steady state is reached after sufficient number of iterations, an occupation probability of an individual state is in accordance with a Boltzmann distribution with respect to a thermal equilibrium state in thermodynamics. Since the occupation probability of a lower-energy state increases when the temperature is gradually lowered from a high initial temperature, a low-energy state is supposed to be obtained when the temperature has sufficiently decreased. This method is referred to as an annealing method (or pseudo-annealing method) because this behavior resembles state change when annealing a material. The stochastic occurrence of a state transition that results in a rise in the energy corresponds to thermal excitation in physics.
FIG. 1 illustrates a conceptual configuration of an optimization apparatus (calculation unit) that performs the annealing method. Although cases where a plurality of candidates for the state transition is generated will be also described in the following description, the transition candidates are generated one by one in the normal basic annealing method.
An optimization apparatus 100 includes a state holding unit 111 configured to hold a current state S (values of a plurality of state variables). The optimization apparatus 100 also includes an energy calculation unit 112 configured to calculate energy change values {−ΔEi} of state transitions in a case where the state transition occurs from the current state S as a result of change in any of the values of the plurality of state variables. The optimization apparatus 100 includes a temperature control unit 113 configured to control a temperature value T and a transition control unit 114 configured to control state changes.
The transition control unit 114 stochastically determines whether or not any one of a plurality of state transitions is accepted depending on a relative relationship between the energy change values {−ΔEi} and thermal excitation energy, based on the temperature value T, the energy change values {−Mi}, and the random number value.
When the transition control unit 114 is subdivided, the transition control unit 114 includes a candidate generation unit 114 a for generating a candidate for a state transition, and an acceptance determination unit 114 b for stochastically determining for each candidate whether or not the state transition is accepted based on the energy change values {−ΔEi} of the candidates and the temperature value T. The transition control unit 114 further includes a transition determination unit 114 c for determining a candidate to be adopted from the accepted candidates, and a random number generation unit 114 d for generating a probability variable.
The operation in one iteration is as follows. First, the candidate generation unit 114 a generates one or a plurality of candidates (candidate numbers {Ni}) for the state transition from the current state S held by the state holding unit 111 to the next state. The energy calculation unit 112 calculates energy change values {−ΔEi} for each of the state transitions for the candidates, by using the current state S and the candidates for the state transition. The acceptance determination unit 114 b uses the temperature value T generated in the temperature control unit 113 and a probability variable (random number value) generated by the random number generation unit 114 d, and accepts the state transition with the acceptance probability expressed by the above Formulas in (1) according to the energy change values {−ΔEi} of the respective state transitions. The acceptance determination unit 114 b outputs the acceptances {fi} of the respective state transitions. In a case where a plurality of state transitions is accepted, the transition determination unit 114 c randomly selects one thereof by using a random number value. The transition determination unit 114 c then outputs a transition number N of the selected state transition, and a transition acceptance f. In a case where there is an accepted state transition, the values of the state variable stored in the state holding unit 111 is updated according to the adopted state transition.
Starting with an initial state, the above-described iteration is repeated while causing the temperature control unit 113 to lower the temperature value, and the operation ends when an end determination condition, for example, a certain number of iterations is reached, or the energy becomes lower than a predetermined value, is satisfied. The solution outputted by the optimization apparatus 110 is the state corresponding to the end of the operation.
FIG. 2 is a circuit level block diagram of a configuration example of a transition control unit in a normal annealing method for generating candidates one by one, especially, an arithmetic portion used for an acceptance determination unit.
The transition control unit 114 includes a random number generation circuit 114 b 1, a selector 114 b 2, a noise table 114 b 3, a multiplier 114 b 4, and a comparator 114 b 5.
Of the energy change values {−ΔEi} calculated for the candidates of the respective state transitions, the selector 114 b 2 selects and outputs an energy change value corresponding to the transition number N, which is a random number value generated by the random number generation circuit 114 b 1.
Functions of the noise table 114 b 3 will be described later. As the noise table 114 b 3, for example, a memory such as a random-access memory (RAM) or a flash memory may be used.
The multiplier 114 b 4 outputs a product obtained by multiplying a value outputted by the noise table 114 b 3 by the temperature value T (corresponding to the thermal excitation energy described above).
The comparator 114 b 5 outputs a comparison result in which a multiplication result outputted by the multiplier 114 b 4 is compared with an energy change value −ΔE selected by the selector 114 b 2, as the transition acceptance f.
The transition control unit 114 illustrated in FIG. 2 basically implements the above-described function as is, but the mechanism for permitting the state transition with the acceptance probability expressed by Formulas in (1) has not been described so far, and therefore, this will be supplemented.
A circuit that outputs 1 at the acceptance probability p and outputs 0 at the probability (1-p) may be realized by a comparator that has two inputs A and B, and outputs 1 when A>B, and outputs 0 when A<B by inputting the acceptance probability p to the input A and a uniform random number having a value in a section [0, 1) to the input B. Thus, with an input of the value of the acceptance probability p calculated by using Formulas in (1) based on the energy change value and the temperature value T to the input A of the comparator, it is possible to realize the above function.
Specifically, assuming that f is the function used in Formulas in (1), and that u is a uniform random number having a value in the section [0, 1), a circuit that outputs 1 when f(ΔE/T) is greater than u realizes the above function.
The situation may be accepted, however, the same function may be realized by the following modification. Even when the same monotonically increasing function is applied to two numbers, the two numbers maintain the same magnitude relationship. Therefore, even when the same monotonically increasing function is applied to the two inputs of the comparator, the same output is obtained. When an inverse function f−1 off is adopted as this monotonically increasing function, it is seen that a circuit that outputs 1 when −ΔE/T is greater than f−1(u) may be adopted. Since the temperature value T is positive, it is seen that a circuit that outputs 1 when −ΔE is greater than Tf−1(u) may be adopted. The noise table 114 b 3 in FIG. 2 is a conversion table for realizing the inverse function f−1(u), and is a table for outputting a value of the following function with respect to the input obtained by discretizing the section [0, 1).
f metro - 1 ( u ) = log ( u ) ( Formula 3 - 1 ) f Gibbs - 1 ( u ) = log ( u 1 - u ) ( Formula 3 - 2 )
Although the transition control unit 114 includes a latch that holds a determination result and the like, a state machine that generates the corresponding timing, and the like, these components are not illustrated in FIG. 2 for simple illustration.
FIG. 3 illustrates the flow of operation of the transition control unit 114. The flow of operation includes a step of selecting one state transition as a candidate (S0001), a step of determining whether a state transition is accepted or not by comparing the energy change value with respect to the state transition with a product of a temperature value and a random number value (S0002), and a step in which the state transition is adopted when the state transition is accepted, and the state transition is not adopted when the state transition is not accepted (S0003).
<Comparison Process>
In the comparison process, the calculated data of the stable steric structure of the peptide molecule is compared with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
The comparison is preferably performed by visualizing the stable steric structure of the peptide molecule and the steric structure of the peptide molecule in the complex structure. In doing so, the side chain having a difference in steric configuration between two peptide molecules may be easily found.
The method for visualizing the steric structure of the peptide molecule is not particularly limited and may be appropriately selected depending on the intended purpose, and may be performed using known molecular graphic software. Examples of the molecular graphic software include, for example, PyMOL, and the like.
Visualization may be performed, for example, by incorporating the steric structure data of the peptide molecule into molecular graphic software to construct a steric structure, and displaying the created steric structure on a display device.
The comparison is preferably performed by superposing the main chain of the visualized stable steric structure of the peptide molecule with the main chain of the visualized steric structure of the peptide molecule. In this way, the side chain having a difference in steric configuration between the two peptide molecules may be found more easily.
The superposition of the main chains may be carried out, for example, by overlapping a Cα atom of each amino acid residue in the peptide molecule and overlapping a Cβ atom of the side chain.
The superposition of the main chains may be carried out using, for example, molecular graphic software. Examples of the molecular graphic software include, for example, PyMOL, and the like.
In the comparison process, for example, a side chain (hotspot) that largely contributes to the structural stabilization of the complex structure is specified from the side chain having a difference in steric configuration between two peptide molecules.
The identification of the hotspot may be appropriately determined from the side chain having a difference in steric configuration between two peptide molecules. For example, when the calculated stable steric structure of the peptide molecule is superimposed on the steric structure of the peptide molecule of the complex structure so that the main chains of the two peptide molecules overlap, in a case where the side chain of the calculated peptide molecule overlaps a binding site of the target molecule of the complex structure (the site where the peptide molecule binds to the target molecule), the side chain is specified as a hotspot.
The side chain in the peptide molecule of the complex structure corresponding to the side chain overlapping the binding site is likely to interfere with the structural stabilization of the complex structure. Specifically, the side chain is likely to be a hotspot.
The disclosed technique will be described using a flowchart.
FIG. 4 is an example of a flow chart of the disclosed technique.
<Step S1>
First, the stable steric structure of the peptide molecule, in a state where the steric configuration of the main chain of the peptide molecule in the steric structure is fixed, is calculated using data of the steric structure of the peptide molecule in the complex structure of the target molecule and the peptide molecule (S1).
The data of the steric structure of the peptide molecule in the complex structure may be acquired from data of the complex structure recorded in a known database. Examples of the known databases include Protein Data Bank (PDB) and the like.
The calculation of the stable steric structure is preferably performed, for example, by performing a ground state search using an annealing method on the Ising model converted based on the restriction conditions of the side chain of the peptide molecule to calculate the minimum energy of the Ising model.
<Step S2>
Next, the calculated data of the stable steric structure of the peptide molecule is compared with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between the two peptide molecules (S2).
The comparison is performed by visualizing the stable steric structure of the peptide molecule and the steric structure of the peptide molecule in the complex structure using, for example, molecular graphic software.
Thus, the side chain of the peptide molecule to be modified for stabilizing the complex structure may be efficiently found.
An example of experimental results of the method of the disclosed technology is described below.
FIG. 5A is the complex structure of 1WCA in PDB. The protein is CYCLOPHILIN A (CypA) and the peptide molecule is CYCLOSPORIN A (CsA).
Using the peptide molecule having the complex structure illustrated in FIG. 5A, the stable steric structure of the peptide molecule was calculated in a state where the steric configuration of the main chain of the peptide molecule was fixed.
The calculated stable steric structure of the peptide molecule and the steric structure of the peptide molecule in the complex structure were visualized in a state where the main chains were overlapped. The results are illustrated in FIG. 5B.
In FIG. 5B, the main chains of the two peptide molecules to be compared overlap, and some side chains have a difference in steric configuration. The side chains of 1-Leu, 4-Ver, and 6-Leu that are indicated by circles have a large difference in steric configuration and are likely to be hotspots. The calculated structure of the side chain of the peptide molecule is indicated by a relatively thin line.
(Program)
The disclosed program is a program for causing a computer to execute the disclosed method for searching for a modification site of a peptide molecule,
The program may be created using various known program languages according to a configuration of a computer system to be used, and a type, a version, and the like of an operating system.
The program may be recorded using a recording medium such as an internal hard disk or an external hard disk, or may be recorded using a recording medium such as a compact disc read-only memory (CD-ROM), a digital versatile disk read-only memory (DVD-ROM), a magneto-optical disk (MO disk), or a Universal Serial Bus (USB) memory [USB flash drive]. When the program is recorded using a recording medium such as a CD-ROM, a DVD-ROM, an MO disk, or a USB memory, the program may be used directly or by being installed on a hard disk through a recording medium reading device included in the computer system when appropriate. The program may be also recorded in an external storage area (another computer or the like) accessible from the computer system through the information communication network, and the program may be used directly from the external storage area through the information communication network or by being installed on the hard disk when appropriate.
The program may be recorded using a plurality of recording media while being divided for each arbitrary process.
(Recording Medium)
The disclosed recording medium records the disclosed program.
The disclosed recording medium is computer-readable.
The disclosed recording medium may be transitory or non-transitory.
The disclosed recording medium is, for example, a recording medium having recorded thereon a program for causing a computer to execute the disclosed method for searching for a modification site of a peptide molecule.
The recording medium is not particularly limited, and may be appropriately selected according to the purpose, and examples thereof include, for example, an internal hard disk, an external hard disk, a CD-ROM, a DVD-ROM, an MO disk, a USB memory, and the like.
The recording medium may be a plurality of recording media in which the program is divided and recorded for each arbitrary process.
(Apparatus for Searching for Modification Site of Peptide Molecule)
The disclosed apparatus for searching for a modification site of a peptide molecule includes at least a calculation unit and a comparison unit, and further includes other units when appropriate.
The calculation unit, by using data of the steric structure of the peptide molecule in the complex structure of the target molecule and the peptide molecule, calculates the stable steric structure of the peptide molecule in a state in which the steric configuration of the main chain of the peptide molecule in the steric structure is fixed.
The comparison unit compares the calculated data of the stable steric structure of the peptide molecule with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
An aspect of the calculation unit is the same as the aspect of the calculation process in the disclosed method for searching for a modification site of a peptide molecule.
An aspect of the comparison unit is the same as the aspect of the comparison process in the disclosed method for searching for a modification site of a peptide molecule.
The disclosed apparatus for searching for a modification site of a peptide molecule includes, for example, a memory, a processor, and other units when appropriate.
The memory stores, for example, data of a complex structure of a target molecule and a peptide molecule.
The memory stores, for example, data of the steric structure of the peptide molecule in the complex structure.
The memory stores, for example, data of the calculated stable steric structure of the peptide molecule.
The processor is coupled to the memory.
The processor is configured to use the data of the steric structure of the peptide molecule in the complex structure of the target molecule and the peptide molecule to calculate a stable steric structure of the peptide molecule in a state where the steric configuration of the main chain of the peptide molecule in the steric structure is fixed.
The processor is configured to compare the calculated data of the stable steric structure of the peptide molecule with the data of the steric structure of the peptide molecule in the complex structure to search for a side chain having a difference in steric configuration between two peptide molecules.
The processor is, for example, a CPU, a GPU, or a combination thereof.
FIG. 6 illustrates a configuration example of a disclosed apparatus for searching for a modification site of a peptide molecule.
The apparatus 10 is configured, for example, by a CPU 11, a memory 12, a storage unit 13, a display unit 14, an input unit 15, an output unit 16, an I/O interface unit 17, and the like that are coupled via a system bus 18.
The CPU 11 performs operations (four arithmetic operations, comparison operations, and the like), operation control of hardware and software, and the like.
The memory 12 is a memory such as a RAM, a read-only memory (ROM), or the like. The RAM stores an operating system (OS), an application program, and the like read from the ROM and the storage unit 13, and functions as a main memory and a work area of the CPU 11.
The storage unit 13 is a device for storing various programs and data, and is a hard disk, for example. The storage unit 13 stores a program to be executed by the CPU 11, data to be used for execution of the program, the OS, and the like.
The program is stored in the storage unit 13, loaded into the RAM (main memory) of the memory 12, and executed by the CPU 11.
The display unit 14 is a display device, and is, for example, a display device such as a CRT monitor, a liquid crystal panel, or the like.
The input unit 15 is an input device for various data, and is, for example, a keyboard, a pointing device (for example, a mouse, or the like), or the like.
The output unit 16 is an output device for various data, and is, for example, a printer, or the like.
The I/O interface unit 17 is an interface for coupling various external devices. For example, the I/O interface unit 17 enables input and output of data of a CD-ROM, a DVD-ROM, an MO disk, a USB memory, or the like.
FIG. 7 illustrates another example of the configuration of the disclosed apparatus for searching for a modification site of a peptide molecule.
The configuration example of FIG. 7 is a cloud-type configuration example, and the CPU 11 is independent of the storage unit 13 and the like. In the configuration example, a computer 30 that stores the storage unit 13 and the like, and a computer 40 that stores the CPU 11 are coupled via network interface units 19 and 20.
The network interface units 19 and 20 are hardware configured to perform communication by using the Internet.
FIG. 8 illustrates another example of the configuration of the disclosed apparatus for searching for a modification site of a peptide molecule.
The configuration example of FIG. 8 is a cloud-type configuration example, and the storage unit 13 is independent of the CPU 11 and the like. In the configuration example, the computer 30 that stores the CPU 11 and the like, and the computer 40 that stores the storage unit 13 are coupled via the network interface units 19 and 20.
According to the disclosed method for searching for a modification site of a peptide molecule, the above-described problems in the related art may be solved, and a modification site of a peptide molecule may be efficiently searched for.
All examples and conditional language provided herein are intended for the pedagogical purposes of aiding the reader in understanding the invention and the concepts contributed by the inventor to further the art, and are not to be construed as limitations to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although one or more embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the invention.

Claims (9)

What is claimed is:
1. A method for searching for a modification site of a peptide molecule, the method comprising:
calculating, by a computer, a second steric structure of the peptide molecule by using data of a first steric structure of the peptide molecule, the first steric structure being a steric structure of the peptide molecule in a complex structure of a target molecule and the peptide molecule, the second steric structure being a stable steric structure of the peptide molecule in a state where a steric configuration of a main chain of the peptide molecule in the first steric structure is fixed, wherein the calculating the second steric structure further includes performing a ground state search using an annealing method for an Ising model converted based on a restriction condition of a side chain of the peptide molecule to calculate a minimum energy of the Ising model and to reduce computational cost; and
comparing data of the second steric structure with the data of the first steric structure in order to search for a side chain having a difference in steric configuration between the two steric structures.
2. The method according to claim 1, further comprising:
visualizing the second steric structure and the first steric structure to compare the data of the second steric structure with the data of the first steric structure.
3. The method according to claim 2, further comprising: superposing a main chain of the visualized second steric structure and a main chain of the visualized first steric structure to compare the data of the second steric structure with the data of the first steric structure.
4. A non-transitory computer-readable recording medium having stored therein a program that causes a computer to execute a process, the process comprising:
calculating a second steric structure of the peptide molecule by using data of a first steric structure of the peptide molecule, the first steric structure being a steric structure of the peptide molecule in a complex structure of a target molecule and the peptide molecule, the second steric structure being a stable steric structure of the peptide molecule in a state where a steric configuration of a main chain of the peptide molecule in the first steric structure is fixed, wherein the calculating the second steric structure further includes performing a ground state search using an annealing method for an Ising model converted based on a restriction condition of a side chain of the peptide molecule to calculate a minimum energy of the Ising model and to reduce computational cost; and
comparing data of the second steric structure with the data of the first steric structure in order to search for a side chain having a difference in steric configuration between the two steric structures.
5. The non-transitory computer-readable recording medium according to claim 4, the process further comprising:
visualizing the second steric structure and the first steric structure to compare the data of the second steric structure with the data of the first steric structure.
6. The non-transitory computer-readable recording medium according to claim 5, the process further comprising: superposing a main chain of the visualized second steric structure and a main chain of the visualized first steric structure to compare the data of the second steric structure with the data of the first steric structure.
7. An information processing apparatus, comprising:
a memory; and a processor coupled to the memory and the processor configured to:
calculate a second steric structure of the peptide molecule by using data of a first steric structure of the peptide molecule, the first steric structure being a steric structure of the peptide molecule in a complex structure of a target molecule and the peptide molecule, the second steric structure being a stable steric structure of the peptide molecule in a state where a steric configuration of a main chain of the peptide molecule in the first steric structure is fixed, wherein the calculate the second steric structure further includes perform a ground state search using an annealing method for an Ising model converted based on a restriction condition of a side chain of the peptide molecule to calculate a minimum energy of the Ising model and to reduce computational cost; and
compare data of the second steric structure with the data of the first steric structure in order to search for a side chain having a difference in steric configuration between the two steric structures.
8. The information processing apparatus according to claim 7, wherein the processor is further configured to: visualize the second steric structure and the first steric structure to compare the data of the second steric structure with the data of the first steric structure.
9. The information processing apparatus according to claim 8, wherein the processor is further configured to: superpose a main chain of the visualized second steric structure and a main chain of the visualized first steric structure to compare the data of the second steric structure with the data of the first steric structure.
US17/001,020 2019-10-21 2020-08-24 Method for searching for modification site of peptide molecule and information processing apparatus Active 2041-09-18 US11594299B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2019191785A JP7347113B2 (en) 2019-10-21 2019-10-21 Method for searching for modified sites in peptide molecules, searching device, and program
JP2019-191785 2019-10-21
JPJP2019-191785 2019-10-21

Publications (2)

Publication Number Publication Date
US20210118523A1 US20210118523A1 (en) 2021-04-22
US11594299B2 true US11594299B2 (en) 2023-02-28

Family

ID=72266163

Family Applications (1)

Application Number Title Priority Date Filing Date
US17/001,020 Active 2041-09-18 US11594299B2 (en) 2019-10-21 2020-08-24 Method for searching for modification site of peptide molecule and information processing apparatus

Country Status (4)

Country Link
US (1) US11594299B2 (en)
EP (1) EP3813069B1 (en)
JP (1) JP7347113B2 (en)
CN (1) CN112768002B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020155501A1 (en) * 1997-07-03 2002-10-24 Akiko Itai. Method of predicting functions of proteins using ligand database
WO2003006154A2 (en) 2001-07-10 2003-01-23 Xencor, Inc. Protein design automation for designing protein libraries with altered immunogenicity
JP2005018447A (en) 2003-06-26 2005-01-20 Ryoka Systems Inc Method for searcing acceptor-ligand stable complex structure
JP2006209764A (en) 2001-01-19 2006-08-10 In-Silico Science Inc Specification method for ligand bonding portion of protein and three dimensional structure construction method for protein-ligand complex

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2003265372A1 (en) * 2002-08-06 2004-02-23 Emory University Novel druggable regions in set domain proteins and methods of using the same
US8005620B2 (en) * 2003-08-01 2011-08-23 Dna Twopointo Inc. Systems and methods for biopolymer engineering
EP1939779A3 (en) * 2003-08-01 2009-04-01 Dna Twopointo Inc. Systems and methods for biopolymer engineering
CN1996238A (en) * 2006-12-15 2007-07-11 江南大学 Analysis program for use in hydrolysis production of bioactive peptide from protein
JP5764860B2 (en) * 2010-04-01 2015-08-19 三井情報株式会社 Protein identification apparatus, identification method, identification program, and computer-readable recording medium recording the same
US20170329892A1 (en) * 2016-05-10 2017-11-16 Accutar Biotechnology Inc. Computational method for classifying and predicting protein side chain conformations
JP7214972B2 (en) 2018-03-30 2023-01-31 富士通株式会社 Method for calculating stable three-dimensional structure, calculation device, and program

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020155501A1 (en) * 1997-07-03 2002-10-24 Akiko Itai. Method of predicting functions of proteins using ligand database
JP2006209764A (en) 2001-01-19 2006-08-10 In-Silico Science Inc Specification method for ligand bonding portion of protein and three dimensional structure construction method for protein-ligand complex
WO2003006154A2 (en) 2001-07-10 2003-01-23 Xencor, Inc. Protein design automation for designing protein libraries with altered immunogenicity
JP2005018447A (en) 2003-06-26 2005-01-20 Ryoka Systems Inc Method for searcing acceptor-ligand stable complex structure

Non-Patent Citations (14)

* Cited by examiner, † Cited by third party
Title
Casiraghi et al., "Grafting Aminocyclopentane Carboxylic Acids onto the RGD Tripeptide Sequence Generates Low Nanomolar αvβ3/αvβ5 Integrin Dual Binders", Journal of Medicinal Chemistry, American Chemical Society, vol. 48, No. 24, Apr. 11, 2005 pp. 7675-7687.
Els et al., "An Aromatic Region To Induce a Switch between Agonism and Inverse Agonism at the Ghrelin Receptor", Journal of Medicinal Chemistry, vol. 55, No. 17, Sep. 13, 2012, pp. 7437-7449.
Extended European Search Report dated Feb. 16, 2021 from European Application No. 20193016.1, 16 pages.
Hao et al., "Torsion Angle Preference and Energetics of Small-Molecule Ligands Bound to Proteins", J. Chem. Inf. Model. 2007, 47, 2242-2252.
Menchise, V., 2003. Insights into peptide nucleic acid (PNA) structural features: The crystal structure of a d-lysine-based chiral PNA-DNA duplex. Proceedings of the National Academy of Science. (Year: 2003). *
Mulligan et al., "Designing Peptides on a Quantum Computer", bioRxiv, Mar. 11, 2020, pp. 1-20.
Ota et al., "Binding Mode Prediction for a Flexible Ligand in a Flexible Pocket using Multi-Conformation Simulated Annealing Pseudo Crystallographic Refinement", Journal of Molecular Biology, vol. 314, No. 3, Nov. 30, 2001, pp. 607-617.
Ota N, Agard DA. Binding mode prediction for a flexible ligand in a flexible pocket using multi-conformation simulated annealing pseudo crystallographic refinement. Journal of Molecular Biology. Nov. 30, 2001;314(3):607-17. (Year: 2001). *
Pokharel et al., "Integrin activation by the lipid molecule 25-hydroxycholesterol induces a proinflammatory response", Nature Communications, vol. 10, No. 1, Apr. 1, 2019, 17 pages.
Reina et al., "Computer-aided design of a PDZ domain to recognize new target sequences", Nature Structural Biology, vol. 9, No. 8, Jun. 24, 2002, 7 pages.
Schueler-Furman et al., "Knowledge-based structure prediction of MHC class I bound peptides: a study of 23 complexes", Folding & Design/Structure, Cell Press, vol. 3, No. 6, Jan. 1, 1998, pp. 549-564.
Tanida Y, Ito M, Fujitani H. Calculation of absolute free energy of binding for theophylline and its analogs to RNA aptamer using nonequilibrium work values. Chemical Physics. Aug. 16, 2007;337(1-3):135-43. (Year: 2007). *
Te et al., "Predicting the effects of amino acid replacements in peptide hormones on their binding affinities for class B GPCRs and application to the design of secretin receptor antagonists", Journal of Computer-Aided Molecular Design, Kluwer Academic Publishers, DO, vol. 26, No. 7, May 11, 2012, pp. 835-845.
Ward, Matthew S., Mohammad Ataai, Richard R. Koepsel, and Rex E. Shepherd. "Comparison of Energy-Minimized Structures of [Pdll (N-methyliminodiacetate)] Complexes of X1-His-X3-His-His Peptides as an Analysis of Steric and Specific Interactions with Synthetic Binding Tags for IMAC Separations." (Year: 2001). *

Also Published As

Publication number Publication date
JP2021068081A (en) 2021-04-30
JP7347113B2 (en) 2023-09-20
US20210118523A1 (en) 2021-04-22
EP3813069B1 (en) 2024-02-07
EP3813069A1 (en) 2021-04-28
CN112768002B (en) 2024-02-23
CN112768002A (en) 2021-05-07

Similar Documents

Publication Publication Date Title
Blaabjerg et al. Rapid protein stability prediction using deep learning representations
Feig et al. MMTSB Tool Set: enhanced sampling and multiscale modeling methods for applications in structural biology
Janowski et al. Peptide crystal simulations reveal hidden dynamics
Hoffmann et al. Accurate methyl group dynamics in protein simulations with AMBER force fields
Tzanov et al. How accurately do current force fields predict experimental peptide conformations? An adiabatic free energy dynamics study
EP2619700B1 (en) System for molecular packing calculations
Ochoa et al. PARCE: protocol for amino acid refinement through computational evolution
Veit-Acosta et al. The impact of crystallographic data for the development of machine learning models to predict protein-ligand binding affinity
JP2021192199A (en) Structure search method, structure search device, program for structure search, and interaction potential specification method
Grillo et al. Quantum chemical descriptors based on semiempirical methods for large biomolecules
Jarmolinska et al. DCA-MOL: a PyMOL plugin to analyze direct evolutionary couplings
Lombard et al. Explaining Conformational Diversity in Protein Families through Molecular Motions
US11594299B2 (en) Method for searching for modification site of peptide molecule and information processing apparatus
US11031093B2 (en) Systems and methods for identifying thermodynamically relevant polymer conformations
Suarez et al. Sampling assessment for molecular simulations using conformational entropy calculations
Jamroz et al. Protocols for efficient simulations of long-time protein dynamics using coarse-grained CABS model
EP2973132B1 (en) Systems and methods for identifying thermodynamic effects of atomic changes to polymers
Hafsa et al. Accessible surface area from NMR chemical shifts
Shen et al. Validation of X-ray Crystal Structure Ensemble Representations of SARS-CoV-2 Main Protease by Solution NMR Residual Dipolar Couplings
Carugo et al. Criteria to extract high-quality protein data bank subsets for structure users
Kasavajhala et al. Exploring the Transferability of Replica Exchange Structure Reservoirs to Accelerate Generation of Ensembles for Alternate Hamiltonians or Protein Mutations
Donovan-Maiye et al. Systematic testing of belief-propagation estimates for absolute free energies in atomistic peptides and proteins
Grahnen et al. Fast side chain replacement in proteins using a coarse-grained approach for evaluating the effects of mutation during evolution
Fonseca et al. Probing RNA native conformational ensembles with structural constraints
Marziali et al. SADIC v2: A modern implementation of the Simple Atom Depth Index Calculator

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

AS Assignment

Owner name: FUJITSU LIMITED, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:TANIDA, YOSHIAKI;REEL/FRAME:053603/0924

Effective date: 20200731

STPP Information on status: patent application and granting procedure in general

Free format text: APPLICATION DISPATCHED FROM PREEXAM, NOT YET DOCKETED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: AWAITING TC RESP., ISSUE FEE NOT PAID

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STCF Information on status: patent grant

Free format text: PATENTED CASE