US12340872B2 - Systems and methods for engineering enzymes using identified energy transfer networks - Google Patents
Systems and methods for engineering enzymes using identified energy transfer networks Download PDFInfo
- Publication number
- US12340872B2 US12340872B2 US17/068,371 US202017068371A US12340872B2 US 12340872 B2 US12340872 B2 US 12340872B2 US 202017068371 A US202017068371 A US 202017068371A US 12340872 B2 US12340872 B2 US 12340872B2
- Authority
- US
- United States
- Prior art keywords
- enzyme
- energy
- residue
- conductance
- network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0012—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7)
- C12N9/0026—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on CH-NH groups of donors (1.5)
- C12N9/0028—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on CH-NH groups of donors (1.5) with NAD or NADP as acceptor (1.5.1)
- C12N9/003—Dihydrofolate reductase [DHFR] (1.5.1.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/18—Carboxylic ester hydrolases (3.1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y105/00—Oxidoreductases acting on the CH-NH group of donors (1.5)
- C12Y105/01—Oxidoreductases acting on the CH-NH group of donors (1.5) with NAD+ or NADP+ as acceptor (1.5.1)
- C12Y105/01003—Dihydrofolate reductase (1.5.1.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/01—Carboxylic ester hydrolases (3.1.1)
- C12Y301/01008—Cholinesterase (3.1.1.8), i.e. butyrylcholine-esterase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y502/00—Cis-trans-isomerases (5.2)
- C12Y502/01—Cis-trans-Isomerases (5.2.1)
- C12Y502/01008—Peptidylprolyl isomerase (5.2.1.8), i.e. cyclophilin
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
- G16B15/20—Protein or domain folding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
- G16B15/30—Drug targeting using structural data; Docking or binding prediction
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B5/00—ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
Definitions
- Enzymes are naturally occurring biological catalysts. They are commonly utilized in industrial applications, food products, animal feed, food preparation, house-hold products, medicinal therapies and in a variety of laboratory scale processes. Naturally occurring enzymes and their modified versions are used to improve the speed of reactions which are too slow to occur on their own or would not occur at all.
- engineered versions of enzymes are highly desired, as the naturally occurring enzymes do not provide the required speed for the reaction and/or they do not withstand the operating conditions.
- enzymes that are able to tolerate high temperatures and other harsh conditions including acidic or alkaline environments. Technologies that provide increased enzyme stability during storage and maintain enzyme viability over extended periods of time are also highly desired. Additionally, more efficient enzymes which enable a desired reaction to proceed at a much faster rate allow for time and cost savings. Therefore, engineering of enzymes has been widely pursued for developing better suited and more efficient versions in various applications. In particular, technologies which improve the stability of enzymes and their catalytic efficiencies are the most sought after.
- This disclosure is directed at engineering enzymes that improve the speed and efficiency of catalyzed reactions and provide optimal enzymes in desired conditions.
- This disclosure further provides methods and systems for manipulating regions of enzymes and also enables designing of related small compounds or large molecules that enable enzymes to achieve better efficiency and more stability.
- the invention is based on the new understanding of protein structure, dynamics and function with the role of surrounding environment in protein activity. Further understanding of conformational fluctuations that enable the activity of the protein target allows identification of functionally critical conformational sub-states. This, in turn, allows for discovery of distal regions of enzyme's structure that are optimized for improving the thermo-dynamical coupling between the enzyme surface and the surrounding environment.
- This disclosure provides methods for identifying functionally important surface regions of enzymes and approaches to optimize the energy flow between these regions and the surrounding solvent and the environment. Using the fundamental knowledge of factors that enable enzymes to achieve their catalytic efficiency, new versions of more efficient and more stable enzymes can be developed.
- a system for engineering enzymes comprises: a processing device; an enzyme database; and a memory device in communication with the processing device.
- the memory device comprises instructions that, when executed by the processing device, cause the system to: receive input at a user interface selecting a target enzyme; access biophysical and dynamics data for the target enzyme from the enzyme database; analyze the biophysical and dynamics data to identify an energy transfer network within the target enzyme, the energy transfer network comprising a series of residues spanning from a surface of the target enzyme to a catalytic site of the target enzyme; identify a surface loop within the energy transfer network, the surface loop comprising a plurality of residues showing dynamical motions coupled to the catalytic activity of the target enzyme; analyze a secondary structure of the target enzyme to identify one or more locations within the surface loop to modify; modify one or more residues in at least one of the one or more locations in the surface loop to produce an engineered enzyme; and output an amino acid sequence of the engineered enzyme on the user interface.
- a method of engineering a target enzyme to enhance its catalytic activity includes: accessing biophysical and dynamics data for the target enzyme; analyzing, using a computing device, the biophysical and dynamics data to identify at least one energy transfer network within the target enzyme, the at least one energy transfer network comprising a series of residues spanning from a surface of the target enzyme to a catalytic site of the target enzyme; identifying at least one surface loop within the energy transfer network, the at least one surface loop comprising a plurality of residues showing dynamical motions coupled to the catalytic activity of the target enzyme; analyzing a secondary structure of the target enzyme to identify one or more locations within the surface loop to modify; modifying one or more of the plurality of residues in the surface loop to produce an enhanced enzyme sequence; modeling the engineered enzyme in complex with a substrate using molecular dynamics simulations; analyzing substrate kinetics of the engineered enzyme; and outputting an amino acid sequence for the enhanced enzyme.
- a method of identifying small compounds to improve the activity of an enzyme having an industrial application includes: accessing biophysical and dynamics data for the enzyme; analyzing the biophysical and dynamics data to identify one or more potential ligand binding sites located remote from a site of designated activity on the target molecule, the binding sites being energetically coupled to an active site of the enzyme through an energy transfer network; quantifying a level of energetic coupling between each of the one or more potential functionally important binding sites and the active site of the target molecule; ranking each of the identified sites based on suitability as a ligand binding site and quantity of energetic coupling; selecting one or more of the identified sites having the best ranking to produce a subset of identified ligand binding sites; computationally screening a plurality of chemical compounds to determine a binding energy between each of the subset of identified sites and each of the plurality of chemical compounds; computationally modeling a derived set of chemical compounds based on strong binding compounds with different functional groups to improve the binding on the identified sites; computationally modeling the effect of the chemical compounds binding to each of the plurality
- a method of engineering a target enzyme to modify its catalytic activity comprises: accessing biophysical and dynamics data for the target enzyme; analyzing, using a computing device, the biophysical and dynamics data to identify at least one energy transfer network within the target enzyme, the at least one energy transfer network comprising a series of residues spanning from a surface of the target enzyme to a catalytic site of the target enzyme; identifying one or more residues within the energy transfer network that are not on the surface of the target enzyme; and modifying the one or more residues to produce an engineered enzyme sequence that has improved energy flow to the catalytic site as compared to the unmodified target enzyme.
- FIG. 1 depicts three schematic representations of an enzyme bound to a substrate, summarizing a current understanding of how enzymes function and their significance for developing more efficient and stable versions of enzymes.
- FIG. 2 depicts energy networks present in three different enzyme folds (cyclophilin A, dihydrofolate reductase and ribonuclease A) and biophysical properties of surface sites, identified through the conserved features of the functionally important surface network regions. These networks are conserved features of the enzyme folds and enable function of enzyme by transferring energy from the solvent to the active site.
- FIG. 3 is a schematic representation of an enzyme engineering system.
- FIG. 4 is a schematic representation of an example computing device usable to implement various aspects of the system of FIG. 3 .
- FIG. 5 is a flow diagram illustrating an example method of engineering an enzyme.
- FIG. 6 depicts the use of quasi-anharmonic analysis (QAA) to identify the conformational landscape of cyclophilin A, characterized for peptidyl-prolyl cis-trans isomerization activity.
- QAA quasi-anharmonic analysis
- FIG. 7 depicts the free energy profile associated with the peptidyl-prolyl cis-trans isomerization activity of the human cyclophilin A protein.
- the reaction coordinate is color coded as the reaction proceeds from Reactant (R) state to Product (P) state.
- the highest point of the free energy profile is defined as the transition state (T or TS).
- FIG. 8 depicts identification of the conformational changes associated with the protein (human cyclophilin A) as the protein transitions from the ground state into the functionally relevant TS′ (Level 1) or TS′′ sub-state (Level 2).
- FIG. 9 depicts identified surface loops and pathways in human cyclophilin A connecting these surface loops to active site.
- FIG. 10 depicts the unique benefit of using the approach quasi-anharmonic analysis (QAA) versus other approaches such as principal component analysis or quasi-harmonic analysis (QHA).
- QAA provides clear identification of homogeneous conformational sub-states. The results shown are from protein human ubiquitin and the coloring is based on scaled conformational (internal) energy. The conformations were obtained from molecular dynamics simulations.
- FIG. 11 is a schematic diagram illustrating energy flow within a protein as a result of a temperature gradient.
- FIG. 12 is a line chart showing hit time plotted against residue numbers for dihydrofolate reductases (DHFRs) from the four different species: Escherichia coli, Mycobacterium tuberculosis, Candida albicans and human.
- DHFRs dihydrofolate reductases
- FIG. 13 depicts the results obtained for the hit times and energy conductance values for the E. coli dihydrofolate reductase (EcDHFR).
- FIG. 14 depicts the inverse correlation between hit time and energy conductance.
- Energy conductance is a measure of a residues ability to relay energy to its neighbors.
- the network residues (with low values of hit times and high energy connectivity) from DHFR are indicated with the dashed rectangle for each of the four species Escherichia coli, Mycobacterium tuberculosis, Candida albicans and human
- FIG. 15 depicts the results of the energy conductance for dihydrofolate reductase from the four species: Escherichia coli, Mycobacterium tuberculosis, Candida albicans and human.
- the arrows indicate the flow of energy from the surface to the active site.
- FIG. 16 depicts the structures of organophosphate (OP) anticholinesterases compounds (paraoxon, PO; diisopropylfluorophosphate, DFP; echothiophate, EthP) used for investigation of engineered butyrylcholinesterase (BChE) activity.
- organophosphate (OP) anticholinesterases compounds paraoxon, PO; diisopropylfluorophosphate, DFP; echothiophate, EthP
- BChE butyrylcholinesterase
- FIG. 17 depicts the schematic of mechanism of hydrolysis of a model OP toxicant (paraoxon) by BChE.
- FIG. 18 depicts the surface loop of butyrylcholinesterase (residues 278-285) which was selected for protein engineering.
- the selected loop is in the highlighted circle.
- the protein sequence of the engineered loop is listed, with the residues inserted at the three positions highlighted.
- the 5 engineered loops (with inserts ENG, ENI, ENR, ENT and ENA) were designed.
- FIG. 19 is a line graph representing BChE backbone flexibility vs. number of protein residues inserted in the engineered loop as well as the location of the inserts.
- FIG. 20 depicts comparative resistance to inhibition of BChE G117H and loop mutants following a 10-min exposure to paraoxon (50 ⁇ M final), DFP (5 ⁇ M final) or EthP (100 ⁇ M final). Data represent the mean ⁇ standard error of the mean (SEM) of three independent replicates. An asterisk indicates a significant difference compared to BChE G117H .
- FIG. 21 depicts the recovery of activity in BChE G117H and loop mutants following exposure to 10 mM OP inhibitor (paraoxon, DFP or EthP).
- Left panel shows a representative plot of raw enzyme activity ( ⁇ A 412 ) following dilution including the reactivation and recovery phases.
- Right panel shows the mean ⁇ SEM of k 3 values.
- FIG. 22 illustrates the results of monitoring EcDHFR activity to determine the catalyzed hydride transfer rate using stopped-flow experiment. The rate is significantly decreased in presence of isopropanol.
- FIG. 23 illustrates the results of quasi-harmonic analysis associated with the EcDHFR catalyzed hydride transfer reaction in increasing isopropanol concentrations.
- the left panels show the conformational sub-states associated with the catalyzed reaction; the three middle panels show the conformational fluctuations in the enzyme structure associated with the conformational transitions into the TS sub-states.
- the right most panel shows the activation of the enzyme conformations near the TS during the catalyzed reaction.
- the present disclosure is directed to methods and systems for identifying different parts of enzyme structures that can be engineered and/or assisted by engineered technologies to improve the speed and efficiency of the catalyzed chemical reactions. More specifically, the present disclosure is directed to identifying and modifying distal surface regions that affect catalytic activity.
- the present technologies for engineering of enzymes to improve efficiency include mutations particularly in the active site and some version of the directed evolution technique.
- the enzyme mechanism is investigated to identify reaction intermediates, and these intermediates are characterized to understand the role of key residues in the active site.
- the identified residues are then mutated to improve their contribution to the catalyzed reaction through stabilization of the intermediates, thereby aiming to improve the catalytic efficiency.
- the technique of directed evolution has led to some limited successes in bio-catalyst engineering.
- directed evolution a large variety of random mutations are created through various molecular biology techniques and the resulting enzymes are screened for beneficial activity.
- Immobilization is a technique in which the natural or artificially prepared enzyme is attached to a substrate or surface such that it can be reused.
- immobilized enzymes can experience decreases in efficiency when compared to their free counterparts. The exact reasons for these decreases of such efficiency are still under debate.
- the invented technology allows preparing of enzyme molecules and selection of immobilization chemistry such that the immobilized enzymes do not go through a drastic decrease in their naturally occurring activity.
- the activity of protein function such as enzyme catalysis involves biochemical and biophysical mechanistic aspects.
- the biochemical mechanism involves interaction between the substrate molecule and residues in the active site of the enzyme.
- Enzymes function by lowering the activation energy barrier for a chemical reaction.
- the active site environment provides a complementary structural and electronic environment to the transition state of the catalyzed reaction.
- biochemical aspects of lowering the activation energy barrier for the catalyzed reaction which include electrostatic effects and providing a hydrophobic environment—which is far different from the solvent surrounded environment.
- These biochemical mechanisms are part of the “conventional view” depicted in the left panel of FIG. 1 . This is the “lock-and-key” model of a substrate fitting into a rigid enzyme.
- the new and emerging view of enzyme function involves a flexible enzyme molecule that is able to collect energy from thermodynamic fluctuations of the bulk solvent and the hydration shell solvent and direct it to the active site.
- the middle panel of FIG. 1 illustrates the energy transfer from the bulk solvent and hydration-shell to the active site.
- a large number enzymes have already been shown to contain conserved networks of residues that connect surface regions of the enzyme to the active site. Three examples are shown in FIG. 2 . These networks provide thermodynamical coupling between the hydration shell, bulk solvent, and the catalyzed reaction. On the surface of the enzyme, the function of these networks is to collect thermodynamical energy from the solvent. Therefore, these regions are highly flexible floppy loops which interact with the solvent. Next to the floppy surface loops are conserved hydrogen bonds and hydrophobic interactions which span the entire enzyme structure and eventually reach into the active site.
- thermo-dynamical coupling with the solvent does not necessarily contain conserved residue, but instead shows conservation of biophysical property.
- the biophysical property is optimization of thermo-dynamical coupling with the surrounding solvent. This is accomplished by presence of a large number of residues with long side chains. The side chains on these residues extend out into the solvent, increasing the surface area for the thermo-dynamical coupling.
- these surface regions show energy connectivity to the active site through conserved pathways of residues. The residues in these pathways or networks are connected by hydrogen bonds and hydrophobic interactions to allow the energy transfer from the surrounding solvent into the active site.
- these networks are identified that are part of these conserved networks, they can be targeted for improving the energetic coupling between the solvent and the protein residues. For example, one or mutations in these surface regions can be used to enhance the solvent-protein energy coupling, in turn improving the catalytic efficiency. In other applications, partial to complete loop engineering can be done to enable hyper-catalytic enzymes. In yet another application, the information regarding networks and the functionally important surface regions is used to design appropriate immobilization chemistry, in order to avoid degrading the function of these networks and surface regions.
- This disclosure specifically describes the identification and characterization of surface sites for enzyme engineering through identification of the enzyme energy networks and identification of conformational sub-states related to the enzyme activity.
- These energy networks can be identified through a wide variety of experimental and computational methodologies.
- the basic steps include identification of a series of hydrogen bond and hydrophobic interactions that range from the surface regions of the protein target and span into the active site, and showing direct role in enzyme activity through inspection of the conformational sub-states. In addition to recognition of these linkages, it is also important to verify the biophysical energy connectivity between the surface regions and the functioning or active site of the target.
- the identification of the surface sites can be used for carefully designing the immobilization chemistry in order to avoid damaging or hindering these sites.
- Immobilization is a process where the enzyme is chemically attached to a surface or another material, in order to facilitate reusage of the enzyme or improving the period of application. Immobilization is known to negatively impact the enzyme efficiency. As depicted in FIG. 1 , the use of incorrect chemistry that modifies and attaches the enzyme near the surface location of enzyme energy networks hinders the energy flow from the solvent to the active site, thereby impacting the enzyme's activity. The knowledge and location of residues of the enzyme energy networks would allow pre-evaluation of the suitability of the immobilization chemistry to affect these sites.
- these networks and surface sites are identified using the described methodology and computational approach, they can be ranked for the efficiency of energy flow from the solvent into the active site. This efficiency ranking reflects the ability of the surface site to modulate the enzyme activity.
- FIG. 3 illustrates a schematic diagram 100 of an enzyme engineering system 104 in communication with a computing device 102 .
- the computing device 102 can communicate with the enzyme engineering system 104 through a wired or wireless connection.
- the computing device 102 could connect to the enzyme engineering system 104 through a network such as the Internet, a local area network, or a Bluetooth connection.
- the enzyme engineering system 104 operates to analyze existing enzymes to identify energy transfer networks and modify residues within the enzyme to modify the enzyme's catalytic activity.
- the enzyme engineering system 104 includes a user interface 106 , an existing enzymes data store 108 , an energy network identification engine 110 , a surface loop modification engine 118 , an enzyme characterization engine 120 , and an engineered enzymes data store 122 .
- the user interface 106 operates to present a display of information in textual and graphical form on the computing device 102 .
- the user interface 106 receives inputs from the user U in the form of selections, text input, and data uploads.
- the input is received through an input device in communication with the computing device 102 such as a mouse, keyboard, microphone, or touchscreen.
- the visual display presents options to the user through the computing device 102 .
- the options include menus to select enzymes of interest.
- the user interface 106 also operates to present data to the user U in the form of text, images, tables, charts, graphs, and 3-D models. Inputs received at the user interface 106 can be used to manipulate the data and the displays of the data.
- the existing enzymes data store 108 operates to store information about enzymes that can be accessed by the user interface 106 and energy network identification engine 110 .
- the existing enzymes data store 108 includes names of enzymes, structural information about the enzymes, and optionally may also include data on the catalytic activity of enzymes. In some embodiments, if structural data is not available for the enzyme target then structural information from similar members, such as members of same superfamily, is included.
- the existing enzymes data store 108 stores biophysical dynamics data that is obtained by computational analysis of internal movements of the enzyme that affect its designated activity. In some embodiments, this computational analysis is performed using quasi-anharmonic analysis (QAA).
- QAM quasi-anharmonic analysis
- this computational analysis is performed using root-mean-square-fluctuations (RMSF) analysis, principal component analysis (PCA), or normal mode analysis (NMA), or time-averaged normal coordinate analysis (TANCA).
- RMSF root-mean-square-fluctuations
- PCA principal component analysis
- NMA normal mode analysis
- TANCA time-averaged normal coordinate analysis
- the biophysical dynamics data is obtained by one or more of nuclear magnetic resonance (NMR), X-ray crystallography, cryogenic electron microscopy (Cryo-EM), neutron scattering, and hydrogen-deuterium exchange.
- the energy network identification engine 110 operates to analyze biophysical dynamics data for a selected target enzyme to identify potential functionally important sites on the enzyme that are remote from the enzyme's active site. Molecular dynamics simulations are used to analyze different enzyme-substrate complex conformations. Regions of high flexibility on the surface of the enzyme are identified as well as a network of residues connecting these regions to the active site of the enzyme. In some examples, quasi-anharmonic analysis (QAA) is employed to analyze the enzyme conformations. In particular, surface loops connected to the energy networks are identified. In some embodiments, the surface loops include at least 3 residues.
- the surface loop modification engine 118 operates to identify surface loops on the surface of the enzyme that are coupled to the active site through an energy network.
- the top surface loops are analyzed by the surface loop modification engine 118 to identify the best locations within the loop to modify. This analysis takes into account effects of residues on secondary structure of the enzyme. Residues are identified within the loops that can be modified without having a detrimental effect on the secondary structure of the enzyme.
- the surface loop modification engine 118 generates new sequences of residues for the enzyme including modified surface loops.
- the resultant engineered enzymes are analyzed by the enzyme characterization engine 120 .
- the enzyme characterization engine 120 operates to examine engineered enzymes to determine if the desired effect is achieved with the modified amino acid sequence. In some embodiments, the enzyme characterization engine 120 models the engineered enzyme in complex with its substrate using molecular dynamics simulations. In some embodiments, the enzyme characterization engine 120 analyzes substrate kinetics of the engineered enzyme and substrate. In instances where multiple versions of an engineered enzyme are being examined, the enzyme characterization engine 120 can score and/or rank the potential engineered enzymes based on the modeling and kinetics.
- the engineered enzymes data store 122 operates to store information about the engineered enzymes. This information can include the results of molecular dynamics simulations and kinetics studies. In some embodiments, the amino acid sequences of the engineered enzymes are stored in the engineered enzymes data store 122 . This data store can be accessed by the user interface 106 to display information about the engineered enzyme on the computing device 102 . In some embodiments, external validation systems can access the engineered enzymes data store 122 to utilize the information. This information can be used, for example, to construct samples of the engineered enzymes for use in wet lab studies.
- FIG. 4 is a block diagram illustrating an example of the physical components of a computing device 400 .
- the computing device 400 could be any computing device utilized in conjunction with the enzyme engineering system 104 such as the computing device 102 of FIG. 3 .
- the computing device 400 includes at least one central processing unit (“CPU”) 402 , a system memory 408 , and a system bus 422 that couples the system memory 408 to the CPU 402 .
- the system memory 408 includes a random-access memory (“RAM”) 410 and a read-only memory (“ROM”) 412 .
- RAM random-access memory
- ROM read-only memory
- the computing system 400 further includes a mass storage device 414 .
- the mass storage device 414 is able to store software instructions and data such as instructions to implement the energy network identification engine 110 or surface loop modification engine 118 .
- the mass storage device 414 is connected to the CPU 402 through a mass storage controller (not shown) connected to the system bus 422 .
- the mass storage device 414 and its associated computer-readable storage media provide non-volatile, non-transitory data storage for the computing device 400 .
- computer-readable storage media can include any available tangible, physical device or article of manufacture from which the CPU 402 can read data and/or instructions.
- the computer-readable storage media comprises entirely non-transitory media.
- Computer-readable storage media include volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information such as computer-readable software instructions, data structures, program modules or other data.
- Example types of computer-readable data storage media include, but are not limited to, RAM, ROM, EPROM, EEPROM, flash memory or other solid state memory technology, CD-ROMs, digital versatile discs (“DVDs”), other optical storage media, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by the computing device 400 .
- the computing device 400 can operate in a networked environment using logical connections to remote network devices through a network 421 , such as a wireless network, the Internet, or another type of network.
- the computing device 400 may connect to the network 421 through a network interface unit 404 connected to the system bus 422 . It should be appreciated that the network interface unit 404 may also be utilized to connect to other types of networks and remote computing systems.
- the computing device 400 also includes an input/output controller 406 for receiving and processing input from a number of other devices, including a touch user interface display screen, or another type of input device. Similarly, the input/output controller 406 may provide output to a touch user interface display screen or other type of output device.
- the mass storage device 414 and the RAM 410 of the computing device 400 can store software instructions and data.
- the software instructions include an operating system 418 suitable for controlling the operation of the computing device 400 .
- the mass storage device 414 and/or the RAM 410 also store software instructions, that when executed by the CPU 402 , cause the computing device 400 to provide the functionality discussed in this document.
- the mass storage device 414 and/or the RAM 410 can store software instructions that, when executed by the CPU 402 , cause the computing system 400 to implement the method 500 described in FIG. 5 .
- FIG. 5 is a flow chart of an example method 500 of engineering an enzyme.
- enzymes are modified to enhance their catalytic activity.
- enzymes are modified to inhibit their catalytic activity.
- This method could be performed by the enzyme engineering system 104 of FIG. 3 .
- Other computing systems could be utilized to perform the operations of the method 500 .
- one or more operations are performed without the aid of a computing device.
- a user interface selecting a target enzyme.
- the user interface could be the user interface 106 displayed on the computing device 102 operated by the user U.
- the input could be received by means such as a stylus, a keyboard, a touchscreen, voice input, or a mouse.
- the target enzyme could be selected from a list or input manually with text or chemical structure input.
- the target enzyme has a particular catalytic activity of interest. In some embodiments, the target enzyme catalyzes a reaction that is useful for industrial purposes, a medical application or a bench-scale application.
- biophysical and dynamics data for the target enzyme is accessed.
- the biophysical and dynamics data was previously generated and stored in a data store such as the existing enzymes data store 108 .
- the biophysical and dynamics data is obtained by computational analysis of internal movements of the target enzymes that affect its catalytic activity.
- biophysical and dynamics data can be accessed directly from the source of the data, such as computer simulations, experimental techniques and previously published literature.
- the biophysical and dynamics data is analyzed to identify energy transfer networks within the target enzyme.
- An energy transfer network is a series of residues spanning from a surface of the target enzyme to a catalytic site of the target enzyme.
- Various techniques can be used to analyze protein dynamics including calculating the root-mean-square-fluctuations (RMSF), principal component analysis (PCA), normal mode analysis (NMA), time-averaged normal coordinate analysis (TANCA), quasi-harmonic analysis (QHA), Gaussian network model (GNM), and quasi-anharmonic analysis (QAA). Different sub-states of the molecule's conformations are evaluated to identify regions of high flexibility on the molecule.
- a surface loop within the energy transfer network is identified.
- the surface loop includes residues showing dynamical motions that are coupled to the catalytic activity of the target enzyme.
- the secondary structure of the target enzyme is analyzed to identify one or more locations within the surface loop that can be modified.
- the surface loop comprises at least 3 residues. In other embodiments, the surface loop comprises at least 5, at least 6, at least 7, at least 8, or at least 9 residues.
- the residues within the surface loop are modified to produce an engineered enzyme.
- the residues are modified to inhibit the catalytic activity of the target enzyme.
- the residues are modified to enhance the catalytic activity of the target enzyme.
- at least one residue is inserted into the surface loop.
- at least one residue is deleted from the surface loop.
- One or more residues may also be substituted to change the secondary structure of the loop.
- the engineered enzyme is computationally modeled in complex with its substrate using molecular dynamics simulations.
- the simulations are used to perform one of root-mean-square-fluctuations (RMSF) analysis, normal mode analysis, (NMA), time-averaged normal coordinate analysis (TANCA), quasi-harmonic analysis (QHA), and quasi-anharmonic analysis (QAA).
- RMSF root-mean-square-fluctuations
- NMA normal mode analysis
- TANCA time-averaged normal coordinate analysis
- QHA quasi-harmonic analysis
- QAA quasi-anharmonic analysis
- the impact of the engineered enzyme with its substrate are analyzed.
- the impact on conformational sub-states and energy coupling is analyzed.
- the amino acid sequence of the engineered enzyme is stored in a data store.
- the amino acid sequence of the engineered enzyme is output on a user interface of a computing device.
- this is the user interface 106 of the enzyme engineering system 104 that is presented on a display of the computing device 102 in FIG. 3 .
- the engineered enzyme is constructed using the updated protein sequence of the enzyme. In some embodiments, this is performed by transfecting cells with nucleic acids encoding the engineered enzyme and purifying the engineered enzyme after it is produced by the cells. Once obtained in sufficient quantity, the engineered enzymes can be validated through purification and wet lab experiments. In some embodiments one or more wet lab techniques from spectrophotometric assays, radiometric assays, mass spectrometry, multiple injection method studies, isothermal titration calorimetry studies, stopped-flow kinetics, steady state kinetics, and pre-steady state kinetics measurements are used to evaluate the enzyme activity.
- Protein dynamics or vibrational modes of a protein are modeled by a set of displacement vectors for each atom in the protein.
- Several methods are available to compute the vibrational modes of a molecular system such as a protein. A list of suitable techniques for use with the methods and systems described in this disclosure to compute the protein vibrational modes are discussed below.
- the time-scale of molecular vibration is determined by taking the inverse square root of the eigenvalues ( ⁇ ) obtained after diagonalization.
- the eigenvector ( ⁇ ) corresponding to the eigenvalue represents the vibrational modes, which are a set of displacement vectors for the atoms in the molecular or the protein conformation.
- NMA This method is similar to NMA in the sense that the protein vibrational modes are obtained by diagonalization of the Hessian matrix.
- NMA suffers from some limitation when considering a highly flexible molecular system such as a protein.
- NMA uses a reference structure and the eigenvalues and eigenvectors thus obtained are only relevant to the reference starting structure. The method therefore weights highly toward the high frequency motions and less toward the low frequency motions.
- the low frequency obtained using NMA are not reliable for molecular conformations that differ considerably from the reference structure for NMA.
- protein function such as enzyme catalysis, the low frequency motions are more important as they are required for overcoming the high energy barriers.
- TANCA TANCA
- This method computes protein vibrational modes from a set of protein conformations that are sampled using either the molecular dynamics (or Monte-Carlo) type simulations.
- QHA is a powerful method in obtaining vibrational modes that are representative of longer time-scales or the low frequency vibrational, by utilizing the information from a set of structures which may be separated by a long-time scale—or from different parts of the protein conformational space.
- the vibrational modes are obtained by diagonalization of the atomic fluctuation matrix.
- Quantities in ⁇ > denote an average determined from molecular dynamics simulation.
- eigenmodes (vibrational modes) diagonalization of the atomic fluctuation matrix is performed (see Eq. 2).
- the time-scale of protein vibration is determined by taking the inverse square root of the eigenvalues ( ⁇ ) obtained after diagonalization.
- the eigenvector ( ⁇ ) corresponding to the eigenvalue represents the protein vibrational modes, which are a set of displacement vectors for the atoms in the protein confirmation.
- QHA Gaussian Network Model
- the protein motions at different time-scales correspond to kinetic energy to allow overcoming of a variety of energy barriers encountered by the protein in the functional landscape.
- the fastest motions in the range of 10 ⁇ 15 seconds to 10 ⁇ 12 seconds overcome small energy barriers.
- the intermediate motions in the range of 10 ⁇ 12 seconds to 10 ⁇ 6 seconds are important for overcoming medium height barriers.
- the slowest motions at scale of 10 ⁇ 6 seconds and slower are important for overcoming large energy barriers.
- vibrational modes associated with the slowest motions can find conformations within the conformational energy landscape that are separated by high energy barriers.
- the vibrational modes associated with the intermediate and faster motions are useful for inducing more local conformational changes, like the movement of a loop region, an alpha helix, or the side chains of individual residues.
- Accurate estimates of energy associated with the protein vibrational modes is currently not available, however, the lower estimations would correspond to energy associated with vibrations of individual bonds ( ⁇ 1 kcal/mol) and on the higher end correspond to movement of entire protein domains (5-10 kcal/mol or higher).
- This relatively new methodology allows automated discovery of a hierarchy of sub-states associated with the conformational ensemble of proteins. Utilizing atomistic level MD simulations of proteins or protein in association with other molecules (such as binding partners or enzyme-substrate complex) as input, this methodology pays close attention to the anharmonic nature of internal protein motions and pursues the higher-order statistics of the internal motions.
- One of the most important advantages of this approach is that it allows clean separation between the conformational sub-states, by projecting the conformations sampled during the molecular dynamics simulations in a lower dimensional space represented by QAA vectors. Characterization of the populations in these sub-states for any relevant properties (such as internal energy, distance order parameter, or reaction coordinate) allows the detailed characterization.
- the motions associated within the sub-states and inter-conversion between the sub-states provide new insights into the inter-relationship between protein structure, motions and function.
- QAA allows characterization of conformational sub-states along a reaction pathway.
- a hierarchical description of the sub-states along the reaction pathway identifies sub-states with structural and dynamical features critical for attainment of the transition state.
- QAA was successfully used to organize the complex conformational landscape of the peptidyl-prolyl cis/trans isomerization (PPIase) activity catalyzed by human enzyme cyclophilin A (CypA), into a multi-scale hierarchy of conformational sub-states.
- PPIase peptidyl-prolyl cis/trans isomerization
- CypA cyclophilin A
- the conformational landscape associated with the PPIase activity was sampled using molecular dynamics in combination with umbrella sampling with the isomerized amide bond as reaction coordinate. Over 100,000 conformations collected from the MD simulations were analyzed using QAA.
- FIG. 6 shows the results of QAA analysis.
- the first round of QAA shows decomposition of the landscape into Reactant (R), Product (P) and Transition State (TS′) sub-states.
- the TS′ here is close to but does not correspond to the transition state (TS), which is defined as the highest point on the free energy profile (see FIG. 7 ). Further analysis of the remaining (mixed) conformations were required to identify the values closer to the true transition state (TS′′). Further analysis for Level 2 decomposition is performed by taking the mixed conformation sub-state and repeating QAA only on this sub-state.
- FIG. 7 depicts the free energy profile associated with the peptidyl-prolyl cis-trans isomerization activity of the human cyclophilin A protein.
- the amide bond is rotated as the reaction proceeds.
- the reaction coordinate in this analysis was defined as a value equal to 180—the value of amide bond dihedral angle.
- the reaction coordinate is color coded as the reaction proceeds from Reactant (R) state to Product (P) state.
- the highest point of the free energy profile is defined as the transition state (T or TS).
- FIG. 8 depicts identification of the conformational changes associated with the protein (human cyclophilin A) as the protein transitions from the ground state into the functionally relevant TS′ or TS′′ sub-state.
- the conformational change is depicted in a movie like fashion with only the protein backbone shown.
- the regions colored are the regions showing largest changes in the conformations. This conformational change corresponds to the black arrows in FIG. 6 , as the protein transitions from the central sub-states into the TS′ (for Level 1) and TS′′ (for Level 2) respectively.
- QAA allowed identification of independent component analysis (ICA) vectors or modes.
- ICA independent component analysis
- the projection of enzyme-substrate complex conformations on the first three dominant QAA modes allows identification of clusters or sub-states (labeled I, II, III and IV in FIG. 6 ). These sub-states can be characterized for the region of reaction pathway they correspond to by inspecting the reaction coordinate ( FIG. 7 ): reactant, product or near the TS (TS′/TS′′).
- the QAA analysis can be repeated in a multi-level fashion ( FIG. 6 ).
- CypA two levels provide qualitative insights as sub-states near TS (the green/yellow colored sub-states TS′ and TS′′) were identified.
- Protein fluctuations within the sub-states or between the sub-states can be obtained by comparing the changes in protein conformations.
- the dark black arrows in the figure (both at level 1 and 2) indicate conformational fluctuations associated with enzyme-substrate complex visiting the sub-states TS′ and TS′′ ( FIG. 8 ).
- FIG. 9 depicts identified surface loops and pathways in human cyclophilin A connecting these surface loops to active site.
- FIG. 10 depicts the unique benefit of using the approach quasi-anharmonic analysis versus other approaches such as principal component analysis or quasi-harmonic analysis.
- the same conformational landscape when subjected to the analysis based on these different method gives different results.
- the QAA approach identifies the correct conformational sub-states of high and low energy while the other approaches do not provide clear separation into homogenous sub-states.
- the results shown are from protein human ubiquitin and the coloring is based on scaled conformational (internal) energy. The conformations were obtained from molecular dynamics simulations.
- the enzyme fluctuations that allow visiting the TS′ and TS′′ correspond to the motions of previously identified network promoting catalysis ( FIG. 9 ).
- Detailed characterization of the motions that lead to the sub-state (TS′/TS′′) identified regions of high flexibility on the surface and the network of residues connecting these regions to the active site ( FIG. 9 ).
- the list of network residues is similar to the results obtained from different techniques including NMR spectroscopy and other computational results.
- QAA allows the identification of the conformational sub-states associated with enzyme activity of a protein.
- Detailed analysis of the functionally relevant conformational sub-states allows the identification of rate limiting conformational transitions that allow access into these sub-states.
- Further structural analysis of these sub-states and conformational transitions allows the identification of surface regions of the enzyme with large dynamical motions on the surface, and the network of residues that connect these surface regions to the active site.
- the surface sites can be assigned a score for their ability to control the target activity. Based on the assigned score, the ranking of these surface sites could serve as an important metric in enzyme engineering efforts. Engineering a surface site with little or no impact on protein activity would lead to loss of time and effort. Therefore, it is important to assign a score or ranking of each discovered surface site for their impact on enzyme activity. In other efforts, already discovered target engineering sites can also be assigned an energy transfer or coupling score as a metric for its ability to control target activity.
- an information theoretic approach can be used in combination with molecular dynamics simulations.
- the ability of protein residues (or atoms) to participate in energy flow is investigated by observing its ability to receive and pass a “signal” to its neighbors.
- the localized neighbors can include solvent (if the atom is exposed to the solvent), ions, and other protein atoms.
- the expected time for a “signal” to propagate from one residue (for example located on the surface of the protein) to another residue (for example located in the active site) can be computed.
- this expected time of signal propagation within a network is investigated using hit times, which can also be used to estimate how a protein may communicate with its neighbors. The methodology to compute hit times has been previously described elsewhere.
- conductance of energy flow along various atom-atom contacts is calculated using data from molecular dynamics simulations.
- Molecular dynamics simulations provide atomic coordinates and velocities associated with each atom as the systems evolves. Conductance of energy is calculated using the coordinates and velocity information.
- a constant system temperature is maintained throughout the system.
- a gradient of energy is created between two different parts of the system as depicted in FIG. 11 .
- the simulations can be performed at temperature T A while the active site is kept at a lower temperature T B .
- T A a temperature of T A is used for the simulation of enzyme assembling including the solvent
- T B the active site.
- the lowering of temperature is performed by reassigning the atomic velocities at a predefined number of steps. This results in creating an energy gradient and inducing a result energy flow into the active site.
- the heat transmission frequency w ij between atoms i and j is derived from the corresponding element of the covariance matrix of the mass-weighted atomic displacements from the average conformation
- the energy conductance of residue R is defined as the sum of all heat fluxes entering and exiting the residue
- Residues with a high energy conductance are thus important in channeling the energy flowing from the solvent towards the reaction center where the energy-sink atoms are located.
- the obtained information of energy conductance is used two ways for surface site of energy channels to be scored and ranked.
- the values of C R associated with each network residue are tabulated.
- each network is assigned an overall score and ranking; with ranks assigned based on scores ordered from largest to smallest.
- the network pathway score in its simplest form is an average of C R values of the network residues; therefore a network consisting of residues with high C R values will show higher score of energy transfer efficiency.
- a multiplicative score is used to assign the overall score for the network (C R values of all network residues are multiplied).
- residues showing high C R values are connected and inspected further for the existence of the network and surface sites.
- NADP + binding proteins play important roles in cellular redox reactions.
- a number of these proteins have been investigated widely for presence of protein residue networks associated with their activity. These include investigations of enzyme dihydrofolate reductase (DHFR) from Escherichia coli (EcDHFR), Mycobacterium tuberculosis (MtDHFR), Candida albicans (CaDHFR) and humans (hDHFR). These enzymes catalyze the transfer of hydride between the NADP + cofactor and a substrate. conserveed networks have been identified in each of these enzymes. To investigate the energy flow and conductance along the networks, a combination of MD simulations was used with the developed methodology.
- MD simulations were carried out near equilibrium in a quasi-steady state. Every 1 picosecond (ps), velocities from a subset of atoms in the immediacy of the hydride transfer reaction site were rescaled such that the total kinetic energy is decreased by 9.552 kcal/mol. These atoms are termed the energy-sink atoms.
- the selected amount of energy extracted is such that would result in an immediate cooling of the energy-sink atoms to a final temperature of 100 K, provided the atoms were at an initial temperature of 300K. This energy decrease is high enough to provide a signal above thermal noise.
- the extracted energy is immediately transferred to the solvent by rescaling solvent atom velocities.
- a temperature gradient within the protein is formed such that the energy-sink atoms have an average temperature of about 100 K, while the protein atoms far from the energy-sink atoms and in contact with the solvent have an average temperature of about 300 K.
- Results indicate that the residues which are most suitable for long range energy connectivity are conserved across evolution. These 4 species share only 30% sequence similarity. It is important to note that the simulations used for computing energy connectivity were performed in non-equilibrium conditions as an artificial energy gradient was introduced by cooling the active site. The simulations used in the informatics approach were performed under equilibrium conditions, with no temperature gradient or any other constraints on the systems. Therefore, both of these approaches for identifying the same residues provides confidence in the ability to characterize the residues for long range connectivity in proteins.
- the results from the informatics and the biophysical approach can be summarized as follows.
- the enzyme structure shows presence of conserved residues, all the way from surface to the active site, that provides channels of long-range connectivity.
- the flow of energy from the surrounding solvent on the enzyme surface through these channels into the active site is the mechanism of the long-range connectivity. It is hypothesized that this energy is required to overcome the activation energy barrier in the active site.
- the residues that form these conserved channels have been indicated to be part of conserved networks.
- organophosphorus (OPs) anticholinesterases have been sought for decades.
- OPs are commonly used as insecticides.
- the OP nerve agents such as sarin (2-[fluoro(methyl)phosphoryl]oxypropane) also act as potent anticholinesterases and are some of the most toxic synthetic chemicals ever created.
- Bioscavengers for OP anticholinesterases are proteins that bind to and inactivate the toxicants in the circulation before they can reach systemic target organs to elicit cholinergic toxicity.
- bioscavengers There are two types of bioscavengers: stoichiometric and catalytic. While stoichiometric scavengers bind OP molecules in a near 1:1 ratio, catalytic scavengers can reactivate following phosphylation by the OP, making them potentially much more efficient.
- Enzymes which catalyze OPs naturally e.g. organophosphorus acid anhydrolase, paraoxonase, phosphotriesterase
- OPs interact with BChE in a manner similar to choline esters.
- a Michaelis complex is initially formed (described as the ratio of association vs dissociation, k 1 /k ⁇ 1 ) which rapidly progresses to acylation (in the case of choline ester) or phosphylation (in the case of an OP, k 2 ).
- the subsequent kinetic step (k 3 ) is markedly different between choline and OP substrates.
- deacylation a histidine-bound water molecule rapidly displaces the acyl group from the catalytic serine residue in the active site.
- the histidine-bound water molecule is sterically hindered from displacing the phosphorus atom, thus it remains covalently attached to the active site serine. While rapid deacylation with a choline substrate allows return of the enzyme to an effective catalyst, the slow dephosphylation step leaves the catalytic reaction blocked, leading to accumulation of the endogenous signal acetylcholine leading to cholinergic toxicity.
- catalytic rate is affected by thermodynamical coupling of the hydration-shell, the bulk solvent, and the catalyzed reaction.
- networks of conserved residues have been discovered that span from the surface of the protein to the active site region, effectively coupling with the catalytic reaction mechanism.
- enhancing energy flow through these networks may be a general approach for increasing enzyme-mediated catalysis.
- FIG. 16 shows the chemical structures of the three OPs studied.
- Paraoxon O,O′-diethyl-p-nitrophenyl phosphate
- 98.6% purity was purchased from ChemService (West Chester, Pa.).
- a 10 mM stock solution of paraoxon was prepared in 100% dry ethanol and kept desiccated under nitrogen at ⁇ 80° C. until use.
- Molecular dynamics (MD) simulations were performed to model native and engineered BChE G117H in complex with OP (PO, EthP and DFP) and choline (butyrylcholine, BCh) substrates in explicit water solvent.
- OP PO, EthP and DFP
- choline butyrylcholine, BCh
- Model preparation and simulations were performed using the AMBER v14 suite of programs for biomolecular simulations (ambermd.org).
- AMBER's ff14SB force-fields were used for all simulations.
- the parameters for the substrates were obtained using the protocol described in the AMBER manual.
- MD simulations were performed using NVIDIA graphical processing units (GPUs) and AMBER's pmemd.cuda simulation engine.
- a total of twenty-four separate simulations were performed, based on the combination of native and engineered versions of BChE G117H in complex with PO, DFP and EthP, as well as with BCh.
- the enzyme was modeled based on the coordinates available in the protein data bank (PDB ID: 4BDS).
- the substrates were modeled based on the diester substrate coordinates for acetylcholine coordinates (PDB ID: 2ACE), the template for the diester bond was used and the remaining crystal structure and the atoms were added by AMBER's leap program based on the substrate template developed using AMBER's protocol.
- the equilibrated systems were then used to run 200 nanoseconds (ns) of production MD under constant energy conditions (NVE ensemble).
- the production simulations were performed at a temperature of 300 K and a time-step of 2 femtoseconds (with SHAKE algorithm applied to bonds and angles involving hydrogens).
- NVE ensemble was used for production runs, these values correspond to initial temperature at start of simulations.
- Temperature adjusting thermostat was not used in simulations; over the course of 200 ns simulations the temperature fluctuated around 300 K with RMS fluctuations between 2-4 K, which is typical for well-equilibrated systems.
- a total of 10,000 conformational snapshots (stored every 20 ps) collected for each system was used for analysis.
- Standard MD simulations (described above) can only model fixed electronic states, and thus chemical reactions cannot be modeled.
- a simple 3-state model was used to characterize the protein dynamics associated with the hydrolysis step. The reaction was modeled using a product, transition state and product state. The transition state was modeled based on the previous acetylcholinesterase studies. Three separate MD simulations (based on the protocol described above) were performed for each of these states. However, the production run for these simulations was 100 nanoseconds. These simulations were only performed for G117H with paraoxon as a representative OP substrate.
- RMSF 10 root-mean-square-fluctuations for top 10 quasi-harmonic modes
- E el is the electrostatic contribution
- E vdw is the van der Waals term and the summation runs over all atom pairs for the protein-substrate complex.
- the E el and E vdw terms were computed as follows,
- ⁇ ⁇ ( r ij ) A + B 1 + k ⁇ exp ⁇ ( - ⁇ ⁇ B ⁇ r ij ) ( Eq . ⁇ 10 )
- VNK was incorporated in the primer to encode a wild card residue (A, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, or V).
- FIG. 18 shows the surface loop in BChE (residues 276-283) in the white circle. Thin blue lines indicate rigid areas while green to red areas with thicker tubes correspond to regions displaying large conformational fluctuations. Red residues are residues inserted into the loop sequence. In addition to E and N as the first two insertions, five different residues were considered at the third insertions site.
- FIG. 19 shows results of computational modeling to characterize the effect of increasing the length of the engineered loop on dynamics.
- one (E), two (E, N) and three (E, N, I) residues were inserted in the positions described above.
- Computational modeling based on MD simulations indicated that the dynamical flexibility increased markedly with insertion of three residues, while inserting only one led to a slight increase and inserting two showed a slight decrease.
- Table 1 shows the interaction energies from conformational snapshots sampled during the MD simulations, and estimated using a sum of electrostatics and van der Waals interactions between the substrate and enzyme residues.
- the MD trajectories were also analyzed for enzyme-substrate interaction behavior.
- Enzyme activity with the choline substrate was evaluated using a modified Ellman method using butyrylthiocholine (BTCh) as the substrate.
- the final concentration of BTCh was varied between 10 ⁇ M and 1 mM and change in absorbance was monitored at 412 nm for 5 min at 37° C. using a Spectramax M2 microplate reader (Molecular Devices; Sunnyvale, Calif.).
- BChE G117H and the loop mutants to inhibition by the OPs were evaluated.
- Working enzyme solutions were prepared prior to assay (2 ⁇ g/ml in 100 mM potassium phosphate buffer, pH 7.0).
- Twenty ng of BChE G117H or one of the loop mutants was added to either vehicle or paraoxon (50 ⁇ M), diisopropylfluorophosphate (5 ⁇ M) or echothiophate (100 ⁇ M) and allowed to pre-incubate for 10 min in a 96-well plate containing buffer (100 mM potassium phosphate, pH 7.0) and DTNB (0.5 mM).
- 20 ⁇ l of BTCh (1 mM final concentration) was added to begin the reaction and enzyme activity was assayed as described above and plotted as percent of the respective vehicle control over time.
- the relative rate of dephosphylation (k 3 ), a measure of an enzyme's ability to reactivate after inhibition, can be used as an indicator of the catalytic rate.
- an equal volume of enzyme (20 ⁇ g/ml in 100 mM potassium phosphate buffer, pH 7.0) and OP (10 mM paraoxon or echothiophate) were pre-incubated for 1 min to achieve saturation of all enzyme active sites.
- OP mM paraoxon or echothiophate
- a 5 ⁇ l sample was removed, rapidly diluted 400-fold into a cuvette containing the reaction components (0.5 mM DTNB and 1 mM BTCh in 100 mM potassium phosphate buffer, pH 7.0) and gently mixed. This large dilution reduces the final OP concentration to below its K M , allowing dissociation of any OP molecules from the reversible Michaelis complexes.
- Hydrolysis of the OP substrate paraoxon was determined using a direct photometric assay.
- BChE G117H and the loop mutants were diluted to 0.06 mg/ml using PBS.
- a stock solution of paraoxon (in dry ethanol) was diluted on the day of assay and added to the reaction in final concentrations between 10-500 ⁇ M (with 5% ethanol included in all reactions).
- the final reaction volume of 100 ⁇ l contained 7.5 ⁇ l diluted enzyme and 72.5 ⁇ l 100 mM and 50 mM potassium phosphate buffer (pH 7.0), with the reaction being initiated by adding a 20 ⁇ l aliquot of paraoxon.
- the p-nitrophenol produced by hydrolysis of paraoxon was monitored at 405 nm for 1 h at 37° C.
- Kinetic parameters were determined using non-linear regression with the Michaelis-Menten equation.
- the discovered connection between the enzyme surface residues/loop regions and solvent through the thermo-dynamical coupling can be used to modulate the enzyme activity.
- using the knowledge of network residues and location of the functionally important dynamical surface sites and their functional groups that interact with the solvent can be used to select solvent conditions to alter the energy coupling.
- FIG. 22 shows the results of monitoring DHFR with stopped-flow kinetics to determine the hydride transfer rate.
- the pH independent rate for hydride transfer is 1170 ⁇ 10 s ⁇ 1 (black curve) for 0% isopropanol (buffer only), while it is 530 ⁇ 10 s ⁇ 1 with 20% isopropanol.
- Use of binary mixture of isopropanol and water indicated that the hydride transfer by E. coli DHFR under rate limiting conditions (measure by steady state kinetics) shows 2.2-fold decrease, at 20% isopropanol concentration.
- thermo-dynamical coupling between the surface regions (in functionally important areas) and solvent.
- Such changes in the thermo-dynamical coupling will also reflect in the changes in the conformational sub-states populations related to the functional states, such as the transition state.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Physics & Mathematics (AREA)
- Molecular Biology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Evolutionary Biology (AREA)
- Theoretical Computer Science (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Physiology (AREA)
- Pharmacology & Pharmacy (AREA)
- Immunology (AREA)
- Enzymes And Modification Thereof (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
Abstract
Description
εi F=ε iωi (Eq. 1)
F αβ = m α 1/2(x α − a α )m β 1/2(x β − x β ) (Eq. 2)
where α,β run through the 3N degrees of freedom in Cartesian space and mα is the mass of atom corresponding to the αth degree of freedom and xα are the Cartesian coordinates of the atom corresponding to the αth degree of freedom. Quantities in <> denote an average determined from molecular dynamics simulation. To obtain the eigenmodes (vibrational modes) diagonalization of the atomic fluctuation matrix is performed (see Eq. 2). The time-scale of protein vibration is determined by taking the inverse square root of the eigenvalues (ε) obtained after diagonalization. The eigenvector (ω) corresponding to the eigenvalue represents the protein vibrational modes, which are a set of displacement vectors for the atoms in the protein confirmation. Note one of the benefits of QHA is that multiple MD trajectories can be combined to construct the atomic fluctuation matrix—thus allowing vibrational modes to be computed that represent conformational changes between different areas of the protein conformational space.
Gaussian Network Model (GNM):
r ij NAT<σi+σi+1/2(δr i +δr i) (Eq. 3),
where σi is the van der Waals radius of atom i and δri is the standard deviation for the distribution of displacements of atom i from the average conformation. The heat flux flowing from atom i to atom j can then be estimated with an expression analog to Fourier's law,
J i→j =w ij k B(T i −T j) (Eq. 4).
3/2k B T i=1/2m i v i 2 (Eq. 5).
Σpro-subs=Σ(E el +E vdw) (Eq. 9)
where qi are partial charges, and Aij, Bij are Lennard-Jones parameters. These parameters were obtained from the AMBER force field. A distance-dependent dielectric function was used:
B=εo−A; εo=78.4 for water; A=−8.5525; λ=0.003627 and k=7.7839.
| TABLE 1 |
| Enzyme-substrate interaction energy obtained from computations. Averaged |
| interaction energies (in kcal/mol) between substrate and full enzyme (all |
| residues) and only the catalytic triad (residues H117, S198, and H438). |
| BCh | paraoxon | DFP | echothiophate | |
| (174.262a) | (275.195a) | (184.146a) | (383.228a) |
| enzyme | triad | enzyme | triad | enzyme | triad | enzyme | triad | ||
| G117H | −34.84 | −8.45 | −35.36 | −2.41 | −10.29 | −0.43 | −33.96 | −4.98 |
| ENI | −32.34 | −8.40 | −38.86 | −9.17b | −23.05 | −3.15 | −39.61 | −9.62b |
| ENG | −31.50 | −8.42 | −37.88 | −7.91b | −10.73 | −1.49 | −35.59 | −7.29b |
| ENA | −31.26 | −8.14 | −33.62 | −1.36 | −24.34 | −4.87 | −39.99 | −10.17b |
| ENT | −31.48 | −8.21 | −35.89 | −5.03 | −21.83 | −3.29 | −39.45 | −6.74b |
| ENR | −33.13 | −9.22 | −34.08 | −1.78 | −24.77 | −4.52 | −36.15 | −8.46b |
| amolecular weight, | ||||||||
| bindicates interaction ≥ in strength to native substrate. | ||||||||
| TABLE 2 |
| Comparison of substrate kinetics parameters of BChEG117H |
| and loop-mutants using butyrylthiocholine as the substrate |
| and analysis using the Michaelis-Menten equation. Values |
| are reported as mean ± SEM. An asterisk indicates a |
| significant difference compared to BChEG117H. |
| kcat a | KM | kcat/KM | ||
| (min−1) | (mM) | (min−1 M−1) | ||
| G117H | 13884 ± 374 | 1.21 ± 0.06 | 11.5 × 106 | ||
| +ENI | 2207 ± 76* | 1.7 ± 0.10 | 1.3 × 106 | ||
| +ENA | 4024 ± 204* | 1.68 ± 0.14 | 2.4 × 106 | ||
| +ENG | 8363 ± 1280* | 2.66 ± 0.60* | 3.1 × 106 | ||
| +ENR | 2568 ± 359* | 3.68 ± 0.71* | 0.7 × 106 | ||
| +ENT | 5976 ± 251* | 1.84 ± 0.13 | 3.2 × 106 | ||
Resistance to Inhibition Assay
| TABLE 3 |
| Comparison of the substrate kinetics parameters of BChEG117H and |
| loop mutants using paraoxon as the substrate. Kinetic analysis |
| was determined using non-linear regression with the classic Michaelis- |
| Menten equation. Values are reported as mean ± SEM. An asterisk |
| indicates a significant difference compared to BChEG117H. |
| kcat a, b | KM b | kcat/KM b | ||
| (min−1) | (μM) | (min−1 M−1) | ||
| G117H | 0.244 ± 0.03 | 107.6 ± 28.1 | 2.2 × 103 | ||
| ENI | 0.071 ± 0.003* | 75.82 ± 9.3 | 0.9 × 103 | ||
| ENA | 0.078 ± 0.004* | 19.72 ± 5.3* | 3.9 × 103 | ||
| ENG | 0.089 ± 0.005* | 24.23 ± 6.0* | 3.6 × 103 | ||
| ENR | 0.092 ± 0.005* | 77.62 ± 12.2 | 1.2 × 103 | ||
| ENT | 0.076 ± 0.005* | 23.59 ± 6.1* | 3.2 × 103 | ||
| akcat = Vmax/[enzyme active site] | |||||
Statistical Analyses
Claims (27)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/068,371 US12340872B2 (en) | 2019-10-11 | 2020-10-12 | Systems and methods for engineering enzymes using identified energy transfer networks |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201962914151P | 2019-10-11 | 2019-10-11 | |
| US17/068,371 US12340872B2 (en) | 2019-10-11 | 2020-10-12 | Systems and methods for engineering enzymes using identified energy transfer networks |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20210110884A1 US20210110884A1 (en) | 2021-04-15 |
| US12340872B2 true US12340872B2 (en) | 2025-06-24 |
Family
ID=75383793
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/068,371 Active 2043-03-13 US12340872B2 (en) | 2019-10-11 | 2020-10-12 | Systems and methods for engineering enzymes using identified energy transfer networks |
Country Status (1)
| Country | Link |
|---|---|
| US (1) | US12340872B2 (en) |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6553317B1 (en) * | 1997-03-05 | 2003-04-22 | Incyte Pharmaceuticals, Inc. | Relational database and system for storing information relating to biomolecular sequences and reagents |
| US20030087306A1 (en) | 1998-06-08 | 2003-05-08 | Christensen Burton G. | Methods for identifying novel multimeric agents that modulate receptors |
| WO2009098596A2 (en) * | 2008-02-05 | 2009-08-13 | Zymeworks Inc. | Methods for determining correlated residues in a protein or other biopolymer using molecular dynamics |
| WO2013003358A2 (en) | 2011-06-29 | 2013-01-03 | Janssen Pharmaceutica Nv | Methods for designing, selecting and/or optimizing allosteric processing inhibitors for matrix metalloproteinases |
| US8417461B2 (en) | 2007-10-05 | 2013-04-09 | Ut-Battelle, Llc | Identification and modification of dynamical regions in proteins for alteration of enzyme catalytic effect |
| US20170356024A1 (en) | 2014-11-04 | 2017-12-14 | Brandeis University | Biophysical platform for drug development based on energy landscape |
| US11621053B2 (en) | 2019-10-11 | 2023-04-04 | Arium BioLabs LLC | Computational systems and methods for discovering allosteric sites and allosteric modulators of proteins |
-
2020
- 2020-10-12 US US17/068,371 patent/US12340872B2/en active Active
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6553317B1 (en) * | 1997-03-05 | 2003-04-22 | Incyte Pharmaceuticals, Inc. | Relational database and system for storing information relating to biomolecular sequences and reagents |
| US20030087306A1 (en) | 1998-06-08 | 2003-05-08 | Christensen Burton G. | Methods for identifying novel multimeric agents that modulate receptors |
| US8417461B2 (en) | 2007-10-05 | 2013-04-09 | Ut-Battelle, Llc | Identification and modification of dynamical regions in proteins for alteration of enzyme catalytic effect |
| US9195795B2 (en) | 2007-10-05 | 2015-11-24 | Ut-Battelle, Llc | Identification and modification of dynamical regions in proteins for alteration of enzyme catalytic effect |
| WO2009098596A2 (en) * | 2008-02-05 | 2009-08-13 | Zymeworks Inc. | Methods for determining correlated residues in a protein or other biopolymer using molecular dynamics |
| WO2013003358A2 (en) | 2011-06-29 | 2013-01-03 | Janssen Pharmaceutica Nv | Methods for designing, selecting and/or optimizing allosteric processing inhibitors for matrix metalloproteinases |
| US20170356024A1 (en) | 2014-11-04 | 2017-12-14 | Brandeis University | Biophysical platform for drug development based on energy landscape |
| US11621053B2 (en) | 2019-10-11 | 2023-04-04 | Arium BioLabs LLC | Computational systems and methods for discovering allosteric sites and allosteric modulators of proteins |
Non-Patent Citations (28)
| Title |
|---|
| Agarwal, A Biophysical Perspective on enzyme Catalysis, Biochemistry 2019, vol. 58, pp. 438-449. |
| Agarwal, Role of Protein Dynamics in Reaction Rate Enhancement by Enzymes, Journal of American Chemical Society, vol. 127, pp. 15248-15256 (2005). |
| Bartlett GJ. Analysis of catalytic residues in enzyme active sites. Journal of Molecular Biology 324: 105-121. (Year: 2002). * |
| Bernal C. Integrating enzyme immobilization and protein engineering: an alternative path for the development of novel and improved industrial biocatalysts. Biotechnology Advances 36(5): 1470-1480. (Year: 2018). * |
| Boehr et al., The Dynamic Energy Landscape of Dihydrofolate Reductase Catalysis, Science, vol. 313, pp. 1638-1642 (Sep. 15, 2006). |
| Chennubhotla, et al., Signal Propagation in Proteins and Relation to Equilibrium Fluctuations, PLOS Computational Biology, vol. 3, Issue 9, pp. 1716-1727 (Sep. 2007). |
| Christopoulos, Allosteric Binding Sites on Cell-Surface Receptors: Novel Targets for Drug Discovery, Nature Reviews Drug Discovery, vol. 1, pp. 198-210 (Mar. 2002). |
| Dettori et al., Regulation of the Interaction Between Protein Kinase C-Related Protein Kinase 2 (PRK2) and Its Upstream Kinase, 3-Phosphoinositide-Dependent Protein Kinase 1 (PDK1), Journal of Biological Chemistry, vol. 284, No. 44, pp. 30318-30327 (Oct. 30, 2009). |
| Eathiraj et al., Discovery of a Novel Mode of Protein Kinase Inhibition Characterized by the Mechanism of Inhibition of Human Mesenchymal-Epithelial Transition Factor (c-Met) Protein Autophosphorylation by ARQ 197, Journal of Biological Chemistry, vol. 286, No. 23, pp. 20666-20676 (Jun. 10, 2011). |
| Hindie et al., Structure and Allosteric Effects of Low-Molecular-Weight Activators on the Protein Kinase PDK1, Nature Chemical Biology, vol. 5, No. 10, pp. 758-764 (Oct. 2009). |
| Huang, Wenkang et al., Computational tools for Allosteric Drug Discovery: Site Identification and Focus Library Design, HHS Public Access Author manuscript, Methods Mol Biol. 2017; 1529: 439-446. doi: 10.1007/978-1-4939-6637-0_23. |
| Ichiye et al., Anisotropy and Anharmonicity of Atomic Fluctuations in Proteins: Implications for X-ray Analysis, Biochemistry, vol. 27, No. 9, pp. 3487-3497 (1988). |
| Kar et al., Allostery and Population Shift in Drug Discovery, Current Opinion in Pharmacology, vol. 10, pp. 715-722 (2010). |
| Kenakin, Allosteric Modulators: The New Generation of Receptor Antagonist, Molecular Interventions, vol. 4, Issue 4, pp. 222-229 (Aug. 2004). |
| Lopez-Garcia et al., Allosteric Regulation of Protein Kinase PKC by the N-Terminal C1 Domain and Small Compounds to the PIF-Pocket, Chemistry & Biology, vol. 18, pp. 1463-1473 (Nov. 23, 2011). |
| Lu, Shaoyong et al. Allosteric modulator Discovery: From Serendipity to Structure-Based Design; Journal of Medicinal Chemistry, J. Med. Chem. 2019, 62, 6405-6421. |
| McClendon, et al., Quantifying Correlations Between Allosteric Sites in Thermodynamic Ensembles, Journal of Chemical Theory and Computation, vol. 5, No. 9, pp. 2486-2502 (2009). |
| Narayanan, Chitra et al., Applications of NMR and computational methodologies to study protein dynamics, Review article in Archives of Biochemistry and Biophysics 628 (2017) pp. 71-80. |
| Perryman, Alexander L. et al., Virtual screening with Autodock Vina and the common pharmacophore engine of a low diversity library of fragment and hit against the three allosteric sites of HIV integrase: participation in the SAMPL4 protein-ligand binding challenge; J Comput Aided Mol Des (2014) 28:429-441; DOI 10.1007/s10822-014-9709-3. |
| Petrounia IP and Arnold FH. Designed evolution of enzymatic properties. Current Opinion in Biotechnology 11(4): 325-330. (Year: 2000). * |
| Ramanathan et al., Discovering Conformational Sub-States Relevant to Protein Function, PLOS One, vol. 6, Issue 1, pp. 1-16 (Jan. 2011). |
| Ramanathan et al., Protein Conformational Populations and Functionally Relevant Substates, Accounts of Chemical Research, vol. 47, No. 1, pp. 149-156 (2013). |
| Ramanathan, et al. Computational Identification of Slow Conformational Fluctuations in Proteins, J. Phys. Chem. B., vol. 113, pp. 16669-16680 (2009). |
| Ramanathan, et al. Evolutionarily Conserved Linkage Between Enzyme Fold, Flexibility, and Catalysis, PLOS Biology, vol. 9, Issue 11, pp. 1-18 (Nov. 2011). |
| Xie et al., Excited-State Lifetimes of Far-Infrared Collective Modes in Proteins, The American Physical Society, Physical Review Letters, vol. 88, pp. 018102-1-018102-4 (Jan. 7, 2002). |
| Yu et al., Heat Flow in Proteins: Computation of Thermal Transport Coefficients, The Journal of Chemical Physics, vol. 122, pp. 54902-1-054902-11 (2005). |
| Zhang et al., Targeting Bcr-Abl by Combining Allosteric with ATP-Binding-Site Inhibitors, Nature, vol. 463, pp. 501-507 (Jan. 28, 2010). |
| Zhang X-F. A general and efficient strategy for generating the stable enzymes. Scientific Reports 6: 33797, 10 pages. (Year: 2016). * |
Also Published As
| Publication number | Publication date |
|---|---|
| US20210110884A1 (en) | 2021-04-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Markin et al. | Revealing enzyme functional architecture via high-throughput microfluidic enzyme kinetics | |
| Hahn et al. | Best practices for constructing, preparing, and evaluating protein-ligand binding affinity benchmarks [article v1. 0] | |
| Welborn et al. | Fluctuations of electric fields in the active site of the enzyme ketosteroid isomerase | |
| US20220238179A1 (en) | Methods and systems for engineering biomolecules | |
| Han | A suite of mathematical solutions to describe ternary complex formation and their application to targeted protein degradation by heterobifunctional ligands | |
| Bashir et al. | Visualization of the encounter ensemble of the transient electron transfer complex of cytochrome c and cytochrome c peroxidase | |
| Nisius et al. | Structure-based computational analysis of protein binding sites for function and druggability prediction | |
| Gaines et al. | Ribozyme catalysis with a twist: active state of the twister ribozyme in solution predicted from molecular simulation | |
| US9404928B2 (en) | Methods for determining correlated residues in a protein or other biopolymer using molecular dynamics | |
| Huang et al. | In silico discovery of β-secretase inhibitors | |
| Furnham et al. | Large-scale analysis exploring evolution of catalytic machineries and mechanisms in enzyme superfamilies | |
| US11621053B2 (en) | Computational systems and methods for discovering allosteric sites and allosteric modulators of proteins | |
| Mlýnský et al. | Molecular dynamics simulations reveal an interplay between SHAPE reagent binding and RNA flexibility | |
| Li et al. | Computational enzyme design approaches with significant biological outcomes: progress and challenges | |
| Magnus et al. | RNA 3D structure prediction guided by independent folding of homologous sequences | |
| Raza et al. | Visualizing protein–ligand binding with chemical energy-wise decomposition (CHEWD): Application to ligand binding in the kallikrein-8 S1 Site | |
| Arcon et al. | Biased docking for protein–ligand pose prediction | |
| Wang et al. | Quantum chemical prediction of electron ionization mass spectra of trimethylsilylated metabolites | |
| Michael et al. | How much can physics do for protein design? | |
| US12340872B2 (en) | Systems and methods for engineering enzymes using identified energy transfer networks | |
| Vukovic et al. | Molecular mechanism of processive 3′ to 5′ RNA translocation in the active subunit of the RNA exosome complex | |
| Yang et al. | Exploring biomolecular conformational dynamics with polarizable force field AMOEBA and enhanced sampling method milestoning | |
| US12087410B2 (en) | Systems and methods of designing chemical libraries for allosteric modulator discovery | |
| Luo et al. | Evidence that the C-terminal domain of a type B PutA protein contributes to aldehyde dehydrogenase activity and substrate channeling | |
| Hammerschmidt et al. | Advanced Isothermal Titration Calorimetry for Medicinal Chemists with ITCcalc |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
| AS | Assignment |
Owner name: ARIUM BIOLABS LLC, TENNESSEE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AGARWAL, PRATUL K.;REEL/FRAME:054039/0955 Effective date: 20191020 |
|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO SMALL (ORIGINAL EVENT CODE: SMAL); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: APPLICATION DISPATCHED FROM PREEXAM, NOT YET DOCKETED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| FEPP | Fee payment procedure |
Free format text: PETITION RELATED TO MAINTENANCE FEES GRANTED (ORIGINAL EVENT CODE: PTGR); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: ADVISORY ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: AWAITING RESPONSE FOR INFORMALITY, FEE DEFICIENCY OR CRF ACTION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| AS | Assignment |
Owner name: ARIUM BIOLABS LLC, OKLAHOMA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ARIUM BIOLABS LLC;REEL/FRAME:069673/0861 Effective date: 20241224 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |