WO2020251615A1 - Methods of designing novel antibody mimetics for use in detecting antigens and as therapeutic agents - Google Patents
Methods of designing novel antibody mimetics for use in detecting antigens and as therapeutic agents Download PDFInfo
- Publication number
- WO2020251615A1 WO2020251615A1 PCT/US2019/061862 US2019061862W WO2020251615A1 WO 2020251615 A1 WO2020251615 A1 WO 2020251615A1 US 2019061862 W US2019061862 W US 2019061862W WO 2020251615 A1 WO2020251615 A1 WO 2020251615A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- peptide
- binding
- peptides
- amino acids
- target protein
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 89
- 239000003814 drug Substances 0.000 title description 7
- 239000000427 antigen Substances 0.000 title description 5
- 108091007433 antigens Proteins 0.000 title description 5
- 102000036639 antigens Human genes 0.000 title description 5
- 229940124597 therapeutic agent Drugs 0.000 title description 4
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 307
- 230000027455 binding Effects 0.000 claims abstract description 182
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 160
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 63
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 63
- 238000004519 manufacturing process Methods 0.000 claims abstract description 8
- 238000003860 storage Methods 0.000 claims abstract description 5
- 150000001413 amino acids Chemical class 0.000 claims description 127
- 108010067902 Peptide Library Proteins 0.000 claims description 87
- 238000003032 molecular docking Methods 0.000 claims description 31
- 238000005304 joining Methods 0.000 claims description 3
- 235000001014 amino acid Nutrition 0.000 description 105
- 229940024606 amino acid Drugs 0.000 description 105
- 241000907316 Zika virus Species 0.000 description 74
- 102100021696 Syncytin-1 Human genes 0.000 description 25
- 241000710831 Flavivirus Species 0.000 description 24
- 101710091045 Envelope protein Proteins 0.000 description 23
- 101710188315 Protein X Proteins 0.000 description 23
- 239000002245 particle Substances 0.000 description 23
- 241000725619 Dengue virus Species 0.000 description 22
- 238000003556 assay Methods 0.000 description 20
- 238000002965 ELISA Methods 0.000 description 17
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 17
- 239000002953 phosphate buffered saline Substances 0.000 description 17
- 241000700605 Viruses Species 0.000 description 16
- 230000013595 glycosylation Effects 0.000 description 15
- 210000002966 serum Anatomy 0.000 description 15
- 210000002700 urine Anatomy 0.000 description 15
- 239000000872 buffer Substances 0.000 description 13
- 238000006206 glycosylation reaction Methods 0.000 description 13
- 238000013459 approach Methods 0.000 description 12
- 238000001514 detection method Methods 0.000 description 12
- 230000000903 blocking effect Effects 0.000 description 10
- 230000006870 function Effects 0.000 description 9
- 230000009260 cross reactivity Effects 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- UDPCXVIKHFVPSZ-HOIFWPIMSA-N 5-[(3as,4s,6ar)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]-n-[2-[2-[2-[3-(2,5-dioxopyrrol-1-yl)propanoylamino]ethoxy]ethoxy]ethyl]pentanamide Chemical compound C([C@H]1[C@H]2NC(=O)N[C@H]2CS1)CCCC(=O)NCCOCCOCCNC(=O)CCN1C(=O)C=CC1=O UDPCXVIKHFVPSZ-HOIFWPIMSA-N 0.000 description 7
- 239000011248 coating agent Substances 0.000 description 7
- 238000000576 coating method Methods 0.000 description 7
- 230000035945 sensitivity Effects 0.000 description 7
- 201000009182 Chikungunya Diseases 0.000 description 6
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 6
- 239000011230 binding agent Substances 0.000 description 6
- 235000018417 cysteine Nutrition 0.000 description 6
- 238000011534 incubation Methods 0.000 description 6
- 238000000302 molecular modelling Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 238000012216 screening Methods 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 238000000126 in silico method Methods 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 239000003446 ligand Substances 0.000 description 5
- 235000018102 proteins Nutrition 0.000 description 5
- 238000012552 review Methods 0.000 description 5
- 238000004088 simulation Methods 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- ALYNCZNDIQEVRV-UHFFFAOYSA-N 4-aminobenzoic acid Chemical compound NC1=CC=C(C(O)=O)C=C1 ALYNCZNDIQEVRV-UHFFFAOYSA-N 0.000 description 4
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- 238000002835 absorbance Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 3
- FUOOLUPWFVMBKG-UHFFFAOYSA-N 2-Aminoisobutyric acid Chemical compound CC(C)(N)C(O)=O FUOOLUPWFVMBKG-UHFFFAOYSA-N 0.000 description 3
- 108091023037 Aptamer Proteins 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- 229920001213 Polysorbate 20 Polymers 0.000 description 3
- 208000001455 Zika Virus Infection Diseases 0.000 description 3
- 208000020329 Zika virus infectious disease Diseases 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- -1 e.g. Proteins 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 3
- 230000002209 hydrophobic effect Effects 0.000 description 3
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 3
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- UQWNNUPJBDWRHC-UWVGGRQHSA-N (3s,4s)-4-amino-5-cyclohexyl-3-hydroxypentanoic acid Chemical compound OC(=O)C[C@H](O)[C@@H](N)CC1CCCCC1 UQWNNUPJBDWRHC-UWVGGRQHSA-N 0.000 description 2
- SNPQRYOQWLOTFA-UHFFFAOYSA-N 2,2-dimethyl-1,3-thiazolidine Chemical compound CC1(C)NCCS1 SNPQRYOQWLOTFA-UHFFFAOYSA-N 0.000 description 2
- JAJQQUQHMLWDFB-UHFFFAOYSA-N 4-azaniumyl-3-hydroxy-5-phenylpentanoate Chemical compound OC(=O)CC(O)C(N)CC1=CC=CC=C1 JAJQQUQHMLWDFB-UHFFFAOYSA-N 0.000 description 2
- DFVFTMTWCUHJBL-UHFFFAOYSA-N 4-azaniumyl-3-hydroxy-6-methylheptanoate Chemical compound CC(C)CC(N)C(O)CC(O)=O DFVFTMTWCUHJBL-UHFFFAOYSA-N 0.000 description 2
- 108010069514 Cyclic Peptides Proteins 0.000 description 2
- 102000001189 Cyclic Peptides Human genes 0.000 description 2
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 2
- 238000012286 ELISA Assay Methods 0.000 description 2
- 101710121417 Envelope glycoprotein Proteins 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 2
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 description 2
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- AKCRVYNORCOYQT-YFKPBYRVSA-N N-methyl-L-valine Chemical compound CN[C@@H](C(C)C)C(O)=O AKCRVYNORCOYQT-YFKPBYRVSA-N 0.000 description 2
- 241001263478 Norovirus Species 0.000 description 2
- 101800004192 Peptide P1 Proteins 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 2
- 230000006399 behavior Effects 0.000 description 2
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 2
- 239000013060 biological fluid Substances 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 239000002981 blocking agent Substances 0.000 description 2
- 238000011088 calibration curve Methods 0.000 description 2
- 210000004027 cell Anatomy 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 150000001945 cysteines Chemical class 0.000 description 2
- 231100000673 dose–response relationship Toxicity 0.000 description 2
- 238000009510 drug design Methods 0.000 description 2
- 238000007876 drug discovery Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000007306 functionalization reaction Methods 0.000 description 2
- 239000007789 gas Substances 0.000 description 2
- 239000003547 immunosorbent Substances 0.000 description 2
- 238000011031 large-scale manufacturing process Methods 0.000 description 2
- 229960004502 levodopa Drugs 0.000 description 2
- 238000000670 ligand binding assay Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 238000000329 molecular dynamics simulation Methods 0.000 description 2
- SCIFESDRCALIIM-UHFFFAOYSA-N n-methylphenylalanine Chemical compound CNC(C(O)=O)CC1=CC=CC=C1 SCIFESDRCALIIM-UHFFFAOYSA-N 0.000 description 2
- 230000009871 nonspecific binding Effects 0.000 description 2
- 230000035515 penetration Effects 0.000 description 2
- 238000002823 phage display Methods 0.000 description 2
- 229920002223 polystyrene Polymers 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 2
- 238000013207 serial dilution Methods 0.000 description 2
- 238000002943 spectrophotometric absorbance Methods 0.000 description 2
- 238000012916 structural analysis Methods 0.000 description 2
- 230000002195 synergetic effect Effects 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 238000003041 virtual screening Methods 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- OGNSCSPNOLGXSM-UHFFFAOYSA-N (+/-)-DABA Natural products NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 1
- DNDVHMGBCGSRIY-ZCFIWIBFSA-N (2r)-2-amino-3-(1,3-benzothiazol-2-yl)propanoic acid Chemical compound C1=CC=C2SC(C[C@@H](N)C(O)=O)=NC2=C1 DNDVHMGBCGSRIY-ZCFIWIBFSA-N 0.000 description 1
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 description 1
- QFQYGJMNIDGZSG-YFKPBYRVSA-N (2r)-3-(acetamidomethylsulfanyl)-2-azaniumylpropanoate Chemical compound CC(=O)NCSC[C@H]([NH3+])C([O-])=O QFQYGJMNIDGZSG-YFKPBYRVSA-N 0.000 description 1
- WBNIBXIBXCPMOW-JTQLQIEISA-N (2s)-2-(ethoxyamino)-3-(4-hydroxyphenyl)propanoic acid Chemical compound CCON[C@H](C(O)=O)CC1=CC=C(O)C=C1 WBNIBXIBXCPMOW-JTQLQIEISA-N 0.000 description 1
- FPDYKABXINADKS-LURJTMIESA-N (2s)-2-(methylazaniumyl)hexanoate Chemical compound CCCC[C@H](NC)C(O)=O FPDYKABXINADKS-LURJTMIESA-N 0.000 description 1
- XQBUAEBDRUXSPI-ZETCQYMHSA-N (2s)-2-(piperidin-1-ylamino)propanoic acid Chemical compound OC(=O)[C@H](C)NN1CCCCC1 XQBUAEBDRUXSPI-ZETCQYMHSA-N 0.000 description 1
- WTYDPHLGAWMABS-LURJTMIESA-N (2s)-2-(pyrrolidin-1-ylamino)propanoic acid Chemical compound OC(=O)[C@H](C)NN1CCCC1 WTYDPHLGAWMABS-LURJTMIESA-N 0.000 description 1
- KWIPUXXIFQQMKN-VIFPVBQESA-N (2s)-2-amino-3-(4-cyanophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(C#N)C=C1 KWIPUXXIFQQMKN-VIFPVBQESA-N 0.000 description 1
- CBNRCFKJVUHIRM-LBPRGKRZSA-N (2s)-2-amino-3-[4-[bis(dimethylaminooxy)phosphoryloxy]phenyl]propanoic acid Chemical compound CN(C)OP(=O)(ON(C)C)OC1=CC=C(C[C@H](N)C(O)=O)C=C1 CBNRCFKJVUHIRM-LBPRGKRZSA-N 0.000 description 1
- SJTPYAOZAPSOLO-REOHCLBHSA-N (2s)-2-hydrazinyl-3-hydroxypropanoic acid Chemical compound NN[C@@H](CO)C(O)=O SJTPYAOZAPSOLO-REOHCLBHSA-N 0.000 description 1
- UTEYHKAJNNCEDX-VIFPVBQESA-N (2s)-3-(4-hydroxyphenyl)-2-(methoxyamino)propanoic acid Chemical compound CON[C@H](C(O)=O)CC1=CC=C(O)C=C1 UTEYHKAJNNCEDX-VIFPVBQESA-N 0.000 description 1
- MCOLRSXARZXGCL-HNNXBMFYSA-N (2s)-3-(4-phenylmethoxyphenyl)-2-(phosphonoamino)propanoic acid Chemical compound C1=CC(C[C@@H](C(=O)O)NP(O)(O)=O)=CC=C1OCC1=CC=CC=C1 MCOLRSXARZXGCL-HNNXBMFYSA-N 0.000 description 1
- ARMHACFUZZJJSF-VIFPVBQESA-N (2s)-3-phenylmethoxy-2-(phosphonoamino)propanoic acid Chemical compound OP(=O)(O)N[C@H](C(=O)O)COCC1=CC=CC=C1 ARMHACFUZZJJSF-VIFPVBQESA-N 0.000 description 1
- XJODGRWDFZVTKW-LURJTMIESA-N (2s)-4-methyl-2-(methylamino)pentanoic acid Chemical compound CN[C@H](C(O)=O)CC(C)C XJODGRWDFZVTKW-LURJTMIESA-N 0.000 description 1
- WCYLQLNJSDTVPN-BQBZGAKWSA-N (2s,3s)-2-amino-n-methoxy-n,3-dimethylpentanamide Chemical compound CC[C@H](C)[C@H](N)C(=O)N(C)OC WCYLQLNJSDTVPN-BQBZGAKWSA-N 0.000 description 1
- LJRDOKAZOAKLDU-UDXJMMFXSA-N (2s,3s,4r,5r,6r)-5-amino-2-(aminomethyl)-6-[(2r,3s,4r,5s)-5-[(1r,2r,3s,5r,6s)-3,5-diamino-2-[(2s,3r,4r,5s,6r)-3-amino-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-6-hydroxycyclohexyl]oxy-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl]oxyoxane-3,4-diol;sulfuric ac Chemical compound OS(O)(=O)=O.N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)N)O[C@@H]1CO LJRDOKAZOAKLDU-UDXJMMFXSA-N 0.000 description 1
- LDXFNKACRSEQND-ULUSZKPHSA-N (5R)-1-azabicyclo[3.1.0]hexane-5-carboxylic acid Chemical compound C1CC[C@@]2(C(=O)O)N1C2 LDXFNKACRSEQND-ULUSZKPHSA-N 0.000 description 1
- IADUEWIQBXOCDZ-VKHMYHEASA-N (S)-azetidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCN1 IADUEWIQBXOCDZ-VKHMYHEASA-N 0.000 description 1
- BLCJBICVQSYOIF-UHFFFAOYSA-N 2,2-diaminobutanoic acid Chemical compound CCC(N)(N)C(O)=O BLCJBICVQSYOIF-UHFFFAOYSA-N 0.000 description 1
- OMGHIGVFLOPEHJ-UHFFFAOYSA-N 2,5-dihydro-1h-pyrrol-1-ium-2-carboxylate Chemical compound OC(=O)C1NCC=C1 OMGHIGVFLOPEHJ-UHFFFAOYSA-N 0.000 description 1
- PRSWALMQXKFZFA-UHFFFAOYSA-N 2-(piperidin-1-ylamino)acetic acid Chemical compound OC(=O)CNN1CCCCC1 PRSWALMQXKFZFA-UHFFFAOYSA-N 0.000 description 1
- ICBKJQDJWHBQAX-UHFFFAOYSA-N 2-aminooxane-4-carboxylic acid Chemical compound NC1CC(C(O)=O)CCO1 ICBKJQDJWHBQAX-UHFFFAOYSA-N 0.000 description 1
- VRFJLUHAQPBTLE-UHFFFAOYSA-N 2-aminopiperidine-1-carboxylic acid Chemical compound NC1CCCCN1C(O)=O VRFJLUHAQPBTLE-UHFFFAOYSA-N 0.000 description 1
- WTOFYLAWDLQMBZ-UHFFFAOYSA-N 2-azaniumyl-3-thiophen-2-ylpropanoate Chemical compound OC(=O)C(N)CC1=CC=CS1 WTOFYLAWDLQMBZ-UHFFFAOYSA-N 0.000 description 1
- YHFKJGZDKILHTL-UHFFFAOYSA-N 3-(aminomethyl)-1h-pyrrole-2-carboxylic acid Chemical compound NCC=1C=CNC=1C(O)=O YHFKJGZDKILHTL-UHFFFAOYSA-N 0.000 description 1
- PECYZEOJVXMISF-UHFFFAOYSA-N 3-aminoalanine Chemical compound [NH3+]CC(N)C([O-])=O PECYZEOJVXMISF-UHFFFAOYSA-N 0.000 description 1
- DKIDEFUBRARXTE-UHFFFAOYSA-N 3-mercaptopropanoic acid Chemical class OC(=O)CCS DKIDEFUBRARXTE-UHFFFAOYSA-N 0.000 description 1
- XWHHYOYVRVGJJY-QMMMGPOBSA-N 4-fluoro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(F)C=C1 XWHHYOYVRVGJJY-QMMMGPOBSA-N 0.000 description 1
- GTVVZTAFGPQSPC-UHFFFAOYSA-N 4-nitrophenylalanine Chemical compound OC(=O)C(N)CC1=CC=C([N+]([O-])=O)C=C1 GTVVZTAFGPQSPC-UHFFFAOYSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- OBMZMSLWNNWEJA-XNCRXQDQSA-N C1=CC=2C(C[C@@H]3NC(=O)[C@@H](NC(=O)[C@H](NC(=O)N(CC#CCN(CCCC[C@H](NC(=O)[C@@H](CC4=CC=CC=C4)NC3=O)C(=O)N)CC=C)NC(=O)[C@@H](N)C)CC3=CNC4=C3C=CC=C4)C)=CNC=2C=C1 Chemical compound C1=CC=2C(C[C@@H]3NC(=O)[C@@H](NC(=O)[C@H](NC(=O)N(CC#CCN(CCCC[C@H](NC(=O)[C@@H](CC4=CC=CC=C4)NC3=O)C(=O)N)CC=C)NC(=O)[C@@H](N)C)CC3=CNC4=C3C=CC=C4)C)=CNC=2C=C1 OBMZMSLWNNWEJA-XNCRXQDQSA-N 0.000 description 1
- 101100493820 Caenorhabditis elegans best-1 gene Proteins 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 206010010356 Congenital anomaly Diseases 0.000 description 1
- 208000027205 Congenital disease Diseases 0.000 description 1
- CKLJMWTZIZZHCS-UHFFFAOYSA-N D-OH-Asp Natural products OC(=O)C(N)CC(O)=O CKLJMWTZIZZHCS-UHFFFAOYSA-N 0.000 description 1
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 1
- NIGWMJHCCYYCSF-UHFFFAOYSA-N Fenclonine Chemical compound OC(=O)C(N)CC1=CC=C(Cl)C=C1 NIGWMJHCCYYCSF-UHFFFAOYSA-N 0.000 description 1
- 241000537377 Fraxinus berlandieriana Species 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 1
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 1
- ZGUNAGUHMKGQNY-ZETCQYMHSA-N L-alpha-phenylglycine zwitterion Chemical compound OC(=O)[C@@H](N)C1=CC=CC=C1 ZGUNAGUHMKGQNY-ZETCQYMHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical compound OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-ZXPFJRLXSA-N L-methionine (R)-S-oxide Chemical compound C[S@@](=O)CC[C@H]([NH3+])C([O-])=O QEFRNWWLZKMPFJ-ZXPFJRLXSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-YGVKFDHGSA-N L-methionine S-oxide Chemical compound CS(=O)CC[C@H](N)C(O)=O QEFRNWWLZKMPFJ-YGVKFDHGSA-N 0.000 description 1
- UCUNFLYVYCGDHP-BYPYZUCNSA-N L-methionine sulfone Chemical compound CS(=O)(=O)CC[C@H](N)C(O)=O UCUNFLYVYCGDHP-BYPYZUCNSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-UHFFFAOYSA-N L-methionine sulphoxide Natural products CS(=O)CCC(N)C(O)=O QEFRNWWLZKMPFJ-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- OLYPWXRMOFUVGH-LURJTMIESA-N N(2)-methyl-L-lysine Chemical compound CN[C@H](C(O)=O)CCCCN OLYPWXRMOFUVGH-LURJTMIESA-N 0.000 description 1
- MXNRLFUSFKVQSK-QMMMGPOBSA-O N(6),N(6),N(6)-trimethyl-L-lysine Chemical compound C[N+](C)(C)CCCC[C@H]([NH3+])C([O-])=O MXNRLFUSFKVQSK-QMMMGPOBSA-O 0.000 description 1
- RYFOQDQDVYIEHN-ZETCQYMHSA-N N,N-Dimethyllysine Chemical compound CN(C)[C@H](C(O)=O)CCCCN RYFOQDQDVYIEHN-ZETCQYMHSA-N 0.000 description 1
- KSPIYJQBLVDRRI-WDSKDSINSA-N N-methyl-L-isoleucine Chemical compound CC[C@H](C)[C@H](NC)C(O)=O KSPIYJQBLVDRRI-WDSKDSINSA-N 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- CIQHWLTYGMYQQR-QMMMGPOBSA-N O(4')-sulfo-L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(OS(O)(=O)=O)C=C1 CIQHWLTYGMYQQR-QMMMGPOBSA-N 0.000 description 1
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 1
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 1
- 101710176384 Peptide 1 Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010077895 Sarcosine Proteins 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 241000710886 West Nile virus Species 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- QWCKQJZIFLGMSD-UHFFFAOYSA-N alpha-aminobutyric acid Chemical compound CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 1
- 229940124277 aminobutyric acid Drugs 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 230000009830 antibody antigen interaction Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 206010003549 asthenia Diseases 0.000 description 1
- 238000005452 bending Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 229940000635 beta-alanine Drugs 0.000 description 1
- 238000011953 bioanalysis Methods 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 150000002013 dioxins Chemical class 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 238000011143 downstream manufacturing Methods 0.000 description 1
- 238000012377 drug delivery Methods 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000004186 food analysis Methods 0.000 description 1
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 230000003862 health status Effects 0.000 description 1
- 244000052637 human pathogen Species 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 230000002055 immunohistochemical effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011503 in vivo imaging Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 208000027866 inflammatory disease Diseases 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- SRJOCJYGOFTFLH-UHFFFAOYSA-N isonipecotic acid Chemical compound OC(=O)C1CCNCC1 SRJOCJYGOFTFLH-UHFFFAOYSA-N 0.000 description 1
- 238000007477 logistic regression Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000036244 malformation Effects 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 208000004141 microcephaly Diseases 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000000324 molecular mechanic Methods 0.000 description 1
- 239000012514 monoclonal antibody product Substances 0.000 description 1
- KRKPYFLIYNGWTE-UHFFFAOYSA-N n,o-dimethylhydroxylamine Chemical compound CNOC KRKPYFLIYNGWTE-UHFFFAOYSA-N 0.000 description 1
- 208000037971 neglected tropical disease Diseases 0.000 description 1
- 230000000926 neurological effect Effects 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 229960003104 ornithine Drugs 0.000 description 1
- 229940029358 orthoclone okt3 Drugs 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 229960001639 penicillamine Drugs 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- DCWXELXMIBXGTH-QMMMGPOBSA-N phosphonotyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-QMMMGPOBSA-N 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 208000023504 respiratory system disease Diseases 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 229940043230 sarcosine Drugs 0.000 description 1
- 229940055619 selenocysteine Drugs 0.000 description 1
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 1
- 235000016491 selenocysteine Nutrition 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000011895 specific detection Methods 0.000 description 1
- 230000003335 steric effect Effects 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000012089 stop solution Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 229940126622 therapeutic monoclonal antibody Drugs 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 210000000605 viral structure Anatomy 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000004457 water analysis Methods 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- ORQXBVXKBGUSBA-QMMMGPOBSA-N β-cyclohexyl-alanine Chemical compound OC(=O)[C@@H](N)CC1CCCCC1 ORQXBVXKBGUSBA-QMMMGPOBSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/08—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses
- C07K16/10—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses from RNA viruses
- C07K16/1081—Togaviridae, e.g. flavivirus, rubella virus, hog cholera virus
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/005—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies constructed by phage libraries
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2318/00—Antibody mimetics or scaffolds
- C07K2318/20—Antigen-binding scaffold molecules wherein the scaffold is not an immunoglobulin variable region or antibody mimetics
Definitions
- Antibodies constitute powerful therapeutic agents characterized by limited side effects due to their ability to specifically target a distinct antigen on a cell, bacteria, virus, or toxin.
- Orthoclone OKT3 the first therapeutic monoclonal antibody
- this class of biopharmaceutical products has significantly grown and over 45 monoclonal antibody products had received approval in the U.S. or Europe for the treatment of a variety of diseases, including cancer and inflammatory, cardiovascular, respiratory, and infectious diseases.
- the projected antibody market in the United States is anticipated to surpass the $10 billion mark, the production of such therapeutics is not without limitations.
- One disadvantage of therapeutic antibodies is the cost of downstream processing to achieve the required high purity levels.
- Another limiting factor of therapeutic antibodies is the sensitivity of antibody structure to chemical and physical denaturation encountered during post-manufacture filling, shipping and storage. Flarsh chemical environments can affect the antibodies’ binding properties.
- Antibody mimetics have been successfully used in the development of binding assays for the detection of analytes in biological samples, as well as in separation methods, cancer therapy, targeted drug delivery, and in vivo imaging.
- the recent advances in the field of antibody mimetics and their applications in bioanalytical chemistry, specifically in diagnostics and other analytical methods have been described (Yu et al., Annual Review of Analytical Chemistry 10, 293-320 (2017). Because synthetic peptides are more resistant to
- Linear peptides were selected by phage display to detect norovirus using an ELISA protocol or by means of an impedance biosensor (Hwang et al., 2017; Palzkill et al., 2018).
- clamp peptides designed to bind to two different binding sites on the ZIKV envelope protein were designed and subsequently manufactured and tested. As shown through direct ELISA, the performances of the clamp peptides demonstrated beneficial and desired binding activities. As shown by the data presented herein, the arms of the clamp peptides were able to wrap around the glycosylation site such as to clamp the peptide in place.
- the present disclosure provides a method implemented by a processor in a computer for designing a clamp peptide comprising the structure A B-A 2 wherein is a first peptide arm that binds to a first binding site of a target protein, A 2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links to A 2 .
- the method comprises the steps of: (a) determining a binding score of each tetrapeptide of a first peptide library for the first binding site of the target protein, wherein the first peptide library comprises a set of tetrapeptides having a unique combination of four amino acids of the twenty naturally-occurring amino acids, optionally, wherein the first peptide library comprises the full combinatorial set of 160,000 tetrapeptides having a unique
- the second peptide library comprises a set of pentapeptides having a unique combination of five amino acids comprising the amino acids of the sequence of a tetrapeptide identified in step (b) and one of the twenty naturally-occurring amino acids added as the first amino acid or the fifth amino acid of the pentapeptide or between two amino acids of the sequence of the tetrapeptide, and (d) identifying the sequences of the pentapeptides having a binding score which meets a second threshold.
- the method further comprises the steps of: (e) determining a binding score of each hexapeptide of a third peptide library for the first binding site of the target protein, wherein the third peptide library comprises a set of hexapeptides having a unique combination of six amino acids comprising the amino acids of the sequence of a pentapeptide identified in step (d) and one of the twenty naturally-occurring amino acids added as the first amino acid or the sixth amino acid of the hexapeptide or between two amino acids of the sequence of the pentapeptide and (f) identifying the sequences of the hexapeptides having a binding score which meets a third threshold.
- the method further comprises the steps of: repeating at least steps (a) to (d) wherein, for each determining step, a binding score for the second binding site of the target protein is determined for each tetrapeptide of the first peptide library and for each pentapeptide of the second peptide library.
- the method further comprises repeating these steps, wherein for each determining step, a binding score for the second binding site of the target protein is determined for each hexapeptide of the third peptide library.
- the present disclosure also provides a computer-readable storage medium having stored thereon machine-readable instructions executable by a processor, comprising
- the present disclosure also provides a method of manufacturing a clamp peptide comprising the structure A B-A 2 wherein is a first peptide arm that binds to a first binding site of a target protein, A 2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links to A 2 .
- the method comprises (i) designing the first peptide arm and the second peptide arm of the clamp peptide according to any one of the presently disclosed methods for designing a clamp peptide and (ii) joining the first peptide arm to the second peptide arm with a bridge peptide, B.
- a clamp peptide comprising the structure A B-A 2 wherein A ⁇ is a first peptide arm that binds to a first binding site of a target protein, A 2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links A ⁇ to A 2 , wherein the sequences of A ⁇ and A 2 are designed by any one of the presently disclosed methods for designing a clamp peptide.
- Figure 1 A is a computer-generated illustration of the target protein (ZIVA envelope protein) and the two active sites chosen for docking the peptide libraries. The larger blue shapes represent the electrostatic surfaces of the active sites.
- Figure 1 B is a schematic representation of the clamp peptide structure made by two peptide arms of five or six amino acids in length linked by a bridge peptide.
- Figure 1 C is a computer-generated illustration of the target protein of Figure 1 A with two peptides (outlined in orange) docked in the two active sites. The larger blue shaded regions represent the electrostatic surfaces of the active sites. The length (in angstrom) between the two docked peptides is shown (19.31 A or 1 .931 nm) in yellow.
- Figure 1 D is a computer-generated illustration of a bridge peptide designed and minimized in zwitterionic mode.
- Figure 2A is a graph of the binding score (kcal/mol) of the three peptides libraries (first library of tetrapeptides (blue); second peptide library of pentapeptides (orange); third peptide library of hexapeptides (gray)) docked in the active site 1 (Asn154), showing the typical distribution of scores obtained in the simulations.
- the binding score data were sorted in ascending order of binding score, thus not necessarily a correspondence must exist between the positions of the peptides in each curve.
- Figure 2B is a schematic representation of the semi-combinatorial approach for obtaining penta- and hexapeptide libraries. The semi- combinatorial approach is based on generating different cycles of peptide libraries by maximizing the recognition properties of amino acid motif between the ZIKV binding site and the other flaviviruses binding sites.
- Figure 3 is a series of graphs depicting the sigmoidal ZIKV particles concentration response trend.
- Y axis D absorbance (450nm);
- X-axis log [ZIKV], copies/mL.
- Figure 4 is a graph of the D spectrophotometric absorbance signals obtained in a cross-reactivity study using the ELISA direct assay for the best three clamp peptides (C3, C6 and C7) and two mono-arm peptides (P2 and X1 ) binding the ZIKV target protein (yellow) and three serotypes of DENV virus (DENV-1 (blue), DENV-2 (orange) and DENV-3 (grey)) at the concentration of 10 5 [ZIKV] copies/mL.
- Figure 5 is a graph of the D spectrophotometric absorbance signals obtained in a study using the ELISA direct assay for the best three clamp peptides (C3, C6 and C7) and two mono-arm peptides (P2 and X1 ) binding the ZIKV target protein at a concentration of 10 5 copies/ml (blue) or 10 6 copies/mL (orange) in the presence of buffer ( * ), serum (s), or urine (u).
- Figure 6A is a schematic of an exemplary embodiment 101 of a system 100 for designing a clamp peptide.
- Figure 6B is a schematic of base routines that may be stored on non-volatile memory of the system 100 of Figure 6A.
- Figure 6C is a schematic of exemplary sub-routines for the base routines of Figure 6B.
- the present disclosure provides a method implemented by a processor in a computer for designing a clamp peptide.
- Clamp peptides as further described herein, represent a new class of antibody mimetics.
- the clamp peptide comprises the structure A1-B-A2 wherein is a first peptide arm that binds to a first binding site of a target protein, A 2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links to A 2 .
- each of A ⁇ and A 2 is a peptide of about 5 to about 7 amino acids, e.g., about 5 amino acids, about 6 amino acids, or about 7 amino acids.
- the bridge peptide (B) is about 5 to about 10 amino acids in length (e.g., about 5 amino acids, about 6 amino acids, about 7 amino acids, about 8 amino acids, about 9 amino acids, or about 10 amino acids in length.
- the amino acids of the clamp peptide, or the first peptide arm, the second peptide arm, or the bridge peptide thereof are naturally-occurring or coded or non-naturally occurring or non-coded.
- Non-naturally occurring amino acids refer to amino acids that do not naturally occur in vivo but which, nevertheless, can be incorporated into the peptide structures described herein.
- Non-coded refers to an amino acid that is not an L-isomer of any of the following 20 amino acids: Ala, Cys, Asp, Glu, Phe, Gly, His, lie, Lys, Leu, Met, Asn, Pro, Gin, Arg, Ser, Thr, Val, Trp, Tyr.
- Naturally-occurring or coded amino acids include the L-isomers of the amino acids: Ala, Cys, Asp, Glu, Phe, Gly, His, lie, Lys, Leu, Met, Asn, Pro, Gin, Arg, Ser, Thr, Val, Trp, Tyr.
- non-naturally occurring or non-coded amino acids include but are not limited to, any D or L isomers of the non-naturally occurring or non-coded amino acids selected from the group consisting of: beta-alanine, N-alpha-methyl-alanine (Me-Ala), aminobutyric acid (Abu), gamma- aminobutyric acid, aminohexanoic acid (epsilon-Ahx), aminoisobutyric acid (Aib),
- aminomethylpyrrole carboxylic acid aminopiperidinecarboxylic acid, aminoserine (Ams), aminotetrahydropyran-4-carboxylic acid, arginine N-methoxy-N-methyl amide, beta-aspartic acid (beta-Asp), azetidine carboxylic acid, 3-(2-benzothiazolyl)alanine, alpha-ferf-butylglycine, 2- amino-5-ureido-n-valeric acid (citrulline, Cit), beta-Cyclohexylalanine (Cha), acetamidomethyl- cysteine, diaminobutanoic acid (Dab), diaminopropionic acid (Dpr), dihydroxyphenylalanine (DOPA), dimethylthiazolidine (DMTA), gamma-Glutamic acid (gamma-Glu), homoserine (Hse), hydroxyproline (Hyp), isoleucine N-meth
- the clamp peptide, or the first peptide arm, the second peptide arm, or the bridge peptide thereof, described herein are glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated, cyclized via, e.g., a disulfide bridge, or converted into a salt (e.g., an acid addition salt, a basic addition salt), and/or optionally dimerized, multimerized, or polymerized, or conjugated.
- a salt e.g., an acid addition salt, a basic addition salt
- the method implemented by a processor in a computer for designing a clamp peptide comprises the steps of: (a) determining a binding score of each tetrapeptide of a first peptide library for the first binding site of the target protein, wherein the first peptide library comprises a set of tetrapeptides having a unique combination of four amino acids of the twenty naturally-occurring amino acids, optionally, wherein the first peptide library comprises the full combinatorial set of 160,000 (160K) tetrapeptides having a unique
- the second peptide library comprises a set of pentapeptides having a unique combination of five amino acids comprising the amino acids of the sequence of a tetrapeptide identified in step (b) and one of the twenty naturally-occurring amino acids added as the first amino acid or the fifth amino acid of the pentapeptide or between two amino acids of the sequence of the tetrapeptide, and (d) identifying the sequences of the pentapeptides having a binding score which meets a second threshold.
- the method further comprises the steps of: (e) determining a binding score of each hexapeptide of a third peptide library for the first binding site of the target protein, wherein the third peptide library comprises a set of hexapeptides having a unique combination of six amino acids comprising the amino acids of the sequence of a pentapeptide identified in step (d) and one of the twenty naturally-occurring amino acids added as the first amino acid or the sixth amino acid of the hexapeptide or between two amino acids of the sequence of the pentapeptide and (f) identifying the sequences of the hexapeptides having a binding score which meets a third threshold.
- the method further comprises the steps of: repeating at least steps (a) to (d) wherein, for each determining step, a binding score for the second binding site of the target protein is determined for each tetrapeptide of the first peptide library and for each pentapeptide of the second peptide library.
- the method further comprises repeating these steps, wherein for each determining step, a binding score for the second binding site of the target protein is determined for each hexapeptide of the third peptide library.
- the method further comprises (I) determining a binding score of each heptapeptide of a fourth peptide library for the first binding site of the target protein, wherein the fourth peptide library comprises a set of heptapeptides having a unique combination of seven amino acids comprising the amino acids of the sequence of a hexapeptide identified in step (f) and one of the twenty naturally-occurring amino acids added as the first amino acid or the seventh amino acid of the hexapeptide or between two amino acids of the sequence of the hexapeptide and (II) identifying the sequences of the heptapeptides having a binding score which meets a fourth threshold.
- the fourth peptide library comprises a set of heptapeptides having a unique combination of seven amino acids comprising the amino acids of the sequence of a hexapeptide identified in step (f) and one of the twenty naturally-occurring amino acids added as the first amino acid or the seventh amino acid of the hexapeptide or between
- the method further comprises the steps of: repeating at least steps of the method, wherein, for each determining step, a binding score for the second binding site of the target protein is determined for each tetrapeptide of the first peptide library and for each pentapeptide of the second peptide library.
- the method further comprises repeating these steps, wherein for each determining step, a binding score for the second binding site of the target protein is determined for each hexapeptide of the third peptide library and for each heptapeptide of the fourth peptide library.
- the method does not further comprise determining a binding score of each heptapeptide of a fourth peptide library.
- the third peptide library is the last peptide library for which binding scores are determined.
- the method in some aspects comprises generating the first peptide library.
- the first peptide library comprises 160,000 unique tetrapeptides, optionally, the full combinatorial set of 160,000 tetrapeptides having a unique combination of four amino acids of the twenty naturally-occurring amino acids.
- the method in some aspects comprises generating the second peptide library.
- the second peptide library comprises pentapeptides having a sequence based on the sequences of the tetrapeptides (post-application of the first threshold) and another amino acid.
- the second peptide library comprises a set of pentapeptides having a unique combination of five amino acids comprising the amino acids of the sequence of a tetrapeptide identified in step (b) and one of the twenty naturally-occurring amino acids added as the first amino acid or the fifth amino acid of the pentapeptide or between two amino acids of the sequence of the tetrapeptide.
- the method in some aspects comprises generating the third peptide library.
- the third peptide library comprises hexapeptides having a sequence based on the sequences of the pentapeptides (post-application of the second threshold) and another amino acid.
- the third peptide library comprises a set of hexapeptides having a unique combination of six amino acids comprising the amino acids of the sequence of a pentapeptide identified in step (d) and one of the twenty naturally-occurring amino acids added as the first amino acid or the sixth amino acid of the hexapeptide or between two amino acids of the sequence of the pentapeptide.
- the first threshold is a binding score within the top 5% of binding scores (e.g., top binding scores mean the strongest binding between peptide and active site of target protein). In various aspects, approximately 8000 unique sequences of tetrapeptides are identified upon application of the first threshold. In certain aspects, the first threshold is a binding score within the top 5% of binding scores and a binding score outside the top 5% of binding scores for a different target protein. In some instances, about 1000 unique sequences of tetrapeptides are upon application of the first threshold. In exemplary instances, the second threshold is the same as the first threshold. In alternative instances, the first threshold is different from the second threshold. In exemplary instances, the third threshold is the same as the first threshold or the second threshold.
- the third threshold is different from the second threshold and/or first threshold.
- the third threshold is a binding score within the top 5% of binding scores.
- the third threshold further comprises a binding score outside the top 5% of binding scores for a different target protein.
- the binding scores are determined using a molecular docking program, optionally, a molecular docking program that is based on multi-conformer rigid body docking, which evaluates several conformers per peptide.
- the binding scores in some aspects are calculated using a docking scoring function.
- one or more of LUDI, Chemscore, chemgauss4, DOCK, FlexX, Gold, Pmf, Score, Fresno, AutoDock Vina, Dynadock, LigScore, Rosetta FlexPepDock are used for determining and/or calculating the binding score.
- Such docking scoring functions are known in the art. See, e.g., Logean et al., Bioorganic and Medicinal Chem Lett 1 1 (5): 675-679 (2001 ), Trott et al., J Computational Chem
- the binding scores for more than 350,000 peptides are determined for each of the first binding site and the second binding site.
- the method in certain instances further comprises determining the length of B of the clamp peptide, optionally, by measuring the distance between a peptide bound to the first binding site of the target protein and a peptide bound to the second binding site of the target protein.
- the bridge peptide of the clamp peptide in various aspects is designed based on the length determined.
- the present disclosure also provides a computer-readable storage medium having stored thereon machine-readable instructions executable by a processor, comprising instructions for carrying out the steps of any one of the presently disclosed methods for designing a clamp peptide.
- Fig. 6A illustrates an exemplary embodiment 101 of a system 100 for designing a clamp peptide.
- the system 100 may include one or more client devices 102, a network 104, and/or a network-accessible database 108.
- Each client device 102 may be communicatively coupled to the network 104 by one or more wired or wireless network connections 1 12, which may be, for example, a connection complying with a standard such as one of the IEEE 802.1 1 standards (“Wi-Fi”), the Ethernet standard, or any other appropriate network connection.
- Wi-Fi IEEE 802.1 1 standards
- the database 108 may be communicatively coupled to the network 104 via one or more connections 1 14. (Of course, the database could alternatively be internal to one or more of the client devices 102.)
- the database 108 may store the sequences of each peptide of a peptide library, optionally, the sequences of each tetrapeptide of the first peptide library, the sequences of each pentapeptide of the second peptide library, the sequences of each hexapeptide of the third peptide library.
- the database 108 additionally or alternatively may store the binding score of each peptide for each peptide library, and/or the identity of the peptide(s) which meet the threshold, and/or the thresholds themselves.
- the network 104 may be a local area network (LAN) or a wide- area network (WAN). That is, network 104 may include only local (e.g., intra-organization) connections or, alternatively, the network 104 may include connections extending beyond the organization and onto one or more public networks (e.g., the Internet).
- the client device 102 and the database 108 may be within the network operated by a single company (Company A).
- the client device(s) 102 may be on a network operated by Company A, while the database 108 may be on a network operated by a second company (Company B), and the networks of Company A and Company B may be coupled by a third network such as, for example, the Internet.
- Company A a network operated by Company A
- Company B a second company
- Company A and Company B may be coupled by a third network such as, for example, the Internet.
- the client device 102 includes a processor 128 (CPU), a RAM 130, and a non-volatile memory 132.
- the non-volatile memory 132 may be any appropriate memory device including, by way of example and not limitation, a magnetic disk (e.g., a hard disk drive), a solid state drive (e.g., a flash memory), etc.
- the database 108 need not be separate from the client device 102. Instead, in some embodiments, the database 108 is part of the non-volatile memory 132 and the data 122, 124, 126 may be stored as data within the memory 132.
- the database may comprise, for instance, the sequences of each tetrapeptide of the first peptide library.
- the database may further comprise claim data 122 which comprises, for example, the binding scores for each of the tetrapeptides of the first peptide library.
- the database may comprise the sequences of each pentapeptide of the second peptide library and the binding scores for each of the pentapeptides of the second library.
- the database in some aspects comprises the sequences of each hexapeptide of the third peptide library and the binding scores for each of the hexapeptides of the third library.
- the benchmark data 124 may comprise binding scores for commercial antibodies to the target protein, which may serve as a benchmark against which the binding scores of the peptides of the peptide libraries may be compared.
- the data 122 may be included as data in a spreadsheet file stored in the memory 132, instead of as data in the database 108.
- the memory 132 stores program data and other data necessary to analyze data (e.g., binding scores) of one or more peptide libraries, determine binding scores, identify the sequences of the peptides which meet a threshold, the thresholds themselves.
- the memory 132 stores a first routine 134, a second routine 136, and a third routine 138.
- the first routine 134 may determine binding scores of each tetrapeptide of a first peptide library for a binding site of the target protein.
- the second routine 136 may compute one or more statistical parameters of the binding scores collected by the first routine 134, and/or apply a threshold to the binding scores to identify the tetrapeptides (e.g., identify the sequences of the tetrapeptides) meeting that threshold.
- the third routine 138 may, for example, generate the sequences of pentapeptides of the second peptide library based on the sequences of the tetrapeptides that met the threshold, as identified by the second routine.
- the memory 132 stores a first routine 134, a second routine 136, and a third routine 138, as outlined above, and additionally stores a fourth routine, a fifth routine, and a sixth routine.
- the fourth routine may determine binding scores of each pentapeptide of the second peptide library generated by the third routine for a binding site of the target protein.
- the fifth routine may compute one or more statistical parameters of the binding scores collected by the fourth routine 134, and/or apply a threshold to the binding scores to identify the pentapeptides (e.g., identify the sequences of the pentapeptides) meeting that threshold.
- the sixth routine may generate the sequences of hexapeptides of the third peptide library based on the sequences of the pentapeptides that met the threshold, as identified by the fifth routine.
- the memory 132 may store additional routines for subsequent cycles of determining binding scores of peptides of a peptide library, applying a threshold to identify peptides that meet the threshold, and generating the sequences of the peptides of the next peptide library using the sequences of the peptides meeting the threshold (as identified by the immediately prior routine).
- the memory 132 may store a seventh routine, eighth routine, and ninth routine, wherein the seventh routine may determine binding scores of each hexapeptide of a third peptide library for a binding site of the target protein, the eighth routine may compute one or more statistical parameters of the binding scores collected by the seventh routine, and/or apply a threshold to the binding scores to identify the hexapeptides (e.g., identify the sequences of the hexapeptides) meeting that threshold.
- the memory 132 may store the routines described in Figures 6B and 6C.
- each of the routines is executable by the processor 128 and comprises a series of compiled or compilable machine-readable instructions stored in the memory 132. Additionally, the memory 132 may store generated reports or records of data output by one of the routines 134 or 136. Alternatively, the reports or records may be output to the database 108.
- One or more display/output devices 140 e.g., printer, display, etc.
- one or more input devices 142 e.g., mouse, keyboard, tablet, touch-sensitive interface, etc.
- client device 102 may also be coupled to the client device 102, as is generally known.
- the network 104 may include but is not limited to any combination of a LAN, a MAN, a WAN, a mobile, a wired or wireless network, a private network, or a virtual private network.
- a LAN local area network
- MAN metropolitan area network
- WAN wide area network
- mobile wide area network
- wired or wireless network a local area network
- private network a wide area network
- virtual private network a virtual private network.
- clients 102 are illustrated in Fig. 6A to simplify and clarify the description, it is understood that any number of client computers are supported and can be in communication with one or more servers (not shown).
- Routines may constitute either software routines (e.g., code embodied on a machine-readable medium or in a transmission signal) or hardware routines.
- a hardware routine is tangible unit capable of performing certain operations and may be configured or arranged in a certain manner.
- one or more computer systems e.g., a standalone, client or server computer system
- one or more hardware routines of a computer system e.g., a processor or a group of processors
- software e.g., an application or application portion
- the methods or routines described herein may be at least partially processor-implemented. For example, at least some of the operations of a method may be performed by one or processors or processor-implemented hardware modules.
- processors may be distributed among the one or more processors, not only residing within a single machine, but deployed across a number of machines.
- the processor or processors may be located in a single location (e.g., within a home environment, an office environment or as a server farm), while in other embodiments the processors may be distributed across a number of locations.
- the performance of certain of the operations may be distributed among the one or more processors, not only residing within a single machine, but deployed across a number of machines.
- the one or more processors or processor- implemented modules may be located in a single geographic location (e.g., within a home environment, an office environment, or a server farm). In other example embodiments, the one or more processors or processor-implemented modules may be distributed across a number of geographic locations.
- Coupled along with their derivatives.
- some embodiments may be described using the term“coupled” to indicate that two or more elements are in direct physical or electrical contact.
- the term“coupled,” however, may also mean that two or more elements are not in direct contact with each other, but yet still co-operate or interact with each other.
- the embodiments are not limited in this context.
- the terms“comprises,”“comprising,”“includes,”“including,”“has,” “having” or any other variation thereof are intended to cover a non-exclusive inclusion.
- a process, method, article, or apparatus that comprises a list of elements is not necessarily limited to only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
- “or” refers to an inclusive or and not to an exclusive or. For example, a condition A or B is satisfied by any one of the following: A is true (or present) and B is false (or not present), A is false (or not present) and B is true (or present), and both A and B are true (or present).
- the clamp peptide comprises the structure A B-A 2 wherein is a first peptide arm that binds to a first binding site of a target protein, A 2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links to A 2 .
- the method comprises (I) designing the first peptide arm and the second peptide arm of the clamp peptide according to any one of the presently disclosed methods implemented by a processor in a computer for designing a clamp peptide and (II) joining the first peptide arm to the second peptide arm with a bridge peptide, B.
- the method further comprises assaying the binding of the clamp peptide to the target protein.
- Suitable methods of assaying peptide-protein binding include for instance assays based on Western blotting, an enzyme-linked immunosorbent assay (ELISA), a radioimmunoassay (RIA), and immunohistochemical assay, and ligand binding assays (LBAs) which are described in O’Hara et al., AAPS J 14(2): 316-328 (2012).
- clamp peptides manufactured by any one of the presently disclosed methods of manufacturing a clamp peptide are further provided herein.
- the following examples are given merely to illustrate the present invention and not in any way to limit its scope.
- This example describes an exemplary method of designing clamp peptides, a new generation of antibody mimetics.
- a new class of antibody mimetics is proposed. This is a generic universal approach that can be used against any type of antigen.
- the antibody mimetics were designed connecting three short peptides mimicking a clamp with two arms and one bridge.
- the peptide arms were computationally designed using a virtual approach based on generating different docking cycles of tetra, penta, hexapeptide libraries by maximizing the recognition properties of amino acid motifs between the ZIKV envelope protein active sites and other flaviviruses binding sites (DENV, Chikungunya and West Nile).
- the peptide bridge connecting the two arms, was made with glycine for spacing, proline for bending and two cysteines for biotin functionalization.
- Three clamp peptides assay showed a detection limit one or two order of magnitude lower (around 3.16 * 10 3 [ZIKV] copies/mL) then the antibody or mono-arm peptides with a dynamic range from 10 4 to 10 7 copies/mL of intact ZIKV particles.
- Synthetic clamp peptides showed low coefficient of variation ( ⁇ 5%) and a good inter-day and batch to batch
- clamp peptides The sensitivity of the clamp peptides was tested in the presence of two biological matrices, urine and serum diluted 1 :1 and 1 :10, respectively.
- the detection limits of clamp peptides decreased about one order of magnitude for ZIKV detection in urine or serum, with a distinct analytical signal starting from 10 5 copies/mL of ZIKV.
- ZIKV infection has emerged as a global public health problem because of its ability to cause severe congenital disease and affect a large population (loos et al., 2014; Weaver et al., 2016).
- ZIKV infection is known to cause neurological problems to pregnant women and potentially cause microcephaly and other congenital malformations and diseases to the unborn child.
- ZIKV affects, both male and females and it has been reported that the virus can be transmitted sexually through semen and vaginal fluids.
- the ZIKV virus is a mosquito- borne flavivirus, and due to the lack of specific antibodies/binders that can be used for diagnosis of the disease, the current bioassays present cross-reactivity with other flaviviruses and arboviruses. It is well established that ZIKV has many common genetic sequences and protein structures with other flaviviruses, like DENV, West Nile virus or Chikungunya (Barba-Spaeth et al., 2016; Heffron et al., 2018). This limits the use of immunoassays for the detection of human pathogens within the flavivirus genus (Priyamvada et al., 2016; Stettler et al., 2016). Thus, there is a need for highly selective binders for ZIKV that can be employed in diagnostics and health status assessment of patients suffering from ZIKV.
- the flavivirus envelope protein is responsible for virus entry and represents a major target for neutralizing antibodies.
- the ZIKV virus structure is similar to other known flaviviruses structures except for the ⁇ 10 amino acids that surround the Asn-154 glycosylation site found in each of the 180 envelope glycoproteins that make up the icosahedral shell (Zhao et al., 2016).
- the in-silico screening technique was based on a semi-combinatorial approach by designing peptides that could wrap around the glycosylation site such as to clamp the peptide in place. Different docking cycles of peptide libraries were generated by maximizing the recognition properties of the amino acid motif between the ZKV glycosylation site and the other flaviviruses glycosylation binding sites (DENV, Chikungunya and West Nile). A total library of around three million peptides was tested in-silico. [0066] Peptides as antibody mimetic elements in diagnostic methods were recently reviewed highlighting the features desired to outperform antibodies with regard to binding affinities, cellular and tumor penetration, large-scale production, temperature, and pH stability (Yu et al.,
- Linear peptides were selected by phage display to detect norovirus using an ELISA protocol or by means of an impedance biosensor (Hwang et al., 2017; Palzkill et al., 2018).
- Synthetic peptides are more resistant to physicochemical stress, more reproducible and less expensive when compared with antibodies so even if they show less specificity can be used as an array giving synergetic contribution to the detection.
- Direct ELISA was chosen as the experimental protocol to check the performances of the clamp peptides. ELISA was preferred to other analytical techniques because it provides automated steps to speed-up the screening of a large number of experimental trials.
- Peptides were designed in zwitterionic mode, using only the 20 natural amino acids, adding hydrogens, using molecular mechanics method amber, with the algorithm“Steepest Descents” converging at 0.08 Kj mol-1 in 32767 as maximum of cycles. Minimization, conformers generation and docking were carried out using Open Eye Scientific Software package under academic license. Each peptide library was compacted in a single file and fast minimized in gas phase to reduce computing time. In this context, solvent condition did not change significantly the results. The energy minimization process was carried out using SZYBKI 1 .5.7 in its default parameterization (SZYBKI, version 1 .5.7).
- the envelope proteins taken as the receptors, were downloaded from the protein data bank web site.
- the envelope proteins were from the flavivirus species ZIKV DENV, Chikungunya and West Nile having respectively the following codes in the Protein Data Bank web site: 5IRE, 4UTC, 3N40 and 3I50. All residues and water molecules were removed from the envelope proteins pdb files. For each envelope protein two dedicated boxes were generated, one enfolding the glycosylation site the other in the closet cavity from the glycosylation site.
- the active site box along with the Multi-conformer rigid body docking were carried out using OEDocking 3.0.0 (Kelley et al., 2015; OEDocking, version 3.0.0).
- Multi-conformer rigid body docking was run using Chemgauss4 as scoring function.
- the Chemgauss4, a modification of the Chemgauss3, was the latest scoring function from OpenEye software with improved hydrogen bonding and metal chelator functions.
- the total score obtained was the sum of steric, acceptor/metal, donor and aromatic contributions.
- the time required for docking a peptide library was about 24 hours.
- the eight clamp peptides and the four mono-arm peptides were purchased from Biomatik (http://www.biomatik.com). Cysteines within the peptide structures were used to bind maleimide-PEG2-biotin. All peptides were provided with a purity >85%.
- Lyophilized peptides were diluted at 1 mM concentration in 10mM phosphate buffered saline (PBS) pH 7.4, divided into 100 mI_ aliquots and stored at -30° C for further use.
- PBS phosphate buffered saline
- TCEP trialkylphosphine
- Intact particles both ZIKV and DENV were provided by Dr. Watkins group (University of Miami, Dep. of Pathology). The samples were controlled and counted by focus forming assay and RT-PCR, the details were reported in a previous work (Magnani et al., 2017). The Intact particles of ZIKV were inactivated using gamma irradiation. Assay biohazardous steps were carried out according to standard safety procedures.
- the intact virus particles were removed by washing five times with the washing buffer (PBST) 10 mM PBS pH 7.4, 0.1% Tween-20, using an automated plate washer (MultiWash+, Molecular Devices, Sunnyvale, CA). Then, the plates were blocked with 200 pL of blocking buffers while shaking at 300 rpm at room temperature.
- PBST washing buffer
- Tween-20 0.1% Tween-20
- the blocking buffers used were: PierceTM Protein-Free (PBS) Blocking Buffer (PF), BlockerTM BLOTTO in TBS, SuperBlockTM Blocking Buffer, BlockerTM BSA (1X) in PBS. All blocking buffer were from ThermoFisher Scientific.
- the envelope protein responsible for virus entry, has very similar structure in all flavivirus.
- ZIKV differs from other known flavivirus by only ⁇ 10 amino acids that surround the Asn154 glycosylation site found in each of the 180 envelope glycoproteins (Sirohi et al., 2016).
- the first active site box defining the general space of the protein where peptides are expected to bind, was designed around the amino acid Asn 154 in the 5IRE and the amino acids Asn 153, Asn 140 and Asn 134 respectively in the 4UTC, 3N40 and 3I50.
- the Second active site box was built inside the closest cavity from the first active site, around the amino acid His 323 in the 5IRE and the amino acids Val 354, His 331 and His 285 respectively in the 4UTC, 3N40 and 3I50.
- Figure 1 B showed the schematic representation of the clamp peptide structure made by two peptide arms of five or six amino acids in length linked by a bridge peptide having as sequence GPGCCGPG (SEQ ID NO: 32).
- the length in angstrom (1.931 nm) between two peptides docked in the two active sites chosen within the envelope protein was also calculated (Figure 1 C) in order to have an idea of the length needed to link the two arm peptides with a bridge peptide.
- the bridge peptide chosen (GPGCCGPG; SEQ ID NO: 32) had a length after minimization of 1.956 nm ( Figure 1 D), approximately the distance between the two arm peptides.
- the two arms of the clamp peptide should have enough flexibility to bind each of the two active site in synergic combination.
- the docking process was run in 3 steps. In each step a peptide library was generated by using an incremental construction approach. In every subsequent iteration, a focused library of peptides of increasing complexity, was built on previous iteration results. The first peptide library docked was made by the entire 160k possible tetrapeptide combinations of the 20 natural amino acids.
- the criterion of the selection was to choose the peptides inside the top 5% peptides binding the ZIKV active site and concurrently outside the top 5% peptides binding the other flaviviruses binding sites.
- the meaning of the selection was to maximize the recognition properties of the amino acid motif between the ZIKV binding site and the other flaviviruses binding sites (DENV, West Nile and Chikungunya). The 5 % was selected as cutoff because in all simulations, this value delimited the zone of the curve in which the steeper slope change was observed (Figure 2A).
- the second step was the generation of the pentapeptide library by inserting each of the 20 natural amino acids in every position of the 1 k tetrapeptides selected in the previous step.
- the hexapeptide library was built by using the same semi- combinatorial approach carried out in the first step but selecting the best 1 k pentapeptides. A total of 380k peptides were docked in each active site box.
- Table 1 reports the statistical summary of the binding scores calculated for the three libraries of peptides towards the ZIKV envelope protein.
- the score values were calculated using chemgauss4 scoring function and, thus, lower values represented higher protein-peptide affinity.
- the peptide size played a critical role for the active site 1 interaction, with lower values decreasing drastically from pentapeptides to hexapeptides, but not for active site 2.
- the docking results were used to select the arms of the clamp peptides for the experimental part. This selection was based on the peptide primary structure structural analysis results along with the position in the top ranked peptides and peptide length. [00112] The primary structural analysis of the docking results was carried out to study the occurrence of the amino acids in the top 0.1% ranked peptides that maximized the recognition properties between the ZIKV active site and the other flaviviruses binding sites (DENV, West Nile and Chikungunya).
- One peptide with high and one with low occurrence amino acids in primary structure were then chosen within the 0.1 % top ranked peptides binding each of the two envelope protein active sites.
- the four peptides were selected from both penta and hexapeptide libraries, resulting in a total of 8 peptides, four pentapeptides and four hexapeptides.
- Table 2 reports the results of the amino acid occurrence (%) in the primary structure of the eight peptides selected to build the clamp peptides. The occurrence was calculated counting the recurrence of each amino acid in the relative position (five positions for pentapeptides and six positions for hexapeptides) in the top 0.1% ranked peptides binding the active site 1 and 2 of the ZIKV envelope protein. The best occurrence amino acids were also reported, but peptides having all best occurrence amino acids were not present or were in the bottom of the 0.1% top rank peptides.
- the other pentapeptide LRGHA (SEQ ID NO: 25) had amino acids with about half percentage of occurrence when compared to the top one in almost all the five positions.
- the hexapeptide KTDAYS (SEQ ID NO: 27) showed and alternate low and high occurrence percentage.
- the penta and hexapeptides selected from the active site 2 docking had similar high and low average occurrence amino acids in primary structure.
- the hexapeptides WPWIGT (SEQ ID NO: 18) and MDSPIK (SEQ ID NO: 22) had respectively the highest and lowest amino acids occurrence (40.8% and 1.7%) of the peptides selected to build the clamp peptides.
- Table 3 shows the relative docking score position in the corresponding libraries of the eight peptides chosen to build the arms of the clamp peptides tested in the experimental evaluation.
- the ranking scores between the ZIKV sites and the other flaviviruses binding sites were for all peptides different enough to expect a ZIKV selective binding. Nevertheless, only three peptides ranked in the first 10 best peptides in binding ZIKV, highlighting strong similarities between the flaviviruses glycosylation sites. It should be noted that in all peptide libraries the simulated binding energy decreased exponentially in the top 1% best ranked peptides, in fact, a decrease of at least 20% in the binding score is observed for the 100 th peptide position.
- Table 4 reports the physicochemical properties of the peptides selected for experimental evaluation.
- the peptides were the eight clamp peptides built using the combination of the penta or hexapeptides, and the mono-arm peta and hexapeptides binding the ZIKV envelope protein glycosylation binding site. Cysteine was inserted in the bridge of clamp peptides and at the N-terminus of mono-arm peptides to bind the maleimide-PEG2-biotin used to label each of the peptides with the signal amplifier streptavidin-HRP.
- the peptides were biotinylated by using the maleimide-PEG2-biotin reagent that reacts with the sulfhydryl group of the cysteine efficiently and specifically by forming a stable thioether bond.
- the antibody 4G2 hybridoma mouse lgG2a was used as the reference and employed in combination with an anti-mouse IgG conjugated to HRP.
- Clamp peptides showed higher sensitivity than mono-arm peptides.
- Clamp peptides C5, C7 and C8 showed the best performances when used at 5 mM.
- the clamp C1 , C2 and C4 at concentration of 2 mM did the best signal to noise ratio.
- clamp peptides C3 and C6 had be used at concentration of 0.5 mM to have the best results, highlighting the remarkable high sensitivity of those peptides in this kind of assay.
- the results had a sigmoidal ZIKV particles concentration response and the calibration curves were obtained by plotting the delta absorbance (after blank signal subtraction) against the log of ZIKV particles concentration and fitting the experimental data with a four- parameter logistic function (FPLR).
- FPLR four- parameter logistic function
- the peptide based assay using C3, C6, and C7 showed three-order of magnitude dynamic range and lower detection limits with dynamic range starting from 10 copies/mL one or two order magnitude lower than the others peptides or antibody based assay.
- the better performance in binding ZIKV intact particles by those three clamp peptides was also highlighted by the FPLR C50 parameter.
- the dose-response performance of the assay was reproducible over a month (RSD lower than 15%), demonstrating that the peptides had high stability and reproducibility.
- FIG. 4 depicts the ELISA data using solutions of peptides with or without the urine and serum obtained coating clear 96-well plates with 10 5 and 10 ® copies/mL of intact ZIKV particles.
- Urine and serum were 1 :1 and 1 :10 diluted, respectively, with a concentrated peptide PBS solution (10 mM PBS, pH 7.4) to obtain a peptide final concentration of 0.5 mM for C3 and C6, 5 mM for C7 and 20 mM for mono-arm peptides (P2 and X1 ).
- Clamp peptide C3 showed a better performance in urine than serum, particularly for 10 5 copies/mL.
- Clamp peptide C6 had a strong decrease in the signal generated at both I C or 10 ® copies/mL in both urine and serum.
- Clamp peptide C7 exhibited the best performance among the three clamp peptides, having higher signals in serum than in urine. All clamp peptides lost at the least one order of magnitude signal detecting ZIKV in urine or serum, except for C7 in serum, having a distinct analytical signal also at 10 5 copies/mL of ZIKV.
- the ELISA assay platform developed for testing the newly designed clamp peptides offered the possibility to optimize in short time the experimental conditions for evaluation of the eight clamp peptides and four mono-arm peptides chosen from the most promising ones yield by the in-silico studies.
- the three clamp peptides with better performances to detect ZIKV had shown to have also semi-selective properties when tested against DENV.
- the matrix-effect was also investigated, by testing the response of the peptides in physiological matrices, i.e., urine and serum. We observed that the matrix affected the assay performance by decreasing the detection limits by one order of magnitude, albeit still having a distinct analytical signal starting from 10 ® copies/mL, the concentration of ZIKV in acute infection.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Virology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Immunology (AREA)
- Peptides Or Proteins (AREA)
Abstract
Provided herein are methods implemented by a processor in a computer for designing a clamp peptide comprising the structure A1-B-A2 wherein A1 is a first peptide arm that binds to a first binding site of a target protein, A2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links A1 to A2. Also provided herein are computer-readable storage media having stored thereon machine-readable instructions executable by a processor and systems. Related methods of manufacturing a clamp peptide and the clamp peptides manufactures by the methods are provided.
Description
METHODS OF DESIGNING NOVEL ANTIBODY MIMETICS FOR USE IN DETECTING
ANTIGENS AND AS THERAPEUTIC AGENTS
CROSS-REFERENCE TO RELATED APPLICATION(S)
[0001] This application claims priority to U.S. Application No. 16/441 ,646, filed on June 14, 2019, the contents of which are incorporated herein by reference.
INCORPORATION BY REFERENCE OF MATERIAL SUBMITTED ELECTRONICALLY
[0002] Incorporated by reference in its entirety is a computer-readable nucleotide/amino acid sequence listing submitted concurrently herewith and identified as follows: 68,786 byte ASCII (Text) file named "53192A_Seqlisting.txt"; created on November 15, 2019.
BACKGROUND
[0003] Antibodies constitute powerful therapeutic agents characterized by limited side effects due to their ability to specifically target a distinct antigen on a cell, bacteria, virus, or toxin. In 1986, the first therapeutic monoclonal antibody, Orthoclone OKT3, was introduced into the market. Since then, this class of biopharmaceutical products has significantly grown and over 45 monoclonal antibody products had received approval in the U.S. or Europe for the treatment of a variety of diseases, including cancer and inflammatory, cardiovascular, respiratory, and infectious diseases. Though the projected antibody market in the United States is anticipated to surpass the $10 billion mark, the production of such therapeutics is not without limitations. One disadvantage of therapeutic antibodies is the cost of downstream processing to achieve the required high purity levels. Another limiting factor of therapeutic antibodies is the sensitivity of antibody structure to chemical and physical denaturation encountered during post-manufacture filling, shipping and storage. Flarsh chemical environments can affect the antibodies’ binding properties.
[0004] Antibody mimetics have been successfully used in the development of binding assays for the detection of analytes in biological samples, as well as in separation methods, cancer therapy, targeted drug delivery, and in vivo imaging. The recent advances in the field of antibody mimetics and their applications in bioanalytical chemistry, specifically in diagnostics and other analytical methods have been described (Yu et al., Annual Review of Analytical Chemistry 10, 293-320 (2017). Because synthetic peptides are more resistant to
physicochemical stress, characteristically more reproducible, and, ultimately, less expensive to manufacture and commercialize, when compared to antibodies, their use as elements of antibody mimetics capable of binding to ligand analytes in a manner analogous to that of the
antigen-antibody interaction has spurred increased interest in the biotechnology and bioanalytical communities. To produce antibody mimetics that outperform antibodies with regard to binding affinities, cellular and tumor penetration, large-scale production, and temperature and pH stability is a high priority goal of the industry (Yu et al., 2017, supra).
[0005] In recent works, short peptides were used as molecular binders for virus detection. Linear peptides were selected by phage display to detect norovirus using an ELISA protocol or by means of an impedance biosensor (Hwang et al., 2017; Palzkill et al., 2018).
Computationally-designed peptides were used to possibly detect flavivirus. The binding affinity and stability of disulfide cyclic peptide ligands with target Dengue virus (DENV) E glycoprotein were calculated by molecular docking and molecular dynamics simulation, but no experimental evidence was provided (Tambunan et al., 2016). A recent report predicted by molecular docking the structure of short peptides targeting the Zika virus (ZIKV) envelope protein and the interactions between the selected peptides and virus were assessed via a fluorescence-linked sandwich immunosorbent assay (FLISA), and the performance of the peptide-linked sandwich FLISA was evaluated in virus-spiked human serum and urine (Do Thi Hoang Kim et al., 2018).
[0006] While in silico methods of molecular modeling has been used to minimize
experimental problems and facilitate the rational design of experimental protocols (Acebes et al., 2016; Bunker et al., 2016; Singh et al., 2016; Michaeli et al., 2018; Xu et al., 2018), and even though virtual docking is currently an important tool in drug discovery, and a subject of important developments over the last decade (Macalino et al., 2015; Yuan et al., 2017), a number of obstacles still limits the widespread use of molecular modeling for biotechnological applications. One of the most important drawbacks for mainstream use of molecular modeling is the challenge to simulate a huge number of candidates to be designed or/and docked using a full combinatorial approach.
[0007] Thus, improved methods of designing peptides for use in antibody mimetics is needed.
SUMMARY
[0008] Presented herein for the first time is a new methodology, based on an incremental construction approach, for the design and selection of short peptides that function as binding agents capable of selectively detecting target molecules. Herein, an in silico semi-combinatorial peptide screening method for designing the target-binding portions, the arms, of a new class of antibody mimetics, called clamp peptides, is demonstrated. In this virtual peptide screening
method, different docking cycles of peptide libraries were generated and evaluated for binding to a binding site of a target protein. Using this method, the recognition properties of the amino acid motif between target binding and non-specific binding were maximized and allowed for ~ 3 million peptides to be tested in a short period of time. Using this virtual screening method, clamp peptides designed to bind to two different binding sites on the ZIKV envelope protein were designed and subsequently manufactured and tested. As shown through direct ELISA, the performances of the clamp peptides demonstrated beneficial and desired binding activities. As shown by the data presented herein, the arms of the clamp peptides were able to wrap around the glycosylation site such as to clamp the peptide in place.
[0009] The present disclosure provides a method implemented by a processor in a computer for designing a clamp peptide comprising the structure A B-A2 wherein
is a first peptide arm that binds to a first binding site of a target protein, A2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links
to A2. In exemplary embodiments, the method comprises the steps of: (a) determining a binding score of each tetrapeptide of a first peptide library for the first binding site of the target protein, wherein the first peptide library comprises a set of tetrapeptides having a unique combination of four amino acids of the twenty naturally-occurring amino acids, optionally, wherein the first peptide library comprises the full combinatorial set of 160,000 tetrapeptides having a unique
combination of four amino acids of the twenty naturally-occurring amino acids, (b) identifying the sequences of the tetrapeptides having a binding score which meets a first threshold, (c) determining a binding score of each pentapeptide of a second peptide library for the first binding site of the target protein, wherein the second peptide library comprises a set of pentapeptides having a unique combination of five amino acids comprising the amino acids of the sequence of a tetrapeptide identified in step (b) and one of the twenty naturally-occurring amino acids added as the first amino acid or the fifth amino acid of the pentapeptide or between two amino acids of the sequence of the tetrapeptide, and (d) identifying the sequences of the pentapeptides having a binding score which meets a second threshold. In exemplary embodiments, the method further comprises the steps of: (e) determining a binding score of each hexapeptide of a third peptide library for the first binding site of the target protein, wherein the third peptide library comprises a set of hexapeptides having a unique combination of six amino acids comprising the amino acids of the sequence of a pentapeptide identified in step (d) and one of the twenty naturally-occurring amino acids added as the first amino acid or the sixth amino acid of the hexapeptide or between two amino acids of the sequence of the pentapeptide and (f) identifying the sequences of the hexapeptides having a binding score which meets a third threshold. In
exemplary embodiments, the method further comprises the steps of: repeating at least steps (a) to (d) wherein, for each determining step, a binding score for the second binding site of the target protein is determined for each tetrapeptide of the first peptide library and for each pentapeptide of the second peptide library. Optionally, wherein, when the method further comprises determining a binding score of each hexapeptide of a third peptide library for the first binding site of the target protein, the method further comprises repeating these steps, wherein for each determining step, a binding score for the second binding site of the target protein is determined for each hexapeptide of the third peptide library.
[0010] The present disclosure also provides a computer-readable storage medium having stored thereon machine-readable instructions executable by a processor, comprising
instructions for carrying out the steps of any one of the presently disclosed methods for designing a clamp peptide.
[0011] Further provided is a system comprising machine readable instructions that, when executed by the processor, cause the processor to carry out the steps of any one of the presently disclosed methods for designing a clamp peptide.
[0012] The present disclosure also provides a method of manufacturing a clamp peptide comprising the structure A B-A2 wherein
is a first peptide arm that binds to a first binding site of a target protein, A2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links
to A2. In exemplary embodiments, the method comprises (i) designing the first peptide arm and the second peptide arm of the clamp peptide according to any one of the presently disclosed methods for designing a clamp peptide and (ii) joining the first peptide arm to the second peptide arm with a bridge peptide, B.
[0013] A clamp peptide comprising the structure A B-A2 wherein A^ is a first peptide arm that binds to a first binding site of a target protein, A2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links A^ to A2, wherein the sequences of A^ and A2 are designed by any one of the presently disclosed methods for designing a clamp peptide.
[0014] Use of the presently disclosed clamp peptides for detecting target proteins, e.g., antigens, are furthermore provided herein. Also, use of the presently disclosed clamp peptides for treating a subject in need thereof are provided herein.
[0015] Additional descriptions and guidance, as well as exemplification, of the presently disclosed methods are provided herein.
BRIEF DESCRIPTION OF THE DRAWINGS
[0016] Figure 1 A is a computer-generated illustration of the target protein (ZIVA envelope protein) and the two active sites chosen for docking the peptide libraries. The larger blue shapes represent the electrostatic surfaces of the active sites. Figure 1 B is a schematic representation of the clamp peptide structure made by two peptide arms of five or six amino acids in length linked by a bridge peptide. Figure 1 C is a computer-generated illustration of the target protein of Figure 1 A with two peptides (outlined in orange) docked in the two active sites. The larger blue shaded regions represent the electrostatic surfaces of the active sites. The length (in angstrom) between the two docked peptides is shown (19.31 A or 1 .931 nm) in yellow. Figure 1 D is a computer-generated illustration of a bridge peptide designed and minimized in zwitterionic mode. The bridge peptide comprises eight amino acids long and has the amino acid sequence GPGCCGPG (SEQ ID NO: 32). The length in angstrom is shown (19.56 A= 1 .956 nm).
[0017] Figure 2A is a graph of the binding score (kcal/mol) of the three peptides libraries (first library of tetrapeptides (blue); second peptide library of pentapeptides (orange); third peptide library of hexapeptides (gray)) docked in the active site 1 (Asn154), showing the typical distribution of scores obtained in the simulations. The binding score data were sorted in ascending order of binding score, thus not necessarily a correspondence must exist between the positions of the peptides in each curve. Figure 2B is a schematic representation of the semi-combinatorial approach for obtaining penta- and hexapeptide libraries. The semi- combinatorial approach is based on generating different cycles of peptide libraries by maximizing the recognition properties of amino acid motif between the ZIKV binding site and the other flaviviruses binding sites.
[0018] Figure 3 is a series of graphs depicting the sigmoidal ZIKV particles concentration response trend. Y axis= D absorbance (450nm); X-axis= log [ZIKV], copies/mL.
[0019] Figure 4 is a graph of the D spectrophotometric absorbance signals obtained in a cross-reactivity study using the ELISA direct assay for the best three clamp peptides (C3, C6 and C7) and two mono-arm peptides (P2 and X1 ) binding the ZIKV target protein (yellow) and three serotypes of DENV virus (DENV-1 (blue), DENV-2 (orange) and DENV-3 (grey)) at the concentration of 105 [ZIKV] copies/mL.
[0020] Figure 5 is a graph of the D spectrophotometric absorbance signals obtained in a study using the ELISA direct assay for the best three clamp peptides (C3, C6 and C7) and two
mono-arm peptides (P2 and X1 ) binding the ZIKV target protein at a concentration of 105 copies/ml (blue) or 106 copies/mL (orange) in the presence of buffer (*), serum (s), or urine (u).
[0021] Figure 6A is a schematic of an exemplary embodiment 101 of a system 100 for designing a clamp peptide. Figure 6B is a schematic of base routines that may be stored on non-volatile memory of the system 100 of Figure 6A. Figure 6C is a schematic of exemplary sub-routines for the base routines of Figure 6B.
DETAILED DESCRIPTION
[0022] The present disclosure provides a method implemented by a processor in a computer for designing a clamp peptide. Clamp peptides, as further described herein, represent a new class of antibody mimetics. In exemplary aspects, the clamp peptide comprises the structure A1-B-A2 wherein
is a first peptide arm that binds to a first binding site of a target protein, A2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links
to A2. In exemplary aspects, each of A^ and A2 is a peptide of about 5 to about 7 amino acids, e.g., about 5 amino acids, about 6 amino acids, or about 7 amino acids. In exemplary aspects, the bridge peptide (B) is about 5 to about 10 amino acids in length (e.g., about 5 amino acids, about 6 amino acids, about 7 amino acids, about 8 amino acids, about 9 amino acids, or about 10 amino acids in length. In various aspects, the amino acids of the clamp peptide, or the first peptide arm, the second peptide arm, or the bridge peptide thereof, are naturally-occurring or coded or non-naturally occurring or non-coded. Non-naturally occurring amino acids refer to amino acids that do not naturally occur in vivo but which, nevertheless, can be incorporated into the peptide structures described herein. “Non-coded” as used herein refer to an amino acid that is not an L-isomer of any of the following 20 amino acids: Ala, Cys, Asp, Glu, Phe, Gly, His, lie, Lys, Leu, Met, Asn, Pro, Gin, Arg, Ser, Thr, Val, Trp, Tyr. Naturally-occurring or coded amino acids include the L-isomers of the amino acids: Ala, Cys, Asp, Glu, Phe, Gly, His, lie, Lys, Leu, Met, Asn, Pro, Gin, Arg, Ser, Thr, Val, Trp, Tyr. Exemplary non-naturally occurring or non-coded amino acids include but are not limited to, any D or L isomers of the non-naturally occurring or non-coded amino acids selected from the group consisting of: beta-alanine, N-alpha-methyl-alanine (Me-Ala), aminobutyric acid (Abu), gamma- aminobutyric acid, aminohexanoic acid (epsilon-Ahx), aminoisobutyric acid (Aib),
aminomethylpyrrole carboxylic acid, aminopiperidinecarboxylic acid, aminoserine (Ams), aminotetrahydropyran-4-carboxylic acid, arginine N-methoxy-N-methyl amide, beta-aspartic acid (beta-Asp), azetidine carboxylic acid, 3-(2-benzothiazolyl)alanine, alpha-ferf-butylglycine, 2- amino-5-ureido-n-valeric acid (citrulline, Cit), beta-Cyclohexylalanine (Cha), acetamidomethyl-
cysteine, diaminobutanoic acid (Dab), diaminopropionic acid (Dpr), dihydroxyphenylalanine (DOPA), dimethylthiazolidine (DMTA), gamma-Glutamic acid (gamma-Glu), homoserine (Hse), hydroxyproline (Hyp), isoleucine N-methoxy-N-methyl amide, methyl-isoleucine (Melle), isonipecotic acid (Isn), methyl-leucine (MeLeu), methyl-lysine, dimethyl-lysine, trimethyl-lysine, methanoproline, methionine-sulfoxide (Met(O)), methionine-sulfone (Met(02)), norleucine (Nle), methyl-norleucine (Me-Nle), norvaline (Nva), ornithine (Orn), para-aminobenzoic acid (PABA), penicillamine (Pen), methylphenylalanine (MePhe), 4-Chlorophenylalanine (Phe(4-CI)), 4- fluorophenylalanine (Phe(4-F)), 4-nitrophenylalanine (Phe(4-N0 )), 4-cyanophenylalanine ((Phe(4-CN)), phenylglycine (Phg), piperidinylalanine, piperidinylglycine, 3,4-dehydroproline, pyrrolidinylalanine, sarcosine (Sar), selenocysteine (Sec), O-Benzyl-phosphoserine, 4-amino-3- hydroxy-6-methylheptanoic acid (Sta), 4-amino-5-cyclohexyl-3-hydroxypentanoic acid (ACHPA), 4-amino-3-hydroxy-5-phenylpentanoic acid (AHPPA), 1 ,2,3,4, -tetrahydro-isoquinoline-3- carboxylic acid (Tic), tetrahydropyranglycine, thienylalanine (Thi) , O-benzyl-phosphotyrosine, O-Phosphotyrosine, methoxytyrosine, ethoxytyrosine, O-(bis-dimethylamino-phosphono)- tyrosine, tyrosine sulfate tetrabutylamine, methyl-valine (MeVal), and alkylated 3- mercaptopropionic acid. In some embodiments, the clamp peptide, or the first peptide arm, the second peptide arm, or the bridge peptide thereof, described herein are glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated, cyclized via, e.g., a disulfide bridge, or converted into a salt (e.g., an acid addition salt, a basic addition salt), and/or optionally dimerized, multimerized, or polymerized, or conjugated.
[0023] In exemplary embodiments, the method implemented by a processor in a computer for designing a clamp peptide comprises the steps of: (a) determining a binding score of each tetrapeptide of a first peptide library for the first binding site of the target protein, wherein the first peptide library comprises a set of tetrapeptides having a unique combination of four amino acids of the twenty naturally-occurring amino acids, optionally, wherein the first peptide library comprises the full combinatorial set of 160,000 (160K) tetrapeptides having a unique
combination of four amino acids of the twenty naturally-occurring amino acids, (b) identifying the sequences of the tetrapeptides having a binding score which meets a first threshold, (c) determining a binding score of each pentapeptide of a second peptide library for the first binding site of the target protein, wherein the second peptide library comprises a set of pentapeptides having a unique combination of five amino acids comprising the amino acids of the sequence of a tetrapeptide identified in step (b) and one of the twenty naturally-occurring amino acids added as the first amino acid or the fifth amino acid of the pentapeptide or between two amino acids of
the sequence of the tetrapeptide, and (d) identifying the sequences of the pentapeptides having a binding score which meets a second threshold.
[0024] In exemplary embodiments, the method further comprises the steps of: (e) determining a binding score of each hexapeptide of a third peptide library for the first binding site of the target protein, wherein the third peptide library comprises a set of hexapeptides having a unique combination of six amino acids comprising the amino acids of the sequence of a pentapeptide identified in step (d) and one of the twenty naturally-occurring amino acids added as the first amino acid or the sixth amino acid of the hexapeptide or between two amino acids of the sequence of the pentapeptide and (f) identifying the sequences of the hexapeptides having a binding score which meets a third threshold. In exemplary embodiments, the method further comprises the steps of: repeating at least steps (a) to (d) wherein, for each determining step, a binding score for the second binding site of the target protein is determined for each tetrapeptide of the first peptide library and for each pentapeptide of the second peptide library. Optionally, wherein, when the method further comprises determining a binding score of each hexapeptide of a third peptide library for the first binding site of the target protein, the method further comprises repeating these steps, wherein for each determining step, a binding score for the second binding site of the target protein is determined for each hexapeptide of the third peptide library.
[0025] In exemplary embodiments, the method further comprises (I) determining a binding score of each heptapeptide of a fourth peptide library for the first binding site of the target protein, wherein the fourth peptide library comprises a set of heptapeptides having a unique combination of seven amino acids comprising the amino acids of the sequence of a hexapeptide identified in step (f) and one of the twenty naturally-occurring amino acids added as the first amino acid or the seventh amino acid of the hexapeptide or between two amino acids of the sequence of the hexapeptide and (II) identifying the sequences of the heptapeptides having a binding score which meets a fourth threshold. In exemplary embodiments, the method further comprises the steps of: repeating at least steps of the method, wherein, for each determining step, a binding score for the second binding site of the target protein is determined for each tetrapeptide of the first peptide library and for each pentapeptide of the second peptide library. Optionally, wherein, when the method further comprises determining a binding score of each hexapeptide of a third peptide library and a binding score of each heptapeptide for the first binding site of the target protein, the method further comprises repeating these steps, wherein for each determining step, a binding score for the second binding site of the target protein is
determined for each hexapeptide of the third peptide library and for each heptapeptide of the fourth peptide library.
[0026] In exemplary embodiments, the method does not further comprise determining a binding score of each heptapeptide of a fourth peptide library. In exemplary instances, the third peptide library is the last peptide library for which binding scores are determined.
[0027] The method in some aspects comprises generating the first peptide library.
Optionally, the first peptide library comprises 160,000 unique tetrapeptides, optionally, the full combinatorial set of 160,000 tetrapeptides having a unique combination of four amino acids of the twenty naturally-occurring amino acids.
[0028] The method in some aspects comprises generating the second peptide library. In various instances, the second peptide library comprises pentapeptides having a sequence based on the sequences of the tetrapeptides (post-application of the first threshold) and another amino acid. In various aspects, the second peptide library comprises a set of pentapeptides having a unique combination of five amino acids comprising the amino acids of the sequence of a tetrapeptide identified in step (b) and one of the twenty naturally-occurring amino acids added as the first amino acid or the fifth amino acid of the pentapeptide or between two amino acids of the sequence of the tetrapeptide.
[0029] The method in some aspects comprises generating the third peptide library. In various instances, the third peptide library comprises hexapeptides having a sequence based on the sequences of the pentapeptides (post-application of the second threshold) and another amino acid. In various aspects, the third peptide library comprises a set of hexapeptides having a unique combination of six amino acids comprising the amino acids of the sequence of a pentapeptide identified in step (d) and one of the twenty naturally-occurring amino acids added as the first amino acid or the sixth amino acid of the hexapeptide or between two amino acids of the sequence of the pentapeptide.
[0030] In various instances, the first threshold is a binding score within the top 5% of binding scores (e.g., top binding scores mean the strongest binding between peptide and active site of target protein). In various aspects, approximately 8000 unique sequences of tetrapeptides are identified upon application of the first threshold. In certain aspects, the first threshold is a binding score within the top 5% of binding scores and a binding score outside the top 5% of binding scores for a different target protein. In some instances, about 1000 unique sequences of tetrapeptides are upon application of the first threshold. In exemplary instances, the second
threshold is the same as the first threshold. In alternative instances, the first threshold is different from the second threshold. In exemplary instances, the third threshold is the same as the first threshold or the second threshold. In alternative instances, the third threshold is different from the second threshold and/or first threshold. In various aspects, the third threshold is a binding score within the top 5% of binding scores. In certain instances, the third threshold further comprises a binding score outside the top 5% of binding scores for a different target protein.
[0031] The binding scores are determined using a molecular docking program, optionally, a molecular docking program that is based on multi-conformer rigid body docking, which evaluates several conformers per peptide. The binding scores in some aspects are calculated using a docking scoring function. In some aspects, one or more of LUDI, Chemscore, chemgauss4, DOCK, FlexX, Gold, Pmf, Score, Fresno, AutoDock Vina, Dynadock, LigScore, Rosetta FlexPepDock are used for determining and/or calculating the binding score. Such docking scoring functions are known in the art. See, e.g., Logean et al., Bioorganic and Medicinal Chem Lett 1 1 (5): 675-679 (2001 ), Trott et al., J Computational Chem
https://doi.org/10.1002/jcc.21334; Bohm et al., J Computer-Aided Molecular Design 13(1 ): 51 -56 (1999); Raveh et al., PLoS ONE 6(4): e18934. https://doi.Org/10.1371/journal.pone.0018934; Krammer et al., J Molecular Graphics and Modelling 23(5): 395-407 (2005). In various instances, the binding scores for more than 350,000 peptides are determined for each of the first binding site and the second binding site.
[0032] The method in certain instances further comprises determining the length of B of the clamp peptide, optionally, by measuring the distance between a peptide bound to the first binding site of the target protein and a peptide bound to the second binding site of the target protein. The bridge peptide of the clamp peptide in various aspects is designed based on the length determined.
[0033] The present disclosure also provides a computer-readable storage medium having stored thereon machine-readable instructions executable by a processor, comprising instructions for carrying out the steps of any one of the presently disclosed methods for designing a clamp peptide.
[0034] Further provided is a system comprising machine readable instructions that, when executed by the processor, cause the processor to carry out the steps of any one of the presently disclosed methods for designing a clamp peptide.
[0035] Fig. 6A illustrates an exemplary embodiment 101 of a system 100 for designing a clamp peptide. Generally, the system 100 may include one or more client devices 102, a network 104, and/or a network-accessible database 108. Each client device 102 may be communicatively coupled to the network 104 by one or more wired or wireless network connections 1 12, which may be, for example, a connection complying with a standard such as one of the IEEE 802.1 1 standards (“Wi-Fi”), the Ethernet standard, or any other appropriate network connection. Similarly, the database 108 may be communicatively coupled to the network 104 via one or more connections 1 14. (Of course, the database could alternatively be internal to one or more of the client devices 102.) The database 108 may store the sequences of each peptide of a peptide library, optionally, the sequences of each tetrapeptide of the first peptide library, the sequences of each pentapeptide of the second peptide library, the sequences of each hexapeptide of the third peptide library. The database 108 additionally or alternatively may store the binding score of each peptide for each peptide library, and/or the identity of the peptide(s) which meet the threshold, and/or the thresholds themselves.
[0036] As will be understood, the network 104 may be a local area network (LAN) or a wide- area network (WAN). That is, network 104 may include only local (e.g., intra-organization) connections or, alternatively, the network 104 may include connections extending beyond the organization and onto one or more public networks (e.g., the Internet). In some embodiments, for example, the client device 102 and the database 108 may be within the network operated by a single company (Company A). In other embodiments, for example, the client device(s) 102 may be on a network operated by Company A, while the database 108 may be on a network operated by a second company (Company B), and the networks of Company A and Company B may be coupled by a third network such as, for example, the Internet.
[0037] Referring still to Fig. 6A, the client device 102 includes a processor 128 (CPU), a RAM 130, and a non-volatile memory 132. The non-volatile memory 132 may be any appropriate memory device including, by way of example and not limitation, a magnetic disk (e.g., a hard disk drive), a solid state drive (e.g., a flash memory), etc. Additionally, it will be understood that, at least with regard to Fig. 6A, the database 108 need not be separate from the client device 102. Instead, in some embodiments, the database 108 is part of the non-volatile memory 132 and the data 122, 124, 126 may be stored as data within the memory 132. The database may comprise, for instance, the sequences of each tetrapeptide of the first peptide library. The database may further comprise claim data 122 which comprises, for example, the binding scores for each of the tetrapeptides of the first peptide library. Likewise, the database may
comprise the sequences of each pentapeptide of the second peptide library and the binding scores for each of the pentapeptides of the second library. Also, the database in some aspects comprises the sequences of each hexapeptide of the third peptide library and the binding scores for each of the hexapeptides of the third library. The benchmark data 124 may comprise binding scores for commercial antibodies to the target protein, which may serve as a benchmark against which the binding scores of the peptides of the peptide libraries may be compared.
[0038] For example, the data 122 may be included as data in a spreadsheet file stored in the memory 132, instead of as data in the database 108. In addition to storing the records of the database 108 (in some embodiments), the memory 132 stores program data and other data necessary to analyze data (e.g., binding scores) of one or more peptide libraries, determine binding scores, identify the sequences of the peptides which meet a threshold, the thresholds themselves. For example, in an embodiment, the memory 132 stores a first routine 134, a second routine 136, and a third routine 138. The first routine 134 may determine binding scores of each tetrapeptide of a first peptide library for a binding site of the target protein. The second routine 136 may compute one or more statistical parameters of the binding scores collected by the first routine 134, and/or apply a threshold to the binding scores to identify the tetrapeptides (e.g., identify the sequences of the tetrapeptides) meeting that threshold. The third routine 138 may, for example, generate the sequences of pentapeptides of the second peptide library based on the sequences of the tetrapeptides that met the threshold, as identified by the second routine. In exemplary embodiments, the memory 132 stores a first routine 134, a second routine 136, and a third routine 138, as outlined above, and additionally stores a fourth routine, a fifth routine, and a sixth routine. The fourth routine may determine binding scores of each pentapeptide of the second peptide library generated by the third routine for a binding site of the target protein. The fifth routine may compute one or more statistical parameters of the binding scores collected by the fourth routine 134, and/or apply a threshold to the binding scores to identify the pentapeptides (e.g., identify the sequences of the pentapeptides) meeting that threshold. The sixth routine may generate the sequences of hexapeptides of the third peptide library based on the sequences of the pentapeptides that met the threshold, as identified by the fifth routine. The memory 132 may store additional routines for subsequent cycles of determining binding scores of peptides of a peptide library, applying a threshold to identify peptides that meet the threshold, and generating the sequences of the peptides of the next peptide library using the sequences of the peptides meeting the threshold (as identified by the immediately prior routine). For instance, the memory 132 may store a seventh routine, eighth routine, and ninth routine, wherein the seventh routine may determine binding scores of each
hexapeptide of a third peptide library for a binding site of the target protein, the eighth routine may compute one or more statistical parameters of the binding scores collected by the seventh routine, and/or apply a threshold to the binding scores to identify the hexapeptides (e.g., identify the sequences of the hexapeptides) meeting that threshold. In exemplary aspects, the memory 132 may store the routines described in Figures 6B and 6C.
[0039] Regardless, each of the routines is executable by the processor 128 and comprises a series of compiled or compilable machine-readable instructions stored in the memory 132. Additionally, the memory 132 may store generated reports or records of data output by one of the routines 134 or 136. Alternatively, the reports or records may be output to the database 108. One or more display/output devices 140 (e.g., printer, display, etc.) and one or more input devices 142 (e.g., mouse, keyboard, tablet, touch-sensitive interface, etc.) may also be coupled to the client device 102, as is generally known.
[0040] As will be understood, although individual operations of one or more methods are illustrated and described as separate operations, one or more of the individual operations may be performed concurrently, and nothing requires that the operations be performed in the order illustrated. Structures and functionality presented as separate components in example configurations may be implemented as a combined structure or component. Similarly, structures and functionality presented as a single component may be implemented as separate components. These and other variations, modifications, additions, and improvements fall within the scope of the subject matter herein.
[0041] For example, the network 104 may include but is not limited to any combination of a LAN, a MAN, a WAN, a mobile, a wired or wireless network, a private network, or a virtual private network. Moreover, while only two clients 102 are illustrated in Fig. 6A to simplify and clarify the description, it is understood that any number of client computers are supported and can be in communication with one or more servers (not shown).
[0042] Additionally, certain embodiments are described herein as including logic or a number of routines. Routines may constitute either software routines (e.g., code embodied on a machine-readable medium or in a transmission signal) or hardware routines. A hardware routine is tangible unit capable of performing certain operations and may be configured or arranged in a certain manner. In example embodiments, one or more computer systems (e.g., a standalone, client or server computer system) or one or more hardware routines of a computer system (e.g., a processor or a group of processors) may be configured by software
(e.g., an application or application portion) as a hardware routine that operates to perform certain operations as described herein.
[0043] Similarly, the methods or routines described herein may be at least partially processor-implemented. For example, at least some of the operations of a method may be performed by one or processors or processor-implemented hardware modules. The
performance of certain of the operations may be distributed among the one or more processors, not only residing within a single machine, but deployed across a number of machines. In some example embodiments, the processor or processors may be located in a single location (e.g., within a home environment, an office environment or as a server farm), while in other embodiments the processors may be distributed across a number of locations.
[0044] The performance of certain of the operations may be distributed among the one or more processors, not only residing within a single machine, but deployed across a number of machines. In some example embodiments, the one or more processors or processor- implemented modules may be located in a single geographic location (e.g., within a home environment, an office environment, or a server farm). In other example embodiments, the one or more processors or processor-implemented modules may be distributed across a number of geographic locations.
[0045] Some embodiments may be described using the expression“coupled” and
“connected” along with their derivatives. For example, some embodiments may be described using the term“coupled” to indicate that two or more elements are in direct physical or electrical contact. The term“coupled,” however, may also mean that two or more elements are not in direct contact with each other, but yet still co-operate or interact with each other. The embodiments are not limited in this context.
[0046] As used herein, the terms“comprises,”“comprising,”“includes,”“including,”“has,” “having” or any other variation thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, article, or apparatus that comprises a list of elements is not necessarily limited to only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Further, unless expressly stated to the contrary,“or” refers to an inclusive or and not to an exclusive or. For example, a condition A or B is satisfied by any one of the following: A is true (or present) and B is false (or not present), A is false (or not present) and B is true (or present), and both A and B are true (or present).
[0047] In addition, use of the“a” or“an” are employed to describe elements and components of the embodiments herein. This is done merely for convenience and to give a general sense of the description. This description should be read to include one or at least one and the singular also includes the plural unless it is obvious that it is meant otherwise.
[0048] Still further, the figures depict preferred embodiments of a map editor system for purposes of illustration only. One skilled in the art will readily recognize from the following discussion that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles described herein
[0049] Upon reading this disclosure, those of skill in the art will appreciate still additional alternative structural and functional designs for a system and a process for identifying terminal road segments through the disclosed principles herein. Thus, while particular embodiments and applications have been illustrated and described, it is to be understood that the disclosed embodiments are not limited to the precise construction and components disclosed herein. Various modifications, changes and variations, which will be apparent to those skilled in the art, may be made in the arrangement, operation and details of the method and apparatus disclosed herein without departing from the spirit and scope defined in the appended claims.
[0050] Methods of manufacturing a clamp peptide are further provided herein. In various aspects, the clamp peptide comprises the structure A B-A2 wherein
is a first peptide arm that binds to a first binding site of a target protein, A2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links
to A2. In exemplary embodiments, the method comprises (I) designing the first peptide arm and the second peptide arm of the clamp peptide according to any one of the presently disclosed methods implemented by a processor in a computer for designing a clamp peptide and (II) joining the first peptide arm to the second peptide arm with a bridge peptide, B. In exemplary aspects, the method further comprises assaying the binding of the clamp peptide to the target protein. Suitable methods of assaying peptide-protein binding are known in the art and include for instance assays based on Western blotting, an enzyme-linked immunosorbent assay (ELISA), a radioimmunoassay (RIA), and immunohistochemical assay, and ligand binding assays (LBAs) which are described in O’Hara et al., AAPS J 14(2): 316-328 (2012).
[0051] The clamp peptides manufactured by any one of the presently disclosed methods of manufacturing a clamp peptide are further provided herein.
[0052] The following examples are given merely to illustrate the present invention and not in any way to limit its scope.
EXAMPLES EXAMPLE 1
[0053] This example describes an exemplary method of designing clamp peptides, a new generation of antibody mimetics.
[0054] Abstract
[0055] A new class of antibody mimetics is proposed. This is a generic universal approach that can be used against any type of antigen. The antibody mimetics were designed connecting three short peptides mimicking a clamp with two arms and one bridge.
[0056] The peptide arms were computationally designed using a virtual approach based on generating different docking cycles of tetra, penta, hexapeptide libraries by maximizing the recognition properties of amino acid motifs between the ZIKV envelope protein active sites and other flaviviruses binding sites (DENV, Chikungunya and West Nile).
[0057] The peptide bridge, connecting the two arms, was made with glycine for spacing, proline for bending and two cysteines for biotin functionalization.
[0058] Eight clamp peptides and four mono arm peptides were then synthesized and tested vs intact ZIKV particles by using a direct enzyme linked immunosorbent assay (ELISA). As a reference, we employed a well-established anti- ZIKV virus antibody, the antibody 4G2.
[0059] Three clamp peptides assay showed a detection limit one or two order of magnitude lower (around 3.16 * 103 [ZIKV] copies/mL) then the antibody or mono-arm peptides with a dynamic range from 104 to 107 copies/mL of intact ZIKV particles. Synthetic clamp peptides showed low coefficient of variation (<5%) and a good inter-day and batch to batch
reproducibility (<15%). These three clamp peptides showed slight cross-reactivity against three serotypes of DENV (DENV-1 , -2 and -3) at the concentration of 10 5 copies/mL of intact virus particles, but the discrimination between the DENV and ZIKV was lost increasing the coating concentration to 10 ® copies/mL of the viruses.
[0060] The sensitivity of the clamp peptides was tested in the presence of two biological matrices, urine and serum diluted 1 :1 and 1 :10, respectively. The detection limits of clamp
peptides decreased about one order of magnitude for ZIKV detection in urine or serum, with a distinct analytical signal starting from 10 5 copies/mL of ZIKV.
[0061] Introduction
[0062] The threat of ZIKV infection has emerged as a global public health problem because of its ability to cause severe congenital disease and affect a large population (loos et al., 2014; Weaver et al., 2016). ZIKV infection is known to cause neurological problems to pregnant women and potentially cause microcephaly and other congenital malformations and diseases to the unborn child. ZIKV affects, both male and females and it has been reported that the virus can be transmitted sexually through semen and vaginal fluids. The ZIKV virus is a mosquito- borne flavivirus, and due to the lack of specific antibodies/binders that can be used for diagnosis of the disease, the current bioassays present cross-reactivity with other flaviviruses and arboviruses. It is well established that ZIKV has many common genetic sequences and protein structures with other flaviviruses, like DENV, West Nile virus or Chikungunya (Barba-Spaeth et al., 2016; Heffron et al., 2018). This limits the use of immunoassays for the detection of human pathogens within the flavivirus genus (Priyamvada et al., 2016; Stettler et al., 2016). Thus, there is a need for highly selective binders for ZIKV that can be employed in diagnostics and health status assessment of patients suffering from ZIKV.
[0063] The flavivirus envelope protein is responsible for virus entry and represents a major target for neutralizing antibodies. The ZIKV virus structure is similar to other known flaviviruses structures except for the ~10 amino acids that surround the Asn-154 glycosylation site found in each of the 180 envelope glycoproteins that make up the icosahedral shell (Zhao et al., 2016).
[0064] In this work, the clamp peptide arms were designed using as guide the
crystallographic coordinates of the ZIKV envelope protein glycosylation binding site. The entire molecular surface of the glycosylation envelope protein biding site was defined by two cubic boxes where arm peptides were expected to bind.
[0065] The in-silico screening technique was based on a semi-combinatorial approach by designing peptides that could wrap around the glycosylation site such as to clamp the peptide in place. Different docking cycles of peptide libraries were generated by maximizing the recognition properties of the amino acid motif between the ZKV glycosylation site and the other flaviviruses glycosylation binding sites (DENV, Chikungunya and West Nile). A total library of around three million peptides was tested in-silico.
[0066] Peptides as antibody mimetic elements in diagnostic methods were recently reviewed highlighting the features desired to outperform antibodies with regard to binding affinities, cellular and tumor penetration, large-scale production, temperature, and pH stability (Yu et al.,
2017). It is well documented that a hard chemical environment can affect antibodies binding properties and DNA and peptides aptamers are the most promising candidates to replace them in bioanalysis as reported by recent reviews (Gong et al., 2010; Pichon et al., 2015; Li et al.,
2018). Aptamers have become increasingly important molecular tools for diagnostics and as therapeutic agents, and are used in many analytical applications, such as chromatography, electrophoresis, mass spectrometry, molecular beacons, gas sensors, and biosensors
(Stobiecka and Chatupa, 2015; Lin et al., 2016; Tang et al., 2016; Mascini et al., 2017; Mascini et al., 2019).
[0067] In recent works, short peptides were used as molecular binders for virus detection. Linear peptides were selected by phage display to detect norovirus using an ELISA protocol or by means of an impedance biosensor (Hwang et al., 2017; Palzkill et al., 2018).
[0068] Computationally designed peptides were used to possibly detect flavivirus. Binding affinity and stability of disulfide cyclic peptide ligands with target DENV E glycoprotein were calculated by molecular docking and molecular dynamics simulation, but no experimental evidence was provided (Tambunan et al., 2016).
[0069] A recent report predicted by molecular docking the structure of short peptides targeting the ZIKV envelope protein and the interactions between the selected peptides and virus were assessed via a fluorescence-linked sandwich immunosorbent assay (FLISA), and the performance of the peptide-linked sandwich FLISA was evaluated in virus-spiked human serum and urine (Do Thi Hoang Kim et al., 2018).
[0070] Molecular modelling is more and more used to overcome the trial and error approach and to minimize experimental problems by providing an understanding of atomic interactions and facilitating the rational design of experimental protocols (Acebes et al., 2016; Bunker et al., 2016; Singh et al., 2016; Michaeli et al., 2018; Xu et al., 2018). Virtual docking is currently an important tool in drug discovery, and a subject of important developments over the last decade (Macalino et al., 2015; Yuan et al., 2017).
[0071] However, a number of obstacles still limits the widespread use of molecular modelling for biotechnological applications. One of the most important drawbacks for mainstream use of
molecular modeling is the challenge to simulate a huge number of candidates to be designed or/and docked using a full combinatorial approach.
[0072] To address this issue, herein we present a new methodology, based on an
incremental construction approach to choose short peptides as binding agents for the selective detection of the intact ZIKV particles. Synthetic peptides are more resistant to physicochemical stress, more reproducible and less expensive when compared with antibodies so even if they show less specificity can be used as an array giving synergetic contribution to the detection.
[0073] Direct ELISA was chosen as the experimental protocol to check the performances of the clamp peptides. ELISA was preferred to other analytical techniques because it provides automated steps to speed-up the screening of a large number of experimental trials.
[0074] Materials and Methods
[0075] All calculations of molecular docking were done using a desktop computer with 19 processors Intel Xeon X5690 at 3.47 GHz each, with 94.5 GiB RAM, running Kernel Linux 2.6.32-642.1.1 el6.x86_64, GNOME 2.28.2.
[0076] The three peptide libraries were designed and cleaned up with Hyperchem 8.0.5.
[0077] Peptides were designed in zwitterionic mode, using only the 20 natural amino acids, adding hydrogens, using molecular mechanics method amber, with the algorithm“Steepest Descents” converging at 0.08 Kj mol-1 in 32767 as maximum of cycles. Minimization, conformers generation and docking were carried out using Open Eye Scientific Software package under academic license. Each peptide library was compacted in a single file and fast minimized in gas phase to reduce computing time. In this context, solvent condition did not change significantly the results. The energy minimization process was carried out using SZYBKI 1 .5.7 in its default parameterization (SZYBKI, version 1 .5.7). To take into account the flexibility of the peptides, ten conformers were generated for each peptide by means of the OMEGA 2.4.6 used with MMFF as the force field (Hawkins et al., 2010; Hawkins and Nicholls, 2012; OMEGA, version 2.4.6). Therefore, the ligands were represented by the peptide conformers, around 5 millions units.
[0078] Then the envelope proteins, taken as the receptors, were downloaded from the protein data bank web site. The envelope proteins were from the flavivirus species ZIKV DENV, Chikungunya and West Nile having respectively the following codes in the Protein Data Bank web site: 5IRE, 4UTC, 3N40 and 3I50. All residues and water molecules were removed from the
envelope proteins pdb files. For each envelope protein two dedicated boxes were generated, one enfolding the glycosylation site the other in the closet cavity from the glycosylation site.
[0079] In order to reduce the calculation time, tetra and pentapeptide libraries were docked using active site boxes with a volume of around 13 nm3 and hexapeptide libraries using boxes having a volume of around 18 nm3.
[0080] Using these sizes, the entire molecular surface of all peptide conformers was inside the active site box.
[0081] The active site box along with the Multi-conformer rigid body docking were carried out using OEDocking 3.0.0 (Kelley et al., 2015; OEDocking, version 3.0.0). Multi-conformer rigid body docking was run using Chemgauss4 as scoring function. The Chemgauss4, a modification of the Chemgauss3, was the latest scoring function from OpenEye software with improved hydrogen bonding and metal chelator functions. The total score obtained was the sum of steric, acceptor/metal, donor and aromatic contributions. The time required for docking a peptide library was about 24 hours.
[0082] Structures visualization and generation of molecular surfaces were performed using VIDA 4.1.1 (VIDA, version 4.1.1 ).
[0083] The entire process was automated using a bash script and using a freeware BASIC- like scripting language (AutolT V3) for post processing data analysis.
[0084] Experimental Setup
[0085] All chemicals used for buffers were of analytical grade and purchased from Sigma- Aldrich (http://www.sigmaaldrich.com).
[0086] The eight clamp peptides and the four mono-arm peptides were purchased from Biomatik (http://www.biomatik.com). Cysteines within the peptide structures were used to bind maleimide-PEG2-biotin. All peptides were provided with a purity >85%.
[0087] Lyophilized peptides were diluted at 1 mM concentration in 10mM phosphate buffered saline (PBS) pH 7.4, divided into 100 mI_ aliquots and stored at -30° C for further use.
[0088] Before biotin functionalization, peptides stock solution was reduced using
trialkylphosphine (TCEP) from ThermoFisher Scientific (www.thermofisher.com) and after 1 h the gel was removed using TCEP gel spin separation columns (ThermoFisher Scientific). Then, 2-fold molar excess of EZ-Link™ Maleimide-PEG2-Biotin (ThermoFisher Scientific) was added to purified peptide solution and incubated for 1 h. At this concentration EZ-Link™ Maleimide-
PEG2-Biotin did not contribute to background signal as shown by a pilot test using only EZ- Link™ Maleimide-PEG2-Biotin without peptide. Therefore, no further separation was carried out.
[0089] To optimize all parameters of the direct ELISA protocol, Pierce 96-Well Polystyrene Plates, (ThermoFisher Scientific) were coated overnight at 4° C with different concentrations of intact virus particles (ZIKV or DENV). The intact virus particles were diluted using 100 mM NaHC03, pH 9.6, and aliquots of 100 pL were dispensed into each well of the plate using a multichannel pipette. This buffer pH assured a strong hydrophobic binding interaction between polystyrene and virus particles.
[0090] Intact particles both ZIKV and DENV were provided by Dr. Watkins group (University of Miami, Dep. of Pathology). The samples were controlled and counted by focus forming assay and RT-PCR, the details were reported in a previous work (Magnani et al., 2017). The Intact particles of ZIKV were inactivated using gamma irradiation. Assay biohazardous steps were carried out according to standard safety procedures.
[0091] After coating the plates overnight, the intact virus particles were removed by washing five times with the washing buffer (PBST) 10 mM PBS pH 7.4, 0.1% Tween-20, using an automated plate washer (MultiWash+, Molecular Devices, Sunnyvale, CA). Then, the plates were blocked with 200 pL of blocking buffers while shaking at 300 rpm at room temperature.
The blocking buffers used were: Pierce™ Protein-Free (PBS) Blocking Buffer (PF), Blocker™ BLOTTO in TBS, SuperBlockTM Blocking Buffer, Blocker™ BSA (1X) in PBS. All blocking buffer were from ThermoFisher Scientific.
[0092] After 2h the blocking buffers were removed using the same washing procedure mentioned above. 100 pL-aliquots of several dilutions of peptides biotinylated in 10mM PBS pH 7.4 were placed in each well and incubated for 2 h while shaking at 300 rpm at room
temperature. After the incubation, the unreacted peptides were removed by using the plate washer with the same settings. Then, 100 pL-aliquots of streptavidin-HRP (ThermoFisher Scientific) at a concentration of 20ng/mL were added into each well and incubated for 30 min at room temperature without shaking. After the incubation, excess streptavidin-HRP was removed and the wells were washed with the plate washer five times using the washing buffer. Finally,
100 pL-aliquots of the Ultra TMB-ELISA Substrate Solution (ThermoFisher Scientific) were added and after 10 min the reaction was stopped by adding 100 pL aliquots of the TMB stop solution (SeraCare). The emission (450 nm) was read using a microplate reader (Clariostar Optima; BMG Labtech, Ortenberg, Germany).
[0093] The corresponding blank signals in triplicates were obtained by using all reagents without peptides. The blank signal was then subtracted to the average absorbance values for triplicate wells of each test.
[0094] All Data were processed and fitted using the software XLSTAT Version
2016.02.28451.
[0095] Results and Discussion
[0096] Docking simulation
[0097] The envelope protein, responsible for virus entry, has very similar structure in all flavivirus. ZIKV differs from other known flavivirus by only ~10 amino acids that surround the Asn154 glycosylation site found in each of the 180 envelope glycoproteins (Sirohi et al., 2016).
[0098] As depicted by Figure 1 A the two arms of the clamp peptide were docked in two different envelope protein binding sites. The first active site box, defining the general space of the protein where peptides are expected to bind, was designed around the amino acid Asn 154 in the 5IRE and the amino acids Asn 153, Asn 140 and Asn 134 respectively in the 4UTC, 3N40 and 3I50. The Second active site box was built inside the closest cavity from the first active site, around the amino acid His 323 in the 5IRE and the amino acids Val 354, His 331 and His 285 respectively in the 4UTC, 3N40 and 3I50.
[0099] Figure 1 B showed the schematic representation of the clamp peptide structure made by two peptide arms of five or six amino acids in length linked by a bridge peptide having as sequence GPGCCGPG (SEQ ID NO: 32). The length in angstrom (1.931 nm) between two peptides docked in the two active sites chosen within the envelope protein was also calculated (Figure 1 C) in order to have an idea of the length needed to link the two arm peptides with a bridge peptide. The bridge peptide chosen (GPGCCGPG; SEQ ID NO: 32) had a length after minimization of 1.956 nm (Figure 1 D), approximately the distance between the two arm peptides. Using this bridge peptide size, the two arms of the clamp peptide should have enough flexibility to bind each of the two active site in synergic combination.
[00100] In order to avoid any disulfide bond between the two arms of the clamp peptide, the peptides having sulfur-containing amino acids (cysteine and methionine) were discarded during the semi-combinatorial peptide libraries screening.
[00101] The docking process was run in 3 steps. In each step a peptide library was generated by using an incremental construction approach. In every subsequent iteration, a
focused library of peptides of increasing complexity, was built on previous iteration results. The first peptide library docked was made by the entire 160k possible tetrapeptide combinations of the 20 natural amino acids.
[00102] The docking program used in this work was based on multi-conformer rigid body docking, therefore ten conformers per peptide were generated to ensure a good compromise between calculation time and accuracy of the output data for this type of ligands (Perez et al., 2013).
[00103] From the 5% peptides (8k tetrapeptides) having the best biding score for each of the two ZIKV binding sites, only 1 k tetrapeptides were selected for the next step.
[00104] The criterion of the selection was to choose the peptides inside the top 5% peptides binding the ZIKV active site and concurrently outside the top 5% peptides binding the other flaviviruses binding sites. The meaning of the selection was to maximize the recognition properties of the amino acid motif between the ZIKV binding site and the other flaviviruses binding sites (DENV, West Nile and Chikungunya). The 5 % was selected as cutoff because in all simulations, this value delimited the zone of the curve in which the steeper slope change was observed (Figure 2A).
[00105] This criterion was applied to the other steps to select penta, hexa and heptapeptides. Figure 2A depicted the typical distributions of scores obtained in the simulations. The curves obtained had similar gaussian distributions. Score values comprised within the range from 20 to -10 kcal/mol in all simulations. All docking runs had approximately 5 % of the complexes with higher scores and 5 % with worse scores, both well separated from the rest of the population.
[00106] The score values were calculated using chemgauss4 scoring function and, thus, lower values represented higher protein-peptide affinity.
[00107] The second step was the generation of the pentapeptide library by inserting each of the 20 natural amino acids in every position of the 1 k tetrapeptides selected in the previous step. As reported in Figure 2B, the hexapeptide library was built by using the same semi- combinatorial approach carried out in the first step but selecting the best 1 k pentapeptides. A total of 380k peptides were docked in each active site box.
[00108] Table 1 reports the statistical summary of the binding scores calculated for the three libraries of peptides towards the ZIKV envelope protein. The score values were calculated using chemgauss4 scoring function and, thus, lower values represented higher protein-peptide
affinity. The peptide size played a critical role for the active site 1 interaction, with lower values decreasing drastically from pentapeptides to hexapeptides, but not for active site 2.
TABLE 1 : active site 1 Active site 2
Tetrapeptides
min -7.5 -7.0
max 16.5 6.3
A v 0.1 -2.2
median -0.2 -2.2
Pentapeptides
min -9.0 -7.0
max 9.6 8.1
A v -1 .0 -0.8
median -1 .1 -0.9
Hexapeptides
min -5.8 -7.9
max 21 .8 26.8
A v 1 .9 2.5
median 1 .6 2.2
Statistical parameters of the scores behavior
(Kcal/mol), obtained using the three peptide
libraries docked in the active site 1 (glycosylation
binding site) and the active site 2 of the 5IRE ZIKV
envelope protein.
[00109] These results could be explained considering the steric effects of the peptides within the glycosylation binding pocket. Also, the minimum-maximum dynamic range among the peptide libraries reflected that this behavior becomes relevant for hexapeptides. On the other hand, all peptide libraries showed average and median very close to each other, demonstrating a good symmetry in normal distribution.
[00110] It should be noted that the purpose of this work was to use the virtual screening step to select peptides for the specific detection of ZIKV virus among different flaviviruses.
[00111 ] The docking results were used to select the arms of the clamp peptides for the experimental part. This selection was based on the peptide primary structure structural analysis results along with the position in the top ranked peptides and peptide length.
[00112] The primary structural analysis of the docking results was carried out to study the occurrence of the amino acids in the top 0.1% ranked peptides that maximized the recognition properties between the ZIKV active site and the other flaviviruses binding sites (DENV, West Nile and Chikungunya).
[00113] One peptide with high and one with low occurrence amino acids in primary structure were then chosen within the 0.1 % top ranked peptides binding each of the two envelope protein active sites. The four peptides were selected from both penta and hexapeptide libraries, resulting in a total of 8 peptides, four pentapeptides and four hexapeptides.
[00114] Table 2 reports the results of the amino acid occurrence (%) in the primary structure of the eight peptides selected to build the clamp peptides. The occurrence was calculated counting the recurrence of each amino acid in the relative position (five positions for pentapeptides and six positions for hexapeptides) in the top 0.1% ranked peptides binding the active site 1 and 2 of the ZIKV envelope protein. The best occurrence amino acids were also reported, but peptides having all best occurrence amino acids were not present or were in the bottom of the 0.1% top rank peptides.
TABLE 2
Amino acid occurrence in the top 160 ranked peptides
(%)
1.3 0.6 1 .3 2.5 1 .9 2.5 1 .7%
Active
site 2
75.0 80.0 32.5 27.5 35.6 18.8 44.9%
Analysis of the amino acid occurrence (%) in the primary structure of the eight peptides selected to build the eight clamp peptides. The occurrence was calculated counting the recurrence of each amino acid in the relative position (five positions for pentapeptides and six positions for hexapeptides) in the top 0.1 % ranked peptides binding the active site 1 and 2 of the ZIKV envelope protein. The best occurrence amino acids were also reported (in italic) along with the average (Av) percentage of the occurrence for each peptide. SEQ ID NO: noted in ().
[00115] In the active site 1 of ZIKV envelop protein (glycosylation active site), the
pentapeptide SWPGQ (SEQ ID NO: 24) and hexapeptide KRNATP (SEQ ID NO: 26) had almost all amino acids with high occurrence with some exception, the glutamine in fifth position for SWPGQ (SEQ ID NO: 24) and lysine in second position for KRNATP (SEQ ID NO: 26). The average in percentage of the occurrence (19.3% and 29.7% ) was the highest of the top 0.1% ranked peptides in their respective libraries.
[00116] The other pentapeptide LRGHA (SEQ ID NO: 25) had amino acids with about half percentage of occurrence when compared to the top one in almost all the five positions. On the other hand, the hexapeptide KTDAYS (SEQ ID NO: 27) showed and alternate low and high occurrence percentage.
[00117] The penta and hexapeptides selected from the active site 2 docking, had similar high and low average occurrence amino acids in primary structure. Remarkably, the hexapeptides WPWIGT (SEQ ID NO: 18) and MDSPIK (SEQ ID NO: 22) had respectively the highest and lowest amino acids occurrence (40.8% and 1.7%) of the peptides selected to build the clamp peptides.
[00118] Table 3 shows the relative docking score position in the corresponding libraries of the eight peptides chosen to build the arms of the clamp peptides tested in the experimental evaluation. The ranking scores between the ZIKV sites and the other flaviviruses binding sites were for all peptides different enough to expect a ZIKV selective binding. Nevertheless, only three peptides ranked in the first 10 best peptides in binding ZIKV, highlighting strong similarities between the flaviviruses glycosylation sites. It should be noted that in all peptide libraries the simulated binding energy decreased exponentially in the top 1% best ranked
peptides, in fact, a decrease of at least 20% in the binding score is observed for the 100th peptide position.
TABLE 3
Active site 1
LRGHA (25) 53 74900 69645 71784 SWPGQ (24) 4 55575 27840 471 1 1 KRNATP (26) 16 85123 100716 74427 KTDAYS (27) 125 95558 1 10693 86882
Active site 2
Docking score rank
WPHTQ (16 9 47135 43238 45072
AGRRP (20) 13 29265 51031 18894
WPWIGT (18) 1 21334 9821 1831 1
MDSPIK (22) _ 46 _ 10080 40000 _ 7831
Relative docking score position of the four pentapeptides and four
hexapeptides selected for building the eight clamp peptides. The ranking obtained from the two active sites of the ZIKV envelope protein was
compared to the one obtained using the other three flaviviruses envelope proteins.
[00119] The four pentapeptides and four hexapeptides were then combined to build eight clamp peptides. At this stage a mix between penta and hexapeptides was avoided in order to understand the contribution of the peptide length in the experimental responses.
[00120] Table 4 reports the physicochemical properties of the peptides selected for experimental evaluation. The peptides were the eight clamp peptides built using the combination of the penta or hexapeptides, and the mono-arm peta and hexapeptides binding the ZIKV envelope protein glycosylation binding site. Cysteine was inserted in the bridge of clamp peptides and at the N-terminus of mono-arm peptides to bind the maleimide-PEG2-biotin used to label each of the peptides with the signal amplifier streptavidin-HRP.
TABLE 4
Active Active net
Bridge IsoPoint Water
site site Label charge MW
Peptide* pH Sol
2 1 pH7
WPHTQ GPGCCGPG SWPGQ
C1 6.9 0.0 poor 1852 (16) (32) (24)
WPHTQ GPGCCGPG LRGHA C2 8.1 1 .1 poor 1831
(16) (32) (25)
WPWIGT GPGCCGPG KRNATP
C3 8.9 1.9 poor 2055
(18) (32) (26)
WPWIGT GPGCCGPG KTDAYS
C4 5.8 -0.1 poor 2053
(18) (32) (27)
AGRRP GPGCCGPG SWPGQ
C5 12.1 2.0 good 1739
(20) (32) (24)
AGRRP GPGCCGPG LRGHA
C6 10.6 3.0 good 1718
(20) (32) (25)
MDSPIK GPGCCGPG KRNATP
C7 8.8 1.9 good 1986
(22) (32) (26)
MDSPIK GPGCCGPG KTDAYS
C8 5.9 -0.1 good 1984
(22) (32) (27)
C-SWPGQ
P1 3.0 -0.1 poor 676
(33)
C-LRGHA
P2 9.2 1.0 good 655
(34)
C-KRNATP
X1 10.5 1.9 good 788
(35)
C-KTDAYS
X2 5.9 -0.1 good 786
(36)
Physicochemical properties of the eight peptides selected for experimental part. A cysteine was added to the N terminus of each mono-arm peptide to link the maleimide-PEG2-biotin. * SEQ ID NOs: are noted in ()
[00121] The experimental analysis was performed in PBS at pH 7.4. Therefore, the physicochemical properties were focused on water solubility and net charge at pH 7. The first four clamp peptides and one of the mono-arm pentapeptide had poor water solubility due to the ratio of the hydrophobic amino acids, but when they were used at micromolar concentration were able to be dissolved in PBS. Five clamp peptides and two mono-arm peptides had a significant amount of positively charged amino acids resulting in a positive net charge at pH 7. Due to the presence of the polar amino acids the other peptides had a slightly negative net charge at pH 7.
[00122] Moreover, to highlight the positive or negative charges inside the peptide, the pH of the isoelectric point of each peptide was also reported. Interestingly, the majority of peptides selected had positively charged amino acids improving the possibility to interact with negative charges in the three-dimensional structure of both ZIKV binding sites selected.
[00123] Experimental Results
[00124] The eight clamp peptides and the four mono-arm peptides selected, were tested vs intact ZIKV particles by using a direct ELISA. The peptides were biotinylated by using the maleimide-PEG2-biotin reagent that reacts with the sulfhydryl group of the cysteine efficiently
and specifically by forming a stable thioether bond. The antibody 4G2 hybridoma mouse lgG2a was used as the reference and employed in combination with an anti-mouse IgG conjugated to HRP.
[00125] All analytical parameters involved in the development of ELISA were optimized by using 96-well plates coated with triplicate 10-fold serial dilutions of intact ZIKV particles. The results were reported in Table 5.
TABLE 5
AB
C1 C2 C3 C4 C5 C6 C7 C8 P1 P2 X1 X2
4G2
Blocking BT BT BT BT PF PF PF PF PF PF PF PF PF
Incubation
PBST PBST PBST PBST PBST PBST PBST PBST PBS PBS PBS PBS PBST
buffer
FPLR
(log [ZIKV],
Dynamic 5-7 5-7 4-7 6-7 6-7 4-7 4-7 5-7 7-8 5-7 5-7 6-8 6-8
copies/mL)
Range
(log [ZIKV],
LOD 4.8 4.5 3.5 5.7 5.8 3.5 3.7 4.7 6.8 4.5 4.8 5.8 5.8
copies/mL)
(log [ZIKV],
FPLR C50 6.3 6.2 5.3 6.3 6.4 5.2 5.8 6.2 nd 6.2 6.1 6.1 6.2
copies/mL)
DA /
FPLR slope 33.2 25.4 8.7 32.8 25.6 10.1 5.3 33.9 nd 19.9 20.8 27.7 22.3
(log [ZIKV])
FPLR DA
w 0.43 0.55 0.67 0.40 0.17 0.53 0.95 0.82 nd 0.94 0.98 0.91 0.81
maximum
o
FPLR
DA 0.03 0.05 0.05 0.02 0.01 0.05 0.01 0.03 nd 0.04 0.03 0.01 0.04 minimum
FPLR R"2 0.98 1.00 0.99 1.00 0.97 0.99 0.99 0.99 nd 0.99 1.00 1.00 0.99
Peptide 1
5 20 20 20 20
Concentration (mM) 2 2 0.5 2 5 0.5 5
□g/ml Intra-day
CV(%) <5 <5 <7 <5 <4 <7 <4 <4 <5 <5 <5 <5 <10 reproducibility
Inter-day and
batch-to-batch CV(%) <12 <12 <15 <12 <10 <15 <10 <10 <10 <10 <10 <10 nd reproducibility
Long-term
(Month) >1 >1 >1 >1 >1 >1 >1 >1 >1 >1 >1 >1 nd stability
Assay time
after Plate (h) 5 5 5 5 5 5 5 5 5 5 5 5 8 Coating
Optimized experimental parameters of the direct ELISA assay for the eight clamp peptides, the four mono-arm peptides and a commercial antibody (4G2). BT=BLOTTO blocking buffer; PF=protein free blocking buffer; PBS= 10mM phosphate buffer saline pH 7.4; PBST= 10 mM PBS pH 7.4, 0.1 % Tween-20. FLRP= Four Parameter Logistic Regression
[00126] To minimize nonspecific binding, four blocking agents (PF, BLOTTO, SuperBlock™ Blocking Buffer, Blocker™ BSA) were tested. All blocking agents had very low background signal. For hydrophilic peptides the lowest background signal was achieved using PF that gave the best performances also using the antibody 4G2. For hydrophobic peptides, the blocker BLOTTO showed better performance, except for mono-arm peptide P1 .
[00127] For clamp peptides and antibody, the surfactant agent tween 20 at 0.1% was necessary in the incubation step. No longer than one hour was necessary for peptide
incubation, a longer time increased both the overall signal generated by the binding event and the background signal. Shacking during incubation improved the signal to noise ratio.
[00128] The optimal concentration of peptide was determined by coating clear 96-well plates with a solution of 10 ? copies/mL of intact ZIKV particles. Concentrations of peptide, from 0.1 to 50 mM, diluted in 10 mM PBS pH 7.4 were added to wells of the microplates coated with intact ZIKV particles. For mono-arm peptides, larger concentrations than 20 DM did not increase the assay sensitivity.
[00129] Clamp peptides showed higher sensitivity than mono-arm peptides. Clamp peptides C5, C7 and C8 showed the best performances when used at 5 mM. The clamp C1 , C2 and C4 at concentration of 2 mM did the best signal to noise ratio. Impressively, clamp peptides C3 and C6 had be used at concentration of 0.5 mM to have the best results, highlighting the remarkable high sensitivity of those peptides in this kind of assay.
[00130] Thus, the peptide concentrations reported in Table 5 were used to estimate the dynamic range and the LOD of the assay by using 10-fold serial dilutions of intact ZIKV particles from 10 1 to 10 8 copies/mL.
[00131] The results had a sigmoidal ZIKV particles concentration response and the calibration curves were obtained by plotting the delta absorbance (after blank signal subtraction) against the log of ZIKV particles concentration and fitting the experimental data with a four- parameter logistic function (FPLR).
[00132] The regression parameters of the assay were reported in Table 5 and the sigmoidal trend in Figure 3. The LOD was interpolated from the calibration curves using LOD = SB + 3 c SDB where SB and SDBwere the average and the standard deviation of the blank
measurements, respectively.
[00133] Dose-response curves generated with all peptides and the antibody had at least two- order of magnitude dynamic range except for peptide P1 , which had just a one order of magnitude dynamic range.
[00134] The peptide based assay using C3, C6, and C7 showed three-order of magnitude dynamic range and lower detection limits with dynamic range starting from 10 copies/mL one or two order magnitude lower than the others peptides or antibody based assay. The better performance in binding ZIKV intact particles by those three clamp peptides was also highlighted by the FPLR C50 parameter. The dose-response performance of the assay was reproducible over a month (RSD lower than 15%), demonstrating that the peptides had high stability and reproducibility.
[00135] The cross-reactivity among Flaviviruses is a key parameter to be tested for this assay. Using the same ELISA protocol, the three clamp peptides (C3, C6 and C7) with higher sensitivity versus the intact ZIKV particles where employed to test the ability to discriminate ZIKV from the three serotypes of DENV (DENV-1 , -2 and -3). The cross reactivity performances were compared with that obtained using the mono-arm peptides P2 and X1 , that showed the best analytical parameters within the mono-arm peptides.
[00136] The results shown in Figure 4 were obtained by coating clear 96-well plates with a solution of 10 5 copies/mL of intact virus particles. At this concentration, all three peptides showed slight cross-reactivity against the DENV. Clamp peptides C3 and C7 had the higher DENV/ZIKV signal ratio with around 70% signal decrease for all DENV serotypes. Clamp Peptide C6 showed slight cross reactivity with DENV-2 and DENV-3 with only 45% and 55% of signal decrease respectively. The three clamp peptides clearly discriminated between the two flavivirus species. At this concentration, mono-arm peptide assays had the ZIKV delta absorbance signals statistically comparable to the signals obtained using DENV.
[00137] Nevertheless, it should be highlight that increasing the coating concentration of the virus to 10 ® copies/mL the ZIKV and DENV analytical signals were statistically equivalent, losing, for the clamp peptides, the discrimination between ZIKV and DENV.
[00138] Usually, the presence of ZIKV in affected bodies is detected in biological fluids. Therefore, the analytical sensitivity of the selected peptides was tested in two biological matrices, namely, urine and serum. The matrix effect was investigated to understand how real biological fluids could modify the binding efficiency of the peptides.
[00139] Figure 4 depicts the ELISA data using solutions of peptides with or without the urine and serum obtained coating clear 96-well plates with 10 5 and 10 ® copies/mL of intact ZIKV particles. Urine and serum were 1 :1 and 1 :10 diluted, respectively, with a concentrated peptide PBS solution (10 mM PBS, pH 7.4) to obtain a peptide final concentration of 0.5 mM for C3 and C6, 5 mM for C7 and 20 mM for mono-arm peptides (P2 and X1 ).
[00140] Clamp peptide C3 showed a better performance in urine than serum, particularly for 10 5 copies/mL. Clamp peptide C6 had a strong decrease in the signal generated at both I C or 10 ® copies/mL in both urine and serum. Clamp peptide C7 exhibited the best performance among the three clamp peptides, having higher signals in serum than in urine. All clamp peptides lost at the least one order of magnitude signal detecting ZIKV in urine or serum, except for C7 in serum, having a distinct analytical signal also at 10 5 copies/mL of ZIKV.
[00141] Mono-arm peptides lost completely the signal at 10 5 copies/mL of ZIKV, starting to detect ZIKV particles in both urine or serum, from 10 ® copies/mL using P1 and from 10 ? copies/mL using X1 (data not showed).
[00142] It should be highlighted that when a body is under a ZIKV infection the level of flavivirus concentration in urine can be up to 10 (3 9 copies/mL (Campos et al., 2016; Pawley et al., 2019).
[00143] Conclusions
[00144] The semi combinatorial virtual strategy to design clamp peptides using the two flavivirus active sites as a binding target has shown to have the potential for designing antibody mimetics for the selective detection of ZIKV.
[00145] The ELISA assay platform developed for testing the newly designed clamp peptides offered the possibility to optimize in short time the experimental conditions for evaluation of the eight clamp peptides and four mono-arm peptides chosen from the most promising ones yield by the in-silico studies. The three clamp peptides with better performances to detect ZIKV had shown to have also semi-selective properties when tested against DENV. The matrix-effect was also investigated, by testing the response of the peptides in physiological matrices, i.e., urine and serum. We observed that the matrix affected the assay performance by decreasing the detection limits by one order of magnitude, albeit still having a distinct analytical signal starting from 10 ® copies/mL, the concentration of ZIKV in acute infection.
[00146] This work represents a new methodology for the selection of tailor-made clamp peptides, rationalizing the way to choose receptors with high binding ability among thousands of
potential compounds that can be employed in biotechnology, medical, and a variety of analytical applications. Taking advantage of the fast progress in computing, we envision that it will be possible to simulate in short time the clamp peptides having even more complex shapes with better selectivity and less cross-reactivity.
REFERENCES
Acebes et al., ACS Catalysis 6(3), 1624-1629 (2016).
Barba-Spaeth et al., Nature 536(7614), 48. (2016).
Bunker et al., Biochimica et Biophysica Acta (BBA)-Biomembranes 1858(10), 2334-2352 (2016). Campos et al., Journal of Clinical Virology 77, 69-70 (2016).
Do Thi Hoang Kim et al., Theranostics 8(13), 3629 (2018).
Gong et al.,. Yi Chuan 32(6), 548-554 (2010).
Hawkins et al., Journal of Chemical Information and Modeling 52(1 1 ), 2919-2936. doi:
10.1021 /OΪ300314k.
Hawkins et al., Journal of Chemical Information and Modeling 50(4), 572-584. doi:
10.1021 /ci100031 x. (2010)
Heffron et al., PLoS neglected tropical diseases 12(1 1 ), e0006903 (2018).
Hwang et al., Biosensors and Bioelectronics 87, 164-170 (2017).
loos et al., Medecine et maladies infectieuses 44(7), 302-307 (2014).
Kelley et al., POSIT: flexible shape-guided docking for pose prediction. Journal of chemical information and modeling 55(8), 1771 -1780 (2015).
Li et al., Electrochemical aptamer-based sensors for food and water analysis: A review. Analytica chimica acta. (2018)
Lin et al., Journal of Chromatography A 1446, 34-40 (2016).
Macalino et al., Archives of pharmacal research 38(9), 1686-1701 (2015).
Magnani et al., Science translational medicine 9(410), eaan8184 (2017).
Mascini et al., Biosensors and Bioelectronics 123, 124-130 (2019).
Mascini et al., Talanta 167, 126-133 (2017).
Michaeli et al., The Journal of Immunology 201 (1 1 ), 3383-3391 (2018).
OEDocking. version 3.0.0. OpenEye Scientific Software, Santa Fe, NM.
http://www.evesopen .com .
OMEGA version 2.4.6. OpenEye Scientific Software, Santa Fe, NM. http://www.eyesopen.com· Palzkill et al., US Patent App. 14/777,714), (2018)
Pawley et al., ACS Omega 4(4), 6808-6818 (2019).
Perez et al., Peptides trapping dioxins: a docking-based inverse screening approach. Journal of Chemistry 2013.
Pichon et al., Anal Bioanal Chem 407(3), 681 -698. doi: 10.1007/s00216-014-8129-5 (2015). Priyamvada et al., Proceedings of the National Academy of Sciences 1 13(28), 7852-7857 (2016). Singh et al., Journal of medicinal chemistry 59(8), 3920-3934 (2016).
Sirohi et al., Science 352(6284), 467-470 (2016).
Stettler et al., Science 353(6301 ), 823-826 (2016).
Stobiecka et al., Chemical Papers 69(1 ), 62-76 (2015).
SZYBKI. version 1 .5.7. OpenEye Scientific Software, Santa Fe, NM. http://www.eyesopen .com . Tambunan et al., Jurnal Teknologi 78(4-3) (2016).
Tang et al., Talanta 146, 55-61 (2016).
VIDA version 4.1 .1 . OpenEye Scientific Software, Santa Fe, NM. http:/7www.eyesopen .com . Weaver et al., Antiviral research 130, 69-80 (2016).
Xu et al., Nature medicine 24(6), 857 (2018)
Yu et al., Annual Review of Analytical Chemistry 10, 293-320 (2017).
Yuan et al., Wiley Interdisciplinary Reviews: Computational Molecular Science 7(2), e1298 (2017).
Zhao et al., Structural basis of Zika virus-specific antibody protection. Cell 166(4), 1016-1027 (2016).
[00147] All references, including publications, patent applications, and patents, cited herein are hereby incorporated by reference to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and were set forth in its entirety herein.
[00148] The use of the terms“a” and“an” and“the” and similar referents in the context of describing the disclosure (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly
contradicted by context. The terms“comprising,”“having,”“including,” and“containing” are to be construed as open-ended terms (i.e., meaning“including, but not limited to,”) unless otherwise noted.
[00149] Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range and each endpoint, unless otherwise indicated herein, and each separate value and endpoint is incorporated into the specification as if it were individually recited herein.
[00150] All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g.,“such as”) provided herein, is intended merely to better illuminate the disclosure and does not pose a limitation on the scope of the disclosure unless otherwise claimed. No language in the specification should be construed as indicating any non- claimed element as essential to the practice of the disclosure.
[00151] Preferred embodiments of this disclosure are described herein, including the best mode known to the inventors for carrying out the disclosure. Variations of those preferred embodiments may become apparent to those of ordinary skill in the art upon reading the foregoing description. The inventors expect skilled artisans to employ such variations as appropriate, and the inventors intend for the disclosure to be practiced otherwise than as specifically described herein. Accordingly, this disclosure includes all modifications and equivalents of the subject matter recited in the claims appended hereto as permitted by applicable law. Moreover, any combination of the above-described elements in all possible
variations thereof is encompassed by the disclosure unless otherwise indicated herein or otherwise clearly contradicted by context.
Claims
1 . A method implemented by a processor in a computer for designing a clamp peptide
comprising the structure A B-A2 wherein
is a first peptide arm that binds to a first binding site of a target protein, A2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links
to A2, the method comprising the steps of:
a. determining a binding score of each tetrapeptide of a first peptide library for the first binding site of the target protein, wherein the first peptide library comprises the full combinatorial set of 160K tetrapeptides having a unique combination of four amino acids of the twenty naturally-occurring amino acids,
b. identifying the sequences of the tetrapeptides having a binding score which meets a first threshold,
c. determining a binding score of each pentapeptide of a second peptide library for the first binding site of the target protein, wherein the second peptide library comprises a set of pentapeptides having a unique combination of five amino acids comprising the amino acids of the sequence of a tetrapeptide identified in step (b) and one of the twenty naturally-occurring amino acids added as the first amino acid or the fifth amino acid of the pentapeptide or between two amino acids of the sequence of the tetrapeptide,
d. identifying the sequences of the pentapeptides having a binding score which meets a second threshold,
e. optionally, determining a binding score of each hexapeptide of a third peptide library for the first binding site of the target protein, wherein the third peptide library comprises a set of hexapeptides having a unique combination of six amino acids comprising the amino acids of the sequence of a pentapeptide identified in step (d) and one of the twenty naturally-occurring amino acids added as the first amino acid or the sixth amino acid of the hexapeptide or between two amino acids of the sequence of the pentapeptide,
f. optionally, identifying the sequences of the hexapeptides having a binding score which meets a third threshold, and
g. repeating steps (a) to (d) or (a) to (f) wherein, for each determining step, a binding score for the second binding site of the target protein is determined for each tetrapeptide of the first peptide library, for each pentapeptide of the second peptide library, and optionally for each hexapeptide of the third peptide library.
2. The method of claim 1 , comprising generating the first peptide library, the second peptide library, the third peptide library, and/or the fourth peptide library.
4. The method of any one of the preceding claims, wherein the first peptide library comprises 160,000 unique tetrapeptides.
5. The method of any one of the preceding claims, wherein the first threshold is the top 5% of binding scores, optionally, wherein 8000 unique sequences of tetrapeptides are identified upon step (b).
6. The method of any one of the preceding claims, wherein the first threshold is the top 5% of binding scores and a binding score outside the top 5% of binding scores for a different target protein, optionally, wherein 1000 unique sequences of tetrapeptides are identified upon step (b).
7. The method of any one of the preceding claims, wherein the second threshold is the top 5% of binding scores, optionally, wherein the second threshold further comprises a binding score outside the top 5% of binding scores for a different target protein.
8. The method of any one of the preceding claims, wherein the third threshold is the top 5% of binding scores, optionally, wherein the third threshold further comprises a binding score outside the top 5% of binding scores for a different target protein.
9. The method of any one of the preceding claims, wherein the binding scores are calculated using a docking scoring function, optionally, chemgauss4.
10. The method of any one of the preceding claims, wherein the binding scores for more than 350,000 peptides are determined for each of the first binding site and the second binding site.
1 1 . The method of any one of the preceding claims, further comprising determining the length of B of the clamp peptide, optionally, by measuring the distance between a peptide bound to the first binding site of the target protein and a peptide bound to the second binding site of the target protein.
12. The method of any one of the preceding claims, wherein the binding scores are determined using a molecular docking program.
13. The method of claim 12, wherein the molecular docking program is based on multi- conformer rigid body docking, which evaluates 10 conformers per peptide.
14. A computer-readable storage medium having stored thereon machine-readable instructions executable by a processor, comprising instructions for carrying out the steps of the method of any one of claims 1 -13.
15. A system comprising machine readable instructions that, when executed by the processor, cause the processor to carry out the steps of the method of any one of claims 1 -13.
16. A method of manufacturing a clamp peptide comprising the structure A B-A2 wherein
is a first peptide arm that binds to a first binding site of a target protein, A2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links Ai to A2, said method comprising designing the first peptide arm and the second peptide arm of the clamp peptide according to the method of any one of claims 1 -13 and joining the first peptide arm to the second peptide arm with a bridge peptide, B.
17. A clamp peptide comprising the structure A B-A2 wherein
is a first peptide arm that binds to a first binding site of a target protein, A2 is a second peptide arm that binds to a second binding site of the target protein, and B is a bridge peptide which links A^ to A2, wherein the sequences of A^ and A2 are designed by the method of any one of claims 1 -13.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/441,646 | 2019-06-14 | ||
US16/441,646 US11830582B2 (en) | 2018-06-14 | 2019-06-14 | Methods of designing novel antibody mimetics for use in detecting antigens and as therapeutic agents |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020251615A1 true WO2020251615A1 (en) | 2020-12-17 |
Family
ID=73781567
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2019/061862 WO2020251615A1 (en) | 2019-06-14 | 2019-11-15 | Methods of designing novel antibody mimetics for use in detecting antigens and as therapeutic agents |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2020251615A1 (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5565325A (en) * | 1992-10-30 | 1996-10-15 | Bristol-Myers Squibb Company | Iterative methods for screening peptide libraries |
US9885050B2 (en) * | 2007-11-08 | 2018-02-06 | The University Of Chicago | Molecular affinity clamp technology and uses thereof |
US20180179519A1 (en) * | 2015-07-03 | 2018-06-28 | Hunan Zonsen Peplib Biotech Co., Ltd | Peptide library constructing method and related vectors |
-
2019
- 2019-11-15 WO PCT/US2019/061862 patent/WO2020251615A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5565325A (en) * | 1992-10-30 | 1996-10-15 | Bristol-Myers Squibb Company | Iterative methods for screening peptide libraries |
US9885050B2 (en) * | 2007-11-08 | 2018-02-06 | The University Of Chicago | Molecular affinity clamp technology and uses thereof |
US20180179519A1 (en) * | 2015-07-03 | 2018-06-28 | Hunan Zonsen Peplib Biotech Co., Ltd | Peptide library constructing method and related vectors |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Casadio et al. | A 3D model of the voltage-dependent anion channel (VDAC) | |
Wang et al. | Structure-based design for binding peptides in anti-cancer therapy | |
Cho et al. | An affinity peptide-incorporated electrochemical biosensor for the detection of neutrophil gelatinase-associated lipocalin | |
CN101407541B (en) | Modified allosteric type cyclic citrulline polypeptide, and fusion protein, antibody and reagent kit thereof | |
EP2910949A1 (en) | Combination hepatitis C virus antigen and antibody detection method | |
Gupta et al. | Probing interactions between CLIP-170, EB1, and microtubules | |
Sun et al. | Research advances in hydrogen–deuterium exchange mass spectrometry for protein epitope mapping | |
Lin et al. | Characterization of ELISA antibody-antigen interaction using footprinting-mass spectrometry and negative staining transmission electron microscopy | |
Kim et al. | Biomimetic isolation of affinity peptides for electrochemical detection of influenza virus antigen | |
Yang et al. | Phage display-derived peptide for the specific binding of SARS-CoV-2 | |
US11830582B2 (en) | Methods of designing novel antibody mimetics for use in detecting antigens and as therapeutic agents | |
Leo et al. | Characterization of Self-Assembled Monolayers of Peptide Mimotopes of CD20 Antigen and Their Binding with Rituximab | |
Tans et al. | Affimers as an alternative to antibodies for protein biomarker enrichment | |
CN101687908A (en) | Antigen determinant of rheumatoid arthritis-specific autoantibody and use thereof | |
Štambuk et al. | Genetic coding algorithm for sense and antisense peptide interactions | |
Lee et al. | Linker-free antibody conjugation for sensitive hydrogel microparticle-based multiplex immunoassay | |
WO2018148489A1 (en) | Analyte detection immunoassay | |
Liu et al. | Advances in mass spectrometry-based epitope mapping of protein therapeutics | |
CN110178031A (en) | The determination of serology of zika virus infection | |
Mascini et al. | A new class of sensing elements for sensors: Clamp peptides for Zika virus | |
CN112375754B (en) | Severe acute respiratory syndrome coronavirus 2 affinity polypeptide based on human angiotensin converting enzyme 2 | |
Hug et al. | HLA antibody affinity determination: From HLA‐specific monoclonal antibodies to donor HLA specific antibodies (DSA) in patient serum | |
Gearhart et al. | Selection of a potential diagnostic biomarker for HIV infection from a random library of non-biological synthetic peptoid oligomers | |
WO2020251615A1 (en) | Methods of designing novel antibody mimetics for use in detecting antigens and as therapeutic agents | |
Olaleye et al. | Determination of binding sites on trastuzumab and pertuzumab to selective affimers using hydrogen–deuterium exchange mass spectrometry |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19932843 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19932843 Country of ref document: EP Kind code of ref document: A1 |