CN114409563A - Linker for protein labeling and application thereof in biological medicine - Google Patents
Linker for protein labeling and application thereof in biological medicine Download PDFInfo
- Publication number
- CN114409563A CN114409563A CN202011173629.8A CN202011173629A CN114409563A CN 114409563 A CN114409563 A CN 114409563A CN 202011173629 A CN202011173629 A CN 202011173629A CN 114409563 A CN114409563 A CN 114409563A
- Authority
- CN
- China
- Prior art keywords
- aryl
- alkyl
- antibody
- compound
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 58
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 58
- 238000002372 labelling Methods 0.000 title claims abstract description 41
- 239000003814 drug Substances 0.000 title claims abstract description 24
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims abstract description 20
- 229940079593 drug Drugs 0.000 claims abstract description 19
- 239000000126 substance Substances 0.000 claims abstract description 17
- 239000004472 Lysine Substances 0.000 claims abstract description 16
- 150000001875 compounds Chemical class 0.000 claims description 91
- -1 hydroxy, amino Chemical group 0.000 claims description 49
- 229920001184 polypeptide Polymers 0.000 claims description 24
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 24
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 24
- 229940049595 antibody-drug conjugate Drugs 0.000 claims description 23
- 125000003118 aryl group Chemical group 0.000 claims description 22
- 125000000217 alkyl group Chemical group 0.000 claims description 19
- 229910052736 halogen Inorganic materials 0.000 claims description 19
- 150000002367 halogens Chemical class 0.000 claims description 19
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 claims description 18
- 239000000611 antibody drug conjugate Substances 0.000 claims description 15
- 125000001072 heteroaryl group Chemical group 0.000 claims description 14
- 125000003172 aldehyde group Chemical group 0.000 claims description 13
- 125000000623 heterocyclic group Chemical group 0.000 claims description 13
- 229910052739 hydrogen Inorganic materials 0.000 claims description 13
- 239000001257 hydrogen Substances 0.000 claims description 13
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 12
- 150000003839 salts Chemical class 0.000 claims description 12
- 125000004093 cyano group Chemical group *C#N 0.000 claims description 11
- YZCKVEUIGOORGS-OUBTZVSYSA-N Deuterium Chemical compound [2H] YZCKVEUIGOORGS-OUBTZVSYSA-N 0.000 claims description 10
- 125000000753 cycloalkyl group Chemical group 0.000 claims description 10
- 229910052805 deuterium Inorganic materials 0.000 claims description 10
- 150000002431 hydrogen Chemical class 0.000 claims description 10
- 238000006467 substitution reaction Methods 0.000 claims description 10
- 125000003277 amino group Chemical group 0.000 claims description 9
- 230000000155 isotopic effect Effects 0.000 claims description 9
- 125000003545 alkoxy group Chemical group 0.000 claims description 8
- 125000003282 alkyl amino group Chemical group 0.000 claims description 8
- 125000004448 alkyl carbonyl group Chemical group 0.000 claims description 8
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 claims description 8
- 239000000562 conjugate Substances 0.000 claims description 8
- 125000000304 alkynyl group Chemical group 0.000 claims description 7
- 150000004677 hydrates Chemical class 0.000 claims description 7
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 claims description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 6
- BWRBVBFLFQKBPT-UHFFFAOYSA-N (2-nitrophenyl)methanol Chemical compound OCC1=CC=CC=C1[N+]([O-])=O BWRBVBFLFQKBPT-UHFFFAOYSA-N 0.000 claims description 6
- IAKHMKGGTNLKSZ-INIZCTEOSA-N (S)-colchicine Chemical compound C1([C@@H](NC(C)=O)CC2)=CC(=O)C(OC)=CC=C1C1=C2C=C(OC)C(OC)=C1OC IAKHMKGGTNLKSZ-INIZCTEOSA-N 0.000 claims description 6
- 239000007801 affinity label Substances 0.000 claims description 5
- 125000005015 aryl alkynyl group Chemical group 0.000 claims description 5
- 125000001769 aryl amino group Chemical group 0.000 claims description 5
- 150000001540 azides Chemical class 0.000 claims description 5
- 238000002360 preparation method Methods 0.000 claims description 5
- 125000006763 (C3-C9) cycloalkyl group Chemical group 0.000 claims description 4
- HBAQYPYDRFILMT-UHFFFAOYSA-N 8-[3-(1-cyclopropylpyrazol-4-yl)-1H-pyrazolo[4,3-d]pyrimidin-5-yl]-3-methyl-3,8-diazabicyclo[3.2.1]octan-2-one Chemical class C1(CC1)N1N=CC(=C1)C1=NNC2=C1N=C(N=C2)N1C2C(N(CC1CC2)C)=O HBAQYPYDRFILMT-UHFFFAOYSA-N 0.000 claims description 4
- DFBIRQPKNDILPW-CIVMWXNOSA-N Triptolide Chemical compound O=C1OCC([C@@H]2C3)=C1CC[C@]2(C)[C@]12O[C@H]1[C@@H]1O[C@]1(C(C)C)[C@@H](O)[C@]21[C@H]3O1 DFBIRQPKNDILPW-CIVMWXNOSA-N 0.000 claims description 4
- 239000000427 antigen Substances 0.000 claims description 4
- 108091007433 antigens Proteins 0.000 claims description 4
- 102000036639 antigens Human genes 0.000 claims description 4
- 125000005129 aryl carbonyl group Chemical group 0.000 claims description 4
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims description 4
- 239000002243 precursor Substances 0.000 claims description 4
- YKUJZZHGTWVWHA-UHFFFAOYSA-N triptolide Natural products COC12CC3OC3(C(C)C)C(O)C14OC4CC5C6=C(CCC25C)C(=O)OC6 YKUJZZHGTWVWHA-UHFFFAOYSA-N 0.000 claims description 4
- 125000004890 (C1-C6) alkylamino group Chemical group 0.000 claims description 3
- 125000004916 (C1-C6) alkylcarbonyl group Chemical group 0.000 claims description 3
- LGZKGOGODCLQHG-CYBMUJFWSA-N 5-[(2r)-2-hydroxy-2-(3,4,5-trimethoxyphenyl)ethyl]-2-methoxyphenol Chemical compound C1=C(O)C(OC)=CC=C1C[C@@H](O)C1=CC(OC)=C(OC)C(OC)=C1 LGZKGOGODCLQHG-CYBMUJFWSA-N 0.000 claims description 3
- 102000008394 Immunoglobulin Fragments Human genes 0.000 claims description 3
- 108010021625 Immunoglobulin Fragments Proteins 0.000 claims description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 claims description 3
- 229930012538 Paclitaxel Natural products 0.000 claims description 3
- 125000004453 alkoxycarbonyl group Chemical group 0.000 claims description 3
- 125000004457 alkyl amino carbonyl group Chemical group 0.000 claims description 3
- 125000004103 aminoalkyl group Chemical group 0.000 claims description 3
- 229930195731 calicheamicin Natural products 0.000 claims description 3
- 229960001338 colchicine Drugs 0.000 claims description 3
- LGZKGOGODCLQHG-UHFFFAOYSA-N combretastatin Natural products C1=C(O)C(OC)=CC=C1CC(O)C1=CC(OC)=C(OC)C(OC)=C1 LGZKGOGODCLQHG-UHFFFAOYSA-N 0.000 claims description 3
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 claims description 3
- 229960001592 paclitaxel Drugs 0.000 claims description 3
- 150000003254 radicals Chemical class 0.000 claims description 3
- 125000001424 substituent group Chemical group 0.000 claims description 3
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 claims description 3
- VEGGTWZUZGZKHY-GJZGRUSLSA-N (2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]-5-(carbamoylamino)-n-[4-(hydroxymethyl)phenyl]pentanamide Chemical compound NC(=O)NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)NC1=CC=C(CO)C=C1 VEGGTWZUZGZKHY-GJZGRUSLSA-N 0.000 claims description 2
- ALBODLTZUXKBGZ-JUUVMNCLSA-N (2s)-2-amino-3-phenylpropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound NCCCC[C@H](N)C(O)=O.OC(=O)[C@@H](N)CC1=CC=CC=C1 ALBODLTZUXKBGZ-JUUVMNCLSA-N 0.000 claims description 2
- NNWYWNRCBPYLML-GWCFXTLKSA-N (2s)-2-amino-n-[(2s)-1-[4-(hydroxymethyl)anilino]-1-oxopropan-2-yl]-3-methylbutanamide Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NC1=CC=C(CO)C=C1 NNWYWNRCBPYLML-GWCFXTLKSA-N 0.000 claims description 2
- 125000004169 (C1-C6) alkyl group Chemical group 0.000 claims description 2
- 125000000882 C2-C6 alkenyl group Chemical group 0.000 claims description 2
- 125000003601 C2-C6 alkynyl group Chemical group 0.000 claims description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 claims description 2
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 claims description 2
- 125000003342 alkenyl group Chemical group 0.000 claims description 2
- 235000020958 biotin Nutrition 0.000 claims description 2
- 239000011616 biotin Substances 0.000 claims description 2
- 229960002685 biotin Drugs 0.000 claims description 2
- 230000000536 complexating effect Effects 0.000 claims description 2
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 claims description 2
- 235000019152 folic acid Nutrition 0.000 claims description 2
- 239000011724 folic acid Substances 0.000 claims description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 claims description 2
- 239000000049 pigment Substances 0.000 claims description 2
- YUOCYTRGANSSRY-UHFFFAOYSA-N pyrrolo[2,3-i][1,2]benzodiazepine Chemical class C1=CN=NC2=C3C=CN=C3C=CC2=C1 YUOCYTRGANSSRY-UHFFFAOYSA-N 0.000 claims description 2
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 claims description 2
- 108010073969 valyllysine Proteins 0.000 claims description 2
- AOJJSUZBOXZQNB-TZSSRYMLSA-N Doxorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-TZSSRYMLSA-N 0.000 claims 4
- KLWPJMFMVPTNCC-UHFFFAOYSA-N Camptothecin Natural products CCC1(O)C(=O)OCC2=C1C=C3C4Nc5ccccc5C=C4CN3C2=O KLWPJMFMVPTNCC-UHFFFAOYSA-N 0.000 claims 2
- VSJKWCGYPAHWDS-FQEVSTJZSA-N camptothecin Chemical compound C1=CC=C2C=C(CN3C4=CC5=C(C3=O)COC(=O)[C@]5(O)CC)C4=NC2=C1 VSJKWCGYPAHWDS-FQEVSTJZSA-N 0.000 claims 2
- 229940127093 camptothecin Drugs 0.000 claims 2
- 150000001735 carboxylic acids Chemical class 0.000 claims 2
- VSJKWCGYPAHWDS-UHFFFAOYSA-N dl-camptothecin Natural products C1=CC=C2C=C(CN3C4=CC5=C(C3=O)COC(=O)C5(O)CC)C4=NC2=C1 VSJKWCGYPAHWDS-UHFFFAOYSA-N 0.000 claims 2
- HYFHYPWGAURHIV-UHFFFAOYSA-N homoharringtonine Natural products C1=C2CCN3CCCC43C=C(OC)C(OC(=O)C(O)(CCCC(C)(C)O)CC(=O)OC)C4C2=CC2=C1OCO2 HYFHYPWGAURHIV-UHFFFAOYSA-N 0.000 claims 2
- HYFHYPWGAURHIV-JFIAXGOJSA-N omacetaxine mepesuccinate Chemical compound C1=C2CCN3CCC[C@]43C=C(OC)[C@@H](OC(=O)[C@@](O)(CCCC(C)(C)O)CC(=O)OC)[C@H]4C2=CC2=C1OCO2 HYFHYPWGAURHIV-JFIAXGOJSA-N 0.000 claims 2
- 229960002230 omacetaxine mepesuccinate Drugs 0.000 claims 2
- GOLORTLGFDVFDW-UHFFFAOYSA-N 3-(1h-benzimidazol-2-yl)-7-(diethylamino)chromen-2-one Chemical compound C1=CC=C2NC(C3=CC4=CC=C(C=C4OC3=O)N(CC)CC)=NC2=C1 GOLORTLGFDVFDW-UHFFFAOYSA-N 0.000 claims 1
- OVBPIULPVIDEAO-UHFFFAOYSA-N N-Pteroyl-L-glutaminsaeure Natural products C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)NC(CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-UHFFFAOYSA-N 0.000 claims 1
- 229940009456 adriamycin Drugs 0.000 claims 1
- 229960004679 doxorubicin Drugs 0.000 claims 1
- 229960000304 folic acid Drugs 0.000 claims 1
- 229920002857 polybutadiene Polymers 0.000 claims 1
- 239000003153 chemical reaction reagent Substances 0.000 abstract description 7
- 238000011161 development Methods 0.000 abstract description 3
- 238000003384 imaging method Methods 0.000 abstract description 2
- 229920002521 macromolecule Polymers 0.000 abstract description 2
- 239000000523 sample Substances 0.000 abstract description 2
- 229940125644 antibody drug Drugs 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 51
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 45
- 239000000243 solution Substances 0.000 description 29
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 21
- 238000001228 spectrum Methods 0.000 description 20
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 18
- 238000006243 chemical reaction Methods 0.000 description 17
- 239000000203 mixture Substances 0.000 description 14
- 238000000034 method Methods 0.000 description 13
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 12
- 102000016943 Muramidase Human genes 0.000 description 11
- 108010014251 Muramidase Proteins 0.000 description 11
- 108010062374 Myoglobin Proteins 0.000 description 11
- 102000036675 Myoglobin Human genes 0.000 description 11
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 11
- 238000002156 mixing Methods 0.000 description 11
- 230000008878 coupling Effects 0.000 description 10
- 238000010168 coupling process Methods 0.000 description 10
- 238000005859 coupling reaction Methods 0.000 description 10
- 125000000524 functional group Chemical group 0.000 description 10
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 10
- OKKJLVBELUTLKV-VMNATFBRSA-N methanol-d1 Chemical compound [2H]OC OKKJLVBELUTLKV-VMNATFBRSA-N 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 238000005160 1H NMR spectroscopy Methods 0.000 description 8
- VAKXPQHQQNOUEZ-UHFFFAOYSA-N 3-[4-[[bis[[1-(3-hydroxypropyl)triazol-4-yl]methyl]amino]methyl]triazol-1-yl]propan-1-ol Chemical compound N1=NN(CCCO)C=C1CN(CC=1N=NN(CCCO)C=1)CC1=CN(CCCO)N=N1 VAKXPQHQQNOUEZ-UHFFFAOYSA-N 0.000 description 8
- 108090000848 Ubiquitin Proteins 0.000 description 8
- 102000044159 Ubiquitin Human genes 0.000 description 8
- ARUVKPQLZAKDPS-UHFFFAOYSA-L copper(II) sulfate Chemical compound [Cu+2].[O-][S+2]([O-])([O-])[O-] ARUVKPQLZAKDPS-UHFFFAOYSA-L 0.000 description 8
- 229910000366 copper(II) sulfate Inorganic materials 0.000 description 8
- 238000002101 electrospray ionisation tandem mass spectrometry Methods 0.000 description 8
- 239000000499 gel Substances 0.000 description 8
- 235000010335 lysozyme Nutrition 0.000 description 8
- 239000004325 lysozyme Substances 0.000 description 8
- 229960000274 lysozyme Drugs 0.000 description 8
- 239000003550 marker Substances 0.000 description 8
- 235000010378 sodium ascorbate Nutrition 0.000 description 8
- PPASLZSBLFJQEF-RKJRWTFHSA-M sodium ascorbate Substances [Na+].OC[C@@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RKJRWTFHSA-M 0.000 description 8
- 229960005055 sodium ascorbate Drugs 0.000 description 8
- PPASLZSBLFJQEF-RXSVEWSESA-M sodium-L-ascorbate Chemical compound [Na+].OC[C@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RXSVEWSESA-M 0.000 description 8
- 108010038061 Chymotrypsinogen Proteins 0.000 description 7
- 238000000799 fluorescence microscopy Methods 0.000 description 7
- 125000005647 linker group Chemical group 0.000 description 7
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 7
- 125000006413 ring segment Chemical group 0.000 description 7
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 6
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 6
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 6
- 125000003275 alpha amino acid group Chemical group 0.000 description 6
- 238000009826 distribution Methods 0.000 description 6
- 230000006698 induction Effects 0.000 description 6
- 238000001644 13C nuclear magnetic resonance spectroscopy Methods 0.000 description 5
- 238000001437 electrospray ionisation time-of-flight quadrupole detection Methods 0.000 description 5
- 229940116978 human epidermal growth factor Drugs 0.000 description 5
- 229920006395 saturated elastomer Polymers 0.000 description 5
- 238000001946 ultra-performance liquid chromatography-mass spectrometry Methods 0.000 description 5
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 4
- 102000007079 Peptide Fragments Human genes 0.000 description 4
- 108010033276 Peptide Fragments Proteins 0.000 description 4
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 4
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 4
- 102000004142 Trypsin Human genes 0.000 description 4
- 108090000631 Trypsin Proteins 0.000 description 4
- 239000012267 brine Substances 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 230000029087 digestion Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 125000005842 heteroatom Chemical group 0.000 description 4
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- HPALAKNZSZLMCH-UHFFFAOYSA-M sodium;chloride;hydrate Chemical compound O.[Na+].[Cl-] HPALAKNZSZLMCH-UHFFFAOYSA-M 0.000 description 4
- 239000011550 stock solution Substances 0.000 description 4
- WGTODYJZXSJIAG-UHFFFAOYSA-N tetramethylrhodamine chloride Chemical compound [Cl-].C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C(O)=O WGTODYJZXSJIAG-UHFFFAOYSA-N 0.000 description 4
- 239000012588 trypsin Substances 0.000 description 4
- IGXNPQWXIRIGBF-KEOOTSPTSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IGXNPQWXIRIGBF-KEOOTSPTSA-N 0.000 description 3
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 3
- 102000001301 EGF receptor Human genes 0.000 description 3
- 108060006698 EGF receptor Proteins 0.000 description 3
- 239000007821 HATU Substances 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- 239000007832 Na2SO4 Substances 0.000 description 3
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 description 3
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 125000004432 carbon atom Chemical group C* 0.000 description 3
- 230000021615 conjugation Effects 0.000 description 3
- 125000000113 cyclohexyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 3
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 238000001035 drying Methods 0.000 description 3
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 238000013365 molecular weight analysis method Methods 0.000 description 3
- 239000012044 organic layer Substances 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 239000000700 radioactive tracer Substances 0.000 description 3
- 239000011541 reaction mixture Substances 0.000 description 3
- 239000011734 sodium Substances 0.000 description 3
- 229910052938 sodium sulfate Inorganic materials 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 125000004042 4-aminobutyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])N([H])[H] 0.000 description 2
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 2
- KXDAEFPNCMNJSK-UHFFFAOYSA-N Benzamide Chemical compound NC(=O)C1=CC=CC=C1 KXDAEFPNCMNJSK-UHFFFAOYSA-N 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 2
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 2
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 2
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 2
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 2
- 238000007385 chemical modification Methods 0.000 description 2
- 239000000460 chlorine Substances 0.000 description 2
- ZYGHJZDHTFUPRJ-UHFFFAOYSA-N coumarin Chemical compound C1=CC=C2OC(=O)C=CC2=C1 ZYGHJZDHTFUPRJ-UHFFFAOYSA-N 0.000 description 2
- 125000001995 cyclobutyl group Chemical group [H]C1([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 2
- 125000001511 cyclopentyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- LOCAIGRSOJUCTB-UHFFFAOYSA-N indazol-3-one Chemical compound C1=CC=C2C(=O)N=NC2=C1 LOCAIGRSOJUCTB-UHFFFAOYSA-N 0.000 description 2
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 2
- 238000001819 mass spectrum Methods 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 230000000269 nucleophilic effect Effects 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 125000001436 propyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])[H] 0.000 description 2
- WGYKZJWCGVVSQN-UHFFFAOYSA-N propylamine Chemical compound CCCN WGYKZJWCGVVSQN-UHFFFAOYSA-N 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 238000010992 reflux Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 239000000741 silica gel Substances 0.000 description 2
- 229910002027 silica gel Inorganic materials 0.000 description 2
- 229910052717 sulfur Inorganic materials 0.000 description 2
- 239000011593 sulfur Substances 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- ZXJZGWOMAFPSJH-DCAQKATOSA-N (2S)-1-[2-[[2-[[(2S)-2-[[(2S)-2-[(2-aminoacetyl)amino]-3-carboxypropanoyl]amino]-3-hydroxypropanoyl]amino]acetyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O ZXJZGWOMAFPSJH-DCAQKATOSA-N 0.000 description 1
- 125000006570 (C5-C6) heteroaryl group Chemical group 0.000 description 1
- LMDZBCPBFSXMTL-UHFFFAOYSA-N 1-Ethyl-3-(3-dimethylaminopropyl)carbodiimide Substances CCN=C=NCCCN(C)C LMDZBCPBFSXMTL-UHFFFAOYSA-N 0.000 description 1
- FPQQSJJWHUJYPU-UHFFFAOYSA-N 3-(dimethylamino)propyliminomethylidene-ethylazanium;chloride Chemical compound Cl.CCN=C=NCCCN(C)C FPQQSJJWHUJYPU-UHFFFAOYSA-N 0.000 description 1
- OYBOVXXFJYJYPC-UHFFFAOYSA-N 3-azidopropan-1-amine Chemical compound NCCCN=[N+]=[N-] OYBOVXXFJYJYPC-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- QMAHVAFURJBOFV-UHFFFAOYSA-N 4-(bromomethyl)-3-nitrobenzoic acid Chemical compound OC(=O)C1=CC=C(CBr)C([N+]([O-])=O)=C1 QMAHVAFURJBOFV-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- 108091023037 Aptamer Proteins 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- WAEWODAAWLGLMK-OYDLWJJNSA-N Arg-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WAEWODAAWLGLMK-OYDLWJJNSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- NTXNUXPCNRDMAF-WFBYXXMGSA-N Asn-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC(N)=O)C)C(O)=O)=CNC2=C1 NTXNUXPCNRDMAF-WFBYXXMGSA-N 0.000 description 1
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 1
- YQNBILXAUIAUCF-CIUDSAMLSA-N Asn-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N YQNBILXAUIAUCF-CIUDSAMLSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- WKBOTKDWSSQWDR-UHFFFAOYSA-N Bromine atom Chemical compound [Br] WKBOTKDWSSQWDR-UHFFFAOYSA-N 0.000 description 1
- 125000000041 C6-C10 aryl group Chemical group 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- ZAMOUSCENKQFHK-UHFFFAOYSA-N Chlorine atom Chemical compound [Cl] ZAMOUSCENKQFHK-UHFFFAOYSA-N 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- ZQHQTSONVIANQR-BQBZGAKWSA-N Cys-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N ZQHQTSONVIANQR-BQBZGAKWSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- 101710112752 Cytotoxin Proteins 0.000 description 1
- 235000000638 D-biotin Nutrition 0.000 description 1
- 239000011665 D-biotin Substances 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- PIICEJLVQHRZGT-UHFFFAOYSA-N Ethylenediamine Chemical compound NCCN PIICEJLVQHRZGT-UHFFFAOYSA-N 0.000 description 1
- PXGOKWXKJXAPGV-UHFFFAOYSA-N Fluorine Chemical compound FF PXGOKWXKJXAPGV-UHFFFAOYSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- IROABALAWGJQGM-OALUTQOASA-N Gly-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)CN IROABALAWGJQGM-OALUTQOASA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- CTGZVVQVIBSOBB-AVGNSLFASA-N His-His-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTGZVVQVIBSOBB-AVGNSLFASA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 1
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 1
- ZEVPMOHYCQFWSE-NAKRPEOUSA-N Met-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N ZEVPMOHYCQFWSE-NAKRPEOUSA-N 0.000 description 1
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- GOOHAUXETOMSMM-UHFFFAOYSA-N Propylene oxide Chemical compound CC1CO1 GOOHAUXETOMSMM-UHFFFAOYSA-N 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 1
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 1
- 229940123237 Taxane Drugs 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- VEENWOSZGWWKHW-SZZJOZGLSA-N Thr-Trp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O VEENWOSZGWWKHW-SZZJOZGLSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 1
- JLTQXEOXIJMCLZ-ZVZYQTTQSA-N Trp-Gln-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 JLTQXEOXIJMCLZ-ZVZYQTTQSA-N 0.000 description 1
- SVGAWGVHFIYAEE-JSGCOSHPSA-N Trp-Gly-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 SVGAWGVHFIYAEE-JSGCOSHPSA-N 0.000 description 1
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 1
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 1
- SLOYNOMYOAOUCX-BVSLBCMMSA-N Trp-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SLOYNOMYOAOUCX-BVSLBCMMSA-N 0.000 description 1
- CYLQUSBOSWCHTO-BPUTZDHNSA-N Trp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CYLQUSBOSWCHTO-BPUTZDHNSA-N 0.000 description 1
- UUZYQOUJTORBQO-ZVZYQTTQSA-N Trp-Val-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UUZYQOUJTORBQO-ZVZYQTTQSA-N 0.000 description 1
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 125000004414 alkyl thio group Chemical group 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 235000001014 amino acid Nutrition 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 239000008346 aqueous phase Substances 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 1
- RWCCWEUUXYIKHB-UHFFFAOYSA-N benzophenone Chemical compound C=1C=CC=CC=1C(=O)C1=CC=CC=C1 RWCCWEUUXYIKHB-UHFFFAOYSA-N 0.000 description 1
- 239000012965 benzophenone Substances 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000001815 biotherapy Methods 0.000 description 1
- 229960000455 brentuximab vedotin Drugs 0.000 description 1
- GDTBXPJZTBHREO-UHFFFAOYSA-N bromine Substances BrBr GDTBXPJZTBHREO-UHFFFAOYSA-N 0.000 description 1
- 229910052794 bromium Inorganic materials 0.000 description 1
- 125000000484 butyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229910052801 chlorine Inorganic materials 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 150000004814 combretastatins Chemical class 0.000 description 1
- 229960000956 coumarin Drugs 0.000 description 1
- 235000001671 coumarin Nutrition 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 125000006165 cyclic alkyl group Chemical group 0.000 description 1
- 125000001559 cyclopropyl group Chemical group [H]C1([H])C([H])([H])C1([H])* 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 231100000599 cytotoxic agent Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 239000002619 cytotoxin Substances 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 239000000890 drug combination Substances 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000003818 flash chromatography Methods 0.000 description 1
- 239000011737 fluorine Substances 0.000 description 1
- 229910052731 fluorine Inorganic materials 0.000 description 1
- 229940014144 folate Drugs 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 229960003297 gemtuzumab ozogamicin Drugs 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 125000001183 hydrocarbyl group Chemical group 0.000 description 1
- 125000004029 hydroxymethyl group Chemical group [H]OC([H])([H])* 0.000 description 1
- 125000002140 imidazol-4-yl group Chemical group [H]N1C([H])=NC([*])=C1[H] 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- 239000011630 iodine Substances 0.000 description 1
- 229910052740 iodine Inorganic materials 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 125000000959 isobutyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])* 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 239000011259 mixed solution Substances 0.000 description 1
- 125000002757 morpholinyl group Chemical group 0.000 description 1
- 125000001624 naphthyl group Chemical group 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 125000001147 pentyl group Chemical group C(CCCC)* 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- NMHMNPHRMNGLLB-UHFFFAOYSA-N phloretic acid Chemical compound OC(=O)CCC1=CC=C(O)C=C1 NMHMNPHRMNGLLB-UHFFFAOYSA-N 0.000 description 1
- 125000004193 piperazinyl group Chemical group 0.000 description 1
- 125000003386 piperidinyl group Chemical group 0.000 description 1
- 125000005592 polycycloalkyl group Polymers 0.000 description 1
- 230000008092 positive effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- JKANAVGODYYCQF-UHFFFAOYSA-N prop-2-yn-1-amine Chemical compound NCC#C JKANAVGODYYCQF-UHFFFAOYSA-N 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 125000000719 pyrrolidinyl group Chemical group 0.000 description 1
- 230000036632 reaction speed Effects 0.000 description 1
- 239000012048 reactive intermediate Substances 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 238000010898 silica gel chromatography Methods 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- YLQBMQCUIZJEEH-UHFFFAOYSA-N tetrahydrofuran Natural products C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 1
- 125000003718 tetrahydrofuranyl group Chemical group 0.000 description 1
- 125000001412 tetrahydropyranyl group Chemical group 0.000 description 1
- 229940126585 therapeutic drug Drugs 0.000 description 1
- 238000004809 thin layer chromatography Methods 0.000 description 1
- 125000004568 thiomorpholinyl group Chemical group 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 229960001612 trastuzumab emtansine Drugs 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07C—ACYCLIC OR CARBOCYCLIC COMPOUNDS
- C07C235/00—Carboxylic acid amides, the carbon skeleton of the acid part being further substituted by oxygen atoms
- C07C235/42—Carboxylic acid amides, the carbon skeleton of the acid part being further substituted by oxygen atoms having carbon atoms of carboxamide groups bound to carbon atoms of six-membered aromatic rings and singly-bound oxygen atoms bound to the same carbon skeleton
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/68—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
- A61K47/6801—Drug-antibody or immunoglobulin conjugates defined by the pharmacologically or therapeutically active agent
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K49/00—Preparations for testing in vivo
- A61K49/001—Preparation for luminescence or biological staining
- A61K49/0013—Luminescence
- A61K49/0017—Fluorescence in vivo
- A61K49/005—Fluorescence in vivo characterised by the carrier molecule carrying the fluorescent agent
- A61K49/0058—Antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07C—ACYCLIC OR CARBOCYCLIC COMPOUNDS
- C07C245/00—Compounds containing chains of at least two nitrogen atoms with at least one nitrogen-to-nitrogen multiple bond
- C07C245/20—Diazonium compounds
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D207/00—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom
- C07D207/02—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom with only hydrogen or carbon atoms directly attached to the ring nitrogen atom
- C07D207/44—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom with only hydrogen or carbon atoms directly attached to the ring nitrogen atom having three double bonds between ring members or between ring members and non-ring members
- C07D207/444—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom with only hydrogen or carbon atoms directly attached to the ring nitrogen atom having three double bonds between ring members or between ring members and non-ring members having two doubly-bound oxygen atoms directly attached in positions 2 and 5
- C07D207/448—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom with only hydrogen or carbon atoms directly attached to the ring nitrogen atom having three double bonds between ring members or between ring members and non-ring members having two doubly-bound oxygen atoms directly attached in positions 2 and 5 with only hydrogen atoms or radicals containing only hydrogen and carbon atoms directly attached to other ring carbon atoms, e.g. maleimide
- C07D207/452—Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom with only hydrogen or carbon atoms directly attached to the ring nitrogen atom having three double bonds between ring members or between ring members and non-ring members having two doubly-bound oxygen atoms directly attached in positions 2 and 5 with only hydrogen atoms or radicals containing only hydrogen and carbon atoms directly attached to other ring carbon atoms, e.g. maleimide with hydrocarbon radicals, substituted by hetero atoms, directly attached to the ring nitrogen atom
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D495/00—Heterocyclic compounds containing in the condensed system at least one hetero ring having sulfur atoms as the only ring hetero atoms
- C07D495/02—Heterocyclic compounds containing in the condensed system at least one hetero ring having sulfur atoms as the only ring hetero atoms in which the condensed system contains two hetero rings
- C07D495/04—Ortho-condensed systems
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/02—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing at least one abnormal peptide link
- C07K5/0207—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing at least one abnormal peptide link containing the structure -NH-(X)4-C(=0), e.g. 'isosters', replacing two amino acids
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/04—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing only normal peptide links
- C07K5/06—Dipeptides
-
- C—CHEMISTRY; METALLURGY
- C09—DYES; PAINTS; POLISHES; NATURAL RESINS; ADHESIVES; COMPOSITIONS NOT OTHERWISE PROVIDED FOR; APPLICATIONS OF MATERIALS NOT OTHERWISE PROVIDED FOR
- C09K—MATERIALS FOR MISCELLANEOUS APPLICATIONS, NOT PROVIDED FOR ELSEWHERE
- C09K11/00—Luminescent, e.g. electroluminescent, chemiluminescent materials
- C09K11/02—Use of particular materials as binders, particle coatings or suspension media therefor
- C09K11/025—Use of particular materials as binders, particle coatings or suspension media therefor non-luminescent particle coatings or suspension media
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Epidemiology (AREA)
- Immunology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Pharmacology & Pharmacy (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Crystallography & Structural Chemistry (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Biomedical Technology (AREA)
- Materials Engineering (AREA)
- Peptides Or Proteins (AREA)
Abstract
The present invention providesA kind of linker is used for protein labeling and the application thereof in biological medicine, in particular to a photoinduced o-nitrobenzyl alcohol linker shown in a general formula (I) and the application. The photoinduced o-nitrobenzyl alcohol linker developed by the invention can specifically mark protein side chain lysine, and then specifically couple affinity markers, fluorescent substances, active drugs and the like to biological macromolecules such as proteins and antibodies, so that a direct way is provided for the development of novel protein drugs and antibody drugs ADC drugs, biological diagnostic reagents, imaging probes and the like.
Description
Technical Field
The invention belongs to the technical field of biological pharmacy and biology, and mainly relates to a novel photoinduced o-nitrobenzyl alcohol compound which is used for selectively marking lysine free amino in polypeptides and proteins and is applied to the purposes of specifically coupling affinity markers, fluorescent substances, active drugs and the like to the polypeptides and the proteins so as to develop targeted tracing diagnostic reagents, tumor targeted therapeutic drugs and the like.
Background
Proteins are an important class of biological macromolecules and are the material basis of life. The specific spatial structure of a protein determines the function of the protein. On the basis of maintaining the integrity and the function of the protein, the protein is subjected to chemical selective marker modification through chemical reaction of functional groups so as to obtain a novel bioconjugate, which is particularly important in the fields of chemical biology and biomedical research. In more and more research reports, not only methods for modifying translated proteins by using a series of enzymes, but also labeling methods by using conjugation of small molecule substances, and selective labeling using natural amino acid residues in proteins is a very cost-effective strategy. Chemical modification processes directly convert the chemical structure of a particular residue in a protein, and because of the presence of various reactive groups in the protein, conjugation can occur at that particular site, especially where the natural amino acids cysteine and lysine both contain nucleophilic functional groups. The free amino group of lysine is also a group with high nucleophilic reactivity in protein molecules, and the chemical modification of the amino group is of great importance in protein sequence analysis and is receiving wide attention. Lysine free amino group is used as a binding site, activated carboxyl functional groups are generally adopted, and N-hydroxysuccinimide ester is particularly adopted, but the methods have the problems of insufficient specificity in selectivity, low coupling efficiency and the like, and a new method for selectively marking and modifying protein lysine residues is still required to be developed in biological application.
Antibody-drug conjugates (ADCs) are an emerging biologic therapy that utilizes a variety of tissue-specific antibodies in combination with a range of linker designs to enable the transport and selective release of cytotoxic drugs in the vicinity of tumors. ADC consists of three parts, an antibody (antibody), a linker (linker) and a cytotoxin (toxin), and is a chemically complex conjugate. ADC drug design is a complex fusion of antibody selection, binding strategies, linker design and payload potency. The choice of antibody is governed by the target antigen, and the linker is chosen according to the type of binding employed and the desired mechanism of action of the payload. Currently, 4 ADC drugs are approved by FDA for marketing, which are: adcetris (2011), Kadcyla (2013), Mylotarg (2017) and bespossa (2017), with over 60 ADC drugs in clinical studies. The antibody modification sites commonly used for ADC drugs in clinical studies are mainly composed of two types, lysine residues and cysteine residues. When cysteine residues are selected as coupling sites, although the antibody has only 4 pairs of interchain disulfide bonds, and DAR values of ADC drugs formed by the cysteine residues generated by reducing the interchain disulfide bonds are theoretically uniform, the uniformity of final products is poor due to poor selectivity of the existing reducing agents (DTT and TCEP) on the interchain disulfide bonds, and the disulfide bonds of the antibody are damaged, so that the stability of the antibody is influenced. When lysine residues are selected as the coupling sites, because an antibody contains more than 40 lysine residues, the chemical selectivity is very poor, and the drug combination sites and the combination number are complex and simultaneously very wide drug loading distribution is obtained. Nevertheless, three of the four ADCs on the market today have successfully adopted a lysine coupling strategy.
Therefore, the development of a novel method for selectively marking and modifying the lysine residues of the protein not only has a very positive effect on the research of the protein per se, but also has unique advantages when the method is used for the site-specific synthesis of ADC drugs. So far, no chemical reaction functional group with photoinduced activity is introduced into the selective labeling modification of protein lysine residues. The light-induced chemically reactive functional groups are inert reactive functional groups in the absence of light, and upon irradiation with light of a specific wavelength, such reactive functional groups produce highly reactive intermediates that undergo a chemical reaction in which they form irreversible covalent bonds with the site of action on the protein to which they are targeted. The photo-crosslinking reaction has the advantages of high speed, simple conditions, suitability for in-situ reaction and the like. Therefore, it is necessary to develop a new method for selectively labeling protein lysine residues with high efficiency, reliability and good selectivity.
Disclosure of Invention
The inventor designs a reaction functional group with photoinduction activity, which has a simple structure and is easy to synthesize, wherein the photoinduction activity functional group mainly contains the structure of o-nitrobenzyl alcohol. The reaction functional group mainly reacts with amino group under the light induction condition, and mainly reacts with side chain amino group of lysine in protein in the protein compound to form Indazolone (Indazolone), so that covalent connection with stable structure is formed. The reaction with the polypeptide or protein is as follows:
the o-nitrobenzyl alcohol linker can be reacted with lysine on protein under very mild conditions, and has very high reaction speed and high reaction efficiency (chem.,2019,5,2955-2968.RSC adv.,2019,9, 13249-13253). Therefore, the photoinduced o-nitrobenzyl alcohol linker developed by the invention can specifically mark lysine of corresponding protein, and then specifically couple affinity markers, fluorescent substances, active drugs and the like to polypeptide and protein, thereby providing a direct path for the design and development of novel ADC drugs, biological diagnostic reagents, imaging probes and the like.
An object of the present invention is to provide a compound represented by the general formula (I), a tautomer, an enantiomer, a diastereomer, a racemate, an isotopic compound thereof, and various forms of salts or hydrates thereof.
Another object of the present invention is to provide methods for selectively labeling lysine in polypeptides and proteins using the compounds.
The invention also aims to provide application of the compounds in preparing antibody-fluorescent tracer substance conjugates and antibody-drug conjugates.
Another object of the present invention is to provide a novel method for developing antibody-drug conjugates from such compounds.
The invention provides a compound shown as the following general formula (I), or a tautomer, enantiomer, diastereomer, racemate, isotopic compound, various forms of salts or hydrates thereof.
Wherein Y is selected from: -CO-, -NH-CH2-、-O-CO-CH2-、-NH-COO-CH2-、-NH-CO-NH-CH2-、-COOCH2-、-CO-NH-、-O-CH2-、-CH2-、-COO-、-OCO-、-O-、-S-、-SO2-、-C≡C-、-C=C-、-SO2NH-、-NHCONH-、-NHCSNH-、-NH-、-CONH-CH2-or absent, wherein one end of Y may be attached to the 3,4, 5 or 6 position of the phenyl ring in the ortho-nitrobenzyl alcohol structure;
wherein R is1Is one or more than one substitution of any position except Y substitution position on 3,4, 5 or 6 position in the structure of o-nitrobenzyl alcohol, and when R is polysubstituted1May be the same or different, R1Each independently selected from hydrogen, deuterium, amino, halogen, nitro, cyano, C1-6Alkyl radical, C3-10Cycloalkyl radical, C1-5Alkoxy radical, C1-6Alkylamino or aminoalkyl radical, C1-C6Alkylcarbonyl group, C2-C6Alkoxycarbonyl group, C2-C6Alkylamino carbonyl, C5-8Heterocyclic group, C6-10Aryl radical, C5-6A heteroaryl group,Wherein n is0And n1Is 1, 2, 3,4 or 5, wherein said alkyl, cycloalkyl, alkoxy, alkylamino or aminoalkyl, alkylcarbonyl, alkoxycarbonyl, alkylaminocarbonyl, heterocyclyl, aryl, heteroarylOptionally further substituted by one or more groups selected from halogen, hydroxy, amino, C1-C6Alkoxy, cyano, nitro; in particular, R1Each independently is hydrogen;
r is selected from hydrogen, deuterium, halogen, nitro, cyano, hydroxyl, alkyl hydroxyl, aryl hydroxyl, alkyl amino, aryl amino, sulfydryl, alkyl sulfydryl, aryl sulfydryl, carboxylic acid, alkyl carboxylic acid, aryl carboxylic acid, alkynyl, alkyl alkynyl, aryl alkynyl, azide, alkyl azide, aryl azide, carbonyl, alkyl carbonyl, aryl carbonyl, aldehyde group, alkyl aldehyde group, aryl aldehyde group, alkyl, cycloalkyl, alkoxy, heterocyclic group, aryl, heteroaryl or any combination thereof, wherein the above groups are optionally further substituted by one or more groups selected from halogen, hydroxyl, amino, C1-C6Alkoxy, cyano, nitro.
Preferably, in the compound represented by the general formula (I),
y is selected from: -CO-, -NH-CH2-、-O-CO-CH2-、-NH-COO-CH2-、-NH-CO-NH-CH2-、-COOCH2-、-CO-NH-、-O-CH2-、-CH2-、-COO-、-OCO-、-O-、-S-、-SO2-、-C≡C-、-C=C-、-SO2NH-、-NHCONH-、-NHCSNH-、-NH-、-CONH-CH2-or absent, wherein one end of Y may be attached to the 3,4, 5 or 6 position of the phenyl ring in the ortho-nitrobenzyl alcohol structure;
wherein R is1Is one or more than one substitution of any position except Y substitution position on 3,4, 5 or 6 position in the structure of o-nitrobenzyl alcohol, and when R is polysubstituted1May be the same or different, R1Each independently selected from hydrogen, deuterium, amino, halogen, nitro, cyano, C1-6Alkyl radical, C3-10Cycloalkyl radical, C1-5Alkoxy radical, C1-6Alkylamino or aminoalkyl radical, C1-C6Alkylcarbonyl group, C2-C6Alkoxycarbonyl group, C2-C6Alkylamino carbonyl, C5-8Heterocyclic group, C6-10Aryl radical, C5-6A heteroaryl group,Wherein n is0And n1Is 1, 2, 3,4 or 5, wherein said alkyl, cycloalkyl, alkoxy, alkylamino or aminoalkyl, alkylcarbonyl, alkoxycarbonyl, alkylaminocarbonyl, heterocyclyl, aryl, heteroaryl is optionally further substituted with one or more substituents selected from the group consisting of halogen, hydroxy, amino, C1-C6Alkoxy, cyano, nitro; in particular, R1Each independently is hydrogen;
r is selected from hydrogen, deuterium, halogen, nitro, cyano, hydroxyl, alkyl hydroxyl, aryl hydroxyl, alkyl amino, aryl amino, sulfydryl, alkyl sulfydryl, aryl sulfydryl, carboxylic acid, alkyl carboxylic acid, aryl carboxylic acid, alkynyl, alkyl alkynyl, aryl alkynyl, azide, alkyl azide, aryl azide, carbonyl, alkyl carbonyl, aryl carbonyl, aldehyde group, alkyl aldehyde group, aryl aldehyde group, alkyl, cycloalkyl, alkoxy, heterocyclic group, aryl, heteroaryl or any combination thereof, wherein the above groups are optionally further substituted by one or more groups selected from halogen, hydroxyl, amino, C1-C6Alkoxy, cyano, nitro.
More preferably, among the compounds represented by the general formula (I)
Wherein Y is selected from: -CO-, -NH-CH2-、-O-CO-CH2-、-NH-COO-CH2-、-NH-CO-NH-CH2-、-COOCH2-、-CO-NH-、-O-CH2-、-CH2-、-COO-、-OCO-、-O-、-S-、-SO2-、-C≡C-、-C=C-、-SO2NH-、-NHCONH-、-NHCSNH-、-NH-、-CONH-CH2-or absent, wherein one end of Y may be attached to the 3,4, 5 or 6 position of the phenyl ring in the ortho-nitrobenzyl alcohol structure;
wherein R is selected from hydrogen, deuterium, halogen, nitro, cyano, hydroxyl, alkyl hydroxyl, aryl hydroxyl, alkyl amino, aryl amino, mercapto, alkyl mercapto, aryl mercapto, carboxylic acid, alkyl carboxylic acid, aryl carboxylic acid, alkynyl, alkyl alkynyl, aryl alkynyl, azide and alkyl azideNitrogen, aromatic azide, carbonyl, alkylcarbonyl, aromatic carbonyl, aldehyde group, alkylaldehyde group, arylaldehyde group, alkyl, cycloalkyl, alkoxy, heterocyclic group, aryl, heteroaryl or any combination thereof, wherein the above groups are optionally further substituted by one or more groups selected from halogen, hydroxy, amino, C1-C6Alkoxy, cyano, nitro;
wherein R is1Is one or more than one substitution of any position except Y substitution position on 3,4, 5 or 6 position in the structure of o-nitrobenzyl alcohol, and when R is polysubstituted1May be the same or different, R1Each independently selected from hydrogen, deuterium, amino, halogen, C1-3Alkoxy, nitro,Wherein n is0And n1Is 1, 2, 3,4, 5, in particular R1Each independently hydrogen.
Further preferably, the compound of formula (i) is selected from the following formulae:
wherein R and R1The definitions of (a) are the same as those described above.
The term "halogen" refers to fluorine, chlorine, bromine or iodine.
The term "hydrocarbyl" refers to a substituent containing only carbon and hydrogen atoms, including, without limitation, methyl, ethyl, isopropyl, propyl, cyclohexyl, phenyl, and the like.
The term "C1-C6" alkyl refers to a straight or branched chain saturated alkyl group having 1 to 6 carbon atoms in the chain, including without limitation methyl, ethyl, propyl, isopropyl, butyl, isobutyl, sec-butyl, tert-butyl, and the like.
The term "cycloalkyl" refers to a saturated cyclic alkyl group consisting of carbon atoms, including without limitation cyclobutyl, cyclopentyl, cyclohexyl, and the like.
The term "C3-C10 cycloalkyl" refers to a saturated mono-or poly-cycloalkyl group containing 3 to 10 carbon atoms, including, without limitation, cyclopropyl, cyclobutyl, cyclopentyl, or cyclohexyl.
The term "C6-C10 aryl" refers to aromatic ring groups containing 6-10 ring atoms, but no heteroatoms in the ring atoms, such as phenyl, naphthyl.
The term "5-8 membered heterocyclyl" means a ring containing one or more saturated and/or partially saturated rings, including 5 to 8 ring atoms, wherein one or more ring atoms are selected from heteroatoms of nitrogen, oxygen or sulfur, the remaining ring atoms being carbon; for example, propylene oxide, tetrahydrofuranyl, pyrrolidinyl, tetrahydropyranyl, piperidinyl, piperazinyl, morpholinyl, thiomorpholinyl.
The term "5-6 membered heteroaryl" refers to a monovalent aromatic ring group containing 5-6 ring atoms and 1-4 heteroatoms in the ring atoms as ring members. The heteroatoms may be selected from nitrogen, oxygen or sulfur.
The term "tautomer" refers to structural isomers that readily interconvert by a chemical reaction with each other being an isomer, which reaction generally results in formal movement of hydrogen atoms or protons with concomitant transformation of single bonds and adjacent double bonds.
The term "enantiomer" refers to stereoisomers that are mirror images of each other and do not overlap.
"diastereomer" refers to a stereoisomer that has two or more chiral neutrals and is not a mirror image.
"racemic" refers to two stereoisomers that are mirror images of each other, but have opposite optical rotations and cancel each other out.
The salt is a salt formed by a molecule and a corresponding organic acid, inorganic acid or organic base and inorganic base, for example, hydrochloric acid, formic acid, trifluoroacetic acid, succinic acid, methanesulfonic acid salt and the like of the compound.
"hydrate" refers to a compound containing water.
The invention further provides the use of the compound shown in the general formula (I) according to the invention, the tautomer, the enantiomer, the diastereomer, the racemate, the isotopic compound, and various forms of salts or hydrates thereof for preparing the marker for selectively marking the free amino group of the lysine on the side chain of the polypeptide or protein. The o-nitrobenzyl alcohol compound shown in the general formula (I) can selectively mark the free amino of the lysine side chain in polypeptide and protein mainly under the light induction condition.
The compound shown in the general formula (I) or tautomer, enantiomer, diastereomer, racemate, precursor compound, isotope compound, salt or hydrate thereof in various forms can form a connecting structure through amino of lysine side chains of polypeptide and protein, and can be used for preparing antibody-drug conjugates, antibody-affinity labels and antibody-fluorescent substances.
Accordingly, in a further aspect, the present invention provides the use of the compounds represented by the general formula (i) of the present invention, tautomers, enantiomers, diastereomers, racemates, isotopic compounds, and salts in various forms or hydrates thereof for the preparation of antibody-drug conjugates, antibody-affinity labels and antibody-fluorescent substances.
The conjugate has a characteristic structural formula (II):
wherein, Y, R1The same as defined in the above general formula (I),
a is a polypeptide or protein;
z is L-X, X comprises one or no of affinity marker, tracer fluorescent substance and active drug or their derivatives; in particular, affinity labels such as biotin and folate, and tracer fluorescent substances including, but not limited to, rhodamine, fluorescein, pigment, coumarin. Wherein the active agents include, but are not limited to, Maytansinoids (Maytansinoids), Auristatins (Auristatins), Calicheamicins (Calicheamicins), adriamycins (Doxorubicins), pyrrolobenzodiazepine dimers (PBDs), Triptolide (Triptolide), colchicines (Colchicine), combretastatins (Combretastatin), homoharringtonines (homoharringtonines), camptothecins (camptothecins), taxanes (Paclitaxel), and all agents useful for antibody drug conjugates;
l can be: C1-C9 alkyl, C2-C9 alkenyl, C2-C9 alkynyl, aryl, heteroaryl, C3-C9 cycloalkyl, C3-C9 heterocyclyl, -NR1-, -O-, -S-, -CO-, -OCO-, -COO-, -NHCO-, -CONR1-, -C ═ NR1-, -C ═ S-O-, -C ═ S-NR1-, -CS2-, -NR1CO-, -NR1CSNR2-, -OCONR1-, -OSO-, Val-Val-PAB, Val-Cit-PAB, Val-Ala-PAB, Val-Lys (Ac) -PAB, Phe-Lys (Ac) -PAB, D-Val-Leu-Lys, Gly-Gly-Arg, Ala-Ala-Asn-PAB, Ala-PAB, PAB and any combination or null thereof, wherein R1 and R2 are independently selected from H, C1-C6 alkyl, C2-C6 alkenyl, C2-C6 alkynyl, aryl, heteroaryl, C3-C9 cycloalkyl and C3-C9 heterocyclic group, and n3 is 0-23; or L is absent.
Preferably, in the antibody-drug conjugate represented by the general formula (ii), L is selected from the following structures and any combination thereof:
wherein n4 is an integer of 0 to 23.
In particular, a above refers to a unit that can bind, reactively associate, or complex a receptor or antigen, including but not limited to a chimeric antibody, a humanized antibody, a human antibody, or an antibody fragment.
In particular, the antibody-drug conjugate is selected from the following structures;
wherein, Y, L, R1The definition of (a) is the same as the above definition,
refers to an Antibody (Antibody) which refers to a unit capable of binding, reactively associating or complexing a receptor or antigen, such as a chimeric Antibody, a humanized Antibody, a human Antibody or an Antibody fragment.
The compound represented by the general formula (I) according to the present invention or the above-mentioned antibody-drug conjugate, antibody-affinity label and antibody-fluorescent substance can be prepared by or by referring to the methods of the following examples or methods similar thereto.
Drawings
FIG. 1 shows the selective reaction of Compound 1G for-NH 2 on the polypeptide.
Figure 2 shows the molecular weight profile of compound 1G-labeled polypeptide product.
FIG. 3 shows the selective reaction of Compound 1F to-NH 2 on the polypeptide.
Figure 4 shows the molecular weight profile of compound 1F-labeled polypeptide product.
FIG. 5 shows ESI-TOF spectra of unlabeled and 1F-labeled polypeptide aptamers.
Fig. 6 shows ESI-TOF spectra of unlabeled nanobody-anti-human epidermal growth factor antibody and 1F-labeled nanobody-anti-human epidermal growth factor antibody.
FIG. 7 shows ESI-TOF spectra of unlabeled ubiquitin and 1F-labeled modified ubiquitin.
FIG. 8a is an ESI-TOF spectrum of unlabeled chymotrypsinogen and chymotrypsinogen labeled with 1F.
FIG. 8 b is an ESI-MS/MS spectrum of the digested peptide fragment after chymotrypsinogen was labeled with Compound 1F.
FIG. 8 c is the marker protein and TAMRA-N3And (4) carrying out fluorescence imaging on the SDS-PAGE gel after coupling.
FIG. 9 a is an ESI-TOF spectrum of unlabeled lysozyme and lysozyme labeled with Compound 1F.
FIG. 9 b is the ESI-MS/MS spectrum of the cleaved peptide fragment after lysozyme was labeled with Compound 1F.
FIG. 9 is a schematic view ofc is a marker protein and TAMRA-N3And (4) carrying out fluorescence imaging on the SDS-PAGE gel after coupling.
Fig. 10 a is an ESI-TOF spectrum of unlabeled myoglobin and myoglobin labeled with compound 1F.
FIG. 10 b is an ESI-MS/MS spectrum of the cleaved peptide fragment after myoglobin was labeled with Compound 1F.
FIG. 10 c is the marker protein and TAMRA-N3And (4) carrying out fluorescence imaging on the SDS-PAGE gel after coupling.
FIG. 11 shows the selective labeling of amino groups in the Nanobody-anti-human epidermal growth factor receptor antibody by Compound 1F, and the preparation of antibody-fluorescent conjugates.
Fig. 12 a is ESI-TOF spectrum of unlabeled nanobody-anti-human epidermal growth factor receptor and antibody labeled by compound 1F.
FIG. 12 b is ESI-MS/MS spectrum of enzyme digestion peptide fragment after nanometer antibody-anti-human epidermal growth factor receptor is labeled by compound 1F.
FIG. 12 c is a graph showing the binding of labeled antibody to TAMRA-N3And (4) carrying out fluorescence imaging on the SDS-PAGE gel after coupling.
Detailed Description
In all examples, 1H NMR was recorded by a Bruker Avance III-300 or Avance III-400 model nuclear magnetic resonance apparatus, chemical shifts being expressed in delta (ppm); mass spectra were determined by MS mass spectra UPLC-MS (esi); wherein the UPLC model is Waters HPLC H-CLASS, and the MS (ESI) model is Waters SQ Detector 2; the anhydrous tetrahydrofuran is prepared by benzophenone/metallic sodium reflux drying and deoxidization, and the anhydrous toluene and the anhydrous dichloromethane are prepared by calcium chloride reflux drying; solvents such as petroleum ether, ethyl acetate and dichloromethane for column chromatography mobile phase are all purchased from chemical reagents of national medicine group; the thin layer chromatography silica gel plate (HSGF254) used in the reaction detection is from chemical reagents of national drug group, Inc.; the compound separation is performed by using 200-300 mesh silica gel of national drug group chemical reagent, Inc. The starting materials of the present invention can be obtained commercially, such as the main reagents purchased from the national pharmaceutical group chemical agents limited, or prepared by methods known in the art, or prepared according to the methods described in the present invention.
Example 1: n- (3-azidopropyl) -4- (hydroxymethyl) -3-nitrobenzamide
Step 1-1: compound S1(4- (bromomethyl) -3-nitrobenzoic acid) (5.00g, 19.2mmol) was dissolved in acetone/H2To the O (1: 1, 150mL) mixed solution, Na was added2CO3(7.13g, 67.3mmol) was refluxed for 3 hours. Acetone was removed in vacuo and the aqueous phase was taken up in Et2And O is extracted twice. Adding concentrated HCl into the water phase until the pH value is 2-3, and extracting for 3 times by using ethyl acetate. The organic layer was washed with water and brine, dried over magnesium sulfate, and concentrated in vacuo without further purification to give compound 1A as a brown oil (3.78g, 98%).1H NMR(500MHz,MeOD)δ8.60(s,1H),8.29(d,J=8.0Hz,1H),7.97(d,J=8.1Hz,1H),5.00(s,2H).
Step 1-2: 1A (1.0eq), HATU (1.2eq) and propylamine (1B, 3.0eq) or propargylamine (1C, 3.0eq) or 3-azidopropan-1-amine (1D, 1.0eq) were dissolved in anhydrous DMF and DIPEA (3.0eq) was added dropwise at 0 ℃ and the mixture was stirred at room temperature overnight. Adding H to the mixture2O, and extracted 3 times with ethyl acetate. The organic layer was washed with saturated NaHCO3、0.1M HCl、H2O, brine, over Na2SO4Dried and concentrated in vacuo and the residue purified by silica chromatography to give the products 1E to G. Compound 1E, yield: and 64 percent.1H NMR(400MHz,MeOD)δ8.50(d,J=1.7Hz,1H),8.14(dd,J=8.1,1.7Hz,1H),7.96(d,J=8.2Hz,1H),4.99(s,2H),3.39–3.33(m,2H),1.66(dq,J=14.7,7.4Hz,2H),1.02–0.94(m,3H).13C NMR(101MHz,MeOD)δ167.5,148.4,142.5,135.6,132.9,129.7,124.5,61.8,42.9,23.6,11.8.HRMS(ESI-Q-TOF):m/z[M+H]+Calcd for C11H15N2O4 +239.1026; found 239.1040 compound 1F, yield:58 percent of Yield.1H NMR(400MHz,MeOD)δ8.51(d,J=1.7Hz,1H),8.15(dd,J=8.2,1.8Hz,1H),7.97(d,J=8.2Hz,1H),4.99(s,2H),4.18(d,J=2.5Hz,2H),2.64–2.62(m,1H).13C NMR(101MHz,MeOD)δ167.1,148.3,142.8,134.9,133.0,129.8,124.6,80.4,72.3,61.8,30.1.HRMS(ESI-Q-TOF):m/z[M-H]-Calcd for C11H9N2O4 -233.0568; found 233.0566 compound 1G, yield: 51 percent.1H NMR(500MHz,MeOD)δ8.52(d,J=1.8Hz,1H),8.15(dd,J=8.1,1.8Hz,1H),7.98(d,J=8.1Hz,1H),4.99(s,2H),3.49(t,J=6.9Hz,2H),3.43(t,J=6.7Hz,2H),1.90(p,J=6.8Hz,2H).13C NMR(126MHz,MeOD)δ167.7,148.5,142.6,135.5,133.0,129.8,124.5,61.8,50.2,38.6,29.7.HRMS(ESI-Q-TOF):m/z[M+Na]+Calcd for C11H13N5NaO4 +:302.0860;found:302.0853.
Example 2: 4- (hydroxymethyl) -3-nitro-N- (2- (5- ((3aS, 4S, 6aR) -2-oxocyclohexane-1H-thieno [3,4-d ] imidazol-4-yl) pentyl) ethyl) benzamide
Step 2-1: a solution of D-biotin (3.00g, 12.3mmol), EDCI. HCl (2.82g, 14.7mmol) and NHS (1.70g, 14.7mmol) in DMF (100mL) was stirred at room temperature overnight. The reaction mixture was concentrated in vacuo and the residue was filtered, washed with EtOH/AcOH/H2O (95: 1: 4) and dried in vacuo. The product, compound 2A, was obtained as a yellow oil without further purification (3.74g, 89%).1H NMR(500MHz,DMSO)δ6.45(s,1H),6.38(s,1H),4.30(dd,J=7.5,5.2Hz,1H),4.16–4.12(m,1H),3.13–3.07(m,1H),2.84(d,J=5.1Hz,1H),2.81(s,4H),2.67(t,J=7.4Hz,2H),2.58(d,J=13.3Hz,1H),1.74–1.57(m,6H).
Step 2-2: to a stirred solution of ethylenediamine (3.91mL, 58.6mmol) in anhydrous DMF (20mL) was added a solution of compound 2A (1.00g, 2.93mmol) in anhydrous DMF (10mL) dropwise and stirred overnight. Et was added to the mixture2O, the precipitated product was filtered, washed with ethyl acetate and dried in vacuo without further purification (753mg, 90%). The product from the previous step (400mg, 1.40mmol), Compound 1A (276mg, 1.40 m)mol) and HATU (637mg, 1.68mmol) were dissolved in anhydrous DMF (15mL), DIPEA (0.69mL, 4.19mmol) was added dropwise at 0 deg.C, and the mixture was stirred at room temperature overnight. Adding H to the mixture2O, and extracted 3 times with ethyl acetate. The organic layer was washed with saturated NaHCO3、1M HCl、H2O, brine, over Na2SO4Dried and concentrated in vacuo, and purified by silica gel chromatography to give compound 2B as a yellow oil (331mg, 51%).1H NMR(500MHz,MeOD)δ8.52(d,J=1.7Hz,1H),8.15(dd,J=8.1,1.7Hz,1H),7.99(d,J=8.2Hz,1H),5.00(s,2H),4.46(dd,J=7.9,4.9Hz,1H),4.24(dd,J=7.9,4.5Hz,1H),3.53(t,J=5.9Hz,2H),3.44(t,J=5.8Hz,2H),3.14–3.09(m,1H),2.89(dd,J=12.8,5.0Hz,1H),2.68(d,J=12.7Hz,1H),2.21(t,J=7.4Hz,2H),1.66(m,4H),1.59–1.49(m,2H).13C NMR(126MHz,MeOD)δ176.7,167.8,166.1,148.5,142.6,135.5,133.0,129.9,124.6,63.3,61.8,61.6,56.9,41.1,41.0,39.9,36.8,29.7,29.4,26.8.HRMS(ESI-Q-TOF):m/z[M+H]+Calcd for C20H28N5O6S+:466.1755;found:466.1744.
Example 3: n- (6- (2, 5-dioxo-2, 5-dihydro-1H-pyrrol-1-yl) hexyl) -4- (hydroxymethyl) -3-nitrobenzamide
Step 3-1: compound S2(74mg, 0.25mmol) was added to 2: 1CH2Cl2And trifluoroacetic acid (1.5mL) at room temperature for 1h, and then the solvent was removed in vacuo and directly fed to the next step.
Step 3-2: compound 1A (54mg, 0.275mmol), HATU (104mg, 0.275mmol) and the resulting oil 3A were dissolved in anhydrous DMF and DIPEA (227. mu.L, 1.38mmol) was added to the solution. The reaction was stirred at room temperature for 1 hour. The mixture was diluted with ethyl acetate, washed with water and brine, over Na2SO4After drying and removal of the solvent in vacuo, the resulting residue was purified by flash chromatography to give compound 3B as a yellow oil (30mg, 32%).1HNMR(600MHz,DMSO)δ8.74(t,J=5.5Hz,1H),8.49(d,J=1.7Hz,1H),8.20(dd,J=8.1,1.7Hz,1H),7.92(d,J=8.1Hz,1H),6.99(s,2H),5.65(t,J=5.5Hz,1H),4.87(d,J=5.5Hz,2H),3.39(t,J=7.1Hz,2H),3.25(dd,J=12.8,6.9Hz,2H),1.53–1.46(m,4H),1.34–1.29(m,2H),1.25–1.22(m,2H).13C NMR(126MHz,DMSO)δ171.1,163.8,146.6,141.1,134.4,134.0,132.0,128.4,123.0,59.9,37.0,28.8,27.9,25.9,25.8.HRMS(ESI-Q-TOF):m/z[M+H]+Calcd for C18H22N3O6 +:376.1503;found:376.1496.
Example 4
Compound 1G on-NH of polypeptide2Selective labelling assay of (1). FIG. 1 shows the selective reaction of compound 1G for-NH 2 on the polypeptide. Compound 1G (1.25mM) and polypeptide (AcSRKYDH in FIG. 1) (0.5mM) in 100mM PBS/MeOH (9: 1, pH 7.4) were treated with 365nm UV light and shaken for 30 min at 25 deg.C, samples were collected with MeOH/H2O was diluted and analyzed by UPLC-MS. FIG. 2 shows the molecular weight spectra of the polypeptide product labeled with Compound 1G, indicating that the light-induced labeling of the polypeptide is very rapid and that the desired product can be obtained by UPLC-MS analysis of the reaction mixture.
Example 5
Compound 1F on-NH of polypeptide2Selective labelling assay of (1). FIG. 3 shows the selective reaction of Compound 1F to-NH 2 on the polypeptide. Compound 1F (1.25mM) and polypeptide (AcRCYMNK, 0.5mM in FIG. 3) in 100mM PBS/MeOH (9: 1, pH 7.4) were treated with 365nm UV light and shaken for 30 min at 25 deg.C, samples were collected with MeOH/H2O was diluted and analyzed by UPLC-MS. FIG. 4 shows the molecular weight spectra of the polypeptide product labeled with Compound 1F, indicating that the light-induced labeling of the polypeptide is very rapid and that UPLC-MS analysis of the reaction mixture can yield the desired product.
Example 6
And (3) under a light induction condition, marking Affiniody protein by using the compound 1F.
Compound 1F (2.5mM) was exposed to 365nm UV light in methanol for 7min, and Affinibody (55. mu.M) was added to bring the final concentration of o-nitrobenzyl alcohol compound 1F to 125. mu.M in PBS. After mixing, shaking for 1h at 25 ℃. Collecting samples, adding water for dilution, and carrying out ESI-TOF protein molecular weight analysis. FIG. 5 shows ESI-TOF spectra of unlabeled Affiniody and 1F-labeled Affiniody, indicating that their covalent modifications were almost quantitatively modified (FIG. 5). The molecular weight of the unlabeled protein is 7594.61 or 7725.50, and the molecular weight of the labeled protein is 7792.74, 7923.52; and the molecular weights of the two labeled molecules are 7990.46, 8121.74.
Affinibody amino acid sequence (SEQ ID NO: 1):
MTSVDNKFNKELSVAGREIVTLPNLNDPQKKAFIFSLWDDPSQSANLLAEAKKLNDAQAPKGSHHHHHH
example 7
Compound 1F was used for Nanobody-EGFR labeling under light induction.
Compound 1F (2.5mM) was exposed to 365nm UV light in methanol for 7min, and Nanobody-EGFR (55. mu.M) was added to make the final concentration of o-nitrobenzyl alcohol compound 1F 250. mu.M in PBS solution. After mixing, shaking for 1h at 25 ℃. Collecting samples, adding water for dilution, and carrying out ESI-TOF protein molecular weight analysis. FIG. 6 shows ESI-TOF spectra of unlabeled Nanobody-EGFR and 1F-labeled Nanobody-EGFR, indicating that its covalent modification is almost quantitatively modified. The molecular weights of the proteins before labeling are 14538.30 and 1455.12, and the molecular weight distributions after labeling are 14736.39, 14753.30, 14934.22, 14951.20, 15149.95, 15132.70 and 15347.14 respectively.
Nanobody-EGFR amino acid sequence (SEQ ID NO: 2):
QVKLEESGGGSVQTGGSLRLTCAASGRTSRSYGMGWFRQAPGKEREFVSGISWRGDSTGYADSVKGRFTISRDNAKNTVDLQMNSLKPEDTAIYYCAAAAGSAWYGTLYEYDYWGQGTQVTVSSALEHHHHHH
example 8
Experiment for labeling Ubiquitin protein by compound 1F under photoinduction condition
Compound 1F (2.5mM) was exposed to 365nm UV light in methanol for 7min, and Ubiquitin (55. mu.M) was added to make the final concentration of o-nitrobenzyl alcohol compound 1F 250. mu.M in PBS solution. After mixing, shaking for 1h at 25 ℃. Collecting samples, adding water for dilution, and carrying out ESI-TOF protein molecular weight analysis. FIG. 7 shows ESI-TOF spectra of unlabeled Ubiquitin and 1F-labeled modified Ubiquitin, indicating that its covalent modification is almost quantitatively modified. Molecular weight 10035.19, 10166.39 before labeling; the molecular weight distributions after labeling were 10233.29, 10364.50, 10431.40, 10562.58, 10629.27, 10760.60, 10827.36 and 10958.34, respectively.
Ubiquitin amino acid sequence (SEQ ID NO: 3):
MTSMQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGLEHHHHHHHH
example 9
Compound 1F was tested for Chymotrypsisinogen protein A labeling under light induction.
Compound 1F was prepared as a 2.5mM MeOH solution and chymotrypsinogen A was prepared as a 27.5 μ M solution in PBS. Compound 1F (2.5mM) in MeOH was treated with 365nm UV light for 7min, then added to chymotrypsinogen A (27.5. mu.M) in PBS to a final concentration of 125. mu.M and mixed. The mixture was shaken at 25 ℃ for 1 hour, samples were collected, diluted with water and analyzed by ESI-TOF to achieve labeling of the chromophorinogen a protein. Mixing TAMRA-N3Preparing into 1.25mM DMSO solution, and preparing CuSO4 and THPTA into 50mM H solution2O solution, and mixing at a ratio of 1: volume ratio of 5. Making sodium ascorbate into 100mM H2Stock solution of O solution. 100 microliters of o-nitrobenzyl alcohol compound 1F-labeled protein (25. mu.M or 50. mu.M), TAMRA-N3 (100. mu.M), premixed CuSO4 (100. mu.M), THPTA (500. mu.M) and sodium ascorbate (5mM) were added, and the mixture was reacted at 25 ℃ for 1h and analyzed by SDS-PAGE. FIG. 8a is an ESI-TOF spectrum of unlabeled Chymotrypsisinogen A and Chymotrypsinogen labeled with 1F, indicating that its covalent modification is almost quantitatively modified (FIG. 8 a). The molecular weight before labeling is 26655.97, and the molecular weight distribution after labeling is 25854.10, 26052.25, 26250.34, 26488.34 and 26646.27. Trypsin digestion and subsequent ESI-MS/MS analysis of the fragments also confirmed lysine (K) specific labeling (b of FIG. 8) and subsequent conjugation with tetramethylrhodamine, with good biocompatibility (c of FIG. 8).
Chymotrypsisinogen A amino acid sequence (SEQ ID NO: 4):
CGVPAIQPVLSGLSRIVNGEEAVPGSWPWQVSLQDKTGFHFCGGSLINENWVVTAAHCGVTTSDVVVAGEFDQGSSSEKIQKLKIAKVFKNSKYNSLTINNDITLLKLSTAASFSQTVSAVCLPSASDDFAAGTTCVTTGWGLTRYTNANTPDRLQQASLPLLSNTNCKKYWGTKIKDAMICAGASGVSSCMGDSGGPLVCKKNGAWTLVGIVSWGSSTCSTSTPGVYARVTALVNWVQQTLAAN
example 10
Compound 1F was tested for Lysozyme protein labeling under light-induced conditions.
Compound 1F was prepared as a 2.5mM MeOH solution and lysozyme was prepared as a 27.5. mu.M solution in PBS. Compound 1F (2.5mM) in MeOH was treated with 365nm UV light for 7min, then added to lysozyme (27.5. mu.M) in PBS to a final concentration of 125. mu.M and mixed. The mixture was shaken at 25 ℃ for 1 hour, and samples were collected, diluted with water and analyzed by ESI-TOF to effect labeling of Lysozyme protein. Mixing TAMRA-N3Preparing into 1.25mM DMSO solution, and preparing CuSO4 and THPTA into 50mM H solution2O solution, and mixing at a ratio of 1: volume ratio of 5. Making sodium ascorbate into 100mM H2Stock solution of O solution. Add 100. mu.l o-nitrobenzyl alcohol Compound 1F-labeled protein (25. mu.M or 50. mu.M), TAMRA-N3(100. mu.M), premixed CuSO4 (100. mu.M), THPTA (500. mu.M) and sodium ascorbate (5mM), the mixture was reacted at 25 ℃ for 1h and analyzed by SDS-PAGE. FIG. 9 a is ESI-TOF spectrum of unlabeled Lysozyme protein and Lysozyme labeled with Compound 1F, indicating that its covalent modification is almost quantitatively modified (FIG. 9 a). Molecular weight was 14304.07 before labeling and 14502.21, 14700.33, 14898.26 after labeling. Trypsin digestion and subsequent ESI-MS/MS analysis of the fragments also confirmed lysine-specific labeling (b of FIG. 9). FIG. 9 c is the marker protein with TAMRA-N3The fluorescence imaging picture of the coupled SDS-PAGE gel shows that the coupled SDS-PAGE gel can also be coupled with tetramethyl rhodamine and has good biocompatibility.
Lysozyme amino acid sequence (SEQ ID NO: 7):
KVFGRCELAAAMKRHGLDNYRGYSLGNWVCAAKFESNFNTQATNRNTDGSTDYGILQINSRWWCNDGRTPGSRNLCNIPCSALLSSDITASVNCAKKIVSDGNGMNAWVAWRNRCKGTDVQAWIRGCRL
example 11
Compound 1F was tested for Myoglobin protein labeling under light induction conditions.
Compound 1F was prepared as a 2.5mM MeOH solution and myoglobin was prepared as a 27.5. mu.M solution in PBS. Compound 1F (2.5mM) in MeOH was treated with 365nm UV light for 7min, then added to myoglobin (27.5. mu.M) in PBS to a final concentration of 125. mu.M and mixed. The mixture was shaken at 25 ℃ for 1 hour, samples were collected, diluted with water and analyzed by ESI-TOF to achieve labeling of the Myoglobin protein. Mixing TAMRA-N3Preparing into 1.25mM DMSO solution, and preparing CuSO4 and THPTA into 50mM H solution2O solution, and mixing at a ratio of 1: volume ratio of 5. Making sodium ascorbate into 100mM H2Stock solution of O solution. Add 100. mu.l o-nitrobenzyl alcohol Compound 1F-labeled protein (25. mu.M or 50. mu.M), TAMRA-N3(100. mu.M), premixed CuSO4 (100. mu.M), THPTA (500. mu.M) and sodium ascorbate (5mM), the mixture was reacted at 25 ℃ for 1h and analyzed by SDS-PAGE. Fig. 10 a is an ESI-TOF spectrum of unlabeled Myoglobin protein and Myoglobin labeled with compound 1F, indicating that its covalent modification is almost quantitatively modified (fig. 10 a). The molecular weight before labeling is 16951.38, and the molecular weight distribution after labeling is 17149.40, 17347.63, 17545.73 and 17743.63. Trypsin digestion and subsequent ESI-MS/MS analysis of the fragments also confirmed lysine-specific labeling (b of FIG. 10). FIG. 10 c is the marker protein and TAMRA-N3The fluorescence imaging picture of the coupled SDS-PAGE gel shows that the coupled tetramethylrhodamine has good biocompatibility (figure 10 c).
Myoglobin amino acid sequence (SEQ ID NO: 5):
GLSDGEWQQVLNVWGKVEADIAGHGQEVLIRLFTGHPETLEKFDKFKHLKTEAEMKASEDLKKHGTVVLTALGGILKKKGHHEAELKPLAQSHATKHKIPIKYLEFISDAIIHVLHSKHPGDFGADAQGAMTKALELFRNDIAAKYKELGFQG
example 12
Compound 1F was tested for Nanobody-HER2 protein labeling under light-induced conditions. FIG. 11 shows the selective labeling of the amino group in the Nanobody-HER2 antibody by Compound 1F, and the preparation of antibody-fluorescent conjugates.
Preparation of Compounds in MeOH solutionSubstance 1F in 2.5 mM. Compound 1F (2.5mM) in MeOH was treated with 365nm UV light for 7min, then added to a solution of Nanobody-HER2 (55. mu.M, 0.77mg/mL) in PBS to a final concentration of 125. mu.M and mixed. The mixture was incubated at 25 ℃ for 1 h. Samples were collected, diluted with H2O, and analyzed by ESI-TOF. The obtained compound 1F-labeled Nanobody-HER2 protein. Mixing TAMRA-N3Preparing into 1.25mM DMSO solution, and preparing CuSO4 and THPTA into 50mM H solution2O solution, and mixing at a ratio of 1: volume ratio of 5. Making sodium ascorbate into 100mM H2Stock solution of O solution. Add 100. mu.l o-nitrobenzyl alcohol Compound 1F-labeled protein (25. mu.M or 50. mu.M), TAMRA-N3(100. mu.M), premixed CuSO4 (100. mu.M), THPTA (500. mu.M) and sodium ascorbate (5mM), the mixture was reacted at 25 ℃ for 1h and analyzed by SDS-PAGE. FIG. 12 a is the ESI-TOF spectrum of unlabeled Nanobody-HER2 antibody and the antibody labeled with Compound 1F, indicating that its covalent modification is almost quantitatively modified (FIG. 12 a). The molecular weight before labeling is 13694.56, and the molecular weight distribution after labeling is 13892.63, 14090.77, 14288.56 and 14485.93. Trypsin digestion and subsequent ESI-MS/MS analysis of the fragments also confirmed lysine-specific labeling (b of FIG. 12). FIG. 12 c is a graph showing the binding of labeled antibody to TAMRA-N3The fluorescence imaging picture of SDS-PAGE gel after coupling shows that the conjugate can also be coupled with tetramethyl rhodamine to prepare the antibody-fluorescence conjugate.
Nanobody-HER2-wt-His6 amino acid sequence (SEQ ID NO: 6):
MQVQLQESGGGSVQAGGSLKLTCAASGYIFNSCGMGWYRQSPGRERELVSRISGDGDTWHKESVKGRFTISQDNVKKTLYLQMNSLKPEDTAVYFCAVCYNLETYWGQGTQVTVSSGGHHHHHH
SEQUENCE LISTING
<110> Shanghai pharmaceutical research institute of Chinese academy of sciences
<120> linker of the same kind used for protein labeling and application thereof in biological medicine
<130> DI20-1875-XC91
<160> 7
<170> PatentIn version 3.5
<210> 1
<211> 69
<212> PRT
<213> Artificial Sequence
<220>
<223> Affibody
<400> 1
Met Thr Ser Val Asp Asn Lys Phe Asn Lys Glu Leu Ser Val Ala Gly
1 5 10 15
Arg Glu Ile Val Thr Leu Pro Asn Leu Asn Asp Pro Gln Lys Lys Ala
20 25 30
Phe Ile Phe Ser Leu Trp Asp Asp Pro Ser Gln Ser Ala Asn Leu Leu
35 40 45
Ala Glu Ala Lys Lys Leu Asn Asp Ala Gln Ala Pro Lys Gly Ser His
50 55 60
His His His His His
65
<210> 2
<211> 133
<212> PRT
<213> Artificial Sequence
<220>
<223> Nanobody-EGFR
<400> 2
Gln Val Lys Leu Glu Glu Ser Gly Gly Gly Ser Val Gln Thr Gly Gly
1 5 10 15
Ser Leu Arg Leu Thr Cys Ala Ala Ser Gly Arg Thr Ser Arg Ser Tyr
20 25 30
Gly Met Gly Trp Phe Arg Gln Ala Pro Gly Lys Glu Arg Glu Phe Val
35 40 45
Ser Gly Ile Ser Trp Arg Gly Asp Ser Thr Gly Tyr Ala Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ala Lys Asn Thr Val Asp
65 70 75 80
Leu Gln Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Ile Tyr Tyr Cys
85 90 95
Ala Ala Ala Ala Gly Ser Ala Trp Tyr Gly Thr Leu Tyr Glu Tyr Asp
100 105 110
Tyr Trp Gly Gln Gly Thr Gln Val Thr Val Ser Ser Ala Leu Glu His
115 120 125
His His His His His
130
<210> 3
<211> 88
<212> PRT
<213> Artificial Sequence
<220>
<223> Ubiquitin
<400> 3
Met Thr Ser Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile
1 5 10 15
Thr Leu Glu Val Glu Pro Ser Asp Thr Ile Glu Asn Val Lys Ala Lys
20 25 30
Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe
35 40 45
Ala Gly Lys Gln Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn Ile
50 55 60
Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Leu Glu
65 70 75 80
His His His His His His His His
85
<210> 4
<211> 245
<212> PRT
<213> Artificial Sequence
<220>
<223> Chymotrypsinogen A
<400> 4
Cys Gly Val Pro Ala Ile Gln Pro Val Leu Ser Gly Leu Ser Arg Ile
1 5 10 15
Val Asn Gly Glu Glu Ala Val Pro Gly Ser Trp Pro Trp Gln Val Ser
20 25 30
Leu Gln Asp Lys Thr Gly Phe His Phe Cys Gly Gly Ser Leu Ile Asn
35 40 45
Glu Asn Trp Val Val Thr Ala Ala His Cys Gly Val Thr Thr Ser Asp
50 55 60
Val Val Val Ala Gly Glu Phe Asp Gln Gly Ser Ser Ser Glu Lys Ile
65 70 75 80
Gln Lys Leu Lys Ile Ala Lys Val Phe Lys Asn Ser Lys Tyr Asn Ser
85 90 95
Leu Thr Ile Asn Asn Asp Ile Thr Leu Leu Lys Leu Ser Thr Ala Ala
100 105 110
Ser Phe Ser Gln Thr Val Ser Ala Val Cys Leu Pro Ser Ala Ser Asp
115 120 125
Asp Phe Ala Ala Gly Thr Thr Cys Val Thr Thr Gly Trp Gly Leu Thr
130 135 140
Arg Tyr Thr Asn Ala Asn Thr Pro Asp Arg Leu Gln Gln Ala Ser Leu
145 150 155 160
Pro Leu Leu Ser Asn Thr Asn Cys Lys Lys Tyr Trp Gly Thr Lys Ile
165 170 175
Lys Asp Ala Met Ile Cys Ala Gly Ala Ser Gly Val Ser Ser Cys Met
180 185 190
Gly Asp Ser Gly Gly Pro Leu Val Cys Lys Lys Asn Gly Ala Trp Thr
195 200 205
Leu Val Gly Ile Val Ser Trp Gly Ser Ser Thr Cys Ser Thr Ser Thr
210 215 220
Pro Gly Val Tyr Ala Arg Val Thr Ala Leu Val Asn Trp Val Gln Gln
225 230 235 240
Thr Leu Ala Ala Asn
245
<210> 5
<211> 153
<212> PRT
<213> Artificial Sequence
<220>
<223> Myoglobin
<400> 5
Gly Leu Ser Asp Gly Glu Trp Gln Gln Val Leu Asn Val Trp Gly Lys
1 5 10 15
Val Glu Ala Asp Ile Ala Gly His Gly Gln Glu Val Leu Ile Arg Leu
20 25 30
Phe Thr Gly His Pro Glu Thr Leu Glu Lys Phe Asp Lys Phe Lys His
35 40 45
Leu Lys Thr Glu Ala Glu Met Lys Ala Ser Glu Asp Leu Lys Lys His
50 55 60
Gly Thr Val Val Leu Thr Ala Leu Gly Gly Ile Leu Lys Lys Lys Gly
65 70 75 80
His His Glu Ala Glu Leu Lys Pro Leu Ala Gln Ser His Ala Thr Lys
85 90 95
His Lys Ile Pro Ile Lys Tyr Leu Glu Phe Ile Ser Asp Ala Ile Ile
100 105 110
His Val Leu His Ser Lys His Pro Gly Asp Phe Gly Ala Asp Ala Gln
115 120 125
Gly Ala Met Thr Lys Ala Leu Glu Leu Phe Arg Asn Asp Ile Ala Ala
130 135 140
Lys Tyr Lys Glu Leu Gly Phe Gln Gly
145 150
<210> 6
<211> 124
<212> PRT
<213> Artificial Sequence
<220>
<223> Nanobody-HER2-wt-His6
<400> 6
Met Gln Val Gln Leu Gln Glu Ser Gly Gly Gly Ser Val Gln Ala Gly
1 5 10 15
Gly Ser Leu Lys Leu Thr Cys Ala Ala Ser Gly Tyr Ile Phe Asn Ser
20 25 30
Cys Gly Met Gly Trp Tyr Arg Gln Ser Pro Gly Arg Glu Arg Glu Leu
35 40 45
Val Ser Arg Ile Ser Gly Asp Gly Asp Thr Trp His Lys Glu Ser Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Gln Asp Asn Val Lys Lys Thr Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Phe Cys
85 90 95
Ala Val Cys Tyr Asn Leu Glu Thr Tyr Trp Gly Gln Gly Thr Gln Val
100 105 110
Thr Val Ser Ser Gly Gly His His His His His His
115 120
<210> 7
<211> 129
<212> PRT
<213> Artificial Sequence
<220>
<223> Lysozyme
<400> 7
Lys Val Phe Gly Arg Cys Glu Leu Ala Ala Ala Met Lys Arg His Gly
1 5 10 15
Leu Asp Asn Tyr Arg Gly Tyr Ser Leu Gly Asn Trp Val Cys Ala Ala
20 25 30
Lys Phe Glu Ser Asn Phe Asn Thr Gln Ala Thr Asn Arg Asn Thr Asp
35 40 45
Gly Ser Thr Asp Tyr Gly Ile Leu Gln Ile Asn Ser Arg Trp Trp Cys
50 55 60
Asn Asp Gly Arg Thr Pro Gly Ser Arg Asn Leu Cys Asn Ile Pro Cys
65 70 75 80
Ser Ala Leu Leu Ser Ser Asp Ile Thr Ala Ser Val Asn Cys Ala Lys
85 90 95
Lys Ile Val Ser Asp Gly Asn Gly Met Asn Ala Trp Val Ala Trp Arg
100 105 110
Asn Arg Cys Lys Gly Thr Asp Val Gln Ala Trp Ile Arg Gly Cys Arg
115 120 125
Leu
Claims (8)
1. A compound represented by the general formula (I), tautomers, enantiomers, diastereomers, racemates, isotopic compounds thereof, and salts in various forms or hydrates thereof:
wherein Y is selected from: -CO-, -NH-CH2-、-O-CO-CH2-、-NH-COO-CH2-、-NH-CO-NH-CH2-、-COOCH2-、-CO-NH-、-O-CH2-、-CH2-、-COO-、-OCO-、-O-、-S-、-SO2-、-C≡C-、-C=C-、-SO2NH-、-NHCONH-、-NHCSNH-、-NH-、-CONH-CH2-or absent, wherein one end of Y may be attached to the 3,4, 5 or 6 position of the phenyl ring in the ortho-nitrobenzyl alcohol structure;
wherein R is1Is one or more than one substitution of any position except Y substitution position on 3,4, 5 or 6 position in the structure of o-nitrobenzyl alcohol, and when R is polysubstituted1Identical or different, R1Each independently selected from hydrogen, deuterium, amino, halogen, nitro, cyano, C1-6Alkyl radical, C3-10Cycloalkyl radical, C1-5Alkoxy radical, C1-6Alkylamino or aminoalkyl radical, C1-C6Alkylcarbonyl group, C2-C6Alkoxycarbonyl group, C2-C6Alkylamino carbonyl, C5-8Heterocyclic group, C6-10Aryl radical, C5-6A heteroaryl group,Wherein n is0And n1Is 1, 2, 3,4 or 5, wherein said alkyl, cycloalkyl, alkoxy, alkylamino or aminoalkyl, alkylcarbonyl, alkoxycarbonyl, alkylaminocarbonyl, heterocyclyl, aryl, heteroaryl is optionally further substituted with one or more substituents selected from the group consisting of halogen, hydroxy, amino, C1-C6Alkoxy, cyano, nitro;
r is selected from hydrogen, deuterium, halogen, nitro, cyano, hydroxyl, alkyl hydroxyl, aryl hydroxyl, alkyl amino, aryl amino, sulfydryl, alkyl sulfydryl, aryl sulfydryl, carboxylic acid, alkyl carboxylic acid, aryl carboxylic acid, alkynyl, alkyl alkynyl, aryl alkynyl, azide, alkyl azide, aryl azide, carbonyl, alkyl carbonyl, aryl carbonyl, aldehyde group, alkyl aldehyde group, aryl aldehyde group, alkyl, cycloalkyl, alkoxy, heterocyclic group, aryl, heteroaryl or any combination thereof, wherein the above groups are optionally further substituted by one or more groups selected from halogen, hydroxyl, amino, C1-C6Alkoxy, cyano, nitro.
2. A compound according to claim 1, its tautomers, enantiomers, diastereomers, racemates, isotopic compounds and salts in various forms or hydrates thereof, wherein
Y is selected from: -CO-, -NH-CH2-、-O-CO-CH2-、-NH-COO-CH2-、-NH-CO-NH-CH2-、-COOCH2-、-CO-NH-、-O-CH2-、-CH2-、-COO-、-OCO-、-O-、-S-、-SO2-、-C≡C-、-C=C-、-SO2NH-、-NHCONH-、-NHCSNH-、-NH-、-CONH-CH2-or absent, wherein one end of Y may be attached to the 3,4, 5 or 6 position of the phenyl ring in the ortho-nitrobenzyl alcohol structure;
r is selected from hydrogen, deuterium, halogen, nitro, cyano, hydroxyl, alkyl hydroxyl, aryl hydroxyl, alkyl amino, aryl amino, sulfydryl, alkyl sulfydryl, aryl sulfydryl, carboxylic acid, alkyl carboxylic acid, aryl carboxylic acid, alkynyl, alkyl alkynyl, aryl alkynyl, azide, alkyl azide, aryl azide, carbonyl, alkyl carbonyl, aryl carbonyl, aldehyde group, alkyl aldehyde group, aryl aldehyde group, alkyl, cycloalkyl, alkoxy, heterocyclic group, aryl, heteroaryl or any combination thereof, wherein the above groups are optionally further substituted by one or more groups selected from halogen, hydroxyl, amino, C1-C6Alkoxy, cyano, nitro;
wherein R is1Is one or more than one substitution of any position except Y substitution position on 3,4, 5 or 6 position in the structure of o-nitrobenzyl alcohol, and when R is polysubstituted1Identical or different, R1Each independently selected from hydrogen, deuterium, amino, halogen, C1-3Alkoxy, nitro orWherein n is0And n11, 2, 3,4 and 5.
3. A compound according to claim 1 or 2, its tautomers, enantiomers, diastereomers, racemates, precursor compounds, isotopic compounds and salts in various forms or hydrates thereof, wherein the compound of formula (i) is selected from the group consisting of the following formulae:
wherein R, R1Is defined and corresponds to the claimsThe requirements are the same.
4. Use of a compound according to any one of claims 1 to 3, or a tautomer, enantiomer, diastereomer, racemate, precursor compound, isotopic compound, salt in various forms, or hydrate thereof, for the preparation of a label for selectively labeling a polypeptide or a lysine free amino group in a protein side chain.
5. Use of the compound of any one of claims 1 to 3, or a tautomer, enantiomer, diastereomer, racemate, precursor compound, isotopic compound, salt or hydrate thereof in various forms, for producing an antibody-drug conjugate, an antibody-affinity label, and an antibody-fluorescent substance.
6. The use according to claim 5, wherein the conjugate has the characteristic structural formula (II):
wherein, Y, R1Are as defined in the corresponding claims,
a is a polypeptide or protein;
z is an integer of L-X,
x includes one or none of affinity tags, labeled fluorescent substances including rhodamine, fluorescein, pigment, coumarin, and active drugs or their derivatives, particularly, affinity tags such as biotin and folic acid, wherein the active drugs include Maytansinoids (Maytansinoids), Auristatins (Auristatins), Calicheamicins (Calichemicins), adriamycin (Doxorubicin), pyrrolobenzodiazepine dimer (PBDs), Triptolide (Triptolide), Colchicine (Colchicine), Combretastatin (Combretastatin), Homoharringtonine (Homoharringtonine), Camptothecin (Camptothecin), Paclitaxel (Paclitaxel), and also includes all drugs that can be used for antibody drug conjugates,
l is: C1-C9 alkyl, C2-C9 alkenyl, C2-C9 alkynyl, aryl, heteroaryl, C3-C9 cycloalkyl, C3-C9 heterocyclyl, -NR1-, -O-, -S-, -CO-, -OCO-, -COO-, -NHCO-, -CONR1-, -C ═ NR1-, -C ═ S-O-, -C ═ S-NR1-, -CS2-, -NR1CO-, -NR1CSNR2-, -OCONR1-, -OSO-, Val-Val-PAB, Val-Cit-PAB, Val-Ala-PAB, Val-Lys (Ac) -PAB, Phe-Lys (Ac) -PAB, D-Val-Leu-Lys, Gly-Gly-Arg, Ala-Ala-Asn-PAB, Ala-PAB, PAB and any combination or null thereof, wherein R1 and R2 are independently selected from H, C1-C6 alkyl, C2-C6 alkenyl, C2-C6 alkynyl, aryl, heteroaryl, C3-C9 cycloalkyl and C3-C9 heterocyclic group, and n3 is 0-23; or L is absent.
8. The use of claim 7, wherein the antibody-drug conjugate is selected from the following structures;
wherein, Y, L, R1Are as defined in the corresponding claims,
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011173629.8A CN114409563B (en) | 2020-10-28 | 2020-10-28 | Linker for protein labeling and application thereof in biological medicine |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011173629.8A CN114409563B (en) | 2020-10-28 | 2020-10-28 | Linker for protein labeling and application thereof in biological medicine |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114409563A true CN114409563A (en) | 2022-04-29 |
CN114409563B CN114409563B (en) | 2023-11-10 |
Family
ID=81260597
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011173629.8A Active CN114409563B (en) | 2020-10-28 | 2020-10-28 | Linker for protein labeling and application thereof in biological medicine |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114409563B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024094131A1 (en) * | 2022-11-02 | 2024-05-10 | 大江基因医学股份有限公司 | Targeting molecule-cell complex and preparation method therefor |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016059622A2 (en) * | 2016-02-04 | 2016-04-21 | Suzhou M-Conj Biotech Co., Ltd. | Specific conjugation linkers, specific immunoconjugates thereof, methods of making and uses such conjugates thereof |
WO2018177369A1 (en) * | 2017-03-30 | 2018-10-04 | 江苏恒瑞医药股份有限公司 | Method for preparing antibody-drug conjugate |
CN109734666A (en) * | 2018-05-03 | 2019-05-10 | 湖南大学 | A kind of carbon dioxide promotes and synthesizes indazole quinoline ketone compounds preparation method without photochemical catalyst photoinduction |
-
2020
- 2020-10-28 CN CN202011173629.8A patent/CN114409563B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016059622A2 (en) * | 2016-02-04 | 2016-04-21 | Suzhou M-Conj Biotech Co., Ltd. | Specific conjugation linkers, specific immunoconjugates thereof, methods of making and uses such conjugates thereof |
WO2018177369A1 (en) * | 2017-03-30 | 2018-10-04 | 江苏恒瑞医药股份有限公司 | Method for preparing antibody-drug conjugate |
CN109734666A (en) * | 2018-05-03 | 2019-05-10 | 湖南大学 | A kind of carbon dioxide promotes and synthesizes indazole quinoline ketone compounds preparation method without photochemical catalyst photoinduction |
Non-Patent Citations (16)
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024094131A1 (en) * | 2022-11-02 | 2024-05-10 | 大江基因医学股份有限公司 | Targeting molecule-cell complex and preparation method therefor |
Also Published As
Publication number | Publication date |
---|---|
CN114409563B (en) | 2023-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6982500B2 (en) | Phenylethynylnaphthalene dyes and how to use them | |
JP7455567B2 (en) | Polymer dye modification and application | |
JP5814366B2 (en) | Anticancer derivatives, their preparation and therapeutic use | |
CN111263747B (en) | Pharmacokinetic enhancement of difunctional chelates and uses thereof | |
TW202116778A (en) | Peptide conjugates of cytotoxins as therapeutics | |
CN112358414B (en) | Unnatural amino acids and their use in protein site-directed modification and protein interactions | |
CN109790178B (en) | Novel cytotoxic agents and conjugates thereof | |
EP0967205B1 (en) | Labeling reactants and their use | |
Saito et al. | Synthesis of boradiazaindacene–imidazopyrazinone conjugate as lipophilic and yellow-chemiluminescent chemosensor for superoxide radical anion | |
CN114409563B (en) | Linker for protein labeling and application thereof in biological medicine | |
WO2016164622A1 (en) | Reagents and methods for esterification | |
Zhou et al. | A designed cyclic peptide based on Trastuzumab used to construct peptide-drug conjugates for its HER2-targeting ability | |
CN113773283B (en) | Oxidobicycloheptene sulfonamide compound containing hydrophobic label and application thereof | |
CN109153639A (en) | Faenum graecum alkaloid compound | |
CN106977498A (en) | Intracellular Fluorescence response flag DNA luminous point hits probe and preparation method and application | |
CN113416196A (en) | benzothiadiazole-TB compound and synthesis method and application thereof | |
JP7191034B2 (en) | Biocompatible Modular Tetrazine Platform | |
US20210380635A1 (en) | Compositions and methods for protein labeling, modification, analysis, and targeted delivery | |
US11053279B2 (en) | Methods for the site-selective coupling of a first agent to a second agent | |
AU2007209853A1 (en) | Divalent metal ion sensors and binders | |
JP6670502B2 (en) | Development of ligand screening system for neurotransmitter receptor | |
KR20140010517A (en) | Drug delivery complex enabling direct monitoring of delivery and cellular uptake of the drug and method for preparing the same | |
CN110251680B (en) | Piperazine divinyl sulfonamide linker and preparation method and application thereof | |
WO2022214054A1 (en) | Conjugate and the preparing method and use thereof | |
WO2023078230A1 (en) | Antibody-drug conjugate intermediate comprising sn38 and preparation method therefor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |