AU675269B2 - Materials and methods for immunocontraception - Google Patents
Materials and methods for immunocontraceptionInfo
- Publication number
- AU675269B2 AU675269B2 AU56800/94A AU5680094A AU675269B2 AU 675269 B2 AU675269 B2 AU 675269B2 AU 56800/94 A AU56800/94 A AU 56800/94A AU 5680094 A AU5680094 A AU 5680094A AU 675269 B2 AU675269 B2 AU 675269B2
- Authority
- AU
- Australia
- Prior art keywords
- leu
- ser
- val
- thr
- pro
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000000034 method Methods 0.000 title claims abstract description 77
- 239000000463 material Substances 0.000 title description 6
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 241
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 211
- 210000004340 zona pellucida Anatomy 0.000 claims abstract description 152
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 77
- 239000012634 fragment Substances 0.000 claims abstract description 72
- 208000000509 infertility Diseases 0.000 claims abstract description 46
- 230000036512 infertility Effects 0.000 claims abstract description 46
- 231100000535 infertility Toxicity 0.000 claims abstract description 33
- 230000001052 transient effect Effects 0.000 claims abstract description 18
- 230000001939 inductive effect Effects 0.000 claims abstract description 16
- 101000818870 Homo sapiens Zona pellucida sperm-binding protein 2 Proteins 0.000 claims description 162
- 101000976428 Homo sapiens Zona pellucida sperm-binding protein 4 Proteins 0.000 claims description 145
- 102100023598 Zona pellucida sperm-binding protein 4 Human genes 0.000 claims description 132
- 108020004414 DNA Proteins 0.000 claims description 68
- 241000282465 Canis Species 0.000 claims description 55
- 241000894007 species Species 0.000 claims description 45
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 41
- 241000124008 Mammalia Species 0.000 claims description 39
- 210000004027 cell Anatomy 0.000 claims description 39
- 241000283973 Oryctolagus cuniculus Species 0.000 claims description 37
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 34
- 241000282567 Macaca fascicularis Species 0.000 claims description 32
- 239000013598 vector Substances 0.000 claims description 28
- 241000282414 Homo sapiens Species 0.000 claims description 27
- 241000283690 Bos taurus Species 0.000 claims description 24
- 241000282324 Felis Species 0.000 claims description 15
- 239000002671 adjuvant Substances 0.000 claims description 15
- 238000004519 manufacturing process Methods 0.000 claims description 13
- 102000055963 human ZP2 Human genes 0.000 claims description 10
- 102000057586 human ZP4 Human genes 0.000 claims description 8
- 239000008194 pharmaceutical composition Substances 0.000 claims description 8
- 229920001184 polypeptide Polymers 0.000 claims description 6
- 241000701959 Escherichia virus Lambda Species 0.000 claims description 5
- 239000003433 contraceptive agent Substances 0.000 claims description 5
- 230000008569 process Effects 0.000 claims description 5
- 101100545373 Bos taurus ZP3 gene Proteins 0.000 claims description 4
- 230000002254 contraceptive effect Effects 0.000 claims description 4
- 101100268276 Bos taurus ZP2 gene Proteins 0.000 claims description 3
- 101100545385 Bos taurus ZP4 gene Proteins 0.000 claims description 3
- 239000003085 diluting agent Substances 0.000 claims description 2
- 239000003937 drug carrier Substances 0.000 claims description 2
- 235000015097 nutrients Nutrition 0.000 claims 1
- 241001465754 Metazoa Species 0.000 abstract description 59
- 238000002255 vaccination Methods 0.000 abstract description 10
- 235000018102 proteins Nutrition 0.000 description 195
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 152
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 122
- 102100021422 Zona pellucida sperm-binding protein 2 Human genes 0.000 description 122
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 120
- 150000001413 amino acids Chemical group 0.000 description 110
- 239000002299 complementary DNA Substances 0.000 description 76
- 235000001014 amino acid Nutrition 0.000 description 74
- 241000282472 Canis lupus familiaris Species 0.000 description 53
- 241000880493 Leptailurus serval Species 0.000 description 39
- 210000001672 ovary Anatomy 0.000 description 37
- 239000013612 plasmid Substances 0.000 description 37
- 150000007523 nucleic acids Chemical class 0.000 description 34
- 108020004707 nucleic acids Proteins 0.000 description 31
- 102000039446 nucleic acids Human genes 0.000 description 31
- 108010026333 seryl-proline Proteins 0.000 description 31
- 108010050848 glycylleucine Proteins 0.000 description 27
- 101000976442 Homo sapiens Zona pellucida sperm-binding protein 3 Proteins 0.000 description 25
- 102100023634 Zona pellucida sperm-binding protein 3 Human genes 0.000 description 25
- 238000012360 testing method Methods 0.000 description 25
- 108010057821 leucylproline Proteins 0.000 description 24
- 210000000287 oocyte Anatomy 0.000 description 24
- 239000000523 sample Substances 0.000 description 24
- 208000021267 infertility disease Diseases 0.000 description 22
- 108010048818 seryl-histidine Proteins 0.000 description 22
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 21
- 108010092114 histidylphenylalanine Proteins 0.000 description 21
- 238000002955 isolation Methods 0.000 description 21
- 241000282326 Felis catus Species 0.000 description 18
- 239000000243 solution Substances 0.000 description 18
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 17
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 17
- 108010049041 glutamylalanine Proteins 0.000 description 17
- 108010064235 lysylglycine Proteins 0.000 description 17
- 230000002611 ovarian Effects 0.000 description 17
- 108010053725 prolylvaline Proteins 0.000 description 17
- 108010068380 arginylarginine Proteins 0.000 description 16
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 16
- 239000000203 mixture Substances 0.000 description 16
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 15
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 15
- 108010093581 aspartyl-proline Proteins 0.000 description 15
- 230000003053 immunization Effects 0.000 description 15
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 14
- 102000008937 Zona Pellucida Glycoproteins Human genes 0.000 description 14
- 108010074006 Zona Pellucida Glycoproteins Proteins 0.000 description 14
- 238000004458 analytical method Methods 0.000 description 14
- 210000004899 c-terminal region Anatomy 0.000 description 14
- 230000012173 estrus Effects 0.000 description 14
- 238000009396 hybridization Methods 0.000 description 14
- 238000002649 immunization Methods 0.000 description 14
- 108010051242 phenylalanylserine Proteins 0.000 description 14
- 101000818877 Homo sapiens Zona pellucida sperm-binding protein 1 Proteins 0.000 description 13
- 241000282898 Sus scrofa Species 0.000 description 13
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 description 13
- 108020001507 fusion proteins Proteins 0.000 description 13
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 13
- 108010077515 glycylproline Proteins 0.000 description 13
- 238000012216 screening Methods 0.000 description 13
- 108010061238 threonyl-glycine Proteins 0.000 description 13
- 238000002965 ELISA Methods 0.000 description 12
- 241000588724 Escherichia coli Species 0.000 description 12
- 102000004961 Furin Human genes 0.000 description 12
- 108090001126 Furin Proteins 0.000 description 12
- 101000976425 Mus musculus Zona pellucida sperm-binding protein 3 Proteins 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 12
- 102100021401 Zona pellucida sperm-binding protein 1 Human genes 0.000 description 12
- 230000000692 anti-sense effect Effects 0.000 description 12
- 102000054766 genetic haplotypes Human genes 0.000 description 12
- 230000000366 juvenile effect Effects 0.000 description 12
- 238000002360 preparation method Methods 0.000 description 12
- 108010090894 prolylleucine Proteins 0.000 description 12
- 210000001519 tissue Anatomy 0.000 description 12
- 108010073969 valyllysine Proteins 0.000 description 12
- 239000003298 DNA probe Substances 0.000 description 11
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 11
- 238000012300 Sequence Analysis Methods 0.000 description 11
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 11
- 108010077245 asparaginyl-proline Proteins 0.000 description 11
- 239000000872 buffer Substances 0.000 description 11
- 108010004073 cysteinylcysteine Proteins 0.000 description 11
- 230000000694 effects Effects 0.000 description 11
- 102000037865 fusion proteins Human genes 0.000 description 11
- 108010034529 leucyl-lysine Proteins 0.000 description 11
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 11
- 108020004999 messenger RNA Proteins 0.000 description 11
- 238000003752 polymerase chain reaction Methods 0.000 description 11
- 108700024394 Exon Proteins 0.000 description 10
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 10
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 10
- 108010041407 alanylaspartic acid Proteins 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 10
- 239000000427 antigen Substances 0.000 description 10
- 102000036639 antigens Human genes 0.000 description 10
- 108091007433 antigens Proteins 0.000 description 10
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 10
- 108010025306 histidylleucine Proteins 0.000 description 10
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 10
- 239000002773 nucleotide Substances 0.000 description 10
- 125000003729 nucleotide group Chemical group 0.000 description 10
- 108010018625 phenylalanylarginine Proteins 0.000 description 10
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 10
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 9
- 241000282693 Cercopithecidae Species 0.000 description 9
- 241000699800 Cricetinae Species 0.000 description 9
- YFKWIIRWHGKSQQ-WFBYXXMGSA-N Cys-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N YFKWIIRWHGKSQQ-WFBYXXMGSA-N 0.000 description 9
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 9
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 9
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 9
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 9
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 9
- PIJXCSUPSNFXNE-QRZOAFCBSA-N N-acetyl-4-(N-acetylglucosaminyl)muramoyl-L-alanyl-D-isoglutamine Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@@H]1[C@@H](NC(C)=O)[C@H](O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 PIJXCSUPSNFXNE-QRZOAFCBSA-N 0.000 description 9
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 9
- 108010079005 RDV peptide Proteins 0.000 description 9
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 9
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 9
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 9
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 9
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 9
- 108010005233 alanylglutamic acid Proteins 0.000 description 9
- 108010087924 alanylproline Proteins 0.000 description 9
- 108010016616 cysteinylglycine Proteins 0.000 description 9
- 108010060199 cysteinylproline Proteins 0.000 description 9
- 230000029087 digestion Effects 0.000 description 9
- 230000035558 fertility Effects 0.000 description 9
- 108010036413 histidylglycine Proteins 0.000 description 9
- 239000008188 pellet Substances 0.000 description 9
- 108010070643 prolylglutamic acid Proteins 0.000 description 9
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 9
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 8
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 8
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 8
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 8
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 8
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 8
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 8
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 8
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 8
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 8
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 8
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 8
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 8
- 238000002105 Southern blotting Methods 0.000 description 8
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 8
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 8
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 8
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 8
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 8
- 108010060035 arginylproline Proteins 0.000 description 8
- 239000004202 carbamide Substances 0.000 description 8
- 238000010367 cloning Methods 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 230000002796 immunocontraceptive effect Effects 0.000 description 8
- 108010003700 lysyl aspartic acid Proteins 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- 239000000047 product Substances 0.000 description 8
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 7
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 7
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 7
- 108020004705 Codon Proteins 0.000 description 7
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 7
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 7
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 7
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 7
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 7
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 7
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 7
- CZXKZMQKXQZDEX-YUMQZZPRSA-N His-Gly-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N CZXKZMQKXQZDEX-YUMQZZPRSA-N 0.000 description 7
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 7
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 7
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 7
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 7
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 7
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 7
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 7
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 7
- 108091034117 Oligonucleotide Proteins 0.000 description 7
- 108700026244 Open Reading Frames Proteins 0.000 description 7
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 7
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 7
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 7
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 7
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 7
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 7
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 7
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 7
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 7
- GRQCSEWEPIHLBI-JQWIXIFHSA-N Trp-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 GRQCSEWEPIHLBI-JQWIXIFHSA-N 0.000 description 7
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 7
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 7
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 7
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 7
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 7
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 7
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 7
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 7
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 7
- 108010069495 cysteinyltyrosine Proteins 0.000 description 7
- 108010054813 diprotin B Proteins 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 108010018006 histidylserine Proteins 0.000 description 7
- 238000011534 incubation Methods 0.000 description 7
- 108010000761 leucylarginine Proteins 0.000 description 7
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 7
- 108010017391 lysylvaline Proteins 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 6
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 6
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 6
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 6
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 6
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 6
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 6
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 6
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 6
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 6
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 6
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 6
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 6
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 6
- 229920001661 Chitosan Polymers 0.000 description 6
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 6
- AOZBJZBKFHOYHL-AVGNSLFASA-N Cys-Glu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O AOZBJZBKFHOYHL-AVGNSLFASA-N 0.000 description 6
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 6
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 6
- IRKLTAKLAFUTLA-KATARQTJSA-N Cys-Thr-Lys Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCCCN)C(O)=O IRKLTAKLAFUTLA-KATARQTJSA-N 0.000 description 6
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 6
- PNAOVYHADQRJQU-GUBZILKMSA-N Glu-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N PNAOVYHADQRJQU-GUBZILKMSA-N 0.000 description 6
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 6
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 6
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 6
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 6
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 6
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 6
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 6
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 6
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 6
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 6
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 6
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 6
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 6
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 6
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 6
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 6
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 6
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 6
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 6
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 6
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 6
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 6
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 6
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 6
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 6
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 6
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 6
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 6
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 6
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 6
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 6
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 6
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 6
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 6
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 6
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 6
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 6
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 6
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 6
- 230000008878 coupling Effects 0.000 description 6
- 238000010168 coupling process Methods 0.000 description 6
- 238000005859 coupling reaction Methods 0.000 description 6
- 239000013604 expression vector Substances 0.000 description 6
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 6
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- -1 i.e. Proteins 0.000 description 6
- 239000012528 membrane Substances 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 6
- 239000013600 plasmid vector Substances 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 108091008146 restriction endonucleases Proteins 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 108010038745 tryptophylglycine Proteins 0.000 description 6
- 108010003137 tyrosyltyrosine Proteins 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 5
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 description 5
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 5
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 5
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 5
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 5
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 5
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 5
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 5
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 5
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 5
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 5
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 5
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 5
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 5
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 5
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 5
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 5
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 5
- 108091035707 Consensus sequence Proteins 0.000 description 5
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 5
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 5
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 5
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 5
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 5
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 5
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 5
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 5
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 5
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 5
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 5
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 5
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 5
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 5
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 5
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 5
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 5
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 5
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 5
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 5
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 5
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 5
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 5
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 5
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 5
- OOXVBECOTYHTCK-WDSOQIARSA-N Met-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N OOXVBECOTYHTCK-WDSOQIARSA-N 0.000 description 5
- 101000818886 Mus musculus Zona pellucida sperm-binding protein 2 Proteins 0.000 description 5
- 101100545381 Mus musculus Zp3 gene Proteins 0.000 description 5
- 239000004677 Nylon Substances 0.000 description 5
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 5
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 5
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 5
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 5
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 5
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 5
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 5
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 5
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 5
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 5
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 5
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 5
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 5
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 5
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 5
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 5
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 5
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 5
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 5
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 5
- 108020005038 Terminator Codon Proteins 0.000 description 5
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 5
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 5
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 5
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 5
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 5
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 5
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 5
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 5
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 5
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 5
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 5
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 5
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 5
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 5
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 5
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 5
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 5
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 5
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 102000055956 human ZP3 Human genes 0.000 description 5
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 5
- BSOQXXWZTUDTEL-ZUYCGGNHSA-N muramyl dipeptide Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@H]1[C@H](O)[C@@H](CO)O[C@@H](O)[C@@H]1NC(C)=O BSOQXXWZTUDTEL-ZUYCGGNHSA-N 0.000 description 5
- 229920001778 nylon Polymers 0.000 description 5
- 108010084572 phenylalanyl-valine Proteins 0.000 description 5
- 108010007513 prolyl-glycyl-prolyl-leucine Proteins 0.000 description 5
- 238000000527 sonication Methods 0.000 description 5
- 241000701447 unidentified baculovirus Species 0.000 description 5
- 229960005486 vaccine Drugs 0.000 description 5
- 108010009962 valyltyrosine Proteins 0.000 description 5
- 238000001262 western blot Methods 0.000 description 5
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 4
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 4
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 4
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 4
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 4
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 4
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 4
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 4
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 4
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 4
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 4
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 4
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 4
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 4
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 4
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 4
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 4
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 4
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 4
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 4
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 4
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 4
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 4
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 4
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 4
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 4
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 4
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 4
- 101100545375 Canis lupus familiaris ZP3 gene Proteins 0.000 description 4
- 108020004635 Complementary DNA Proteins 0.000 description 4
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 4
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 4
- ZKAUCGZIIXXWJQ-BZSNNMDCSA-N Cys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CS)N)O ZKAUCGZIIXXWJQ-BZSNNMDCSA-N 0.000 description 4
- 108020003215 DNA Probes Proteins 0.000 description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 4
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 4
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 4
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 4
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 4
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 4
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 4
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 4
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 4
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 4
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 4
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 4
- OHOXVDFVRDGFND-YUMQZZPRSA-N His-Cys-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O OHOXVDFVRDGFND-YUMQZZPRSA-N 0.000 description 4
- LBCAQRFTWMMWRR-CIUDSAMLSA-N His-Cys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O LBCAQRFTWMMWRR-CIUDSAMLSA-N 0.000 description 4
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 4
- XMAUFHMAAVTODF-STQMWFEESA-N His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XMAUFHMAAVTODF-STQMWFEESA-N 0.000 description 4
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 4
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 4
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 4
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 4
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 4
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 4
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 4
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 4
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 4
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 4
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 4
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 4
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 4
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 4
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 4
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 4
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 4
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 4
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 4
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 4
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 4
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 4
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 4
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 4
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 4
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 4
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 4
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 4
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 4
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 4
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 4
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 4
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 4
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 4
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 4
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 4
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 4
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 4
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 4
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 4
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 4
- 101000976429 Sus scrofa Zona pellucida sperm-binding protein 4 Proteins 0.000 description 4
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 4
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 4
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 4
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 4
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 4
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 4
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 4
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 4
- OETOOJXFNSEYHQ-WFBYXXMGSA-N Trp-Ala-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 OETOOJXFNSEYHQ-WFBYXXMGSA-N 0.000 description 4
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 4
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 4
- ZMKDQRJLMRZHRI-ACRUOGEOSA-N Tyr-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N ZMKDQRJLMRZHRI-ACRUOGEOSA-N 0.000 description 4
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 4
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 4
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 4
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 4
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 4
- 108010045514 alpha-lactorphin Proteins 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 230000006472 autoimmune response Effects 0.000 description 4
- 108010005774 beta-Galactosidase Proteins 0.000 description 4
- 238000009395 breeding Methods 0.000 description 4
- 230000001488 breeding effect Effects 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- 230000001351 cycling effect Effects 0.000 description 4
- 239000008367 deionised water Substances 0.000 description 4
- 229910021641 deionized water Inorganic materials 0.000 description 4
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 4
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 238000010562 histological examination Methods 0.000 description 4
- 230000001900 immune effect Effects 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 108010029384 tryptophyl-histidine Proteins 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 3
- 229920001817 Agar Polymers 0.000 description 3
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 3
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 3
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 3
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 3
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 3
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 3
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 3
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 3
- DPNHSNLIULPOBH-GUBZILKMSA-N Arg-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DPNHSNLIULPOBH-GUBZILKMSA-N 0.000 description 3
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 3
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 3
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 3
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 3
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 3
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 3
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 3
- FFMIYIMKQIMDPK-BQBZGAKWSA-N Asn-His Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 FFMIYIMKQIMDPK-BQBZGAKWSA-N 0.000 description 3
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 3
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 3
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 3
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 3
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 3
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 3
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 3
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 3
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 3
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 3
- 241000972773 Aulopiformes Species 0.000 description 3
- 101100268277 Canis lupus familiaris ZP2 gene Proteins 0.000 description 3
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 3
- KPENUVBHAKRDQR-GUBZILKMSA-N Cys-His-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPENUVBHAKRDQR-GUBZILKMSA-N 0.000 description 3
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 3
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 3
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 3
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 3
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 3
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 3
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 3
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 3
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 3
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 3
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 3
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 3
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 3
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 3
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 3
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 101150017040 I gene Proteins 0.000 description 3
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 3
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 3
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 3
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 3
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 3
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 3
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 3
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 3
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 3
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 3
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 3
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 3
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 3
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 3
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 3
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 3
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 3
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 3
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 3
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 3
- QRHWTCJBCLGYRB-FXQIFTODSA-N Met-Ala-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O QRHWTCJBCLGYRB-FXQIFTODSA-N 0.000 description 3
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 3
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 3
- MHQXIBRPDKXDGZ-ZFWWWQNUSA-N Met-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 MHQXIBRPDKXDGZ-ZFWWWQNUSA-N 0.000 description 3
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 3
- KAKJTZWHIUWTTD-VQVTYTSYSA-N Met-Thr Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)O)C([O-])=O KAKJTZWHIUWTTD-VQVTYTSYSA-N 0.000 description 3
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 3
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 108010065395 Neuropep-1 Proteins 0.000 description 3
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 3
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 3
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 3
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 3
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 3
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 3
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 3
- 241000288906 Primates Species 0.000 description 3
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 3
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 3
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 3
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 3
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 3
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 3
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 3
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 3
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 3
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 3
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 3
- MTMJNKFZDQEVSY-BZSNNMDCSA-N Pro-Val-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MTMJNKFZDQEVSY-BZSNNMDCSA-N 0.000 description 3
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 3
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 3
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 3
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 3
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 3
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 3
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 3
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 3
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 3
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 3
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 3
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 3
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 3
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 3
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 3
- 108091036066 Three prime untranslated region Proteins 0.000 description 3
- PEYSVKMXSLPQRU-FJHTZYQYSA-N Trp-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PEYSVKMXSLPQRU-FJHTZYQYSA-N 0.000 description 3
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 3
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 3
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 3
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 3
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 3
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 3
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 3
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 3
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 3
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 3
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 3
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 3
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 3
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 3
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 3
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 3
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 3
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 3
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 3
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 description 3
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 238000013019 agitation Methods 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 150000003862 amino acid derivatives Chemical class 0.000 description 3
- 230000000890 antigenic effect Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000010511 deprotection reaction Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 3
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 229960000789 guanidine hydrochloride Drugs 0.000 description 3
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 238000001155 isoelectric focusing Methods 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 238000010841 mRNA extraction Methods 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 239000002480 mineral oil Substances 0.000 description 3
- 235000010446 mineral oil Nutrition 0.000 description 3
- 239000002751 oligonucleotide probe Substances 0.000 description 3
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 3
- 108010024607 phenylalanylalanine Proteins 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 230000037452 priming Effects 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 238000010188 recombinant method Methods 0.000 description 3
- 230000000717 retained effect Effects 0.000 description 3
- 235000019515 salmon Nutrition 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 3
- 229910000162 sodium phosphate Inorganic materials 0.000 description 3
- 239000007929 subcutaneous injection Substances 0.000 description 3
- 238000010254 subcutaneous injection Methods 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- 239000002569 water oil cream Substances 0.000 description 3
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 2
- NTWUFSCNXWKSGG-BOLZHIRLSA-N (2s)-2-[[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]amino]-n-[(2s)-1-amino-1-oxo-3-phenylpropan-2-yl]-3-methylpentanamide Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](C(C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(N)=O)C1=CC=C(O)C=C1 NTWUFSCNXWKSGG-BOLZHIRLSA-N 0.000 description 2
- NGEZPLCPKXKLQQ-VOTSOKGWSA-N (e)-4-(3-methoxyphenyl)but-3-en-2-one Chemical compound COC1=CC=CC(\C=C\C(C)=O)=C1 NGEZPLCPKXKLQQ-VOTSOKGWSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- HFBFSOAKPUZCCO-ZLUOBGJFSA-N Ala-Cys-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HFBFSOAKPUZCCO-ZLUOBGJFSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 2
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 2
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 2
- OMFMCIVBKCEMAK-CYDGBPFRSA-N Ala-Leu-Val-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O OMFMCIVBKCEMAK-CYDGBPFRSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 2
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 2
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 2
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 2
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 2
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 2
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 2
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- DGFGDPVSDQPANQ-XGEHTFHBSA-N Arg-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)O DGFGDPVSDQPANQ-XGEHTFHBSA-N 0.000 description 2
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 2
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 2
- BMNVSPMWMICFRV-DCAQKATOSA-N Arg-His-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CN=CN1 BMNVSPMWMICFRV-DCAQKATOSA-N 0.000 description 2
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 2
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 2
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 2
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 2
- XKDYWGLNSCNRGW-WDSOQIARSA-N Arg-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCN=C(N)N)CCCCN)C(O)=O)=CNC2=C1 XKDYWGLNSCNRGW-WDSOQIARSA-N 0.000 description 2
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 2
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 2
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 2
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 2
- JJIBHAOBNIFUEL-SRVKXCTJSA-N Arg-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCN=C(N)N)N JJIBHAOBNIFUEL-SRVKXCTJSA-N 0.000 description 2
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 2
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 2
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 2
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- NKTLGLBAGUJEGA-BIIVOSGPSA-N Asn-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N)C(=O)O NKTLGLBAGUJEGA-BIIVOSGPSA-N 0.000 description 2
- FGYUMGXLCZYNQG-UBHSHLNASA-N Asn-Cys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CS)NC(=O)[C@H](CC(N)=O)N)C(O)=O)=CNC2=C1 FGYUMGXLCZYNQG-UBHSHLNASA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 2
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 2
- ZTRJUKDEALVRMW-SRVKXCTJSA-N Asn-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZTRJUKDEALVRMW-SRVKXCTJSA-N 0.000 description 2
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 2
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 2
- LWXJVHTUEDHDLG-XUXIUFHCSA-N Asn-Leu-Leu-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LWXJVHTUEDHDLG-XUXIUFHCSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 2
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 2
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 2
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 2
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 2
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 2
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 2
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 2
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 2
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 2
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 2
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 2
- LIQNMKIBMPEOOP-IHRRRGAJSA-N Asp-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)O)N LIQNMKIBMPEOOP-IHRRRGAJSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 2
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- UEFODXNXUAVPTC-VEVYYDQMSA-N Asp-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UEFODXNXUAVPTC-VEVYYDQMSA-N 0.000 description 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 2
- RMFITHMDQGFSDC-UBHSHLNASA-N Asp-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RMFITHMDQGFSDC-UBHSHLNASA-N 0.000 description 2
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 241001466804 Carnivora Species 0.000 description 2
- AYKQJQVWUYEZNU-IMJSIDKUSA-N Cys-Asn Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O AYKQJQVWUYEZNU-IMJSIDKUSA-N 0.000 description 2
- QDFBJJABJKOLTD-FXQIFTODSA-N Cys-Asn-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QDFBJJABJKOLTD-FXQIFTODSA-N 0.000 description 2
- JIVJXVJMOBVCJF-ZLUOBGJFSA-N Cys-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)N JIVJXVJMOBVCJF-ZLUOBGJFSA-N 0.000 description 2
- WVJHEDOLHPZLRV-CIUDSAMLSA-N Cys-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N WVJHEDOLHPZLRV-CIUDSAMLSA-N 0.000 description 2
- TULNGKSILXCZQT-IMJSIDKUSA-N Cys-Asp Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O TULNGKSILXCZQT-IMJSIDKUSA-N 0.000 description 2
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 2
- SBORMUFGKSCGEN-XHNCKOQMSA-N Cys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)C(=O)O SBORMUFGKSCGEN-XHNCKOQMSA-N 0.000 description 2
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 2
- HAYVLBZZBDCKRA-SRVKXCTJSA-N Cys-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N HAYVLBZZBDCKRA-SRVKXCTJSA-N 0.000 description 2
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 2
- DYBIDOHFRRUMLW-CIUDSAMLSA-N Cys-Leu-Cys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O DYBIDOHFRRUMLW-CIUDSAMLSA-N 0.000 description 2
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 2
- ORYFTECKJZTNQP-DCAQKATOSA-N Cys-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N ORYFTECKJZTNQP-DCAQKATOSA-N 0.000 description 2
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 2
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 2
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 2
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 2
- IWVNIQXKTIQXCT-SRVKXCTJSA-N Cys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IWVNIQXKTIQXCT-SRVKXCTJSA-N 0.000 description 2
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 2
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 2
- 108010090461 DFG peptide Proteins 0.000 description 2
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 2
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 2
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 2
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- MCGNJCNXIMQCMN-DCAQKATOSA-N Glu-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(O)=O MCGNJCNXIMQCMN-DCAQKATOSA-N 0.000 description 2
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 2
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 2
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 2
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 2
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 2
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 2
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 2
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 2
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 2
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 2
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 2
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 2
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 2
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 2
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 2
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 2
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 2
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 2
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 2
- WSEITRHJRVDTRX-QTKMDUPCSA-N His-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N)O WSEITRHJRVDTRX-QTKMDUPCSA-N 0.000 description 2
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 2
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 2
- VTMSUKSRIKCCAD-ULQDDVLXSA-N His-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N VTMSUKSRIKCCAD-ULQDDVLXSA-N 0.000 description 2
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 2
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- GZAUZBUKDXYPEH-CIUDSAMLSA-N Leu-Cys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N GZAUZBUKDXYPEH-CIUDSAMLSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 2
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 2
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 2
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 2
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 2
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 2
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- WXDRGWBQZIMJDE-ULQDDVLXSA-N Leu-Phe-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O WXDRGWBQZIMJDE-ULQDDVLXSA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 2
- UIIMIKFNIYPDJF-WDSOQIARSA-N Leu-Trp-Met Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CC(C)C)=CNC2=C1 UIIMIKFNIYPDJF-WDSOQIARSA-N 0.000 description 2
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 2
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 2
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 2
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 2
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- GOVDTWNJCBRRBJ-DCAQKATOSA-N Lys-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GOVDTWNJCBRRBJ-DCAQKATOSA-N 0.000 description 2
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 2
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 2
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 2
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 2
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 2
- CNXOBMMOYZPPGS-NUTKFTJISA-N Lys-Trp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O CNXOBMMOYZPPGS-NUTKFTJISA-N 0.000 description 2
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 2
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 2
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 2
- NDYNTQWSJLPEMK-WDSKDSINSA-N Met-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(O)=O NDYNTQWSJLPEMK-WDSKDSINSA-N 0.000 description 2
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 2
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 2
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 2
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 2
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 2
- 101100268281 Mus musculus Zp2 gene Proteins 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 2
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 2
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 2
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 2
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 2
- HQCSLJFGZYOXHW-KKUMJFAQSA-N Phe-His-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N HQCSLJFGZYOXHW-KKUMJFAQSA-N 0.000 description 2
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 2
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 2
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 2
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 2
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 2
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 2
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 2
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 2
- GLEOIKLQBZNKJZ-WDSKDSINSA-N Pro-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GLEOIKLQBZNKJZ-WDSKDSINSA-N 0.000 description 2
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 2
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 2
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 2
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- IQAGKQWXVHTPOT-FHWLQOOXSA-N Pro-Lys-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O IQAGKQWXVHTPOT-FHWLQOOXSA-N 0.000 description 2
- BLJMJZOMZRCESA-GUBZILKMSA-N Pro-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BLJMJZOMZRCESA-GUBZILKMSA-N 0.000 description 2
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 2
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 2
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 2
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 2
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 2
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 2
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 2
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- FFOKMZOAVHEWET-IMJSIDKUSA-N Ser-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(O)=O FFOKMZOAVHEWET-IMJSIDKUSA-N 0.000 description 2
- WKLJLEXEENIYQE-SRVKXCTJSA-N Ser-Cys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WKLJLEXEENIYQE-SRVKXCTJSA-N 0.000 description 2
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 2
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 2
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 2
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 2
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 2
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 2
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 101100545382 Sus scrofa ZP3 gene Proteins 0.000 description 2
- 101150006914 TRP1 gene Proteins 0.000 description 2
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 2
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 2
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 2
- YDWLCDQXLCILCZ-BWAGICSOSA-N Thr-His-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YDWLCDQXLCILCZ-BWAGICSOSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 2
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 2
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 2
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 2
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 2
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 2
- CMXACOZDEJYZSK-XIRDDKMYSA-N Trp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CMXACOZDEJYZSK-XIRDDKMYSA-N 0.000 description 2
- DXYQIGZZWYBXSD-JSGCOSHPSA-N Trp-Pro Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(O)=O DXYQIGZZWYBXSD-JSGCOSHPSA-N 0.000 description 2
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 2
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 2
- XOSGQKFEIOCPIJ-SZMVWBNQSA-N Trp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N XOSGQKFEIOCPIJ-SZMVWBNQSA-N 0.000 description 2
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 2
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 2
- VDCGPCSLAJAKBB-XIRDDKMYSA-N Trp-Ser-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N VDCGPCSLAJAKBB-XIRDDKMYSA-N 0.000 description 2
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 2
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 2
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 2
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 2
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 2
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 2
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 2
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 2
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 2
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 2
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 2
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 2
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 2
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 2
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 2
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 2
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 2
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 2
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 2
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 2
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 2
- 101710151236 Zona pellucida sperm-binding protein 3 Proteins 0.000 description 2
- 101150088854 Zp3 gene Proteins 0.000 description 2
- LUXUAZKGQZPOBZ-SAXJAHGMSA-N [(3S,4S,5S,6R)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] (Z)-octadec-9-enoate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC1O[C@H](CO)[C@@H](O)[C@H](O)[C@@H]1O LUXUAZKGQZPOBZ-SAXJAHGMSA-N 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010003196 alanyl-prolyl-arginyl-prolyl-glycine Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- RDOXTESZEPMUJZ-UHFFFAOYSA-N anisole Chemical compound COC1=CC=CC=C1 RDOXTESZEPMUJZ-UHFFFAOYSA-N 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 230000002513 anti-ovulatory effect Effects 0.000 description 2
- 108010006195 arginyl-glycyl-aspartyl-cysteine Proteins 0.000 description 2
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 2
- WQZGKKKJIJFFOK-FPRJBGLDSA-N beta-D-galactose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-FPRJBGLDSA-N 0.000 description 2
- 102000005936 beta-Galactosidase Human genes 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N biotin Natural products N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 229940124558 contraceptive agent Drugs 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 230000003292 diminished effect Effects 0.000 description 2
- 108010043837 egg surface sperm receptor Proteins 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 230000001158 estrous effect Effects 0.000 description 2
- 230000004720 fertilization Effects 0.000 description 2
- 230000004545 gene duplication Effects 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 238000007654 immersion Methods 0.000 description 2
- 230000016784 immunoglobulin production Effects 0.000 description 2
- 230000009027 insemination Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 2
- 208000015124 ovarian disease Diseases 0.000 description 2
- 201000004535 ovarian dysfunction Diseases 0.000 description 2
- 231100000543 ovarian dysfunction Toxicity 0.000 description 2
- 230000000624 ovulatory effect Effects 0.000 description 2
- 230000001766 physiological effect Effects 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 230000002797 proteolythic effect Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 230000001850 reproductive effect Effects 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 229910000029 sodium carbonate Inorganic materials 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 230000001954 sterilising effect Effects 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 210000001550 testis Anatomy 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 238000000539 two dimensional gel electrophoresis Methods 0.000 description 2
- 238000001419 two-dimensional polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 229910001868 water Inorganic materials 0.000 description 2
- 101150069452 z gene Proteins 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- CEHZCZCQHUNAJF-AVGNSLFASA-N (2s)-1-[2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(O)=O)CCC1 CEHZCZCQHUNAJF-AVGNSLFASA-N 0.000 description 1
- AJDRTCHWJPWAGM-QAETUUGQSA-N (2s)-1-[2-[[(2s)-1-[(2s)-5-(diaminomethylideneamino)-2-[[(2s)-pyrrolidine-2-carbonyl]amino]pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 AJDRTCHWJPWAGM-QAETUUGQSA-N 0.000 description 1
- VWWKKDNCCLAGRM-GVXVVHGQSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VWWKKDNCCLAGRM-GVXVVHGQSA-N 0.000 description 1
- QYUKEUAGMBNKFN-ACUXCFJPSA-N (2s,3r)-2-amino-3-hydroxybutanoic acid;(2s)-2-amino-3-(4-hydroxyphenyl)propanoic acid Chemical compound C[C@@H](O)[C@H](N)C(O)=O.OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QYUKEUAGMBNKFN-ACUXCFJPSA-N 0.000 description 1
- JDRAOGVAQOVDEB-KTKRTIGZSA-N (3-hydroxy-2,3,3a,5,6,6a-hexahydrofuro[3,2-b]furan-6-yl) (z)-octadec-9-enoate Chemical compound OC1COC2C(OC(=O)CCCCCCC\C=C/CCCCCCCC)COC21 JDRAOGVAQOVDEB-KTKRTIGZSA-N 0.000 description 1
- UGXDVELKRYZPDM-XLXQKPBQSA-N (4r)-4-[[(2s,3r)-2-[[(2r)-2-[(2r,3r,4r,5r)-2-acetamido-4,5,6-trihydroxy-1-oxohexan-3-yl]oxypropanoyl]amino]-3-hydroxybutanoyl]amino]-5-amino-5-oxopentanoic acid Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](C)O[C@@H]([C@H](O)[C@H](O)CO)[C@@H](NC(C)=O)C=O UGXDVELKRYZPDM-XLXQKPBQSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- DHBXNPKRAUYBTH-UHFFFAOYSA-N 1,1-ethanedithiol Chemical compound CC(S)S DHBXNPKRAUYBTH-UHFFFAOYSA-N 0.000 description 1
- 101150090724 3 gene Proteins 0.000 description 1
- IMIZPWSVYADSCN-UHFFFAOYSA-N 4-methyl-2-[[4-methyl-2-[[4-methyl-2-(pyrrolidine-2-carbonylamino)pentanoyl]amino]pentanoyl]amino]pentanoic acid Chemical compound CC(C)CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C1CCCN1 IMIZPWSVYADSCN-UHFFFAOYSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- YIQAOPNCIJVKDN-XKNYDFJKSA-N Ala-Asn-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YIQAOPNCIJVKDN-XKNYDFJKSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- IAUSCRHURCZUJP-CIUDSAMLSA-N Ala-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CS)C(O)=O IAUSCRHURCZUJP-CIUDSAMLSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- IPWKGIFRRBGCJO-IMJSIDKUSA-N Ala-Ser Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O IPWKGIFRRBGCJO-IMJSIDKUSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- BUQICHWNXBIBOG-LMVFSUKVSA-N Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)N BUQICHWNXBIBOG-LMVFSUKVSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- JKRPBTQDPJSQIT-RCWTZXSCSA-N Arg-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O JKRPBTQDPJSQIT-RCWTZXSCSA-N 0.000 description 1
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- 240000003291 Armoracia rusticana Species 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 description 1
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- VBKIFHUVGLOJKT-FKZODXBYSA-N Asn-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)N)O VBKIFHUVGLOJKT-FKZODXBYSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- ICAYWNTWHRRAQP-FXQIFTODSA-N Asp-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N ICAYWNTWHRRAQP-FXQIFTODSA-N 0.000 description 1
- MUWDILPCTSMUHI-ZLUOBGJFSA-N Asp-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O MUWDILPCTSMUHI-ZLUOBGJFSA-N 0.000 description 1
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- NZJDBCYBYCUEDC-UBHSHLNASA-N Asp-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N NZJDBCYBYCUEDC-UBHSHLNASA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- JOCQXVJCTCEFAZ-CIUDSAMLSA-N Asp-His-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O JOCQXVJCTCEFAZ-CIUDSAMLSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- UXIPUCUHQBIQOS-SRVKXCTJSA-N Asp-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UXIPUCUHQBIQOS-SRVKXCTJSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- 241001203868 Autographa californica Species 0.000 description 1
- 239000004135 Bone phosphate Substances 0.000 description 1
- BIZULEWPNGZPNN-WBGPXRNDSA-N C(CCCC[C@@H]1SC[C@@H]2NC(=O)N[C@H]12)(=O)NC(C(=O)O)CCCC Chemical compound C(CCCC[C@@H]1SC[C@@H]2NC(=O)N[C@H]12)(=O)NC(C(=O)O)CCCC BIZULEWPNGZPNN-WBGPXRNDSA-N 0.000 description 1
- 101100328086 Caenorhabditis elegans cla-1 gene Proteins 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- 241001227713 Chiron Species 0.000 description 1
- KKZHXOOZHFABQQ-UWJYBYFXSA-N Cys-Ala-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKZHXOOZHFABQQ-UWJYBYFXSA-N 0.000 description 1
- XXDLUZLKHOVPNW-IHRRRGAJSA-N Cys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O XXDLUZLKHOVPNW-IHRRRGAJSA-N 0.000 description 1
- HRJLVSQKBLZHSR-ZLUOBGJFSA-N Cys-Asn-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O HRJLVSQKBLZHSR-ZLUOBGJFSA-N 0.000 description 1
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- BMHBJCVEXUBGFI-BIIVOSGPSA-N Cys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O BMHBJCVEXUBGFI-BIIVOSGPSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- DIUBVGXMXONJCF-KKUMJFAQSA-N Cys-His-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DIUBVGXMXONJCF-KKUMJFAQSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 1
- WYVKPHCYMTWUCW-YUPRTTJUSA-N Cys-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)N)O WYVKPHCYMTWUCW-YUPRTTJUSA-N 0.000 description 1
- OEDPLIBVQGRKGZ-AVGNSLFASA-N Cys-Tyr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O OEDPLIBVQGRKGZ-AVGNSLFASA-N 0.000 description 1
- CLEFUAZULXANBU-MELADBBJSA-N Cys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N)C(=O)O CLEFUAZULXANBU-MELADBBJSA-N 0.000 description 1
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- 230000003682 DNA packaging effect Effects 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 108010001496 Galectin 2 Proteins 0.000 description 1
- 102100021735 Galectin-2 Human genes 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- TUTIHHSZKFBMHM-WHFBIAKZSA-N Glu-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O TUTIHHSZKFBMHM-WHFBIAKZSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- VJVAQZYGLMJPTK-QEJZJMRPSA-N Glu-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VJVAQZYGLMJPTK-QEJZJMRPSA-N 0.000 description 1
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 1
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- YIWFXZNIBQBFHR-LURJTMIESA-N Gly-His Chemical compound [NH3+]CC(=O)N[C@H](C([O-])=O)CC1=CN=CN1 YIWFXZNIBQBFHR-LURJTMIESA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- OBTMRGFRLJBSFI-GARJFASQSA-N His-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OBTMRGFRLJBSFI-GARJFASQSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 1
- YTKOTXRIWQHSAZ-GUBZILKMSA-N His-Glu-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N YTKOTXRIWQHSAZ-GUBZILKMSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- AKAPKBNIVNPIPO-KKUMJFAQSA-N His-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 AKAPKBNIVNPIPO-KKUMJFAQSA-N 0.000 description 1
- CNHSMSFYVARZLI-YJRXYDGGSA-N His-His-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CNHSMSFYVARZLI-YJRXYDGGSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- DPQIPEAHIYMUEJ-IHRRRGAJSA-N His-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N DPQIPEAHIYMUEJ-IHRRRGAJSA-N 0.000 description 1
- BCZFOHDMCDXPDA-BZSNNMDCSA-N His-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)O BCZFOHDMCDXPDA-BZSNNMDCSA-N 0.000 description 1
- YIGCZZKZFMNSIU-RWMBFGLXSA-N His-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YIGCZZKZFMNSIU-RWMBFGLXSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 1
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 1
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- WRPDZHJNLYNFFT-GEVIPFJHSA-N His-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O WRPDZHJNLYNFFT-GEVIPFJHSA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- HTOOKGDPMXSJSY-STQMWFEESA-N His-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 HTOOKGDPMXSJSY-STQMWFEESA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- 101100545378 Homo sapiens ZP3 gene Proteins 0.000 description 1
- QLROSWPKSBORFJ-BQBZGAKWSA-N L-Prolyl-L-glutamic acid Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- 108010054278 Lac Repressors Proteins 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- GPXFZVUVPCFTMG-AVGNSLFASA-N Leu-Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C GPXFZVUVPCFTMG-AVGNSLFASA-N 0.000 description 1
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- VFQOCUQGMUXTJR-DCAQKATOSA-N Leu-Cys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N VFQOCUQGMUXTJR-DCAQKATOSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- XWOBNBRUDDUEEY-UWVGGRQHSA-N Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XWOBNBRUDDUEEY-UWVGGRQHSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- UFPLDOKWDNTTRP-ULQDDVLXSA-N Leu-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=C(O)C=C1 UFPLDOKWDNTTRP-ULQDDVLXSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- QQXJROOJCMIHIV-AVGNSLFASA-N Leu-Val-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O QQXJROOJCMIHIV-AVGNSLFASA-N 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- QQYRCUXKLDGCQN-SRVKXCTJSA-N Lys-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N QQYRCUXKLDGCQN-SRVKXCTJSA-N 0.000 description 1
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- QTZXSYBVOSXBEJ-WDSKDSINSA-N Met-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O QTZXSYBVOSXBEJ-WDSKDSINSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 1
- UDOYVQQKQHZYMB-DCAQKATOSA-N Met-Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDOYVQQKQHZYMB-DCAQKATOSA-N 0.000 description 1
- WXXNVZMWHOLNRJ-AVGNSLFASA-N Met-Pro-Lys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O WXXNVZMWHOLNRJ-AVGNSLFASA-N 0.000 description 1
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 101000818878 Mus musculus Zona pellucida sperm-binding protein 1 Proteins 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- 108700020497 Nucleopolyhedrovirus polyhedrin Proteins 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- 239000005662 Paraffin oil Substances 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- OZILORBBPKKGRI-RYUDHWBXSA-N Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 OZILORBBPKKGRI-RYUDHWBXSA-N 0.000 description 1
- MQWISMJKHOUEMW-ULQDDVLXSA-N Phe-Arg-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 MQWISMJKHOUEMW-ULQDDVLXSA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- OHUXOEXBXPZKPT-STQMWFEESA-N Phe-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 OHUXOEXBXPZKPT-STQMWFEESA-N 0.000 description 1
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- SPXWRYVHOZVYBU-ULQDDVLXSA-N Phe-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N SPXWRYVHOZVYBU-ULQDDVLXSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- YVIVIQWMNCWUFS-UFYCRDLUSA-N Phe-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N YVIVIQWMNCWUFS-UFYCRDLUSA-N 0.000 description 1
- PBWNICYZGJQKJV-BZSNNMDCSA-N Phe-Phe-Cys Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O PBWNICYZGJQKJV-BZSNNMDCSA-N 0.000 description 1
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- JLDZQPPLTJTJLE-IHPCNDPISA-N Phe-Trp-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JLDZQPPLTJTJLE-IHPCNDPISA-N 0.000 description 1
- LKRUQZQZMXMKEQ-SFJXLCSZSA-N Phe-Trp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKRUQZQZMXMKEQ-SFJXLCSZSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- YYARMJSFDLIDFS-FKBYEOEOSA-N Pro-Phe-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YYARMJSFDLIDFS-FKBYEOEOSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- RJTUIDFUUHPJMP-FHWLQOOXSA-N Pro-Trp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CN=CN4)C(=O)O RJTUIDFUUHPJMP-FHWLQOOXSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 241000282695 Saimiri Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- VBKBDLMWICBSCY-IMJSIDKUSA-N Ser-Asp Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O VBKBDLMWICBSCY-IMJSIDKUSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 1
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- HZNFKPJCGZXKIC-DCAQKATOSA-N Ser-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N HZNFKPJCGZXKIC-DCAQKATOSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- SBMNPABNWKXNBJ-BQBZGAKWSA-N Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO SBMNPABNWKXNBJ-BQBZGAKWSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 101100268282 Sus scrofa ZP2 gene Proteins 0.000 description 1
- 101100545388 Sus scrofa ZP4 gene Proteins 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- HYLXOQURIOCKIH-VQVTYTSYSA-N Thr-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N HYLXOQURIOCKIH-VQVTYTSYSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- XVNZSJIKGJLQLH-RCWTZXSCSA-N Thr-Arg-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N)O XVNZSJIKGJLQLH-RCWTZXSCSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- CUTPSEKWUPZFLV-WISUUJSJSA-N Thr-Cys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(O)=O CUTPSEKWUPZFLV-WISUUJSJSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- BECPPKYKPSRKCP-ZDLURKLDSA-N Thr-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BECPPKYKPSRKCP-ZDLURKLDSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- YKRQRPFODDJQTC-CSMHCCOUSA-N Thr-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN YKRQRPFODDJQTC-CSMHCCOUSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- IQHUITKNHOKGFC-MIMYLULJSA-N Thr-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IQHUITKNHOKGFC-MIMYLULJSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 1
- LCPVBXOHXMBLFW-JSGCOSHPSA-N Trp-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)=CNC2=C1 LCPVBXOHXMBLFW-JSGCOSHPSA-N 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- NOBINHCGDUHOBV-NAZCDGGXSA-N Trp-His-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NOBINHCGDUHOBV-NAZCDGGXSA-N 0.000 description 1
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 1
- ABEVJDLMFPTGPS-SZMVWBNQSA-N Trp-Met-Met Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ABEVJDLMFPTGPS-SZMVWBNQSA-N 0.000 description 1
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 1
- LFMLXCJYCFZBKE-IHPCNDPISA-N Trp-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N LFMLXCJYCFZBKE-IHPCNDPISA-N 0.000 description 1
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 1
- VNRTXOUAOUZCFW-WDSOQIARSA-N Trp-Val-His Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O VNRTXOUAOUZCFW-WDSOQIARSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- CVXURBLRELTJKO-BWAGICSOSA-N Tyr-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O CVXURBLRELTJKO-BWAGICSOSA-N 0.000 description 1
- XDGPTBVOSHKDFT-KKUMJFAQSA-N Tyr-Met-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XDGPTBVOSHKDFT-KKUMJFAQSA-N 0.000 description 1
- VNYDHJARLHNEGA-RYUDHWBXSA-N Tyr-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 VNYDHJARLHNEGA-RYUDHWBXSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- WKWJJQZZZBBWKV-JYJNAYRXSA-N Val-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WKWJJQZZZBBWKV-JYJNAYRXSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 1
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 1
- ILMVQSHENUZYIZ-JYJNAYRXSA-N Val-Met-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N ILMVQSHENUZYIZ-JYJNAYRXSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- 229920004482 WACKER® Polymers 0.000 description 1
- 101150089724 ZP2 gene Proteins 0.000 description 1
- 101710151239 Zona pellucida sperm-binding protein 2 Proteins 0.000 description 1
- 101710151254 Zona pellucida sperm-binding protein 4 Proteins 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- FKNHDDTXBWMZIR-GEMLJDPKSA-N acetic acid;(2s)-1-[(2r)-2-amino-3-sulfanylpropanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(O)=O.SC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O FKNHDDTXBWMZIR-GEMLJDPKSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000030120 acrosome reaction Effects 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 235000019270 ammonium chloride Nutrition 0.000 description 1
- 210000000612 antigen-presenting cell Anatomy 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000001363 autoimmune Effects 0.000 description 1
- 230000005784 autoimmunity Effects 0.000 description 1
- OHDRQQURAXLVGJ-UHFFFAOYSA-N azane;3-ethyl-2-[(3-ethyl-6-sulfo-1,3-benzothiazol-2-ylidene)hydrazinylidene]-1,3-benzothiazole-6-sulfonic acid Chemical compound [NH4+].[NH4+].S1C2=CC(S([O-])(=O)=O)=CC=C2N(CC)C1=NN=C1SC2=CC(S([O-])(=O)=O)=CC=C2N1CC OHDRQQURAXLVGJ-UHFFFAOYSA-N 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- RIIWUGSYXOBDMC-UHFFFAOYSA-N benzene-1,2-diamine;hydron;dichloride Chemical compound Cl.Cl.NC1=CC=CC=C1N RIIWUGSYXOBDMC-UHFFFAOYSA-N 0.000 description 1
- 229920001222 biopolymer Polymers 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 210000004900 c-terminal fragment Anatomy 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000011026 diafiltration Methods 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 229910000397 disodium phosphate Inorganic materials 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 231100000502 fertility decrease Toxicity 0.000 description 1
- 230000003325 follicular Effects 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- KZNQNBZMBZJQJO-YFKPBYRVSA-N glyclproline Chemical compound NCC(=O)N1CCC[C@H]1C(O)=O KZNQNBZMBZJQJO-YFKPBYRVSA-N 0.000 description 1
- 102000035122 glycosylated proteins Human genes 0.000 description 1
- 108091005608 glycosylated proteins Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 102000055962 human ZP1 Human genes 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 231100000503 infertility induction Toxicity 0.000 description 1
- BWHLPLXXIDYSNW-UHFFFAOYSA-N ketorolac tromethamine Chemical compound OCC(N)(CO)CO.OC(=O)C1CCN2C1=CC=C2C(=O)C1=CC=CC=C1 BWHLPLXXIDYSNW-UHFFFAOYSA-N 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 101150011498 lad gene Proteins 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 230000002045 lasting effect Effects 0.000 description 1
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000005923 long-lasting effect Effects 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- UZKWTJUDCOPSNM-UHFFFAOYSA-N methoxybenzene Substances CCCCOC=C UZKWTJUDCOPSNM-UHFFFAOYSA-N 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- KHIWWQKSHDUIBK-UHFFFAOYSA-N periodic acid Chemical compound OI(=O)(=O)=O KHIWWQKSHDUIBK-UHFFFAOYSA-N 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 150000002978 peroxides Chemical class 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- NMHMNPHRMNGLLB-UHFFFAOYSA-N phloretic acid Chemical compound OC(=O)CCC1=CC=C(O)C=C1 NMHMNPHRMNGLLB-UHFFFAOYSA-N 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 210000005059 placental tissue Anatomy 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000010583 slow cooling Methods 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 235000011008 sodium phosphates Nutrition 0.000 description 1
- 238000010972 statistical evaluation Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 238000012956 testing procedure Methods 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 241000701366 unidentified nuclear polyhedrosis viruses Species 0.000 description 1
- 108010072695 valyl-valyl-tyrosyl-proline Proteins 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P15/00—Drugs for genital or sexual disorders; Contraceptives
- A61P15/18—Feminine contraceptives
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/61—Fusion polypeptide containing an enzyme fusion for detection (lacZ, luciferase)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S424/00—Drug, bio-affecting and body treating compositions
- Y10S424/811—Drug, bio-affecting and body treating compositions involving sex selection or contraception
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Cell Biology (AREA)
- Gynecology & Obstetrics (AREA)
- Animal Behavior & Ethology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Endocrinology (AREA)
- Reproductive Health (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Engineering & Computer Science (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Saccharide Compounds (AREA)
- Steroid Compounds (AREA)
Abstract
A method for specifically inducing transient infertility or permanent sterility in a host animal by selective vaccination with specific zona pellucida proteins or immunocontraceptively active fragments thereof. Novel zona pellucida DNA sequences encoding specific zona pellucida proteins are disclosed.
Description
TITLE: MATERIALS AND METHODS FOR
IMMUNOCONTRACEPTION
CROSS REFERENCE TO RELATED APPLICATION
This application is a continuation-in-part of U.S. Application Serial No. 08/012,990, filed January 29, 1993, which is a continuation-in-part of U.S. Application Serial No. 07/973,341 , filed on November 9, 1992.
FIELD OF THE INVENTION This invention relates generally to the production and use of zona pellucida proteins, and more particularly to novel DNA sequences encoding zona pellucida proteins, to recombinant materials and methods for producing such proteins and to materials and methods for selectively effecting either transient infertility or permanent sterility in mammals through use of naturally occurring and recombinant zona pellucida proteins.
BACKGROUND OF THE INVENTION
The present invention relates to a method for inducing reproducible transient infertility or sterility in a mammal by inducing in that mammal antibodies directed to proteins found in the zona pellucida of that mammal's oocytes. The invention also relates to purified, isolated DNA sequences encoding the zona pellucida proteins herein designated "ZPA" and "ZPB" and "ZPC" from various mammalian species. The invention is further directed to pharmaceutical compositions capable of inducing antibody production in a subject mammal.
The zona pellucida (ZP) is a complex matrix surrounding the mammalian oocyte, formed of glycoproteins secreted by ovarian cells. Zona pellucida glycoproteins perform a variety of functions. For example, the mouse ZP proteins previously designated ZP2 and ZP3 are complexed into long filaments which are cross-linked by the protein designated ZP1 in the ZP matrix providing structural integrity to the matrix. Wassarman, P.M., Annu. Rev. Biochem. 57:415-442 (1988). In addition to its structural role, mouse ZP3 has been shown to be a sperm receptor in the ZP matrix. Bleil, J.P. and Wassarman, P.M., Cell 20: 873-882 (1980). Following binding of sperm to ZP3 and the subsequent induction of the sperm acrosome reaction on the surface of the ZP, ZP2 acts as a secondary sperm receptor that is necessary for the maintenance of sperm binding to the egg. Bleil et al. , Dev. Biol. 128: 376-385 (1988). Because of its role in the maintenance of the oocyte and in sperm-oocyte interactions, the ZP represents a logical target for design of contraceptive agents which interfere with the fertilization process.
Various groups have undertaken an immunological approach in attempts to interfere with ZP functions and thus to decrease fertility in immunized animals. See, Dunbar et al. In: International Congress on Reproductive Immunology. T. Wegman and T. Gills (eds.). London: Oxford Press, pp. 505-528 (1983); and Dunbar et al. In: Mechanisms and Control of Animal Fertilization. J. Hartman (ed.) Academic Press, New York, pp. 139-166 (1983). These studies showed that active immunization of mammals with ovarian homogenates decreased fertility. However, the large number of components in such homogenates made the identification of antigens responsible for the decrease in fertility nearly impossible. In addition, the use of such a complex mixture creates a potential for unwanted and potentially harmful side-effects.
Research by various investigators using chromatographic methods including SDS polyacrylamide gel electrophoresis (PAGE) and high pressure liquid chromatography (HPLC) have resulted in the identification of
numerous zona pellucida proteins from a variety of mammalian species. Data compiled by Timmons and Dunbar in "Perspectives in Immunoreproduction: Conception and Contraception "; pp. 242-260, Mathur, S. and Fredericks, CM. eds. ; New York, Hemisphere Publishing Co (1988), as described below, illustrate examples of zona pellucida proteins that have been characterized. Zona pellucida proteins isolated from pig include: PZI, a 40- 110 kD protein isolated by Dunbar et al. , Biol. Reprod. 24: 1111 (1981); PZII, a 70-110 kD protein, PZIII, a 95-118 kD protein, and PZIV, an 18-25 kD protein, all isolated by Dunbar et al , Biol. Reprod. 32:619 (1985); 90K, a 89-119 kD protein, 65K, a 61-83 kD protein, 55K, a 47-66 kD protein, and 25K, an 18-26 kD protein, all isolated by Hedrick, J.L. and Wardrip, N.J. Biochem. 157: 63 (1986); ZP1, an 82-118 kD protein, ZP2, a 58-96 kD protein, ZP3 (PPZA), a 40-74 kD protein, and ZP4, a 21 kD protein, all isolated by Subramanian et al. , Biol. Reprod. 24:933 (1981); 87K (ZP1/ZP2), a 77-97 kD protein, 58K, a 40-70 kD protein both isolated by Yurewicz et al , Biol. Reprod. 29: 511 (1983); deglycosylated PZI, a 35 kD protein; PZII, a 55 kD protein; and PZIII, an 80 kD protein all isolated by Skinner and Dunbar as described in Immunological Approaches to Contraception and the Promotion of Fertility, G. P. Talwar (ed.) New York: Plenum pp. 251- 268 (1986); and deglycosylated ZP3 having a molecular weight of 45 kD isolated by Sacco et al , J. Reprod. Fertil. 76:575 (1986).
Isolated rabbit zona pellucida proteins include: RZI, RZII, and RZIII, having molecular weights of 68-125 kD, 80-100.5 kD, and 100-132 kD respectively, all isolated by Dunbar et al , Biol. Reprod. 24: 1111 (1986); ZP1, ZP2, and ZP3 having molecular weights of 100-118 kD, 83-110 kD, and 80-92 kD respectively, all isolated by Sacco et al , Proc. Soc. .Exp. Biol. Med. 167:318 (1981); deglycosylated RZI. and RZII having molecular weights of 65 kD, and 80kD respectively, both isolated by Skinner and Dunbar and described in Immunological Approaches to Contraception and Promotion of Fertility. G.P. Talwar (ed.). New York: Plenum, pp. 251-268 (1986); and
deglycosylated RZIII, a 90 kD protein isolated by Timmons and Dunbar, Biol. Reprod. 36: 1275 (1987).
A number of mouse zona pellucida proteins have been isolated including: ZP1, ZP2, and ZP3 having molecular weights of 200 kD, 120 kD, and 83 kD respectively, all isolated by Bleil and Wassarman Dev. Biol. 76:185 (1980); and ZP1 and ZP2 having molecular weights of 166-122 kD and 90-92 kD respectively, isolated by Sacco et al. , Proc. Soc. Exp. Biol. Med. 167: 318 (1981). The differences in the molecular weights of mouse ZP1 and ZP2 as reported by Bleil et al. and Sacco et al. may be due to the fact that Bleil used 2D-PAGE under non-reducing conditions while Sacco used 2D-PAGE under reducing conditions.
The cat zona pellucida proteins CZI and CZII were isolated by Maresh and Dunbar J. Exp. Zool 244:299 (1987) and have molecular weights of 50-110 kD and 90-110 kD respectively. Maresh and Dunbar J. Exp. Zool. 244:299 (1987), have also isolated the dog zona pellucida proteins DZI, DZII, and DZIII which have molecular weights of 50-110 kD, 70-95 kD, and 90-100 kD respectively.
Sacco et al , Proc. Soc. Exp. Biol. Med. 167:318 (1981) described squirrel monkey ZP1, ZP2, ZP3, and ZP4 having molecular weights of 63-78 kD, 63-70 kD, 47-51 kD, and 43-47 kD respectively. In the same publication
Sacco et al. described human ZP1 , ZP2, and ZP3 having molecular weights of 80-120 kD, 73 kD, and 59-65 kD respectively.
To date, few mammalian zona pellucida genes or proteins have been isolated and sequenced. None has been successfully used to produce an effective immunocontraceptive. A lack of consensus among those of skill in the art regarding the number and characteristics (e.g. molecular weight) of proteins present in the zona pellucida of various mammalian species, and difficulties in purifying these heavily glycosylated proteins have hampered
attempts to utilize zona pellucida proteins to produce an effective immunocontraceptive with predictable function.
A number of groups have had success in cloning cDNAs or genes encoding various mammalian zona pellucida proteins. Ringuette et al. , Dev. Biol, 127:287-295 (1988) and Liang et al , Mol Cell. Biol , 10:1507-1515 (1990), reported cloning of mouse DNA encoding zona pellucida proteins ZP3 and ZP2, respectively. The clones were obtained by screening mouse cDNA libraries with anti-ZP3 and anti-ZP2 antibodies. No sequence homology was found between mouse ZP3 and ZP2. Ringuette et al , Proc. Natl. Acad. Sci. USA, 83:4341-4345
(1986), reported isolation of a partial cDNA clone for mouse ZP3, which clone hybridized with total genomic DNA of mouse, rat, dog, cow, and human, but not with pig or rabbit genomic DNA unless the hybridization was performed at very low stringency. The full length ZP3 cDNA characterized by Ringuette Dev. Biol. 127:287-295(1988) represents a germ-line specific mRNA having relatively short 5' and 3' untranslated regions and an open reading frame of about 1317 nucleotides with an additional 200-300 nucleotide poly-A tail. Ringuette also found that rat, rabbit, dog, and cow ovary transcribes mRNA which hybridized to the mouse ZP3 cDNA and that the ZP3 transcripts had similar molecular weights. Liang et al Mol. Cell. Biol. , 10:1507-1515 (1990), showed that the nucleic acid and deduced amino acid sequence of ZP2 is distinctly different from that of ZP3 although it had the same short motif of 5' and 3' untranslated regions. The ZP2 mRNA is reported to have single open reading frame of 2, 139 nucleotides which codes for a polypeptide of 80,217 Daltons representing 713 amino acids.
Chamberlin and Dean, Dev. Biol. 131:207-214 (1989) and Kinloch, R.A. et al , Proc. Nat. Acad. Sci. USA, 85:6409-6413 (1988) have reported the cloning of the mouse ZP3 gene. The mouse ZP3 gene is reported to have 8 exons and 7 introns in a transcription unit of 8.6 kbp.
Kinloch etal, Dev. Biol. 142:414-421 (1990), reported cloning of hamster genomic ZP3 DNA from a hamster genomic DNA library screened with mouse ZP3 DNA as a probe. The hamster ZP3 gene has a transcription unit of 7900 nucleotides and was found to contain 7 introns and 8 exons. The hamster ZP3 protein is approximately 81 % homologous to mouse ZP3 protein. The hamster transcript contained 1266 nucleotides, six less than mouse ZP3 mRNA.
Chamberlain and Dean, Proc. Natl. Acad. Sci. USA 87:6014-6018 (1990), reported the cloning of human ZP3 from a human genomic DNA library using mouse ZP3 cDNA as a probe. The human ZP3 gene is composed of 8 exons in a transcription unit of 18.3 kbp. The exons are almost identical in size to the eight exons of mouse ZP3 and the nucleotide sequence of the coding region is 74 % homologous. The human ZP3 transcript is very similar to mouse ZP3 mRNA. Both have short 5 ' and 3 ' untranslated regions, and both have a single open reading frame of 1272 nucleotides that encodes a 424-amino acid protein.
U.S. Patent No. 4,996,297, to Dunbar, reported the isolation of three rabbit zona pellucida clones encoding rabbit ZP1 and ZP2 proteins, using anti-ZPl and anti-ZP2 antibodies as screening probes. The sequences designated as P2 and P3 in Figure 4 of the Dunbar patent represent rabbit ZP cDNAs of 812 and 1705 nucleotides respectively.
Schwoebel et al , J. Biol Chan. 266:7214-7219 (1991), isolated and characterized a full length cDNA (designated re 55) encoding the 55-kD rabbit zona pellucida protein using cross-species affinity purified antisera. The protein encoded by this cDNA has some similarity to the mouse ZP2 protein described by Liang. However, comparisons of re 55 with the mouse ZP3 protein revealed no homology.
The functional activities of the cloned ZP DNAs and their encoded proteins have not been fully characterized and neither has their potential use as immunocontraceptives been demonstrated.
In order to develop a useful zona pellucida product for use in fertility control, particularly in the form of a vaccine, it is highly desirable to purify, isolate, and characterize zona pellucida proteins from a species of an animal of interest. Because of factors such as the purity of such proteins needed for vaccine production, and the high cost and numerous problems associated with purification of these proteins, it would be highly desirable to ascertain the DNA and amino acid sequences of zona pellucida proteins of a specific species of interest. Having such known, isolated and characterized zona pellucida proteins, the function of each zona pellucida protein may be understood and a fertility control product may be designed based upon the specific functional characteristics of a particular zona pellucida protein and for a particular mammalian species.
It would be thus highly useful and desirable to provide isolated, purified, sequenced, and characterized recombinant zona pellucida proteins which would permit the development of fertility control products possessing specific reproducible effects in eliciting transient and/or permanent infertility. Such products, where used to elicit transient infertility, would desirably have long lasting effects so as to minimize the number of times the immunocontraceptive agent must be administered to maintain infertility.
SUMMARY OF THE INVENTION
The present invention provides novel methods and materials for inducing either reproducible transient or permanent infertility effects in female mammals, including humans, by selective administration of homologous and/or heterologous mammalian species ZP proteins or immunocontraceptively active fragments thereof hereinafter designated as ZPA, ZPB and ZPC. By "reproducible" is meant that, unlike prior art attempts to induce transient infertility by administration of ZP proteins (in the form of mixtures of such proteins), this invention achieves its transient infertility effects by the administration of ZPA and/or ZPB in a form such that the duration of
transient infertility is controllable and can be maintained in an on or off condition in a controllable and/or predictable fashion. This is achieved primarily through administration of the highly pure ZPA and ZPB proteins or immunocontraceptively active fragments thereof of this invention, e.g., in recombinant form and thus essentially devoid of ZPC. By immunocontraceptively active fragments is meant a ZP protein fragment capable of inducing infertility.
In one of its aspects, the present invention provides methods for inducing reproducible transient infertility in a mammal by administering to a subject female mammal a zona pellucida protein (or fragment thereof) selected from the group consisting of mammalian ZPA, and ZPB, and combinations thereof in doses effective to stimulate production in said mammal of antibodies which recognize ZPA or ZPB proteins of said mammal. It is presently preferred that mammalian ZPA and ZPB for use in such methods be derived from the same mammalian species as the subject mammal although the use of heterologous species proteins is also contemplated. Use of purified isolates of mammalian ZPA or ZPB protein such as obtained by chromatographic separatory procedures is contemplated. Use of proteins produced by recombinant methods is expected to be most preferred. According to another aspect of the invention, methods are provided for inducing permanent sterility in a female mammal by administering to a subject female mammal a recombinant mammalian ZPC protein (or fragment thereof) in a form essentially devoid of ZPA and/or ZPB, in a dose effective to stimulate production in said female mammal of antibodies which recognize the ZPC protein of said mammal. As is the case with induction of transient infertility, use of homologous species ZPC is preferred, but not required, and the protein may be derived from natural sources or produced by recombinant methods. Modified ZPC proteins including but not limited to palmitylated and chitosan modified proteins are also contemplated by the present invention.
Presently preferred ZPA, ZPB, and ZPC proteins for veterinary application of the transient infertility and sterility inducing methods include porcine, rabbit, canine, feline, bovine, and cynomolgus monkey ZP proteins. In another of its aspects, the present invention provides pharmaceutical compositions for use in inducing reproducible transient infertility in a female mammal (including humans) comprising an effective dose of a zona pellucida protein (or fragment thereof) selected from the group consisting of mammalian ZPA, and ZPB (substantially free of ZPC), in combination with one or more pharmaceutically acceptable carriers, diluents and adjuvants. Modified ZPA and ZPB proteins (for example, palmitylated or chitosan modified) are also contemplated by the present invention.
According to another aspect of the present invention, novel purified and isolated DNA sequences are provided which encode porcine ZPA, ZPB, and ZPC, as illustrated by the DNA sequences set out in SEQ ID NOS. 1, 3, and 5. Also, provided are purified and isolated DNA sequences encoding: rabbit ZPC, as illustrated by the DNA sequence set out in SEQ ID NO. 7; canine ZPA and ZPC, as illustrated by the DNA sequences set out in SEQ ID NOS. 9 and 11; feline ZPA, ZPB, and ZPC, as illustrated by the DNA sequences set out in SEQ ID NOS. 13, 15, and 17; bovine ZPA, ZPB, and ZPC, as illustrated by the DNA sequences set out in SEQ ID NOS. 19, 21, and 23; human ZPA and ZPB as illustrated by sequences set out in SEQ ID NO. 42 and 40, respectively, and as contained as human DNA inserts in lambda phage clones Al and A4, (ZPA) and as contained in human DNA inserts in lambda phage clones 1-1 and 4-9 (ZPB). Polynucleotide sequences of the invention are useful for the production of ZPA, ZPB and ZPC proteins by recombinant methods and as probes for the isolation of heterologous species polynucleotides encoding corresponding zona pellucida proteins by hybridization methods.
Also provided by the present invention are novel host cells, especially unicellular eucaryotic and procaryotic cells, stably transformed or
transfected with polynucleotides of the invention in a manner allowing expression of the ZP proteins (or immunologically significant fragments thereof) in the host cells. Host cells expressing such ZP products, when grown in a suitable culture medium, and particularly useful for large scale production processes wherein the desired polypeptide products, in glycosylated or non-glycosylated form are isolated from the cells or the medium in which the cells are grown.
Recombinant polypeptides provided by the invention thus comprise ZPA, ZPB and ZPC, and full equivalents of such zona pellucida proteins including both glycosylated and non-glycosylated forms, variants and immunologically active fragments thereof which retain substantial biological activity, i.e., at least one of the biological activities of the zona pellucida protein discussed herein, e.g., the ability to stimulate the production of antibodies as discussed herein upon administration to a mammal. Such immunologically active fragments may be defined as containing at least one epitope effective to stimulate the production of antibodies upon administration to a mammal in accordance with this invention.
In another aspect of the invention, a method is provided for the isolation of nucleic acid sequences encoding other mammalian ZPA, ZPB, and ZPC proteins by hybridization under stringent conditions of heterologous species ZPA, ZPB, and/or ZPC probes to cDNA or genomic DNA libraries, derived from the mammalian species of interest.
More particularly, it is an aspect of the invention to provide a method for the isolation of nucleic acid sequences encoding human ZPA and ZPB by hybridization under stringent conditions of sequences encoding ZPA and/or ZPB from heterologous species.
Other aspects and advantages of the present invention will be readily understood upon consideration of the following detailed description of presently preferred embodiments thereof, reference being made to the figures wherein:
DESCRIPTION OF THE FIGURES
Fig. 1 is a diagrammatic representation of the plasmid vector pZ90;
Fig. 2 is a diagrammatic representation of the plasmid vector pZ98; and
Fig. 3 is a diagrammatic representation of the plasmid vector pZ156.
Fig. 4 is a diagrammatic representation of the alignment of the Eco Rl fragments encoding human ZPB. Fig. 5 is a diagrammatic representation of the plasmid vector pZ169.
Fig. 6 is a diagrammatic representation of the plasmid vector pZ145.
DETAILED DESCRIPTION OF THE INVENTION The present invention is directed to mammalian zona pellucida proteins characterized in three major classes: ZPA, ZPB, and ZPC. This classification scheme has resulted from repetitive screening of various mammalian ovarian cDNA libraries and retrieval of clones which encode proteins showing significant homology in three distinct groups, designated herein as ZPA, ZPB and ZPC. Although similarity is seen between DNA sequences encoding ZPA, ZPB, or ZPC between animal species, very little homology is found between the individual species' ZPA, ZPB, and ZPC proteins.
DNA sequences encoding zona pellucida proteins A, B, and C and their deduced amino acid sequences for various mammalian species ZPs are presented in SEQ ID NOS. 1-24. It is understood that the DNA sequence of a particular animal may vary slightly due to the phenomenon of allelic variation. Small differences in the precise DNA sequence between animals or slight errors due to the inefficiency of sequencing procedures are to be
expected. Such variants are included within the scope of the present invention.
The zona pellucida DNA sequences described above were obtained from ovarian cDNA libraries screened with specific zona pellucida antibodies or known zona pellucida DNA probes. Comparison of isolated sequences to published protein or DNA sequences and with other clones as they were isolated was used to classify and identify the clones as described above.
The term "zona pellucida protein" is meant to include full length proteins ZPA, ZPB, and ZPC, as well as expected variants, immunologically active fragments or peptides contained within these proteins.
The term "zona pellucida DNA" is meant to include those nucleic acid sequences encoding zona pellucida protein or fragments thereof.
The three major classes of mammalian zona pellucida proteins have been determined on the basis of homology within the DNAs encoding ZP proteins of a variety of mammalian species. ZPA includes those peptides previously, variously described in the literature as ZP1 , ZP2, and ZP4; ZPB includes those peptides previously, variously described as ZP3α and re 55; and ZPC includes those peptides previously variously described as ZP33 and ZP3.
The homology of various species of zona pellucida proteins within a specific class as compared with a consensus sequence for each class is shown in Table 1. The consensus sequence was derived using the Microgenie® Sequence Analysis Program (Beckman Instruments, Inc. Spinco Division, Palo Alto, CA). The minimum percent of aligned sequences which must have the same residue at a given position for that residue to be included in the consensus sequence was 50%. The DNA sequences corresponding to the amino acid consensus sequences for ZPA, ZPB, and ZPC proteins are set out in SEQ ID NOS 25. 26, and 27, respectively.
TABLE 1 HOMOLOGY OF DEDUCED ZP PROTEINS AMINO ACIDS
DOG CAT COW PIG
RABBIT MOUSE HUMAN HAMSTER
The deduced amino acid sequences of the various species of zona pellucida proteins suggest approximate unglycosylated molecular weights of 75 kD, 55 kD, and 45 kD for ZPA, ZPB, and ZPC, respectively. A more detailed analysis of both DNA sequence homology and deduced amino acid sequence homology is set out as Examples 13, 14, and 15.
It has surprisingly been found that administration of a specific class of zona pellucida protein to a host animal results in a specific immunocontraceptive effect and that selection of the appropriate ZP protein for administration allows induction of desired contraceptive results, in terms of permanent sterility or transient infertility. For example, vaccination of an animal with zona pellucida protein C induces antibody titers in that animal which recognize endogenous ZPC resulting in loss of oocytes from the animal's ovary, thereby causing permanent sterility. In contrast, vaccination of an animal with zona pellucida protein A, B or combinations thereof induces antibody titers which do not recognize ZPC, but recognize ZPA and/or ZPB. This results in cycling, infertile animals for the time period during which
anti-ZPA and/or anti-ZPB antibody titers remain high. When such antibody titers fall, the infertility effect is diminished, and the animal regains fertility.
Vaccination with the purified, isolated, and characterized ZPA,
ZPB, or ZPC proteins is seen to exert a specific effect on the immunized animal if an autoimmune response is triggered wherein the autoantibodies generated specifically recognize the immunized animals' own specific zona pellucida protein. This self-recognition for antibodies induced according to the present invention may be defined and characterized by the ability of serum antibodies to recognize at least one epitope present on a homologous species zona pellucida protein.
In the preferred method of the invention, an animal is immunized with a recombinant ZPA, ZPB, or ZPC or fragments thereof. The recombinant protein or peptide may be of homologous species or derived from a heterologous species zona pellucida which shares common epitopic determinants, with the proviso that such common epitopic determinants function to induce the desired autoimmune response.
The recombinant protein or peptide fragment may be chemically conjugated to immune enhancing agents such as Keyhole Limpet Hemocyanin (KLH), and Muramyl dipeptide (MDP), and the like, or alternatively may be provided in the form of a fusion protein, e.g., with foreign protein amino acids at the amino and/or carboxy terminus. Fully conventional methods for stimulating the production of antibodies upon administration of the proteins or fragments of this invention are well known; similarly, passive immunization techniques involving administration of antibodies per se, e.g., anti-ZPA antibodies, anti-ZPB antibodies, or anti-ZPC antibodies, to the zona pellucida proteins or fragments of this invention is also within the scope of the invention. For details, see Dean, PCT Application WO90/15624 whose disclosure is entirely incorporated by reference herein.
Thus, to induce permanent sterility in a dog, recombinant anine ZPC may be employed which is expressed as a bacterial fusion protein
(or conjugated to immune enhancing agents) wherein active canine ZPC protein is conserved and available for interaction with antigen presenting cells. The expressed protein is then administered to a host dog and induces an autoimmune response in which generated antibodies recognize canine zona pellucida protein C. This autoimmune effect, which specifically recognizes dog ZPC protein or its aggregates, induces permanent sterility in the vaccinated dog, which sterility is associated with a loss of oocytes from the dog's ovary.
Alternately, a non-homologous species ZPC, such as recombinant porcine ZPC or peptides thereof which are cross-reactive with canine ZPC, can be administered to a dog to achieve similar sterilizing effects. The sterilizing effect, however, is only realized when antibodies capable of recognizing the host's own native zona pellucida are induced (or administered in the context of passive immunization). In an alternative embodiment of the present invention, the administration of a host species' own A and/or B class zona pellucida protein, or a related A and/or B protein from another species which induce antibodies against the host's ZPA and/or ZPB proteins results in an infertility effect which is distinct from that produced by ZPC class antigens. The physiological effect of vaccination with the ZPA and ZPB proteins is a transient one. "Transient infertility" is herein defined as infertility which is maintained when antibodies against self-zona pellucida proteins are sustained in the host animal's circulation at a contraceptively effective concentration (e.g., at titers of approximately 1:250 in the dog) and which infertility is diminished when antibodies against self fall below a contraceptively effective lower limit. The reduction in antibodies against self-zona pellucida results in restoration of fertility without evidence of major physiological changes in the ovary. Typically, the reduction in antibody titers occur by natural processes in the mammalian host, but other methods of reducing antibody titers are within the scope of the invention.
Contraceptively effective antibody titers against self zona pellucida proteins A and B required to maintain infertility will vary with the species of vaccinated animal as well as with the species of recombinant ZPA or ZPB peptide administered, but may readily be determined, for example, by testing a panel of the desired animal species with varying doses of the specific antigen, measuring the induced titer of anti-self antibodies by known ELISA techniques, and correlating the titers with reproductive indicators, e.g., cycling, hormone levels, and the like. In general, antibody titers greater than 1:250 are contraceptively effective. Based on amino acid sequence homologies, it is expected that all zona pellucida proteins of a particular class contain functional epitopes which are cross-reactive between mammalian species. However, absent characterization and identification of such functional cross-reactive epitopes, a preferred, selective contraceptive agent is a homologous species zona pellucida protein or antibody thereto.
The present invention will be more completely understood upon consideration of the following illustrative examples of the practice thereof wherein: Example 1 addresses the isolation of DNAs encoding porcine species ZPA, ZPB and ZPC: Example 2 relates to isolation of rabbit ZPC DNA; Example 3 relates to isolation of DNAs encoding canine ZPA and ZPC; Example 4 addresses isolation of feline DNAs encoding ZPA, ZPB and ZPC; Example 5 relates to cloning and isolation of DNAs encoding bovine species ZPA, ZPB and ZPC; Examples 6 and 7 describe immunocontraceptive treatment of dogs with naturally-derived porcine zona pellucida proteins; Example 8 relates to serochemical studies on animals treated in Examples 6 and 7; and Examples 9 and 10 address recombinant production of a canine ZPC fusion protein and its immunocontraceptive use in dogs. Example 11 relates to the isolation of DNAs encoding human ZPA and ZPB by methods described herein. Example 12 relates to the isolation and sequencing of DNAs encoding cynomolgus monkey ZPA, ZPB and ZPC. Examples 13-15 relate
to the comparison of the DNA sequence and the deduced amino acid sequence of mammalian ZPA, ZPB, and ZPC, respectively. Example 16 relates to the immunization of cynomolgus monkey using HSPZ and fractionated HZPC. Example 17 relates to the mapping of mammalian zona pellucida protein epitopes. Example 18 describes the immunization of dogs using recombinant ZPC proteins. Example 19 relates to the vaccination of cows and cats with recombinant ZP proteins.
Example 1 Isolation of DNA Sequences Encoding Porcine Zona Pellucida Proteins ZPA, ZPB, and ZPC.
A cDNA library in λgtl 1 was commercially prepared by Clone
Tech, Palo Alto, CA, from an ovary isolated from a 14 week old pig and was screened using an anti-ZP33 antibody obtained from E.C. Yurewicz and described in Keenan et al , Biol. Reprod. , 44:150-156 (1991). Eight candidate clones were identified.
A degenerate DNA oligonucleotide probe (19bps) was constructed to represent all possible sequences of a short portion of the
N-terminus porcine ZP3 5 as described in Yurewicz et al , J. Biol. Chem. ,
262:564-571 , (1987). The degenerate probe sequence is set out in SEQ ID NO. 28.
Southern analysis of the eight candidate clones isolated by expression screening with the degenerate DNA oligonucleotide probe resulted in hybridization with two of the eight candidates. The two clones recognized by the degenerate probe were then subcloned into the pBS KS plasmid (STRATAGENE Cloning Systems, La Jolla, CA) for sequence analysis using the sequence enzyme and the protocol described in the SEQUENASE® Manual (U.S. Biochemical, Cleveland, OH). One of the clones, B-8, having an insert size of approximately 1200 base pairs, included a sequence homologous to the
N-terminal sequence of mouse ZP3, previously identified by Ringuette et al. , Dev. Biol. , 127:287-295, (1988). The remaining clone, B-6, had an insert size of approximately 1000 base pairs. Neither hybridizing clone contained the C-terminal portion of the gene, as suggested by the lack of homology to the mouse ZP3 gene in this region.
The 14-week porcine ovarian library was then rescreened by DNA hybridization. Approximately 150,000 PFUs were plated on agar plates with E. coli Y1090. After overnight incubation at 37°C, nylon membrane lifts of plaques were prepared and screened using the B6 and B8 clones derived above isolated by screening with the degenerate oligonucleotide probe set out in SEQ ID NO. 28.
Filters were prehybridized in a solution containing 5X saline, sodium phosphate, EDTA buffer (SSPE), 5X Denhardt's Reagent, lOOμg/ml salmon sperm DNA, 30% formamide and 0.5% SDS for three hours at 42°C. Approximately 50 ml of the prehybridization solution was used for 12 filters (132 mm). After prehybridization, 10 ng of freshly radiolabeled DNA probe in 30% formamide, 5X SSPE was added. The probes were heat denatured at 95°C for 3-5 minutes and hybridization with the DNA probes continued overnight at 42°C. The hybridized filters were then washed twice with 100 ml of 5X SSPE at 55°C, for approximately one hour each wash. The filters were then rinsed with 250 ml of 5X SSPE at room temperature and allowed to air dry. The dried filters were exposed to x-ray film at -70°C using intensifier screens for at least eight hours and the films were developed for visual analysis. Among the additional clones isolated were two clones including the C-terminal portion of the porcine ZP3/3 gene. One clone, X5-1 , was subcloned into plasmid pBS KS and sequenced. This plasmid, termed pZ57, contained a ZP DNA insert having 1266 base pairs and appeared to encode the full length amino acid sequence of porcine ZP33 as compared with known mouse ZP3. Alignment of the deduced amino acid sequence of the clone with
the known N-terminal amino acid sequence of ZP33 reported by Yurewicz et al, J. Biol. Chem., 262:564-571 (1987), and an internal peptide sequence of ZP3/S corresponding to amino acids 255-27 A as provided by E.C. Yurewicz confirmed the identity of this clone as encoding porcine ZP35. The DNA sequence of this clone, termed porcine ZPC, is set out in SEQ ID NO. 5 and its deduced amino acid sequence is set out in SEQ ID NO. 6.
The 14-week porcine ovarian cDNA library was further screened using rabbit zona pellucida re 55 cDNA as a probe [described in Schwoebel et al , J. Biol. Chem, 266:7214-7219, (1991)].
One candidate clone of approximately 1700 base pairs, λ2-l, was isolated and was transferred into the sequencing plasmid pBS KS. The
DNA sequence and deduced amino acid sequence of the porcine DNA insert was determined using the method described in the SEQUENASE® manual (US Biochemical Corporation, Cleveland, Ohio). The sequenced clone contained
1620 base pairs and included a full length copy of the porcine ZP3α gene as confirmed by alignment of the deduced amino acid sequence with portions of the known protein sequence of porcine ZP3α* provided by E.C. Yurewicz between amino acids 206-222, 271-279, and 328- 344. The DNA sequence of this clone, termed porcine ZPB, is set out in SEQ ID NO. 3. Its deduced amino acid set out in SEQ ID NO. 4.
The 14-week porcine ovarian library was further screened using the procedure described above and using a DNA probe encoding canine ZPA protein (as obtained in Example 3 below, SEQ ID NO. 9). A single clone, λ3-5 having approximately 1300 base pairs, was obtained representing the N-terminal 60% of the theoretical porcine ZPA gene as estimated by the size of the clone in relation to the ZP2 gene isolated from mouse by Liang et al. , Mol. Cell. Biol. 10: 1507-1515 (1990), and rabbit by Dunbar, U.S. Patent No. 4,996,297, and dog (see Example 3 below).
This clone was then used to rescreen the porcine ovarian library. Three additional clones were obtained, two small clones and one clone large enough to contain the full length sequence. The large candidate clone, λB, having approximately 2200 base pairs, was sequenced, and the data showed this ZPA clone to lack only approximately seven base pairs of the full length sequence including the ATG start codon when aligned with the mouse ZP2 gene and the canine ZPA gene described in Example 3. The DNA sequence of this clone, termed porcine ZPA, is set out in SEQ ID NO. 1. Its deduced amino acid sequence is set out in SEQ ID NO. 2. This isolated porcine clone included sequences corresponding to published sequences of three identified porcine zona pellucida proteins, ZP1 (80kD), ZP2 (62kD) as disclosed in U.S. Patent No. 4,996,297 to Dunbar and ZP4 (21kD) as disclosed by Hasegawa et al , Abst. No. 382, Meeting Soc. Study Reprod. July, 1991. These results suggest that a singular clone encodes one zona pellucida protein which previously had been thought to exist as three separate proteins, i.e., ZP1 , ZP2, and ZP4. This further suggests that only three major porcine zona pellucida genes encode three major zona pellucida proteins which here are termed ZPA, ZPB, and ZPC. ZPA includes those proteins previously identified as ZP1, ZP2, and ZP4. ZPB corresponds to ZP3α; and ZPC corresponds to previously identified ZP30. Yurewicz et al. J. Biol. Chem. , 262:564-571, (1987).
Example 2
Isolation and Purification of DNA Sequences
Encoding Rabbit ZPC Protein
Ovaries were removed from five week old rabbits and mRNA was prepared using the Fast Track™ mRNA isolation kit in accordance with the procedure described in the Fast Track™ instruction manual, version 3.1, catalog No. K1593-02 (Invitrogen, San Diego, CA). A Lambda Librarian™
kit (Invitrogen, San Diego, CA) was used to prepare cDNA and to clone cDNAs into λgtlO according to the manufacturer's instructions. Approximately 150,000 PFUs were plated on agar plates with E. coli Y1090. After overnight incubation at 37° C, nylon membrane lifts of colonies were prepared and screened with a porcine ZPC DNA probe using the screening procedures described for Example 1. The probe used was the porcine ZPC sequence as set out in SEQ ID NO. 5.
Two positive clones, λR4 and λR5, hybridized with the porcine ZPC DNA. The size of each of these clones as estimated in agarose gels was approximately 1300 base pairs. Both λR4 and λR5 were sequenced as described for Example 1. The sequences were identical except that λR5 contained four additional nucleotides at the 5 ' end. The determined DNA sequence was approximately 75 % homologous to the DNA sequence encoding porcine ZPC. The DNA sequence encoding rabbit ZPC protein is set out in
SEQ ID NO. 7. Its deduced amino acid sequence is set out in SEQ ID NO. 8.
Rabbit ZPA and ZPB proteins have been previously identified by Dunbar in U.S. Patent No. 4,996,297 as P2 and P3, respectively.
Example 3
Isolation of DNA Sequences Encoding Canine Zona Pellucida Proteins ZPA and ZPC
A 16 week canine ovarian cDNA expression library was commercially prepared by Clone Tech, Palo Alto, CA, in λgtl l generally following the methods described in Example 1. The canine ovarian cDNA library was screened using antibodies raised against heat solubilized canine zona pellucida. Heat solubilized canine zona pellucida (HSDZ) was prepared generally following the procedures described in Dunbar et al. Biochemistry,
19:356-365, (1980) except ganged razor blades were used to mince the ovaries.
Rabbits were immunized with 250 μg HSDZ and 250 μg MDP. Two additional boosts followed at approximately three week intervals. The resultant rabbit serum was used to screen the canine ovarian cDNA expression library. Seven candidate clones were obtained. Cross-hybridization experiments were performed by Southern blot analysis as follows. The largest clone, X26-1 , having approximately 1300 base pairs, was first used as a probe against all of the other clones in Southern blots. Three other clones were identified. The largest of the remaining clones, X20-1 and X7-1 , having approximately 800 and 1000 base pairs respectively, were then used as probes in Southern blots. These probes identified no additional clones. This cross hybridization analysis of the seven candidate clones to each other indicated that four of these clones were related, e.g. four clones hybridized to X26-1 while the remaining three X20-1, X7-1, and X19-3 were independent.
The largest of the four related clones, X26-1 , was subcloned into pBS KS plasmid for sequence analysis according to the procedure described in Example 1. The analyzed sequence demonstrated the presence of a long open reading frame of 1278 base pairs encoding a protein of approximately 426 amino acids. Comparison of the deduced amino acid sequence of this clone with the sequences of known zona pellucida proteins, indicated this clone encoded a protein related to mouse ZP3 (ZPC) as reported by Ringuette et al , Dev. Biol. 127:287-295 (1988), hamster ZP3 as reported by Kinloch et al , Dev. Biol , 142:414-421 (1990), human ZP3 as reported by Chamberlin et al , Proc. Natl Acad. Sci. USA 87:6014- 6018 (1990) and porcine ZPC protein (see Example 1). The DNA sequence of this clone, termed canine ZPC, is set out in SEQ ID NO. 11. Its deduced amino acid sequence is set out in SEQ ID NO. 12.
The remaining three independent candidate clones were subcloned into the pBS KS plasmid for sequence analysis as described above.
The determined sequence of the 800 base pair clone, X20-1, was compared with known ZP sequences by computer analysis as described above and was found to be related to the mouse ZP2 (ZPA) [Liang et al. , Mol. Cell. Biol 10:1507-1515 (1990)] and porcine ZPA (see Example 1). The 800 base pair fragment from X20-1, was then used as a hybridization probe to rescreen the canine cDNA library. Two additional candidate clones were identified, the larger of which, X7A, having approximately 2800 base pairs, was subcloned into pBS KS plasmid for sequence analysis. Comparison of this sequence with known sequences encoding zona pellucida proteins suggested the candidate clone X7A contained a full length ZPA sequence, but an incorrect N-terminal sequence, e.g., the clone contained an additional 600 base pairs as determined by alignment with known mouse ZP2 and rabbit ZPA sequences referenced in Example 1. The second candidate clone, X9-2, having approximately 1000 base pairs, was then subcloned into the plasmid pBS KS and sequenced. The sequence of the second clone indicated the presence of a correct N-terminal sequence, but included only approximately the N-terminal 40% of the full length clone as determined by alignment with the mouse ZP2 and rabbit ZPA genes. Overlap of the two cDNA clones, however, provided the full length sequence. The appropriate pieces of each clone were subcloned as follows to generate the correct full length zona pellucida clone containing a 2028 base pair open reading frame encoding a protein of approximately 676 amino acids. The X7A DNA was digested with Eco RI to yield two insert fragments (2000 bps and 800 bps). These two fragments were each subcloned into pBS KS yielding pZ36 and pZ37, respectively. Plasmid pZ37 carried the C-terminal portion of this sequence. The X9-2 DNA insert was removed from the X vector and subcloned into pBS KS to yield pZ38. Plasmid pZ36 was digested with Hind III to remove approximately 1350 bps of the N-terminal portion of the X7A gene fragment (about 850 bps of nonsense DNA and 500 bps of coding sequence). This digestion also removed one of the Eco RI insert ends
and left a single Eco RI site. The pZ37 Eco RI insert was then moved into the single remaining Eco RI site in the modified pZ36 (pZ36 Δl) to reestablish the relative DNA structure orientation that existed in the X7A insert (1450/2800 bps). This combined plasmid was then opened with Hind III and the Hind III fragment from pZ38 carrying the N-terminal ZP DNA sequence was inserted to create plasmid pZ39 which is a pBS KS carrying the full length canine ZPA sequence. The DNA sequence of this canine ZPA gene is set out in SEQ ID NO. 9. Its deduced amino acid sequence set out in SEQ ID NO. 10.
Example 4
Isolation of DNA Sequences Encoding Feline Zona Pellucida Proteins ZPA, ZPB, and ZPC
Ovaries were isolated from five cats approximately three to four months in age. Messenger RNA was isolated from six ovaries using the Fast Track™ mRNA Isolation Kit (Invitrogen, San Diego, CA, Catalog No.
K1593-02) using the protocol provided with the kit. cDNA was prepared using the protocol and cloned into XgtlO as described in Example 2.
Approximately 150,000 plaque forming units (PFUs) were plated on agar plates with E. coli Y1090. After overnight incubation at 37°C, nylon transfer membranes were used to prepare and screen plaque lifts.
Plaques were screened using a mixture of DNA probes in equal proportions encoding porcine ZPA, ZPB, and ZPC proteins and using the hybridization procedure as described for Example 2. A total of 81 positive clones were identified. Twelve of these clones were plaque-purified. Southern analysis of these clones using porcine ZPA, ZPB, and ZPC DNAs individually as probes indicated that seven of these clones encoded ZPC proteins and one clone encoded a ZPA protein. Four of the clones contained inserts which could not be separated by Eco RI digestion
Five of the ZPC clones were between 1200-1350 base pairs in length. One clone, XC-112, having approximately 1350 base pairs was subjected to sequence analysis as described above and its deduced amino acid sequence was found to be approximately 70% homologous to the canine ZPC protein obtained in Example 3. The DNA sequence of this feline ZPC clone is set out in SEQ ID NO. 17. Its deduced amino acid sequence is set out in SEQ ID NO. 18.
The single feline ZPA clone, XC-116, was sequenced and found to be approximately 2215 base pairs in length. The deduced amino acid sequence was approximately 75% homologous to the canine ZPA protein characterized in Example 5. The DNA sequence of this feline ZPA clone is set out in SEQ ID NO. 13. Its deduced amino acid sequence is set out in SEQ ID NO. 14.
The remaining 69 positive clones were rescreened using porcine ZPB DNA as a probe (SEQ ID NO. 3). Ten positive clones were obtained. The largest clone, XC-1, contained approximately 1.7 kilobases as determined by agarose gel electrophoresis. This clone was sequenced, and its deduced amino acid sequence was found to be approximately 80% homologous to the porcine ZPB protein described in Example 1. The DNA sequence of this feline ZPB clone is set out in SEQ ID NO. 15. Its deduced amino acid sequence is set out in SEQ ID NO. 16.
Example 5 Isolation of DNA Sequences Encoding Bovine Zona PelJucida-Proteins ZPA, ZPB, and ZPC
A cDNA library was constructed from a five month bovine ovary by the method described in Example 2. The bovine ovarian library was screened with DNA hybridization probes representing each of the classes of zona pellucida proteins using a mixture of equal proportions of porcine
DNA probes encoding ZPA (SEQ ID NO. 1), ZPB (SEQ ID NO. 3), and ZPC (SEQ ID NO. 5) proteins, as described for Example 2 and using the procedures described for Example 1. Initial screening yielded three candidate clones. Southern analysis of these clones with individual porcine ZPA, ZPB, and ZPC DNA probes used in the initial screening indicated that one of the clones, XB2, having approximately 650 base pairs, encoded ZPA. A second clone, XB-1 having approximately 1000 base pairs encoded ZPB. A third clone, XB14, having approximately 1200 base pairs, encoded ZPC.
The bovine ovarian library was then rescreened with the mixed porcine ZP DNA probes. Two additional clones were obtained and identified by Southern analysis as encoding ZPC.
The Eco RI inserts of the ZPA, ZPB, and largest ZPC clone were subcloned and their DNA sequences analyzed. The sequences encoding these bovine ZPA, ZPB and ZPC fragments were set out in SEQ ID NOS. 19, 21, and 23, respectively. Their deduced amino acid sequences are set out in
SEQ ID NOS. 20, 22, and 24, respectively.
Example 6
Immunization of Dogs with Heat-Solubilized Fractionated
Porcine Zona Pellucida
Heat-solubilized, porcine zona pellucida (HSPZ) was prepared generally following the procedures described by Dunbar et al. Biochemistry, 19:356-365, (1980) but using a hand powered meat grinder instead of the Zonamatic described. Following isolation, the zona pellucida protein was solubilized in 0.1 M sodium carbonate buffer, pH 9.6, and was dialyzed extensively against 6M urea. The resultant solution, a volume of 2-3ml containing approximately 12μg of HSPZ, was subjected to isoelectric-focusing in a BIORAD Rotofor isoelectric-focusing chamber as follows. An isoelectric gradient was established using 1 % ampholytes having a pi range of 3-10. The
zona pellucida protein was introduced into the mid-range chamber (pi 7.0) and allowed to focus for approximately four hours at 4°C or until the voltage stabilized.
Twenty isoelectrically focused fractions were collected and analyzed by SDS PAGE and Western blot analysis for pig zona pellucida proteins. Acidic fractions having a pi range of approximately 3.5-5.5 and which contained the porcine zona pellucida proteins were combined. The fractions were dialyzed into 0.1M carbonate buffer, pH 9.6 and concentrated to approximately 3mg/ml. This antigenic preparation was used to vaccinate animals as described below. Analysis of this antigenic preparation by two- dimensional gel electrophoresis indicated the presence of ZPA and ZPB protein. However, ZPC was not revealed to be present in this preparation. The HSPZ antigenic preparation was added to a 50/50 water oil emulsion with incomplete Freund's adjuvant (Sigma, St. Louis, MO) containing 250μg of MDP per dose. One ml of the 50/50 water oil emulsion contained 0.425 ml paraffin oil, 0.075 ml mannide monooleate, and 0.5 ml PBS containing 250 μg threonyl-MDP (SYNTEX Corporation) and the amount of HSPZ described in Table 3 below.
Four random breed dogs aged 10-12 weeks were immunized with HSPZ using the regimen described in Table 2.
The antisera produced by these animals was monitored via ELISA methodology. By week 17 antibody titers against self, e.g. against canine zona pellucida proteins, had reached a maximum (8-16K by ELISA) and thereafter began to drop. At week 36, one animal was unilaterally ovariectomized and the removed ovary was sectioned and stained with periodic acid schiff stain (PAS) for histological examination. The ovary appeared normal, as evidenced by the presence of follicles in all stages of development. At week 52, two of the four test animals were observed to exhibit estrus behavior. The remaining two test animals exhibited estrus behavior at approximately one and a half years when the first two test animals experienced their second heat. All test animals were bred repeatedly with competent males and by artificial insemination, however, none became pregnant. During this same period, animals in various test regimens in which no self titers were obtained, as described in Example 10, became pregnant when presented with the same males or artificial insemination techniques.
Two weeks following the breeding sessions, e.g. at 54 weeks, the two early cycling animals were unilaterally ovariectomized and the removed ovaries were sectioned for histological examination. The ovaries appeared normal for this stage of follicular activity despite the functional infertility demonstrated.
Example 7 Vaccination With Porcine ZPC Protein
A purified porcine ZPC protein (ZP3/3) was obtained from E. Yurewicz, prepared as described in J. Biol Chem. , 262:564-571, (1987).
Vaccines were prepared by adding 167μg purified porcine ZPC protein (ZP30) to a 50/50 water-oil emulsion with complete Freund's adjuvant (Sigma No. F5881, St. Louis MO), for the priming dose or with Incomplete
Freund's Adjuvant (Sigma No. F5506, St. Louis, MO) containing MDP as described in Example 6 for the booster doses.
Five random breed dogs of approximately 10-12 weeks of age were injected with the ZPC vaccine preparation described above using the regimen described in Table 3.
Each animal's antibody titer versus self- zona proteins, e.g., versus canine zona pellucida proteins, was monitored by ELISA, using the method described in Dunbar, Two Dimensional Gel Electrophoresis and Immunological Techniques, 1987. ELISA microtiter plates were coated with HSDZ in antigen-coating buffer (0.1M sodium carbonate, pH 9.6). Biotinylated rabbit-antidog IgG was used as the second antibody. ABC reagent (Avidin-biotinylated peroxidase complex) and O-phenylene diamine dihydrochloride with a peroxide substrate was used for visualization. Only two animals produced antibodies versus self achieving peak self-antibody titers of 16K by week 4. The other three animals produced no self-antibody titers but achieved peak antibody titers of 4K against porcine zona pellucida protein. During the period of time between week 20 and week 36, all dogs were observed to exhibit estrous behavior. The animals were bred repeatedly with proven males. Only the two animals having antibody titers versus self zona pellucida proteins remained infertile. All other animals in the study became pregnant.
Two weeks after estrous and breeding the two infertile dogs exhibiting self-antibody titers were unilaterally ovariectomized and the removed ovaries were sectioned and stained with PAS for histological examination. The histological examination revealed abnormal morphology in the ovaries of the infertile dogs. No evidence of ongoing folliculogenesis was seen and the ovaries were depleted of oocyte-containing follicles. In addition, no primordial oocytes were seen.
Example 8 Western Analysis of Antisera Produced by Vaccinated Animals
In an attempt to better understand the immune response and different physiological effects obtained in the two studies described in Examples 6 and 7, antisera produced in each test group was analyzed by Western Analysis against a variety of antigens including natural porcine ZPC, heat-solubilized dog zona pellucida (HSDZ), recombinant dog ZPA and ZPC, and recombinant pig ZPC. Western blots were probed with antiserum obtained from the test animals of Example 6, e.g., animals immunized with isoelectric focused, heat-solubilized porcine zona pellucida, and with antiserum obtained from the two test animals of Example 7 which contained antibodies against self-zona proteins.
The data demonstrate no recognition of recombinant porcine or canine ZPC by antisera from infertile, but cycling dogs immunized with heat solubilized porcine zona pellucida which contained no demonstrable ZPC by PAGE analysis, however, natural ZPC, HSDZ and recombinant canine ZPA were recognized. In contrast, antisera obtained from infertile dogs whose ovaries were depleted of oocytes recognized recombinant ZPC protein, i.e. , the polypeptide backbone.
A key difference in the antibody recognition of antigen was that only the antisera obtained from dogs having ovaries devoid of oocytes appeared to recognize the recombinant dog ZPC antigen. Infertile dogs whose antisera strongly recognized natural ZPC, HSDZ, and recombinant dog ZPA demonstrated no recognition of recombinant dog ZPC.
Given that autoimmunity is essential for a contraceptive effect, these data suggest that infertility without histologically evident ovarian dysfunction can be obtained in dogs via an autoimmune response against dog ZPA antigens. In contrast, histologically confirmed ovarian dysfunction, i.e., loss of oocytes, which would result in permanent sterility, requires the generation of antibodies which specifically recognize homologous species ZPC protein.
Example 9 Expression of Recombinant ZP Proteins
I. Construction of Expression Vectors
The plasmid vector pZ90 shown in Fig. 1 was constructed from fragments of the plasmids pUC9 (Vierra & Messing, Gene 19:259-268 (1982)) and p ?gal2 (Queen, J. Mol. App. Gen. 2: 1-10 (1983)). The single Pvu II restriction site present in p/3gal2 was converted to a Sal I site using a Sal I polylinker adaptor purchased from New England Biolabs. The DNA sequences between the new Sal I site and a pre-existing Sal I site were excised by digestion with Sal I, religated and screened for the reduced size plasmid. A Cla 1 - Nde I fragment of the modified p/?ga!2 plasmid which carried the XCI repressor gene, the XpR promoter and the Lac Z gene δ-galactosidase) was inserted into pUC9 between its Ace I and Nde I restriction sites. The pUC9 plasmid carries the ampicillin resistance (AmpR) gene and col El replication origin (ori) needed to maintain the plasmid in E. coli cells. The combination plasmid was further modified to convert the Bam
HI site 3' of the ATG initiation codon (ATG GAT CCN) to a Bgl II site 5' of the ATG initiation codon (AGATCTATG). This was accomplished by partially digesting the plasmid with Rsa I. One of the several digestion points was about 20 bps 5' of the Bam HI restriction site. When the partially digested plasmid was digested with Bam HI, some of the plasmids produced were nearly full length. A synthetic oligomer (GTACTAAGGAAGATCTATGGATCC) (SEQ ID NO.29) was produced to replace the sequence that had been removed (GTACTA AGG AGGTTGTATGG ATCC) (SEQ ID NO.30). The net effect of this replacement was the substitution of 3 bps to create the Bgl II restriction site. A DNA fragment containing approximately 3000 base pairs of the Lac Z gene was then excised by restriction digestion with Bgl I and Ban II and was followed by insertion of a synthetic oligomer containing a Bam HI site. The plasmid was cut with Bgl I and Ban II, and then treated with nuclease SI to create blunt ends. A Bam HI linker (New England Biolabs) was inserted at the blunt ends of the digested plasmid. Next a Pvu II restriction site between the XCI repressor gene and the ori sequence was converted to a Hind III site using a synthetic linker. The Pvu II restriction site was cut with Pvu II, and a Hind III linker (New England Biolabs) was ligated to the blunted ends. Because the remaining lac Z sequence was missing the first 8 codons of the natural sequence, these 8 codons were replaced by synthesizing a synthetic oligomer that began with a Bgl II site and encoded the lac Z wild type gene product (βgal) N-terminal sequence.
The synthetic oligomer was prepared by synthesizing four oligomers having the sequences set out in SEQ ID NO.31 (oligomer 1), SEQ ID NO.32 (oligomer 2), SEQ ID NO.33 (oligomer 3), and SEQ ID NO.34 (Oligomer 4). Oligomers 2 and 3 were phosphorylated by treating with kinase and ATP to add phosphate to the 5' end. Oligomers 1 and 2 were then hybridized to oligomers 3 and 4, respectively, by incubation at 100°C followed by a slow cooling in 200μM NaCI. The resultant oligomer had the sequence
set out in SEQ ID NO. 35. The synthetic oligomer as set out in SEQ ID NO. 35 had Bgl II-Pvu II ends and was substituted for the Bgl II-Pvu II sequence of the plasmid by restriction digestion of the plasmid and ligation with the oligomer. The resultant plasmid was termed pZ90 and is shown in Figure
1. The plasmid pZ90 can be used to express recombinant proteins by heat induction, using the heat labile λCI repressor. The heat-inducible repressor and promoter of pZ90 was next replaced with the chemically inducible promoter ptac (A ann et al , Gene 25: 167-178 (1983)). The ptac promoter is controlled by the lac repressor, a product of the lac I gene (Farabaugh, Nature 279:765-769 (1978)). The Lac I gene was obtained from pMC9 (Miller et al , The EMBO Journal 3:3117-3121 (1984)) by use of PCR methodology as described by Innis and Gelfand, In: PCR Protocols: A Guide to Methods and Applications, Innis, M.A., Gelfand, D.H. , Sninsky, J.J. and White, T.J. (eds)., pgs 1-12, Academic Press, Inc., San Diego, CA. The primers used were complimentary to the Lac I promoter at one end and the Lac I gene termination codon at the opposite end. The N-terminal primer carried a Hind III site and the C-terminal primer carried a tac promoter sequence followed by a Bgl II site. The N-terminal primer had the sequence set out in SEQ ID NO. 36. The C-terminal primer had the sequence as set out in SEQ ID NO. 37 which includes a Dra 3 site having the sequence 5'- CACAATGTG-3'. The resulting lac I - ptac DNA fragment having Hind III and Bgl II restriction sites at its respective ends was then used to replace the Hind III - Bgl II fragment of pZ90 which carried the λCI repressor and λpR promotor. This replacement yielded the plasmid pZ98 shown in Fig. 2.
II. Insertion of Recombinant ZP DNA DNA sequences encoding porcine ZPC were prepared by the PCR procedures described above (Innis & Gelfand) from the plasmid pZ57 prepared in Example 1 , which contains the full length porcine ZPC sequence
obtained from λgtl l clone 5-1 described for Example 1. During the PCR procedure the porcine ZPC gene was modified by using primers that did not include the leader sequence and the hydrophobic tail. The N-terminal primer used had the sequence set out in SEQ ID NO. 38 which included an internal Bam HI restriction site having the sequence 5 '-GGATCC-3 '. The C-terminal primer used had the sequence as set in SEQ ID NO. 39 includes a Sal I restriction site having the sequence 5 '-CTCGAG-3 ' and an internal Xho I restriction site having the sequence 5'-CTCGAG-3'. The modified ZPC gene contained base pairs 105 to 1154 encoding ZPC amino acids 1-350. To the 5 ' end of the modified porcine ZPC gene was added a
Bam HI restriction site, and to the 3 ' end was added an Xho I site, a Hexa-CAT-codon sequence (CAT)6, a termination codon, and a Sal I restriction site. This modified porcine ZPC gene was inserted into the Bam HI - Sal I restriction site of pZ98 to yield the porcine ZPC expression vector, plasmid pZ156 shown in Fig. 3. The (CAT)6 sequence produces a C-terminal hexahistidine (His6) amino acid sequence in the recombinant fusion protein which permits purification of the fusion protein by immobilized metal in affinity chromatography.
In a similar manner as described above, the plasmid pZ156 when digested with Bam HI and Xho I, may be used to receive any other recombinant ZP gene or gene fragment for expression as a βgal fusion protein which can be purified by metal ion affinity chromatography.
III. Expression of Porcine ZPC Fusion Protein in E. coli
The expression vector pZ156 (Fig. 3) was transformed into E. coli strain Top 10F' (Invitrogen. San Diego, CA) by the procedure of Chung et αl , Proc. Nαtl Acαd. Sci. USA 86: 2172-2175 (1989). The transformed
E. coli cell line was termed Strain ZI 156, and was used to express recombinant porcine ZPC-βgal fusion protein.
Bacterial cultures of ZI 156 were grown in Luria Broth (LB) containing 100 μg/ml ampicillin at 30°C until the cell density reached an OD600 of approximately 1.5. Isopropyl beta-D-thiogalactopyranoside (IPTG) (3ml of lOOmM solution/ 1 media) was added to induce expression from the tac promoter, and the cells were further incubated at 30°C for 2-3 hours. The cells were harvested by centrifugation, and the resulting cell pellet was frozen at -70°C
The frozen cell pellets were suspended in 10 mM EDTA
(lg/2-2.5 ml) and twice sonicated at 50% power for 3 minutes, cooling in an ice bath between each sonication. The cell lysate was then centrifuged at
3300 x g for one hour and the hard pellet was retained. This lysis procedure was repeated using the hard pellets.
In order to remove residual EDTA, the final hard cellular pellet was dispersed in a small volume of water by a brief burst of sonication, the suspension was centrifuged, and the supernatant discarded. The washed pellet was thoroughly resuspended in Buffer A, (6M guanidine hydrochloride (GuHCl), 100 mM Na H2PO4, 10 mM TRIS pH 8, at approximately 0.5 ml per original gram of cell pellet). The suspension was centrifuged at 10,000 x g for 45 seconds and the supernatant was retained while the pellet was discarded.
The retained supernatant was loaded onto a Ni column (in Buffer A) and the column was washed with 10 column volumes of Buffer A. The column was next washed with 5 volumes each Buffers B-D, each containing 8M urea, lOOmM NaH2PO4, and 10 mM TRIS, and having successively reduced pH values of 8, 6.3, 5.9 for Buffers B, C, and D, respectively. The recombinant pZPC-/Jgal fusion protein eluted with Buffer E, at pH 4.5 as shown by screening by Western Blot analysis using rabbit anti-HSDZ and anti-HSPZ as probes. Further elution may be accomplished using Buffer F (pH 2.5) (8M GuHCl, 200 mM Acetic Acid).
The fusion protein obtained by this protocol was prepared in its final dose for injection into a host animal by adjusting the final volume to 0.5 ml in 8M urea, and adding it to 0.5 ml adjuvant as described above. Each dose was injected subcutaneously into a test animal.
Example 10
Vaccination of Dogs with Recombinant ZPC-β gal Fusion Protein
Eleven mixed breed dogs approximately 5-6 months of age were randomly selected from test animals previously treated at approximately 2 months of age with heat solubilized porcine zona pellucida or chromatographically purified porcine ZP3/3 in combination with various biopolymers as adjuvants and drug releasing vehicles. Six weeks post first injection, i.e., three and a half months of age, all test animals had achieved antibody titers versus HSPZ in the range of 2-16K as determined by ELISA. However, none of the test animals achieved antibody titers against self- antigen, e. g., HSDZ.
At 5-6 months of age, five of the test animals were then injected with a loading dose of the porcine ZPC-β gal fusion protein prepared as described for Example 9. The recombinant ZPC-β gal fusion protein produced in Example 9 was adjusted to the desired dose in a final volume of 0.5ml 8M urea and combined with 0.5 ml adjuvant. The adjuvant, N-acetyl-D-glucosaminyl-/3(l ,4)-N-acetyl muramyl-L-alanyl-D-isoglutamine (GMDP), 250μg, was dispersed in 0.42 ml mineral oil, 0.157 ml L-121 block polymers, and 0.02 ml Tween 80. Each dose was injected subcutaneously into the five test animals. The remaining 6 animals were maintained as controls.
Following a total of four injections given at 2-3 week intervals, antibody titers versus self antigen, e.g., HSDZ, were obtained in all test animals, with peaks in the range of 2-8 K as measured by ELISA.
Some of the control animals began to cycle beginning at approximately 9 months of age, and by 11 months of age, 4 of 6 control animals had experienced their first estrus. In contrast, none of the 5 test animals which had received recombinant ZPC-β gal fusion protein had cycled during this same time period. However, although the first estrus was delayed for several months in the test animals, they eventually began to cycle. Two of the five vaccinated dogs became pregnant during their second estrus after immunization while a third dog became pregnant during its third estrus after immunization; however, the two remaining test animals remain infertile through three estrus cycles and nearly two years after vaccination.
Example 11 Isolation of Human DNA Sequences Encoding Human
Zona Pellucida Proteins ZPA and ZPB
A human genomic DNA library purchased from Stratagene (catalog no. 946203) was used for the isolation of DNA sequences encoding human ZP proteins. The library consisted of 9-23 kb inserts of human DNA (from placenta tissue of a male Caucasian) cloned into the Lambda Fix™II vector (Stratagene). Approximately 40,000 pfus were plated on E. coli strain LE 392 (Stratagene, catalog no. 200266), as described in the Stratagene protocol, but replacing MgSQ, with MgCl2. After overnight incubation, nylon membrane lifts of the plaques were prepared and screened with 3 P-labelled porcine ZPA cDNA (SEQ ID NO. 1) and with 32P-labelled porcine ZPB cDNA (SEQ ID NO. 3) as described in Example 2.
Three clones 1-1, 2-2, and 4-9 were shown to hybridize to the porcine ZPB cDNA (SEQ ID NO. 3). Clones 1-1 and 4-9 were deposited
with the American Type Culture Collection, (ATCC) 12301 Parklawn Drive, Rockville, Maryland, on January 27, 1993 under ATCC Accession Nos. 75406 and 75405, respectively. Human DNA inserts were isolated from these clones and analyzed by restriction endonuclease digestion with Eco RI and Southern blot analysis as described in Example 1. Table 4 shows the results of Eco RI digestion of these clones.
Table 4 HUMAN GENOMIC ZPB EcoRI INSERTS
Southern blot analysis revealed four Eco RI fragments which were judged to carry ZPB coding sequences based on hybridization to the porcine ZPB cDNA (SEQ ID NO. 3). Clone 1-1 DNA included a 2.2 kb, 2.0 kb, and 1.5 kb Eco RI fragments which so hybridized. Clone 2-2 DNA included a 2.8 kb Eco RI hybridizing fragment. Clone 4-9 DNA included a 2.8 kb and a 1.5 kb Eco RI fragment which hybridized to the porcine ZPB cDNA probe. All inserts additionally included a 3.2 kb non-hybridizing Eco RI fragment; inserts from clones 1-1 and 4-9 both provided 0.2 kb non- hybridizing fragments; and clone 1-1 additionally provided a 0.7 kb non- hybridizing fragment.
Further restriction analysis revealed the fragment alignment shown in Figure 4. Six of the fragments (A-F) were subcloned into pBSKS for sequence analysis, as described in Example 1. Preliminary sequence analysis confirmed the fragment alignment shown in Figure 4, and suggested that the complete coding sequence of the human ZPB gene may be from clones 1-1 and 4-9. This was confirmed by nucleotide sequence analysis of the inserts, and comparison of the sequences with the feline ZPB sequence (SEQ ID NO. 15) and porcine ZPB sequence (SEQ ID NO. 3). The DNA sequence and deduced amino acid sequences for human ZPB are set out as SEQ ID NO. 40 and 41 , respectively.
Clones hybridizing to the porcine ZPA cDNA (SEQ ID NO. 1) under the conditions described in Example 1 were also isolated. Two positive clones, Al and A4 were identified. The clones were deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville, Maryland 20852, on January 27, 1993 under ATCC Accession Nos. 75404 and 75403 respectively. Southern blot analysis revealed that these clones contain all or part of the human ZPA gene. DNA was isolated from these clones and was analyzed by Bgl II, Hind III, and Not I restriction endonuclease digestion and Southern blot analysis as described in Example 1. The size of the Al clone DNA insert is approximately 11.6 kb, and that of the A4 clone is approximately 13.2 kb. Two of the Bgl II fragments which hybridized with the porcine ZPA cDNA (SEQ ID NO 1) were subcloned into pBSKS for sequence analysis, as described in Example 1. Sequence analysis revealed that Al and A4 collectively contain the human ZPA gene as supported by comparison to sequences with the porcine ZPA cDNA (SEQ ID NO. 1) and the canine ZPA cDNA (SEQ ID NO. 11). The complete DNA sequence and the deduced amino acid sequence are set out as SEQ ID NOS. 42 and 43, respectively.
Example 12
Isolation and Sequencing of DNA Encoding
Cynomolgus Monkey ZPA, ZPB, and ZPC
Cynomolgus monkey cDNA libraries were constructed in λgtlO as described below. Briefly, a set of ovaries were collected from two female cynomolgus monkeys aged 1.5 years and 2 years, and a second set from three females aged 3 years, 4 years, and 14 years of age. Messenger RNA was isolated using the Fast Track™ mRNA isolation kit following the manufacturer's instructions. The cDNA was prepared using the Lambda Librarian™ (Invitrogen, as described in Example 2) kit following the protocol provided with the kit. The cDNA was packaged into lambda phage heads using the Protoclone® (Promega, Madison, Wl) λgtlO EcoRI arms plus the Packagene® (Promega) lambda DNA packaging system following the manufacturer's instructions. This procedure generally produced libraries with a titer of greater than 1 x 106 pfu/ml. The monkey cDNA library was then screened using porcine ZPA, ZPB, and ZPC probes isolated from the porcine cDNA as described in Example 1. Screening was accomplished by preparing duplicate plaque lifts using Nytran® nylon filters (0.2μM pore size). The filters were prehybridized in a solution of 5x SSPE (43.83 g/1 of NaCI, 6.9 g/1 of NaH2PO4, H20, 1.85 g/1 of EDTA, pH 7.4), 5x Denhardts Reagent (1 g/1 of Ficoll [type 400], 1 g/1 of polyvinylpyrrolidone and 1 g/1 bovine serum albumin), lOOμg/ml sonicated, denatured salmon sperm testes DNA, 30% formamide, and 0.5% SDS, for 3 hrs. at 42°C. Radio-labelled probes were prepared using [ - 32P] -dATP and the Prime-a-Gene® (Promega) labelling system. After prehybridization, 10 ng of freshly radio-labelled probe was heat denatured at 95°C for 5 minutes in 50% formamide and 100 μg/ml sonicated, denatured salmon testes DNA, and was added to the filters. The hybridization was carried out at 42"C for 15-24 hours. The hybridized filters were then washed twice with 100 ml of 5X SSPE at 55°C, for approximately one hour
each wash. The filters were then rinsed in 250 ml of 5X SSPE at 55°C and allowed to air dry. The dried filters were exposed to x-ray film (Kodak XAR5, Eastman Kodak, Rochester NY ) at -70°C using two intensifying screens (Kodak X-OMATIC™) for at least eight hours. The film was then developed for visual analysis.
Exhaustive screening of the two cynomolgus monkey ovarian cDNA libraries using all of the porcine probes yielded a total of 12 candidate clones. Southern hybridization revealed that only one of these clones (λ CM 4-2) hybridized to the porcine ZPA probe. This clone contained an insert of 560 bp. Sequencing of the insert was performed using the Sequenase® Version 2 kit (U.S. Biochemicals, Cleveland, Ohio) according to the manufacturer's instructions. Sequencing revealed that the 560 bp insert was homologous to the 3' end of other mammalian ZPA genes. The 560 bp fragment represents just under 25% bp of the full-length sequence and contains an open reading frame of 492 bp which would encode a protein of 164 amino acids. The DNA sequence and the deduced amino acid sequence of the cynomolgus monkey ZPA cDNA is set out as SEQ ID NOS. 44 and 45, respectively.
Exhaustive screening of the cynomolgus monkey ovarian cDNA libraries with the porcine ZPB probe yielded a single ZPB candidate clone having an insert of 866 bp. Sequence analysis suggests that the insert includes the C-terminal 50% of the expected full-length sequence. The DNA sequence and deduced amino acid sequence of the monkey ZPB insert are set out as SEQ ID NOS. 46 and 47, respectively. Screening of monkey ovarian cDNA libraries with the porcine ZPC DNA probe yielded only partial ZPC clones, the largest (λ CMl-1) having an insert of approximately 1300 bp which contains just over 50% of the C-terminal portion of the full-length sequence based on comparison to known ZPC clones, (particularly the human ZPC clone). The clone contains an open reading frame of 672 bp which would encode a protein of 224 amino acids. The clone also contains stop codons
immediately 5 ' to the coding sequence in all three reading frames. The DNA sequence and the deduced amino acid sequence of the cynomolgus monkey ZPC clones are set out as sequence ID NOS 48 and 49 respectively.
Example 13 Comparison of ZPA DNA and Deduced Amino Acid Sequences
Table 5 shows a comparison of the DNA and deduced amino acid sequence of mammalian ZPAs.
TABLE 5 ZPA HOMOLOGY
PROTEIN HOMOLOG
DNA HOMOLOGY
Data is presented as a cross-wise comparison of the ZPA protein and DNA sequences. The comparison of the protein sequences are shown in the upper right hand side of the table, above the diagonal dashed lines. The comparison of the DNA sequences are shown in the lower left hand side of the table, below the diagonal dashed lines. The ZPA DNA and deduced amino acid sequences are highly homologous between species. The homology is highest between members of the same order within the class mammalia. For example, the human and cynomolgus monkey (primata), the pig and cow (ungulatά), and the cat and dog (carnivora) sequences have the most similarity. The high degree of homology between the ZPA genes, as well as between the ZPB (see Example 14) and ZPC (Example 15) genes from a variety of mammalian species, implies a great deal of structural similarity in the ZP layers of these species. However, post-translational modification differences such as glycosylation and others, could represent a potential source of variation.
One protein processing site that all of these ZPA proteins have in common is a furin cleavage site (R-X-R/K-R; Hosaka et al. J. Biol. Chem, 266:12127 (1991)) near the C-terminal end of the protein. In fact, with only a few exceptions, all ZP proteins contain a furin processing site near the C- terminus This furin site could serve to cleave off a putative membrane anchor sequence which would allow the processed proteins to move toward the outer edge of the growing ZP layer.
The human ZPA gene contains an exon near the 3' end that is present in the cynomolgus monkey ZPA sequence, but not present in the ZPA genes from other species. This extra exon codes for an amino acid sequence that occurs after the furin processing site, which suggests that the C-terminal fragment generated by furin cleavage might still be important to the function of the ZP layer or to the oocyte in some way.
There are 20 conserved cysteine residues and one or two non- conserved cysteine residues in each of the full-length ZPA sequences. The
non-conserved cysteine residues occur either in the N-terminal leader sequence region, or in the extreme C-terminal region of the sequence, where a large amount of the variation between the ZPA sequences occurs. The high degree of homology and the large number of conserved cysteine residues suggests that the tertiary structures of the ZPA proteins are similar.
It has been noted previously that there are regions of homology between the ZPA and ZPB class proteins (Schwoebel et al. J. Biol. Chem. , 266:7214 (1991); Lee et al. J. Biol Chem, 268: 12412 (1993); Yurewicz et al Biochem. Biophys. Acta 1174:211 (1993)). Comparison of the human ZPA genomic structure with the human ZPB genomic structure shows these regions to be confined to exons 12, 13, and 14 of the human ZPA gene and exons 5, 6, and 7 of the human ZPB gene. This suggests that this homology might be due to a partial ancestral gene duplication. The ZPB proteins contain 21 conserved cysteine residues. The first 11 of these do not align with those in the ZPA proteins, but the last 10 match well. This extends the homology to approximately 270 amino acids, covering exons 11-16 of the ZPA gene and exons 4-9 of the ZPB gene, although the overall homology of the expanded region is slightly lower (approximately 43%). The remainder of the ZPA and ZPB genes show very little homology with each other, and the ZPC genes also show no extensive homology to the ZPA genes. In addition, the ZPA gene has no extensive sequence similarity to non-ZP nucleic acid and protein sequences in Genbank and the SwissProt data banks.
Example 14 Comparison of ZPB DNA and of Deduced Amino Acid Sequences
Table 6 shows the comparison of the six known ZPB DNA and protein sequences (the bovine and cynomolgus cDNA fragments are only compared to the corresponding regions of the other full-length ZPB sequences).
TABLE 6
ZPB HOMOLOGY
PROTEIN HOMOLOGY
DNA HOMOLOGY
The data are presented as cross-wise comparison of the ZPB protein and DNA sequences. The comparison of the protein sequences are shown in the upper right hand side of the table, above the diagonal dashed lines. The comparison of the DNA sequences are shown in the lower left hand side of the table, below the diagonal dashed lines.
The data shows considerable ZPB homology among members of different mammalian species. As was the case with ZPA, this homology is most pronounced between members of the same order within the class mammalia. For example, the human and cynomolgus monkey sequences (primata) and the pig and cow sequences (ungulata) have the most homology to each other. With only a few exceptions (noted below), the ZPB sequences show no homology to other DNA or protein sequences in the GenBank or SwissProt databases. Hybridization experiments suggest that the ZPB transcripts are ovary specific. Comparisons of the deduced amino acid sequences of the ZPB clones show more divergence within this genetic group than within the ZPA and ZPC groups. Comparison of the rabbit ZPB and porcine ZPB shows the sequences to be predominantly collinear (74% homologous) except that the rabbit has an additional upstream ATG codon which adds six codons to the rabbit sequence.
The feline ZPB sequence has two additional amino acid inserts, which total 38 additional codons, in the first quarter of the gene, compared to the porcine and rabbit sequences. Both inserts occur just after cysteine residues, which suggests that if the cysteines are involved in disulfide bridges, these regions might form unique epitopes. However, the feline gene is still 73% homologous to porcine gene and 70% homologous to the rabbit gene.
The human gene has a sequence homologous to the first of the inserts in the cat sequence, but not the second. However, there are consensus splice site donor and acceptor sequences adjacent to this extra region in the human sequence, which if used would leave the coding sequence in frame.
Therefore, the sequence representing exon 2 could actually be two small exons (122 and 103 bp) separated by a small intron (84 bp). This would make the human sequence in this region identical to the pig sequence. The first extra region in the cat sequence is also flanked by in frame splice site donor and acceptor signals. If the extra region was removed from the cat sequence, it would differ from the pig sequence by only a single amino acid. However, the cat sequence was obtained from a cDNA clone made from an mRNA that appears to be fully processed. The second extra region in the cat sequence does not contain in frame splice site donor or acceptor signals, and therefore is probably not due to the presence of an unprocessed intron.
The cynomolgus monkey and human sequences have an additional seven codons at the C-terminus when compared to the other ZPB sequences. In the cynomolgus monkey, this is due to a two-base pair deletion, which causes a frameshift mutation which puts the termination codon used by the other species out of frame. The human sequence also contains this deletion, but in addition, there is also a base change that eliminates this termination codon.
There are 21 conserved cysteine residues in the ZPB proteins, the final 10 of which occur in a region that has homology to the ZPA proteins. This homology was noted previously (Schwoebel et al. , supra; Lee et al. supra, 1993; Yurewicz et al. supra, 1993), but examination of the genomic structure of the human ZPA and ZPB genes allowed the homology to be extended to approximately 270 amino acids. This homology could be due to a partial ancestral gene duplication. In addition to the conserved cysteine residues, the pig ZPB protein contains one additional cysteine residue in the putative leader sequence, and the human sequence contains four additional cysteine residues. The first of these is in the putative leader sequence (in a different location than pig), the second is in the region containing the additional insert, and the last two are in the C-terminal
extension caused by the mutated termination codon. These last two extra cysteine residues are conserved in the cynomolgus monkey sequence.
All of the ZP proteins contain a putative transmembrane domain near the C-terminus. However, the canonical furin proteolytic processing signal (R-X-R/K-R, Hosaka et al. supra, 1991), which occurs just prior to the transmembrane domain in all of the ZPA and ZPC proteins, is altered in the human (S-R-R-R), cynomolgus monkey (S-R-R-N) and rabbit (S-R-R-R) ZPB sequences. The significance of this is unknown, but it may indicate that these proteins are processed by a related system with specificity for di- or tribasic sequences, since the release of the putative transmembrane domain would be necessary for the ZPB protein to move as the ZP layer grows. There appears to be a great deal of proteolytic processing of the pig ZPA and ZPB (Yurewicz et al. supra,) proteins. There is no data concerning the post- translational modification of the ZPB proteins of cat, cow, cynomolgus monkey or human. The physiologic significance of this processing is unknown, but differential processing would present an avenue of variation among species of the highly conserved ZP proteins.
There is a question of whether humans actually transcribe the ZPB gene. Since the amount of human ovarian mRNA recovered was so small, there was not enough RNA to both construct a cDNA library and perform a Northern analysis. However, since cynomolgus monkey transcribes the ZPB gene, it is probable that the highly homologous human ZPB gene is also transcribed.
The apparent lack of a ZPB cDNA in the dog cDNA library is another puzzle. All of the libraries screened which contained any zona pellucida gene contained all three genes, except the dog. However, mRNA isolated from the ovary of a six-month old dog (the library was made from the ovary of a four-month old dog), includes a ZPB mRNA that comigrates with the porcine and cynomolgus monkey ZPB mRNA on a Northern blot. One possibility to explain the lack of a canine ZPB cDNA is that the transcriptional
timing of the three ZP genes is spread out, and since the ovary used to make the library was young, the transcription of the ZPB gene occurs later than the ZPA and ZPC genes (Andersen and Simpson, 1973).
Example 15 Comparison of ZPC DNA and Deduced Amino Acid Sequences
Table 7 shows the comparison of the DNA and deduced amino acid sequences from all of the ZPC cDNAs and genes.
TABLE 7
ZPC HOMOLOGY
PROTEIN HOMOLOGY
DNA HOMOLOGY
The data are presented as a cross-wise comparison of the ZPC protein and DNA sequences. The comparison of the protein sequences are shown in the upper right hand side of the table, above the diagonal dashed lines. The comparison of the DNA sequences are shown in the lower left hand side of the table, below the diagonal dashed lines.
ZPC proteins and DNA sequences show a higher degree of homology than the ZPA and ZPB DNAs and proteins. As was the case with ZPA and ZPB, the homology is most pronounced in members of the same order within the class mammalia; the human and cynomolgus monkey sequences (primata), the cat and dog sequences (carnivora), the pig and cow sequences (ungulata), and the mouse and hamster sequences (rodenta). The ZPC transcripts are ovary specific, based on Northern blot analysis and comparison to the sequences in the GenBank and SwissProt databases detects no significant non-ZP homology. Comparison of the deduced amino acid sequences of the known ZPC genes detects three regions that contain large numbers of non-consensus sequences. These regions are: the putative leader sequences and the first 20-25 amino acids of the mature protein; the region containing the peptide that was identified as a sperm-binding region in the mouse (Millar et al. Science 216:935-938 (1989)); and the C-terminal region of the proteins that might be removed from the mature protein at the furin processing site (see below).
The epitope identified as a putative sperm-binding site (Millar et al. supra, 1989) occurs immediately before a furin proteolytic cleavage site (Hosaka et al. , 1991). The furin site (R-X-R/K-R) is highly conserved in all of the ZPC sequences. However, it should be noted that the canine ZPC sequence contains a second furin site, 19 amino acids upstream from the first furin site. Also as is the case with ZPA and ZPB, cleavage by furin of the ZPC proteins would remove a putative membrane anchor sequence (Klein et al, 1985), which would allow the processed ZPC protein to move toward the outer layer of the expanding oocyte. Therefore, this sperm-binding site
probably represents the C-terminus of the mature proteins. However, there is very little homology (even between hamster and mouse) in the regions of the ZPC proteins corresponding to this epitope. This might indicate that this region contributes to the species specificity of sperm-egg binding. The variation that is seen at the C-terminus of the ZPC proteins occurs in the putative transmembrane region. This variation could indicate that this amino acid sequence is less important than the overall hydrophobicity of the amino acids in this region, similar to the lack of homology seen in leader sequences. However, it is also possible that this variation signifies a species-specific function for this region.
Each ZPC sequence contains 14 conserved cysteine residues, but each sequence also has one or two extra cysteine residues that are shared only with one or a few other sequences. These extra cysteine residues are near the N- or C-terminus of the proteins, where the greatest sequence variation exists. However, the large number of conserved cysteine residues probably indicates that the overall structure of the central core of all of these proteins is quite conserved.
Example 16 Immunization of Cynomolgus Monkeys With HSPZ
A sexually mature cynomolgus monkey was immunized with
HSPZ to test the ability of HSPZ to induce infertility. HSPZ was prepared as described in Example 6. HSPZ was mixed with the following GMDP/oil adjuvant. 50μgGMDP(N-acetyl-D-glucosaminyl-(/31-4)-N-acetylmuramyl-D- isoglutamine) (CC. Biotech, Poway, CA); 42.1 of mineral oil, 15.8% pluronic VC-121 (block polymer polyols, BASF-Wyandotte, Parsippany, NJ). The animal received a series of 4 subcutaneous injections of 1 mg of HSPZ in the GMDP/oil adjuvant beginning with a priming dose followed four weeks later by a booster dose, which was followed by two booster doses five weeks apart
which were followed six weeks later by a final dose. This dosage regimen resulted in an anovulatory monkey having antibody titers against its cynomolgus monkey heat-solubilized zona pellucida prepared as described for HSPZ. The peak antibody titers to cynomolgus monkey HSPZ were 1:8000- 1:16,000.
A fractionated preparation of HSPZ which is essentially native porcine ZPA and ZPB was prepared by isoelectric focusing, as described in Example 6 and was used to vaccinate cynomolgus monkeys using 1 mg of fractionated HSPZ in GMDP/oil injected subcutaneously according to the following schedule: a priming dose was given followed approximately 6 weeks later by a booster dose followed by a final booster dose 11 weeks after the previous booster dose. The immunized monkeys achieved peak antibody titers of 1:4,000-1:8,000 against monkey heat-solubilized zona pellucida while maintaining a regular ovulatory cycle. However, despite maintaining a regular ovulatory cycle, the monkeys remained infertile until their antibody titers to monkey heat-solubilized zona pellucida fell below 1:500 after which the animals became pregnant upon breeding.
Immunization of cynomolgus monkeys with recombinant baculovirus produced canine ZPC and porcine ZPC (prepared as described in Example 18) failed to induce infertility despite inducing antibody production against monkey heat-solubilized zona pellucida. One possible explanation for this is that the glycosylation pattern of ZP proteins produced in the baculovirus system may prevent recognition of the epitopes responsible for induction of infertility. Bacterially produced porcine ZPA, ZPB, and ZPC described above administered to cynomolgus monkeys failed to induce detectable antibody titers against cynomolgus monkey heat-solubilized zona pellucida even though antibody titers against the presented antigens were produced.
Example 17 Mapping of Mammalian Zona Pellucida Protein Epitopes
A Pin Technology™ Epitope Scanning Kit purchased from
Chiron Mimotopes U.S., Emeryville, CA (Catalog No. PT-02-20000A) was used for mapping epitopes in Zona Pellucida proteins. The procedures described in the kit manual were followed, with the exception of modifications in the ELISA testing procedure (described below).
Briefly, Pin Technology software was installed in a United Business Machines 486/33 computer according to the manufacturer's instructions. The protein sequence was entered into the computer program, the desired peptide length, and degree of overlap between peptides were selected, and a protocol containing the daily requirements of activated protected amino acid derivatives and their location in the coupling tray wells was printed. Prior to use, the pins were first washed once with dimethylformamide (DMF), and then with methanol three times, each wash lasting for two minutes. The pin block was air dried and the pins were deprotected by agitation in a 20% mixture of piperidine in DMF at room temperature for 30 minutes. The pins were washed again as described above, except that the washes were for 5 minutes each, and the pin block was then air dried. The required amino acid derivative solutions were prepared and dispensed into the wells of the synthesis tray according to the protocol for the current cycle. The dried mimotope pins were washed once more in a DMF bath for 5 minutes and then positioned appropriately in the wells of the synthesis tray. The assembly was then sealed in a plastic bag and incubated at 30°C for approximately 22 hours. On the following day, the pin block was removed from the coupling tray and subjected to the same cycle of washing, deprotection, and coupling steps as outlined above; however, using the amino acid derivatives and their tray location appropriate to the next cycle. The
foregoing cycle of washing, deprotection, washing, and coupling was repeated until the peptide sequences were completed.
After coupling the terminal amino acids of the peptides, the pin block was washed, air dried, deprotected, washed and air dried as before. The terminal amino groups of the peptides were then acetylated by immersion of the pins in a mixture containing 5 parts DMF, 2 parts acetic anhydride, and 1 part triethylamine, by volume, dispensed in the wells of a polypropylene coupling tray, and incubating at 30°C for 90 minutes. The pin block was removed, subjected to another washing sequence as before, and air dried. Side chain deprotection of the peptides was performed by agitating the pin block in a mixture containing 95 parts trifluoroacetic acid, 2.5 parts anisole, and 2.5 parts ethanedithiol, by volume, at room temperature for 4 hours. The pin block was then air dried for approximately 10 minutes, sonicated in a bath containing 0.1 % hydrochloric acid in a mixture containing equal parts of methanol and deionized water, by volume, for 15 minutes, and finally air dried.
Prior to ELISA testing, the pins were subjected to a disruption procedure involving sonication in a bath consisting of a mixture containing 39 parts sodium dihydrogen orthophosphate, 25 parts sodium dodecyl sulfate, 0.1 part 2-mercaptoethanol, and 2500 parts deionized water, by weight, adjusted to pH 7.2 with 50% sodium hydroxide solution. The sonication was performed at 55 to 60°C for approximately 45 minutes. The pin block was then washed by immersion with gentle agitation in three sequential baths of deionized water at 60 degrees for three minutes each. Finally, the pin block was immersed in gently boiling methanol for approximately 4 minutes and then air dried.
Preparation of Antisera
Antisera directed against zona pellucida proteins was prepared by immunizing the appropriate animals with the appropriate zona pellucida
protein using procedures well known in the art and described in E. Harlow and D. Lane in Antibodies, A Laboratory Manual, Chapter 5, Cold Spring Harbor Laboratory, 1988 which is incorporated herein by reference. Biotinylated antisera was prepared by a modification of the procedure described in Harlow supra (page 314). Briefly, to a solution containing between 1 and 3 mg per ml of the selected antibody IgG fraction in phosphate buffer with saline (PBS) at pH 7.2 was added a solution containing 25 to 250 micrograms biotinamidocaproate, N-hydroxysuccinimide ester (Sigma, Cat No. B2643) in dimethyl sulfoxide at a concentration of 10 mg/ml. The mixture was mixed well and then incubated at room temperature for 4 hours. One molar ammonium chloride solution in the amount corresponding to 20 microliters per 250 micrograms biotin ester was added, and the resulting mixture was incubated at room temperature for 10 minutes. Unreacted biotin ester was then removed by extensive diafiltration with PBS using a Centricon- 30 (TM) microconcentrator devices (Amicon Division, W.R. Grace & Co., Inc., Beverly MA). The dilution factor for the resulting conjugate was determined by ELISA titration against the appropriate native protein.
ELISA Testing
A modification of the procedure described in the Epitope Scanning Kit manual was employed.
After disruption, the mimotopepins were blocked by incubation with "supercocktail" (10 g ovalbumin, 10 g bovine serum albumin, and 1 ml Tween 20 detergent per liter of PBS) at room temperature for 1 hour. This was followed by incubation at room temperature for 2 hours with appropriately diluted biotinylated antisera. The pins were washed 4 times with PBS containing 0.5% Tween 20 (PBST) at room temperature for 10 minutes each time, with agitation.
The pins were then incubated at room temperature for 1 hour with the secondary antibody, horseradish peroxidase-streptavidin conjugate
(Zymed Laboratories, Inc., South San Francisco, CA) diluted 1:2500 with PBST. They were washed again as described above.
Substrate buffer was prepared by combining 200 ml 1.0 M. disodium hydrogen orthophosphate solution with 160 ml 1.0 M. citric acid solution, diluting the mixture with 1640 ml deionized water, and adjusting to pH 4.0 using either citric acid or sodium hydroxide solutions. Substrate solution was prepared by dissolving 10 mg 2,2'-azino-bis(3- ethylbenzthiazoline-6-sulfonic acid) diammonium salt in 20 ml substrate buffer and adding 6 microliters 30% hydrogen peroxide. The mimotope pins were incubated at room temperature with this solution, using microtiter plates containing 150 microliters per well. When color development appeared to be appropriate for measurement by an ELISA plate reader, the pin block was removed and the plate was read at a wavelength of 450 nm. The pin block was then disrupted by the procedure described above. The data were entered into the Pin Technology™ computer program, which performed statistical analysis and evaluation and furnished a print-out of the results identifying the strongest binding epitopes. Briefly, the 25% of the wells having the lowest optical density readings were assumed to represent background in each experiment. The mean value and the standard deviation of these readings were calculated. Significant recognition of peptides by antisera was attributed to the pins corresponding to those wells showing absorbance readings greater than the sum of the background mean and three standard deviations from the mean.
Human ZPA epitopes were examined for reactivity with mouse anti-human ZP antiserum prepared as described above. Peptides of 15 amino acids in length were synthesized beginning with amino acid number 1 as illustrated in SEQ ID NO. 43. Successive peptides having a 7-amino acid overlap with the preceding peptide of the series were synthesized. The following peptides were shown to bind mouse anti-human ZP antiserum: 1- 15, 9-23, 25-39, 33-47, 65-79, 81-95, 89-103, 97-111, 105-119, 113-127,
121-135, 129-143, 145-159, 153-167, 161-175, 193-207, 209-223, 217-231, 225-239, 241-255, 249-263, 273-287, 281-295, 289-303, 305-319, 313-327, 321-335, 329-343, 337-351, 345-359, 385-399, 393-407, 401-415, 409-423, 417-431, 425-439, 441-455, 449-463, 457-471, 481-495, 489-503, 497-511, 505-519, 513-527, 521-535, 537-551, 545-559, 561-575, 569-583, 577-591, 585-599, 601-615, 609-623, 617-631, 625-639, 633-647, 641-655, 665-679, 697-711, 705-719, 713-727, 721-735, and 729-743.
Similarly, human ZPB epitopes were mapped using mouse anti- human ZP antiserum. In these experiments, 15 amino acid peptides were synthesized beginning with amino acid number 1 as set out in SEQ ID NO. 41. The overlap between successive peptides in this case was 9 amino acids. The following peptides were shown to bind mouse anti-human ZP antiserum: 7-21, 25-39, 31-45, 49-63, 67-81, 73-87, 79-93, 91-105, 103-117, 121-135, 193-207, 205-219, 211-225, 217-231, 223-237, 229-243, 253-267, 259-273, 265-279, 283-297, 289-303, 295-309, 301-315, 307-321, 313-327, 319-333, 343-357, 349-363, 355-369, 367-381, 373-387, 379-393, 385-399, 403-417, 409-423, 415-429, 421-435, 433-447, 439-453, 445-459, 451-465, 481-495, 487-501, 499-513, 505-519, 511-525, 523-537, 529-543, and 547-561.
Human ZPC epitopes were mapped using mouse anti-human ZP antiserum. In these experiments, the 15 amino acid peptides were synthesized beginning with amino acid number 1 as set out in Chamberlin et al. , Proc. Nat'l Acad. Sci. USA 87:6014-6018 (1990) which is incorporated herein by reference. The overlap between successive peptides was 10 amino acids. The following peptides were shown to bind mouse anti-human ZP antiserum: 21- 35, 51-65, 116-130, 146-160, 151-165, 181-195, 241-255, 251-265, 271-285, 296-310, 321-335, 401-415, and 411-425.
Canine ZPC epitopes were mapped using rabbit anti-canine ZP antiserum. In these experiments, the 15 amino acid peptides were synthesized beginning at amino acid number 1 set out in SEQ ID NO. 10. The overlap between successive peptides was 5 amino acids. The following peptides were
shown to bind rabbit anti-canine ZP antiserum: 51-65, 61-75, 81-95, DI¬ MS, 181-195, and 301-315.
Feline ZPC epitopes were mapped using rabbit anti-feline ZP antiserum. In these experiments, the 15 amino acid peptides were synthesized beginning at amino acid number 1 as set out in SEQ ID NO. 18. The overlap between successive peptides was 5 amino acids. The following peptides were shown to bind rabbit anti-feline ZP: 36-50, 46-60, 56-70, 76-90, 96-110, 106-120, 116-130, 126-140, 136-150, 146-160, 156-170, 186-200, 196-210, 246-260, 266-280, 276-290, 286-300, 296-310, 316-330, 326-340, 336-350, 346-360, 376-390, 396-410, and 406-420.
Bovine ZPC epitopes were mapped using rabbit anti-bovine ZP antiserum. In these experiments, the overlapping 15 amino acid peptides were synthesized beginning at amino acid number 1 as set out in SEQ ID NO. 24. The overlap between peptides was 10 amino acids. The following peptides were shown to be reactive with rabbit anti-bovine ZP antiserum: 1-15, 31-45, 51-65, 56-70, 61-75, 76-90, 106-120, 111-125, 116-130, 121-135, 131-145, 136-150, 141-155, 146-160, 151-165, 161-175, 181-195, 186-200, 191-205, 196-210, 201-215, 206-220, 216-230, 226-240, 241-255, 246-260, 261-275, 266-280, 271-285, 276-290, 291-305, 296-310, 301-315, 316-330, 321-335, 326-340, 331-345, 336-350, 341-355, 356-370, 361-375, 376-390, 381-395, 386-400, 396-410, 401-415, and 406-420.
Example 18 Immunization of Dogs with Recombinant ZPC Proteins
Dogs were immunized with various preparations of recombinant canine ZPC. The plasmid pZ169 bacterial expression vector (Figure 5) was constructed as follows. The parent vector pZ98 (described in Example 9) was digested with the restriction enzymes Pvul and Bam HI, and the large
fragment was gel purified. Into this vector was ligated a fragment created by annealing the following oligonucleotides:
5' CGCCCTTCCCAGCAACTGCACCATCACCACCATGGG 3' (SEQ ID NO.50); and 5' GATCCCCATGGTGGTGGTGATGGTGCAGTTGCTGGGAAGGGCGAT 3'
(SEQ ID NO.51).
These oligonucleotides create a fragment with Pvul and BamHI ends, and codes for the hexapeptide sequence His6. This intermediate vector was digested with the restriction enzymes BamHI and EcoRI, and the large fragment was gel purified. Into this vector was ligated a fragment created by annealing the following oligonucleotides:
5' GATCCCTCGAGCCACCATCACCACCATCATG 3' (SEQ ID NO.52); and
5' AATTCATGATGGTGGTGATGGTGGCTCGAGG 3' (SEQ ID NO.53).
These oligonucleotides create a fragment with BamHI and EcoRI ends and an Xhol site just downstream of the BamHI site, and which codes for the hexapeptide sequence His6. This new vector was named pZ88, and contains unique BamHI and Xhol cloning sites between two His6 sequences. To create pZ169, the pZ88 vector was digested with the restriction enzymes BamHI and Xhol, and the large fragment was gel purified. Into this vector was ligated a fragment generated by performing a PCR (polymerase chain reaction) of the canine ZPC cDNA using the following oligonucleotides:
5' CCCGGATCCGCAGACCATCTGGCCAACTGAG 3 (SEQ ID NO.54); and
5' GCGCTCGAGGGCATATGGCTGCCAGTGTG 3' (SEQ ID NO.55).
This PCR creates a fragment containing amino acids 23-207 of the canine ZPC sequence, with BamHI and Xhol ends. This new vector is named pZ169, (Figure 5) and produces a protein containing amino acids 1-56 of the E. coli jS-galactosidase sequence, His6, amino acids 23-207 of the canine ZPC sequence, His6, and amino acids 1006-1023 of the E. coli 0-galactosidase sequence. This protein is referred to as N-terminal canine ZPC. In Figure 5, pTAC refers to the tac promoter described above; AmpR refers to an ampicillin resistance marker, ori is an E. coli origin of replication sequences and pLacI is the lad promoter which drives expression of the lad gene.
Recombinant canine ZPC was produced and purified as described in Example 9. A baculovirus expression vector pZ145 was constructed as follows. The parent vector pBlueBac2 (purchased from Invitrogen Corporation, San Diego, CA) was digested with the restriction enzymes Nhel and BamHI, and the large fragment was gel purified. Into this vector was ligated a fragment generated by a PCR of the porcine ZPC cDNA using the following oligonucleotide:
5 ' CGCGCTAGCAGATCTATGGCGCCGAGCTGGAGGTTC 3 ' (SEQ ID NO. 56); and
5 ' CGCGGATCCTATTAATGGTGGTGATGGTGGTGACTAGTGGACCCTTCCA 3 " (SEQ ID NO. 57).
This PCR creates a fragment with Nhel and BamHI ends, and contains amino acids 27-350 of the porcine ZPC sequence followed by an Spel site and the hexapeptide His6. This new vector is named pZ147. To create the pZ145 vector, pZ147 is digested with Nhel and Spel and the large fragment is gel purified (this removes the pig ZPC sequence). Into this vector was ligated a
fragment generated by a PCR of the canine ZPC cDNA using the following oligonucleotides:
5' CCCGCTAGCAGATCTATGGGGCTGAGCTATGGAATTTTC 3 ' (SEQ ID NO. 58); and
5 ' CGCACTAGTTGACCCCTCTATACCATGATCACTA 3 ' (SEQ ID NO. 59).
This PCR creates a fragment with Nhel and Spel ends, and contains amino acids 1-379 of the canine sequence. Transformants of this ligation were screened for the presence of the inserted NhellSpel fragment in the correct orientation (since the Nhel and Spel sticky ends are identical). This new vector is named pZ145, (Figure 6) and produces a protein containing amino acids 1-379 of the DZPC sequence followed by His6. This protein is referred to as baculo-canine ZPC. In Figure 6, pP represents the baculovirus polyhedrin promoter, AmpR represents an ampicillin resistance marker, LacZ represents the gene for /3-galactosidase, pE is a constituitive promoter which drives the expression of LacZ and ori is the E. coli origin of replication.
Recombinant baculovirus derived canine ZPC was produced by co-transfecting insect SF9 cells with pZ145 and Autographica californica multiply enveloped nuclear polyhedrosis virus (AcMNPV) using methods well known in the art as described in the MAXBAC™ kit purchased from Invitrogen, San Diego, CA. Recombinant canine ZPC produced in SF9 cells was prepared from cotransfected SF9 cells as follows. Cotransfected cells were harvested and pelleted by centrifugation and recombinant canine ZPC was purified as was described in Example 9 for purification from a cell pellet. Recombinant canine ZPC may also be isolated from the culture medium and purified on a Ni-column as described in Example 9.
Other expression vectors which are capable of expressing zona pellucida encoding nucleotide sequences under the control of a variety of
regulatory sequences are within the scope of the present invention and are readily constructed using methods well known in the art.
Recombinant zona pellucida proteins may also be modified to increase their potential antigenicity by a variety of methods well known in the art. For example, a recombinant dog ZPC was modified by palmitylation was prepared as follows. Approximately 1 mg of recombinant ZPC produced using the plasmid pZ169 as described above was brought to a final concentration of 8M urea (total volume 0.2-0.3 mis.). A palmitylation solution (Pl2O/TEA) was then prepared by adding palmitic anhydride to triethylamine to give a final concentration of palmitic anhydride of 20 mg/ml of triethylamine.
Approximately 10 μl of Pl2O/TEA solution was added to 1 mg of recombinant canine ZPC in 8M urea (described above). The mixture was allowed to stand at room temperature for a least two hours after which the preparation was ready for mixture with GMDP/oil adjuvant.
Chitosan modification is another useful modification of canine ZPC for the practice of the present invention. Briefly, 1.5 ml of sterile mineral oil was added to 1.5 ml of recombinant canine ZPC solution prepared as described above using the plasmid pZ169 (2 mg/ml ZPC, 3 mg total is 8M urea) was mixed with 5 drops of Arlacel A (mannide monooleate, Sigma, St, Louis, MO). Subsequently, 0.75 ml of Chitosan (2% wt/vol. is 0.5M sodium acetate, pH 5.0) was added, and the mixture was sonicated for 10-20 seconds, followed by the addition of 0.045 ml of 50% NaOH and another round of sonication for 10-20 seconds. Finally, lOμl of 10 mg/ml GMDP/8M urea was added.
A group of three dogs was immunized five times each at one-month intervals with subcutaneous injections of 1 mg doses of the N- terminal canine ZPC modified by the addition of chitosan prepared as described above. Immunized dogs developed antibody titers of 1:8000- 1: 16000 against heat solubilized dog zona pellucida (self-titers) using methods
described above. The estrus cycle of the dogs showing self-titers was anovulatory and prolonged (4-6 weeks instead of the normal 10-day to 14-day cycle for normal dogs). Of the three immunized dogs, two have experienced their first estrus; one of the two dogs exhibited estrus six months after the first immunization and was bred and found to be infertile. The second of the two dogs experienced estrus and remained infertile nine months after the first immunization. The third dog has yet to experience estrus more than nine months after immunization.
Another group of four dogs were immunized three times at one- month intervals using 1 mg doses of palmitylated canine ZPC (prepared as described above) in GMDP/oil adjuvant administered subcutaneously. These animals achieved self-titers (against heat solubilized dog zona pellucida) of 1:4000-1:8000. Nearly seven months after immunization, two of the four dogs experienced estrus and remain infertile. The remaining two dogs have yet to experience estrus.
Another set of dogs was immunized 3 times at one-month intervals, using subcutaneous injections of 1 mg of recombinant canine ZPC produced using pZ166, (a plasmid similar to pZ169 but containing a DNA sequence encoding amino acids 23-379 of the canine ZPC protein) in GMDP/oil adjuvant. These animals failed to develop self-titers and became pregnant after breeding. Similarly, dogs immunized with canine ZPC fragments produced using the baculovirus system failed to induce infertility.
Example 19 Vaccination of Cows and Cats with Recombinant Zona Pellucida Proteins
Preliminary studies were undertaken to assess the ability of recombinant zona pellucida proteins to induce infertility in cows and cats.
Cows were injected with 3 or more doses (in GMDP (250 μg) oil adjuvant) of 1 mg of a variety of recombinantly derived ZPC proteins from canine and porcine sources including canine ZPC produced using the plasmid pZ169 as shown in Figure 5. Recombinant proteins were administered in an unmodified form and in palmitylated and chitosan modified forms. None of the ZP protein preparations induced self-titers or infertility in the vaccinated cows. Further studies are underway using different recombinant preparations of zona pellucida proteins and differing dosage regimens in attempts to induce self-titers and infertility in cows. Similarly, cats were vaccinated with the following recombinant zona pellucida proteins: a mixture of recombinant feline ZPA, ZPB, and ZPC; porcine ZPC produced using pZ156 as described above and shown in Figure 3; and canine ZPC produced using the plasmid pZ169 described above and shown in Figure 5. Cats vaccinated using these ZP protein preparations produced antibody to the vaccine proteins, but produced no self-titers and were consequently fertile. Studies are ongoing to determine the effects of modifying the recombinant zona pellucida proteins in attempts to stimulate the production of self-titers and to induce infertility.
Studies are also ongoing to select other recombinantly derived zona pellucida protein fragments for testing as possible immunocontraceptives.
Numerous modifications in variations in the practice of the invention as illustrated in the above examples are expected to occur to those of ordinary skill in the art. Consequently, the illustrative examples are not intended to limit the scope of the invention as set out in the appended claims.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT:
(A) ADDRESSEE: ZONAGEN, Inc.
(B) STREET: 2408 Timberloch Place, B-4
(C) CITY: The Woodlands
(D) STATE: Texas
(E) COUNTRY: United States of America
(F) POSTAL CODE: 77380
(A) ADDRESSEE: Harris Ph.D., Jeffrey D.
(B) STREET: 15 Flatstone
(C) CITY: The Woodlands
(D) STATE: Texas
(E) COUNTRY: United States of America
(F) POSTAL CODE: 77381
(A) ADDRESSEE: Hsu, Kuang T.
(B) STREET: 71 N. Misty Morning Trace
(C) CITY: The Woodlands
(D) STATE: Texas
(E) COUNTRY: United States of America
(F) POSTAL CODE: 77381
(A) ADDRESSEE: Podolski, Joseph S.
(B) STREET: 3 Pebble Hollow Court
(C) CITY: The Woodlands
(D) STATE: Texas
(E) COUNTRY: United States of America
(F) POSTAL CODE: 77381
(ii) TITLE OF INVENTION: Materials and Methods for Immunocontraception (iii) NUMBER OF SEQUENCES: 59
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Marshall, O'Toole, Gerstein, Murray S Borun
(B) STREET: 6300 Sears Tower, 233 South Wacker Drive
(C) CITY: Chicago
(D) STATE: Illinois
(E) COUNTRY: United States of America
(F) POSTAL CODE: 60606-6402
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentln Release #1.0, Version #1.25
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE: 09-NOV-1993
(C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: 08/012,990
(B) FILING DATE: 29-JAN-1993
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: 07/973,341
(B) FILING DATE: 09-NOV-1992
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Clough, David W.
(B) REGISTRATION NUMBER: 36,107
(C) REFERENCE/DOCKET NUMBER: 31745
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 312/474-6653
(B) TELEFAX: 312/474-0448
(C) TELEX: 25-3856
(2) INFORMATION FOR SEQ ID Nθ:l:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2214 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Sus scrofa
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: sig_peptide
(B) LOCATION: 12..119 (ix) FEATURE:
(A) NAME/KEY: mat_peptide
(B) LOCATION: 120..2153
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 12..2153
(xi) SEQUENCE DESCRIPTION: SEQ ID Nθ:l:
GAATTCCGGG C AGG CAC AGA GGA GAC AGT GGG AGA CCC TTA AGC TGG CTC 50 Arg His Arg Gly Asp Ser Gly Arg Pro Leu Ser Trp Leu -36 -35 -30 -25
AGT GCA AGC TGG AGG TCA CTT CTT CTA TTT TTC CCC CTT GTG ACT TCA 98 Ser Ala Ser Trp Arg Ser Leu Leu Leu Phe Phe Pro Leu Val Thr Ser -20 -15 -10
GTG AAC TCC ATA GGT GTC AAT CAG TTG GTG AAT ACT GCC TTC CCA GGT 146 Val Asn Ser lie Gly Val Asn Gin Leu Val Asn Thr Ala Phe Pro Gly -5 1 5
ATT GTC ACT TGC CAT GAA AAT AGA ATG GTA GTG GAA TTT CCA AGA ATT 194 lie Val Thr Cys His Glu Asn Arg Met Val Val Glu Phe Pro Arg lie 10 15 20 25
CTT GGC ACT AAG ATA CAG TAC ACC TCT GTG GTG GAC CCT CTT GGT CTT 242 Leu Gly Thr Lys lie Gin Tyr Thr Ser Val Val Asp Pro Leu Gly Leu 30 35 40
GAA ATG ATG AAC TGT ACT TAT GTT CTG GAC CCA GAA AAC CTC ACC CTG 290
Glu Met Met Asn Cys Thr Tyr Val Leu Asp Pro Glu Asn Leu Thr Leu 45 50 55
AAG GCC CCA TAT GAA GCC TGT ACC AAA AGA GTG CGT GGC CAT CAC CAA 338 Lys Ala Pro Tyr Glu Ala Cys Thr Lys Arg Val Arg Gly His His Gin 60 65 70
ATG ACC ATC AGA CTC ATA GAT GAC AAT GCT GCT TTA AGA CAA GAG GCT 386 Met Thr lie Arg Leu lie Asp Asp Asn Ala Ala Leu Arg Gin Glu Ala 75 80 85
CTC ATG TAT CAC ATC AGC TGT CCT GTT ATG GGA GCA GAA GGC CCT GAT 434 Leu Met Tyr His lie Ser Cys Pro Val Met Gly Ala Glu Gly Pro Asp 90 95 100 105
CAG CAT TCG GGA TCC ACA ATC TGC ATG AAA GAT TTC ATG TCT TTT ACC 482 Gin His Ser Gly Ser Thr lie Cys Met Lys Asp Phe Met Ser Phe Thr 110 115 120
TTT AAC TTT TTT CCC GGG ATG GCT GAC GAA AAT GTG AAA CGT GAG GAT 530 Phe Asn Phe Phe Pro Gly Met Ala Asp Glu Asn Val Lys Arg Glu Asp 125 130 135
TCG AAG CAG CGC ATG GGA TGG AGC CTT GTA GTT GGT GAC GGT GAA AGA 578 Ser Lys Gin Arg Met Gly Trp Ser Leu Val Val Gly Asp Gly Glu Arg 140 145 150
GCC CGA ACT CTG ACC TTT CAG GAG GCC ATG ACC CAA GGA TAT AAT TTC 626 Ala Arg Thr Leu Thr Phe Gin Glu Ala Met Thr Gin Gly Tyr Asn Phe 155 160 165
CTG ATA GAG AAC CAG AAG ATG AAC ATC CAA GTG TCA TTC CAT GCC ACT 674 Leu lie Glu Asn Gin Lys Met Asn He Gin Val Ser Phe His Ala Thr 170 175 180 185
GGA GTG ACT CGC TAC TCG CAA GGT AAC AGT CAT CTC TAC ATG GTA CCT 722 Gly Val Thr Arg Tyr Ser Gin Gly Asn Ser His Leu Tyr Met Val Pro 190 195 200
CTG AAG CTT AAA CAT GTA TCT CAT GGG CAG TCT CTC ATC TTA GCA TCA 770 Leu Lys Leu Lys His Val Ser His Gly Gin Ser Leu He Leu Ala Ser 205 210 215
CAA CTC ATC TGT GTG GCA GAT CCT GTG ACC TGT AAT GCC ACA CAC GTG 818 Gin Leu He Cys Val Ala Asp Pro Val Thr Cys Asn Ala Thr His Val 220 225 230
ACT CTT GCC ATA CCA GAG TTT CCT GGG AAG CTA AAA TCC GTG AAC TTG 866 Thr Leu Ala He Pro Glu Phe Pro Gly Lys Leu Lys Ser Val Asn Leu 235 240 245
GGA AGT GGG AAT ATT GCT GTG AGC CAG CTG CAC AAA CAC GGG ATT GAA 914 Gly Ser Gly Asn He Ala Val Ser Gin Leu His Lys His Gly He Glu 250 255 260 265
ATG GAA ACA ACA AAC GGC CTG AGG TTG CAT TTC AAC CAA ACT CTT CTC 962 Met Glu Thr Thr Asn Gly Leu Arg Leu His Phe Asn Gin Thr Leu Leu 270 275 280
AAA ACA AAT GTC TCT GAA AAA TGC CTA CCA CAT CAG TTG TAC TTA TCT 1010 Lys Thr Asn Val Ser Glu Lys Cys Leu Pro His Gin Leu Tyr Leu Ser 285 290 295
TCA CTC AAG CTG ACT TTT CAC AGT CAA CTA GAG GCA GTA TCC ATG GTG 1058 Ser Leu Lys Leu Thr Phe His Ser Gin Leu Glu Ala Val Ser Met Val 300 305 310
ATT TAT CCT GAG TGT CTC TGT GAG TCA ACA GTC TCT TTA GTT TCA GAG 1106 He Tyr Pro Glu Cys Leu Cys Glu Ser Thr Val Ser Leu Val Ser Glu 315 320 325
GAG CTA TGC ACT CAG GAT GGG TTT ATG GAC GTC AAG GTC CAC AGC CAC 1154 Glu Leu Cys Thr Gin Asp Gly Phe Met Asp Val Lys Val His Ser His 330 335 340 345
CAA ACA AAA CCA GCT CTC AAC TTG GAT ACC CTC AGG GTG GGA GAC TCA 1202 Gin Thr Lys Pro Ala Leu Asn Leu Asp Thr Leu Arg Val Gly Asp Ser 350 355 360
TCC TGC CAG CCA ACC TTT AAA GCT CCA GCT CAG GGG CTG GTA CAG TTT 1250 Ser Cys Gin Pro Thr Phe Lys Ala Pro Ala Gin Gly Leu Val Gin Phe 365 370 375
CGC ATA CCC CTG AAT GGA TGT GGA ACA AGA CAT AAG TTC AAG AAT GAC 1298 Arg He Pro Leu Asn Gly Cys Gly Thr Arg His Lys Phe Lys Asn Asp 380 385 390
AAA GTC ATC TAT GAA AAT GAA ATA CAT GCT CTC TGG GCA GAT CCT CCA 1346 Lys Val He Tyr Glu Asn Glu He His Ala Leu Trp Ala Asp Pro Pro 395 400 405
AGC GCC GTT TCC AGA GAT AGT GAG TTC AGA ATG ACA GTG AGG TGC TCT 1394 Ser Ala Val Ser Arg Asp Ser Glu Phe Arg Met Thr Val Arg Cys Ser 410 415 420 425
TAC AGC AGC AGC AAC ATG CTA ATA AAT ACC AAT GTT GAA AGT CTT CCT 1442 Tyr Ser Ser Ser Asn Met Leu He Asn Thr Asn Val Glu Ser Leu Pro 430 435 440
TCT CCA GAG GCC TCA GTG AAG CCA GGT CCA CTT ACC CTG ACT CTG CAA 1490 Ser Pro Glu Ala Ser Val Lys Pro Gly Pro Leu Thr Leu Thr Leu Gin 445 450 455
ACC TAC CCA GAT AAC GCC TAC CTG CAG CCT TAT GGG GAC AAG GAG TAC 1538 Thr Tyr Pro Asp Asn Ala Tyr Leu Gin Pro Tyr Gly Asp Lys Glu Tyr 460 465 470
CCT GTG GTG AAA TAT CTC CGC CAA CCA ATT TAC CTA GAA GTG AGA ATC 1586 Pro Val Val Lys Tyr Leu Arg Gin Pro He Tyr Leu Glu Val Arg He 475 480 485
CTC AAC AGG ACT GAC CCC AAC ATC AAG CTG GTC TTG GAT GAC TGC TGG 1634 Leu Asn Arg Thr Asp Pro Asn He Lys Leu Val Leu Asp Asp Cys Trp 490 495 500 505
GCA ACA TCC ACA GAG GAC CCA GCC TCT CTC CCC CAG TGG AAT GTT GTC 1682 Ala Thr Ser Thr Glu Asp Pro Ala Ser Leu Pro Gin Trp Asn Val Val 510 515 520
ATG GAT GGC TGT GAA TAC AAC CTG GAC AAC CAC AGA ACC ACC TTC CAT 1730 Met Asp Gly Cys Glu Tyr Asn Leu Asp Asn His Arg Thr Thr Phe His 525 530 535
CCG GTG GGC TCC TCC GTG ACC TAT CCT AAC CAC CAT CAG AGG TTT GAT 1778 Pro Val Gly Ser Ser Val Thr Tyr Pro Asn His His Gin Arg Phe Asp 540 545 550
GTG AAG ACC TTT GCC TTT GTG TCA GGG GCC CAA GGG GTC TCT CAA CTG 1826 Val Lys Thr Phe Ala Phe Val Ser Gly Ala Gin Gly Val Ser Gin Leu 555 560 565
GTC TAC TTC CAC TGC AGT GTC TTC ATC TGC AAT CAA CTC TCT CCC ACC 1874 Val Tyr Phe His Cys Ser Val Phe He Cys Asn Gin Leu Ser Pro Thr 570 575 580 585
TTC TCT CTG TGT TCT GTG ACT TGC CAT GGG CCA TCT AGG AGC CGG CGA 1922 Phe Ser Leu Cys Ser Val Thr Cys His Gly Pro Ser Arg Ser Arg Arg 590 595 600
GCT ACA GGG ACC ACT GAG GAA GAG AAA ATG ATA GTG AGT CTC CCG GGC 1970 Ala Thr Gly Thr Thr Glu Glu Glu Lys Met He Val Ser Leu Pro Gly 605 610 615
CCC ATC CTG CTG TTG TCA GAT GGC TCT TCA CTC AGA GAT GCT GTG AAC 2018 Pro He Leu Leu Leu Ser Asp Gly Ser Ser Leu Arg Asp Ala Val Asn 620 625 630
TCT AAA GGA TCC AGA ACC AAC GGA TAT GTT GCT TTT AAA ACT ATG GTT 2066 Ser Lys Gly Ser Arg Thr Asn Gly Tyr Val Ala Phe Lys Thr Met Val 635 640 645
GCT ATG GTT GCT TCA GCA GGC ATC GTG GCA ACT CTA GGC CTC ATC AGC 2114 Ala Met Val Ala Ser Ala Gly He Val Ala Thr Leu Gly Leu He Ser 650 655 660 665
TAC CTG CAC AAA AAA AGA ATC ATG ATG TTA AAT CAC TAATTTGGAT 2160
Tyr Leu His Lys Lys Arg He Met Met Leu Asn His 670 675
TTTCAAATAA AAGTGGAAGT AAGCCTCTTC TAAAAAAAAA AAAAACCGGA ATTC 2214
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 713 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
Arg His Arg Gly Asp Ser Gly Arg Pro Leu Ser Trp Leu Ser Ala Ser -36 -35 -30 -25
Trp Arg Ser Leu Leu Leu Phe Phe Pro Leu Val Thr Ser Val Asn Ser -20 -15 -10 -5
He Gly Val Asn Gin Leu Val Asn Thr Ala Phe Pro Gly He Val Thr 1 5 10
Cys His Glu Asn Arg Met Val Val Glu Phe Pro Arg He Leu Gly Thr 15 20 25
Lys He Gin Tyr Thr Ser Val Val Asp Pro Leu Gly Leu Glu Met Met 30 35 40
Asn Cys Thr Tyr Val Leu Asp Pro Glu Asn Leu Thr Leu Lys Ala Pro 45 50 55 60
Tyr Glu Ala Cys Thr Lys Arg Val Arg Gly His His Gin Met Thr He 65 70 75
Arg Leu He Asp Asp Asn Ala Ala Leu Arg Gin Glu Ala Leu Met Tyr 80 85 90
His He Ser Cys Pro Val Met Gly Ala Glu Gly Pro Asp Gin His Ser 95 100 105
Gly Ser Thr He Cys Met Lys Asp Phe Met Ser Phe Thr Phe Asn Phe 110 115 120
Phe Pro Gly Met Ala Asp Glu Asn Val Lys Arg Glu Asp Ser Lys Gin 125 130 135 140
Arg Met Gly Trp Ser Leu Val Val Gly Asp Gly Glu Arg Ala Arg Thr 145 150 155
Leu Thr Phe Gin Glu Ala Met Thr Gin Gly Tyr Asn Phe Leu He Glu 160 165 170
Asn Gin Lys Met Asn He Gin Val Ser Phe His Ala Thr Gly Val Thr 175 180 185
Arg Tyr Ser Gin Gly Asn Ser His Leu Tyr Met Val Pro Leu Lys Leu 190 195 200
Lys His Val Ser His Gly Gin Ser Leu He Leu Ala Ser Gin Leu He 205 210 215 220
Cys Val Ala Asp Pro Val Thr Cys Asn Ala Thr His Val Thr Leu Ala 225 230 235
He Pro Glu Phe Pro Gly Lys Leu Lys Ser Val Asn Leu Gly Ser Gly 240 245 250
Asn He Ala Val Ser Gin Leu His Lys His Gly He Glu Met Glu Thr 255 260 265
Thr Asn Gly Leu Arg Leu His Phe Asn Gin Thr Leu Leu Lys Thr Asn 270 275 280
Val Ser Glu Lys Cys Leu Pro His Gin Leu Tyr Leu Ser Ser Leu Lys 285 290 295 300
Leu Thr Phe His Ser Gin Leu Glu Ala Val Ser Met Val He Tyr Pro 305 310 315
Glu Cys Leu Cys Glu Ser Thr Val Ser Leu Val Ser Glu Glu Leu Cys 320 325 330
Thr Gin Asp Gly Phe Met Asp Val Lys Val His Ser His Gin Thr Lys 335 340 345
Pro Ala Leu Asn Leu Asp Thr Leu Arg Val Gly Asp Ser Ser Cys Gin 350 355 360
Pro Thr Phe Lys Ala Pro Ala Gin Gly Leu Val Gin Phe Arg He Pro 365 370 375 380
Leu Asn Gly Cys Gly Thr Arg His Lys Phe Lys Asn Asp Lys Val He 385 390 395
Tyr Glu Asn Glu He His Ala Leu Trp Ala Asp Pro Pro Ser Ala Val 400 405 410
Ser Arg Asp Ser Glu Phe Arg Met Thr Val Arg Cys Ser Tyr Ser Ser 415 420 425
Ser Asn Met Leu He Asn Thr Asn Val Glu Ser Leu Pro Ser Pro Glu 430 435 440
Ala Ser Val Lys Pro Gly Pro Leu Thr Leu Thr Leu Gin Thr Tyr Pro 445 450 455 460
Asp Asn Ala Tyr Leu Gin Pro Tyr Gly Asp Lys Glu Tyr Pro Val Val 465 470 475
Lys Tyr Leu Arg Gin Pro He Tyr Leu Glu Val Arg He Leu Asn Arg
480 485 490
Thr Asp Pro Asn He Lys Leu Val Leu Asp Asp Cys Trp Ala Thr Ser 495 500 505
Thr Glu Asp Pro Ala Ser Leu Pro Gin Trp Asn Val Val Met Asp Gly 510 515 520
Cys Glu Tyr Asn Leu Asp Asn His Arg Thr Thr Phe His Pro Val Gly 525 530 535 540
Ser Ser Val Thr Tyr Pro Asn His His Gin Arg Phe Asp Val Lys Thr 545 550 555
Phe Ala Phe Val Ser Gly Ala Gin Gly Val Ser Gin Leu Val Tyr Phe 560 565 570
His Cys Ser Val Phe He Cys Asn Gin Leu Ser Pro Thr Phe Ser Leu 575 580 585
Cys Ser Val Thr Cys His Gly Pro Ser Arg Ser Arg Arg Ala Thr Gly 590 595 600
Thr Thr Glu Glu Glu Lys Met He Val Ser Leu Pro Gly Pro He Leu 605 610 615 620
Leu Leu Ser Asp Gly Ser Ser Leu Arg Asp Ala Val Asn Ser Lys Gly 625 630 635
Ser Arg Thr Asn Gly Tyr Val Ala Phe Lys Thr Met Val Ala Met Val 640 645 650
Ala Ser Ala Gly He Val Ala Thr Leu Gly Leu He Ser Tyr Leu His 655 660 665
Lys Lys Arg He Met Met Leu Asn His 670 675
(2) INFORMATION FOR SEQ ID NO:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1699 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Sus scrofa
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: sig_peptide
(B) LOCATION: 38.-445 (ix) FEATURE:
(A) NAME/KEY: mat_peptide
(B) LOCATION: 446..1648 (ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 38..1648
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:
GAATTCCGGG TGGAAGTACC TGTTCTCCGC AGGCGCT ATG TGG TTG CGG CCG TCC 55
Met Trp Leu Arg Pro Ser -136-135
ATC TGG CTC TGC TTT CCG CTG TGT CTT GCT CTG CCA GGC CAG TCT CAG 103 He Trp Leu Cys Phe Pro Leu Cys Leu Ala Leu Pro Gly Gin Ser Gin -130 -125 -120 -115
CCC AAA GCA GCA GAT GAC CTT GGT GGC CTC TAC TGT GGG CCA AGC AGC 151 Pro Lys Ala Ala Asp Asp Leu Gly Gly Leu Tyr Cys Gly Pro Ser Ser -110 -105 -100
TTT CAT TTC TCC ATA AAT CTT CTC AGC CAG GAC ACA GCA ACT CCT CCT 199 Phe His Phe Ser He Asn Leu Leu Ser Gin Asp Thr Ala Thr Pro Pro -95 -90 -85
GCA CTG GTG GTT TGG GAC AGG CGC GGG CGG CTG CAC AAG CTG CAG AAT 247 Ala Leu Val Val Trp Asp Arg Arg Gly Arg Leu His Lys Leu Gin Asn -80 -75 -70
GAC TCT GGC TGT GGC ACG TGG GTC CAC AAG GGC CCA GGC AGC TCC ATG 295 Asp Ser Gly Cys Gly Thr Trp Val His Lys Gly Pro Gly Ser Ser Met -65 -60 -55
GGA GTG GAA GCA TCC TAC AGA GGC TGC TAT GTG ACT GAG TGG GAC TCT 343 Gly Val Glu Ala Ser Tyr Arg Gly Cys Tyr Val Thr Glu Trp Asp Ser -50 -45 -40 -35
CAC TAC CTC ATG CCC ATT GGA CTT GAA GAA GCA GAT GCA GGT GGA CAC 391 His Tyr Leu Met Pro He Gly Leu Glu Glu Ala Asp Ala Gly Gly His -30 -25 -20
AGA ACA GTC ACA GAG ACG AAA CTG TTT AAG TGC CCT GTG GAT TTC CTA 439 Arg Thr Val Thr Glu Thr Lys Leu Phe Lys Cys Pro Val Asp Phe Leu -15 -10 -5
GCT CTT GAT GTT CCA ACC ATT GGC CTT TGT GAT GCT GTC CCA GTG TGG 487 Ala Leu Asp Val Pro Thr He Gly Leu Cys Asp Ala Val Pro Val Trp 1 5 10
GAC CGA TTG CCA TGT GCT CCT CCA CCC ATC ACT CAA GGA GAA TGC AAG 535 Asp Arg Leu Pro Cys Ala Pro Pro Pro He Thr Gin Gly Glu Cys Lys 15 20 25 30
CAG CTT GGC TGC TGC TAC AAC TCG GAA GAG GTC CCT TCT TGT TAC TAT 583 Gin Leu Gly Cys Cys Tyr Asn Ser Glu Glu Val Pro Ser Cys Tyr Tyr 35 40 45
GGA AAC ACA GTG ACC TCA CGC TGT ACC CAA GAT GGC CAC TTC TCC ATC 631 Gly Asn Thr Val Thr Ser Arg Cys Thr Gin Asp Gly His Phe Ser He 50 55 60
GCT GTG TCT CGC AAT GTG ACC TCA CCT CCA CTG CTC TGG GAT TCT GTG 679 Ala Val Ser Arg Asn Val Thr Ser Pro Pro Leu Leu Trp Asp Ser Val 65 70 75
CAC CTG GCC TTC AGA AAT GAC AGT GAA TGT AAA CCT GTG ATG GAA ACA 727 His Leu Ala Phe Arg Asn Asp Ser Glu Cys Lys Pro Val Met Glu Thr 80 85 90
CAC ACT TTT GTC CTC TTC CGG TTT CCA TTT AGT TCC TGT GGG ACT GCA 775
His Thr Phe Val Leu Phe Arg Phe Pro Phe Ser Ser Cys Gly Thr Ala 95 100 105 110
AAA CGG GTA ACT GGG AAC CAG GCG GTA TAT GAA AAT GAG CTG GTA GCA 823 Lys Arg Val Thr Gly Asn Gin Ala Val Tyr Glu Asn Glu Leu Val Ala 115 120 125
GCT CGG GAT GTG AGG ACT TGG AGC CAT GGT TCT ATT ACC CGA GAC AGC 871 Ala Arg Asp Val Arg Thr Trp Ser His Gly Ser He Thr Arg Asp Ser 130 135 140
ATC TTC AGG CTT CGA GTC AGT TGT ATC TAC TCT GTA AGT AGC AGT GCT 919 He Phe Arg Leu Arg Val Ser Cys He Tyr Ser Val Ser Ser Ser Ala 145 150 155
CTC CCA GTT AAC ATC CAG GTT TTC ACT CTC CCA CCA CCG CTT CCG GAG 967 Leu Pro Val Asn He Gin Val Phe Thr Leu Pro Pro Pro Leu Pro Glu 160 165 170
ACC CAC CCT GGA CCT CTT ACT CTG GAG CTT CAG ATT GCC AAA GAT GAA 1015 Thr His Pro Gly Pro Leu Thr Leu Glu Leu Gin He Ala Lys Asp Glu 175 180 185 190
CGC TAT GGC TCC TAC TAC AAT GCT AGT GAC TAC CCG GTG GTG AAA TTG 1063 Arg Tyr Gly Ser Tyr Tyr Asn Ala Ser Asp Tyr Pro Val Val Lys Leu 195 200 205
CTT CGG GAG CCC ATC TAT GTG GAG GTC TCT ATC CGT CAC CGA ACA GAC 1111 Leu Arg Glu Pro He Tyr Val Glu Val Ser He Arg His Arg Thr Asp 210 215 220
CCC AGT CTC GGG CTG CAC CTG CAC CAG TGC TGG GCC ACA CCC GGC ATG 1159 Pro Ser Leu Gly Leu His Leu His Gin Cys Trp Ala Thr Pro Gly Met 225 230 235
AGC CCC CTG CTC CAG CCA CAG TGG CCC ATG CTA GTC AAT GGA TGC CCC 1207 Ser Pro Leu Leu Gin Pro Gin Trp Pro Met Leu Val Asn Gly Cys Pro 240 245 250
TAC ACT GGA GAC AAC TAC CAG ACC AAA CTG ATC CCT GTC CAG AAA GCC 1255 Tyr Thr Gly Asp Asn Tyr Gin Thr Lys Leu He Pro Val Gin Lys Ala 255 260 265 270
TCA AAC CTG CTA TTT CCT TCT CAC TAC CAG CGT TTC AGT GTT TCC ACC 1303 Ser Asn Leu Leu Phe Pro Ser His Tyr Gin Arg Phe Ser Val Ser Thr 275 280 285
TTC AGT TTT GTG GAC TCT GTG GCA AAG CAG GCA CTC AAG GGA CCG GTG 1351 Phe Ser Phe Val Asp Ser Val Ala Lys Gin Ala Leu Lys Gly Pro Val 290 295 300
TAT CTG CAT TGT ACT GCA TCG GTC TGC AAG CCT GCA GGG GCA CCG ATC 1399 Tyr Leu His Cys Thr Ala Ser Val Cys Lys Pro Ala Gly Ala Pro He 305 310 315
TGT GTG ACA ACC TGT CCT GCT GCC AGA CGA AGA AGA AGT TCT GAC ATC 1447 Cys Val Thr Thr Cys Pro Ala Ala Arg Arg Arg Arg Ser Ser Asp He 320 325 330
CAT TTT CAG AAT GGC ACT GCT AGC ATT TCT AGC AAG GGT CCC ATG ATT 1495 His Phe Gin Asn Gly Thr Ala Ser He Ser Ser Lys Gly Pro Met He 335 340 345 350
CTA CTC CAA GCC ACT CGG GAC TCT TCA GAA AGG CTC CAT AAA TAC TCA 1543 Leu Leu Gin Ala Thr Arg Asp Ser Ser Glu Arg Leu His Lys Tyr Ser 355 360 365
AGG CCT CCT GTA GAC TCC CAT GCT CTG TGG GTG GCT GGC CTC TTG GGA 1591 Arg Pro Pro Val Asp Ser His Ala Leu Trp Val Ala Gly Leu Leu Gly 370 375 380
AGC TTA ATT ATT GGA CCC TTG TTA GTG TCC TAC CTG GTC TTC AGG AAA 1639 Ser Leu He He Gly Ala Leu Leu Val Ser Tyr Leu Val Phe Arg Lys 385 390 395
TGG AGA TGAGTTACTC AGACCAAATG TGTCAATAAA ACCAATAAAA CAAAACCGGA 1695 Trp Arg 400
ATTC 1699
(2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 536 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
Met Trp Leu Arg Pro Ser He Trp Leu Cys Phe Pro Leu Cys Leu Ala -136 -135 -130 -125
Leu Pro Gly Gin Ser Gin Pro Lys Ala Ala Asp Asp Leu Gly Gly Leu -120 -115 -110 -105
Tyr Cys Gly Pro Ser Ser Phe His Phe Ser He Asn Leu Leu Ser Gin -100 -95 -90
Asp Thr Ala Thr Pro Pro Ala Leu Val Val Trp Asp Arg Arg Gly Arg -85 -80 -75
Leu His Lys Leu Gin Asn Asp Ser Gly Cys Gly Thr Trp Val His Lys -70 -65 -60
Gly Pro Gly Ser Ser Met Gly Val Glu Ala Ser Tyr Arg Gly Cys Tyr -55 -50 -45
Val Thr Glu Trp Asp Ser His Tyr Leu Met Pro He Gly Leu Glu Glu -40 -35 -30 -25
Ala Asp Ala Gly Gly His Arg Thr Val Thr Glu Thr Lys Leu Phe Lys -20 -15 -10
Cys Pro Val Asp Phe Leu Ala Leu Asp Val Pro Thr He Gly Leu Cys
-5 1 5
Asp Ala Val Pro Val Trp Asp Arg Leu Pro Cys Ala Pro Pro Pro He 10 15 20
Thr Gin Gly Glu Cys Lys Gin Leu Gly Cys Cys Tyr Asn Ser Glu Glu 25 30 35 40
Val Pro Ser Cys Tyr Tyr Gly Asn Thr Val Thr Ser Arg Cys Thr Gin 45 50 55
Asp Gly His Phe Ser He Ala Val Ser Arg Asn Val Thr Ser Pro Pro 60 65 70
Leu Leu Trp Asp Ser Val His Leu Ala Phe Arg Asn Asp Ser Glu Cys
75 80 85
Lys Pro Val Met Glu Thr His Thr Phe Val Leu Phe Arg Phe Pro Phe 90 95 100
Ser Ser Cys Gly Thr Ala Lys Arg Val Thr Gly Asn Gin Ala Val Tyr 105 110 115 120
Glu Asn Glu Leu Val Ala Ala Arg Asp Val Arg Thr Trp Ser His Gly 125 130 135
Ser He Thr Arg Asp Ser He Phe Arg Leu Arg Val Ser Cys He Tyr 140 145 150
Ser Val Ser Ser Ser Ala Leu Pro Val Asn He Gin Val Phe Thr Leu 155 160 165
Pro Pro Pro Leu Pro Glu Thr His Pro Gly Pro Leu Thr Leu Glu Leu 170 175 180
Gin He Ala Lys Asp Glu Arg Tyr Gly Ser Tyr Tyr Asn Ala Ser Asp 185 190 195 200
Tyr Pro Val Val Lys Leu Leu Arg Glu Pro He Tyr Val Glu Val Ser 205 210 215
He Arg His Arg Thr Asp Pro Ser Leu Gly Leu His Leu His Gin Cys 220 225 230
Trp Ala Thr Pro Gly Met Ser Pro Leu Leu Gin Pro Gin Trp Pro Met 235 240 245
Leu Val Asn Gly Cys Pro Tyr Thr Gly Asp Asn Tyr Gin Thr Lys Leu 250 255 260
He Pro Val Gin Lys Ala Ser Asn Leu Leu Phe Pro Ser His Tyr Gin 265 270 275 280
Arg Phe Ser Val Ser Thr Phe Ser Phe Val Asp Ser Val Ala Lys Gin 285 290 295
Ala Leu Lys Gly Pro Val Tyr Leu His Cys Thr Ala Ser Val Cys Lys 300 305 310
Pro Ala Gly Ala Pro He Cys Val Thr Thr Cys Pro Ala Ala Arg Arg 315 320 325
Arg Arg Ser Ser Asp He His Phe Gin Asn Gly Thr Ala Ser He Ser 330 335 340
Ser Lys Gly Pro Met He Leu Leu Gin Ala Thr Arg Asp Ser Ser Glu 345 350 355 360
Arg Leu His Lys Tyr Ser Arg Pro Pro Val Asp Ser His Ala Leu Trp 365 370 375
Val Ala Gly Leu Leu Gly Ser Leu He He Gly Ala Leu Leu Val Ser 380 385 390
Tyr Leu Val Phe Arg Lys Trp Arg 395 400
(2) INFORMATION FOR SEQ ID NO:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1326 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Sus scrofa
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: sig_peptide
(B) LOCATION: 25..105
(ix) FEATURE:
(A) NAME/KEY: mat_peptide
(B) LOCATION: 106..1290
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 25..1290
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:
GAATTCCGGG GCCTTGTGAG TGCC ATG GCG CCG AGC TGG AGG TTC TTC GTC 51
Met Ala Pro Ser Trp Arg Phe Phe Val -27 -25 -20
TGC TTT CTG CTC TGG GGA GGT ACA GAG CTA TGC AGC CCG CAG CCC GTC 99 Cys Phe Leu Leu Trp Gly Gly Thr Glu Leu Cys Ser Pro Gin Pro Val -15 -10 -5
TGG CAG GAC GAA GGC CAG CGC TTG AGG CCC TCA AAG CCA CCC ACC GTA 147 Trp Gin Asp Glu Gly Gin Arg Leu Arg Pro Ser Lys Pro Pro Thr Val 1 5 10
ATG GTG GAG TGT CAG GAG GCC CAG CTG GTG GTC ATT GTC AGC AAA GAC 195 Met Val Glu Cys Gin Glu Ala Gin Leu Val Val He Val Ser Lys Asp 15 20 25 30
CTT TTC GGT ACC GGG AAG CTC ATC AGG CCT GCA GAT CTC AGC CTG GGC 243 Leu Phe Gly Thr Gly Lys Leu He Arg Pro Ala Asp Leu Ser Leu Gly 35 40 45
CCT GCA AAG TGT GAG CCG CTG GTC TCT CAG GAC ACG GAC GCA GTG GTC 291 Pro Ala Lys Cys Glu Pro Leu Val Ser Gin Asp Thr Asp Ala Val Val
50 55 60
AGG TTT GAG GTT GGG CTG CAC GAG TGT GGC AGC AGC TTG CAG GTG ACT 339 Arg Phe Glu Val Gly Leu His Glu Cys Gly Ser Ser Leu Gin Val Thr 65 70 75
GAT GAT GCT CTG GTG TAC AGC ACC TTC CTG CGC CAT GAC CCC CGC CCT 387 Asp Asp Ala Leu Val Tyr Ser Thr Phe Leu Arg His Asp Pro Arg Pro 80 85 90
GCA GGA AAC CTG TCC ATC CTG AGG ACG AAC CGT GCG GAG GTC CCC ATC 435 Ala Gly Asn Leu Ser He Leu Arg Thr Asn Arg Ala Glu Val Pro He 95 100 105 110
GAG TGT CAC TAC CCC AGG CAG GGC AAC GTG AGC AGC TGG GCC ATC CTG 483 Glu Cys His Tyr Pro Arg Gin Gly Asn Val Ser Ser Trp Ala He Leu 115 120 125
CCC ACC TGG GTG CCC TTC AGG ACC ACG GTG TTC TCC GAG GAG AAG CTG 531 Pro Thr Trp Val Pro Phe Arg Thr Thr Val Phe Ser Glu Glu Lys Leu 130 135 140
GTG TTC TCT CTG CGC CTG ATG GAG GAA AAC TGG AGT GCC GAG AAG ATG 579 Val Phe Ser Leu Arg Leu Met Glu Glu Asn Trp Ser Ala Glu Lys Met 145 150 155
ACG CCC ACC TTC CAG CTG GGG GAC AGA GCC CAC CTC CAG GCC CAA GTC 627 Thr Pro Thr Phe Gin Leu Gly Asp Arg Ala His Leu Gin Ala Gin Val 160 165 170
CAC ACC GGC AGC CAC GTG CCA CTG AGG CTG TTT GTG GAC CAC TGT GTG 675 His Thr Gly Ser His Val Pro Leu Arg Leu Phe Val Asp His Cys Val 175 180 185 190
GCC ACG CTG ACG CCG GAC TGG AAC ACC TCC CCC TCT CAC ACC ATC GTG 723 Ala Thr Leu Thr Pro Asp Trp Asn Thr Ser Pro Ser His Thr He Val 195 200 205
GAC TTC CAC GGC TGT CTC GTG GAC GGT CTC ACT GAG GCC TCA TCT GCT 771 Asp Phe His Gly Cys Leu Val Asp Gly Leu Thr Glu Ala Ser Ser Ala 210 215 220
TTC AAA GCA CCT AGA CCT GGA CCA GAG ACG CTC CAG TTC ACC GTG GAT 819 Phe Lys Ala Pro Arg Pro Gly Pro Glu Thr Leu Gin Phe Thr Val Asp 225 230 235
GTG TTC CAT TTT GCT AAT GAT TCC AGA AAC ACG ATC TAC ATC ACC TGC 867 Val Phe His Phe Ala Asn Asp Ser Arg Asn Thr He Tyr He Thr Cys 240 245 250
CAT CTG AAG GTC ACT CCG GCT GAC CGA GTC CCG GAC CAA CTC AAC AAA 915 His Leu Lys Val Thr Pro Ala Asp Arg Val Pro Asp Gin Leu Asn Lys 255 260 265 270
GCC TGT TCC TTC AGC AAG TCC TCC AAC AGG TGG TCC CCG GTG GAA GGG 963 Ala Cys Ser Phe Ser Lys Ser Ser Asn Arg Trp Ser Pro Val Glu Gly 275 280 285
CCT GCT GTT ATC TGT CGT TGC TGT CAC AAG GGG CAG TGT GGT ACC CCA 1011 Pro Ala Val He Cys Arg Cys Cys His Lys Gly Gin Cys Gly Thr Pro 290 295 300
AGC CTT TCC AGG AAG CTG TCT ATG CCG AAG AGA CAG TCT GCT CCC CGC 1059 Ser Leu Ser Arg Lys Leu Ser Met Pro Lys Arg Gin Ser Ala Pro Arg 305 310 315
AGT CGC AGG CAC GTG ACA GAT GAA GCA GAT GTC ACA GTG GGG CCT CTG 1107 Ser Arg Arg His Val Thr Asp Glu Ala Asp Val Thr Val Gly Pro Leu 320 325 330
ATC TTC CTG GGC AAG ACG AGT GAC CAC GGT GTG GAA GGG TCC ACC TCC 1155 He Phe Leu Gly Lys Thr Ser Asp His Gly Val Glu Gly Ser Thr Ser 335 340 345 350
TCC CCC ACC TCG GTG ATG GTG GGC TTG GGC CTG GCC ACC GTG GTG ACC 1203 Ser Pro Thr Ser Val Met Val Gly Leu Gly Leu Ala Thr Val Val Thr 355 360 365
TTG ACT CTG GCT ACC ATT GTC CTG GGT GTG CCC AGG AGG CGT CGG GCT 1251 Leu Thr Leu Ala Thr He Val Leu Gly Val Pro Arg Arg Arg Arg Ala
370 375 380
GCT GCC CAC CTT GTG TGC CCC GTG TCT GCT TCC CAA TAAAAGGAGA 1297
Ala Ala His Leu Val Cys Pro Val Ser Ala Ser Gin 385 390
AACATGAAAA AAAAAAAAAA CCGGAATTC 1326
(2) INFORMATION FOR SEQ ID NO:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 421 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID Nθ:6:
Met Ala Pro Ser Trp Arg Phe Phe Val Cys Phe Leu Leu Trp Gly Gly -27 -25 -20 -15
Thr Glu Leu Cys Ser Pro Gin Pro Val Trp Gin Asp Glu Gly Gin Arg -10 -5 1 5
Leu Arg Pro Ser Lys Pro Pro Thr Val Met Val Glu Cys Gin Glu Ala 10 15 20
Gin Leu Val Val He Val Ser Lys Asp Leu Phe Gly Thr Gly Lys Leu 25 30 35
He Arg Pro Ala Asp Leu Ser Leu Gly Pro Ala Lys Cys Glu Pro Leu 40 45 50
Val Ser Gin Asp Thr Asp Ala Val Val Arg Phe Glu Val Gly Leu His 55 60 65
Glu Cys Gly Ser Ser Leu Gin Val Thr Asp Asp Ala Leu Val Tyr Ser 70 75 80 85
Thr Phe Leu Arg His Asp Pro Arg Pro Ala Gly Asn Leu Ser He Leu 90 95 100
Arg Thr Asn Arg Ala Glu Val Pro He Glu Cys His Tyr Pro Arg Gin 105 110 115
Gly Asn Val Ser Ser Trp Ala He Leu Pro Thr Trp Val Pro Phe Arg 120 125 130
Thr Thr Val Phe Ser Glu Glu Lys Leu Val Phe Ser Leu Arg Leu Met 135 140 145
Glu Glu Asn Trp Ser Ala Glu Lys Met Thr Pro Thr Phe Gin Leu Gly 150 155 160 165
Asp Arg Ala His Leu Gin Ala Gin Val His Thr Gly Ser His Val Pro 170 175 180
Leu Arg Leu Phe Val Asp His Cys Val Ala Thr Leu Thr Pro Asp Trp 185 190 195
Asn Thr Ser Pro Ser His Thr He Val Asp Phe His Gly Cys Leu Val 200 205 210 sp Gly Leu Thr Glu Ala Ser Ser Ala Phe Lys Ala Pro Arg Pro Gly 215 220 225
Pro Glu Thr Leu Gin Phe Thr Val Asp Val Phe His Phe Ala Asn Asp 230 235 240 245
Ser Arg Asn Thr He Tyr He Thr Cys His Leu Lys Val Thr Pro Ala 250 255 260
Asp Arg Val Pro Asp Gin Leu Asn Lys Ala Cys Ser Phe Ser Lys Ser 265 270 275
Ser Asn Arg Trp Ser Pro Val Glu Gly Pro Ala Val He Cys Arg Cys 280 285 290
Cys His Lys Gly Gin Cys Gly Thr Pro Ser Leu Ser Arg Lys Leu Ser 295 300 305
Met Pro Lys Arg Gin Ser Ala Pro Arg Ser Arg Arg His Val Thr Asp 310 315 320 325
Glu Ala Asp Val Thr Val Gly Pro Leu He Phe Leu Gly Lys Thr Ser 330 335 340
Asp His Gly Val Glu Gly Ser Thr Ser Ser Pro Thr Ser Val Met Val 345 350 355
Gly Leu Gly Leu Ala Thr Val Val Thr Leu Thr Leu Ala Thr He Val 360 365 370
Leu Gly Val Pro Arg Arg Arg Arg Ala Ala Ala His Leu Val Cys Pro 375 380 385
Val Ser Ala Ser Gin 390
(2) INFORMATION FOR SEQ ID NO:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1338 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Oryctolagus cuniculus
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 17..1261
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:
GAATTCGCGG CCGGCC TAC GGG CTC TTC GTT TGC CTA CTG CTC TGG GGA 49
Tyr Gly Leu Phe Val Cys Leu Leu Leu Trp Gly 1 5 10
GGC TCG GAG CTG TGC TGC CCC CAG CCG CTC TGG TTC TGG CAG GGC GGG 97 Gly Ser Glu Leu Cys Cys Pro Gin Pro Leu Trp Phe Trp Gin Gly Gly 15 20 25
ACC CGC CAG CCC GCG CCC TCC GTG ACG CCC GTG GTG GTG GAG TGT CTG 145 Thr Arg Gin Pro Ala Pro Ser Val Thr Pro Val Val Val Glu Cys Leu 30 35 40
GAG GCC CGG CTC GTG GTC ACG GTC AGC AGG GAC CTT TTT GGC ACC GGG 193 Glu Ala Arg Leu Val Val Thr Val Ser Arg Asp Leu Phe Gly Thr Gly 45 50 55
AAG CTC ATC CAG GAG GCC GAC CTC AGC CTG GGC CCC GAG GGC TGC GAG 241 Lys Leu He Gin Glu Ala Asp Leu Ser Leu Gly Pro Glu Gly Cys Glu 60 65 70 75
CCC CAG GCC TCC ACG GAC GCC GTG GTC AGG TTC GAG GTC GGG CTG CAT 289 Pro Gin Ala Ser Thr Asp Ala Val Val Arg Phe Glu Val Gly Leu His 80 85 90
GAA TGT GGT AAC AGC GTG CAG GTG ACT GAC GAC TCC CTG GTG TAC AGC 337 Glu Cys Gly Asn Ser Val Gin Val Thr Asp Asp Ser Leu Val Tyr Ser 95 100 105
TCC TTC CTG CTC CAC GAC CCC CGC CCC GCG GGA AAC CTG TCC ATC CTC 385 Ser Phe Leu Leu His Asp Pro Arg Pro Ala Gly Asn Leu Ser He Leu 110 115 120
AGG ACC AAC CGC GCC GAG GTC CCC ATC GAG TGC CGC TAC CCC AGG CAG 433 Arg Thr Asn Arg Ala Glu Val Pro He Glu Cys Arg Tyr Pro Arg Gin 125 130 135
GGC AAC GTG AGC AGC CGG GCG ATC CTG CCG ACC TGG GTG CCC TTC TGG 481 Gly Asn Val Ser Ser Arg Ala He Leu Pro Thr Trp Val Pro Phe Trp 140 145 150 155
ACC ACG GTA CTG TCA GAG GAG AGG CTG GTG TTC TCC CTG CGC CTC ATG 529 Thr Thr Val Leu Ser Glu Glu Arg Leu Val Phe Ser Leu Arg Leu Met 160 165 170
GAG GAG AAC TGG AGC CGA GAA AAG ATG TCC CCC ACC TTC CAC CTG GGC 577 Glu Glu Asn Trp Ser Arg Glu Lys Met Ser Pro Thr Phe His Leu Gly 175 180 185
GAC ACG GCC CAC CTG CAG GCA GAG GTC CGC ACG GGC AGC CAC CCG CCC 625 Asp Thr Ala His Leu Gin Ala Glu Val Arg Thr Gly Ser His Pro Pro 190 195 200
CTG CTG CTG TTC GTG GAT CGC TGC GTG GCC ACC CCG ACA CGG GAC CAG 673 Leu Leu Leu Phe Val Asp Arg Cys Val Ala Thr Pro Thr Arg Asp Gin 205 210 215
AGC GGC TCC CCC TAT CAC ACC ATC GTG GAC TTG CAC GGC TGT CTT GTG 721 Ser Gly Ser Pro Tyr His Thr He Val Asp Leu His Gly Cys Leu Val 220 225 230 235
GAT GGC CTC TCC GAT GGG GCT TCC AAG TTC AAA GCC CCC AGG CCG AAG 769 Asp Gly Leu Ser Asp Gly Ala Ser Lys Phe Lys Ala Pro Arg Pro Lys 240 245 250
CCG GAC GTG CTC CAG TTC ATG GTG GCC GTG TTC CAC TTC GCT AAT GAC 817 Pro Asp Val Leu Gin Phe Met Val Ala Val Phe His Phe Ala Asn Asp 255 260 265
TCC AGG CAC ACG GTC TAC ATC ACG TGT CAC CTG AGG GTC ATT CCT GCC 865 Ser Arg His Thr Val Tyr He Thr Cys His Leu Arg Val He Pro Ala 270 275 280
CAG CAA GCC CCG GAC CGG CTC AAC AAG GCT TGT TCT TTC AAC CAG TCC 913 Gin Gin Ala Pro Asp Arg Leu Asn Lys Ala Cys Ser Phe Asn Gin Ser 285 290 295
TCC AGC AGC TGG GCC CCG GTG GAA GGC AGT GCA GAC ATC TGT GAG TGT 961 Ser Ser Ser Trp Ala Pro Val Glu Gly Ser Ala Asp He Cys Glu Cys 300 305 310 315
TGC GGC AAC GGT GAC TGT GAC CTC ATC GCA GGC TCC CCC ATG AAC CAG 1009 Cys Gly Asn Gly Asp Cys Asp Leu He Ala Gly Ser Pro Met Asn Gin 320 325 330
AAC CAT GCT GCC CGG TCC TCT CTG CGA AGC CGC AGG CAC GTG ACG GAA 1057 Asn His Ala Ala Arg Ser Ser Leu Arg Ser Arg Arg His Val Thr Glu 335 340 345
GAA GCA GAC GTC ACC GTG GGC CCG CTG ATC TTC CTG GGG AAG GCT GGT 1105 Glu Ala Asp Val Thr Val Gly Pro Leu He Phe Leu Gly Lys Ala Gly 350 355 360
GAC CCT GCC GGC ACA GAG GGG CTG GCC TCT GCT GCG CAG GCG ACC CTG 1153 Asp Pro Ala Gly Thr Glu Gly Leu Ala Ser Ala Ala Gin Ala Thr Leu 365 370 375
GTG CTG GGC CTT CGC ATG GCC ACC ATT GTG TTC CTG GCT GTG GCT GCT 1201 Val Leu Gly Leu Arg Met Ala Thr He Val Phe Leu Ala Val Ala Ala 380 385 390 395
GTG GTC CTG GGC CTC ACC AGG GGG CGC CAC GCT GCT TCC CAC CCC AGG 1249 Val Val Leu Gly Leu Thr Arg Gly Arg His Ala Ala Ser His Pro Arg 400 405 410
TCT GCT TCC CAA TAAAAAATCA TGACTTCAAA AAAAAAAAAA AAAAAAAAAA 1301 Ser Ala Ser Gin 415
AAAAAAAAAA AAAAAAAAAA AAAGCGGCCG CGAATTC 1338
(2) INFORMATION FOR SEQ ID NO:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 415 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
( i) SEQUENCE DESCRIPTION: SEQ ID NO:8:
Tyr Gly Leu Phe Val Cys Leu Leu Leu Trp Gly Gly Ser Glu Leu Cys
1 5 10 15
Cys Pro Gin Pro Leu Trp Phe Trp Gin Gly Gly Thr Arg Gin Pro Ala 20 25 30
Pro Ser Val Thr Pro Val Val Val Glu Cys Leu Glu Ala Arg Leu Val 35 40 45
Val Thr Val Ser Arg Asp Leu Phe Gly Thr Gly Lys Leu He Gin Glu 50 55 60
Ala Asp Leu Ser Leu Gly Pro Glu Gly Cys Glu Pro Gin Ala Ser Thr 65 70 75 80
Asp Ala Val Val Arg Phe Glu Val Gly Leu His Glu Cys Gly Asn Ser 85 90 95
Val Gin Val Thr Asp Asp Ser Leu Val Tyr Ser Ser Phe Leu Leu His 100 105 110
Asp Pro Arg Pro Ala Gly Asn Leu Ser He Leu Arg Thr Asn Arg Ala 115 120 125
Glu Val Pro He Glu Cys Arg Tyr Pro Arg Gin Gly Asn Val Ser Ser 130 135 140
Arg Ala He Leu Pro Thr Trp Val Pro Phe Trp Thr Thr Val Leu Ser 145 150 155 160
Glu Glu Arg Leu Val Phe Ser Leu Arg Leu Met Glu Glu Asn Trp Ser 165 170 175
Arg Glu Lys Met Ser Pro Thr Phe His Leu Gly Asp Thr Ala His Leu 180 185 190
Gin Ala Glu Val Arg Thr Gly Ser His Pro Pro Leu Leu Leu Phe Val 195 200 205
Asp Arg Cys Val Ala Thr Pro Thr Arg Asp Gin Ser Gly Ser Pro Tyr 210 215 220
His Thr He Val Asp Leu His Gly Cys Leu Val Asp Gly Leu Ser Asp 225 230 235 240
Gly Ala Ser Lys Phe Lys Ala Pro Arg Pro Lys Pro Asp Val Leu Gin 245 250 255
Phe Met Val Ala Val Phe His Phe Ala Asn Asp Ser Arg His Thr Val 260 265 270
Tyr He Thr Cys His Leu Arg Val He Pro Ala Gin Gin Ala Pro Asp 275 280 285
Arg Leu Asn Lys Ala Cys Ser Phe Asn Gin Ser Ser Ser Ser Trp Ala 290 295 300
Pro Val Glu Gly Ser Ala Asp He Cys Glu Cys Cys Gly Asn Gly Asp 305 310 315 320
Cys Asp Leu He Ala Gly Ser Pro Met Asn Gin Asn His Ala Ala Arg 325 330 335
Ser Ser Leu Arg Ser Arg Arg His Val Thr Glu Glu Ala Asp Val Thr 340 345 350
Val Gly Pro Leu He Phe Leu Gly Lys Ala Gly Asp Pro Ala Gly Thr 355 360 365
Glu Gly Leu Ala Ser Ala Ala Gin Ala Thr Leu Val Leu Gly Leu Arg 370 375 380
Met Ala Thr He Val Phe Leu Ala Val Ala Ala Val Val Leu Gly Leu 385 390 395 400
Thr Arg Gly Arg His Ala Ala Ser His Pro Arg Ser Ala Ser Gin 405 410 415
(2) INFORMATION FOR SEQ ID NO:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2381 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Canis familiaris
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 206..2353
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:
GAATTCCGGG AGCCCTGAAG GAAGCCGCAA GAACCCTGCC CGCACCTCCG CGACCTCAAG 60
ATGTCCACTC CACTGGAAGA CGGAGAATAC TGGATTGACC CCAACCAAGG ATGCAACCTG 120
ATGCCATCAA GGTTTTCTGC AACATGGAGA CAGGTGAGAC CTGCGTATAC CCACCTACCT 180
GGCTGATTTG GTGGTACGTT TGGCC ATG GCA TGC AAA CAG AAA GGA GAC AGT 232
Met Ala Cys Lys Gin Lys Gly Asp Ser
1 5
GGG AGT CCC TCA AGC AGG TTT AGT GCA GAT TGG AGC ACC TAC AGG TCA 280 Gly Ser Pro Ser Ser Arg Phe Ser Ala Asp Trp Ser Thr Tyr Arg Ser 10 15 20 25
CTT TCT TTA TTC TTC ATC CTT GTG ACT TCA GTG AAC TCA GTA GGT GTT 328 Leu Ser Leu Phe Phe He Leu Val Thr Ser Val Asn Ser Val Gly Val 30 35 40
ATG CAG TTG GTG AAT CCC ATC TTC CCA GGT ACT GTC ATT TGC CAT GAA 376 Met Gin Leu Val Asn Pro He Phe Pro Gly Thr Val He Cys His Glu 45 50 55
AAT AAA ATG ACA GTG GAA TTT CCA AGG GAT CTT GGC ACC AAA AAA TGG 424 Asn Lys Met Thr Val Glu Phe Pro Arg Asp Leu Gly Thr Lys Lys Trp 60 65 70
CAT GCA TCT GTG GTG GAT CCA TTT AGT TTT GAA TTG TTG AAC TGT ACT 472 His Ala Ser Val Val Asp Pro Phe Ser Phe Glu Leu Leu Asn Cys Thr 75 80 85
TCT ATC CTG GAC CCA GAA AAG CTC ACC CTG AAG GCC CCA TAT GAG ACC 520 Ser He Leu Asp Pro Glu Lys Leu Thr Leu Lys Ala Pro Tyr Glu Thr 90 95 100 105
TGT AGC AGG AGA GTG CTT GGC CAG CAT CAG ATG GCC ATC AGA CTC ACG 568 Cys Ser Arg Arg Val Leu Gly Gin His Gin Met Ala He Arg Leu Thr 110 115 120
GAC AAC AAT GCT GCT TCA AGA CAT AAG GCT TTC ATG TAT CAG ATC AGC 616 Asp Asn Asn Ala Ala Ser Arg His Lys Ala Phe Met Tyr Gin He Ser 125 130 135
TGT CCA GTT ATG CAA ACA GAA GAA ACC CAT GAG CAT GCA GGA TCC ACA 664 Cys Pro Val Met Gin Thr Glu Glu Thr His Glu His Ala Gly Ser Thr 140 145 150
ATC TGC ACA AAA GAT TCC ATG TCT TTT ACC TTT AAC ATT ATT CCT GGC 712
He Cys Thr Lys Asp Ser Met Ser Phe Thr Phe Asn He He Pro Gly 155 160 165
ATG GCT GAT GAA AAT ACG AAT CCC AGT GGT GGG AAA TGG ATG ATG GAG 760 Met Ala Asp Glu Asn Thr Asn Pro Ser Gly Gly Lys Trp Met Met Glu 170 175 180 185
GTT GAT GAT GCA AAA GCT CAA AAT CTG ACT CTT CGG GAG GCC TTG ATG 808 Val Asp Asp Ala Lys Ala Gin Asn Leu Thr Leu Arg Glu Ala Leu Met 190 195 200
CAA GGA TAT AAT TTC CTG TTT GAT AGC CAC AGG CTC AGT GTC CAA GTG 856 Gin Gly Tyr Asn Phe Leu Phe Asp Ser His Arg Leu Ser Val Gin Val 205 210 215
TCA TTC AAT GCC ACT GGA GTC ACT CAC TAC ATG CAA GGT AAC AGT CAC 904 Ser Phe Asn Ala Thr Gly Val Thr His Tyr Met Gin Gly Asn Ser His 220 225 230
CTC TAC ACA GTG CCT CTG AAG CTT ATA CAC ACA TCT CCT GGG CAG AAG 952 Leu Tyr Thr Val Pro Leu Lys Leu He His Thr Ser Pro Gly Gin Lys 235 240 245
ATC ATC TTA ACA ACA CGA GTA CTT TGT ATG TCA GAT CCC GTG ACC TGT 1000 He He Leu Thr Thr Arg Val Leu Cys Met Ser Asp Pro Val Thr Cys 250 255 260 265
AAC GCC ACA CAC ATG ACC CTC ACC ATA CCA GAG TTT CCT GGG AAA CTA 1048 Asn Ala Thr His Met Thr Leu Thr He Pro Glu Phe Pro Gly Lys Leu 270 275 280
CAG TCT GTG AGA TTT GAA AAC ACG AAC TTT CGT GTA AGC CAG CTG CAC 1096 Gin Ser Val Arg Phe Glu Asn Thr Asn Phe Arg Val Ser Gin Leu His 285 290 295
AAC CAT GGG ATT GAT AAA GAA GAA TTA AAC GGC TTG AGG TTA CAC TTC 1144 Asn His Gly He Asp Lys Glu Glu Leu Asn Gly Leu Arg Leu His Phe 300 305 310
AGC AAA TCT CTT CTC AAA ATG AAC TCC TCT GAA AAA TGC CTA CTC TAT 1192 Ser Lys Ser Leu Leu Lys Met Asn Ser Ser Glu Lys Cys Leu Leu Tyr 315 320 325
CAG TTC TAC TTA GCA TCT CTC AAG CTG ACC TTT GCC TTT GAA CGG GAC 1240 Gin Phe Tyr Leu Ala Ser Leu Lys Leu Thr Phe Ala Phe Glu Arg Asp 330 335 340 345
ACG GTT TCC ACA GTG GTT TAT CCT GAG TGT GTT TGT GAG CCA CCA GTT 1288 Thr Val Ser Thr Val Val Tyr Pro Glu Cys Val Cys Glu Pro Pro Val 350 355 360
ACT ATA GTT ACA GGT GAC CTG TGT ACC CAG GAT GGG TTT ATG GAT GTC 1336 Thr He Val Thr Gly Asp Leu Cys Thr Gin Asp Gly Phe Met Asp Val 365 370 375
AAG GTC TAC AGC CAC CAA ACA AAA CCA GCT CTA AAC TTG GAT ACC CTC 1384 Lys Val Tyr Ser His Gin Thr Lys Pro Ala Leu Asn Leu Asp Thr Leu 380 385 390
AGA GTG GGA GAC TCC TCC TGC CAA CCT ACT TTC AAG GCT CCA TCA CAA 1432 Arg Val Gly Asp Ser Ser Cys Gin Pro Thr Phe Lys Ala Pro Ser Gin 395 400 405
GGG TTG ACA CTG TTT CAC ATC CCC CTA AAT GGA TGT GGA ACA AGA CTT 1480 Gly Leu Thr Leu Phe His He Pro Leu Asn Gly Cys Gly Thr Arg Leu 410 415 420 425
AAG TTC AAA GGT GAC ACA GTC ATC TAT GAA AAT GAA ATA CAT GCT CTC 1528 Lys Phe Lys Gly Asp Thr Val He Tyr Glu Asn Glu He His Ala Leu 430 435 440
TGG ACA GAT CTC CCT CCA AGC ACA ATT TCC AGA GAT AGT GAA TTC AGA 1576 Trp Thr Asp Leu Pro Pro Ser Thr He Ser Arg Asp Ser Glu Phe Arg 445 450 455
ATG ACT GTG AAG TGC CAT TAC AGC AGA GAT GAC CTG CTG ATA AAT ACC 1624 Met Thr Val Lys Cys His Tyr Ser Arg Asp Asp Leu Leu He Asn Thr 460 465 470
AAT GTC CAA AGT CTT CCT CCT CCC GTG GCC TCA GTG AGG CCT GGT CCA 1672 Asn Val Gin Ser Leu Pro Pro Pro Val Ala Ser Val Arg Pro Gly Pro 475 480 485
CTT GCC TTA ATC CTG CAA ACC TAC CCA GAT AAA TCC TAT TTG CGA CCC 1720 Leu Ala Leu He Leu Gin Thr Tyr Pro Asp Lys Ser Tyr Leu Arg Pro 490 495 500 505
TAT GGG GAT AAG GAG TAT CCT GTG GTG AGA TAC CTC CGC CAA CCA ATT 1768 Tyr Gly Asp Lys Glu Tyr Pro Val Val Arg Tyr Leu Arg Gin Pro He 510 515 520
TAC CTG GAA GTG AAA GTC CTA AAT AGG GCT GAC CCC AAC ATC AAG CTG 1816 Tyr Leu Glu Val Lys Val Leu Asn Arg Ala Asp Pro Asn He Lys Leu 525 530 535
GTC TTA GAT GAT TGC TGG GCA ACA CCC ACC ATG GAC CCA GCC TCA CTC 1864 Val Leu Asp Asp Cys Trp Ala Thr Pro Thr Met Asp Pro Ala Ser Leu 540 545 550
CCC CAG TGG AAT ATT GTC ATG GAT GGC TGT GAA TAC AAT CTG GAC AAC 1912 Pro Gin Trp Asn He Val Met Asp Gly Cys Glu Tyr Asn Leu Asp Asn 555 560 565
TAC AGA ACG ACC TTC CAT CCA GTT GGC TCC TCT GTG ACC TAC CCT ACT 1960 Tyr Arg Thr Thr Phe His Pro Val Gly Ser Ser Val Thr Tyr Pro Thr 570 575 580 585
CAC TAT CAG AGG TTT GAT GTG AAG ACC TTT GCC TTT ATA TCA GAG GCC 2008 His Tyr Gin Arg Phe Asp Val Lys Thr Phe Ala Phe He Ser Glu Ala 590 595 600
CAA GTG CTT TCT AGC CTG GTC TAC TTC CAC TGC ACC GCA TTA ATC TGC 2056 Gin Val Leu Ser Ser Leu Val Tyr Phe His Cys Thr Ala Leu He Cys 605 610 615
AAT CGA CTG TCT CCT GAC TCC CCT CTG TGT TCT GTG ACT TGC CCT GTA 2104 Asn Arg Leu Ser Pro Asp Ser Pro Leu Cys Ser Val Thr Cys Pro Val 620 625 630
TCA TCC AGG CAC AGG CGA GCC ACA GGC AGT ACT GAA GAA GAG AAG ATG 2152 Ser Ser Arg His Arg Arg Ala Thr Gly Ser Thr Glu Glu Glu Lys Met 635 640 645
ATA GTA AGT CTC CCG GGA CCC ATC CTC CTG TTG GCA GAC AGC TCT TCA 2200 He Val Ser Leu Pro Gly Pro He Leu Leu Leu Ala Asp Ser Ser Ser 650 655 660 665
CTC AGA GAT GGT GTG GAC TCA AAA GGG CAC AGG GCT GCT GGA TAT GTT 2248 Leu Arg Asp Gly Val Asp Ser Lys Gly His Arg Ala Ala Gly Tyr Val 670 675 680
GCT TTT AAA ACT GTA GTG GCT GTG GCT GCC TTA GCA GGC CTT GTG GCT 2296 Ala Phe Lys Thr Val Val Ala Val Ala Ala Leu Ala Gly Leu Val Ala 685 690 695
GCT CTA GGT CTC ATC ATC TAC CTG CGT AAG AAA AGA ACC ATG GTG TTA 2344 Ala Leu Gly Leu He He Tyr Leu Arg Lys Lys Arg Thr Met Val Leu 700 705 710
AAT CAC TAAGGATTTT CAAATAAAGT GTCCGGAATT C 2381
Asn His 715
(2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 715 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
Met Ala Cys Lys Gin Lys Gly Asp Ser Gly Ser Pro Ser Ser Arg Phe
1 5 10 15
Ser Ala Asp Trp Ser Thr Tyr Arg Ser Leu Ser Leu Phe Phe He Leu 20 25 30
Val Thr Ser Val Asn Ser Val Gly Val Met Gin Leu Val Asn Pro He 35 40 45
Phe Pro Gly Thr Val He Cys His Glu Asn Lys Met Thr Val Glu Phe 50 55 60
Pro Arg Asp Leu Gly Thr Lys Lys Trp His Ala Ser Val Val Asp Pro 65 70 75 80
Phe Ser Phe Glu Leu Leu Asn Cys Thr Ser He Leu Asp Pro Glu Lys 85 90 95
Leu Thr Leu Lys Ala Pro Tyr Glu Thr Cys Ser Arg Arg Val Leu Gly 100 105 110
Gin His Gin Met Ala He Arg Leu Thr Asp Asn Asn Ala Ala Ser Arg 115 120 125
His Lys Ala Phe Met Tyr Gin He Ser Cys Pro Val Met Gin Thr Glu 130 135 140
Glu Thr His Glu His Ala Gly Ser Thr He Cys Thr Lys Asp Ser Met 145 150 155 160
Ser Phe Thr Phe Asn He He Pro Gly Met Ala Asp Glu Asn Thr Asn 165 170 175
Pro Ser Gly Gly Lys Trp Met Met Glu Val Asp Asp Ala Lys Ala Gin 180 185 190
Asn Leu Thr Leu Arg Glu Ala Leu Met Gin Gly Tyr Asn Phe Leu Phe 195 200 205
Asp Ser His Arg Leu Ser Val Gin Val Ser Phe Asn Ala Thr Gly Val 210 215 220
Thr His Tyr Met Gin Gly Asn Ser His Leu Tyr Thr Val Pro Leu Lys 225 230 235 240
Leu He His Thr Ser Pro Gly Gin Lys He He Leu Thr Thr Arg Val 245 250 255
Leu Cys Met Ser Asp Pro Val Thr Cys Asn Ala Thr His Met Thr Leu 260 265 270
Thr He Pro Glu Phe Pro Gly Lys Leu Gin Ser Val Arg Phe Glu Asn 275 280 285
Thr Asn Phe Arg Val Ser Gin Leu His Asn His Gly He Asp Lys Glu 290 295 300
Glu Leu Asn Gly Leu Arg Leu His Phe Ser Lys Ser Leu Leu Lys Met 305 310 315 320
Asn Ser Ser Glu Lys Cys Leu Leu Tyr Gin Phe Tyr Leu Ala Ser Leu 325 330 335
Lys Leu Thr Phe Ala Phe Glu Arg Asp Thr Val Ser Thr Val Val Tyr 340 345 350
Pro Glu Cys Val Cys Glu Pro Pro Val Thr He Val Thr Gly Asp Leu 355 360 365
Cys Thr Gin Asp Gly Phe Met Asp Val Lys Val Tyr Ser His Gin Thr 370 375 380
Lys Pro Ala Leu Asn Leu Asp Thr Leu Arg Val Gly Asp Ser Ser Cys 385 390 395 400
Gin Pro Thr Phe Lys Ala Pro Ser Gin Gly Leu Thr Leu Phe His He 405 410 415
Pro Leu Asn Gly Cys Gly Thr Arg Leu Lys Phe Lys Gly Asp Thr Val 420 425 430
He Tyr Glu Asn Glu He His Ala Leu Trp Thr Asp Leu Pro Pro Ser 435 440 445
Thr He Ser Arg Asp Ser Glu Phe Arg Met Thr Val Lys Cys His Tyr 450 455 460
Ser Arg Asp Asp Leu Leu He Asn Thr Asn Val Gin Ser Leu Pro Pro 465 470 475 480
Pro Val Ala Ser Val Arg Pro Gly Pro Leu Ala Leu He Leu Gin Thr 485 490 495
Tyr Pro Asp Lys Ser Tyr Leu Arg Pro Tyr Gly Asp Lys Glu Tyr Pro 500 505 510
Val Val Arg Tyr Leu Arg Gin Pro He Tyr Leu Glu Val Lys Val Leu 515 520 525
Asn Arg Ala Asp Pro Asn He Lys Leu Val Leu Asp Asp Cys Trp Ala 530 535 540
Thr Pro Thr Met Asp Pro Ala Ser Leu Pro Gin Trp Asn He Val Met 545 550 555 560
Asp Gly Cys Glu Tyr Asn Leu Asp Asn Tyr Arg Thr Thr Phe His Pro 565 570 575
Val Gly Ser Ser Val Thr Tyr Pro Thr His Tyr Gin Arg Phe Asp Val 580 585 590
Lys Thr Phe Ala Phe He Ser Glu Ala Gin Val Leu Ser Ser Leu Val 595 600 605
Tyr Phe His Cys Thr Ala Leu He Cys Asn Arg Leu Ser Pro Asp Ser 610 615 620
Pro Leu Cys Ser Val Thr Cys Pro Val Ser Ser Arg His Arg Arg Ala 625 630 635 640
Thr Gly Ser Thr Glu Glu Glu Lys Met He Val Ser Leu Pro Gly Pro 645 650 655
He Leu Leu Leu Ala Asp Ser Ser Ser Leu Arg Asp Gly Val Asp Ser 660 665 670
Lys Gly His Arg Ala Ala Gly Tyr Val Ala Phe Lys Thr Val Val Ala 675 680 685
Val Ala Ala Leu Ala Gly Leu Val Ala Ala Leu Gly Leu He He Tyr 690 695 700
Leu Arg Lys Lys Arg Thr Met Val Leu Asn His 705 710 715
(2) INFORMATION FOR SEQ ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1325 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(Al ORGANISM: Canis familiaris
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 13..1293
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:
GAATTCCGGG CT ATG GGG CTG AGC TAT GGA ATT TTC ATC TGT TTT CTG 48
Met Gly Leu Ser Tyr Gly He Phe He Cys Phe Leu 1 5 10
CTC CTG GGA GGC ATG GAG CTG TGC TGC CCC CAG ACC ATC TGG CCA ACT 96 Leu Leu Gly Gly Met Glu Leu Cys Cys Pro Gin Thr He Trp Pro Thr 15 20 25
GAG ACC TAC TAC CCA TTG ACA TCT AGG CCC CCA GTA ATG GTG GAC TGT 144 Glu Thr Tyr Tyr Pro Leu Thr Ser Arg Pro Pro Val Met Val Asp Cys 30 35 40
CTG GAG TCC CAG CTG GTG GTC ACT GTC AGC AAA GAC CTT TTT GGT ACT 192 Leu Glu Ser Gin Leu Val Val Thr Val Ser Lys Asp Leu Phe Gly Thr 45 50 55 60
GGG AAG CTC ATC AGG CCA GCA GAC CTC ACC CTG GGT CCA GAG AAC TGT 240
Gly Lys Leu He Arg Pro Ala Asp Leu Thr Leu Gly Pro Glu Asn Cys 65 70 75
GAG CCC CTG GTC TCC ATG GAC ACG GAT GAT GTG GTC AGG TTT GAG GTT 288 Glu Pro Leu Val Ser Met Asp Thr Asp Asp Val Val Arg Phe Glu Val 80 85 90
GGG CTG CAC GAG TGT GGC AGC AGG GTG CAG GTG ACT GAC AAT GCT CTG 336 Gly Leu His Glu Cys Gly Ser Arg Val Gin Val Thr Asp Asn Ala Leu 95 100 105
GTG TAC AGC ACC TTC CTG ATC CAC AGC CCC CGC CCT GCG GGC AAC CTG 384 Val Tyr Ser Thr Phe Leu He His Ser Pro Arg Pro Ala Gly Asn Leu 110 115 120
TCC ATC CTG AGA ACT AAT CGT GCC GAG GTT CCC ATC GAG TGC CAC TAC 432 Ser He Leu Arg Thr Asn Arg Ala Glu Val Pro He Glu Cys His Tyr 125 130 135 140
CCC AGG CAC AGC AAT GTG AGC AGC CAG GCC ATC CTG CCC ACT TGG GTG 480 Pro Arg His Ser Asn Val Ser Ser Gin Ala He Leu Pro Thr Trp Val 145 150 155
CCC TTC AGG ACC ACA ATG CTC TTC GAG GAG AAG CTA GTT TTC TCT CTC 528 Pro Phe Arg Thr Thr Met Leu Phe Glu Glu Lys Leu Val Phe Ser Leu 160 165 170
CGC CTA ATG GAG GAG GAC TGG GGC TCC GAG AAG CAA TCC CCC ACA TTC 576 Arg Leu Met Glu Glu Asp Trp Gly Ser Glu Lys Gin Ser Pro Thr Phe 175 180 185
CAG CTG GGA GAC ATA GCC CAC CTC CAG GCT GAA GTC CAC ACT GGC AGC 624 Gin Leu Gly Asp He Ala His Leu Gin Ala Glu Val His Thr Gly Ser 190 195 200
CAT ATG CCA CTG CGA CTT TTT GTG GAC CAC TGT GTG GCC ACG CTG ACA 672 His Met Pro Leu Arg Leu Phe Val Asp His Cys Val Ala Thr Leu Thr 205 210 215 220
CCA GAT CGG AAT GCC TTC CTT CAT CAC AAA ATT GTG GAC TTC CAT GGC 720 Pro Asp Arg Asn Ala Phe Leu His His Lys He Val Asp Phe His Gly 225 230 235
TGT CTT GTG GAT GGT CTC TAC AAT TCC TCT TCA GCC TTC AAA GCC CCC 768 Cys Leu Val Asp Gly Leu Tyr Asn Ser Ser Ser Ala Phe Lys Ala Pro 240 245 250
AGA CCC AGG CCA GAG ACT CTT CAG TTC ACA GTG GAT GTT TTC CAC TTT 816 Arg Pro Arg Pro Glu Thr Leu Gin Phe Thr Val Asp Val Phe His Phe 255 260 265
GCT AAG GAC TCA AGA AAC ACG ATC TAT ATC ACC TGC CAT CTG AAG GTC 864 Ala Lys Asp Ser Arg Asn Thr He Tyr He Thr Cys His Leu Lys Val 270 275 280
ACT CCG GCT GAC CGA GTC CCA GAC CAG CTA AAC AAA GCT TGT TCC TTC 912 Thr Pro Ala Asp Arg Val Pro Asp Gin Leu Asn Lys Ala Cys Ser Phe 285 290 295 300
ATC AAG TCT ACC AAG AGG TGG TAC CCT GTA GAA GGC TCG GCT GAT ATT 960 He Lys Ser Thr Lys Arg Trp Tyr Pro Val Glu Gly Ser Ala Asp He 305 310 315
TGT CGC TGT TGT AAC AAA GGC AGC TGT GGC CTT CCA GGC CGG TCC AGG 1008 Cys Arg Cys Cys Asn Lys Gly Ser Cys Gly Leu Pro Gly Arg Ser Arg 320 325 330
AGG CTG TCC CAC CTA GAG AGA GGG TGG CGC AAG TCT GTT TCC CAC ACT 1056 Arg Leu Ser His Leu Glu Arg Gly Trp Arg Lys Ser Val Ser His Thr 335 340 345
AGA AAT CGC AGG CAC GTG ACT GAA GAA GCA GAG ATC ACC GTG GGG CCT 1104 Arg Asn Arg Arg His Val Thr Glu Glu Ala Glu He Thr Val Gly Pro 350 355 360
CTG ATC TTC CTG GGA AAG GCT AGT GAT CAT GGT ATA GAG GGG TCA ACC 1152 Leu He Phe Leu Gly Lys Ala Ser Asp His Gly He Glu Gly Ser Thr 365 370 375 380
TCT CCT CAC ACC TCT GTG ATG TTG GGC TTA GGC CTG GCC ACG GTG GTA 1200 Ser Pro His Thr Ser Val Met Leu Gly Leu Gly Leu Ala Thr Val Val 385 390 395
TCC CTG ACT CTA GCT ACC ATT GTC CTG GTC CTT GCC AAG AGG CAT CGT 1248 Ser Leu Thr Leu Ala Thr He Val Leu Val Leu Ala Lys Arg His Arg 400 405 410
ACT GCT TCC CAC CCT GTG ATA TGC CCT GCA TCT GTC TCC CAA TAAAAGAATA 1300 Thr Ala Ser His Pro Val He Cys Pro Ala Ser Val Ser Gin 415 420 425
AGCAAAAAAA AAAAAACCGG AATTC 1325
(2) INFORMATION FOR SEQ ID NO:12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 426 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:
Met Gly Leu Ser Tyr Gly He Phe He Cys Phe Leu Leu Leu Gly Gly
1 5 10 15
Met Glu Leu Cys Cys Pro Gin Thr He Trp Pro Thr Glu Thr Tyr Tyr 20 25 30
Pro Leu Thr Ser Arg Pro Pro Val Met Val Asp Cys Leu Glu Ser Gin 35 40 45
Leu ''al Val Thr Val Ser Lys Asp Leu Phe Gly Thr Gly Lys Leu He 50 55 60
Arg Pro Ala Asp Leu Thr Leu Gly Pro Glu Asn Cys Glu Pro Leu Val 65 70 75 80
Ser Met Asp Thr Asp Asp Val Val Arg Phe Glu Val Gly Leu His Glu 85 90 95
Cys Gly Ser Arg Val Gin Val Thr Asp Asn Ala Leu Val Tyr Ser Thr 100 105 110
Phe Leu He His Ser Pro Arg Pro Ala Gly Asn Leu Ser He Leu Arg 115 120 125
Thr Asn Arg Ala Glu Val Pro He Glu Cys His Tyr Pro Arg His Ser 130 135 140 sn Val Ser Ser Gin Ala He Leu Pro Thr Trp Val Pro Phe Arg Thr 145 150 155 160
Thr Met Leu Phe Glu Glu Lys Leu Val Phe Ser Leu Arg Leu Met Glu 165 170 175
Glu Asp Trp Gly Ser Glu Lys Gin Ser Pro Thr Phe Gin Leu Gly Asp 180 185 190
He Ala His Leu Gin Ala Glu Val His Thr Gly Ser His Met Pro Leu 195 200 205
Arg Leu Phe Val Asp His Cys Val Ala Thr Leu Thr Pro Asp Arg Asn 210 215 220
Ala Phe Leu His His Lys He Val Asp Phe His Gly Cys Leu Val Asp 225 230 235 240
Gly Leu Tyr Asn Ser Ser Ser Ala Phe Lys Ala Pro Arg Pro Arg Pro 245 250 255
Glu Thr Leu Gin Phe Thr Val Asp Val Phe His Phe Ala Lys Asp Ser 260 265 270
Arg Asn Thr He Tyr He Thr Cys His Leu Lys Val Thr Pro Ala Asp 275 280 285
Arg Val Pro Asp Gin Leu Asn Lys Ala Cys Ser Phe He Lys Ser Thr 290 295 300
Lys Arg Trp Tyr Pro Val Glu Gly Ser Ala Asp He Cys Arg Cys Cys 305 310 315 320
Asn Lys Gly Ser Cys Gly Leu Pro Gly Arg Ser Arg Arg Leu Ser His 325 330 335
Leu Glu Arg Gly Trp Arg Lys Ser Val Ser His Thr Arg Asn Arg Arg 340 345 350
His Val Thr Glu Glu Ala Glu He Thr Val Gly Pro Leu He Phe Leu 355 360 365
Gly Lys Ala Ser Asp His Gly He Glu Gly Ser Thr Ser Pro His Thr 370 375 380
Ser Val Met Leu Gly Leu Gly Leu Ala Thr Val Val Ser Leu Thr Leu 385 390 395 400
Ala Thr He Val Leu Val Leu Ala Lys Arg His Arg Thr Ala Ser His 405 410 415
Pro Val He Cys Pro Ala Ser Val Ser Gin 420 425
(2) INFORMATION FOR SEQ ID NO:13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2236 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Felis domesticus
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 28..2175
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:
GAATTCGCGG CCGCGATACT TTTGGCT ATG GCC TCC AGA CAG AAA GGA GAT 51
Met Ala Ser Arg Gin Lys Gly Asp
1 5
AGT GGG AGT CCT TCA AGC TGG TTT AAT GCA GAT TGG AGC ACC TAC AGG 99 Ser Gly Ser Pro Ser Ser Trp Phe Asn Ala Asp Trp Ser Thr Tyr Arg 10 15 20
TCA CTT TTT CTA CTC TTT ATC CTC GTG ACT TCA GTG AAT TCC ATA GGT 147 Ser Leu Phe Leu Leu Phe He Leu Val Thr Ser Val Asn Ser He Gly 25 30 35 40
GTT TTG CAG TTG GTG AAT CCT GTC TTC CCA GGT ACT GTC ACT TGC TAT 195 Val Leu Gin Leu Val Asn Pro Val Phe Pro Gly Thr Val Thr Cys Tyr 45 50 55
GAA ACT AGA ATG GCA GTG GAA TTT CCA AGT GAT TTT GGC ACC AAA AAA 243 Glu Thr Arg Met Ala Val Glu Phe Pro Ser Asp Phe Gly Thr Lys Lys 60 65 70
TGG CAT ACA TCT GTG GTG GAT CCC TTT AGT TTT GAA TTG TTG AAC TGC 291 Trp His Thr Ser Val Val Asp Pro Phe Ser Phe Glu Leu Leu Asn Cys 75 80 85
ACT TAC ATC TTG GAT CCA GAA AAT CTC ACC TTA AAG GCC CCA TAT GAG 339 Thr Tyr He Leu Asp Pro Glu Asn Leu Thr Leu Lys Ala Pro Tyr Glu 90 95 100
ACC TGT ACC AGA AGA ACG CTT GGC CAG CAC CGG ATG ATC ATC AGA CTC 387 Thr Cys Thr Arg Arg Thr Leu Gly Gin His Arg Met He He Arg Leu 105 110 115 120
AAG GAC CAC AAT GCT GCT TCA AGA CAT AAC AGT TTG ATG TAT CAG ATC 435 Lys Asp His Asn Ala Ala Ser Arg His Asn Ser Leu Met Tyr Gin He 125 130 135
AAC TGT CCA GTT ATG CAA GCA GAA GAA ACC CAT GAG CAT GCA GGA TCC 483 Asn Cys Pro Val Met Gin Ala Glu Glu Thr His Glu His Ala Gly Ser 140 145 150
ACT ATC TGC ACA AAG GAT TCC ATG TCT TTT ACC TTT AAT GTC ATT CCT 531 Thr He Cys Thr Lys Asp Ser Met Ser Phe Thr Phe Asn Val He Pro 155 160 165
GGC CTG GCT GAT GAA AAT ACG GAT ATC AAG AAT CCG ATG GGA TGG AGC 579 Gly Leu Ala Asp Glu Asn Thr Asp He Lys Asn Pro Met Gly Trp Ser 170 175 180
ATT GAG GTT GGT GAT GGT ACA AAA GCC AAA ACT CTG ACT CTT CAG GAT 627 He Glu Val Gly Asp Gly Thr Lys Ala Lys Thr Leu Thr Leu Gin Asp 185 190 195 200
GTC TTG AGA CAA GGA TAC AAT ATC CTG TTT GAT AAC CAC AAG ATC ACC 675 Val Leu Arg Gin Gly Tyr Asn He Leu Phe Asp Asn His Lys He Thr
205 210 215
TTC CAG GTG TCA TTC AAT GCC ACT GGA GTG ACT CAC TAC ATG CAA GGT 723 Phe Gin Val Ser Phe Asn Ala Thr Gly Val Thr His Tyr Met Gin Gly 220 225 230
AAC AGT CAC CTC TAC ATG GTG CCT CTG AAG TTG ATA CAT GAA TCT CTT 771 Asn Ser His Leu Tyr Met Val Pro Leu Lys Leu He His Glu Ser Leu 235 240 245
GGG CAG AAG ATC ATC TTA ACA ACA CGA GTG CTT TGT ATG TCA GAT GCT 819 Gly Gin Lys He He Leu Thr Thr Arg Val Leu Cys Met Ser Asp Ala 250 255 260
GTG ACC TGT AAT GCC ACA CAT GTG ACT CTG ACC ATA CCA GAG TTT CCT 867 Val Thr Cys Asn Ala Thr His Val Thr Leu Thr He Pro Glu Phe Pro 265 270 275 280
GGG AAG TTA AAA TCT GTG AGC TCT GAA AAT AGG AAC TTT GCT GTA AGC 915 Gly Lys Leu Lys Ser Val Ser Ser Glu Asn Arg Asn Phe Ala Val Ser 285 290 295
CAG CTG CAC AAC AAT GGG ATT GAT AAA GAA GAA TCA AGT GGC TTG ACA 963 Gin Leu His Asn Asn Gly He Asp Lys Glu Glu Ser Ser Gly Leu Thr 300 305 310
TTG CAC TTC AGC AAA ACT CTT CTC AAA ATG GAA TTC TCT GAA AAA TGC 1011 Leu His Phe Ser Lys Thr Leu Leu Lys Met Glu Phe Ser Glu Lys Cys 315 320 325
CTA CCC TAT CAG TTC TAC TTA GCT TCA CTC AAG CTG ACC TTT GCC TTT 1059 Leu Pro Tyr Gin Phe Tyr Leu Ala Ser Leu Lys Leu Thr Phe Ala Phe 330 335 340
AAT CAA GAG ACT ATA TCC ACG GTG CTT TAT CCT GAG TGT GTC TGT GAG 1107 Asn Gin Glu Thr He Ser Thr Val Leu Tyr Pro Glu Cys Val Cys Glu 345 350 355 360
TCA CCA GTT TCT ATA GTT ACA GGT GAC CTG TGT ACT CAG GAT GGG TTT 1155 Ser Pro Val Ser He Val Thr Gly Asp Leu Cys Thr Gin Asp Gly Phe 365 370 375
ATG GAC ATA AAG GTC TAC AGT CAC CAG ACA AAA CCA GCT CTC AAC TTA 1203 Met Asp He Lys Val Tyr Ser His Gin Thr Lys Pro Ala Leu Asn Leu 380 385 390
GAA ACC CTA AGG GTG GGA GAC TCA TCC TGC CAA CCT ACC TTC CAG GCT 1251 Glu Thr Leu Arg Val Gly Asp Ser Ser Cys Gin Pro Thr Phe Gin Ala 395 400 405
GCA TCT CAA GGG CTG ATA CTG TTT CAC ATA CCC CTG AAT GGA TGC GGG 1299 Ala Ser Gin Gly Leu He Leu Phe His He Pro Leu Asn Gly Cys Gly 410 415 420
ACA AGA CAT AAG TTC AAG GAA GGC AAA GTC ATC TAT GAA AAT GAA ATA 1347 Thr Arg His Lys Phe Lys Glu Gly Lys Val He Tyr Glu Asn Glu He 425 430 435 440
CAT GCT GTC TGG GCG GAT CTT CCT CCA AGC ACA ATT TCT AGA GAT AGT 1395 His Ala Val Trp Ala Asp Leu Pro Pro Ser Thr He Ser Arg Asp Ser 445 450 455
GAA TTC AGA ATG ACA GTG CAG TGC CAT TAC AGC AAA GGT GAC CTG CTA 1443 Glu Phe Arg Met Thr Val Gin Cys His Tyr Ser Lys Gly Asp Leu Leu 460 465 470
ATA AAT ACC AGA GTC CAA AGT CTT CCT CCT CTA GAG GCC TCA GTG AGG 1491
He Asn Thr Arg Val Gin Ser Leu Pro Pro Leu Glu Ala Ser Val Arg 475 480 485
CCA GGT CCA CTT GCC TTA ATC CTG CAA ACC TAC CCA GAT AAA TCC TAC 1539 Pro Gly Pro Leu Ala Leu He Leu Gin Thr Tyr Pro Asp Lys Ser Tyr 490 495 500
CTC CAA CCT TAC GGG GAG AAG GAG TAC CCT GTG GTG AGA TAC CTC CGC 1587 Leu Gin Pro Tyr Gly Glu Lys Glu Tyr Pro Val Val Arg Tyr Leu Arg 505 510 515 520
CAA CCA ATT TAT CTG GAA GTG AGA GTC CTA AAT AGG TCT GAC CCC AAC 1635 Gin Pro He Tyr Leu Glu Val Arg Val Leu Asn Arg Ser Asp Pro Asn 525 530 535
ATC AAG CTG GTC TTA GAT GAC TGC TGG GCA ACA CCC ACG ATG GAC CCA 1683 He Lys Leu Val Leu Asp Asp Cys Trp Ala Thr Pro Thr Met Asp Pro 540 545 550
GCC TCC GTC CCC CAG TGG AAT ATT ATC ATG GAT GGC TGT GAA TAC AAC 1731 Ala Ser Val Pro Gin Trp Asn He He Met Asp Gly Cys Glu Tyr Asn 555 560 565
CTG GAC AAC CAC AGA ACC ACC TTC CAT CCA GTT GGC TCC TCT GTG ACC 1779 Leu Asp Asn His Arg Thr Thr Phe His Pro Val Gly Ser Ser Val Thr 570 575 580
TAT CCT ACT CAC TAT CGG AGG TTT GAT GTG AAG ACC TTT GCC TTT GTA 1827 Tyr Pro Thr His Tyr Arg Arg Phe Asp Val Lys Thr Phe Ala Phe Val 585 590 595 600
TCA GAG GCC CAA GTG CTT TCT AGT CTG GTC TAC TTC CAC TGC AGT GTC 1875 Ser Glu Ala Gin Val Leu Ser Ser Leu Val Tyr Phe His Cys Ser Val 605 610 615
TTA ATC TGC AGT CGA CTG TCT GCT GAC TCC CCT CTG TGT TCC GTG ACT 1923 Leu He Cys Ser Arg Leu Ser Ala Asp Ser Pro Leu Cys Ser Val Thr 620 625 630
TGC CCT GTG TCA TTC AGA CAC AGG AGA GCC ACA GGC ACC ACT GAA GAA 1971 Cys Pro Val Ser Phe Arg His Arg Arg Ala Thr Gly Thr Thr Glu Glu 635 640 645
GAG AAA ATG ATA GTG AGT CTT CCA GGA CCC ATC CTC CTG CTG TCA GAT 2019 Glu Lys Met He Val Ser Leu Pro Gly Pro He Leu Leu Leu Ser Asp 650 655 660
AGC TCT TCA CTC AGA GAT GTG GTG GAC TCA AAA GGG TAT GGG GCT GCC 2067 Ser Ser Ser Leu Arg Asp Val Val Asp Ser Lys Gly Tyr Gly Ala Ala 665 670 675 680
GGA TAT GTT GCT TTT AAG ACT GTG GTA GCT GTG GCT GCC TTA GCA GGC 2115 Gly Tyr Val Ala Phe Lys Thr Val Val Ala Val Ala Ala Leu Ala Gly 685 690 695
CTC GTG GCA ACG CTA GGC TTC ATC ACC TAC CTG CGC AAG AAC AGA ACC 2163 Leu Val Ala Thr Leu Gly Phe He Thr Tyr Leu Arg Lys Asn Arg Thr 700 705 710
ATG ATA AAT CAC TAAGGATTTT CAAATAAAAT GGTTGAAGTA AAAAAAAAAA 2215 Met He Asn His 715
AAAAAAAGCG GCCGCGAATT C 2236
(2) INFORMATION FOR SEQ ID NO:14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 716 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:
Met Ala Ser Arg Gin Lys Gly Asp Ser Gly Ser Pro Ser Ser Trp Phe
1 5 10 15
Asn Ala Asp Trp Ser Thr Tyr Arg Ser Leu Phe Leu Leu Phe He Leu 20 25 30
Val Thr Ser Val Asn Ser He Gly Val Leu Gin Leu Val Asn Pro Val 35 40 45
Phe Pro Gly Thr Val Thr Cys Tyr Glu Thr Arg Met Ala Val Glu Phe 50 55 60
Pro Ser Asp Phe Gly Thr Lys Lys Trp His Thr Ser Val Val Asp Pro 65 70 75 80
Phe Ser Phe Glu Leu Leu Asn Cys Thr Tyr He Leu Asp Pro Glu Asn 85 90 95
Leu Thr Leu Lys Ala Pro Tyr Glu Thr Cys Thr Arg Arg Thr Leu Gly 100 105 110
Gin His Arg Met He He Arg Leu Lys Asp His Asn Ala Ala Ser Arg 115 120 125
His Asn Ser Leu Met Tyr Gin He Asn Cys Pro Val Met Gin Ala Glu 130 135 140
Glu Thr His Glu His Ala Gly Ser Thr He Cys Thr Lys Asp Ser Met 145 150 155 160
Ser Phe Thr Phe Asn Val He Pro Gly Leu Ala Asp Glu Asn Thr Asp 165 170 175
He Lys Asn Pro Met Gly Trp Ser He Glu Val Gly Asp Gly Thr Lys 180 185 190
Ala Lys Thr Leu Thr Leu Gin Asp Val Leu Arg Gin Gly Tyr Asn He 195 200 205
Leu Phe Asp Asn His Lys He Thr Phe Gin Val Ser Phe Asn Ala Thr 210 215 220
Gly Val Thr His Tyr Met Gin Gly Asn Ser His Leu Tyr Met Val Pro 225 230 235 240
Leu Lys Leu He His Glu Ser Leu Gly Gin Lys He He Leu Thr Thr 245 250 255
Arg Val Leu Cys Met Ser Asp Ala Val Thr Cys Asn Ala Thr His Val 260 265 270
Thr Leu Thr He Pro Glu Phe Pro Gly Lys Leu Lys Ser Val Ser Ser 275 280 285
Glu Asn Arg Asn Phe Ala Val Ser Gin Leu His Asn Asn Gly He Asp 290 295 300
Lys Glu Glu Ser Ser Gly Leu Thr Leu His Phe Ser Lys Thr Leu Leu
305 310 315 320
Lys Met Glu Phe Ser Glu Lys Cys Leu Pro Tyr Gin Phe Tyr Leu Ala 325 330 335
Ser Leu Lys Leu Thr Phe Ala Phe Asn Gin Glu Thr He Ser Thr Val 340 345 350
Leu Tyr Pro Glu Cys Val Cys Glu Ser Pro Val Ser He Val Thr Gly 355 360 365
Asp Leu Cys Thr Gin Asp Gly Phe Met Asp He Lys Val Tyr Ser His 370 375 380
Gin Thr Lys Pro Ala Leu Asn Leu Glu Thr Leu Arg Val Gly Asp Ser 385 390 395 400
Ser Cys Gin Pro Thr Phe Gin Ala Ala Ser Gin Gly Leu He Leu Phe 405 410 415
His He Pro Leu Asn Gly Cys Gly Thr Arg His Lys Phe Lys Glu Gly 420 425 430
Lys Val He Tyr Glu Asn Glu He His Ala Val Trp Ala Asp Leu Pro 435 440 445
Pro Ser Thr He Ser Arg Asp Ser Glu Phe Arg Met Thr Val Gin Cys 450 455 460
His Tyr Ser Lys Gly Asp Leu Leu He Asn Thr Arg Val Gin Ser Leu 465 470 475 480
Pro Pro Leu Glu Ala Ser Val Arg Pro Gly Pro Leu Ala Leu He Leu 485 490 495
Gin Thr Tyr Pro Asp Lys Ser Tyr Leu Gin Pro Tyr Gly Glu Lys Glu 500 505 510
Tyr Pro Val Val Arg Tyr Leu Arg Gin Pro He Tyr Leu Glu Val Arg 515 520 525
Val Leu Asn Arg Ser Asp Pro Asn He Lys Leu Val Leu Asp Asp Cys 530 535 540
Trp Ala Thr Pro Thr Met Asp Pro Ala Ser Val Pro Gin Trp Asn He 545 550 555 560
He Met Asp Gly Cys Glu Tyr Asn Leu Asp Asn His Arg Thr Thr Phe 565 570 575
His Pro Val Gly Ser Ser Val Thr Tyr Pro Thr His Tyr Arg Arg Phe 580 585 590
Asp Val Lys Thr Phe Ala Phe Val Ser Glu Ala Gin Val Leu Ser Ser 595 600 605
Leu Val Tyr Phe His Cys Ser Val Leu He Cys Ser Arg Leu Ser Ala 610 615 620
Asp Ser Pro Leu Cys Ser Val Thr Cys Pro Val Ser Phe Arg His Arg 625 630 635 640
Arg Ala Thr Gly Thr Thr Glu Glu Glu Lys Met He Val Ser Leu Pro 645 650 655
Gly Pro He Leu Leu Leu Ser Asp Ser Ser Ser Leu Arg Asp Val Val 660 665 670
Asp Ser Lys Gly Tyr Gly Ala Ala Gly Tyr Val Ala Phe Lys Thr Val 675 680 685
Val Ala Val Ala Ala Leu Ala Gly Leu Val Ala Thr Leu Gly Phe He 690 695 700
Thr Tyr Leu Arg Lys Asn Arg Thr Met He Asn His 705 710 715
(2) INFORMATION FOR SEQ ID NO:15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1840 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Felis domesticus
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 57..1766
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:
GAATTCCGCG GCCGCAAGTA CAGGTCTTGC AGCCAGTGGG GGCTCCCGAT GGCATC 56
ATG TGG CTG CTG CAG CCC CTC TTG CTC TGT GTT CCC TTG TCT CTC GCT 104 Met Trp Leu Leu Gin Pro Leu Leu Leu Cys Val Pro Leu Ser Leu Ala 1 5 10 15
GTG CAT GGC CAG CAG AAG CCC CAG GTA CCA GAT TAT CCC GGT GAA CTC 152 Val His Gly Gin Gin Lys Pro Gin Val Pro Asp Tyr Pro Gly Glu Leu 20 25 30
CAT TGT GGG CTC CAG AGC CTT CAG TTT GCC ATA AAC CCG AGC CCC GGG 200 His Cys Gly Leu Gin Ser Leu Gin Phe Ala He Asn Pro Ser Pro Gly 35 40 45
AAA GCG ACT CCT GCA CTC ATA GTC TGG GAC AAT CGC GGG CTG CCA CAC 248 Lys Ala Thr Pro Ala Leu He Val Trp Asp Asn Arg Gly Leu Pro His 50 55 60
AAG CTG CAG AAC AAC TCT GGC TGC GGT ACC TGG GTA AGG GAG AGC CCG 296 Lys Leu Gin Asn Asn Ser Gly Cys Gly Thr Trp Val Arg Glu Ser Pro 65 70 75 80
GGG GGC TCC GTG CTG TTA GAC GCC TCT TAC AGC AGC TGC TAT GTC AAC 344 Gly Gly Ser Val Leu Leu Asp Ala Ser Tyr Ser Ser Cys Tyr Val Asn 85 90 95
GAG TGG GTG AGC ACG ACC CAA TCC CCA GGA ACG TCG AGG CCC CCC ACC 392 Glu Trp Val Ser Thr Thr Gin Ser Pro Gly Thr Ser Arg Pro Pro Thr 100 105 110
CCA GCA TCC AGG GTG ACT CCC CAG GAC TCC CAC TAC GTC ATG ATA GTC 440 Pro Ala Ser Arg Val Thr Pro Gin Asp Ser His Tyr Val Met He Val 115 120 125
GGA GTT GAA GGC ACA GAT GCG GCT GGG CGC AGG GTT ACC AAC ACC AAG 488 Gly Val Glu Gly Thr Asp Ala Ala Gly Arg Arg Val Thr Asn Thr Lys 130 135 140
GTG CTC AGG TGT CCT AGG AAT CCC CCA GAC CAA GCT TTG GTG TCG AGC 536 Val Leu Arg Cys Pro Arg Asn Pro Pro Asp Gin Ala Leu Val Ser Ser 145 150 155 160
TTA AGT CCC TCT CCT CTT CAA AAC GTA GCA CTA GAA GCT CCA AAC GCT 584 Leu Ser Pro Ser Pro Leu Gin Asn Val Ala Leu Glu Ala Pro Asn Ala 165 170 175
GAC TTG TGT GAC TCT GTC CCA AAG TGG GAC AGG CTT CCG TGT GCT TCT 632 Asp Leu Cys Asp Ser Val Pro Lys Trp Asp Arg Leu Pro Cys Ala Ser 180 185 190
TCA CCC ATC ACT CAG GGA GAC TGC AAT AAG CTT GGT TGC TGC TAC AAA 680 Ser Pro He Thr Gin Gly Asp Cys Asn Lys Leu Gly Cys Cys Tyr Lys 195 200 205
TCA GAG GCA AAT TCC TGT TAC TAT GGA AAC ACA GTG ACC TCA CGC TGT 728 Ser Glu Ala Asn Ser Cys Tyr Tyr Gly Asn Thr Val Thr Ser Arg Cys 210 215 220
ACC CAA GAC GGC CAC TTC TCC ATC GCC GTG TCT CGG AAC GTG ACC TCA 776 Thr Gin Asp Gly His Phe Ser He Ala Val Ser Arg Asn Val Thr Ser 225 230 235 240
CCC CCA CTG CTC TTA AAT TCT CTG CGC TTG GCC TTC GGG AAG GAC CGC 824 Pro Pro Leu Leu Leu Asn Ser Leu Arg Leu Ala Phe Gly Lys Asp Arg 245 250 255
GAA TGT AAC CCT GTG AAA GCA ACA CGT GCC TTT GCC CTG TTC TTT TTT 872 Glu Cys Asn Pro Val Lys Ala Thr Arg Ala Phe Ala Leu Phe Phe Phe 260 265 270
CCA TTT AAT TCC TGT GGC ACC ACG AGA TGG GTC ACT GGA GAC CAG GCA 920 Pro Phe Asn Ser Cys Gly Thr Thr Arg Trp Val Thr Gly Asp Gin Ala 275 280 285
GTA TAT GAA AAT GAG CTG GTG GCA GCT AGA GAT GTG AGA ACT TGG AGC 968 Val Tyr Glu Asn Glu Leu Val Ala Ala Arg Asp Val Arg Thr Trp Ser 290 295 300
CAT GGT TCT ATT ACC CGT GAC AGT ATC TTC AGG CTT CGA GTT AGC TGC 1016 His Gly Ser He Thr Arg Asp Ser He Phe Arg Leu Arg Val Ser Cys 305 310 315 320
AGC TAC TCT GTA AGG AGT AAT GCC TTC CCG CTT AGC GTT CAG GTG TTT 1064 Ser Tyr Ser Val Arg Ser Asn Ala Phe Pro Leu Ser Val Gin Val Phe 325 330 335
ACC ATC CCA CCA CCC CAT CTG AAA ACC CAG CAT GGA CCC CTC ACT CTG 1112 Thr He Pro Pro Pro His Leu Lys Thr Gin His Gly Pro Leu Thr Leu 340 345 350
GAA CTC AAG ATT GCC AAA GAT AAG CAC TAT GGC TCC TAC TAC ACT ATT 1160 Glu Leu Lys He Ala Lys Asp Lys His Tyr Gly Ser Tyr Tyr Thr He 355 360 365
GGT GAC TAC CCA GTG GTA AAG TTG CTT CGG GAT CCC ATT TAT GTG GAG 1208 Gly Asp Tyr Pro Val Val Lys Leu Leu Arg Asp Pro He Tyr Val Glu 370 375 380
GTC TCT ATC CGC CAC AGA ACG GAC CCC TCC CTG GGG CTG CTC CTC CAT 1256 Val Ser He Arg His Arg Thr Asp Pro Ser Leu Gly Leu Leu Leu His 385 390 395 400
AAC TGT TGG GCC ACA CCC GGC AAG AAC TCC CAG AGT CTG TCC CAG TGG 1304 Asn Cys Trp Ala Thr Pro Gly Lys Asn Ser Gin Ser Leu Ser Gin Trp 405 410 415
CCC ATT CTG GTG AAA GGA TGC CCC TAC GTT GGA GAC AAC TAT CAA ACC 1352 Pro He Leu Val Lys Gly Cys Pro Tyr Val Gly Asp Asn Tyr Gin Thr 420 425 430
CAG CTG ATC CCT GTC CAG AAG GCT CTG GAT ACA CCA TTT CCA TCT TAC 1400 Gin Leu He Pro Val Gin Lys Ala Leu Asp Thr Pro Phe Pro Ser Tyr 435 440 445
TAC AAG CGC TTC AGT ATT TTC ACC TTC AGC TTT GTG GAC ACC ATG GCA 1448 Tyr Lys Arg Phe Ser He Phe Thr Phe Ser Phe Val Asp Thr Met Ala 450 455 460
AAG TGG GCA CTC AGG GGA CCG GTG TAT CTG CAC TGT AAT GTA TCC ATC 1496 Lys Trp Ala Leu Arg Gly Pro Val Tyr Leu His Cys Asn Val Ser He 465 470 475 480
TGC CAG CCT GCT GGG ACC TCC TCC TGT AGG ATA ACC TGT CCT GTT GCC 1544 Cys Gin Pro Ala Gly Thr Ser Ser Cys Arg He Thr Cys Pro Val Ala 485 490 495
AGG CGA AGA AGA CAC TCT GAC CTC CAT CAT CAC AGC AGT ACT GCG AGC 1592 Arg Arg Arg Arg His Ser Asp Leu His His His Ser Ser Thr Ala Ser 500 505 510
ATC TCT AGC AAG GGT CCC ATG ATT CTA CTC CAA GCC ACT ATG GAC TCT 1640 He Ser Ser Lys Gly Pro Met He Leu Leu Gin Ala Thr Met Asp Ser 515 520 525
GCA GAG AAG CTC CAC AAA AAC TCA AGT TCT CCT ATA GAC TCC CAA GCT 1688 Ala Glu Lys Leu His Lys Asn Ser Ser Ser Pro He Asp Ser Gin Ala 530 535 540
CTG TGG ATG GCA GGC CTT TCC GGG ACC CTA ATC TTT GGA TTC TTG TTA 1736 Leu Trp Met Ala Gly Leu Ser Gly Thr Leu He Phe Gly Phe Leu Leu 545 550 555 560
GTG TCC TAC TTG GCT ATC AGG AAA CGG AGG TGAATTATTC CAGTTGTGTT 1786 Val Ser Tyr Leu Ala He Arg Lys Arg Arg 565 570
AATAAAACCA GATTGCATTA CCAAAAAAAA AAAAAAAAAA GCGGCCGCGA ATTC 1840
(2) INFORMATION FOR SEQ ID NO:16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 570 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:
Met Trp Leu Leu Gin Pro Leu Leu Leu Cys Val Pro Leu Ser Leu Ala
1 5 10 15
Val His Gly Gin Gin Lys Pro Gin Val Pro Asp Tyr Pro Gly Glu Leu 20 25 30
His Cys Gly Leu Gin Ser Leu Gin Phe Ala He Asn Pro Ser Pro Gly 35 40 45
Lys Ala Thr Pro Ala Leu He Val Trp Asp Asn Arg Gly Leu Pro His
50 55 60
Lys Leu Gin Asn Asn Ser Gly Cys Gly Thr Trp Val Arg Glu Ser Pro 65 70 75 80
Gly Gly Ser Val Leu Leu Asp Ala Ser Tyr Ser Ser Cys Tyr Val Asn 85 90 95
Glu Trp Val Ser Thr Thr Gin Ser Pro Gly Thr Ser Arg Pro Pro Thr 100 105 110
Pro Ala Ser Arg Val Thr Pro Gin Asp Ser His Tyr Val Met He Val 115 120 125
Gly Val Glu Gly Thr Asp Ala Ala Gly Arg Arg Val Thr Asn Thr Lys 130 135 140
Val Leu Arg Cys Pro Arg Asn Pro Pro Asp Gin Ala Leu Val Ser Ser 145 150 155 160
Leu Ser Pro Ser Pro Leu Gin Asn Val Ala Leu Glu Ala Pro Asn Ala 165 170 175
Asp Leu Cys Asp Ser Val Pro Lys Trp Asp Arg Leu Pro Cys Ala Ser 180 185 190
Ser Pro He Thr Gin Gly Asp Cys Asn Lys Leu Gly Cys Cys Tyr Lys 195 200 205
Ser Glu Ala Asn Ser Cys Tyr Tyr Gly Asn Thr Val Thr Ser Arg Cys 210 215 220
Thr Gin Asp Gly His Phe Ser He Ala Val Ser Arg Asn Val Thr Ser 225 230 235 240
Pro Pro Leu Leu Leu Asn Ser Leu Arg Leu Ala Phe Gly Lys Asp Arg 245 250 255
Glu Cys Asn Pro Val Lys Ala Thr Arg Ala Phe Ala Leu Phe Phe Phe 260 265 270
Pro Phe Asn Ser Cys Gly Thr Thr Arg Trp Val Thr Gly Asp Gin Ala 275 280 285
Val Tyr Glu Asn Glu Leu Val Ala Ala Arg Asp Val Arg Thr Trp Ser 290 295 300
His Gly Ser He Thr Arg Asp Ser He Phe Arg Leu Arg Val Ser Cys 305 310 315 320
Ser Tyr Ser Val Arg Ser Asn Ala Phe Pro Leu Ser Val Gin Val Phe 325 330 335
Thr He Pro Pro Pro His Leu Lys Thr Gin His Gly Pro Leu Thr Leu 340 345 350
Glu Leu Lys He Ala Lys Asp Lys His Tyr Gly Ser Tyr Tyr Thr He 355 360 365
Gly Asp Tyr Pro Val Val Lys Leu Leu Arg Asp Pro He Tyr Val Glu 370 375 380
Val Ser He Arg His Arg Thr Asp Pro Ser Leu Gly Leu Leu Leu His
385 390 395 400
Asn Cys Trp Ala Thr Pro Gly Lys Asn Ser Gin Ser Leu Ser Gin Trp 405 410 415
Pro He Leu Val Lys Gly Cys Pro Tyr Val Gly Asp Asn Tyr Gin Thr 420 425 430
Gin Leu He Pro Val Gin Lys Ala Leu Asp Thr Pro Phe Pro Ser Tyr 435 440 445
Tyr Lys Arg Phe Ser He Phe Thr Phe Ser Phe Val Asp Thr Met Ala 450 455 460
Lys Trp Ala Leu Arg Gly Pro Val Tyr Leu His Cys Asn Val Ser He 465 470 475 480
Cys Gin Pro Ala Gly Thr Ser Ser Cys Arg He Thr Cys Pro Val Ala 485 490 495
Arg Arg Arg Arg His Ser Asp Leu His His His Ser Ser Thr Ala Ser 500 505 510
He Ser Ser Lys Gly Pro Met He Leu Leu Gin Ala Thr Met Asp Ser 515 520 525
Ala Glu Lys Leu His Lys Asn Ser Ser Ser Pro He Asp Ser Gin Ala 530 535 540
Leu Trp Met Ala Gly Leu Ser Gly Thr Leu He Phe Gly Phe Leu Leu 545 550 555 560
Val Ser Tyr Leu Ala He Arg Lys Arg Arg 565 570
(2) INFORMATION FOR SEQ ID NO:17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1319 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Felis domesticus
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 26..1297
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:
GAATTCGCGG CCGCGCGTAG GCCGC ATG GGG CTG AGC TAC GGG CTT TTC ATC 52
Met Gly Leu Ser Tyr Gly Leu Phe He
1 5
TGT TTT CTG CTT TGG GCA GGC ACG GGG CTG TGC TAT CCC CCA ACC ACC 100 Cys Phe Leu Leu Trp Ala Gly Thr Gly Leu Cys Tyr Pro Pro Thr Thr 10 15 20 25
ACC GAG GAT AAG ACC CAC CCC TCG TTG CCA TCA AGC CCC TCT GTG GTG 148 Thr Glu Asp Lys Thr His Pro Ser Leu Pro Ser Ser Pro Ser Val Val 30 35 40
GTA GAG TGT CGG CAT GCC TGG CTG GTG GTC AAC GTC AGC AAA AAC CTT 196 Val Glu Cys Arg His Ala Trp Leu Val Val Asn Val Ser Lys Asn Leu 45 50 55
TTT GGT ACT GGG AGG CTT GTG AGG CCT GCA GAC CTC ACC CTG GGT CCG 244 Phe Gly Thr Gly Arg Leu Val Arg Pro Ala Asp Leu Thr Leu Gly Pro 60 65 70
GAG AAC TGT GAG CCC CTG ATC TCT GGG GAC TCA GAT GAT ACG GTC AGG 292 Glu Asn Cys Glu Pro Leu He Ser Gly Asp Ser Asp Asp Thr Val Arg 75 80 85
TTT GAA GTC GAG CTC CAC AAG TGT GGC AAC AGC GTG CAG GTG ACC GAA 340 Phe Glu Val Glu Leu His Lys Cys Gly Asn Ser Val Gin Val Thr Glu 90 95 100 105
GAT GCC CTG GTG TAT AGC ACC TTC CTG CTC CAC AAC CCC CGC CCC ATG 388 Asp Ala Leu Val Tyr Ser Thr Phe Leu Leu His Asn Pro Arg Pro Met 110 115 120
GGA AAC CTG TCC ATC CTG AGG ACC AAC CGC GCG GAA GTT CCC ATT GAG 436 Gly Asn Leu Ser He Leu Arg Thr Asn Arg Ala Glu Val Pro He Glu 125 130 135
TGC CGT TAC CCC AGG CAT AGC AAC GTG AGC AGC GAG GCC ATC CTG CCC 484 Cys Arg Tyr Pro Arg His Ser Asn Val Ser Ser Glu Ala He Leu Pro 140 145 150
ACC TGG GTG CCC TTC AGG ACC ACA ATG CTC TCA GAG GAG AAG CTG GCT 532 Thr Trp Val Pro Phe Arg Thr Thr Met Leu Ser Glu Glu Lys Leu Ala 155 160 165
TTC TCT CTG CGC CTG ATG GAG GAG GAC TGG GGC TCC GAG AAG CAG TCC 580 Phe Ser Leu Arg Leu Met Glu Glu Asp Trp Gly Ser Glu Lys Gin Ser 170 175 180 185
CCC ACT TTC CAG TTG GGA GAC CTA GCC CAC CTC CAG GCC GAA GTC CAC 628 Pro Thr Phe Gin Leu Gly Asp Leu Ala His Leu Gin Ala Glu Val His 190 195 200
ACC GGC CGC CAC ATA CCA CTG CGA CTG TTT GTG GAC TAC TGT GTG GCC 676 Thr Gly Arg His He Pro Leu Arg Leu Phe Val Asp Tyr Cys Val Ala 205 210 215
ACG CTG ACA CCA GAC CAG AAC GCC TCC CCT CAT CAC ACC ATC GTG GAC 724 Thr Leu Thr Pro Asp Gin Asn Ala Ser Pro His His Thr He Val Asp 220 225 230
TTC CAC GGC TGT CTC GTG GAT GGT CTC TCT GAT GCC TCT TCT GCC TTC 772 Phe His Gly Cys Leu Val Asp Gly Leu Ser Asp Ala Ser Ser Ala Phe 235 240 245
AAA GCC CCC AGA CCC AGG CCG GAG ACT CTC CAG TTT ACA GTA GAC ACG 820 Lys Ala Pro Arg Pro Arg Pro Glu Thr Leu Gin Phe Thr Val Asp Thr 250 255 260 265
TTC CAC TTT GCT AAT GAC CCC AGA AAC ATG ATC TAT ATC ACC TGC CAT 868 Phe His Phe Ala Asn Asp Pro Arg Asn Met He Tyr He Thr Cys His 270 275 280
CTG AAA GTC ACT CCA GCT AGC CGA GTC CCA GAC CAG CTA AAC AAA GCC 916 Leu Lys Val Thr Pro Ala Ser Arg Val Pro Asp Gin Leu Asn Lys Ala 285 290 295
TGT TCC TTC ATC AAG TCT TCT AAC AGG TGG TTC CCA GTA GAA GGC CCT 964 Cys Ser Phe He Lys Ser Ser Asn Arg Trp Phe Pro Val Glu Gly Pro 300 305 310
GCT GAC ATC TGT AAC TGT TGT AAC AAA GGT AGC TGT GGC CTT CAG GGC 1012 Ala Asp He Cys Asn Cys Cys Asn Lys Gly Ser Cys Gly Leu Gin Gly 315 320 325
CGT TCC TGG AGG CTG TCC CAC CTA GAC AGA CCG TGG CAC AAG ATG GCT 1060 Arg Ser Trp Arg Leu Ser His Leu Asp Arg Pro Trp His Lys Met Ala 330 335 340 345
TCC CGA AAT CGC AGG CAT GTG ACC GAA GAA GCG GAT ATC ACC GTG GGG 1108 Ser Arg Asn Arg Arg His Val Thr Glu Glu Ala Asp He Thr Val Gly 350 355 360
CCT CTG ATC TTC CTG GGA AAG GCT GCC GAT CGT GGT GTG GAG GGG TCG 1156 Pro Leu He Phe Leu Gly Lys Ala Ala Asp Arg Gly Val Glu Gly Ser 365 370 375
ACC TCG CCT CAC ACC TCT GTG ATG GTG GGC ATA GGC CTG GCC ACG GTG 1204 Thr Ser Pro His Thr Ser Val Met Val Gly He Gly Leu Ala Thr Val 380 385 390
TTG TCC CTG ACT CTG GCT ACC ATT GTC CTG GGT CTC GCC AGG AGG CAT 1252 Leu Ser Leu Thr Leu Ala Thr He Val Leu Gly Leu Ala Arg Arg His 395 400 405
CAC ACT GCT TCC CGT CCT ATG ATC TGC CCT GTG TCT GCT TCC CAA 1297 His Thr Ala Ser Arg Pro Met He Cys Pro Val Ser Ala Ser Gin 410 415 420
TAAAAGAAGC GGCCGCGAAT TC 1319
(2) INFORMATION FOR SEQ ID NO:18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 424 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:
Met Gly Leu Ser Tyr Gly Leu Phe He Cys Phe Leu Leu Trp Ala Gly
1 5 10 15
Thr Gly Leu Cys Tyr Pro Pro Thr Thr Thr Glu Asp Lys Thr His Pro 20 25 30
Ser Leu Pro Ser Ser Pro Ser Val Val Val Glu Cys Arg His Ala Trp 35 40 45
Leu Val Val Asn Val Ser Lys Asn Leu Phe Gly Thr Gly Arg Leu Val 50 55 60
Arg Pro Ala Asp Leu Thr Leu Gly Pro Glu Asn Cys Glu Pro Leu He 65 70 75 80
Ser Gly Asp Ser Asp Asp Thr Val Arg Phe Glu Val Glu Leu His Lys 85 90 95
Cys Gly Asn Ser Val Gin Val Thr Glu Asp Ala Leu Val Tyr Ser Thr 100 105 110
Phe Leu Leu His Asn Pro Arg Pro Met Gly Asn Leu Ser He Leu Arg 115 120 125
Thr Asn Arg Ala Glu Val Pro He Glu Cys Arg Tyr Pro Arg His Ser 130 135 140
Asn Val Ser Ser Glu Ala He Leu Pro Thr Trp Val Pro Phe Arg Thr 145 150 155 160
Thr Met Leu Ser Glu Glu Lys Leu Ala Phe Ser Leu Arg Leu Met Glu 165 170 175
Glu Asp Trp Gly Ser Glu Lys Gin Ser Pro Thr Phe Gin Leu Gly Asp 180 185 190
Leu Ala His Leu Gin Ala Glu Val His Thr Gly Arg His He Pro Leu 195 200 205
Arg Leu Phe Val Asp Tyr Cys Val Ala Thr Leu Thr Pro Asp Gin Asn 210 215 220
Ala Ser Pro His His Thr He Val Asp Phe His Gly Cys Leu Val Asp 225 230 235 240
Gly Leu Ser Asp Ala Ser Ser Ala Phe Lys Ala Pro Arg Pro Arg Pro 245 250 255
Glu Thr Leu Gin Phe Thr Val Asp Thr Phe His Phe Ala Asn Asp Pro 260 265 270
Arg Asn Met He Tyr He Thr Cys His Leu Lys Val Thr Pro Ala Ser 275 280 285
Arg Val Pro Asp Gin Leu Asn Lys Ala Cys Ser Phe He Lys Ser Ser 290 295 300
Asn Arg Trp Phe Pro Val Glu Gly Pro Ala Asp He Cys Asn Cys Cys 305 310 315 320
Asn Lys Gly Ser Cys Gly Leu Gin Gly Arg Ser Trp Arg Leu Ser His 325 330 335
Leu Asp Arg Pro Trp His Lys Met Ala Ser Arg Asn Arg Arg His Val 340 345 350
Thr Glu Glu Ala Asp He Thr Val Gly Pro Leu He Phe Leu Gly Lys 355 360 365
Ala Ala Asp Arg Gly Val Glu Gly Ser Thr Ser Pro His Thr Ser Val 370 375 380
Met Val Gly He Gly Leu Ala Thr Val Leu Ser Leu Thr Leu Ala Thr 385 390 395 400
He Val Leu Gly Leu Ala Arg Arg His His Thr Ala Ser Arg Pro Met 405 410 415
He Cys Pro Val Ser Ala Ser Gin 420
(2) INFORMATION FOR SEQ ID NO:19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 643 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Bos taurus
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 16..582
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:
GAATTCGCGG CCGCC CTA AAC AGG ACT GAC CCC AAC ATC AAG TTG GTC TTA 51 Leu Asn Arg Thr Asp Pro Asn He Lys Leu Val Leu
1 5 10
GAT GAT TGC TGG GCA ACA TCC ACC ATG GAC CCA GCC TCT CTC CCT CAG 99 Asp Asp Cys Trp Ala Thr Ser Thr Met Asp Pro Ala Ser Leu Pro Gin 15 20 25
TGG AAT ATT ATC GTG GAT GGC TGT GAA TAC AAC TTG GAC AAC CAC AGA 147 Trp Asn He He Val Asp Gly Cys Glu Tyr Asn Leu Asp Asn His Arg 30 35 40
ACC ACC TTC CAT CCG GTT GGC TCC TCG GTG GCC TAT CCT AAT CAC TAC 195 Thr Thr Phe His Pro Val Gly Ser Ser Val Ala Tyr Pro Asn His Tyr 45 50 55 60
CAG AGG TTT GCT GTG AAG ACC TTT GCC TTT GTG TCA GAG GAC CCG GCG 243 Gin Arg Phe Ala Val Lys Thr Phe Ala Phe Val Ser Glu Asp Pro Ala 65 70 75
TTC TCT CAC TTG GTC TAC TTC CAC TGC AGC GCC TTA ATC TGC GAT CAA 291 Phe Ser His Leu Val Tyr Phe His Cys Ser Ala Leu He Cys Asp Gin 80 85 90
CTT TCT TCT AAC TTC CCC CTG TGT TCT GCG TCT TGC CTT GTG TCA TCC 339 Leu Ser Ser Asn Phe Pro Leu Cys Ser Ala Ser Cys Leu Val Ser Ser 95 100 105
AGA AGC AGG CGA GCC ACA GGG GCC ACT GAG GAA GAG AAG ATG ATA GTG 387 Arg Ser Arg Arg Ala Thr Gly Ala Thr Glu Glu Glu Lys Met He Val 110 115 120
AGT CTC CCG GGC CCC ATC CTC CTG TTG TCA GAT GGC TCT TCA TTC AGA 435 Ser Leu Pro Gly Pro He Leu Leu Leu Ser Asp Gly Ser Ser Phe Arg 125 130 135 140
GAT GCT GTG GAT TCT AAA GGG CAT GGG ACT TCT GGA TAT GCT GCT TTT 483 Asp Ala Val Asp Ser Lys Gly His Gly Thr Ser Gly Tyr Ala Ala Phe 145 150 155
AAA ACT ATG GTT GCT GTA GTT GCC TTA GCA GGT GTT GTG GCA ACT CTA 531 Lys Thr Met Val Ala Val Val Ala Leu Ala Gly Val Val Ala Thr Leu 160 165 170
AGC CTA ATC AGC TAC CTG CGC AAG AAA AGA ATC ACA GTG CTA AAC CAC 579 Ser Leu He Ser Tyr Leu Arg Lys Lys Arg He Thr Val Leu Asn His 175 180 185
TAATTGGATT TTCAATAAAA TGTGGAAGTA AAAAAAAAAA AAAAAAAAAA GCGGCCGCGA 639
ATTC 643
(2) INFORMATION FOR SEQ ID NO:20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 188 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:
Leu Asn Arg Thr Asp Pro Asn He Lys Leu Val Leu Asp Asp Cys Trp 1 5 10 15
Ala Thr Ser Thr Met Asp Pro Ala Ser Leu Pro Gin Trp Asn He He 20 25 30
Val Asp Gly Cys Glu Tyr Asn Leu Asp Asn His Arg Thr Thr Phe His 35 40 45
Pro Val Gly Ser Ser Val Ala Tyr Pro Asn His Tyr Gin Arg Phe Ala 50 55 60
Val Lys Thr Phe Ala Phe Val Ser Glu Asp Pro Ala Phe Ser His Leu 65 70 75 80
Val Tyr Phe His Cys Ser Ala Leu He Cys Asp Gin Leu Ser Ser Asn 85 90 95
Phe Pro Leu Cys Ser Ala Ser Cys Leu Val Ser Ser Arg Ser Arg Arg 100 105 110
Ala Thr Gly Ala Thr Glu Glu Glu Lys Met He Val Ser Leu Pro Gly 115 120 125
Pro He Leu Leu Leu Ser Asp Gly Ser Ser Phe Arg Asp Ala Val Asp 130 135 140
Ser Lys Gly His Gly Thr Ser Gly Tyr Ala Ala Phe Lys Thr Met Val 145 150 155 160
Ala Val Val Ala Leu Ala Gly Val Val Ala Thr Leu Ser Leu He Ser 165 170 175
Tyr Leu Arg Lys Lys Arg He Thr Val Leu Asn His 180 185
(2) INFORMATION FOR SEQ ID NO:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1029 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Bos taurus
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 2..976
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:
G AAT TCT GTA CAC TTG GCC TTC AGG AAT GAC AGC GAA TGT AAA CCT 46
Asn Ser Val His Leu Ala Phe Arg Asn Asp Ser Glu Cys Lys Pro
1 5 10 15
GTG ATG GCA ACA CAC ACT TTT GTT CTG TTC CGG TTT CCA TTT ACT ACT 94 Val Met Ala Thr His Thr Phe Val Leu Phe Arg Phe Pro Phe Thr Thr 20 25 30
TGT GGT ACT ACA AAA CAG ATC ACT GGA AAG CAA GCG GTA TAT GAA AAT 142 Cys Gly Thr Thr Lys Gin He Thr Gly Lys Gin Ala Val Tyr Glu Asn 35 40 45
GAG CTG GTT GCA GCT CGG GAT GTG AGA ACT TGG AGC CGT GGT TCT ATT 190 Glu Leu Val Ala Ala Arg Asp Val Arg Thr Trp Ser Arg Gly Ser He 50 55 60
ACC CGA GAC AGT ACC TTC AGG CTC CAA GTC AGT TGT AGC TAC TCT GCA 238 Thr Arg Asp Ser Thr Phe Arg Leu Gin Val Ser Cys Ser Tyr Ser Ala 65 70 75
AGT AGC AGT GCT CTC CCA GTT AAT GTC CAA GTT CTT ACT CTC CCA CCA 286 Ser Ser Ser Ala Leu Pro Val Asn Val Gin Val Leu Thr Leu Pro Pro 80 85 90 95
CCC CTT CCT GAG ACC CTG CCT GGA AAC CTC ACT CTG GAA CTT AAG ATT 334 Pro Leu Pro Glu Thr Leu Pro Gly Asn Leu Thr Leu Glu Leu Lys He 100 105 110
GCC AAA GAT AAA CCG TAT CGC TCC TAC TAC ACG GCT AGT GAC TAC CCA 382 Ala Lys Asp Lys Pro Tyr Arg Ser Tyr Tyr Thr Ala Ser Asp Tyr Pro 115 120 125
GTG GTG AAG TTA CTT CGG GAT CCC ATC TAC GTG GAA GTC TCC ATC CAT 430 Val Val Lys Leu Leu Arg Asp Pro He Tyr Val Glu Val Ser He His 130 135 140
CAG AGA ACA GAC CCC AGT CTC GAG CTG CGC CTG GAC CAG TGT TGG GCG 478 Gin Arg Thr Asp Pro Ser Leu Glu Leu Arg Leu Asp Gin Cys Trp Ala 145 150 155
ACA CCT GGT GCA GAT GCC CTG CTC CAG CCC CAG TGG CCC TTG CTT GTG 526 Thr Pro Gly Ala Asp Ala Leu Leu Gin Pro Gin Trp Pro Leu Leu Val 160 165 170 175
AAT GGG TGC CCC TAC ACA GGA GAC AAC TAT CAG ACA AAA CTG ATC CCT 574 Asn Gly Cys Pro Tyr Thr Gly Asp Asn Tyr Gin Thr Lys Leu He Pro 180 185 190
GTC TGG GAA GCC TCA GAC CTG CCG TTT CCT TCT CAC TAC CAG CGC TTC 622 Val Trp Glu Ala Ser Asp Leu Pro Phe Pro Ser His Tyr Gin Arg Phe 195 200 205
AGC ATT TCC ACC TTC AGC TTT GTG GAC TCA GTG GCA AAG CGG GCC CTC 670 Ser He Ser Thr Phe Ser Phe Val Asp Ser Val Ala Lys Arg Ala Leu 210 215 220
AAG GGA CCG GTG TAT CTG CAC TGC AGT GCA TCG GTC TGC CAG CCT GCC 718 Lys Gly Pro Val Tyr Leu His Cys Ser Ala Ser Val Cys Gin Pro Ala 225 230 235
GGG ACA CCA TCC TGT GTG ACA CTC TGT CCT GCC AGA CGA AGA AGA AGC 766 Gly Thr Pro Ser Cys Val Thr Leu Cys Pro Ala Arg Arg Arg Arg Ser 240 245 250 255
TCT GAC ATC CAT TTT CAG AAC AAA ACG GCT AGC ATT TCT AGC AAG GGT 814 Ser Asp He His Phe Gin Asn Lys Thr Ala Ser He Ser Ser Lys Gly 260 265 270
CCC TTG ATT CTA CTC CAA GCC ATT CAA GAC TCT TCA GAA AAG CTC CAC 862 Pro Leu He Leu Leu Gin Ala He Gin Asp Ser Ser Glu Lys Leu His 275 280 285
AAA TAC TCA AGG TCT CCT GTA GAC TCT CAA GCT TTG TGG GTG GCT GGC 910 Lys Tyr Ser Arg Ser Pro Val Asp Ser Gin Ala Leu Trp Val Ala Gly 290 295 300
CTA TCT GGA ATC TTA ATC GTT GGA GCC TTG TTC ATG TCC TAC CTG GCC 958 Leu Ser Gly He Leu He Val Gly Ala Leu Phe Met Ser Tyr Leu Ala 305 310 315
ATT AGG AAA TGG AGA TGAGTTGCTC AGCCCAAATG TGTTAATAAA ACCAGATTGC 1013
He Arg Lys Trp Arg
320
AGCCGGCCGC GAATTC 1029
(2) INFORMATION FOR SEQ ID NO:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 324 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:
Asn Ser Val His Leu Ala Phe Arg Asn Asp Ser Glu Cys Lys Pro Val
1 5 10 15
Met Ala Thr His Thr Phe Val Leu Phe Arg Phe Pro Phe Thr Thr Cys 20 25 30
Gly Thr Thr Lys Gin He Thr Gly Lys Gin Ala Val Tyr Glu Asn Glu 35 40 45
Leu Val Ala Ala Arg Asp Val Arg Thr Trp Ser Arg Gly Ser He Thr 50 55 60
Arg Asp Ser Thr Phe Arg Leu Gin Val Ser Cys Ser Tyr Ser Ala Ser 65 70 75 80
Ser Ser Ala Leu Pro Val Asn Val Gin Val Leu Thr Leu Pro Pro Pro 85 90 95
- Ill -
Leu Pro Glu Thr Leu Pro Gly Asn Leu Thr Leu Glu Leu Lys He Ala 100 105 110
Lys Asp Lys Pro Tyr Arg Ser Tyr Tyr Thr Ala Ser Asp Tyr Pro Val 115 120 125
Val Lys Leu Leu Arg Asp Pro He Tyr Val Glu Val Ser He His Gin 130 135 140
Arg Thr Asp Pro Ser Leu Glu Leu Arg Leu Asp Gin Cys Trp Ala Thr 145 150 155 160
Pro Gly Ala Asp Ala Leu Leu Gin Pro Gin Trp Pro Leu Leu Val Asn 165 170 175
Gly Cys Pro Tyr Thr Gly Asp Asn Tyr Gin Thr Lys Leu He Pro Val 180 185 190
Trp Glu Ala Ser Asp Leu Pro Phe Pro Ser His Tyr Gin Arg Phe Ser 195 200 205
He Ser Thr Phe Ser Phe Val Asp Ser Val Ala Lys Arg Ala Leu Lys 210 215 220
Gly Pro Val Tyr Leu His Cys Ser Ala Ser Val Cys Gin Pro Ala Gly 225 230 235 240
Thr Pro Ser Cys Val Thr Leu Cys Pro Ala Arg Arg Arg Arg Ser Ser 245 250 255
Asp He His Phe Gin Asn Lys Thr Ala Ser He Ser Ser Lys Gly Pro 260 265 270
Leu He Leu Leu Gin Ala He Gin Asp Ser Ser Glu Lys Leu His Lys 275 280 285
Tyr Ser Arg Ser Pro Val Asp Ser Gin Ala Leu Trp Val Ala Gly Leu 290 295 300
Ser Gly He Leu He Val Gly Ala Leu Phe Met Ser Tyr Leu Ala He 305 310 315 320
Arg Lys Trp Arg
(2) INFORMATION FOR SEQ ID NO:23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1457 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Bos taurus
(D) DEVELOPMENTAL STAGE: Juvenile
(E) HAPLOTYPE: Diploidy
(F) TISSUE TYPE: Ovary
(G) CELL TYPE: Oocyte
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 149..1411
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:
CCCGGGCCTC CCTACTCTCA GGAAGGCACC CGCTCACCTC CTCAAGTTCT CGATCTCGGC 60
CGGGATGCTC TGAAGCTGGT TGCCGCCGAG GCTGAGGGTC TGCAGCGGCG CAGTCCAGCA 120
GCGAGGTGGG AGTGGCTTCG TGGGCACC ATG GGG CCG TGC TCT AGG CTG TTC 172
Met Gly Pro Cys Ser Arg Leu Phe 1 5
GTC TGC TTT CTG CTC TGG GGA AGC ACA GAG CTC TGC AGC CCC CAG CCC 220 Val Cys Phe Leu Leu Trp Gly Ser Thr Glu Leu Cys Ser Pro Gin Pro 10 15 20
TTC TGG GAT GAT GAA ACC GAG CGC TTC AGG CCA TCA AAG CCG CCC GCC 268 Phe Trp Asp Asp Glu Thr Glu Arg Phe Arg Pro Ser Lys Pro Pro Ala 25 30 35 40
GTG ATG GTG GAG TGT CAG GAG GCC CAG CTG GTG GTC ACA GTC GAC AAA 316 Val Met Val Glu Cys Gin Glu Ala Gin Leu Val Val Thr Val Asp Lys 45 50 55
GAC CTT TTC GGC ACA GGG AAG CTC ATC CGG CCT GCG GAC CTC ACC CTG 364 Asp Leu Phe Gly Thr Gly Lys Leu He Arg Pro Ala Asp Leu Thr Leu 60 65 70
GGC CCC GAC AAC TGT GAG CCG CTG GCC TCC GCG GAC ACG GAT GGC GTG 412 Gly Pro Asp Asn Cys Glu Pro Leu Ala Ser Ala Asp Thr Asp Gly Val 75 80 85
GTT AGG TTT GCG GTC GGG CTG CAC GAG TGT GGC AAC ATC TTG CAG GTG 460 Val Arg Phe Ala Val Gly Leu His Glu Cys Gly Asn He Leu Gin Val 90 95 100
ACC GAC AAT GCC CTG GTG TAC AGC ACC TTC CTG CTC CAC AAC CCC CGC 508 Thr Asp Asn Ala Leu Val Tyr Ser Thr Phe Leu Leu His Asn Pro Arg 105 110 115 120
CCT GCA GGA AAC CTG TCC ATC CTG AGG ACT AAC CGC GCA GAG GTC CCC 556 Pro Ala Gly Asn Leu Ser He Leu Arg Thr Asn Arg Ala Glu Val Pro 125 130 135
ATC GAG TGC CAC TAC CCC AGG CAG GGC AAT GTG AGT AGC TGG GCC ATC 604 He Glu Cys His Tyr Pro Arg Gin Gly Asn Val Ser Ser Trp Ala He 140 145 150
CAG CCC ACC TGG GTG CCA TTC AGG ACC ACA GTG TTC TCG GAG GAG AAG 652 Gin Pro Thr Trp Val Pro Phe Arg Thr Thr Val Phe Ser Glu Glu Lys 155 160 165
CTG GTT TTC TCT CTG CGC CTG ATG GAG GAG AAC TGG AGC GCC GAG AAG 700 Leu Val Phe Ser Leu Arg Leu Met Glu Glu Asn Trp Ser Ala Glu Lys 170 175 180
ATG ACG CCC ACC TTC CAG CTG GGA GAC AGA GCC CAC CTC CAG GCC CAA 748 Met Thr Pro Thr Phe Gin Leu Gly Asp Arg Ala His Leu Gin Ala Gin 185 190 195 200
GTG CAC ACT GGC AGC CAC GTG CCC CTG CGG CTG TTC GTG GAC CAC TGC 796 Val His Thr Gly Ser His Val Pro Leu Arg Leu Phe Val Asp His Cys 205 210 215
GTG GCC AGC CTG ACG CCA GAC TGG AGC ACC TCC CCT TAC CAC ACC ATC 844 Val Ala Ser Leu Thr Pro Asp Trp Ser Thr Ser Pro Tyr His Thr He 220 225 230
GTG GAC TTC CAT GGT TGT CTC GTC GAT GGT CTC ACC GAT GCC TCC TCT 892 Val Asp Phe His Gly Cys Leu Val Asp Gly Leu Thr Asp Ala Ser Ser 235 240 245
GCT TTC AAA GCA CCC AGA CCC AGA CCG GAG ATC CTC CAG TTC ACA GTG 940 Ala Phe Lys Ala Pro Arg Pro Arg Pro Glu He Leu Gin Phe Thr Val 250 255 260
GAT GTG TTC CGT TTT GCT AAT GAC TCC AGA AAC ATG ATA TAT ATC ACC 988 Asp Val Phe Arg Phe Ala Asn Asp Ser Arg Asn Met He Tyr He Thr 265 270 275 280
TGC CAC CTG AAG GTC ACT CCG GTT GAC CGA GTC CCG GAC CAA CTA AAC 1036 Cys His Leu Lys Val Thr Pro Val Asp Arg Val Pro Asp Gin Leu Asn 285 290 295
AAA GCC TGT TCC TTC AGC AAG TCC TCC AAC AGG TGG TCC CCG GTT GAA 1084 Lys Ala Cys Ser Phe Ser Lys Ser Ser Asn Arg Trp Ser Pro Val Glu 300 305 310
GGC CCC ACT GAC ATC TGT CGA TGC TGT AGC AAG GGG CGC TGT GGC ATT 1132 Gly Pro Thr Asp He Cys Arg Cys Cys Ser Lys Gly Arg Cys Gly He 315 320 325
TCA GGC CGT TCC ATG AGG CTG TCC CAC CGG GAG GGC AGG CCT GTT CCC 1180 Ser Gly Arg Ser Met Arg Leu Ser His Arg Glu Gly Arg Pro Val Pro 330 335 340
CGA AGT CGC AGG CAC GTG ACG GAG GAA GCA GAT GTC ACC GTG GGG CCG 1228 Arg Ser Arg Arg His Val Thr Glu Glu Ala Asp Val Thr Val Gly Pro 345 350 355 360
TTG ATC TTC CTG AGG AAG ATG AAT GAC CGT GGC GTG GAA GGG CCC ACC 1276 Leu He Phe Leu Arg Lys Met Asn Asp Arg Gly Val Glu Gly Pro Thr 365 370 375
TCC TCT CCC CCT CTG GTG ATG CTG GGC TTA GGC CTG GCT ACT GTG ATG 1324 Ser Ser Pro Pro Leu Val Met Leu Gly Leu Gly Leu Ala Thr Val Met 380 385 390
ACC TTG ACT CTG GCT GCC ATT GTC CTG GGT CTC ACT GGG AGG CTT CGG 1372 Thr Leu Thr Leu Ala Ala He Val Leu Gly Leu Thr Gly Arg Leu Arg 395 400 405
GCT GCT TCT CAC CCC GTG TGC CCT GTG TCT GCT TCC CAA TAAAAGAAGA 1421 Ala Ala Ser His Pro Val Cys Pro Val Ser Ala Ser Gin 410 415 420
AAGTGAAAAA AAAAAAAAAA AAGCGGCCGC GAATTC 1457
(2) INFORMATION FOR SEQ ID NO:24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 421 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:
Met Gly Pro Cys Ser Arg Leu Phe Val Cys Phe Leu Leu Trp Gly Ser
1 5 10 15
Thr Glu Leu Cys Ser Pro Gin Pro Phe Trp Asp Asp Glu Thr Glu Arg 20 25 30
Phe Arg Pro Ser Lys Pro Pro Ala Val Met Val Glu Cys Gin Glu Ala 35 40 45
Gin Leu Val Val Thr Val Asp Lys Asp Leu Phe Gly Thr Gly Lys Leu 50 55 60
He Arg Pro Ala Asp Leu Thr Leu Gly Pro Asp Asn Cys Glu Pro Leu 65 70 75 80
Ala Ser Ala Asp Thr Asp Gly Val Val Arg Phe Ala Val Gly Leu His 85 90 95
Glu Cys Gly Asn He Leu Gin Val Thr Asp Asn Ala Leu Val Tyr Ser 100 105 110
Thr Phe Leu Leu His Asn Pro Arg Pro Ala Gly Asn Leu Ser He Leu 115 120 125
Arg Thr Asn Arg Ala Glu Val Pro He Glu Cys His Tyr Pro Arg Gin 130 135 140
Gly Asn Val Ser Ser Trp Ala He Gin Pro Thr Trp Val Pro Phe Arg 145 150 155 160
Thr Thr Val Phe Ser Glu Glu Lys Leu Val Phe Ser Leu Arg Leu Met 165 170 175
Glu Glu Asn Trp Ser Ala Glu Lys Met Thr Pro Thr Phe Gin Leu Gly 180 185 190
Asp Arg Ala His Leu Gin Ala Gin Val His Thr Gly Ser His Val Pro 195 200 205
Leu Arg Leu Phe Val Asp His Cys Val Ala Ser Leu Thr Pro Asp Trp 210 215 220
Ser Thr Ser Pro Tyr His Thr He Val Asp Phe His Gly Cys Leu Val 225 230 235 240
Asp Gly Leu Thr Asp Ala Ser Ser Ala Phe Lys Ala Pro Arg Pro Arg 245 250 255
Pro Glu He Leu Gin Phe Thr Val Asp Val Phe Arg Phe Ala Asn Asp 260 265 270
Ser Arg Asn Met He Tyr He Thr Cys His Leu Lys Val Thr Pro Val 275 280 285
Asp Arg Val Pro Asp Gin Leu Asn Lys Ala Cys Ser Phe Ser Lys Ser 290 295 300
Ser Asn Arg Trp Ser Pro Val Glu Gly Pro Thr Asp He Cys Arg Cys 305 310 315 320
Cys Ser Lys Gly Arg Cys Gly He Ser Gly Arg Ser Met Arg Leu Ser 325 330 335
His Arg Glu Gly Arg Pro Val Pro Arg Ser Arg Arg His Val Thr Glu 340 345 350
Glu Ala Asp Val Thr Val Gly Pro Leu He Phe Leu Arg Lys Met Asn 355 360 365
Asp Arg Gly Val Glu Gly Pro Thr Ser Ser Pro Pro Leu Val Met Leu 370 375 380
Gly Leu Gly Leu Ala Thr Val Met Thr Leu Thr Leu Ala Ala He Val 385 390 395 400
Leu Gly Leu Thr Gly Arg Leu Arg Ala Ala Ser His Pro Val Cys Pro 405 410 415
Val Ser Ala Ser Gin 420
(2) INFORMATION FOR SEQ ID NO:25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 125 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: AGTTCGTGCT TATCTGAACA TGTCTTGAGG GATTAGTATG TGTGCTCATT TGGGTTCTTT 60 CCGCTGTATG CTAGGCGTAT CTAGATGCAT TAGCTTGTTA ACACCTCATG TGGAGTAAAA 120 GATGT 125
(2) INFORMATION FOR SEQ ID NO:26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 111 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: CAGGCGTAGG CGTGGACTGA AGTTCAAAGC CATGCGCCCG TTCTGATAGC ATACGTTTGA 60 AATGTCATTG TAGTTGCATG GCTGTATAAG CCAGTCTCAT AGATAAGGGA A 111
(2) INFORMATION FOR SEQ ID NO:27:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 96 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: CDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: GCGGTCGGTC ATGTGATGCT GCGTATAGTA CGATTTTGAA TGCATTATGC GAAATTATTC 60 TAACGACCCG CGATATGGAG GTTGGATTAA GTTACA 96
(2) INFORMATION FOR SEQ ID NO:28:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 19 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: ATGGARAGRT GYCAMGARG 19
(2) INFORMATION FOR SEQ ID NO:29:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(li) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: GATCTAAGGA AGATCTATGG ATCC 24
(2) INFORMATION FOR SEQ ID NO:30:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(il) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: GATCTAAGGA GGTTGTATGG ATCC 24
(2) INFORMATION FOR SEQ ID NO:31:
(1) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 55 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: GATCTATGAC CATGATTACG GATTCGCGTA GCCGTCGTCC TGCAGCGTCG CGACT 55
(2) INFORMATION FOR SEQ ID NO:32: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 57 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: GGGAAAACCC GGGCGTTACC CAACTTAATC GATTAGCAGC ACATCCCCCT TCGCCAG 57
(2) INFORMATION FOR SEQ ID NO:33:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 54 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: TTTTCCCAGT CGCGCTGCAG AACGACGGCT AGCGAATCCG TAATCATGGT CATA 54
(2) INFORMATION FOR SEQ ID NO:34:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 52 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO:34: CTGGCCAAAG GGGGATGTGG CTGCTAATCG ATTAAGTTGG GTAACGCCCG GG 52
(2) INFORMATION FOR SEQ ID NO: 5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 120 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: GATCTATGAC CATGATTACG GATTCGCTAG CCGTCGTTCT GCAGCGTCGC GACTGGGAAA 60
ATACTGGTAC TAATGCCTAA GCGATCGGCA GCAAGACGTC GGAGCGCTGAC CCTTTACCC 120 GGGCGTTACC CAACTTAATC GATTAGCAGC ACATCCCCCT TTCGCCAGTGG GCCCGCAAT 180 CCCTTGAATT AGCAAATCGT CGTGTAGGGG GAAAGCGGTC 120
(2) INFORMATION FOR SEQ ID NO:36:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: GCGAAGCTTC CGACACCATC GAACGGCGC 29
(2) INFORMATION FOR SEQ ID NO:37:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 30 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: GCGCACAATG TGCCTAATGA GTGAGCTAAC 30
(2) INFORMATION FOR SEQ ID NO:38:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 28 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: CGCGGATCCG GACGAAGGCC AGCGCTTG 28
(2) INFORMATION FOR SEQ ID NO:39:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 58 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: GCGGTCGACT CATTAATGAT GATGATGATG ATGCGGGCTC GAGGTGGACC CTTCCACC 58 (2) INFORMATION FOR SEQ ID NO:40: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1701 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..1698
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-40:
ATG TGG CTG CTG CGG TGC GTT TTG CTG TGT GTT TCA TTA TCT CTT GCT 48 Met Trp Leu Leu Arg Cys Val Leu Leu Cys Val Ser Leu Ser Leu Ala 1 5 10 15
GTG AGT GGC CAG CAT AAG CCT GAG GCA CCA GAT TAT TCC AGT GTG CTC 96 Val Ser Gly Gin His Lys Pro Glu Ala Pro Asp Tyr Ser Ser Val Leu 20 25 30
CAC TGT GGG CCG TGG AGC TTC CAG TTT GCT GTA AAC CTC AAC CAG GAG 144 His Cys Gly Pro Trp Ser Phe Gin Phe Ala Val Asn Leu Asn Gin Glu 35 40 45
GCA ACG TCT CCT CCT GTA CTA ATA GCT TGG GAC AAC CAA GGG CTG CTG 192 Ala Thr Ser Pro Pro Val Leu He Ala Trp Asp Asn Gin Gly Leu Leu 50 55 60
CAC GAG CTG CAG AAT GAC TCC GAC TGT GGC ACC TGG ATA AGA AAA GGT 240 His Glu Leu Gin Asn Asp Ser Asp Cys Gly Thr Trp He Arg Lys Gly 65 70 75 80
CCA GGC AGC TCC GTG GTG TTG GAG GCA ACC TAT AGC AGC TGC TAT GTC 288 Pro Gly Ser Ser Val Val Leu Glu Ala Thr Tyr Ser Ser Cys Tyr Val 85 90 95
ACT GAG TGG GTG AGT ATG ACC CAA TGG CCA GGG AGA CTG TGT GAA GCG 336 Thr Glu Trp Val Ser Met Thr Gin Trp Pro Gly Arg Leu Cys Glu Ala 100 105 110
CCT CAT GCT ACC ATC CAG GCT GAC CCC CAA GGC CTG TCT CTC CAG GAC 384 Pro His Ala Thr He Gin Ala Asp Pro Gin Gly Leu ser Leu Gin Asp 115 120 125
TCC CAC TAC ATC ATG CCA GTT GGA GTT GAA GGA GCA GGC GCG GCT GAA 432 Ser His Tyr He Met Pro Val Gly Val Glu Gly Ala Gly Ala Ala Glu 130 135 140
CAC AAG GTG GTT ACA GAG AGG AAG CTG CTC AAG TGT CCT ATG GAT CTT 480 His Lys Val Val Thr Glu Arg Lys Leu Leu Lys Cys Pro Met Asp Leu 145 150 155 160
CTA GAT GCT CCA GAT ACT GAC TGG TGT GAC TCC ATC CCA GCA CGG GAC 528 Leu Asp Ala Pro Asp Thr Asp Trp Cys Asp Ser He Pro Ala Arg Asp 165 170 175
AGA CTG CCA TGT GCA CCT TCA CCC ATC TCT CGA GGA GAC TGT GAA GGG 576 Arg Leu Pro Cys Ala Pro Ser Pro He Ser Arg Gly Asp Cys Glu Gly 180 185 190
CTA GGC TGT TGT TAT AGC TCT GAA GAG GTG AAT TCC TGC TAC TAT GGA 624 Leu Gly Cys Cys Tyr Ser Ser Glu Glu Val Asn Ser Cys Tyr Tyr Gly 195 200 205
AAC ACT GTG ACC TTG CAT TGT ACC CGA GAG GGC CAT TTC TCT ATT GCT 672 Asn Thr Val Thr Leu His Cys Thr Arg Glu Gly His Phe Ser He Ala 210 215 220
GTG TCT CGG AAC GTG ACC TCG CCA CCA CTG CTC TTG GAT TCT GTG CGC 720 Val Ser Arg Asn Val Thr Ser Pro Pro Leu Leu Leu Asp Ser Val Arg 225 230 235 240
TTG GCC CTT AGG AAT GAC AGT GCG TGT AAC CCT GTG ATG GCA ACA CAA 768 Leu Ala Leu Arg Asn Asp Ser Ala Cys Asn Pro Val Met Ala Thr Gin 245 250 255
GCT TTT GTT CTG TTC CAG TTT CCA TTT ACT TCC TGT GGC ACC ACA AGA 816 Ala Phe Val Leu Phe Gin Phe Pro Phe Thr Ser Cys Gly Thr Thr Arg 260 265 270
CAG ATC ACT GGA GAC CGA GCA GTA TAT GAA AAT GAA CTG GTG GCA ACT 864 Gin He Thr Gly Asp Arg Ala Val Tyr Glu Asn Glu Leu Val Ala Thr 275 280 285
AGG GAT GTG AAA AAT GGG AGC CGT GGC TCT GTC ACT CGT GAC AGC ATC 912 Arg Asp Val Lys Asn Gly Ser Arg Gly Ser Val Thr Arg Asp Ser He 290 295 300
TTC AGG CTC CAT GTC AGC TGC AGC TAC TCA GTA AGT AGC AAC TCT CTC 960 Phe Arg Leu His Val Ser Cys Ser Tyr Ser Val Ser Ser Asn Ser Leu 305 310 315 320
CCA ATC AAT GTC CAG GTT TTC ACT CTC CCA CCA CCC TTT CCT GAG ACC 1008 Pro He Asn Val Gin Val Phe Thr Leu Pro Pro Pro Phe Pro Glu Thr 325 330 335
CAG CCT GGA CCC CTC ACT CTG GAA CTT CAG ATT GCC AAA GAT AAA AAC 1056 Gin Pro Gly Pro Leu Thr Leu Glu Leu Gin He Ala Lys Asp Lys Asn 340 345 350
TAT GGC TCT TAC TAC GGT GTT GGT GAC TAC CCA GTG GTG AAG TTG CTT 1104 Tyr Gly Ser Tyr Tyr Gly Val Gly Asp Tyr Pro Val Val Lys Leu Leu 355 360 365
CGG GAT CCC ATT TAC GTG GAG GTC TCC ATC CTT CAC AGA ACA GAC CCC 1152 Arg Asp Pro He Tyr Val Glu Val Ser He Leu His Arg Thr Asp Pro 370 375 380
TAC CTG GGG CTG CTC CTA CAA CAG TGT TGG GCA ACA CCC AGC ACT GAC 1200 Tyr Leu Gly Leu Leu Leu Gin Gin Cys Trp Ala Thr Pro Ser Thr Asp 385 390 395 400
CCC CTG AGT CAG CCA CAG TGG CCC ATC CTG GTA AAG GGC TGC CCC TAC 1248 Pro Leu Ser Gin Pro Gin Trp Pro He Leu Val Lys Gly Cys Pro Tyr 405 410 415
ATT GGA GAC AAC TAT CAG ACC CAG CTG ATC CCT GTC CAG AAA GCC TTG 1296 He Gly Asp Asn Tyr Gin Thr Gin Leu He Pro Val Gin Lys Ala Leu 420 425 430
GAT CTT CCA TTT CCC TCT CAC CAC CAG CGC TTC AGC ATC TTC ACC TTC 1344 Asp Leu Pro Phe Pro Ser His His Gin Arg Phe Ser He Phe Thr Phe 435 440 445
AGC TTT GTG AAC CCT ACA GTG GAG AAA CAG GCC CTC AGG GGA CCG GTG 1392 Ser Phe Val Asn Pro Thr Val Glu Lys Gin Ala Leu Arg Gly Pro Val 450 455 460
CAT CTG CAC TGC AGC GTG TCA GTC TGC CAG CCT GCT GAG ACA CCA TCC 1440 His Leu His Cys Ser Val Ser Val Cys Gin Pro Ala Glu Thr Pro Ser 465 470 475 480
TGT GTG GTG ACC TGT CCT GAT CTC AGT CGA AGA AGA AAT TTT GAC AAC 1488 Cys Val Val Thr Cys Pro Asp Leu Ser Arg Arg Arg Asn Phe Asp Asn 485 490 495
AGT TCT CAG AAC ACT ACT GCT AGT GTT TCT AGC AAA GGC CCC ATG ATT 1536 Ser Ser Gin Asn Thr Thr Ala Ser Val Ser Ser Lys Gly Pro Met He 500 505 510
CTA CTC CAA GCC ACT AAG GAC CCT CCA GAA AAG CTC CGT GTT CCT GTA 1584 Leu Leu Gin Ala Thr Lys Asp Pro Pro Glu Lys Leu Arg Val Pro Val 515 520 525
GAC TCG AAA GTT CTG TGG GTG GCA GGC CTT TCT GGG ACC TTA ATC CTT 1632 Asp Ser Lys Val Leu Trp Val Ala Gly Leu Ser Gly Thr Leu He Leu 530 535 540
GGA GCC TTG TTA GTA TCC TAC TTG GCT GTC AAG AAA CAG AAG AGT TGC 1680 Gly Ala Leu Leu Val Ser Tyr Leu Ala Val Lys Lys Gin Lys Ser Cys 545 550 555 560
CCA GAC CAA ATG TGT CAA TAA 1701
Pro Asp Gin Met Cys Gin 565
(2) INFORMATION FOR SEQ ID NO:41:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 566 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: 1inear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41:
Met Trp Leu Leu Arg Cys Val Leu Leu Cys Val Ser Leu Ser Leu Ala
1 5 10 15
Val Ser Gly Gin His Lys Pro Glu Ala Pro Asp Tyr Ser Ser Val Leu 20 25 30
His Cys Gly Pro Trp Ser Phe Gin Phe Ala Val Asn Leu Asn Gin Glu 35 40 45
Ala Thr Ser Pro Pro Val Leu He Ala Trp Asp Asn Gin Gly Leu Leu 50 55 60
His Glu Leu Gin Asn Asp Ser Asp Cys Gly Thr Trp He Arg Lys Gly 65 70 75 80
Pro Gly Ser Ser Val Val Leu Glu Ala Thr Tyr Ser Ser Cys Tyr Val 85 90 95
Thr Glu Trp Val Ser Met Thr Gin Trp Pro Gly Arg Leu Cys Glu Ala 100 105 110
Pro His Ala Thr He Gin Ala Asp Pro Gin Gly Leu Ser Leu Gin Asp 115 120 125
Ser His Tyr He Met Pro Val Gly Val Glu Gly Ala Gly Ala Ala Glu 130 135 140
His Lys Val Val Thr Glu Arg Lys Leu Leu Lys Cys Pro Met Asp Leu 145 150 155 160
Leu Asp Ala Pro Asp Thr Asp Trp Cys Asp Ser He Pro Ala Arg Asp
165 170 175
Arg Leu Pro Cys Ala Pro Ser Pro He Ser Arg Gly Asp Cys Glu Gly 180 185 190
Leu Gly Cys Cys Tyr Ser Ser Glu Glu Val Asn Ser Cys Tyr Tyr Gly 195 200 205
Asn Thr Val Thr Leu His Cys Thr Arg Glu Gly His Phe Ser He Ala 210 215 220
Val Ser Arg Asn Val Thr Ser Pro Pro Leu Leu Leu Asp Ser Val Arg 225 230 235 240
Leu Ala Leu Arg Asn Asp Ser Ala Cys Asn Pro Val Met Ala Thr Gin 245 250 255
Ala Phe Val Leu Phe Gin Phe Pro Phe Thr Ser Cys Gly Thr Thr Arg 260 265 270
Gin He Thr Gly Asp Arg Ala Val Tyr Glu Asn Glu Leu Val Ala Thr 275 280 285
Arg Asp Val Lys Asn Gly Ser Arg Gly Ser Val Thr Arg Asp Ser He 290 295 300
Phe Arg Leu His Val Ser Cys Ser Tyr Ser Val Ser Ser Asn Ser Leu 305 310 315 320
Pro He Asn Val Gin Val Phe Thr Leu Pro Pro Pro Phe Pro Glu Thr 325 330 335
Gin Pro Gly Pro Leu Thr Leu Glu Leu Gin He Ala Lys Asp Lys Asn 340 345 350
Tyr Gly Ser Tyr Tyr Gly Val Gly Asp Tyr Pro Val Val Lys Leu Leu 355 360 365
Arg Asp Pro He Tyr Val Glu Val Ser He Leu His Arg Thr Asp Pro 370 375 380
Tyr Leu Gly Leu Leu Leu Gin Gin Cys Trp Ala Thr Pro Ser Thr Asp 385 390 395 400
Pro Leu Ser Gin Pro Gin Trp Pro He Leu Val Lys Gly Cys Pro Tyr 405 410 415
He Gly Asp Asn Tyr Gin Thr Gin Leu He Pro Val Gin Lys Ala Leu 420 425 430
Asp Leu Pro Phe Pro Ser His His Gin Arg Phe Ser He Phe Thr Phe 435 440 445
Ser Phe Val Asn Pro Thr Val Glu Lys Gin Ala Leu Arg Gly Pro Val 450 455 460
His Leu His Cys Ser Val Ser Val Cys Gin Pro Ala Glu Thr Pro Ser 465 470 475 480
Cys Val Val Thr Cys Pro Asp Leu Ser Arg Arg Arg Asn Phe Asp Asn 485 490 495
Ser Ser Gin Asn Thr Thr Ala Ser Val Ser Ser Lys Gly Pro Met He 500 505 510
Leu Leu Gin Ala Thr Lys Asp Pro Pro Glu Lys Leu Arg Val Pro Val 515 520 525
Asp Ser Lys Val Leu Trp Val Ala Gly Leu Ser Gly Thr Leu He Leu 530 535 540
Gly Ala Leu Leu Val Ser Tyr Leu Ala Val Lys Lys Gin Lys Ser Cys 545 550 555 560
Pro Asp Gin Met Cys Gin
565
(2) INFORMATION FOR SEQ ID NO:42:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2266 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..2235
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:
ATG GCG TGC AGG CAG AGA GGA GGC TCT TGG AGT CCC TCA GGC TGG TTC 48 Met Ala Cys Arg Gin Arg Gly Gly Ser Trp Ser Pro Ser Gly Trp Phe 1 5 10 15
AAT GCA GGC TGG AGC ACC TAC AGG TCG ATT TCT CTC TTC TTC GCC CTT 96 Asn Ala Gly Trp Ser Thr Tyr Arg Ser He Ser Leu Phe Phe Ala Leu 20 25 30
GTG ACT TCA GGG AAC TCC ATA GAT GTT TCT CAG TTG GTA AAT CCT GCC 144 Val Thr Ser Gly Asn Ser He Asp Val Ser Gin Leu Val Asn Pro Ala 35 40 45
TTT CCA GGC ACT GTC ACT TGC GAT GAA AGG GAA ATA ACA GTG GAG TTC 192 Phe Pro Gly Thr Val Thr Cys Asp Glu Arg Glu He Thr Val Glu Phe 50 55 60
CCA AGC AGT CCT GGC ACC AAG AAA TGG CAT GCA TCT GTG GTG GAT CCT 240 Pro Ser Ser Pro Gly Thr Lys Lys Trp His Ala Ser Val Val Asp Pro 65 70 75 80
CTT GGT CTC GAC ATG CCG AAC TGC ACT TAC ATC CTG GAC CCA GAA AAG 288 Leu Gly Leu Asp Met Pro Asn Cys Thr Tyr He Leu Asp Pro Glu Lys 85 90 95
CTC ACC CTG AGG GCT ACC TAT GAT AAC TGT ACC AGG AGA GTG CAT GGT 336 Leu Thr Leu Arg Ala Thr Tyr Asp Asn Cys Thr Arg Arg Val His Gly 100 105 110
GGA CAC CAG ATG ACC ATC AGA GTC ATG AAC AAC AGT GCT GCC TTA AGA 384 Gly His Gin Met Thr He Arg Val Met Asn Asn Ser Ala Ala Leu Arg 115 120 125
CAC GGA GCT GTC ATG TAT CAG TTC TTC TGT CCA GCT ATG CAA GTA GAA 432 His Gly Ala Val Met Tyr Gin Phe Phe Cys Pro Ala Met Gin Val Glu 130 135 140
GAG ACC CAG GGG CTT TCA GCA TCT ACA ATC TGC CAG AAG GAT TTC ATG 480 Glu Thr Gin Gly Leu Ser Ala Ser Thr He Cys Gin Lys Asp Phe Met 145 150 155 160
TCT TTT TCC TTG CCA CGG GTC TTC TCT GGC TTG GCT GAC GAC AGT AAG 528 Ser Phe Ser Leu Pro Arg Val Phe Ser Gly Leu Ala Asp Asp Ser Lys 165 170 175
GGG ACC AAA GTT CAG ATG GGA TGG AGC ATT GAG GTT GGT GAT GGT GCA 576 Gly Thr Lys Val Gin Met Gly Trp Ser He Glu Val Gly Asp Gly Ala 180 185 190
AGA GCC AAA ACT CTG ACC CTG CCA GAG GCC ATG AAG GAA GGC TTC AGC 624 Arg Ala Lys Thr Leu Thr Leu Pro Glu Ala Met Lys Glu Gly Phe Ser 195 200 205
CTC TTG ATT GAC AAC CAC AGG ATG ACC TTC CAT GTG CCA TTC AAT GCC 672 Leu Leu He Asp Asn His Arg Met Thr Phe His Val Pro Phe Asn Ala 210 215 220
ACT GGA GTG ACT CAC TAT GTG CAA GGT AAC AGT CAT CTC TAC ATG GTG 720 Thr Gly Val Thr His Tyr Val Gin Gly Asn Ser His Leu Tyr Met Val 225 230 235 240
TCT CTG AAG CTT ACA TTT ATA TCT CCT GGA CAG AAG GTG ATC TTC TCT 768 Ser Leu Lys Leu Thr Phe He Ser Pro Gly Gin Lys Val He Phe Ser 245 250 255
TCA CAA GCT ATT TGT GCA CCA GAT CCT GTG ACC TGC AAT GCC ACA CAC 816 Ser Gin Ala He Cys Ala Pro Asp Pro Val Thr Cys Asn Ala Thr His 260 265 270
ATG ACT CTC ACC ATA CCA GAG TTT CCT GGG AAG CTT AAG TCT GTG AGC 864 Met Thr Leu Thr He Pro Glu Phe Pro Gly Lys Leu Lys Ser Val Ser 275 280 285
TTT GAA AAC CAG AAC ATT GAT GTG AGC CAG CTG CAT GAC AAT GGA ATT 912 Phe Glu Asn Gin Asn He Asp Val Ser Gin Leu His Asp Asn Gly He 290 295 300
GAT CTA GAA GCA ACA AAT GGC ATG AAA TTG CAT TTC AGC AAA ACT CTG 960 Asp Leu Glu Ala Thr Asn Gly Met Lys Leu His Phe Ser Lys Thr Leu 305 310 315 320
CTC AAA ACG AAA TTA TCT GAA AAA TGC CTA CTC CAT CAG TTC TAC TTA 1008 Leu Lys Thr Lys Leu Ser Glu Lys Cys Leu Leu His Gin Phe Tyr Leu 325 330 335
GCT TCA CTC AAG CTG ACC TTT CTC CTT CGG CCA GAG ACA GTA TCC ATG 1056 Ala Ser Leu Lys Leu Thr Phe Leu Leu Arg Pro Glu Thr Val Ser Met 340 345 350
GTG ATC TAT CCT GAG TGT CTC TGT GAG TCA CCC GTT TCT ATA GTT ACA 1104 Val He Tyr Pro Glu Cys Leu Cys Glu Ser Pro Val Ser He Val Thr 355 360 365
GGG GAG CTG TGC ACC CAG GAT GGG TTT ATG GAC GTC GAG GTC TAC AGC 1152 Gly Glu Leu Cys Thr Gin Asp Gly Phe Met Asp Val Glu Val Tyr Ser 370 375 380
TAC CAA ACA CAA CCA GCT CTT GAC CTG GGT ACT CTG AGG GTG GGA AAC 1200 Tyr Gin Thr Gin Pro Ala Leu Asp Leu Gly Thr Leu Arg Val Gly Asn 385 390 395 400
TCA TCC TGC CAG CCT GTC TTT GAG GCT CAG TCT CAG GGG CTG GTA CGG 1248 Ser Ser Cys Gin Pro Val Phe Glu Ala Gin Ser Gin Gly Leu Val Arg 405 410 415
TTC CAC ATA CCC CTG AAT GGA TGT GGA ACG AGA TAT AAG TTC GAA GAT 1296 Phe His He Pro Leu Asn Gly Cys Gly Thr Arg Tyr Lys Phe Glu Asp 420 425 430
GAT AAA GTC GTC TAT GAA AAC GAA ATA CAT GCT CTC TGG ACG GAT TTT 1344 Asp Lys Val Val Tyr Glu Asn Glu He His Ala Leu Trp Thr Asp Phe 435 440 445
CCT CCA AGC AAA ATA TCT AGA GAC AGT GAG TTC AGA ATG ACA GTG AAG 1392 Pro Pro Ser Lys He Ser Arg Asp Ser Glu Phe Arg Met Thr Val Lys 450 455 460
TGT TCT TAT AGC AGG AAT GAC ATG CTA CTA AAC ATC AAC GTT GAA AGC 1440 Cys Ser Tyr Ser Arg Asn Asp Met Leu Leu Asn He Asn Val Glu Ser 465 470 475 480
CTT ACT CCT CCA GTG GCC TCA GTG AAG TTG GGT CCA TTT ACC TTG ATC 1488 Leu Thr Pro Pro Val Ala Ser Val Lys Leu Gly Pro Phe Thr Leu He 485 490 495
CTG CAA AGC TAC CCA GAT AAT TCC TAC CAA CAA CCT TAT GGG GAA AAC 1536 Leu Gin Ser Tyr Pro Asp Asn Ser Tyr Gin Gin Pro Tyr Gly Glu Asn 500 505 510
GAG TAC CCT CTA GTG AGA TTC CTC CGC CAA CCA ATT TAC ATG GAA GTG 1584 Glu Tyr Pro Leu Val Arg Phe Leu Arg Gin Pro He Tyr Met Glu Val 515 520 525
AGA GTC CTA AAC AGG GAT GAC CCC AAC ATC AAG CTG GTC TTA GAT GAC 1632 Arg Val Leu Asn Arg Asp Asp Pro Asn He Lys Leu Val Leu Asp Asp 530 535 540
TGC TGG GCG ACG TCC ACC ATG GAT CCA GAC TCT TTC CCC CAG TGG AAC 1680 Cys Trp Ala Thr Ser Thr Met Asp Pro Asp Ser Phe Pro Gin Trp Asn 545 550 555 560
GTT GTC GTG GAT GGC TGT GCA TAT GAC CTG GAC AAC TAC CAG ACC ACC 1728 Val Val Val Asp Gly Cys Ala Tyr Asp Leu Asp Asn Tyr Gin Thr Thr 565 570 575
TTC CAT CCA GTC GGC TCC TCT GTG ACC CAT CCT GAT CAC TAT CAG AGG 1776 Phe His Pro Val Gly Ser Ser Val Thr His Pro Asp His Tyr Gin Arg 580 585 590
TTT GAC ATG AAG GCT TTT GCC TTT GTA TCA GAA GCC CAC GTG CTC TCT 1824 Phe Asp Met Lys Ala Phe Ala Phe Val Ser Glu Ala His Val Leu Ser 595 600 605
AGC CTG GTC TAC TTC CAC TGC AGT GCC TTA ATC TGT AAT CGA CTC TCC 1872 Ser Leu Val Tyr Phe His Cys Ser Ala Leu He Cys Asn Arg Leu Ser 610 615 620
CCT GAC TCC CCA CTG TGT TCT GTG ACC TGC CCT GTG TCC TCT AGG CAC 1920 Pro Asp Ser Pro Leu Cys Ser Val Thr Cys Pro Val Ser Ser Arg His 625 630 635 640
AGG CGA GCC ACA GGG GCC ACT GAA GCA GAG AAA ATG ACA GTC AGC CTC 1968 Arg Arg Ala Thr Gly Ala Thr Glu Ala Glu Lys Met Thr Val Ser Leu 645 650 655
CCA GGA CCC ATT CTC CTG TTG TCA GAT GAC TCC TCA TTC AGA GGT GTC 2016 Pro Gly Pro He Leu Leu Leu Ser Asp Asp Ser Ser Phe Arg Gly Val 660 665 670
GGC TCA TCT GAT CTA AAA GCA AGT GGG AGC AGT GGG GAG AAG AGT AGG 2064 Gly Ser Ser Asp Leu Lys Ala Ser Gly Ser Ser Gly Glu Lys Ser Arg 675 680 685
AGT GAA ACA GGG GAG GAG GTT GGC TCA CGA GGT GCT ATG GAC ACC AAA 2112 Ser Glu Thr Gly Glu Glu Val Gly Ser Arg Gly Ala Met Asp Thr Lys 690 695 700
GGG CAC AAG ACT GCT GGA GAT GTT GGT TCC AAA GCT GTG GCT GCT GTG 2160 Gly His Lys Thr Ala Gly Asp Val Gly Ser Lys Ala Val Ala Ala Val 705 710 715 720
GCT GCC TTT GCA GGT GTG GTG GCA ACT CTA GGC TTC ATC TAC TAC CTG 2208 Ala Ala Phe Ala Gly Val Val Ala Thr Leu Gly Phe He Tyr Tyr Leu 725 730 735
TAC GAG AAA AGG ACT GTG TCA AAT CAC TAAATGGGCT TCTAAATAAA 2255
Tyr Glu Lys Arg Thr Val Ser Asn His 740 745
GCAGTCAAAA T 2266
(2) INFORMATION FOR SEQ ID NO:43:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 745 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:
Met Ala Cys Arg Gin Arg Gly Gly Ser Trp Ser Pro Ser Gly Trp Phe 1 5 10 15
Asn Ala Gly Trp Ser Thr Tyr Arg Ser He Ser Leu Phe Phe Ala Leu 20 25 30
Val Thr Ser Gly Asn Ser He Asp Val Ser Gin Leu Val Asn Pro Ala 35 40 45
Phe Pro Gly Thr Val Thr Cys Asp Glu Arg Glu He Thr Val Glu Phe 50 55 60
Pro Ser Ser Pro Gly Thr Lys Lys Trp His Ala Ser Val Val Asp Pro 65 70 75 80
Leu Gly Leu Asp Met Pro Asn Cys Thr Tyr He Leu Asp Pro Glu Lys 85 90 95
Leu Thr Leu Arg Ala Thr Tyr Asp Asn Cys Thr Arg Arg Val His Gly 100 105 110
Gly His Gin Met Thr He Arg Val Met Asn Asn Ser Ala Ala Leu Arg 115 120 125
His Gly Ala Val Met Tyr Gin Phe Phe Cys Pro Ala Met Gin Val Glu 130 135 140
Glu Thr Gin Gly Leu Ser Ala Ser Thr He Cys Gin Lys Asp Phe Met 145 150 155 160
Ser Phe Ser Leu Pro Arg Val Phe Ser Gly Leu Ala Asp Asp Ser Lys 165 170 175
Gly Thr Lys Val Gin Met Gly Trp Ser He Glu Val Gly Asp Gly Ala 180 185 190
Arg Ala Lys Thr Leu Thr Leu Pro Glu Ala Met Lys Glu Gly Phe Ser 195 200 205
Leu Leu He Asp Asn His Arg Met Thr Phe His Val Pro Phe Asn Ala 210 215 220
Thr Gly Val Thr His Tyr Val Gin Gly Asn Ser His Leu Tyr Met Val 225 230 235 240
Ser Leu Lys Leu Thr Phe He Ser Pro Gly Gin Lys Val He Phe Ser 245 250 255
Ser Gin Ala He Cys Ala Pro Asp Pro Val Thr Cys Asn Ala Thr His 260 265 270
Met Thr Leu Thr He Pro Glu Phe Pro Gly Lys Leu Lys Ser Val Ser 275 280 285
Phe Glu Asn Gin Asn He Asp Val Ser Gin Leu His Asp Asn Gly He 290 295 300
Asp Leu Glu Ala Thr Asn Gly Met Lys Leu His Phe Ser Lys Thr Leu 305 310 315 320
Leu Lys Thr Lys Leu Ser Glu Lys Cys Leu Leu His Gin Phe Tyr Leu 325 330 335
Ala Ser Leu Lys Leu Thr Phe Leu Leu Arg Pro Glu Thr Val Ser Met 340 345 350
Val He Tyr Pro Glu Cys Leu Cys Glu Ser Pro Val Ser He Val Thr 355 360 365
Gly Glu Leu Cys Thr Gin Asp Gly Phe Met Asp Val Glu Val Tyr Ser 370 375 380
Tyr Gin Thr Gin Pro Ala Leu Asp Leu Gly Thr Leu Arg Val Gly Asn 385 390 395 400
Ser Ser Cys Gin Pro Val Phe Glu Ala Gin Ser Gin Gly Leu Val Arg 405 410 415
Phe His He Pro Leu Asn Gly Cys Gly Thr Arg Tyr Lys Phe Glu Asp 420 425 430
Asp Lys Val Val Tyr Glu Asn Glu He His Ala Leu Trp Thr Asp Phe 435 440 445
Pro Pro Ser Lys He Ser Arg Asp Ser Glu Phe Arg Met Thr Val Lys 450 455 460
Cys Ser Tyr Ser Arg Asn Asp Met Leu Leu Asn He Asn Val Glu Ser 465 470 475 480
Leu Thr Pro Pro Val Ala Ser Val Lys Leu Gly Pro Phe Thr Leu He 485 490 495
Leu Gin Ser Tyr Pro Asp Asn Ser Tyr Gin Gin Pro Tyr Gly Glu Asn 500 505 510
Glu Tyr Pro Leu Val Arg Phe Leu Arg Gin Pro He Tyr Met Glu Val 515 520 525
Arg Val Leu Asn Arg Asp Asp Pro Asn He Lys Leu Val Leu Asp Asp 530 535 540
Cys Trp Ala Thr Ser Thr Met Asp Pro Asp Ser Phe Pro Gin Trp Asn 545 550 555 560
Val Val Val Asp Gly Cys Ala Tyr Asp Leu Asp Asn Tyr Gin Thr Thr 565 570 575
Phe His Pro Val Gly Ser Ser Val Thr His Pro Asp His Tyr Gin Arg
580 585 590
Phe Asp Met Lys Ala Phe Ala Phe Val Ser Glu Ala His Val Leu Ser 595 600 605
Ser Leu Val Tyr Phe His Cys Ser Ala Leu He Cys Asn Arg Leu Ser 610 615 620
Pro Asp Ser Pro Leu Cys Ser Val Thr Cys Pro Val Ser Ser Arg His 625 630 635 640
Arg Arg Ala Thr Gly Ala Thr Glu Ala Glu Lys Met Thr Val Ser Leu 645 650 655
Pro Gly Pro He Leu Leu Leu Ser Asp Asp Ser Ser Phe Arg Gly Val 660 665 670
Gly Ser Ser Asp Leu Lys Ala Ser Gly Ser Ser Gly Glu Lys Ser Arg 675 680 685
Ser Glu Thr Gly Glu Glu Val Gly Ser Arg Gly Ala Met Asp Thr Lys 690 695 700
Gly His Lys Thr Ala Gly Asp Val Gly Ser Lys Ala Val Ala Ala Val 705 710 715 720
Ala Ala Phe Ala Gly Val Val Ala Thr Leu Gly Phe He Tyr Tyr Leu 725 730 735
Tyr Glu Lys Arg Thr Val Ser Asn His 740 745
(2) INFORMATION FOR SEQ ID NO:44:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 560 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(i ) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 15..506
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:
GAATTCGCGG CCGC TCC TCT GTG ACC CAT CCT GAT CAC TAT CAG AGG TTT 50 Ser Ser Val Thr His Pro Asp His Tyr Gin Arg Phe
1 5 10
GAC ATG AAG GCT TTT GCC TTT GTA TCA GAG GCC CAT GTG CTC TCT AGC 98 Asp Met Lys Ala Phe Ala Phe Val Ser Glu Ala His Val Leu Ser Ser 15 20 25
CTG GTC TAC TTC CAC TGC AGT GCC TTA ATC TGC AAT CGA CTC TCT CCA 146 Leu Val Tyr Phe His Cys Ser Ala Leu He Cys Asn Arg Leu Ser Pro 30 35 40
GAC TCC CCT CTG TGT TCT GTG ACC TGC CCT GTG TCA TCT AGG CAC AGG 194 Asp Ser Pro Leu Cys Ser Val Thr Cys Pro Val Ser Ser Arg His Arg 45 50 55 60
CGA GCC ACA GGG GCC ACT GAA GCA GAG AAA ATG ACA GTC AGC CTC CCA 242
Arg Ala Thr Gly Ala Thr Glu Ala Glu Lys Met Thr Val Ser Leu Pro 65 70 75
GGA CCC ATT CTC CTG TTG TCA GAC GAC TCC TCA TTC AGA GGT GTT GGC 290 Gly Pro He Leu Leu Leu Ser Asp Asp Ser Ser Phe Arg Gly Val Gly 80 85 90
TCA TCT GAT CTA AAA GCA AGT GGG AGC AGT GGG GAG AAC AGT AGG AGC 338 Ser Ser Asp Leu Lys Ala Ser Gly Ser Ser Gly Glu Asn Ser Arg Ser 95 100 105
GAA ACA GGG GAG GAG GTT GGC TCA CGA GAT GTT ATG GAC ACC AAA GGG 386 Glu Thr Gly Glu Glu Val Gly Ser Arg Asp Val Met Asp Thr Lys Gly 110 115 120
CAC AGG ACT GCT GGA GAT GTT GGT TCC AAA GCT GTG GCT GCT GTG GCT 434 His Arg Thr Ala Gly Asp Val Gly Ser Lys Ala Val Ala Ala Val Ala 125 130 135 140
GCC TTG GCA GGT GTG GTG GCA ACT CTA GGC TTC ATC TGT TAC CTG TAT 482 Ala Leu Ala Gly Val Val Ala Thr Leu Gly Phe He Cys Tyr Leu Tyr 145 150 155
AAG AAA AGG ACT GTG TCA AAT CAC TAAATGGGCT TCTAAATAAA GCAGTCAAAA 536 Lys Lys Arg Thr Val Ser Asn His 160
TAAAAAAAAA GCGGCCGCGA ATTC 560
(2) INFORMATION FOR SEQ ID NO:45:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 164 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45:
Ser Ser Val Thr His Pro Asp His Tyr Gin Arg Phe Asp Met Lys Ala 1 5 10 15
Phe Ala Phe Val Ser Glu Ala His Val Leu Ser Ser Leu Val Tyr Phe 20 25 30
His Cys Ser Ala Leu He Cys Asn Arg Leu Ser Pro Asp Ser Pro Leu 35 40 45
Cys Ser Val Thr Cys Pro Val Ser Ser Arg His Arg Arg Ala Thr Gly 50 55 60
Ala Thr Glu Ala Glu Lys Met Thr Val Ser Leu Pro Gly Pro He Leu 65 70 75 80
Leu Leu Ser Asp Asp Ser Ser Phe Arg Gly Val Gly Ser Ser Asp Leu 85 90 95
Lys Ala Ser Gly Ser Ser Gly Glu Asn Ser Arg Ser Glu Thr Gly Glu 100 105 110
Glu Val Gly Ser Arg Asp Val Met Asp Thr Lys Gly His Arg Thr Ala 115 120 125
Gly Asp Val Gly Ser Lys Ala Val Ala Ala Val Ala Ala Leu Ala Gly 130 135 140
Val Val Ala Thr Leu Gly Phe He Cys Tyr Leu Tyr Lys Lys Arg Thr 145 150 155 160
Val Ser Asn His
(2) INFORMATION FOR SEQ ID NO:46:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 866 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: CDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 12..821
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46:
GAATTCGCGG C CGC CGT GGC TCT GTC ACT CGT GAC AGC ATC TTC AGG CTC 50 Arg Arg Gly Ser Val Thr Arg Asp Ser He Phe Arg Leu 1 5 10
CAT GTC AGC TGC AGC TAC TCA GTA AGT AGC AAC TCT CTC CCA ATC AAG 98 His Val Ser Cys Ser Tyr Ser Val Ser Ser Asn Ser Leu Pro He Lys 15 20 25
GTC CAG GTT TTT ACT CTC CCA CCA CCC TTT CCT GAG ACC CAG CCT GGA 146 Val Gin Val Phe Thr Leu Pro Pro Pro Phe Pro Glu Thr Gin Pro Gly 30 35 40 45
CCC CTC ACT CTG GAA CTT CAG ATT GCC AAA GAT AAA AAC TAT GGC TCC 194 Pro Leu Thr Leu Glu Leu Gin He Ala Lys Asp Lys Asn Tyr Gly Ser 50 55 60
TAC TAT GGT GTT GGT GAC TAC CCC GTG GTG AAG TTG CTT CGG GAT CCC 242 Tyr Tyr Gly Val Gly Asp Tyr Pro Val Val Lys Leu Leu Arg Asp Pro 65 70 75
ATC TAT GTG GAG GTC TCC ATC CTT CAC AGA ACA GAC CCC TCC CTG GGG 290 He Tyr Val Glu Val Ser He Leu His Arg Thr Asp Pro Ser Leu Gly 80 85 90
CTG CTC CTA CAT CAG TGT TGG GCA ACA CCC AGC ACA GAC CCA CTG AGT 338 Leu Leu Leu His Gin Cys Trp Ala Thr Pro Ser Thr Asp Pro Leu Ser 95 100 105
CAG CCA CAG TGG CCC ATC CTG GTA AAG GGC TGC CCC TAC ATT GGA GAC 386 Gin Pro Gin Trp Pro He Leu Val Lys Gly Cys Pro Tyr He Gly Asp 110 115 120 125
AAC TAT CAG ACC CAG CTG ATC CCT GTC CAG AAA GCC TTG GAT CTT CCA 434 Asn Tyr Gin Thr Gin Leu He Pro Val Gin Lys Ala Leu Asp Leu Pro 130 135 140
TTT CCC TCT CAC TAC CAG CGC TTC AGC ATC TTC ACC TTC AGC TTT GTG 482 Phe Pro Ser His Tyr Gin Arg Phe Ser He Phe Thr Phe Ser Phe Val 145 150 155
GAC CCT ACA GCG GAG AAA CAG GCC CTC AGG GGA CCG GTG CAT CTG CAC 530 Asp Pro Thr Ala Glu Lys Gin Ala Leu Arg Gly Pro Val His Leu His 160 165 170
TGC AGT GTG TCA GTC TGC CAG CCT GCT GAG ACA CCA TCC TGT GCG GTA 578 Cys Ser Val Ser Val Cys Gin Pro Ala Glu Thr Pro Ser Cys Ala Val 175 180 185
ACC TGT CCT GAT CTC AGT CGA AGA AAT TCA GGC ACC ATT TTT CAG AAC 626 Thr Cys Pro Asp Leu Ser Arg Arg Asn Ser Gly Thr He Phe Gin Asn 190 195 200 205
ACT ACT GCT AGT GTT TCT AGC AAA GGC CCC ATG ATT CTA CTC CAA GCC 674 Thr Thr Ala Ser Val Ser Ser Lys Gly Pro Met He Leu Leu Gin Ala 210 215 220
ACT AAG GAC CCT CCA GAA AAG CTC CGT GCT CCT GTA GAC TCA AAA GTT 722 Thr Lys Asp Pro Pro Glu Lys Leu Arg Ala Pro Val Asp Ser Lys Val 225 230 235
CTG TGG GTG GCA GGC CTT TCT GGG ACC TTA ATC CTT GGA GGC TTA GTA 770 Leu Trp Val Ala Gly Leu Ser Gly Thr Leu He Leu Gly Gly Leu Val 240 245 250
GTA TCC TAC TTG GCT ATC AAA CAG CTG AAT TGT CCA GAC CAA ACA TGT 818 Val Ser Tyr Leu Ala He Lys Gin Leu Asn Cys Pro Asp Gin Thr Cys 255 260 265
CAA TAAAACCAGA CTGTACTCCC AAAAAAAAAA AGCGGCCGCG AATTC 866
Gin
270
(2) INFORMATION FOR SEQ ID NO:47:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 270 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:
Arg Arg Gly Ser Val Thr Arg Asp Ser He Phe Arg Leu His Val Ser
1 5 10 15
Cys Ser Tyr Ser Val Ser Ser Asn Ser Leu Pro He Lys Val Gin Val 20 25 30
Phe Thr Leu Pro Pro Pro Phe Pro Glu Thr Gin Pro Gly Pro Leu Thr 35 40 45
Leu Glu Leu Gin He Ala Lys Asp Lys Asn Tyr Gly Ser Tyr Tyr Gly 50 55 60
Val Gly Asp Tyr Pro Val Val Lys Leu Leu Arg Asp Pro He Tyr Val 65 70 75 80
Glu Val Ser He Leu His Arg Thr Asp Pro Ser Leu Gly Leu Leu Leu 85 90 95
His Gin Cys Trp Ala Thr Pro Ser Thr Asp Pro Leu Ser Gin Pro Gin 100 105 110
Trp Pro He Leu Val Lys Gly Cys Pro Tyr He Gly Asp Asn Tyr Gin 115 120 125
Thr Gin Leu He Pro Val Gin Lys Ala Leu Asp Leu Pro Phe Pro Ser 130 135 140
His Tyr Gin Arg Phe Ser He Phe Thr Phe Ser Phe Val Asp Pro Thr 145 150 155 160
Ala Glu Lys Gin Ala Leu Arg Gly Pro Val His Leu His Cys Ser Val 165 170 175
Ser Val Cys Gin Pro Ala Glu Thr Pro Ser Cys Ala Val Thr Cys Pro 180 185 190
Asp Leu Ser Arg Arg Asn Ser Gly Thr He Phe Gin Asn Thr Thr Ala 195 200 205
Ser Val Ser Ser Lys Gly Pro Met He Leu Leu Gin Ala Thr Lys Asp 210 215 220
Pro Pro Glu Lys Leu Arg Ala Pro Val Asp Ser Lys Val Leu Trp Val 225 230 235 240
Ala Gly Leu Ser Gly Thr Leu He Leu Gly Gly Leu Val Val Ser Tyr 245 250 255
Leu Ala He Lys Gin Leu Asn Cys Pro Asp Gin Thr Cys Gin 260 265 270
(2) INFORMATION FOR SEQ ID NO:48:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 722 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 15..683
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48:
GAATTCGCGG CCGC ATC CAC ACT GGC AGC CAC GTG CCA CTG CGG TTG TTT 50 He His Thr Gly Ser His Val Pro Leu Arg Leu Phe 1 5 10
GTG GAC CAC TGC GTG GCC ACA CCA ACA CCA GAC CAG AAT GCC TCC CCT 98 Val Asp His Cys Val Ala Thr Pro Thr Pro Asp Gin Asn Ala Ser Pro 15 20 25
TAT CAC ACC ATC GTG GAC TTC CAT GGC TGT CTT GTC GAT GGT CTC ACT 146 Tyr His Thr He Val Asp Phe His Gly Cys Leu Val Asp Gly Leu Thr 30 35 40
GAT GCC TCT TCT GCG TTC AAA GTT CCT CGA CCC GGG CCA GAT ACA CTC 194 Asp Ala Ser Ser Ala Phe Lys Val Pro Arg Pro Gly Pro Asp Thr Leu 45 50 55 60
CAG TTC ACA GTG GAT GTC TTC CAC TTT GCT AAT GAC TCC AGA AAC ATG 242 Gin Phe Thr Val Asp Val Phe His Phe Ala Asn Asp Ser Arg Asn Met 65 70 75
ATA TAC ATC ACC TGC CAC CTG AAG GCC ATC CCA GCT GAG CAG GAA CCA 290 He Tyr He Thr Cys His Leu Lys Ala He Pro Ala Glu Gin Glu Pro 80 85 90
GAC GAA CTC AAC AAA GCC TGT TCC TTC AGC AAG TCT TCC AAC AGC TGG 338
Asp Glu Leu Asn Lys Ala Cys Ser Phe Ser Lys Ser Ser Asn Ser Trp 95 100 105
TTC CCA GTG GAA GGC CCA GCT GAC ATC TGT CAA TGC TGT AGC AAG GGT 386 Phe Pro Val Glu Gly Pro Ala Asp He Cys Gin Cys Cys Ser Lys Gly 110 115 120
GAC TGT GGC ACT CCA AGC CAT TCC AGG AGG CAG CCC CAT GTC GTG AGC 434 Asp Cys Gly Thr Pro Ser His Ser Arg Arg Gin Pro His Val Val Ser 125 130 135 140
CAG TGG TCC AGG TCT GCT TCT CGT AAC CGC AGG CAT GTG ACA GAA GAA 482 Gin Trp Ser Arg Ser Ala Ser Arg Asn Arg Arg His Val Thr Glu Glu 145 150 155
GCA GAT ATC ACC GTG GGG CCA CTG ATC TTC CTG GAC AGG AGT GCT GAC 530 Ala Asp He Thr Val Gly Pro Leu He Phe Leu Asp Arg Ser Ala Asp 160 165 170
TAT GAA GTA GAA CAG TGG GCC TTG CCG ACT GAC ACC TCC GTG CTG CTG 578 Tyr Glu Val Glu Gin Trp Ala Leu Pro Thr Asp Thr Ser Val Leu Leu 175 180 185
CTG GGC ATA GGC CTG GCC GTG GTG GCA TCT CTG ACT CTG ACC GCT GTT 626 Leu Gly He Gly Leu Ala Val Val Ala Ser Leu Thr Leu Thr Ala Val 190 195 200
ATC CTG ATT TTC ACC AGG AGG TGG CGC ACT GCC TCC CGC CCT GTG TCT 674 He Leu He Phe Thr Arg Arg Trp Arg Thr Ala Ser Arg Pro Val Ser 205 210 215 220
GTT TCC CAA TAAAAGAAGA AAGCAGTAAA AAAAAGCGGC CGCGAATTC 722
Val Ser Gin
(2) INFORMATION FOR SEQ ID NO:49:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 223 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49:
He His Thr Gly Ser His Val Pro Leu Arg Leu Phe Val Asp His Cys 1 5 10 15
Val Ala Thr Pro Thr Pro Asp Gin Asn Ala Ser Pro Tyr His Thr He 20 25 30
Val Asp Phe His Gly Cys Leu Val Asp Gly Leu Thr Asp Ala Ser Ser 35 40 45
Ala Phe Lys Val Pro Arg Pro Gly Pro Asp Thr Leu Gin Phe Thr Val 50 55 60
Asp Val Phe His Phe Ala Asn Asp Ser Arg Asn Met He Tyr He Thr 65 70 75 80
Cys His Leu Lys Ala He Pro Ala Glu Gin Glu Pro Asp Glu Leu Asn 85 90 95
Lys Ala Cys Ser Phe Ser Lys Ser Ser Asn Ser Trp Phe Pro Val Glu 100 105 110
Gly Pro Ala Asp He Cys Gin Cys Cys Ser Lys Gly Asp Cys Gly Thr 115 120 125
Pro Ser His Ser Arg Arg Gin Pro His Val Val Ser Gin Trp Ser Arg 130 135 140
Ser Ala Ser Arg Asn Arg Arg His Val Thr Glu Glu Ala Asp He Thr 145 150 155 160
Val Gly Pro Leu He Phe Leu Asp Arg Ser Ala Asp Tyr Glu Val Glu 165 170 175
Gin Trp Ala Leu Pro Thr Asp Thr Ser Val Leu Leu Leu Gly He Gly 180 185 190
Leu Ala Val Val Ala Ser Leu Thr Leu Thr Ala Val He Leu He Phe 195 200 205
Thr Arg Arg Trp Arg Thr Ala Ser Arg Pro Val Ser Val Ser Gin 210 215 220
(2) INFORMATION FOR SEQ ID NO:50:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 28 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: CGCCCTTCCC AGCAACTGCA CCATCACCAC CATGGG 36
(2) INFORMATION FOR SEQ ID NO:51:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 45 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: GATCCCCATG GTGGTGGTGA TGGTGCAGTT GCTGGGAAGG GCGAT 45
(2) INFORMATION FOR SEQ ID NO:52:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 31 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO:52: GATCCCTCGA GCCACCATCA CCACCATCAT G 31
(2) INFORMATION FOR SEQ ID NO:53:
(i.) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 31 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: AATTCATGAT GGTGGTGATG GTGGCTCGAG G 31
(2) INFORMATION FOR SEQ ID NO:54:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 31 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: CCCGGATCCG CAGACCATCT GGCCAACTGA G 31
(2) INFORMATION FOR SEQ ID NO:55:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: GCGCTCGAGG GCATATGGCT GCCAGTGTG 29
(2) INFORMATION FOR SEQ ID NO:56:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 36 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: CGCGCTAGCA GATCTATGGC GCCGAGCTGG AGGTTC 36
(2) INFORMATION FOR SEQ ID NO:57:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 49 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: CGCGGATCCT ATTAATGGTG GTGATGGTGG TGACTAGTGG ACCCTTCCA 49
(2) INFORMATION FOR SEQ ID NO:58:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: CCCGCTAGCA GATCTATGGG GCTGAGCTAT GGAATTTTC 39
(2) INFORMATION FOR SEQ ID NO:59:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 34 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: CGCACTAGTT GACCCCTCTA TACCATGATC ACTA 34
INDICATIONS RELATING TO A DEPOSITED MICROORGANISM
(PCT Rule I3bis)
A. The indications made below relate to the microorganism referred to in the descπptior on page 37 line 28 and page 38 , lιn s 1-3
B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet | x |
Name of depositary institution
American Type Culture Collection
Address of depositary institution (including postal code and country)
12301 Parklawn Drive Rockville, Maryland 20852 United States of America
Date of deposit Accession Numbers
January 27, 1993 75406 and 75405
C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet | |
"In respect of those designations in which a European patent is sought , a sample of the deposited microorganism will be made available until the publication of the mention of the grant of the European patent or until the date on which the application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person requesting the sample (Rule 23 (4) EPC) . "
D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicauons are not for all designated States)
E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable)
The indications listed below will be submitted to the International Bureau later (specify the general nature ofthe indications e.g., "Accession Number of Deposit")
For International Bureau use only
I j This sheet was received by the International Bureau on:
Authorized officer
INDICATIONS RELATING TO A DEPOSITED MICROORGANISM
(PCT Rule 13bιs)
A. The indications made below relate to the microorganism referred to in the description on page 39 , lines 13-16
B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet [ |
Name of depositary institution
American Type Culture Collection
Address of depositary institution (including postal code and country)
12301 Parklawn Drive Rockville, Maryland 20852 United States of America
Date of deposit Accession Numbers
January 27, 1993 75404 and 75403
C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet | |
"In respect of those designations in which a European patent is sought, a sample of the deposited microorganism will be made available until the publication of the mention of the grant of the European patent or until the date on which the application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person requesting the sample (Rule 23(4) EPC)."
D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ifthe indications are not for all designated States)
E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable)
The indications listed belowwill be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession Number of Deposit")
For International Bureau use only
I I This sheet was received by the International Bureau on:
Authorized officer
Claims (43)
1. A method for inducing reproducible transient infertility in a mammal which comprises administering to a subject mammal a dose of a zona pellucida protein or fragment thereof, said proteins being selected from the group consisting of mammalian ZPA, mammalian ZPB, and combinations thereof, effective to stimulate production in said mammal of antibodies which recognize ZPA or ZPB protein of said mammal.
2. The method of claim 1 , wherein said mammalian ZPA and ZPB are derived from the same mammalian species as the subject mammal.
3. The method of claim 1 wherein said mammalian ZPA and ZPB are derived from a mammalian species other than the subject mammal.
4. The method of claim 1, wherein said mammalian ZPA or ZPB protein is selected from the group consisting of porcine, canine, feline, bovine, cynomolgus monkey, and human ZPA and ZPB.
5. The method of claim 1 wherein said mammalian ZPA and mammalian ZPB are essentially devoid of ZPC.
6. The method of claim 1 wherein said zona pellucida protein is substantially only ZPA.
7. The method of claim 1 wherein said zona pellucida protein is substantially only ZPB.
8. The method of claim 1 wherein said mammalian ZPA and ZPB is recombinant ZPA and ZPB.
9. The method of claim 1 wherein said antibodies have a titer of at least 1:250.
10. A method for inducing permanent sterility in a mammal which comprises administering to a subject mammal a dose of a recombinant mammalian ZPC protein or fragment thereof, effective to stimulate production in said mammal of antibodies which recognize the ZPC protein of said mammal.
11. The method of claim 10, wherein said mammalian ZPC protein is derived from the same species as the subject mammal.
12. The method of claim 10 wherein said ZPC is derived from a mammalian species other than the subject mammal.
13. The method of claim 10, wherein said mammalian ZPC protein is selected from the group consisting of porcine, rabbit, canine, feline, cynomolgus monkey, and bovine ZPC.
14. The method of claim 10 wherein said ZPC protein is essentially devoid of ZPA and ZPB.
15. A pharmaceutical composition comprising, an effective contraceptive dose of a recombinant ZPC protein or an immunocontraceptively active fragment thereof. U 1
16. A pharmaceutical composition comprising an effective contraceptive dose of a zona pellucida protein selected from the group consisting of mammalian ZPA and ZPB, and fragments thereof, and pharmaceutically acceptable carriers, diluents and adjuvants.
17. The pharmaceutical composition of claim 16 wherein said mammalian ZPA and ZPB are derived from the same mammalian species as the subject mammal.
18. The pharmaceutical composition of claim 16, wherein said mammalian ZPA and ZPB are selected from the group consisting of porcine, feline, canine, bovine, cynomolgus monkey, and human ZPA and ZPB.
19. The pharmaceutical composition of claim 16 wherein said mammalian ZPA and ZPB are essentially devoid of ZPC.
20. The pharmaceutical composition of claim 16, wherein said mammalian ZPA and ZPB is recombinant ZPA and ZPB.
21. A purified and isolated DNA sequence encoding porcine ZPA, ZPB, ZPC, or immunocontraceptively active fragments thereof, said DNA sequences being essentially as set out in SEQ ID NOS. 1, 3, and 5.
22. A purified and isolated DNA sequence encoding rabbit ZPC or an immunocontraceptively active fragment thereof, said DNA sequences being essentially as set out in SEQ ID NO. 7.
23. A purified and isolated DNA sequence encoding canine ZPA or ZPC, or immunocontraceptively active fragments thereof, said DNA sequences being essentially as set out in SEQ ID NOS. 9 and 11.
24. A purified and isolated DNA sequence encoding feline ZPA, ZPB, or ZPC, or immunocontraceptively active fragments thereof, said
DNA sequences being essentially as set out in SEQ ID NOS. 13, 15, and 17.
25. A purified and isolated DNA sequence encoding bovine ZPA, ZPB, or ZPC, or immunocontraceptively active fragments thereof, said DNA sequences being essentially as set out in SEQ ID NOS. 19, 21, and 23.
26. A purified and isolated DNA encoding human ZPA or immunocontraceptively active fragments thereof, comprising DNA present in the human DNA inserts in lambda phage clones Al (ATCC No. 75404) and A4 (ATCC No. 75403).
27. A purified and isolated DNA encoding human ZPA or an immunocontraceptively active fragment thereof, said sequence being essentially as set out as SEQ ID NO. 42.
28. A purified isolated DNA encoding human ZPB or immunocontraceptively active fragments thereof, comprising human DNA present in the DNA inserts in lambda phage clones 1-1 (ATCC No. 75406) and 4-9 (ATCC No. 75405).
29. A purified and isolated DNA encoding human ZPB or an immunocontraceptively active fragments thereof, said sequence being essentially as set out in SEQ ID NO. 40.
30. A vector containing the DNA sequence of claim 21.
31. A vector containing the DNA sequence of claim 22.
32. A vector containing the DNA sequence of claim 23.
33. A vector containing the DNA sequence of claim 24.
34. A vector containing the DNA sequence of claim 25.
35. A vector containing the DNA sequence claim 26.
36. A vector containing the DNA sequence of claim 27.
37. A vector containing the DNA sequence of claim 28.
38. A vector containing the DNA sequence of claim 29.
39. A procaryotic or eucaryotic host cell stably transformed or transfected with a vector according to claims 30, 31 , 32, 33, 34, 35, 36, 37, or 38.
40. A polypeptide product of the expression in a procaryotic or eucaryotic host cell of a DNA sequence according to claims 21, 22, 23, 24, 25, 26, 27, 28 or 29.
41. A process for the production of a recombinant mammalian zona pellucida protein or fragment thereof, said process comprising: growing, under suitable nutrient conditions, procaryotic or eucaryotic host cells transformed or transfected with a DNA vector according to claims 30, 31, 32, 33, 34, 35, 36, or 37 and isolating desired polypeptide products of the expression of DNA sequences in said vector.
42. A method for inducing reproducible transient infertility in a mammal, the method comprising, administering to a subject mammal a contraceptively effective dose of an antibody directed to a zona pellucida protein, said antibody selected from the group consisting of anti-ZPA antibodies and anti-ZPB antibodies.
43. A method for inducing permanent sterility in a mammal, the method comprising administering to a subject mammal a contraceptively effective dose of an antibody directed to ZPC.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US97334192A | 1992-11-09 | 1992-11-09 | |
US973341 | 1992-11-09 | ||
US1299093A | 1993-01-29 | 1993-01-29 | |
US012990 | 1993-01-29 | ||
PCT/US1993/010851 WO1994011019A1 (en) | 1992-11-09 | 1993-11-06 | Materials and methods for immunocontraception |
Publications (2)
Publication Number | Publication Date |
---|---|
AU5680094A AU5680094A (en) | 1994-06-08 |
AU675269B2 true AU675269B2 (en) | 1997-01-30 |
Family
ID=32965094
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU56800/94A Ceased AU675269B2 (en) | 1992-11-09 | 1993-11-06 | Materials and methods for immunocontraception |
Country Status (9)
Country | Link |
---|---|
US (5) | US6027727A (en) |
EP (1) | EP0634936B1 (en) |
JP (1) | JPH07503142A (en) |
CN (1) | CN1110177A (en) |
AT (1) | ATE275191T1 (en) |
AU (1) | AU675269B2 (en) |
CA (1) | CA2127531A1 (en) |
DE (1) | DE69333610D1 (en) |
WO (1) | WO1994011019A1 (en) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
USRE37224E1 (en) * | 1992-06-05 | 2001-06-12 | Dalhousie University | Method to prevent fertilization in mammals by administering a single dose of zona pellucida derived antigens, liposome and adjuvant |
FR2772047B1 (en) * | 1997-12-05 | 2004-04-09 | Ct Nat D Etudes Veterinaires E | GENOMIC SEQUENCE AND POLYPEPTIDES OF CIRCOVIRUS ASSOCIATED WITH PIGLET LOSS DISEASE (MAP), APPLICATIONS TO DIAGNOSIS AND TO PREVENTION AND / OR TREATMENT OF INFECTION |
WO1999034825A1 (en) * | 1998-01-02 | 1999-07-15 | The University Of Georgia Research Foundation, Inc. | Fertility impairing vaccine and method of use |
US7037663B2 (en) * | 1998-02-19 | 2006-05-02 | Eastern Virginia Medical School | Human zona pellucida protein 3 and uses thereof |
EP1056858B1 (en) * | 1998-02-19 | 2004-12-08 | Eastern Virginia Medical School | RECOMBINANT ACTIVE HUMAN ZONA PELLUCIDA PROTEIN 3 (hZP3) |
US6455041B1 (en) * | 1998-11-17 | 2002-09-24 | Bonita S. Dunbar | Immunogenic epitopes of the human zona pellucida protein (ZP1) |
AU3962700A (en) | 1999-03-17 | 2000-10-04 | Biosyn Arzneimittel Gmbh | Nucleic acid molecule comprising a nucleic acid sequence coding for a hemocyanin |
DE19939578A1 (en) | 1999-08-20 | 2001-02-22 | Biosyn Arzneimittel Gmbh | New nucleic acid encoding hemocyanin, useful for gene therapy of tumors and for recombinant production of fusion proteins for vaccination |
WO2001090185A2 (en) * | 2000-05-25 | 2001-11-29 | Queen's University At Kingston | Pt32 sperm protein, sperm c-yes, oocyte cytoplasmic c-yes, and uses thereof |
JP4744062B2 (en) * | 2000-10-18 | 2011-08-10 | アメリカ合衆国 | Human genes essential for fertility |
EP1421104A4 (en) * | 2001-08-02 | 2005-08-24 | Trinity Biomedical Technology | Human zona pellucida proteins and methods of their use in diagnosing male infertility |
US20030181409A1 (en) * | 2001-12-14 | 2003-09-25 | The Regents Of The University Of California | Methods of inhibiting fertility |
US7056515B2 (en) | 2002-02-08 | 2006-06-06 | Immunovaccine Technologies Inc. | Antigens for immunocontraception |
EP1474447A2 (en) * | 2002-02-08 | 2004-11-10 | Immunovaccine Technologies Inc. | Antigens for immunocontraception |
EP1606391B1 (en) * | 2003-03-14 | 2011-01-19 | Regents of the University of California | Virulent systemic feline calicivirus |
US7309495B2 (en) * | 2003-03-14 | 2007-12-18 | The Regents Of The University Of California | Hemorrhagic feline calicivirus |
US7639629B2 (en) * | 2006-07-28 | 2009-12-29 | Microsoft Corporation | Security model for application and trading partner integration |
US20080187549A1 (en) * | 2007-02-07 | 2008-08-07 | Morey William A | Polyantigenic-based (multiple antigens) modi or set of methods for developing infertility vaccines |
US20090090367A1 (en) * | 2007-10-03 | 2009-04-09 | Hurley Scott B | Method of contraception |
CN102159240B (en) * | 2008-09-17 | 2014-12-24 | 美国政府健康及人类服务部,疾病控制和预防中心 | Rabies virus-based recombinant immunocontraceptive compositions and methods of use |
CN107158373B (en) * | 2017-05-25 | 2021-06-11 | 新疆大学 | Application of pleurotus ferulae polysaccharide in preparation of dog zona pellucida 3DNA vaccine adjuvant |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1990015624A1 (en) * | 1989-06-12 | 1990-12-27 | The United States Of America, As Represented By The Secretary, U.S. Department Of Commerce | Contraceptive vaccine based on cloned zona pellucida gene |
WO1992003548A1 (en) * | 1990-08-27 | 1992-03-05 | Akzo N.V. | Human zona pellucida protein zp3 |
WO1993014786A1 (en) * | 1992-02-04 | 1993-08-05 | Colorado State University Research Foundation | Composition and method to prevent conception or to cause sterility in animals |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4996297A (en) * | 1987-10-07 | 1991-02-26 | Zonagen, Inc. | Recombinantly expressed rabbit zona pellucida polypeptides |
US5641487A (en) * | 1989-06-12 | 1997-06-24 | The Government Of The United States Of America As Represented By The Secretary Department Of Health And Human Services | Contraceptive vaccine based on alloimmunization with zona pellucida polypeptides |
JPH05317050A (en) * | 1992-01-10 | 1993-12-03 | Shinzou Isojima | Pzp-4 gene and contraceptive vaccine |
-
1993
- 1993-11-06 WO PCT/US1993/010851 patent/WO1994011019A1/en active IP Right Grant
- 1993-11-06 EP EP94902225A patent/EP0634936B1/en not_active Expired - Lifetime
- 1993-11-06 AT AT94902225T patent/ATE275191T1/en not_active IP Right Cessation
- 1993-11-06 JP JP6512318A patent/JPH07503142A/en active Pending
- 1993-11-06 AU AU56800/94A patent/AU675269B2/en not_active Ceased
- 1993-11-06 CA CA002127531A patent/CA2127531A1/en not_active Abandoned
- 1993-11-06 DE DE69333610T patent/DE69333610D1/en not_active Expired - Lifetime
- 1993-11-09 US US08/149,223 patent/US6027727A/en not_active Expired - Fee Related
-
1994
- 1994-03-14 CN CN94103010A patent/CN1110177A/en active Pending
-
1995
- 1995-06-07 US US08/484,596 patent/US5981228A/en not_active Expired - Fee Related
- 1995-06-07 US US08/480,150 patent/US5989550A/en not_active Expired - Fee Related
- 1995-06-07 US US08/484,993 patent/US5837497A/en not_active Expired - Fee Related
- 1995-06-07 US US08/484,158 patent/US5976545A/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1990015624A1 (en) * | 1989-06-12 | 1990-12-27 | The United States Of America, As Represented By The Secretary, U.S. Department Of Commerce | Contraceptive vaccine based on cloned zona pellucida gene |
WO1992003548A1 (en) * | 1990-08-27 | 1992-03-05 | Akzo N.V. | Human zona pellucida protein zp3 |
WO1993014786A1 (en) * | 1992-02-04 | 1993-08-05 | Colorado State University Research Foundation | Composition and method to prevent conception or to cause sterility in animals |
Also Published As
Publication number | Publication date |
---|---|
US5981228A (en) | 1999-11-09 |
DE69333610D1 (en) | 2004-10-07 |
AU5680094A (en) | 1994-06-08 |
US5989550A (en) | 1999-11-23 |
JPH07503142A (en) | 1995-04-06 |
US6027727A (en) | 2000-02-22 |
CN1110177A (en) | 1995-10-18 |
US5976545A (en) | 1999-11-02 |
ATE275191T1 (en) | 2004-09-15 |
CA2127531A1 (en) | 1994-05-26 |
EP0634936A1 (en) | 1995-01-25 |
WO1994011019A1 (en) | 1994-05-26 |
US5837497A (en) | 1998-11-17 |
EP0634936B1 (en) | 2004-09-01 |
EP0634936A4 (en) | 1998-07-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU675269B2 (en) | Materials and methods for immunocontraception | |
Xie et al. | Multiple pregnancy-associated glycoproteins are secreted by day 100 ovine placental tissue | |
Hasegawa et al. | Amino acid sequence of a porcine zona pellucida glycoprotein ZP4 determined by peptide mapping and cDNA cloning | |
US6613891B1 (en) | Polynucleotides that encode bovine inhibin | |
Kaul et al. | Expression of bonnet monkey (Macaca radiata) zona pellucida‐3 (ZP3) in a prokaryotic system and its immunogenicity | |
WO1991016922A1 (en) | Analogs of glycoprotein hormones having altered immunological characteristics, efficacy and/or receptor specificity | |
JPS62285792A (en) | Vaccine for zoster virus | |
US5403586A (en) | LHRH-TraTp fusion proteins | |
US6001599A (en) | DNAs encoding mammalian ZPBs | |
Gupta et al. | Sequence of complementary deoxyribonucleic acid encoding bonnet monkey (Macaca radiata) zona pellucida glycoprotein-ZP1 and its high-level expression in Escherichia coli | |
CA2058999A1 (en) | Contraceptive vaccine based on cloned zona pellucida gene | |
CA2090486A1 (en) | Human zona pellucida protein zp3 | |
JP2000236887A (en) | Fusion protein containing carrier capable of inducing double immune reaction | |
US6455041B1 (en) | Immunogenic epitopes of the human zona pellucida protein (ZP1) | |
Yurewicz et al. | Generation and characterization of site-directed antisera against an amino-terminal segment of a 55 kDa sperm adhesive glycoprotein from zona pellucida of pig oocytes | |
DE69322960T2 (en) | CONCEPTIBLE VACCINE | |
AU3594397A (en) | Immunocontraceptive compositions containing sperm antigens, and methods of use | |
EP0478609B1 (en) | Srif-related peptides and uses thereof | |
AU5903986A (en) | Recombinant inhibin | |
JP2787926B2 (en) | Fusion protein | |
by a Fertilization-Blocking et al. | Reproductive Immunology SK Gupta (Ed) Copyright© 1999 Narosa Publishing House, New Delhi, India | |
Koyama et al. | Antibody responses to a synthetic pZP4 peptide recognized by a fertilization-blocking monoclonal antibody | |
Hasegawa et al. | Evaluation of the contraceptive potential of recombinant proteins and synthetic peptides of zona pellucida (ZP) | |
AU2004203523A1 (en) | Immunocontraceptive compositions containing sperm antigens, and methods of use | |
WO1997039020A2 (en) | Antigenic sequences of a sperm protein and immunocontraceptive methods |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |