CN115247187A - 表达三种人源基因的SARS-CoV-2易感模型猪的构建方法及其应用 - Google Patents
表达三种人源基因的SARS-CoV-2易感模型猪的构建方法及其应用 Download PDFInfo
- Publication number
- CN115247187A CN115247187A CN202210121045.9A CN202210121045A CN115247187A CN 115247187 A CN115247187 A CN 115247187A CN 202210121045 A CN202210121045 A CN 202210121045A CN 115247187 A CN115247187 A CN 115247187A
- Authority
- CN
- China
- Prior art keywords
- pig
- cov
- sars
- plasmid
- susceptible
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 122
- 241001678559 COVID-19 virus Species 0.000 title claims abstract description 69
- 238000000034 method Methods 0.000 title claims description 27
- 241000282414 Homo sapiens Species 0.000 title abstract description 39
- 108091033409 CRISPR Proteins 0.000 claims abstract description 36
- 101000638154 Homo sapiens Transmembrane protease serine 2 Proteins 0.000 claims abstract description 18
- 238000012216 screening Methods 0.000 claims abstract description 18
- 229960005486 vaccine Drugs 0.000 claims abstract description 12
- 239000003814 drug Substances 0.000 claims abstract description 11
- 229940079593 drug Drugs 0.000 claims abstract description 8
- 230000000694 effects Effects 0.000 claims abstract description 7
- 230000008506 pathogenesis Effects 0.000 claims abstract description 6
- 239000013612 plasmid Substances 0.000 claims description 156
- 108020004414 DNA Proteins 0.000 claims description 134
- 239000002773 nucleotide Substances 0.000 claims description 102
- 125000003729 nucleotide group Chemical group 0.000 claims description 102
- 102000053602 DNA Human genes 0.000 claims description 83
- 108091027544 Subgenomic mRNA Proteins 0.000 claims description 57
- 108010023337 axl receptor tyrosine kinase Proteins 0.000 claims description 20
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 20
- 238000011144 upstream manufacturing Methods 0.000 claims description 16
- 210000000056 organ Anatomy 0.000 claims description 15
- 238000002360 preparation method Methods 0.000 claims description 12
- 101100433975 Homo sapiens ACE2 gene Proteins 0.000 claims description 11
- 238000013334 tissue model Methods 0.000 claims description 8
- 101150008656 COL1A1 gene Proteins 0.000 claims description 7
- 206010035664 Pneumonia Diseases 0.000 claims description 6
- 101000929928 Homo sapiens Angiotensin-converting enzyme 2 Proteins 0.000 claims description 5
- 102000048657 human ACE2 Human genes 0.000 claims description 4
- 102000049800 human TMPRSS2 Human genes 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 210000004027 cell Anatomy 0.000 abstract description 140
- 101150028074 2 gene Proteins 0.000 abstract description 20
- 238000005516 engineering process Methods 0.000 abstract description 12
- 238000010354 CRISPR gene editing Methods 0.000 abstract description 6
- 238000010276 construction Methods 0.000 abstract description 6
- 230000006801 homologous recombination Effects 0.000 abstract description 6
- 238000002744 homologous recombination Methods 0.000 abstract description 6
- 210000001082 somatic cell Anatomy 0.000 abstract description 6
- 238000011160 research Methods 0.000 abstract description 5
- 238000011156 evaluation Methods 0.000 abstract description 4
- 101150018445 Axl gene Proteins 0.000 abstract description 3
- 238000010449 nuclear transplantation Methods 0.000 abstract description 3
- 241000282898 Sus scrofa Species 0.000 description 115
- 102000004169 proteins and genes Human genes 0.000 description 47
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 46
- 235000018102 proteins Nutrition 0.000 description 42
- 238000003780 insertion Methods 0.000 description 36
- 230000037431 insertion Effects 0.000 description 36
- 230000014509 gene expression Effects 0.000 description 35
- 239000002609 medium Substances 0.000 description 31
- 102100033601 Collagen alpha-1(I) chain Human genes 0.000 description 28
- 108010029483 alpha 1 Chain Collagen Type I Proteins 0.000 description 28
- 229950010131 puromycin Drugs 0.000 description 23
- 210000002950 fibroblast Anatomy 0.000 description 22
- 101001000998 Homo sapiens Protein phosphatase 1 regulatory subunit 12C Proteins 0.000 description 18
- 241000282887 Suidae Species 0.000 description 18
- 210000001132 alveolar macrophage Anatomy 0.000 description 18
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 16
- 102100035620 Protein phosphatase 1 regulatory subunit 12C Human genes 0.000 description 15
- 108091034057 RNA (poly(A)) Proteins 0.000 description 15
- 239000013598 vector Substances 0.000 description 15
- 241001465754 Metazoa Species 0.000 description 14
- 210000000287 oocyte Anatomy 0.000 description 14
- 101800001494 Protease 2A Proteins 0.000 description 13
- 101800001066 Protein 2A Proteins 0.000 description 13
- 241001112090 Pseudovirus Species 0.000 description 12
- 238000001514 detection method Methods 0.000 description 12
- 239000001963 growth medium Substances 0.000 description 12
- 239000000243 solution Substances 0.000 description 12
- 238000012258 culturing Methods 0.000 description 11
- 208000015181 infectious disease Diseases 0.000 description 11
- 239000012212 insulator Substances 0.000 description 11
- 102220289632 rs33941849 Human genes 0.000 description 11
- 101150066002 GFP gene Proteins 0.000 description 10
- 210000002257 embryonic structure Anatomy 0.000 description 10
- 230000004927 fusion Effects 0.000 description 10
- 210000001161 mammalian embryo Anatomy 0.000 description 9
- 238000012360 testing method Methods 0.000 description 9
- 238000013518 transcription Methods 0.000 description 9
- 230000035897 transcription Effects 0.000 description 9
- 150000001413 amino acids Chemical group 0.000 description 8
- 238000003776 cleavage reaction Methods 0.000 description 8
- 230000007017 scission Effects 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 108020001507 fusion proteins Proteins 0.000 description 7
- 102000037865 fusion proteins Human genes 0.000 description 7
- 238000010362 genome editing Methods 0.000 description 7
- 102000005962 receptors Human genes 0.000 description 7
- 108020003175 receptors Proteins 0.000 description 7
- 239000006228 supernatant Substances 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 6
- 102100021579 Enhancer of filamentation 1 Human genes 0.000 description 6
- 241000282412 Homo Species 0.000 description 6
- 101000898310 Homo sapiens Enhancer of filamentation 1 Proteins 0.000 description 6
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 6
- 201000003176 Severe Acute Respiratory Syndrome Diseases 0.000 description 6
- 241000700605 Viruses Species 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 6
- 238000005520 cutting process Methods 0.000 description 6
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 6
- 210000003917 human chromosome Anatomy 0.000 description 6
- 238000000338 in vitro Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 238000005406 washing Methods 0.000 description 6
- 102000053723 Angiotensin-converting enzyme 2 Human genes 0.000 description 5
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 5
- 241000711573 Coronaviridae Species 0.000 description 5
- 108020005004 Guide RNA Proteins 0.000 description 5
- 241000288906 Primates Species 0.000 description 5
- 235000011449 Rosa Nutrition 0.000 description 5
- 230000004913 activation Effects 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 230000010354 integration Effects 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 210000004072 lung Anatomy 0.000 description 5
- 239000002244 precipitate Substances 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 238000010374 somatic cell nuclear transfer Methods 0.000 description 5
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 4
- 239000007995 HEPES buffer Substances 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 229940096437 Protein S Drugs 0.000 description 4
- 241000315672 SARS coronavirus Species 0.000 description 4
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 101710198474 Spike protein Proteins 0.000 description 4
- 101710081844 Transmembrane protease serine 2 Proteins 0.000 description 4
- 102100031989 Transmembrane protease serine 2 Human genes 0.000 description 4
- 102000004142 Trypsin Human genes 0.000 description 4
- 108090000631 Trypsin Proteins 0.000 description 4
- 102100037236 Tyrosine-protein kinase receptor UFO Human genes 0.000 description 4
- GBOGMAARMMDZGR-UHFFFAOYSA-N UNPD149280 Natural products N1C(=O)C23OC(=O)C=CC(O)CCCC(C)CC=CC3C(O)C(=C)C(C)C2C1CC1=CC=CC=C1 GBOGMAARMMDZGR-UHFFFAOYSA-N 0.000 description 4
- 230000003213 activating effect Effects 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 239000007853 buffer solution Substances 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 210000001771 cumulus cell Anatomy 0.000 description 4
- GBOGMAARMMDZGR-JREHFAHYSA-N cytochalasin B Natural products C[C@H]1CCC[C@@H](O)C=CC(=O)O[C@@]23[C@H](C=CC1)[C@H](O)C(=C)[C@@H](C)[C@@H]2[C@H](Cc4ccccc4)NC3=O GBOGMAARMMDZGR-JREHFAHYSA-N 0.000 description 4
- GBOGMAARMMDZGR-TYHYBEHESA-N cytochalasin B Chemical compound C([C@H]1[C@@H]2[C@@H](C([C@@H](O)[C@@H]3/C=C/C[C@H](C)CCC[C@@H](O)/C=C/C(=O)O[C@@]23C(=O)N1)=C)C)C1=CC=CC=C1 GBOGMAARMMDZGR-TYHYBEHESA-N 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 230000035800 maturation Effects 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 229910052754 neon Inorganic materials 0.000 description 4
- GKAOGPIIYCISHV-UHFFFAOYSA-N neon atom Chemical compound [Ne] GKAOGPIIYCISHV-UHFFFAOYSA-N 0.000 description 4
- 150000007523 nucleic acids Chemical group 0.000 description 4
- 210000001672 ovary Anatomy 0.000 description 4
- 230000002093 peripheral effect Effects 0.000 description 4
- 210000004508 polar body Anatomy 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 238000003753 real-time PCR Methods 0.000 description 4
- 229920006395 saturated elastomer Polymers 0.000 description 4
- 229960005322 streptomycin Drugs 0.000 description 4
- 210000003437 trachea Anatomy 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 239000012588 trypsin Substances 0.000 description 4
- 102000007469 Actins Human genes 0.000 description 3
- 108010085238 Actins Proteins 0.000 description 3
- 102100030988 Angiotensin-converting enzyme Human genes 0.000 description 3
- 101710185050 Angiotensin-converting enzyme Proteins 0.000 description 3
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 3
- 108700039887 Essential Genes Proteins 0.000 description 3
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 3
- 108091005904 Hemoglobin subunit beta Proteins 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 229930182555 Penicillin Natural products 0.000 description 3
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- 108091093126 WHP Posttrascriptional Response Element Proteins 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000010171 animal model Methods 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 239000012091 fetal bovine serum Substances 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 229940049954 penicillin Drugs 0.000 description 3
- 108010012581 phenylalanylglutamate Proteins 0.000 description 3
- 230000035479 physiological effects, processes and functions Effects 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 238000002054 transplantation Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 2
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 2
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 2
- 208000025721 COVID-19 Diseases 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 102100031673 Corneodesmosin Human genes 0.000 description 2
- 101710139375 Corneodesmosin Proteins 0.000 description 2
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 2
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- 101150112014 Gapdh gene Proteins 0.000 description 2
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 2
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 2
- 108010003272 Hyaluronate lyase Proteins 0.000 description 2
- 102000001974 Hyaluronidases Human genes 0.000 description 2
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 208000037847 SARS-CoV-2-infection Diseases 0.000 description 2
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 2
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000010370 cell cloning Methods 0.000 description 2
- 230000007910 cell fusion Effects 0.000 description 2
- 210000003855 cell nucleus Anatomy 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000012761 co-transfection Methods 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 2
- 230000000857 drug effect Effects 0.000 description 2
- 238000007877 drug screening Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000003797 essential amino acid Substances 0.000 description 2
- 235000020776 essential amino acid Nutrition 0.000 description 2
- 230000012173 estrus Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000000706 filtrate Substances 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 229960002773 hyaluronidase Drugs 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 230000013011 mating Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 239000002480 mineral oil Substances 0.000 description 2
- 235000010446 mineral oil Nutrition 0.000 description 2
- 239000012452 mother liquor Substances 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 210000003101 oviduct Anatomy 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 230000001575 pathological effect Effects 0.000 description 2
- 230000007170 pathology Effects 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 2
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 2
- 230000035935 pregnancy Effects 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 108010054624 red fluorescent protein Proteins 0.000 description 2
- 230000008672 reprogramming Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 230000001568 sexual effect Effects 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 229960002920 sorbitol Drugs 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 230000009385 viral infection Effects 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- QIJRTFXNRTXDIP-UHFFFAOYSA-N (1-carboxy-2-sulfanylethyl)azanium;chloride;hydrate Chemical compound O.Cl.SCC(N)C(O)=O QIJRTFXNRTXDIP-UHFFFAOYSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 1
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- 101710199744 Anionic trypsin-2 Proteins 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- YHSNASXGBPAHRL-BPUTZDHNSA-N Arg-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N YHSNASXGBPAHRL-BPUTZDHNSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- XFXZKCRBBOVJKS-BVSLBCMMSA-N Arg-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XFXZKCRBBOVJKS-BVSLBCMMSA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- UEONJSPBTSWKOI-CIUDSAMLSA-N Asn-Gln-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O UEONJSPBTSWKOI-CIUDSAMLSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- GBAWQWASNGUNQF-ZLUOBGJFSA-N Asp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N GBAWQWASNGUNQF-ZLUOBGJFSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- JRBVWZLHBGYZNY-QEJZJMRPSA-N Asp-Gln-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRBVWZLHBGYZNY-QEJZJMRPSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- RNAQPBOOJRDICC-BPUTZDHNSA-N Asp-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N RNAQPBOOJRDICC-BPUTZDHNSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- 229940022962 COVID-19 vaccine Drugs 0.000 description 1
- 102000029816 Collagenase Human genes 0.000 description 1
- 108060005980 Collagenase Proteins 0.000 description 1
- 101800004637 Communis Proteins 0.000 description 1
- 208000001528 Coronaviridae Infections Diseases 0.000 description 1
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 1
- KEBJBKIASQVRJS-WDSKDSINSA-N Cys-Gln-Gly Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N KEBJBKIASQVRJS-WDSKDSINSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- HSAWNMMTZCLTPY-DCAQKATOSA-N Cys-Met-Leu Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HSAWNMMTZCLTPY-DCAQKATOSA-N 0.000 description 1
- MWVDDZUTWXFYHL-XKBZYTNZSA-N Cys-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O MWVDDZUTWXFYHL-XKBZYTNZSA-N 0.000 description 1
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 1
- 102400001368 Epidermal growth factor Human genes 0.000 description 1
- 101800003838 Epidermal growth factor Proteins 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- GQTNWYFWSUFFRA-KKUMJFAQSA-N Gln-Met-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GQTNWYFWSUFFRA-KKUMJFAQSA-N 0.000 description 1
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- RZMXBFUSQNLEQF-QEJZJMRPSA-N Glu-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RZMXBFUSQNLEQF-QEJZJMRPSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- VPZXBVLAVMBEQI-VKHMYHEASA-N Glycyl-alanine Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- -1 H11 Proteins 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- ZNPRMNDAFQKATM-LKTVYLICSA-N His-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZNPRMNDAFQKATM-LKTVYLICSA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- MWXBCJKQRQFVOO-DCAQKATOSA-N His-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N MWXBCJKQRQFVOO-DCAQKATOSA-N 0.000 description 1
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 1
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- UXZMINKIEWBEQU-SZMVWBNQSA-N His-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N UXZMINKIEWBEQU-SZMVWBNQSA-N 0.000 description 1
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 1
- 101000773743 Homo sapiens Angiotensin-converting enzyme Proteins 0.000 description 1
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- JVTAAEKCZFNVCJ-UHFFFAOYSA-M Lactate Chemical compound CC(O)C([O-])=O JVTAAEKCZFNVCJ-UHFFFAOYSA-M 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 1
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- NHRINZSPIUXYQZ-DCAQKATOSA-N Leu-Met-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N NHRINZSPIUXYQZ-DCAQKATOSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- KVNLHIXLLZBAFQ-RWMBFGLXSA-N Lys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N KVNLHIXLLZBAFQ-RWMBFGLXSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- ZEVPMOHYCQFWSE-NAKRPEOUSA-N Met-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N ZEVPMOHYCQFWSE-NAKRPEOUSA-N 0.000 description 1
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- JZXKNNOWPBVZEV-XIRDDKMYSA-N Met-Trp-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JZXKNNOWPBVZEV-XIRDDKMYSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- UXQFHEKRGHYJRA-STQMWFEESA-N Phe-Met-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O UXQFHEKRGHYJRA-STQMWFEESA-N 0.000 description 1
- OHIYMVFLQXTZAW-UFYCRDLUSA-N Phe-Met-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OHIYMVFLQXTZAW-UFYCRDLUSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- YTGGLKWSVIRECD-JBACZVJFSA-N Phe-Trp-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 YTGGLKWSVIRECD-JBACZVJFSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- BELBBZDIHDAJOR-UHFFFAOYSA-N Phenolsulfonephthalein Chemical compound C1=CC(O)=CC=C1C1(C=2C=CC(O)=CC=2)C2=CC=CC=C2S(=O)(=O)O1 BELBBZDIHDAJOR-UHFFFAOYSA-N 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- GNFHQWNCSSPOBT-ULQDDVLXSA-N Pro-Trp-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O GNFHQWNCSSPOBT-ULQDDVLXSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 1
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- FKZSXTKZLPPHQU-GQGQLFGLSA-N Ser-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N FKZSXTKZLPPHQU-GQGQLFGLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- 101710151381 Serine protease 2 Proteins 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- IMYTYAWRKBYTSX-YTQUADARSA-N Trp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O IMYTYAWRKBYTSX-YTQUADARSA-N 0.000 description 1
- OJCSQAWRJKPKFM-TUSQITKMSA-N Trp-His-Trp Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OJCSQAWRJKPKFM-TUSQITKMSA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- CSRCUZAVBSEDMB-FDARSICLSA-N Trp-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CSRCUZAVBSEDMB-FDARSICLSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- NFVQCNMGJILYMI-SZMVWBNQSA-N Trp-Met-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NFVQCNMGJILYMI-SZMVWBNQSA-N 0.000 description 1
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 1
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 1
- IYHRKILQAQWODS-VJBMBRPKSA-N Trp-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IYHRKILQAQWODS-VJBMBRPKSA-N 0.000 description 1
- FBHHJGOJWXHGDO-TUSQITKMSA-N Trp-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 FBHHJGOJWXHGDO-TUSQITKMSA-N 0.000 description 1
- 102100034392 Trypsin-2 Human genes 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- WKWJJQZZZBBWKV-JYJNAYRXSA-N Val-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WKWJJQZZZBBWKV-JYJNAYRXSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- PHZGFLFMGLXCFG-FHWLQOOXSA-N Val-Lys-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PHZGFLFMGLXCFG-FHWLQOOXSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 101710204001 Zinc metalloprotease Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010031014 alanyl-histidyl-leucyl-leucine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 229940024606 amino acid Drugs 0.000 description 1
- 235000001014 amino acid Nutrition 0.000 description 1
- 210000003484 anatomy Anatomy 0.000 description 1
- 239000003443 antiviral agent Substances 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 238000007664 blowing Methods 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229960002424 collagenase Drugs 0.000 description 1
- 238000005138 cryopreservation Methods 0.000 description 1
- 229960001305 cysteine hydrochloride Drugs 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 230000005782 double-strand break Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 210000002308 embryonic cell Anatomy 0.000 description 1
- 229940116977 epidermal growth factor Drugs 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 210000001733 follicular fluid Anatomy 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 102000056252 human ACE Human genes 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- VVIUBCNYACGLLV-UHFFFAOYSA-N hypotaurine Chemical compound [NH3+]CCS([O-])=O VVIUBCNYACGLLV-UHFFFAOYSA-N 0.000 description 1
- 238000010832 independent-sample T-test Methods 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 229960003531 phenolsulfonphthalein Drugs 0.000 description 1
- 108010073832 phenylalanyl-leucyl-leucyl-arginyl-asparagine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 229940076788 pyruvate Drugs 0.000 description 1
- 229940107700 pyruvic acid Drugs 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 238000002791 soaking Methods 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 102000035160 transmembrane proteins Human genes 0.000 description 1
- 108091005703 transmembrane proteins Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 238000007492 two-way ANOVA Methods 0.000 description 1
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/8509—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells for producing genetically modified animals, e.g. transgenic
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/027—New or modified breeds of vertebrates
- A01K67/0273—Cloned vertebrates
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K49/00—Preparations for testing in vivo
- A61K49/0004—Screening or testing of compounds for diagnosis of disorders, assessment of conditions, e.g. renal clearance, gastric emptying, testing for diabetes, allergy, rheuma, pancreas functions
- A61K49/0008—Screening agents using (non-human) animal models or transgenic animal models or chimeric hosts, e.g. Alzheimer disease animal model, transgenic model for heart failure
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/06—Animal cells or tissues; Human cells or tissues
- C12N5/0602—Vertebrate cells
- C12N5/0652—Cells of skeletal and connective tissues; Mesenchyme
- C12N5/0656—Adult fibroblasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/485—Exopeptidases (3.4.11-3.4.19)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/64—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue
- C12N9/6421—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue from mammals
- C12N9/6424—Serine endopeptidases (3.4.21)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/10—Protein-tyrosine kinases (2.7.10)
- C12Y207/10001—Receptor protein-tyrosine kinase (2.7.10.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/17—Metallocarboxypeptidases (3.4.17)
- C12Y304/17023—Angiotensin-converting enzyme 2 (3.4.17.23)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/21—Serine endopeptidases (3.4.21)
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/5005—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
- G01N33/5008—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2207/00—Modified animals
- A01K2207/15—Humanized animals
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/07—Animals genetically altered by homologous recombination
- A01K2217/072—Animals genetically altered by homologous recombination maintaining or altering function, i.e. knock in
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/108—Swine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
- A01K2267/0393—Animal model comprising a reporter system for screening tests
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2503/00—Use of cells in diagnostics
- C12N2503/02—Drug screening
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2510/00—Genetically modified cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2500/00—Screening for compounds of potential therapeutic value
- G01N2500/10—Screening for compounds of potential therapeutic value involving cells
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Urology & Nephrology (AREA)
- Veterinary Medicine (AREA)
- Cell Biology (AREA)
- Plant Pathology (AREA)
- Immunology (AREA)
- Biophysics (AREA)
- Rheumatology (AREA)
- Hematology (AREA)
- Environmental Sciences (AREA)
- Pathology (AREA)
- Toxicology (AREA)
- Animal Behavior & Ethology (AREA)
- Food Science & Technology (AREA)
- Gastroenterology & Hepatology (AREA)
- Epidemiology (AREA)
- Endocrinology (AREA)
- Public Health (AREA)
- Diabetes (AREA)
- Tropical Medicine & Parasitology (AREA)
- Analytical Chemistry (AREA)
- Animal Husbandry (AREA)
- Vascular Medicine (AREA)
Abstract
本发明公开了表达三种人源基因的SARS‑CoV‑2易感模型猪的构建方法及其应用。本发明采用CRISPR/Cas9系统及同源重组技术构建了在基因组的特定位置整合并表达三个人源基因(ACE2基因、AXL基因和TMPRSS2基因)的猪重组细胞,进一步,利用该重组细胞通过体细胞核移植技术得到了SARS‑CoV‑2易感模型猪,该模型猪可用于下一步的药物、疫苗和抗体筛选及药物、疫苗和抗体效果评价以及发病机制的研究。
Description
技术领域
本发明属于生物技术领域,涉及基因编辑技术,具体涉及采用CRISPR/Cas9系统及同源重组技术构建的在基因组的特定位置整合并表达三个人源基因(ACE2基因、AXL基因和TMPRSS2基因)的猪重组细胞,利用该重组细胞可以制备SARS-CoV-2易感模型猪,从而将其用于生物医药领域。
背景技术
人类血管紧张素I转化酶2(hACE2)为人类血管紧张素转换酶同源物,是一种锌金属蛋白酶,属于1型跨膜蛋白,其作为SARS-CoV-2和SARS-CoV等冠状病毒在人体内的受体蛋白,广泛存在于人体各个组织中,是目前已知导致SARS-CoV-2进入宿主细胞并造成宿主发病的关键蛋白。
SARS-CoV-2进入宿主细胞需要依赖于病毒的刺突蛋白(S)与细胞受体的结合,并通过宿主细胞蛋白酶激活S蛋白,而人丝氨酸蛋白酶2(TMPRSS2)在此过程中行使着激活S蛋白的关键功能。
此外,人AXL受体酪氨酸激酶已被发现是一种不依赖于ACE2的SARS-CoV-2新受体,其通过特异性结合SARS-CoV-2刺突蛋白N端的NTD区域,介导病毒在呼吸系统细胞上的吸附与内化。
目前,尚未有针对SARS-CoV-2的特效药物,世界各国都在争分夺秒地开展SARS-CoV-2疫苗的研发。而动物模型对于阐明SARS-CoV-2的感染与发病机制、传播途径以及抗病毒药物和疫苗及抗体的开发和评价至关重要。
现有技术中,常用的模式动物为小鼠。小鼠不论从体型、器官大小、生理、病理等方面都与人相差巨大,不能真实地模拟人类正常的生理、病理状态。而猪作为大动物,是人类长期以来主要的肉食供应动物,其体型大小和生理功能与人类近似,易于大规模繁殖饲养,而且在伦理道德及动物保护等方面要求较低,是理想的人类疾病模型动物。
基因编辑是近年来不断取得重大发展的一种生物技术,其包括从基于同源重组的基因编辑到基于核酸酶的ZFN、TALEN、CRISPR/Cas9等编辑技术,其中CRISPR/Cas9技术是当前最先进的基因编辑技术。目前,基因编辑技术被越来越多地应用到动物模型的制作上。
发明内容
本发明解决的技术问题为提供了采用CRISPR/Cas9系统及同源重组技术构建的在基因组的特定位置整合并表达三个人源基因(ACE2基因、AXL基因和TMPRSS2基因)的猪重组细胞,进一步,利用该重组细胞通过体细胞核移植技术得到了SARS-CoV-2易感模型猪,该模型猪可用于下一步的药物、疫苗及抗体筛选和药物、疫苗及抗体效果评价以及发病机制的研究等。
本发明提供了一种制备重组细胞的方法,包括如下步骤:将命名为DNA分子甲的DNA分子整合至猪细胞的基因组DNA,得到重组细胞;所述DNA分子甲表达人源ACE2蛋白、人源AXL蛋白和人源TMPRSS2蛋白。
具体的,人源ACE2蛋白(hACE2蛋白)如SEQ ID NO:15所示。
具体的,人源AXL蛋白(hAXL蛋白),如SEQ ID NO:16所示。
具体的,人源TMPRSS2蛋白(hTMPRSS2蛋白),如SEQ ID NO:17所示。
人源ACE2蛋白由人ACE2基因编码。
人源AXL蛋白由人AXL基因编码。
人源TMPRSS2蛋白由人TMPRSS2基因编码。
具体的,人ACE2基因(hACE2基因)如SEQ ID NO:14中第2290-4704位核苷酸所示。
具体的,人AXL基因(hAXL基因)如SEQ ID NO:14中第4771-6648位核苷酸所示。
具体的,人TMPRSS2基因(hTMPRSS2基因)如SEQ ID NO:14中第6712-8298位核苷酸所示。
所述DNA分子甲中,人ACE2基因、人AXL基因和人TMPRSS2基因存在于同一表达盒。
所述DNA分子甲中,人ACE2基因、人AXL基因和人TMPRSS2基因由同一启动子启动表达。
具体的,所述启动子为人hEF1α启动子。
具体的,人hEF1α启动子如SEQ ID NO:14中第1094-2271位核苷酸所示。
所述DNA分子甲中,人ACE2基因、人AXL基因和人TMPRSS2基因由同一终止密码子终止表达。
所述DNA分子甲中,人ACE2基因、人AXL基因和人TMPRSS2基因共用一个Poly(A)信号。
具体的,所述Poly(A)信号为EF1αPoly(A)。
具体的,EF1αPoly(A)如SEQ ID NO:14中第8302-8874位核苷酸所示。
所述DNA分子甲中,人ACE2基因、人AXL基因和人TMPRSS2基因存在于同一编码框,基因之间由2A肽的编码基因间隔。2A肽又称为自剪接肽或自裂解肽或2A自剪切肽或2A自裂解肽。
所述2A肽具体可为P2A肽。
所述2A肽具体可为T2A肽。
具体的,所述P2A肽的编码基因如SEQ ID NO:14中第4705-4770位核苷酸所示。
具体的,所述T2A肽的编码基因如SEQ ID NO:14中第6649-6711位核苷酸所示。
具体的,所述DNA分子甲中,人ACE2基因、人AXL基因和人TMPRSS2基因所在的编码框如SEQ ID NO:14中第1094-8874位核苷酸所示。
所述DNA分子甲中还包括抗性筛选基因表达盒。所述抗性筛选基因可为编码Puromycin抗性蛋白的基因。所述抗性筛选基因表达盒如SEQ ID NO:14中第8987-10262位核苷酸所示。
所述DNA分子甲中还包括LoxP序列。
所述DNA分子甲中具体包括2个LoxP序列,分别如SEQ ID NO:14中第8911-8944位核苷酸、第10307-10340位核苷酸所示。
所述DNA分子甲中还包括绝缘子。
所述DNA分子甲中具体包括2个绝缘子,分别如SEQ ID NO:14中第887-1087位核苷酸、第10349-10549位核苷酸所示。
所述DNA分子甲自上游至下游依次包括如下区段:人hEF1α启动子、hACE2基因、编码自P2A肽的核苷酸、hAXL基因、编码T2A肽的核苷酸、hTMPRSS2基因、终止密码子、EF1αPoly(A)、LoxP序列、SV40启动子、编码Puromycin抗性蛋白的核苷酸、终止密码子、SV40Poly(A)、LoxP序列。
所述DNA分子甲自上游至下游依次包括如下区段:绝缘子1、人hEF1α启动子、hACE2基因、编码P2A肽的核苷酸、hAXL基因、编码T2A肽的核苷酸、hTMPRSS2基因、终止密码子、EF1αPoly(A)、LoxP序列、SV40启动子、编码Puromycin抗性蛋白的核苷酸、终止密码子、SV40Poly(A)、LoxP序列、绝缘子3。
具体的,所述DNA分子甲如SEQ ID NO:14中第887-10549位核苷酸所示。
具体的,所述DNA分子甲如SEQ ID NO:14中第881-10549位核苷酸所示。
所述“将命名为DNA分子甲的DNA分子整合至猪细胞的基因组DNA”的实现方式为:将命名为DNA分子乙的DNA分子导入猪细胞或者将具有所述DNA分子乙的重组质粒导入猪细胞;所述DNA分子乙中,具有所述DNA分子甲且在所述DNA分子甲的上游具有上游同源臂且在所述DNA分子甲的下游具有下游同源臂,所述上游同源臂和所述下游同源臂用于将所述DNA分子甲整合至猪细胞的基因组DNA。
所述同源臂为针对ROSA26基因的同源臂,上游同源臂为SH1左臂,下游同源臂为SH1右臂。SH1左臂如SEQ ID NO:3中第9-339位核苷酸所示,SH1右臂如SEQ ID NO:3中第9184-10195位核苷酸所示。
所述同源臂为针对AAVS1基因的同源臂,上游同源臂为SH2左臂,下游同源臂为SH2右臂。SH2左臂如SEQ ID NO:4所示,SH2右臂如SEQ ID NO:5所示。
所述同源臂为针对H11位点的同源臂,上游同源臂为SH3左臂,下游同源臂为SH3右臂。SH3左臂如SEQ ID NO:6所示,SH3右臂如SEQ ID NO:7所示。
所述同源臂为针对COL1A1基因的同源臂,上游同源臂为SH4左臂,下游同源臂为SH4右臂。SH4左臂如SEQ ID NO:8所示,SH4右臂如SEQ ID NO:9所示。
所述同源臂为针对COL1A1基因的同源臂,上游同源臂为SH4左臂,下游同源臂为SH4右臂。SH4左臂如SEQ ID NO:14中第9-880位核苷酸所示,SH4右臂如SEQ ID NO:14中第10550-11276位核苷酸所示。
所述DNA分子乙自上游至下游依次包括如下区段:上游同源臂、绝缘子1、人hEF1α启动子、hACE2基因、编码P2A肽的核苷酸、hAXL基因、编码T2A肽的核苷酸、hTMPRSS2基因、终止密码子、EF1αPoly(A)、LoxP序列、SV40启动子、编码Puromycin抗性蛋白的核苷酸、终止密码子、SV40 Poly(A)、LoxP序列、绝缘子3、下游同源臂。
具体的,所述DNA分子乙如SEQ ID NO:14中第9-11276位核苷酸所示。
具体的,具有所述DNA分子乙的重组质粒如SEQ ID NO:14所示。
DNA分子甲整合至猪细胞的基因组DNA的ROSA26基因或者AAVS1基因或者H11位点或者COL1A1基因中。优选为COL1A1基因。
DNA分子甲整合至猪细胞的基因组DNA的ROSA26安全港插入位点或者AAVS1安全港插入位点或者H11安全港插入位点或者COL1A1安全港插入位点。优选为COL1A1安全港插入位点。
DNA分子甲整合至猪细胞的基因组DNA的ROSA26基因中指的是将DNA分子甲插入至基因组DNA中SH1左臂和SH1右臂之间;SH1左臂如SEQ ID NO:3中第9-339位核苷酸所示,SH1右臂如SEQ ID NO:3中第9184-10195位核苷酸所示。
DNA分子甲整合至猪细胞的基因组DNA的AAVS1基因中指的是将DNA分子甲插入至基因组DNA中SH2左臂和SH2右臂之间;SH2左臂如SEQ ID NO:4所示,SH2右臂如SEQ ID NO:5所示。
DNA分子甲整合至猪细胞的基因组DNA的H11位点中指的是将DNA分子甲插入至基因组DNA中SH3左臂和SH3右臂之间;SH3左臂如SEQ ID NO:6所示,SH3右臂如SEQ ID NO:7所示。
DNA分子甲整合至猪细胞的基因组DNA的COL1A1基因中指的是将DNA分子甲插入至基因组DNA中SH4左臂和SH4右臂之间;SH4左臂如SEQ ID NO:8所示,SH4右臂如SEQ ID NO:9所示。
DNA分子甲整合至猪细胞的基因组DNA的COL1A1基因中指的是将DNA分子甲插入至基因组DNA中SH4左臂和SH4右臂之间;SH4左臂如SEQ ID NO:14中第9-880位核苷酸所示,SH4右臂如SEQ ID NO:14中第10550-11276位核苷酸所示。
所述方法中,具有所述DNA分子乙的重组质粒与两个辅助质粒共同导入猪细胞;所述两个辅助质粒为sgRNA质粒和Cas9质粒;
sgRNA质粒转录得到特异sgRNA;所述特异sgRNA为sgRNAROSA26-g3、sgRNAAAVS1-g4、sgRNAH11-g1或sgRNACOL1A1-g3;sgRNAROSA26-g3的靶序列结合区如SEQ ID NO:10中第1-20位核苷酸所示;sgRNAAAVS1-g4的靶序列结合区如SEQ ID NO:11中第1-20位核苷酸所示;sgRNAH11-g1的靶序列结合区如SEQ ID NO:12中第1-20位核苷酸所示;sgRNACOL1A1-g3的靶序列结合区如SEQID NO:13中第1-20位核苷酸所示;
Cas9质粒表达Cas9蛋白。
所述Cas9质粒具体可为质粒pKG-GE3。
具体来说,所述sgRNA质粒是借助限制性内切酶BbsI将特异sgRNA的靶序列结合区的编码序列插入pKG-U6gRNA载体得到的。
具有所述DNA分子乙的重组质粒、sgRNA质粒和Cas9质粒的摩尔配比具体可为1:3:1。
猪细胞、具有所述DNA分子乙的重组质粒、sgRNA质粒和Cas9质粒的配比具体可为:约20万个猪细胞:1.3μg具有所述DNA分子乙的重组质粒:0.8μg sgRNA质粒:0.9μg Cas9质粒。
具体的,猪基因组中所述ROSA26安全港插入位点及其周边区域如SEQ ID NO:18所示。
具体的,猪基因组中所述AAVS1安全港插入位点及其周边区域如SEQ ID NO:19所示。
具体的,猪基因组中所述H11安全港插入位点及其周边区域如SEQ ID NO:20所示。
具体的,猪基因组中所述COL1A1安全港插入位点及其周边区域如SEQ ID NO:21所示。
本发明还保护一种试剂盒,包括以上任一所述DNA分子乙。
本发明还保护一种试剂盒,包括具有以上任一所述DNA分子乙的重组质粒。
所述试剂盒还包括所述sgRNA质粒和所述的Cas9质粒。
具有所述DNA分子乙的重组质粒、sgRNA质粒和Cas9质粒的摩尔配比具体可为1:3:1。
本发明还保护以上任一所述DNA分子乙在制备试剂盒中的应用。
本发明还保护具有以上任一所述DNA分子乙的重组质粒在制备试剂盒中的应用。
本发明还保护具有以上任一所述DNA分子乙的重组质粒、sgRNA质粒和Cas9质粒在制备试剂盒中的应用。
以上任一所述试剂盒的用途为如下(a)或(b)或(c):(a)制备重组细胞;(b)制备SARS-CoV-2易感模型猪;(c)制备SARS-CoV-2易感猪细胞模型或SARS-CoV-2易感猪组织模型或SARS-CoV-2易感猪器官模型。
本发明还保护以上任一所述DNA分子乙、具有以上任一所述DNA分子乙的重组质粒或以上任一所述试剂盒的应用,为如下(a)或(b)或(c):(a)制备重组细胞;(b)制备SARS-CoV-2易感模型猪;(c)制备SARS-CoV-2易感猪细胞模型或SARS-CoV-2易感猪组织模型或SARS-CoV-2易感猪器官模型。
以上任一所述猪细胞为猪原代成纤维细胞。
所述猪可为任何品种的猪,优选的,所述猪可为从江香猪。
所述猪具体可为初生猪。
质粒pKG-GE3中,具有特异融合基因;所述特异融合基因编码特异融合蛋白;
所述特异融合蛋白自N端至C端依次包括如下元件:两个核定位信号、Cas9蛋白、两个核定位信号、P2A肽、荧光报告蛋白、T2A肽、抗性筛选标记蛋白;
质粒pKG-GE3中,由EF1a启动子启动所述特异融合基因的表达;
质粒pKG-GE3中,所述特异融合基因下游具有WPRE序列元件、3’LTR序列元件和bGHpoly(A)signal序列元件。
质粒pKG-GE3中,依次具有如下元件:CMV增强子、EF1a启动子、所述特异融合基因、WPRE序列元件、3’LTR序列元件、bGH poly(A)signal序列元件。
所述特异融合蛋白中,Cas9蛋白上游的两个核定位信号为SV40核定位信号,Cas9蛋白下游的两个核定位信号为nucleoplasmin核定位信号。
所述特异融合蛋白中,荧光报告蛋白具体可为EGFP蛋白。
所述特异融合蛋白中,抗性筛选标记蛋白具体可为Puromycin抗性蛋白。
P2A肽的氨基酸序列为“ATNFSLLKQAGDVEENPGP”(断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间)。
T2A肽的氨基酸序列为“EGRGSLLTCGDVEENPGP”(断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间)。
特异融合基因具体如SEQ ID NO:1中第911-6706位核苷酸所示。
CMV增强子如SEQ ID NO:1中第395-680位核苷酸所示。
EF1a启动子如SEQ ID NO:1中第682-890位核苷酸所示。
WPRE序列元件如SEQ ID NO:1中第6722-7310位核苷酸所示。
3’LTR序列元件如SEQ ID NO:1中第7382-7615位核苷酸所示。
bGH poly(A)signal序列元件如SEQ ID NO:1中第7647-7871位核苷酸所示。
质粒pKG-GE3具体如SEQ ID NO:1所示。
质粒pKG-U6gRNA中,具有SEQ ID NO:2中第2280-2637位核苷酸所示的DNA分子。
质粒pKG-U6gRNA具体如SEQ ID NO:2所示。
具体的,sgRNAROSA26-g3如SEQ ID NO:10所示。
具体的,sgRNAAAVS1-g4如SEQ ID NO:11所示。
具体的,sgRNAH11-g1如SEQ ID NO:12所示。
具体的,sgRNACOL1A1-g3如SEQ ID NO:13所示。
sgRNAROSA26-g3(SEQ ID NO:10):
GAAGGAGCAAACUGACAUGGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
sgRNAAAVS1-g4(SEQ ID NO:11):
UGCAGUGGGUCUUUGGGGACguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
sgRNAH11-g1(SEQ ID NO:12):
UUCCAGGAACAUAAGAAAGUguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
sgRNACOL1A1-g3(SEQ ID NO:13):
GCAGUCUCAGCAACCACUGAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
本发明还保护以上任一所述方法制备得到的重组细胞。
具体的,所述重组细胞可为如下重组细胞:与猪原代成纤维细胞相比,重组细胞的基因组DNA的差异仅在于:在猪基因组DNA中SH4左臂和SH4右臂之间插入了SEQ ID NO:14中第881-10549位核苷酸所示的DNA分子。
本发明还保护所述重组细胞在制备SARS-CoV-2易感模型猪中的应用。
将所述重组细胞作为核移植供体细胞进行体细胞克隆,可以得到克隆猪,即为SARS-CoV-2易感模型猪。
将所述重组细胞作为核移植供体细胞进行体细胞克隆从而制备克隆猪具体可采用如下方法:
(1)取离体卵巢,从直径3~6mm的卵泡中抽取卵丘卵母细胞复合体(Cumulus-oocyte complexes,COCs),选择至少具有三层致密卵丘细胞的COCs,接种至4孔板中,每孔装有400μL猪卵母细胞体外成熟培养基,每孔接种50个,每孔覆盖400μL的矿物油,将含COCs的培养板在38.5℃、5%CO2和饱和湿度的培养箱中培养42小时;
(2)完成步骤(1)后,用0.1%(w/v)透明质酸酶反复吹打去除COCs的扩张卵丘细胞,将具有完整膜且含排出的第一极体的卵母细胞在含有0.1mg/mL地美可辛、0.05M蔗糖和4mg/mL牛血清白蛋白的NCSU23培养基中培养0.5-1h,促使卵母细胞核突起,然后使用尖部倾斜的显微注射针在含有10μM HEPES、0.3%(w/v)聚乙烯吡咯烷酮、10%FBS、0.1mg/mL地美可辛和5mg/mL细胞松弛素B的Tyrode乳酸培养基中去除突起的细胞核和极体;然后,将单个核供体细胞注入去核卵母细胞的卵周隙,使用胚胎细胞融合仪在融合培养基中用200V/mm的直流脉冲将核供体细胞与受体卵母细胞融合20μs;然后,将重构胚在PZM-3培养基中培养2h以允许细胞核重编程,然后在激活培养基中用150V/mm的单脉冲激活100μs;然后,将激活的重构胚置于含5μg/mL细胞松弛素B的PZM-3培养基中,培养2小时,以进一步激活胚胎;然后将重构胚置于PZM-3培养基中培养;
(3)将激活后培养6h的重构胚移植到代孕母猪的输卵管中,正常饲养,得到子代,即为克隆猪。
代孕母猪具体可为二花脸母猪。
代孕母猪具体可为9月龄的二花脸母猪。
本发明还保护用所述重组细胞制备的SARS-CoV-2易感模型猪的猪细胞、猪组织或猪器官。
本发明还保护以上任一所述重组细胞的应用。
本发明还保护用所述重组细胞制备的SARS-CoV-2易感模型猪的猪细胞、猪组织或猪器官的应用。
本发明还保护用所述重组细胞制备的SARS-CoV-2易感模型猪的应用。
所述应用为如下(d1)或(d2)或(d3):
(d1)筛选治疗新冠肺炎的药物和/或疫苗和/或抗体;
(d2)进行新冠肺炎的药物和/或疫苗和/或抗体的效果评价;
(d3)研究新冠肺炎的发病机制。
人ACE2基因(hACE2基因)信息:编码angiotensin-converting enzyme 2[Homosapiens];位于人X号染色体;GeneID为59272。angiotensin-converting enzyme 2[Homosapiens],又称为hACE2蛋白。hACE2蛋白是目前SARS-CoV-2及SARS-CoV等冠状病毒公认的受体蛋白,如SEQ ID NO:15所示(NP_001358344.1)。
人AXL基因(hAXL基因)信息:编码AXL receptor tyrosine kinase[Homosapiens];位于人19号染色体;GeneID为558。AXL receptor tyrosine kinase[Homosapiens],又称为hAXL蛋白。hAXL蛋白是近期发现的不依赖于hACE2蛋白的独立介导SARS-CoV-2感染的蛋白,如SEQ ID NO:16所示(NP_001265528.1)。
人TMPRSS2基因(hTMPRSS2基因)信息:编码transmembrane protease serine 2[Homo sapiens];位于人21号染色体;GeneID为7113。transmembrane protease serine 2[Homo sapiens],又称为hTMPRSS2蛋白。hTMPRSS2蛋白能够通过激活SARS-CoV-2的刺突蛋白(S)从而介导SARS-CoV-2与宿主细胞受体结合,进而促进SARS-CoV-2进入宿主细胞,如SEQ ID NO:17所示(NP_001128571.1)。
与现有技术相比,本发明至少具有如下有益效果:
(1)本发明研究对象(猪)比其他动物(大小鼠、灵长类)具有更好的应用性。
大小鼠等啮齿类动物不论从体型、器官大小、生理、病理等方面都与人相差巨大,无法真实地模拟人类正常的生理、病理状态。研究表明,95%以上在大小鼠中验证有效的药物在人类临床试验中是无效的。就大动物而言,灵长类是与人亲缘关系最近的动物,但其体型小、性成熟晚(6-7岁开始交配),且为单胎动物,群体扩繁速度极慢,饲养成本很高。另外,灵长类动物克隆效率低、难度大、成本高。
而猪作为模型动物就没有上述缺点,猪是除灵长类外与人亲缘关系最近的动物,其体型、体重、器官大小等与人相近,在解剖学、生理学、免疫学、营养代谢、疾病发病机制等方面与人类极为相似。同时,猪的性成熟早(4-6个月),繁殖力高,一胎多仔,在2-3年内即可形成一个较大群体。另外,猪的克隆技术非常成熟,克隆及饲养成本也较灵长类低得多。
(2)本发明针对猪基因组进行了4个安全港位点基因敲入后表达情况的摸索,从中筛选出了最佳的供外源基因插入的猪基因组安全港位点,可有效改善基因敲入后目的基因的表达情况。
(3)利用本发明所得到的hACE2-hAXL-hTMPRSS2纯合敲入单细胞克隆株进行体细胞核移植动物克隆可直接得到hACE2-hAXL-hTMPRSS2纯合敲入的克隆猪,并且该纯合插入基因可稳定遗传。
在小鼠模型制作中,通常采用受精卵显微注射基因编辑材料后再进行胚胎移植,因其直接获得纯合突变后代的概率非常低(低于5%),需要进行后代的杂交选育,这不太适用于妊娠期较长的大动物(如猪)模型制作。因此,本发明采用技术难度大、挑战性高的原代细胞体外编辑并筛选阳性编辑单细胞克隆的方法,后期再通过体细胞核移植动物克隆技术直接获得相应模型猪,可大大缩短模型猪制作周期,并节省人力、物力、财力。
(4)本发明所制备的hACE2-hAXL-hTMPRSS2人源化猪可用于冠状病毒动物感染模型进行临床前试验研究。
本发明通过基因编辑手段获得了hACE2、hAXL和hTMPRSS2三基因人源化模型猪,将有助于研究并揭示由hACE2、hAXL和hTMPRSS2联合介导引发的SARS-CoV-2等冠状病毒的感染机制,并可用于进行药物筛选、药效评价、疫苗及抗体的应用效果测试等研究,能够为进一步的临床应用提供有效的实验数据,进而为预防和治疗人类SARS-CoV-2等冠状病毒感染疾病提供有力的实验手段。本发明对于针对SARS-CoV-2的药物、疫苗及抗体的研发及揭示该病毒的感染机制具有重大应用价值。
附图说明
图1为质粒PB-1G 2R 3-puro-ROSA26的结构示意图。
图2为不同安全港位点调控GFP绿色荧光表达图。
图3为不同安全港位点调控GFP基因转录水平荧光定量PCR结果。
图4为不同安全港位点调控GFP蛋白表达的FACS检测结果。
图5为质粒pKG-hACE2-hAXL-hTMPRSS2的结构示意图。
图6为单细胞克隆hACE2、hAXL和hTMPRSS2基因转录水平检测的结果。
图7为单细胞克隆hACE2、hAXL和hTMPRSS2的蛋白表达水平FACS检测的结果。
图8为体细胞克隆得到的模型猪的照片。
图9为人源化猪的hACE2、hAXL和hTMPRSS2基因转录水平检测的结果。
图10为人源化猪的hACE2、hAXL和hTMPRSS2蛋白表达水平FACS检测的结果。
图11为质粒pMD2.G-SARS-C19的结构示意图。
图12为质粒Lenti-mCherry的结构示意图。
图13为假病毒感染后的模型猪肺泡巨噬细胞的荧光信号照片。
图14为假病毒感染后的野生型对照猪肺泡巨噬细胞的荧光信号照片。
具体实施方式
下面结合具体实施方式对本发明进行进一步的详细描述,给出的实施例仅为了阐明本发明,而不是为了限制本发明的范围。以下提供的实施例可作为本技术领域普通技术人员进行进一步改进的指南,并不以任何方式构成对本发明的限制。
下述实施例中的实验方法,如无特殊说明,均为常规方法,按照本领域内的文献所描述的技术或条件或者按照产品说明书进行。下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。实施例中构建的重组质粒,均已进行测序验证。完全培养液(%为体积比):15%胎牛血清(Gibco)+83%DMEM培养基(Gibco)+1%Penicillin-Streptomycin(Gibco)+1%HEPES(Solarbio)。细胞培养条件:37℃,5%CO2、5%O2的恒温培养箱。
实施例中采用的猪原代成纤维细胞均是用初生从江香猪(雌性,血型AO)的耳组织制备得到的。制备猪原代成纤维细胞的方法:①取猪耳组织0.5g,去除毛发及骨组织,然后用75%酒精浸泡30-40s,然后用含5%(体积比)Penicillin-Streptomycin(Gibco)的PBS缓冲液洗涤5次,然后用PBS缓冲液洗涤一次;②用剪刀将组织剪碎,采用5mL 0.1%胶原酶溶液(Sigma),37℃消化1h,然后500g离心5min,弃上清;③将沉淀用1mL完全培养液重悬,然后铺入含10mL完全培养液并已用0.2%明胶(VWR)封盘的直径为10cm的细胞培养皿中,培养至细胞长满皿底60%左右;④完成步骤③后,采用胰蛋白酶消化并收集细胞,然后重悬于完全培养液。用于进行后续电转实验。
eEF1a-mNLS-hSpCas9-EGFP-PURO简称质粒pKG-GE3(环形质粒),如SEQ ID NO:1所示。SEQ ID NO:1中,第395-680位核苷酸组成CMV增强子,第682-890位核苷酸组成EF1a启动子,第986-1006位核苷酸编码核定位信号(NLS),第1016-1036位核苷酸编码核定位信号(NLS),第1037-5161位核苷酸编码Cas9蛋白,第5162-5209位核苷酸编码核定位信号(NLS),第5219-5266位核苷酸编码核定位信号(NLS),第5276-5332位核苷酸编码P2A肽(P2A肽的氨基酸序列为“ATNFSLLKQAGDVEENPGP”,断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间),第5333-6046位核苷酸编码EGFP蛋白,第6056-6109位核苷酸编码T2A肽(T2A肽的氨基酸序列为“EGRGSLLTCGDVEENPGP”,断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间),第6110-6703位核苷酸编码Puromycin蛋白(简称Puro蛋白),第6722-7310位核苷酸组成WPRE序列元件,第7382-7615位核苷酸组成3’LTR序列元件,第7647-7871位核苷酸组成bGH poly(A)signal序列元件。SEQ ID NO:1中,第911-6706形成融合基因,表达融合蛋白。由于P2A肽和T2A肽的存在,融合蛋白自发形成如下三个蛋白:具有Cas9蛋白的蛋白、具有EGFP蛋白的蛋白和具有Puro蛋白的蛋白。记载于专利申请202011170395.1(申请公布号为CN 112522261A;申请公布日为2021.03.19)。
pKG-U6gRNA载体,又称为质粒pKG-U6gRNA(环形质粒),如SEQ ID NO:2所示。SEQID NO:2中,第2280-2539位核苷酸组成hU6启动子,第2558-2637位核苷酸用于转录形成gRNA骨架。使用时,将20bp左右的DNA分子(用于转录形成gRNA的靶序列结合区)插入质粒pKG-U6gRNA,形成重组质粒,在细胞中重组质粒转录得到gRNA。记载于专利申请202011170395.1(申请公布号为CN 112522261 A;申请公布日为2021.03.19)。
实施例1、筛选供外源基因定点插入的猪基因组最佳安全港位点
一、构建含GFP基因的不同安全港位点Donor载体
构建质粒PB-1G 2R 3-puro-ROSA26、质粒PB-1G 2R 3-puro-AAVS1、质粒PB-1G2R3-puro-H11和质粒PB-1G 2R 3-puro-COL1A1。四个质粒均为环形质粒。
质粒PB-1G 2R 3-puro-ROSA26如SEQ ID NO:3所示,结构示意图见图1。SEQ IDNO:3中,第9-339位核苷酸组成ROSA26安全港插入位点5’端猪基因组区域(SH1左臂),第9184-10195位核苷酸组成ROSA26安全港插入位点3’端猪基因组区域(SH1右臂)。SEQ IDNO:3中,第346-546、3132-3531、6506-6706、8975-9175位核苷酸分别组成4个不同的绝缘子区域。SEQ ID NO:3中,第637-1209位核苷酸组成EF-1αpoly(A)信号,第1216-1935位核苷酸编码EGFP蛋白,第1954-3131位核苷酸组成EF-1α启动子,第3543-4042位核苷酸组成PGK启动子,第4059-4769位核苷酸编码mCherry蛋白,第4791-5015位核苷酸组成bGH poly(A)信号,第5054-6504位核苷酸为loxP-puro-loxP表达框区域,第6969-7233位核苷酸组成β-globin poly(A)信号,第7259-8974位核苷酸组成pCAG启动子。
质粒PB-1G 2R 3-puro-AAVS1与质粒PB-1G 2R 3-puro-ROSA26的差异仅在于:将SH1左臂替换为AAVS1安全港插入位点5’端猪基因组区域(SH2左臂,SH2左臂如SEQ ID NO:4所示)并且将SH1右臂替换为AAVS1安全港插入位点3’端猪基因组区域(SH2右臂,SH2右臂如SEQ ID NO:5所示)。
质粒PB-1G 2R 3-puro-H11与质粒PB-1G 2R 3-puro-ROSA26的差异仅在于:将SH1左臂替换为H11安全港插入位点5’端猪基因组区域(SH3左臂,SH3左臂如SEQ ID NO:6所示)并且将SH1右臂替换为H11安全港插入位点3’端猪基因组区域(SH3右臂,SH3右臂如SEQ IDNO:7所示)。
质粒PB-1G 2R 3-puro-COL1A1与质粒PB-1G 2R 3-puro-ROSA26的差异仅在于:将SH1左臂替换为COL1A1安全港插入位点5’端猪基因组区域(SH4左臂,SH4左臂如SEQ ID NO:8所示)并且将SH1右臂替换为COL1A1安全港插入位点3’端猪基因组区域(SH4右臂,SH4右臂如SEQ ID NO:9所示)。
二、猪ROSA26、AAVS1、H11、COL1A1基因组安全港位点的高效切割靶点筛选
通过前期筛选,ROSA26安全港位点的高效切割靶点为sgRNAROSA26-g3(切割效率38%),AAVS1安全港位点的高效切割靶点为sgRNAAAVS1-g4(切割效率30%)、H11安全港位点的高效切割靶点为sgRNAH11-g1(切割效率60%),COL1A1安全港位点的高效切割靶点为sgRNACOL1A1-g3(切割效率56%)。
靶点序列如下:
sgRNAROSA26-g3靶点:5’-GAAGGAGCAAACTGACATGG-3’;
sgRNAAAVS1-g4靶点:5’-TGCAGTGGGTCTTTGGGGAC-3’;
sgRNAH11-g1靶点:5’-TTCCAGGAACATAAGAAAGT-3’;
sgRNACOL1A1-g3靶点:5’-GCAGTCTCAGCAACCACTGA-3’。
三、制备安全港位点gRNA重组载体
取质粒pKG-U6gRNA,用限制性内切酶BbsI进行酶切,回收载体骨架(约3kb的线性大片段)。
分别合成ROSA26-g3-S和ROSA26-g3-A,然后混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(ROSA26-g3)。质粒pKG-U6gRNA(ROSA26-g3)表达SEQ ID NO:10所示的sgRNAROSA26-g3。
分别合成AAVS1-g4-S和AAVS1-g4-A,然后混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(AAVS1-g4)。质粒pKG-U6gRNA(AAVS1-g4)表达SEQ ID NO:11所示的sgRNAAAVS1-g4。
分别合成H11-g1-S和H11-g1-A,然后混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(H11-g1)。质粒pKG-U6gRNA(H11-g1)表达SEQ ID NO:12所示的sgRNAH11-g1。
分别合成COL1A1-g3-S和COL1A1-g3-A,然后混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(COL1A1-g3)。质粒pKG-U6gRNA(COL1A1-g3)表达SEQ ID NO:13所示的sgRNACOL1A1-g3。
ROSA26-g3-S、ROSA26-g3-A、AAVS1-g4-S、AAVS1-g4-A、H11-g1-S、H11-g1-A、COL1A1-g3-S和COL1A1-g3-A均为单链DNA分子。
ROSA26-g3-S:caccGAAGGAGCAAACTGACATGG;
ROSA26-g3-A:aaacCCATGTCAGTTTGCTCCTTC。
AAVS1-g4-S:caccgTGCAGTGGGTCTTTGGGGAC;
AAVS1-g4-A:aaacGTCCCCAAAGACCCACTGCAc。
H11-g1-S:caccgTTCCAGGAACATAAGAAAGT;
H11-g1-A:aaacACTTTCTTATGTTCCTGGAAc。
COL1A1-g3-S:caccGCAGTCTCAGCAACCACTGA;
COL1A1-g3-A:aaacTCAGTGGTTGCTGAGACTGC。
sgRNAROSA26-g3(SEQ ID NO:10):
GAAGGAGCAAACUGACAUGGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
sgRNAAAVS1-g4(SEQ ID NO:11):
UGCAGUGGGUCUUUGGGGACguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
sgRNAH11-g1(SEQ ID NO:12):
UUCCAGGAACAUAAGAAAGUguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
sgRNACOL1A1-g3(SEQ ID NO:13):
GCAGUCUCAGCAACCACUGAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu。
四、含不同安全港插入位点两侧同源臂的荧光Donor载体(即包含外源基因GFP的不同安全港位点载体)、sgRNA载体和Cas9载体(即质粒pKG-GE3)混合电转猪原代成纤维细胞及细胞GFP荧光强度检测
1、共转染
第一组(ROSA26组):将质粒PB-1G 2R 3-puro-ROSA26、质粒pKG-U6gRNA(ROSA26-g3)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:1.26μg质粒PB-1G 2R 3-puro-ROSA26:0.82μg质粒pKG-U6gRNA(ROSA26-g3):0.92μg质粒pKG-GE3;即3种质粒的摩尔配比依次为:1:3:1。
第二组(AAVS1组):将质粒PB-1G 2R 3-puro-AAVS1、质粒pKG-U6gRNA(AAVS1-g4)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:1.26μg质粒PB-1G 2R 3-puro-AAVS1:0.82μg质粒pKG-U6gRNA(AAVS1-g4):0.92μg质粒pKG-GE3;即3种质粒的摩尔配比依次为:1:3:1。
第三组(H11组):将质粒PB-1G 2R 3-puro-H11、质粒pKG-U6gRNA(H11-g1)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:1.26μg质粒PB-1G2R3-puro-H11:0.82μg质粒pKG-U6gRNA(H11-g1):0.92μg质粒pKG-GE3;即3种质粒的摩尔配比依次为:1:3:1。
第四组(COL1A1组):将质粒PB-1G 2R 3-puro-COL1A1、质粒pKG-U6gRNA(COL1A1-g3)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:1.26μg质粒PB-1G 2R 3-puro-COL1A1:0.82μg质粒pKG-U6gRNA(COL1A1-g3):0.92μg质粒pKG-GE3;即3种质粒的摩尔配比依次为:1:3:1。
第五组:猪原代成纤维细胞,同等电转参数不加任何质粒进行电转操作。
共转染采用电击转染的方式,采用哺乳动物核转染试剂盒(Neon kit,Thermofisher)与Neon TM transfection system电转仪(参数设置为:1450V、10ms、3pulse)进行转染。
2、完成步骤1后,采用完全培养液培养12-24小时,然后更换新的完全培养液进行培养。培养总时间为48小时。
3、完成步骤2后,更换为含1.5μg/mL嘌呤霉素的完全培养液培养3周(每2天更换新的含1.5μg/mL嘌呤霉素的完全培养液),持续观察并对GFP绿色荧光进行拍照,通过GFP荧光表达的强弱判断安全港位点表达外源基因效率的高低。
嘌呤霉素筛选一周后,ROSA26、COL1A1安全港位点试验组荧光强度明显强于AAVS1、H11试验组。嘌呤霉素筛选两周后,荧光强度由强到弱依次为:COL1A1>ROSA26>H11>AAVS1,其中H11组荧光强度不太均一,ROSA26组整体荧光强度较均一且荧光强度较高,AAVS1组细胞荧光表达最弱,COL1A1组荧光细胞数最多且荧光最强。嘌呤霉素继续筛选三周后,荧光强度由强到弱依次为:COL1A1>ROSA26>H11>AAVS1,照片见如图2。
五、GFP基因转录水平检测
为了比较GFP基因整合入四个不同安全港位点后mRNA转录水平的差异性,能否参与GFP的表达调控及对表达量的影响。在GFP基因外显子处设计一对引物,取步骤四中嘌呤霉素筛选三周后的细胞,提取总RNA,反转录成cDNA,检测原代细胞在四个不同安全港位点整合GFP基因后的转录水平,同时用第五组的细胞(无质粒的对照电转组)所得到的定量结果作为对照。以GAPDH基因为内参基因按照2-ΔCt法进行计算。
用于检测GFP基因的引物:F:AGATCCGCCACAACATCGAG;R:GTCCATGCCGAGAGTGATCC。
用于检测GAPDH基因的引物:F:GGTCGGAGTGAACGGATTTG;R:CCATTTGATGTTGGCGGGAT。
用SPSS统计学软件进行数据分析,以(平均数±标准差)表示,采用双因素方差分析进行统计学分析。2-ΔCt值结果显示嘌呤霉素筛选三周后AAVS1、H11组GFP表达量较低,ROSA26、COL1A1组GFP表达量较高,且COL1A1组和ROSA26组相对于AAVS1和H11组GFP转录水平差异极显著(P<0.01)。2-ΔCt值见表1,差异显著性分析结果见图3。
表1 2-ΔCt值信息
综上,根据培养细胞三周后的荧光信号强度与GFP基因实时荧光定量PCR的结果,可以得出如下结论,在ROSA26、AAVS1、H11、COL1A1这四个基因组安全港位点中,COL1A1位点插入外源基因后表达效果最好。
六、GFP基因的蛋白表达水平FACS检测
为了比较GFP基因整合入四个不同安全港位点后GFP蛋白的表达情况。分别用胰蛋白酶消化步骤四中嘌呤霉素筛选三周后的电转细胞,400g离心4min,弃上清。以1mL完全培养液重悬细胞,并将细胞悬液分别转移至流式管内。在BD FACSMelody流式细胞仪的FITC通道内检测GFP信号,收集5×104个细胞进行分析,结果见图4。
结果显示,GFP荧光信号强度COL1A1>ROSA26>H11>AAVS1。
因此,综合上述结果,COL1A1位点是ROSA26、AAVS1、H11、COL1A1四个安全港位点中最高效表达外源基因的猪原代细胞安全港位点。
实施例2、制备hACE2-hAXL-hTMPRSS2定点插入猪COL1A1安全港位点的单细胞克隆
人ACE2基因(hACE2基因)信息:编码angiotensin-converting enzyme 2[Homosapiens];位于人X号染色体;GeneID为59272。angiotensin-converting enzyme 2[Homosapiens],又称为hACE2蛋白。hACE2蛋白是目前SARS-CoV-2及SARS-CoV等冠状病毒公认的受体蛋白,如SEQ ID NO:15所示(NP_001358344.1)。
人AXL基因(hAXL基因)信息:编码AXL receptor tyrosine kinase[Homosapiens];位于人19号染色体;GeneID为558。AXL receptor tyrosine kinase[Homosapiens],又称为hAXL蛋白。hAXL蛋白是近期发现的不依赖于hACE2蛋白的独立介导SARS-CoV-2感染的蛋白,如SEQ ID NO:16所示(NP_001265528.1)。
人TMPRSS2基因(hTMPRSS2基因)信息:编码transmembrane protease serine 2[Homo sapiens];位于人21号染色体;GeneID为7113。transmembrane protease serine 2[Homo sapiens],又称为hTMPRSS2蛋白。hTMPRSS2蛋白能够通过激活SARS-CoV-2的刺突蛋白(S)从而介导SARS-CoV-2与宿主细胞受体结合,进而促进SARS-CoV-2进入宿主细胞,如SEQ ID NO:17所示(NP_001128571.1)。
一、构建pKG-hACE2-hAXL-hTMPRSS2 Donor载体
pKG-hACE2-hAXL-hTMPRSS2 Donor载体即质粒pKG-hACE2-hAXL-hTMPRSS2。
质粒pKG-hACE2-hAXL-hTMPRSS2如SEQ ID NO:14所示,为环形质粒,结构示意图见图5。SEQ ID NO:14中,第9-880位核苷酸为COL1A1安全港插入位点5’端猪基因组区域(SH4左臂),第887-1087位核苷酸为绝缘子1(Insulator 1),第1094-2271位核苷酸为人hEF1α启动子,第2290-4704位核苷酸为hACE2基因,第4705-4770位核苷酸编码P2A肽,第4771-6648位核苷酸为hAXL基因,第6649-6711位核苷酸编码T2A肽,第6712-8298位核苷酸为hTMPRSS2基因,第8302-8874位核苷酸为EF1αPoly(A),第8911-8944位核苷酸为LoxP序列,第8987-9316位核苷酸为SV40启动子,第9365-9961位核苷酸编码Puromycin蛋白(简称PuroR蛋白),第10141-10262位核苷酸为SV40 Poly(A),第10307-10340位核苷酸为LoxP序列,第10349-10549位核苷酸为绝缘子3(Insulator 3),第10550-11276位核苷酸为COL1A1安全港插入位点3’端猪基因组区域(SH4右臂)。
二、共转染
将质粒pKG-hACE2-hAXL-hTMPRSS2、质粒pKG-U6gRNA(COL1A1-g3)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:1.3μg质粒pKG-hACE2-hAXL-hTMPRSS2:0.8μg质粒pKG-U6gRNA(COL1A1-g3):0.9μg质粒pKG-GE3;即3种质粒的摩尔配比依次为:1:3:1。
共转染采用电击转染的方式,采用哺乳动物核转染试剂盒(Neon kit,Thermofisher)与Neon TM transfection system电转仪(参数设置为:1450V、10ms、3pulse)。
质粒pKG-GE3和质粒pKG-U6gRNA(COL1A1-g3)发挥的功能为在猪基因组DNA中制造DNA双链断裂,以提高同源重组率。质粒pKG-hACE2-hAXL-hTMPRSS2与猪基因组DNA发生同源重组,在猪基因组DNA中SH4左臂和SH4右臂之间插入外源靶基因片段(外源靶基因片段即SEQ ID NO:14中第881-10549位核苷酸所示的DNA分子)。
三、嘌呤霉素加压筛选
1、嘌呤霉素筛选hACE2-hAXL-hTMPRSS2基因插入的阳性细胞
(1)完成步骤二后,采用完全培养液培养电转后的细胞16-18小时,然后更换新的完全培养液进行培养。培养总时间为48小时。
(2)完成步骤(1)后,更换为含1.5μg/mL嘌呤霉素的完全培养液进行筛选培养(每天更换新的含1.5μg/mL嘌呤霉素的完全培养液),筛选培养的时间为3周。
筛选培养1周时,细胞出现大量死亡。
筛选培养2周时,细胞只有零星死亡,部分阳性克隆开始分裂增殖,细胞数不断增多。
筛选培养第3周的目的是使细胞内质粒降解完全以排除假阳性细胞克隆。
(3)完成步骤(2)后,收集细胞,采用不含嘌呤霉素的完全培养液恢复培养2代(每2天1代),让细胞恢复至良好状态以用于下一步的单细胞分选。
2、单细胞分选,放大培养
(1)完成步骤1后,收集细胞,使用胰蛋白酶进行消化,然后采用完全培养液中和,然后500g离心5min,弃除上清,将沉淀用1mL完全培养液重悬并适当稀释,用口吸管挑取单细胞转移到96孔板中(每孔预先加入100μl完全培养液)(每组细胞一个96孔板,每孔一个细胞),进行培养,培养2天后更换为含1.5μg/mL嘌呤霉素的完全培养液,之后每2~3天更换新的含1.5μg/mL嘌呤霉素的完全培养液,期间用显微镜观察每孔细胞生长情况,排除无细胞及非单细胞克隆的孔。
(2)待步骤(1)中的96孔板的孔中细胞长满孔底(大约2周左右),使用胰蛋白酶消化并收集细胞,其中2/3细胞接种到含有完全培养液的6孔板中,剩余的1/3细胞收集在1.5mL离心管中。
(3)待步骤(2)中的6孔板的孔中细胞长至50%丰满度时使用0.25%(Gibco)的胰蛋白酶消化并收集细胞,使用细胞冻存液(90%完全培养液+10%DMSO,体积比)将细胞冻存。
四、猪COL1A1安全港位点定点插入hACE2-hAXL-hTMPRSS2的基因组水平鉴定
为了检测猪COL1A1安全港位点是否成功定点插入了hACE2-hAXL-hTMPRSS2。取步骤三的2的(2)中的离心管,提取细胞基因组DNA,采用特异引物对(特异引物对分别为:sh4-Lr-JDF1414和sh4-Lr-JDR5965组成的引物对、sh4-Rr-JDF282和sh4-Rr-JDR4723组成的引物对、sh4-wt-JDF1085和sh4-wt-JDR1560组成的引物对)进行PCR扩增,然后进行电泳。将猪原代成纤维细胞作为野生型对照(WT)。
sh4-Lr-JDF1414和sh4-Lr-JDR5965组成的引物对用来鉴定猪COL1A1安全港插入位点5’端hACE2-hAXL-hTMPRSS2是否重组成功(靶序列为4552bp,获得约4552bp的扩增产物表示重组成功);sh4-Rr-JDF282和sh4-Rr-JDR4723组成的引物对用来鉴定猪COL1A1安全港插入位点3’端hACE2-hAXL-hTMPRSS2是否重组成功(靶序列为4442bp,获得约4442bp的扩增产物表示重组成功);sh4-wt-JDF1085和sh4-wt-JDR1560组成的引物对用来鉴定猪COL1A1安全港位点定点插入的hACE2-hAXL-hTMPRSS2表达框为纯合型还是杂合型(野生型对照的基因组DNA可扩增出476bp片段,重组细胞由于外源插入片段太大无法实现扩增;因此,如果不显示扩增产物,说明细胞为插入hACE2-hAXL-hTMPRSS2的纯合型;如果显示476bp扩增产物,说明细胞为插入hACE2-hAXL-hTMPRSS2的杂合型或者是野生型)。
sh4-Lr-JDF1414:CCTGCTGTAAGTGCCGTAGT;
sh4-Lr-JDR5965:CTAGGGGCACAGCACGTC;
sh4-Rr-JDF282:AAGTTATTAGGTCTGAAGAGGAGTTT;
sh4-Rr-JDR4723:CCCATCATTCCGTCCCAGAG;
sh4-wt-JDF1085:TGCTGAGTTCTGGCTTCCTG;
sh4-wt-JDR1560:TCTACCAAGAGAGTGACCAGCAG。
根据鉴定结果,编号为1-20号的单细胞克隆均为成功在猪COL1A1安全港位点定点插入hACE2-hAXL-hTMPRSS2的克隆,其中10、15号单细胞克隆为纯合定点插入,其他单细胞克隆为杂合定点插入。见表2。
表2 hACE2-hAXL-hTMPRSS2定点插入猪COL1A1安全港位点单细胞克隆的基因型
表2中编号为hACE2-hAXL-hTMPRSS2-1的重组细胞(杂合定点插入型)命名为1#重组细胞。
表2中编号为hACE2-hAXL-hTMPRSS2-10的重组细胞(纯合定点插入型)命名为10#重组细胞。经全基因组测序,与同一来源的猪原代成纤维细胞相比,10#重组细胞的基因组DNA的差异仅在于:在猪基因组DNA中SH4左臂和SH4右臂之间插入了SEQ ID NO:14中第881-10549位核苷酸所示的DNA分子。
五、单细胞克隆的hACE2、hAXL和hTMPRSS2基因转录水平检测
供试细胞:1#重组细胞和10#重组细胞。
取供试细胞,提取总RNA,反转录得到cDNA。将cDNA作为模板,通过荧光定量PCR检测hACE2基因、hAXL基因和hTMPRSS2基因的相对表达水平(以β-actin为内参基因按照2-ΔCt法进行计算)。用猪原代成纤维细胞作为对照(WT)。
荧光定量PCR的引物如下:
hACE2-F:CTAACCAGCCCCCTGTTTCC;
hACE2-R:GGAGGCATAAGGATTTTCTCCAC;
hAXL-F:TCAACTCCTGCCTTCTCGTG;
hAXL-R:ACCTCTTTCCACTGTTGGTTCA;
hTMPRSS2-F:TGGAAGTTCATGGGCAGCAA;
hTMPRSS2-R:GTAGAGGCGAACACACCGAT;
β-actin-F:CACGCCATCCTGCGTCTGGA;
β-actin-R:AGCACCGTGTTGGCGTAGAG。
用SPSS统计学软件进行数据分析,以(平均数±标准差)表示,采用独立样本T检验统计分析。结果见图6。2-ΔCt值结果显示,供试细胞的hACE2基因、hAXL基因和hTMPRSS2基因的表达量均显著高于对照,且纯合定点插入的单细胞克隆比杂合定点插入的单细胞克隆表达量高。结果表明,hACE2基因、hAXL基因和hTMPRSS2基因在供试细胞中均有较高程度的表达。
六、单细胞克隆hACE2、hAXL和hTMPRSS2的蛋白表达水平FACS检测
取10#重组细胞(用猪原代成纤维细胞作为对照,WT),用PBS缓冲液充分洗涤,然后用-20℃预冷的90%甲醇重悬并固定20min。固定结束后,离心,弃去固定液。加入3%BSA封闭1h。封闭结束后,离心,弃去封闭液,并用完全培养基洗涤。洗涤结束后,分别以人特异性的ACE2抗体(Novus Biologicals,NBP2-80038)(1:100稀释)、hAXL抗体(Santa CruzBiotechnology,sc-166269)(1:50稀释)和hTMPRSS2抗体(Santa Cruz Biotechnology,sc-515727)(1:50稀释)重悬细胞,室温孵育1h。抗体孵育结束后,用完全培养基充分洗涤,加入500μL完全培养基重悬细胞,并将细胞悬液转移至流式管内。在BD FACSMelody流式细胞仪的相关通道内分别检测hACE2、hAXL和hTMPRSS2抗体荧光信号,收集5×104个细胞进行分析。
结果见图7。结果显示,10#重组细胞中检测到hACE2、hAXL和hTMPRSS2的抗体荧光信号,而对照WT细胞中没有检测到hACE2、hAXL和hTMPRSS2的抗体荧光信号,说明所插入的hACE2基因、hAXL基因和hTMPRSS2基因在猪成纤维细胞中有显著的表达。
实施例3、模型猪的克隆生产及所插入的目标基因在模型猪中的表达分析
一、利用体细胞核移植技术克隆生产模型猪
1、卵母细胞体外成熟
从屠宰场采集新鲜的离体猪卵巢(卵巢来源于杜长大三元杂交母猪),卵巢在含75mg/mL青霉素和50mg/mL链霉素的0.9%氯化钠溶液中保存,于25-30℃温度下运输至实验室。
取卵巢,从直径3~6mm的卵泡中抽取卵丘卵母细胞复合体(Cumulus-oocytecomplexes,COCs),选择至少具有三层致密卵丘细胞的COCs,接种至4孔板中,每孔装有400μL猪卵母细胞体外成熟培养基,每孔接种50个,每孔覆盖400μL的矿物油(Sigma,M8410)。每次移植需培养300-400个COCs。将含COCs的培养板在38.5℃、5%CO2和饱和湿度的培养箱中培养42-44小时。
猪卵母细胞体外成熟培养基(IVM培养基):含0.1mg/mL丙酮酸、0.1mg/mL盐酸半胱氨酸、10ng/mL表皮生长因子、10%(v/v)猪卵泡液、75mg/mL青霉素、50mg/mL链霉素、10IU/mL eCG和10IU/mL hCG,余量为TCM-199培养基。
2、体细胞核移植与胚胎移植
(1)体细胞核移植(SCNT)
核供体细胞为实施例2获得的10#重组细胞。
完成步骤1后,用0.1%(w/v)透明质酸酶反复吹打去除COCs的扩张卵丘细胞。将具有完整膜且含排出的第一极体的卵母细胞在含有0.1mg/mL地美可辛、0.05M蔗糖和4mg/mL牛血清白蛋白的NCSU23培养基中培养0.5-1h,促使卵母细胞核突起,然后使用尖部倾斜的显微注射针(直径约20μm)在含有10μm HEPES、0.3%(w/v)聚乙烯吡咯烷酮、10%FBS、0.1mg/mL地美可辛和5mg/mL细胞松弛素B的Tyrode乳酸培养基中去除突起的细胞核和极体。将单个核供体细胞注入去核卵母细胞的卵周隙,使用胚胎细胞融合仪(ET3,Fuj ihiraIndustry)在融合培养基中用200V/mm的直流脉冲将核供体细胞与受体卵母细胞融合20μs。然后,将重构胚在PZM-3培养基(配方见表3)中培养2h以允许细胞核重编程,然后在激活培养基中用150V/mm的单脉冲激活100μs。然后,将激活的重构胚置于含5μg/mL细胞松弛素B的PZM-3培养基中,于38.5℃、5%CO2、5%O2、90%N2和饱和湿度的培养箱中培养2小时,以进一步激活;然后将重构胚置于PZM-3培养基中,于38.5℃、5%CO2、5%O2、90%N2和饱和湿度的培养箱中培养。大部分重构胚在激活后培养6h即可用于后续的胚胎移植。
融合培养基:含0.25M D-山梨醇、0.05mM Mg(C2H3O2)2,20mg/mL BSA和0.5mMHEPES[acid-free],余量为水。
激活培养基:含0.25M D-山梨醇,0.01mM Ca(C2H3O2)2,0.05mM Mg(C2H3O2)2和0.1mg/mL BSA,余量为水。
表3 PZM-3培养基的配方
成分 | g/50mL | |
1 | NaCl | 0.3156g |
2 | KCl | 0.0373g |
3 | KH<sub>2</sub>PO<sub>4</sub> | 0.0024g |
4 | MgSO<sub>4</sub>·7H<sub>2</sub>O | 0.0024g |
5 | NaHCO<sub>3</sub> | 0.1055g |
6 | Na-pyruvate | 0.0011g |
7 | Ca-(lactate)<sub>2</sub>·5H<sub>2</sub>O | 0.0308g |
8 | Myo-Inositol | 0.0250 |
9 | Phenol Red(母液10mg/mL) | 0.5mL母液 |
10 | L-glutamine* | 0.0073g |
11 | hypotaurine* | 0.0273g |
12 | BME essential amino acid(50×)* | 1mL |
13 | MEM non-essential amino acid(100×)* | 0.5mL |
14 | 超纯水 | 补足至50mL |
*使用前添加
(2)胚胎移植
选择4头处于发情期的9月龄二花脸母猪二花脸母猪作为重构胚的代孕母猪,将激活后培养6h的重构胚移植到代孕母猪的输卵管中,每头母猪移植300-350个重构胚。在胚胎移植后约23天,使用超声波扫描仪(HS-101V,日本本田电子)检查妊娠情况,确认受体母猪是否怀孕。4头代孕母猪中,2头怀孕。
克隆猪在胚胎移植后第116-117天左右出生。2头成功怀孕的母猪共生产获得5头克隆猪。克隆猪即为模型猪(照片见图8)。
3、野生型对照猪的制备
将同一来源的猪原代成纤维细胞代替重组细胞作为核供体细胞进行步骤2,得到4头克隆猪,即为野生型对照猪。野生型对照猪的遗传背景除不具有外源插入DNA外,其余与模型猪完全一致。
二、人源化模型猪的外源基因表达分析
1、从猪肺脏中分离肺泡巨噬细胞
(1)分离模型猪(或野生型对照猪)的完整气管和肺脏,并用含0.3%链霉素/青霉素的PBS缓冲液充分清洗,然后用PBS缓冲液清洗;然后向气管中灌入50mL无菌PBS缓冲液,轻轻拍打肺后倒出液体;然后向气管中灌入50mL无菌PBS缓冲液,轻轻拍打肺后倒出液体;然后向气管中灌入50mL无菌PBS缓冲液,轻轻拍打肺后倒出液体;合并三次获得的液体,1500g离心4分钟,弃上清,收集沉淀。
(2)加入10mL红细胞裂解液重悬步骤(1)获得的细胞沉淀,静置4分钟,然后加入20mL PBS缓冲液,1500g离心10分钟,弃上清,沉淀即为肺泡巨噬细胞。
2、人源化猪的hACE2、hAXL和hTMPRSS2基因转录水平检测
引物和方法均同实施例2的步骤五。
结果见图9。模型猪(hACE2-hAXL-hTMPRSS2-pig)的肺泡巨噬细胞中,hACE2基因的表达量为管家基因β-actin表达量的0.11倍,hAXL基因的表达量为管家基因β-actin表达量的0.06倍,hTMPRSS2基因的表达量为管家基因β-actin表达量的0.04倍,均显著高于野生型对照猪(WT-pig)肺泡巨噬细胞的相应基因的表达量。
综上,hACE2基因、hAXL基因和hTMRSS2基因在模型猪的肺泡巨噬细胞中有较强的表达。
3、人源化猪的hACE2、hAXL和hTMPRSS2蛋白表达水平FACS检测
方法同实施例2的步骤六。
结果见图10。结果显示,模型猪的肺泡巨噬细胞中分别检测到hACE2、hAXL和hTMPRSS2的抗体荧光信号,而在野生型对照猪的肺泡巨噬细胞中没有检测到相应基因的抗体荧光信号。说明,所插入的hACE2基因、hAXL基因和hTMPRSS2基因在模型猪的在肺泡巨噬细胞中被成功表达。
实施例4、模型猪的肺泡巨噬细胞对SARS-CoV-2假病毒的感染性研究
假病毒不具备复制能力,可以最大程度降低SARS病毒研究过程中的各种风险。另外,由于假病毒的感染过程与真病毒相同,因此可以模拟病毒感染的早期过程,且假病毒内携带有报告基因,可以方便地进行各种检测分析。
一、SARS-CoV-2-Spike假病毒的制备
1、构建重组质粒
质粒pMD2.G-SARS-C19的结构示意图见图11。初始质粒为pMD2.G商品质粒;改造过程:将pMD2.G质粒的VSV-G区域删除,并将SARS-CoV-2病毒的刺突蛋白(Spike,为SARS-CoV-2的膜蛋白)进行胞内C端19个氨基酸缺失突变,然后插入已删除VSV-G区域的pMD2.G载体中。质粒pMD2.G-SARS-C19如SEQ ID NO:22所示,为环形质粒。SEQ ID NO:22中,第161-540位核苷酸为CMV增强子,第541-744位核苷酸为CMV启动子,第878-1353位核苷酸为β-globin内含子,第1415-5188位核苷酸为SARS-CoV-2刺突蛋白(Spike)的编码序列,第5264-5648位核苷酸为β-globinpoly(A)signal。
质粒Lenti-mCherry的结构示意图见图12。初始质粒为商品质粒Lenti-CRISPRV2;改造过程:去除该质粒中的gRNA骨架与编码Cas9蛋白的区域,并将报告基因(mCherry基因)插入相应区域,同时保留原质粒所携带的嘌呤霉素抗性基因。这将使利用该质粒联合配套质粒所构建的假病毒的基因组带有mCherry荧光标签及嘌呤霉素抗性标签。质粒Lenti-mCherry如SEQ ID NO:23所示,为环形质粒。SEQ ID NO:23中,第2602-2813位核苷酸为EF1a核心启动子元件,第2844-3551位核苷酸编码mCherry荧光蛋白,第3567-3623位核苷酸编码P2A肽(P2A肽的氨基酸序列为“ATNFSLLKQAGDVEENPGP”,断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间),第3624-4220位核苷酸编码Puromycin抗性蛋白(简称PuroR蛋白),第5161-5385位核苷酸为bGHpoly(A)signal。
2、SARS-CoV-2-Spike假病毒的制备
psPAX2慢病毒包装质粒:addgene,#12260。
(1)将质粒pMD2.G-SARS-C19、质粒Lenti-mCherry与psPAX2慢病毒包装质粒按6μg:4μg:5μg的比例混合,用真空浓缩仪除去水分。
(3)完成步骤(2)后,加入24μL Lipo8000TM(碧云天,ST483)转染试剂,轻柔混合。
(4)HEK293T细胞接种至直径为10cm的细胞培养盘中,培养至细胞密度为70%-80%。
(5)将步骤(3)得到的混合物滴加至完成步骤(4)的细胞培养盘中培养6小时,然后更换新的完全培养液进行培养。培养总时间为48小时。
(6)完成步骤(5)后,收集上清液,用0.45μm滤膜过滤,收集滤液。
(7)取10mL步骤(6)得到的滤液,加入3.3ml Lenti-X Concentrator(Clontech,631231),轻柔混合,4℃放置过夜。
(8)完成步骤(7)后,4℃,1500g离心45分钟,弃上清,加入100μL DMEM培养基溶解沉淀,即为浓缩后的SARS-CoV-2假病毒病毒液,简称浓缩病毒原液。
二、假病毒感染猪肺泡巨噬细胞
供试细胞:模型猪的肺泡巨噬细胞(实施例3中制备得到)或野生型对照猪的肺泡巨噬细胞(实施例3中制备得到)。
(1)于感染实验24小时前,取供试细胞接种于96孔板中(3×104个细胞/孔)。
(2)向96孔板细胞中加入500μl浓缩病毒原液,同时加入0.8μL polybrene试剂。
(3)感染6小时后换液为新的完全培养液进行培养。
(4)感染48小时后,在倒置荧光显微镜下观察细胞的mCherry荧光信号。
结果显示,在模型猪的肺泡巨噬细胞中观察到了整合至细胞基因组中的病毒mCherry荧光信号(图13),表明SARS-CoV-2假病毒能够感染模型猪的肺泡巨噬细胞,而野生型对照猪的肺泡巨噬细胞中则未见有荧光信号(图14)。结果表明,模型猪的肺泡巨噬细胞具有感染SARS-Cov-2假病毒的能力,模型猪可作为SARS-Cov-2病毒感染动物模型。
进一步的,本发明所制备的模型猪(hACE2-hAXL-hTMPRSS2人源化猪)可用于下一步的药物筛选、药效评价、疫苗和抗体效果测试、病毒感染机制等生物医药领域的相关研究。
以上对本发明进行了详述。对于本领域技术人员来说,在不脱离本发明的宗旨和范围,以及无需进行不必要的实验情况下,可在等同参数、浓度和条件下,在较宽范围内实施本发明。虽然本发明给出了特殊的实施例,应该理解为,可以对本发明作进一步的改进。总之,按本发明的原理,本申请欲包括任何变更、用途或对本发明的改进,包括脱离了本申请中已公开范围,而用本领域已知的常规技术进行的改变。按以下附带的权利要求的范围,可以进行一些基本特征的应用。
序列表
<110> 南京启真基因工程有限公司
<120> 表达三种人源基因的SARS-CoV-2易感模型猪的构建方法及其应用
<130> GNCYX220181
<160> 23
<170> SIPOSequenceListing 1.0
<210> 1
<211> 10476
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 1
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag ttaaaataag 300
gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc tagcgcgtgc 360
gccaattctg cagacaaatg gctctagagg tacccgttac ataacttacg gtaaatggcc 420
cgcctggctg accgcccaac gacccccgcc cattgacgtc aatagtaacg ccaataggga 480
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 540
aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 600
ggcattgtgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 660
tagtcatcgc tattaccatg ggggcagagc gcacatcgcc cacagtcccc gagaagttgg 720
ggggaggggt cggcaattga tccggtgcct agagaaggtg gcgcggggta aactgggaaa 780
gtgatgtcgt gtactggctc cgcctttttc ccgagggtgg gggagaaccg tatataagtg 840
cagtagtcgc cgtgaacgtt ctttttcgca acgggtttgc cgccagaaca caggttggac 900
cggtgccacc atggactata aggaccacga cggagactac aaggatcatg atattgatta 960
caaagacgat gacgataaga tggcccccaa aaagaaacga aaggtgggtg ggtccccaaa 1020
gaagaagcgg aaggtcggta tccacggagt cccagcagcc gacaagaagt acagcatcgg 1080
cctggacatc ggcaccaact ctgtgggctg ggccgtgatc accgacgagt acaaggtgcc 1140
cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac agcatcaaga agaacctgat 1200
cggagccctg ctgttcgaca gcggcgaaac agccgaggcc acccggctga agagaaccgc 1260
cagaagaaga tacaccagac ggaagaaccg gatctgctat ctgcaagaga tcttcagcaa 1320
cgagatggcc aaggtggacg acagcttctt ccacagactg gaagagtcct tcctggtgga 1380
agaggataag aagcacgagc ggcaccccat cttcggcaac atcgtggacg aggtggccta 1440
ccacgagaag taccccacca tctaccacct gagaaagaaa ctggtggaca gcaccgacaa 1500
ggccgacctg cggctgatct atctggccct ggcccacatg atcaagttcc ggggccactt 1560
cctgatcgag ggcgacctga accccgacaa cagcgacgtg gacaagctgt tcatccagct 1620
ggtgcagacc tacaaccagc tgttcgagga aaaccccatc aacgccagcg gcgtggacgc 1680
caaggccatc ctgtctgcca gactgagcaa gagcagacgg ctggaaaatc tgatcgccca 1740
gctgcccggc gagaagaaga atggcctgtt cggaaacctg attgccctga gcctgggcct 1800
gacccccaac ttcaagagca acttcgacct ggccgaggat gccaaactgc agctgagcaa 1860
ggacacctac gacgacgacc tggacaacct gctggcccag atcggcgacc agtacgccga 1920
cctgtttctg gccgccaaga acctgtccga cgccatcctg ctgagcgaca tcctgagagt 1980
gaacaccgag atcaccaagg cccccctgag cgcctctatg atcaagagat acgacgagca 2040
ccaccaggac ctgaccctgc tgaaagctct cgtgcggcag cagctgcctg agaagtacaa 2100
agagattttc ttcgaccaga gcaagaacgg ctacgccggc tacattgacg gcggagccag 2160
ccaggaagag ttctacaagt tcatcaagcc catcctggaa aagatggacg gcaccgagga 2220
actgctcgtg aagctgaaca gagaggacct gctgcggaag cagcggacct tcgacaacgg 2280
cagcatcccc caccagatcc acctgggaga gctgcacgcc attctgcggc ggcaggaaga 2340
tttttaccca ttcctgaagg acaaccggga aaagatcgag aagatcctga ccttccgcat 2400
cccctactac gtgggccctc tggccagggg aaacagcaga ttcgcctgga tgaccagaaa 2460
gagcgaggaa accatcaccc cctggaactt cgaggaagtg gtggacaagg gcgcttccgc 2520
ccagagcttc atcgagcgga tgaccaactt cgataagaac ctgcccaacg agaaggtgct 2580
gcccaagcac agcctgctgt acgagtactt caccgtgtat aacgagctga ccaaagtgaa 2640
atacgtgacc gagggaatga gaaagcccgc cttcctgagc ggcgagcaga aaaaggccat 2700
cgtggacctg ctgttcaaga ccaaccggaa agtgaccgtg aagcagctga aagaggacta 2760
cttcaagaaa atcgagtgct tcgactccgt ggaaatctcc ggcgtggaag atcggttcaa 2820
cgcctccctg ggcacatacc acgatctgct gaaaattatc aaggacaagg acttcctgga 2880
caatgaggaa aacgaggaca ttctggaaga tatcgtgctg accctgacac tgtttgagga 2940
cagagagatg atcgaggaac ggctgaaaac ctatgcccac ctgttcgacg acaaagtgat 3000
gaagcagctg aagcggcgga gatacaccgg ctggggcagg ctgagccgga agctgatcaa 3060
cggcatccgg gacaagcagt ccggcaagac aatcctggat ttcctgaagt ccgacggctt 3120
cgccaacaga aacttcatgc agctgatcca cgacgacagc ctgaccttta aagaggacat 3180
ccagaaagcc caggtgtccg gccagggcga tagcctgcac gagcacattg ccaatctggc 3240
cggcagcccc gccattaaga agggcatcct gcagacagtg aaggtggtgg acgagctcgt 3300
gaaagtgatg ggccggcaca agcccgagaa catcgtgatc gaaatggcca gagagaacca 3360
gaccacccag aagggacaga agaacagccg cgagagaatg aagcggatcg aagagggcat 3420
caaagagctg ggcagccaga tcctgaaaga acaccccgtg gaaaacaccc agctgcagaa 3480
cgagaagctg tacctgtact acctgcagaa tgggcgggat atgtacgtgg accaggaact 3540
ggacatcaac cggctgtccg actacgatgt ggaccatatc gtgcctcaga gctttctgaa 3600
ggacgactcc atcgacaaca aggtgctgac cagaagcgac aagaaccggg gcaagagcga 3660
caacgtgccc tccgaagagg tcgtgaagaa gatgaagaac tactggcggc agctgctgaa 3720
cgccaagctg attacccaga gaaagttcga caatctgacc aaggccgaga gaggcggcct 3780
gagcgaactg gataaggccg gcttcatcaa gagacagctg gtggaaaccc ggcagatcac 3840
aaagcacgtg gcacagatcc tggactcccg gatgaacact aagtacgacg agaatgacaa 3900
gctgatccgg gaagtgaaag tgatcaccct gaagtccaag ctggtgtccg atttccggaa 3960
ggatttccag ttttacaaag tgcgcgagat caacaactac caccacgccc acgacgccta 4020
cctgaacgcc gtcgtgggaa ccgccctgat caaaaagtac cctaagctgg aaagcgagtt 4080
cgtgtacggc gactacaagg tgtacgacgt gcggaagatg atcgccaaga gcgagcagga 4140
aatcggcaag gctaccgcca agtacttctt ctacagcaac atcatgaact ttttcaagac 4200
cgagattacc ctggccaacg gcgagatccg gaagcggcct ctgatcgaga caaacggcga 4260
aaccggggag atcgtgtggg ataagggccg ggattttgcc accgtgcgga aagtgctgag 4320
catgccccaa gtgaatatcg tgaaaaagac cgaggtgcag acaggcggct tcagcaaaga 4380
gtctatcctg cccaagagga acagcgataa gctgatcgcc agaaagaagg actgggaccc 4440
taagaagtac ggcggcttcg acagccccac cgtggcctat tctgtgctgg tggtggccaa 4500
agtggaaaag ggcaagtcca agaaactgaa gagtgtgaaa gagctgctgg ggatcaccat 4560
catggaaaga agcagcttcg agaagaatcc catcgacttt ctggaagcca agggctacaa 4620
agaagtgaaa aaggacctga tcatcaagct gcctaagtac tccctgttcg agctggaaaa 4680
cggccggaag agaatgctgg cctctgccgg cgaactgcag aagggaaacg aactggccct 4740
gccctccaaa tatgtgaact tcctgtacct ggccagccac tatgagaagc tgaagggctc 4800
ccccgaggat aatgagcaga aacagctgtt tgtggaacag cacaagcact acctggacga 4860
gatcatcgag cagatcagcg agttctccaa gagagtgatc ctggccgacg ctaatctgga 4920
caaagtgctg tccgcctaca acaagcaccg ggataagccc atcagagagc aggccgagaa 4980
tatcatccac ctgtttaccc tgaccaatct gggagcccct gccgccttca agtactttga 5040
caccaccatc gaccggaaga ggtacaccag caccaaagag gtgctggacg ccaccctgat 5100
ccaccagagc atcaccggcc tgtacgagac acggatcgac ctgtctcagc tgggaggcga 5160
caaaaggccg gcggccacga aaaaggccgg ccaggcaaaa aagaaaaagg gcggctccaa 5220
gcggcctgcc gcgacgaaga aagcgggaca ggccaagaaa aagaaaggat ccggcgcaac 5280
aaacttctct ctgctgaaac aagccggaga tgtcgaagag aatcctggac cggtgagcaa 5340
gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa 5400
cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac 5460
cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac 5520
cctgacctac ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt 5580
cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga 5640
cggcaactac aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat 5700
cgagctgaag ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta 5760
caactacaac agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt 5820
gaacttcaag atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca 5880
gcagaacacc cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac 5940
ccagtccgcc ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt 6000
cgtgaccgcc gccgggatca ctctcggcat ggacgagctg tacaagggct ccggcgaggg 6060
caggggaagt cttctaacat gcggggacgt ggaggaaaat cccggcccaa ccgagtacaa 6120
gcccacggtg cgcctcgcca cccgcgacga cgtccccagg gccgtacgca ccctcgccgc 6180
cgcgttcgcc gactaccccg ccacgcgcca caccgtcgat ccggaccgcc acatcgagcg 6240
ggtcaccgag ctgcaagaac tcttcctcac gcgcgtcggg ctcgacatcg gcaaggtgtg 6300
ggtcgcggac gacggcgccg cggtggcggt ctggaccacg ccggagagcg tcgaagcggg 6360
ggcggtgttc gccgagatcg gcccgcgcat ggccgagttg agcggttccc ggctggccgc 6420
gcagcaacag atggaaggcc tcctggcgcc gcaccggccc aaggagcccg cgtggttcct 6480
ggccaccgtc ggagtctcgc ccgaccacca gggcaagggt ctgggcagcg ccgtcgtgct 6540
ccccggagtg gaggcggccg agcgcgccgg ggtgcccgcc ttcctggaga cctccgcgcc 6600
ccgcaacctc cccttctacg agcggctcgg cttcaccgtc accgccgacg tcgaggtgcc 6660
cgaaggaccg cgcacctggt gcatgacccg caagcccggt gcctgaacgc gttaagtcga 6720
caatcaacct ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc 6780
tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttgcttcccg 6840
tatggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt atgaggagtt 6900
gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg caacccccac 6960
tggttggggc attgccacca cctgtcagct cctttccggg actttcgctt tccccctccc 7020
tattgccacg gcggaactca tcgccgcctg ccttgcccgc tgctggacag gggctcggct 7080
gttgggcact gacaattccg tggtgttgtc ggggaaatca tcgtcctttc cttggctgct 7140
cgcctgtgtt gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct 7200
caatccagcg gaccttcctt cccgcggcct gctgccggct ctgcggcctc ttccgcgtct 7260
tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc gtcgacttta 7320
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 7380
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 7440
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 7500
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 7560
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagggcc 7620
cgtttaaacc cgctgatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg 7680
cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata 7740
aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt 7800
ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgctg gggatgcggt 7860
gggctctatg gcctgcaggg gcgcctgatg cggtattttc tccttacgca tctgtgcggt 7920
atttcacacc gcatacgtca aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg 7980
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ttagcgcccg 8040
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 8100
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 8160
aacttgattt gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 8220
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 8280
tcaactctat ctcgggctat tcttttgatt tataagggat tttgccgatt tcggtctatt 8340
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt 8400
ttacaatttt atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc 8460
cccgacaccc gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg 8520
cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat 8580
caccgaaacg cgcgagacga aagggcctcg tgatacgcct atttttatag gttaatgtca 8640
tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc 8700
ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 8760
gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 8820
cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 8880
tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 8940
tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 9000
cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac 9060
tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 9120
agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 9180
ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 9240
ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 9300
aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc 9360
gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 9420
tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 9480
ttgctgataa atctggagcc ggtgagcgtg gaagccgcgg tatcattgca gcactggggc 9540
cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 9600
atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 9660
cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 9720
ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 9780
cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 9840
ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 9900
tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 9960
taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 10020
caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 10080
agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 10140
gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 10200
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 10260
ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 10320
acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 10380
tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 10440
ggttcctggc cttttgctgg ccttttgctc acatgt 10476
<210> 2
<211> 3120
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 2
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60
cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120
tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180
aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240
ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300
ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360
tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420
tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480
actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540
gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600
acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660
gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720
acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780
gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840
ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900
gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960
cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020
agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080
catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc 1380
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980
cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040
cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160
cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220
accatgatta cgccaagctt gcatgcaggc ctctgcagtc gacgggcccg ggatccgatg 2280
ataaacatgt gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc 2340
tgttagagag ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac 2400
gtgacgtaga aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat 2460
ggactatcat atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt 2520
gtggaaagga cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag 2580
ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc 2640
tagcgcgtgc gccaattctg cagacaaatg gctctagagg tacccataga tctagatgca 2700
ttcgcgaggt accgagctcg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa 2760
accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta 2820
atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat 2880
ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt 2940
gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 3000
cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 3060
tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 3120
<210> 3
<211> 14138
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 3
ggcgcgccct ctacctgctc tcggacccgt gggggtgggg ggtggaggaa ggagtggggg 60
gtcggtcctg ctggcttgtg ggtgggaggc gcatgttctc caaaaacccg cgcgagctgc 120
aatcctgagg gagctgcagt ggaggaggcg gagagaaggc cgcacccttc tccgcagggg 180
gaggggagtg ccgcaatacc tttatgggag ttctctgctg cctccttttc ctaaggaccg 240
ccctgggcct agaaaaatcc ctccctcccc cgcgatctcg tcatcgcctc catgtcagtt 300
tgctccttct cgattatggg cgggattctt ttgccctggc gcgccccaga cccgggcctg 360
gggggcaagt cggggggcgg ggggaggtcg ggcagggtcc cctgggagga tggggacgtg 420
ctgtgcccct agcggccacc agagggcacc aggacaccac tgcggtcggc tcagcggctc 480
ctgccctggt cagggggcgc caggtcctgc ccctcctggg gagggcgggg ggcgagaagg 540
gcgattttaa ttaacccacg tttcaacatg cacatcccag taatttggaa acattttgtt 600
tccaaagatt cacttaacat tggtttagca acatgaagct ttctatgcaa cccaaggact 660
cagtttttgg cctgttttag tgacaggcaa tcagcaacat gctgcatttc tctccagtgt 720
tgtaatcaaa gaaaccctcc catagcttta aatgatattc cttccccttc caattatgtg 780
gggggaaaac aaccctattc tccacccaga agtgttaact caagaattac attttcaaga 840
agtttccaga ttcgtaaaac cagaattaga tgtctttcac ctaaatgtct cggtgttgac 900
caaaggaaca cacaggtttc tcatttaact tttttaatgg gtctcaaaat tctgtgacaa 960
atttttggtc aagttgtttc cattaaaaag tactgatttt aaaaactaat aacttaaaac 1020
tgccacacgc aaaaaagaaa accaaagtgg tccacaaaac attctccttt ccttctgaag 1080
gttttacgat gcattgttat cattaaccag tcttttacta ctaaacttaa atggccaatt 1140
gaaacaaaca gttctgagac cgttcttcca ccactgatta agagtggggt ggcaggtatt 1200
agggataatg ctagcttact tgtacagctc gtccatgccg agagtgatcc cggcggcggt 1260
cacgaactcc agcaggacca tgtgatcgcg cttctcgttg gggtctttgc tcagggcgga 1320
ctgggtgctc aggtagtggt tgtcgggcag cagcacgggg ccgtcgccga tgggggtgtt 1380
ctgctggtag tggtcggcga gctgcacgct gccgtcctcg atgttgtggc ggatcttgaa 1440
gttcaccttg atgccgttct tctgcttgtc ggccatgata tagacgttgt ggctgttgta 1500
gttgtactcc agcttgtgcc ccaggatgtt gccgtcctcc ttgaagtcga tgcccttcag 1560
ctcgatgcgg ttcaccaggg tgtcgccctc gaacttcacc tcggcgcggg tcttgtagtt 1620
gccgtcgtcc ttgaagaaga tggtgcgctc ctggacgtag ccttcgggca tggcggactt 1680
gaagaagtcg tgctgcttca tgtggtcggg gtagcggctg aagcactgca cgccgtaggt 1740
cagggtggtc acgagggtgg gccagggcac gggcagcttg ccggtggtgc agatgaactt 1800
cagggtcagc ttgccgtagg tggcatcgcc ctcgccctcg ccggacacgc tgaacttgtg 1860
gccgtttacg tcgccgtcca gctcgaccag gatgggcacc accccggtga acagctcctc 1920
gcccttgctc accatggtgg cgtcgaccgt acgtcacgac acctgaaatg gaagaaaaaa 1980
actttgaacc actgtctgag gcttgagaat gaaccaagat ccaaactcaa aaagggcaaa 2040
ttccaaggag aattacatca agtgccaagc tggcctaact tcagtctcca cccactcagt 2100
gtggggaaac tccatcgcat aaaacccctc cccccaacct aaagacgacg tactccaaaa 2160
gctcgagaac taatcgaggt gcctggacgg cgcccggtac tccgtggagt cacatgaagc 2220
gacggctgag gacggaaagg cccttttcct ttgtgtgggt gactcacccg cccgctctcc 2280
cgagcgccgc gtcctccatt ttgagctccc tgcagcaggg ccgggaagcg gccatctttc 2340
cgctcacgca actggtgccg accgggccag ccttgccgcc cagggcgggg cgatacacgg 2400
cggcgcgagg ccaggcacca gagcaggccg gccagcttga gactaccccc gtccgattct 2460
cggtggccgc gctcgcaggc cccgcctcgc cgaacatgtg cgctgggacg cacgggcccc 2520
gtcgccgccc gcggccccaa aaaccgaaat accagtgtgc agatcttggc ccgcatttac 2580
aagactatct tgccagaaaa aaagcgtcgc agcaggtcat caaaaatttt aaatggctag 2640
agacttatcg aaagcagcga gacaggcgcg aaggtgccac cagattcgca cgcggcggcc 2700
ccagcgccca ggccaggcct caactcaagc acgaggcgaa ggggctcctt aagcgcaagg 2760
cctcgaactc tcccacccac ttccaacccg aagctcggga tcaagaatca cgtactgcag 2820
ccagtggaag taattcaagg cacgcaaggg ccataacccg taaagaggcc aggcccgcgg 2880
gaaccacaca cggcacttac ctgtgttctg gcggcaaacc cgttgcgaaa aagaacgttc 2940
acggcgacta ctgcacttat atacggttct cccccaccct cgggaaaaag gcggagccag 3000
tacacgacat cactttccca gtttaccccg cgccaccttc tctaggcacc ggttcaattg 3060
ccgacccctc cccccaactt ctcggggact gtgggcgatg tgcgctctgc ccactgacgg 3120
gcaccggagc cctagattcg attccctttg gggcaaaact caccgcctaa tcccctataa 3180
ctctaccggg gagcccggtg gagagcagac gggctgacgc tgccacctgc cggccatccc 3240
aggataggac cgccgtattc aagtcgccct caggaaggac cctcggggca ccagaggcct 3300
tcgaagcccc aatgagtgag gcaactgagg gtcgcgggtg ccattacaag gcccagccaa 3360
ggcctagagc caaggcttga accgtggggg acccccaagc cccacctgcc caggaacagc 3420
agacactggg acactttgtt tcaggtcctg cccaggcccc tcccactgtg aggctgggat 3480
ttgtcgccca gggtgcagat gagaagagtg gggaaagcag tcctgagcca ggaaattcta 3540
ccgggtaggg gaggcgcttt tcccaaggca gtctggagca tgcgctttag cagccccgct 3600
gggcacttgg cgctacacaa gtggcctctg gcctcgcaca cattccacat ccaccggtag 3660
gcgccaaccg gctccgttct ttggtggccc cttcgcgcca ccttctactc ctcccctagt 3720
caggaagttc ccccccgccc cgcagctcgc gtcgtgcagg acgtgacaaa tggaagtagc 3780
acgtctcact agtctcgtgc agatggacag caccgctgag caatggaagc gggtaggcct 3840
ttggggcagc ggccaatagc agctttgctc cttcgctttc tgggctcaga ggctgggaag 3900
gggtgggtcc gggggcgggc tcaggggcgg gctcaggggc ggggcgggcg cccgaaggtc 3960
ctccggaggc ccggcattct gcacgcttca aaagcgcacg tctgccgcgc tgttctcctc 4020
ttcctcatct ccgggccttt cgacctccta gggccaccat ggtgagcaag ggcgaggacg 4080
acaacatggc catcatcaag gagttcatgc gcttcaaggt gcacatggag ggctccgtga 4140
acggccacga gttcgagatc gagggcgagg gcgagggccg cccctacgag ggcacccaga 4200
ccgccaagct gaaggtgacc aagggcggcc ccctgccctt cgcctgggac atcctgtccc 4260
ctcagttcat gtacggctcc aaggcctacg tgaagcaccc cgccgacatc cccgactact 4320
tgaagctgtc cttccccgag ggcttcaagt gggagcgcgt gatgaacttc gaggacggcg 4380
gcgtggtgac cgtgacccag gactcctccc tgcaggacgg cgagttcatc tacaaggtga 4440
agctgcgcgg caccaacttc ccctccgacg gccccgtaat gcagaagaag accatgggct 4500
gggaggcctc ctccgagcgg atgtaccccg aggacggcgc cctgaagggc gagatcaagc 4560
agaggctgaa gctgaaggac ggcggccact acgacgccga ggtcaagacc acctacaagg 4620
ccaagaagcc cgtgcagctg cccggcgcct acaacgtcaa catcaagctg gacatcacct 4680
cccacaacga ggactacacc atcgtggaac agtacgagcg cgccgagggc cgccactcca 4740
ccggcggcat ggacgagctg tacaagtgag gatccgctga tcagcctcga ctgtgccttc 4800
tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc 4860
cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg 4920
tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa 4980
tagcaggcat gctggggatg cggtgggctc tatggcttct gaggcggaaa gaacccttct 5040
gaggcggaaa gaaccagctg ccttaatata acttcgtata atgtatgcta tacgaagtta 5100
ttaggtctga agaggagttt acgtccagcc aattctgtgg aatgtgtgtc agttagggtg 5160
tggaaagtcc ccaggctccc cagcaggcag aagtatgcaa agcatgcatc tcaattagtc 5220
agcaaccagg tgtggaaagt ccccaggctc cccagcaggc agaagtatgc aaagcatgca 5280
tctcaattag tcagcaacca tagtcccgcc cctaactccg cccatcccgc ccctaactcc 5340
gcccagttcc gcccattctc cgccccatgg ctgactaatt ttttttattt atgcagaggc 5400
cgaggccgcc tctgcctctg agctattcca gaagtagtga ggaggctttt ttggaggcct 5460
aggcttttgc aaaaagctcc cgggagcttg tatatccatt ttcggcggcc gcgccaccat 5520
gaccgagtac aagcccacgg tgcgcctcgc cacccgcgac gacgtcccca gggccgtacg 5580
caccctcgcc gccgcgttcg ccgactaccc cgccacgcgc cacaccgtcg atccggaccg 5640
ccacatcgag cgggtcaccg agctgcaaga actcttcctc acgcgcgtcg ggctcgacat 5700
cggcaaggtg tgggtcgcgg acgacggcgc cgcggtggcg gtctggacca cgccggagag 5760
cgtcgaagcg ggggcggtgt tcgccgagat cggcccgcgc atggccgagt tgagcggttc 5820
ccggctggcc gcgcagcaac agatggaagg cctcctggcg ccgcaccggc ccaaggagcc 5880
cgcgtggttc ctggccaccg tcggagtctc gcccgaccac cagggcaagg gtctgggcag 5940
cgccgtcgtg ctccccggag tggaggcggc cgagcgcgcc ggggtgcccg ccttcctgga 6000
gacctccgcg ccccgcaacc tccccttcta cgagcggctc ggcttcaccg tcaccgccga 6060
cgtcgaggtg cccgaaggac cgcgcacctg gtgcatgacc cgcaagcccg gtgcctgaga 6120
attcgcggga ctctggggtt cgaaatgacc gaccaagcga cgcccaacct gccatcacga 6180
gatttcgatt ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac 6240
gccggctgga tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac 6300
ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat 6360
aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat 6420
catgtctgta taccgctcga ctagagcttg cggaaccctt aatataactt cgtataatgt 6480
atgctatacg aagttattag gtccgctggc catctacgag ccaaagactt tcaaatcttt 6540
ggctgccttg gccagtagga ggcgacacga aggatttgct gctgccttgg gggatgggaa 6600
ggaacctgaa ggcatttttt ccagagtggt gcagtaccac tgaggactgt tgctgtattg 6660
attaggaaaa gagacagagt aatttgcagt ttgtttgatt tatactgggc tgcaggtcga 6720
gggatcttca taagagaaga gggacagcta tgactgggag tagtcaggag aggaggaaaa 6780
atctggctag taaaacatgt aaggaaaatt ttagggatgt taaagaaaaa aataacacaa 6840
aacaaaatat aaaaaaaatc taacctcaag tcaaggcttt tctatggaat aaggaatgga 6900
cagcaggggg ctgtttcata tactgatgac ctctttatag ccacctttgt tcatggcagc 6960
cagcatatgg catatgttgc caaactctaa accaaatact cattctgatg ttttaaatga 7020
tttgccctcc catatgtcct tccgagtgag agacacaaaa aattccaaca cactattgca 7080
atgaaaataa atttccttta ttagccagaa gtcagatgct caaggggctt catgatgtcc 7140
ccataatttt tggcagaggg aaaaagatct cagtggtatt tgtgagccag ggcattggcc 7200
acaccagcca ccaccttctg ataggcagcc tgcggtacct tacatggtgg cgaattcgtt 7260
tgccaaaatg atgagacagc acaataacca gcacgttgcc caggagctgt aggaaaaaga 7320
agaaggcatg aacatggtta gcagaggctc tagagccgcc ggtcacacgc cagaagccga 7380
accccgccct gccccgtccc ccccgaaggc agccgtcccc ctgcggcagc cccgaggctg 7440
gagatggaga aggggacggc ggcgcggcga cgcacgaagg ccctccccgc ccatttcctt 7500
cctgccggcg ccgcaccgct tcgcccgcgc ccgctagagg gggtgcggcg gcgcctccca 7560
gatttcggct ccgccagatt tgggacaaag gaagtccctg cgccctctcg cacgattacc 7620
ataaaaggca atggctgcgg ctcgccgcgc ctcgacagcc gccggcgctc cggggccgcc 7680
gcgcccctcc cccgagccct ccccggcccg aggcggcccc gccccgcccg gcacccccac 7740
ctgccgccac cccccgcccg gcacggcgag ccccgcgcca cgccccgcac ggagccccgc 7800
acccgaagcc gggccgtgct cagcaactcg gggagggggg tgcagggggg ggttacagcc 7860
cgaccgccgc gcccacaccc cctgctcacc cccccacgca cacaccccgc acgcagcctt 7920
tgttcccctc gcagcccccc cgcaccgcgg ggcaccgccc ccggccgcgc tcccctcgcg 7980
cacacgcgga gcgcacaaag ccccgcgccg cgcccgcagc gctcacagcc gccgggcagc 8040
gcgggccgca cgcggcgctc cccacgcaca cacacacgca cgcacccccc gagccgctcc 8100
cccccgcaca aagggccctc ccggagccct ttaaggcttt cacgcagcca cagaaaagaa 8160
acgagccgtc attaaaccaa gcgctaatta cagcccggag gagaagggcc gtcccgcccg 8220
ctcacctgtg ggagtaacgc ggtcagtcag agccggggcg ggcggcgcga ggcggcgcgg 8280
agcggggcac ggggcgaagg caacgcagcg actcccgccc gccgcgcgct tcgcttttta 8340
tagggccgcc gccgccgccg cctcgccata aaaggaaact ttcggagcgc gccgctctga 8400
ttggctgccg ccgcacctct ccgcctcgcc ccgccccgcc cctcgccccg ccccgccccg 8460
cctggcgcgc gccccccccc cccccgcccc catcgctgca caaaataatt aaaaaataaa 8520
taaatacaaa attgggggtg gggagggggg ggagatgggg agagtgaagc agaacgtggg 8580
gctcacctcg acccatggta atagcgatga ctaatacgta gatgtactgc caagtaggaa 8640
agtcccataa ggtcatgtac tgggcataat gccaggcggg ccatttaccg tcattgacgt 8700
caataggggg cgtacttggc atatgataca cttgatgtac tgccaagtgg gcagtttacc 8760
gtaaatagtc cacccattga cgtcaatgga aagtccctat tggcgttact atgggaacat 8820
acgtcattat tgacgtcaat gggcgggggt cgttgggcgg tcagccaggc gggccattta 8880
ccgtaagtta tgtaacgcgg aactccatat atgggctatg aactaatgac cccgtaattg 8940
attactatta ataactagtc aataatcaat gtcgtaaatg tcgtaaatgt ctcagctagt 9000
caggtagtaa aaggtgtcaa ctaggcagtg gcagagcagg attcaaattc agggctgttg 9060
tgatgcctcc gcagactctg agcgccacct ggtggtaatt tgtctgtgcc tcttctgacg 9120
tggaagaaca gcaactaaca cactaacacg gcatttacta tgggccagcc attgtacgcg 9180
ttgcttaacc tgattcttgg gcgttgtcct gcaggggatt gagcaggtgt acgaggacga 9240
gcccaatttc tctatattcc cacagtcttg agtttgtgtc acaaaataat tatagtgggg 9300
tggagatggg aaatgagtcc aggcaacacc taagcctgat tttatgcatt gagactgcgt 9360
gttattacta aagatctttg tgtcgcaatt tcctgatgaa gggagatagg ttaaaaagca 9420
cggatctact gagttttaca gtcatcccat ttgtagactt ttgctacacc accaaagtat 9480
agcatctgag attaaatatt aatctccaaa ccttaggccc cctcacttgc atccttacgg 9540
tcagataact ctcactcata ctttaagccc attttgtttg ttgtacttgc tcatccagtc 9600
ccagacatag cattggcttt ctcctcacct gttttaggta gccagcaagt catgaaatca 9660
gataagttcc accaccaatt aacactaccc atcttgagca taggcccaac agtgcattta 9720
ttcctcattt actgatgttc gtgaatattt accttgattt tcattttttt ctttttctta 9780
agctgggatt ttactcctga ccctattcac agtcagatga tcttgactac cactgcgatt 9840
ggacctgagg ttcagcaata ctccccttta tgtcttttga atacttttca ataaatctgt 9900
ttgtattttc attagttagt aactgagctc agttgccgta atgctaatag cttccaaact 9960
agtgtctctg tctccagtat ctgataaatc ttaggtgttg ctgggacagt tgtcctaaaa 10020
ttaagataaa gcatgaaaat aactgacaca actccattac tggctcctaa ctacttaaac 10080
aatgcattct atcatcacaa atgtgaaaaa ggagttccct cagtggacta accttatctt 10140
ttctcaacac ctttttcttt gcacaatttt ccacacatgc ctacaaaaag tacttatgcg 10200
gccgccataa aagttttgtt actttataga agaaattttg agtttttgtt ttttttaata 10260
aataaataaa cataaataaa ttgtttgttg aatttattat tagtatgtaa gtgtaaatat 10320
aataaaactt aatatctatt caaattaata aataaacctc gatatacaga ccgataaaac 10380
acatgcgtca attttacaca tgattatctt taacgtacgt cacaatatga ttatctttct 10440
agggttaatc tagctgcgtg ttctgcagcg tgtcgagcat cttcatctgc tccatcacgc 10500
tgtaaaacac atttgcaccg cgagtctgcc cgtcctccac gggttcaaaa acgtgaatga 10560
acgaggcgcg ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 10620
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 10680
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 10740
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 10800
ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 10860
gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 10920
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 10980
gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 11040
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 11100
tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 11160
ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac 11220
ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc 11280
ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt 11340
cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct 11400
ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga 11460
tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag 11520
cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca 11580
actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga 11640
aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag 11700
tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc 11760
ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa 11820
tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt 11880
gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg 11940
gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt 12000
tattgctgat aaatctggag ccggtgagcg tggttcacgc ggtatcattg cagcactggg 12060
gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat 12120
ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact 12180
gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa 12240
aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt 12300
ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 12360
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 12420
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 12480
gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt 12540
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 12600
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 12660
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 12720
gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 12780
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 12840
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 12900
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 12960
acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga 13020
ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac 13080
gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 13140
tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa 13200
agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc 13260
tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 13320
cacaggaaac agctatgacc atgattacgc caagcgcgcc cgccgggtaa ctcacggggt 13380
atccatgtcc atttctgcgg catccagcca ggatacccgt cctcgctgac gtaatatccc 13440
agcgccgcac cgctgtcatt aatctgcaca ccggcacggc agttccggct gtcgccggta 13500
ttgttcgggt tgctgatgcg cttcgggctg accatccgga actgtgtccg gaaaagccgc 13560
gacgaactgg tatcccaggt ggcctgaacg aacagttcac cgttaaaggc gtgcatggcc 13620
acaccttccc gaatcatcat ggtaaacgtg cgttttcgct caacgtcaat gcagcagcag 13680
tcatcctcgg caaactcttt ccatgccgct tcaacctcgc gggaaaaggc acgggcttct 13740
tcctccccga tgcccagata gcgccagctt gggcgatgac tgagccggaa aaaagacccg 13800
acgatatgat cctgatgcag ctagattaac cctagaaaga tagtctgcgt aaaattgacg 13860
catgcattct tgaaatattg ctctctcttt ctaaatagcg cgaatccgtc gctgtgcatt 13920
taggacatct cagtcgccgc ttggagctcc cgtgaggcgt gcttgtcaat gcggtaagtg 13980
tcactgattt tgaactataa cgaccgcgtg agtcaaaatg acgcatgatt atcttttacg 14040
tgacttttaa gatttaactc atacgataat tatattgtta tttcatgttc tacttacgtg 14100
ataacttatt atatatatat tttcttgtta tagatatc 14138
<210> 4
<211> 1069
<212> DNA
<213> Sus scrofa
<400> 4
gtgctgagtc cttttcccat cccacccacc tggagctccc ctcttccagt cctgagccac 60
ttgaactggc ctggtttttg ccatcctgcg ctgccctctc tccggactcg agccactgct 120
gagggcctca ggccagtcca tcctcgtctt gtctctttcg ccctgctctt tccccacctt 180
gagcgctctt aaccagcctg gcccgtgcca cctctactct gccatcgaat gctgccccac 240
tttctcgagt ccgccacttc tcccagcttc accggtaccc actgtttccc ctagtccagg 300
caggtaccac tttccctgag cgtcctcctc ctctctcctg ggcctgtgct gcttcttttc 360
ccgctctctg gcctgggccg tttcttcggc cagcccccga gccttccatg ccctttcctt 420
caggtttctg ctcttcatcc ttggtctctg ccatctgttg ccatgtaagg gtgctctttc 480
ctgagccatc gccctcaagg cgctctgctc ctcaagtgga tgcttccctc gcctggctca 540
cctcctgctc tctctcctgc ccccttcacc tgcgtgccct cctcattctc cctctgtgcc 600
acctctggcc ttgcactgta ggctctctct tggggatgtt tctccttctc cacacacttc 660
tctttcactc tgtcctcttg ctttgtgtgg gcctgcagcg ttaccctttt ttctgggcac 720
actcagagca ccctcctctt tctggttctg ggccacctgt ctgtcctcgg gtcatcttgc 780
tctctctgcc tggatgccct cctgtggctt tgggcagctt ctccctcctt cagagtgcac 840
cgccagttct cctaggcccg gtcacttccc cttcccaggg gacctagagc cctgctaggt 900
cctctctctc cacaacctgg gcccccaaac ctttccaaaa caccttgctt tctgcctcca 960
ttggtcttgt gttccagagc cagagtcact atatgtccca gaaccaggat tccctctggt 1020
tctgagggct tttatcgcat cccctgcctg gctgcagtgg gtctttggg 1069
<210> 5
<211> 260
<212> DNA
<213> Sus scrofa
<400> 5
gacaggccac agaagagcct ctactcctcc ctctgtcccc gaggctgtct ccctcccagt 60
cttcccagct caggccagtc cccaggcctc tcttccctgc cagagcccgt caggttcggt 120
tactttgggg cccagagagg accctgtgaa ggaagcgtgg gtaggggcac gggaatgggg 180
aggatgcctg aagaggcccc cttagccaga agaggagcag aagaggagca ggtacccaga 240
agaggagcag ttcagggaaa 260
<210> 6
<211> 540
<212> DNA
<213> Sus scrofa
<400> 6
aaatacccac gtttattggg acaaaagttg ttagggaaaa tggggcctca gagttatgat 60
tcaagtcata attctttcca tttataattt cactcgagac tctgttaact gattccttgt 120
gtgttgtatc ttactcctca gctcacaatt acttttagtt attcacctta actgtatgaa 180
taacagtgga gaaaaggatt ctaccagaat actctaatta tggttttgag tcccctttcc 240
agactgaaga tttttcagtc tttttgatct gaggtgattt ttcagtcttt tcgatctgag 300
gtgacagtct caagctcctc aattcaccca gtctcttgat acttgtccat ttagggccac 360
caaagctact ttgacttcat actagagagt caattaatga ggccattctc tgatggacag 420
gtgaagcagg caaggtgact atattttgac taaacggtag aaaacagcct gagtgttaac 480
agtgtagcct ataaaaccca gagctgccca ccctgatcta aacttccagg aacataagaa 540
<210> 7
<211> 1009
<212> DNA
<213> Sus scrofa
<400> 7
agtaggtcac atttcagtaa aacctggctt tgtggattga gcatggtctg tctcttcctg 60
gtacttcatt agtcccctaa gtgggatttg ctgagcaaga ctcctcaatt acagaaatac 120
tccagtttag aattctcgca aaggcttttt gtttccacaa gtagaatcta gaaagcaatc 180
tcaagtaaca acagcagaga cctgaatccc aatccatctt tcctgtgtgt cctcttttac 240
ctccttccct ttcatgttga accaacagtc ctttttcagt ctagaagcta gtacgaaaga 300
aatgtacaga tgtaggtacc aagcaaagcc attagccaat aactggtgag atggagctaa 360
gaggaaataa aagtgttcct aagaatagca cagcagaagc tagatccaca gatcttaaaa 420
caattttggt tgagtaagag tagaggcaaa agaggaagct aataatgcag tttttaggag 480
ctaagagcca gataaagggt aagggcagga ggaagtgcta tctcagctaa cgagatacat 540
gaaacaacgg tggaagtcca gcaggcacaa gatgagttga gaagcaatca gggccagaag 600
gatgtgcaag gcctcaaaat aaaaaagcac agggccacag ggaaccttat ggaaattaaa 660
aggaagagga tgcagtcagg agaggaaaaa atagtgctcc ctcccccatg cccaaggaag 720
cagctgagca gccagtactt gggaagttag tagtaataag ttggtaagag ggagttctgt 780
tcgtggctca atggttaaca aatcagacta gaaaccgtga ggttgcgggt ttgatccctg 840
gccttgctca gtgggttaag gatccggcat tgccgtgacc tgtggtgtag gtcacagacg 900
tggctcagtt cccgcattcc tgtggctctg gtgtaggctg gtggctacag ctctgattag 960
acccctaggc tgggaacctc catatgccct ggaagtggcc gtagaaaag 1009
<210> 8
<211> 872
<212> DNA
<213> Sus scrofa
<400> 8
ggatggggac tcatgtgaat tttctaaagg tgctatttaa acggggggca cgagtgccgg 60
ctttggacag ggccgctcgc tctccaccct ttcttcttcc ccctcggccg cctctcaccc 120
cctgaggcct ctctcccccc acgacctcct ctctctcctc tgaaaccctc tcctcctcag 180
ctgcatccca ccctcgtggc ctctctctct ctctgtctgt cctgtgtcct ctctcactgg 240
gtttcagagc acagatgccc aaagcacaaa agcagttttc ccctggggtg ggaggaagca 300
agagactttg tacctatttt gtatgtgtat aataatttga gatgttttta attattttga 360
ttgctggaat aaagcatgtg gaaatgaccc aaaccaatct tgcactggcc tcctgatttc 420
cttccttgga gacggaggga gggggagacc tgggggaggg cgcttggggg ggggtgggct 480
ctcttctttc tgcgctcccc ccccccacct ccaacacctt gacgacccct cctgcttccg 540
cttgcctttc tcaggcttta acactttctc ctcgccctct cagcatgcgc atgcgcgtgc 600
ctctacctcc cccgcacatc ctggcctgcc caccctgaat ggcctggccc agcgatgcca 660
ccaactctct cgctccgtcc acggctgggg aggggggcac tctgcagggt tggggggcac 720
tgggaggctg ggttgggtga gggaggggtg cctgggcccc caccccccag caagttctct 780
ccctaggcga actggagggt cgtctggcct cttgagcctt gttgctggct ctgagctcta 840
ccaagagagt gaccagcagg accgcaccat ca 872
<210> 9
<211> 727
<212> DNA
<213> Sus scrofa
<400> 9
gtggttgctg agactgcgtg ggggcccaag gagacctgga gaaaggaatg cttcctgctc 60
cttcttctgg ggccccagga gagccttccc agggccttgg agaggtgctg tccagggact 120
aaccctgtgc tctaggaagg ctgcaggccc tgaccagctg ggcaggtcct gggtccctcc 180
tggccttcta agttccccaa acatgagacc tctgggtgtg gggtggcctg gggaggtcat 240
tttgcccagg ccctacctcc tgcccattcc taaccctttt taaaaatctg tgcgtcctct 300
tcttccttct tctccctccc ttcccttttc gctcaccctc tgctgctggc ctgagagccg 360
gaggccccca gggggaaggc gactggtctc ctccccagtc tcagggaagg gagacagaga 420
atccaggaag ccagaactca gcagacgaag cacccaggga cctagagatg ggttgaaaag 480
ttgacagctg tcccacctgc ctcccaaggt ctcagggcct aaacctccaa ggcaggaaag 540
gcccctgtcc ctccctgggg tccatagaaa gagggacaag tctgcacgga ccatttgctg 600
taatattaac accttggctg tcattaggta gtcttggctg ttaattatgt cctgtgataa 660
tgtattatta gcacgccgac cacatagggt agggaactgc agctagtaaa caaaagtttg 720
ttcctat 727
<210> 10
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 10
gaaggagcaa acugacaugg guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 11
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 11
ugcagugggu cuuuggggac guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 12
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 12
uuccaggaac auaagaaagu guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 13
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 13
gcagucucag caaccacuga guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 14
<211> 15219
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 14
ggcgcgccgg atggggactc atgtgaattt tctaaaggtg ctatttaaac ggggggcacg 60
agtgccggct ttggacaggg ccgctcgctc tccacccttt cttcttcccc ctcggccgcc 120
tctcaccccc tgaggcctct ctccccccac gacctcctct ctctcctctg aaaccctctc 180
ctcctcagct gcatcccacc ctcgtggcct ctctctctct ctgtctgtcc tgtgtcctct 240
ctcactgggt ttcagagcac agatgcccaa agcacaaaag cagttttccc ctggggtggg 300
aggaagcaag agactttgta cctattttgt atgtgtataa taatttgaga tgtttttaat 360
tattttgatt gctggaataa agcatgtgga aatgacccaa accaatcttg cactggcctc 420
ctgatttcct tccttggaga cggagggagg gggagacctg ggggagggcg cttggggggg 480
ggtgggctct cttctttctg cgctcccccc ccccacctcc aacaccttga cgacccctcc 540
tgcttccgct tgcctttctc aggctttaac actttctcct cgccctctca gcatgcgcat 600
gcgcgtgcct ctacctcccc cgcacatcct ggcctgccca ccctgaatgg cctggcccag 660
cgatgccacc aactctctcg ctccgtccac ggctggggag gggggcactc tgcagggttg 720
gggggcactg ggaggctggg ttgggtgagg gaggggtgcc tgggccccca ccccccagca 780
agttctctcc ctaggcgaac tggagggtcg tctggcctct tgagccttgt tgctggctct 840
gagctctacc aagagagtga ccagcaggac cgcaccatca cgcgccccag acccgggcct 900
ggggggcaag tcggggggcg gggggaggtc gggcagggtc ccctgggagg atggggacgt 960
gctgtgcccc tagcggccac cagagggcac caggacacca ctgcggtcgg ctcagcggct 1020
cctgccctgg tcagggggcg ccaggtcctg cccctcctgg ggagggcggg gggcgagaag 1080
ggcgattacg cgtggctccg gtgcccgtca gtgggcagag cgcacatcgc ccacagtccc 1140
cgagaagttg gggggagggg tcggcaattg aaccggtgcc tagagaaggt ggcgcggggt 1200
aaactgggaa agtgatgtcg tgtactggct ccgccttttt cccgagggtg ggggagaacc 1260
gtatataagt gcagtagtcg ccgtgaacgt tctttttcgc aacgggtttg ccgccagaac 1320
acaggtaagt gccgtgtgtg gttcccgcgg gcctggcctc tttacgggtt atggcccttg 1380
cgtgccttga attacttcca ctggctgcag tacgtgattc ttgatcccga gcttcgggtt 1440
ggaagtgggt gggagagttc gaggccttgc gcttaaggag ccccttcgcc tcgtgcttga 1500
gttgaggcct ggcctgggcg ctggggccgc cgcgtgcgaa tctggtggca ccttcgcgcc 1560
tgtctcgctg ctttcgataa gtctctagcc atttaaaatt tttgatgacc tgctgcgacg 1620
ctttttttct ggcaagatag tcttgtaaat gcgggccaag atctgcacac tggtatttcg 1680
gtttttgggg ccgcgggcgg cgacggggcc cgtgcgtccc agcgcacatg ttcggcgagg 1740
cggggcctgc gagcgcggcc accgagaatc ggacgggggt agtctcaagc tggccggcct 1800
gctctggtgc ctggcctcgc gccgccgtgt atcgccccgc cctgggcggc aaggctggcc 1860
cggtcggcac cagttgcgtg agcggaaaga tggccgcttc ccggccctgc tgcagggagc 1920
tcaaaatgga ggacgcggcg ctcgggagag cgggcgggtg agtcacccac acaaaggaaa 1980
agggcctttc cgtcctcagc cgtcgcttca tgtgactcca cggagtaccg ggcgccgtcc 2040
aggcacctcg attagttctc gagcttttgg agtacgtcgt ctttaggttg gggggagggg 2100
ttttatgcga tggagtttcc ccacactgag tgggtggaga ctgaagttag gccagcttgg 2160
cacttgatgt aattctcctt ggaatttgcc ctttttgagt ttggatcttg gttcattctc 2220
aagcctcaga cagtggttca aagttttttt cttccatttc aggtgtcgtg acgtacggtc 2280
gacgccacca tgtcaagctc ttcctggctc cttctcagcc ttgttgctgt aactgctgct 2340
cagtccacca ttgaggaaca ggccaagaca tttttggaca agtttaacca cgaagccgaa 2400
gacctgttct atcaaagttc acttgcttct tggaattata acaccaatat tactgaagag 2460
aatgtccaaa acatgaataa tgctggggac aaatggtctg cctttttaaa ggaacagtcc 2520
acacttgccc aaatgtatcc actacaagaa attcagaatc tcacagtcaa gcttcagctg 2580
caggctcttc agcaaaatgg gtcttcagtg ctctcagaag acaagagcaa acggttgaac 2640
acaattctaa atacaatgag caccatctac agtactggaa aagtttgtaa cccagataat 2700
ccacaagaat gcttattact tgaaccaggt ttgaatgaaa taatggcaaa cagtttagac 2760
tacaatgaga ggctctgggc ttgggaaagc tggagatctg aggtcggcaa gcagctgagg 2820
ccattatatg aagagtatgt ggtcttgaaa aatgagatgg caagagcaaa tcattatgag 2880
gactatgggg attattggag aggagactat gaagtaaatg gggtagatgg ctatgactac 2940
agccgcggcc agttgattga agatgtggaa catacctttg aagagattaa accattatat 3000
gaacatcttc atgcctatgt gagggcaaag ttgatgaatg cctatccttc ctatatcagt 3060
ccaattggat gcctccctgc tcatttgctt ggtgatatgt ggggtagatt ttggacaaat 3120
ctgtactctt tgacagttcc ctttggacag aaaccaaaca tagatgttac tgatgcaatg 3180
gtggaccagg cctgggatgc acagagaata ttcaaggagg ccgagaagtt ctttgtatct 3240
gttggtcttc ctaatatgac tcaaggattc tgggaaaatt ccatgctaac ggacccagga 3300
aatgttcaga aagcagtctg ccatcccaca gcttgggacc tggggaaggg cgacttcagg 3360
atccttatgt gcacaaaggt gacaatggac gacttcctga cagctcatca tgagatgggg 3420
catatccagt atgatatggc atatgctgca caaccttttc tgctaagaaa tggagctaat 3480
gaaggattcc atgaagctgt tggggaaatc atgtcacttt ctgcagccac acctaagcat 3540
ttaaaatcca ttggtcttct gtcacccgat tttcaagaag acaatgaaac agaaataaac 3600
ttcctgctca aacaagcact cacgattgtt gggactctgc catttactta catgttagag 3660
aagtggaggt ggatggtctt taaaggggaa attcccaaag accagtggat gaaaaagtgg 3720
tgggagatga agcgagagat agttggggtg gtggaacctg tgccccatga tgaaacatac 3780
tgtgaccccg catctctgtt ccatgtttct aatgattact cattcattcg atattacaca 3840
aggacccttt accaattcca gtttcaagaa gcactttgtc aagcagctaa acatgaaggc 3900
cctctgcaca aatgtgacat ctcaaactct acagaagctg gacagaaact gttcaatatg 3960
ctgaggcttg gaaaatcaga accctggacc ctagcattgg aaaatgttgt aggagcaaag 4020
aacatgaatg taaggccact gctcaactac tttgagccct tatttacctg gctgaaagac 4080
cagaacaaga attcttttgt gggatggagt accgactgga gtccatatgc agaccaaagc 4140
atcaaagtga ggataagcct aaaatcagct cttggagata aagcatatga atggaacgac 4200
aatgaaatgt acctgttccg atcatctgtt gcatatgcta tgaggcagta ctttttaaaa 4260
gtaaaaaatc agatgattct ttttggggag gaggatgtgc gagtggctaa tttgaaacca 4320
agaatctcct ttaatttctt tgtcactgca cctaaaaatg tgtctgatat cattcctaga 4380
actgaagttg aaaaggccat caggatgtcc cggagccgta tcaatgatgc tttccgtctg 4440
aatgacaaca gcctagagtt tctggggata cagccaacac ttggacctcc taaccagccc 4500
cctgtttcca tatggctgat tgtttttgga gttgtgatgg gagtgatagt ggttggcatt 4560
gtcatcctga tcttcactgg gatcagagat cggaagaaga aaaataaagc aagaagtgga 4620
gaaaatcctt atgcctccat cgatattagc aaaggagaaa ataatccagg attccaaaac 4680
actgatgatg ttcagacctc ctttggcagc ggcgccacaa acttctctct gctaaagcaa 4740
gcaggtgatg ttgaagaaaa ccccgggcct atgggcatcc aggcgggaga accagacccc 4800
ccagaggagc ccctcacctc gcaagcatcc gtgccccccc atcagcttcg gctaggcagc 4860
ctccatcctc acacccctta tcacatccgc gtggcatgca ccagcagcca gggcccctca 4920
tcctggaccc actggcttcc tgtggagacg ccggagggag tgcccctggg cccccctgag 4980
aacattagtg ctacgcggaa tgggagccag gccttcgtgc attggcaaga gccccgggcg 5040
cccctgcagg gtaccctgtt agggtaccgg ctggcgtatc aaggccagga caccccagag 5100
gtgctaatgg acatagggct aaggcaagag gtgaccctgg agctgcaggg ggacgggtct 5160
gtgtccaatc tgacagtgtg tgtggcagcc tacactgctg ctggggatgg accctggagc 5220
ctcccagtac ccctggaggc ctggcgccca gggcaagcac agccagtcca ccagctggtg 5280
aaggaacctt caactcctgc cttctcgtgg ccctggtggt atgtactgct aggagcagtc 5340
gtggccgctg cctgtgtcct catcttggct ctcttccttg tccaccggcg aaagaaggag 5400
acccgttatg gagaagtgtt tgaaccaaca gtggaaagag gtgaactggt agtcaggtac 5460
cgcgtgcgca agtcctacag tcgtcggacc actgaagcta ccttgaacag cctgggcatc 5520
agtgaagagc tgaaggagaa gctgcgggat gtgatggtgg accggcacaa ggtggccctg 5580
gggaagactc tgggagaggg agagtttgga gctgtgatgg aaggccagct caaccaggac 5640
gactccatcc tcaaggtggc tgtgaagacg atgaagattg ccatctgcac gaggtcagag 5700
ctggaggatt tcctgagtga agcggtctgc atgaaggaat ttgaccatcc caacgtcatg 5760
aggctcatcg gtgtctgttt ccagggttct gaacgagaga gcttcccagc acctgtggtc 5820
atcttacctt tcatgaaaca tggagaccta cacagcttcc tcctctattc ccggctcggg 5880
gaccagccag tgtacctgcc cactcagatg ctagtgaagt tcatggcaga catcgccagt 5940
ggcatggagt atctgagtac caagagattc atacaccggg acctggcggc caggaactgc 6000
atgctgaatg agaacatgtc cgtgtgtgtg gcggacttcg ggctctccaa gaagatctac 6060
aatggggact actaccgcca gggacgtatc gccaagatgc cagtcaagtg gattgccatt 6120
gagagtctag ctgaccgtgt ctacaccagc aagagcgatg tgtggtcctt cggggtgaca 6180
atgtgggaga ttgccacaag aggccaaacc ccatatccgg gcgtggagaa cagcgagatt 6240
tatgactatc tgcgccaggg aaatcgcctg aagcagcctg cggactgtct ggatggactg 6300
tatgccttga tgtcgcggtg ctgggagcta aatccccagg accggccaag ttttacagag 6360
ctgcgggaag atttggagaa cacactgaag gccttgcctc ctgcccagga gcctgacgaa 6420
atcctctatg tcaacatgga tgagggtgga ggttatcctg aaccccctgg agctgcagga 6480
ggagctgacc ccccaaccca gccagaccct aaggattcct gtagctgcct cactgcggct 6540
gaggtccatc ctgctggacg ctatgtcctc tgcccttcca caacccctag ccccgctcag 6600
cctgctgata ggggctcccc agcagcccca gggcaggagg atggtgccgg ctccggcgag 6660
ggcaggggaa gtcttctaac atgcggggac gtggaggaaa atcccggccc aatgccccct 6720
gccccgcccg gaggtgaaag cgggtgtgag gagcgcggcg cggcaggtca tattgaacat 6780
tccagatacc tatcattact cgatgctgtt gataacagca agatggcttt gaactcaggg 6840
tcaccaccag ctattggacc ttactatgaa aaccatggat accaaccgga aaacccctat 6900
cccgcacagc ccactgtggt ccccactgtc tacgaggtgc atccggctca gtactacccg 6960
tcccccgtgc cccagtacgc cccgagggtc ctgacgcagg cttccaaccc cgtcgtctgc 7020
acgcagccca aatccccatc cgggacagtg tgcacctcaa agactaagaa agcactgtgc 7080
atcaccttga ccctggggac cttcctcgtg ggagctgcgc tggccgctgg cctactctgg 7140
aagttcatgg gcagcaagtg ctccaactct gggatagagt gcgactcctc aggtacctgc 7200
atcaacccct ctaactggtg tgatggcgtg tcacactgcc ccggcgggga ggacgagaat 7260
cggtgtgttc gcctctacgg accaaacttc atccttcagg tgtactcatc tcagaggaag 7320
tcctggcacc ctgtgtgcca agacgactgg aacgagaact acgggcgggc ggcctgcagg 7380
gacatgggct ataagaataa tttttactct agccaaggaa tagtggatga cagcggatcc 7440
accagcttta tgaaactgaa cacaagtgcc ggcaatgtcg atatctataa aaaactgtac 7500
cacagtgatg cctgttcttc aaaagcagtg gtttctttac gctgtatagc ctgcggggtc 7560
aacttgaact caagccgcca gagcaggatt gtgggcggcg agagcgcgct cccgggggcc 7620
tggccctggc aggtcagcct gcacgtccag aacgtccacg tgtgcggagg ctccatcatc 7680
acccccgagt ggatcgtgac agccgcccac tgcgtggaaa aacctcttaa caatccatgg 7740
cattggacgg catttgcggg gattttgaga caatctttca tgttctatgg agccggatac 7800
caagtagaaa aagtgatttc tcatccaaat tatgactcca agaccaagaa caatgacatt 7860
gcgctgatga agctgcagaa gcctctgact ttcaacgacc tagtgaaacc agtgtgtctg 7920
cccaacccag gcatgatgct gcagccagaa cagctctgct ggatttccgg gtggggggcc 7980
accgaggaga aagggaagac ctcagaagtg ctgaacgctg ccaaggtgct tctcattgag 8040
acacagagat gcaacagcag atatgtctat gacaacctga tcacaccagc catgatctgt 8100
gccggcttcc tgcaggggaa cgtcgattct tgccagggtg acagtggagg gcctctggtc 8160
acttcgaaga acaatatctg gtggctgata ggggatacaa gctggggttc tggctgtgcc 8220
aaagcttaca gaccaggagt gtacgggaat gtgatggtat tcacggactg gatttatcga 8280
caaatgaggg cagacggcta aattatccct aatacctgcc accccactct taatcagtgg 8340
tggaagaacg gtctcagaac tgtttgtttc aattggccat ttaagtttag tagtaaaaga 8400
ctggttaatg ataacaatgc atcgtaaaac cttcagaagg aaaggagaat gttttgtgga 8460
ccactttggt tttctttttt gcgtgtggca gttttaagtt attagttttt aaaatcagta 8520
ctttttaatg gaaacaactt gaccaaaaat ttgtcacaga attttgagac ccattaaaaa 8580
agttaaatga gaaacctgtg tgttcctttg gtcaacaccg agacatttag gtgaaagaca 8640
tctaattctg gttttacgaa tctggaaact tcttgaaaat gtaattcttg agttaacact 8700
tctgggtgga gaatagggtt gttttccccc cacataattg gaaggggaag gaatatcatt 8760
taaagctatg ggagggtttc tttgattaca acactggaga gaaatgcagc atgttgctga 8820
ttgcctgtca ctaaaacagg ccaaaaactg agtccttggg ttgcatagaa agctacgcgt 8880
tctgaggcgg aaagaaccag ctgccttaat ataacttcgt ataatgtatg ctatacgaag 8940
ttattaggtc tgaagaggag tttacgtcca gccaattctg tggaatgtgt gtcagttagg 9000
gtgtggaaag tccccaggct ccccagcagg cagaagtatg caaagcatgc atctcaatta 9060
gtcagcaacc aggtgtggaa agtccccagg ctccccagca ggcagaagta tgcaaagcat 9120
gcatctcaat tagtcagcaa ccatagtccc gcccctaact ccgcccatcc cgcccctaac 9180
tccgcccagt tccgcccatt ctccgcccca tggctgacta atttttttta tttatgcaga 9240
ggccgaggcc gcctctgcct ctgagctatt ccagaagtag tgaggaggct tttttggagg 9300
cctaggcttt tgcaaaaagc tcccgggagc ttgtatatcc attttcggcg gccgcgccac 9360
catgaccgag tacaagccca cggtgcgcct cgccacccgc gacgacgtcc ccagggccgt 9420
acgcaccctc gccgccgcgt tcgccgacta ccccgccacg cgccacaccg tcgatccgga 9480
ccgccacatc gagcgggtca ccgagctgca agaactcttc ctcacgcgcg tcgggctcga 9540
catcggcaag gtgtgggtcg cggacgacgg cgccgcggtg gcggtctgga ccacgccgga 9600
gagcgtcgaa gcgggggcgg tgttcgccga gatcggcccg cgcatggccg agttgagcgg 9660
ttcccggctg gccgcgcagc aacagatgga aggcctcctg gcgccgcacc ggcccaagga 9720
gcccgcgtgg ttcctggcca ccgtcggagt ctcgcccgac caccagggca agggtctggg 9780
cagcgccgtc gtgctccccg gagtggaggc ggccgagcgc gccggggtgc ccgccttcct 9840
ggagacctcc gcgccccgca acctcccctt ctacgagcgg ctcggcttca ccgtcaccgc 9900
cgacgtcgag gtgcccgaag gaccgcgcac ctggtgcatg acccgcaagc ccggtgcctg 9960
agaattcgcg ggactctggg gttcgaaatg accgaccaag cgacgcccaa cctgccatca 10020
cgagatttcg attccaccgc cgccttctat gaaaggttgg gcttcggaat cgttttccgg 10080
gacgccggct ggatgatcct ccagcgcggg gatctcatgc tggagttctt cgcccacccc 10140
aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca 10200
aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct 10260
tatcatgtct gtataccgct cgactagagc ttgcggaacc cttaatataa cttcgtataa 10320
tgtatgctat acgaagttat taggtccgct ggccatctac gagccaaaga ctttcaaatc 10380
tttggctgcc ttggccagta ggaggcgaca cgaaggattt gctgctgcct tgggggatgg 10440
gaaggaacct gaaggcattt tttccagagt ggtgcagtac cactgaggac tgttgctgta 10500
ttgattagga aaagagacag agtaatttgc agtttgtttg atttatactg tggttgctga 10560
gactgcgtgg gggcccaagg agacctggag aaaggaatgc ttcctgctcc ttcttctggg 10620
gccccaggag agccttccca gggccttgga gaggtgctgt ccagggacta accctgtgct 10680
ctaggaaggc tgcaggccct gaccagctgg gcaggtcctg ggtccctcct ggccttctaa 10740
gttccccaaa catgagacct ctgggtgtgg ggtggcctgg ggaggtcatt ttgcccaggc 10800
cctacctcct gcccattcct aacccttttt aaaaatctgt gcgtcctctt cttccttctt 10860
ctccctccct tcccttttcg ctcaccctct gctgctggcc tgagagccgg aggcccccag 10920
ggggaaggcg actggtctcc tccccagtct cagggaaggg agacagagaa tccaggaagc 10980
cagaactcag cagacgaagc acccagggac ctagagatgg gttgaaaagt tgacagctgt 11040
cccacctgcc tcccaaggtc tcagggccta aacctccaag gcaggaaagg cccctgtccc 11100
tccctggggt ccatagaaag agggacaagt ctgcacggac catttgctgt aatattaaca 11160
ccttggctgt cattaggtag tcttggctgt taattatgtc ctgtgataat gtattattag 11220
cacgccgacc acatagggta gggaactgca gctagtaaac aaaagtttgt tcctatatgc 11280
ggccgccata aaagttttgt tactttatag aagaaatttt gagtttttgt tttttttaat 11340
aaataaataa acataaataa attgtttgtt gaatttatta ttagtatgta agtgtaaata 11400
taataaaact taatatctat tcaaattaat aaataaacct cgatatacag accgataaaa 11460
cacatgcgtc aattttacac atgattatct ttaacgtacg tcacaatatg attatctttc 11520
tagggttaat ctagctgcgt gttctgcagc gtgtcgagca tcttcatctg ctccatcacg 11580
ctgtaaaaca catttgcacc gcgagtctgc ccgtcctcca cgggttcaaa aacgtgaatg 11640
aacgaggcgc gctcactggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt 11700
acccaactta atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag 11760
gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg ggacgcgccc 11820
tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt 11880
gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc 11940
ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta 12000
cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc 12060
tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg 12120
ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt ataagggatt 12180
ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat 12240
tttaacaaaa tattaacgct tacaatttag gtggcacttt tcggggaaat gtgcgcggaa 12300
cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac 12360
cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg 12420
tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc 12480
tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg 12540
atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga 12600
gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc 12660
aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag 12720
aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga 12780
gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg 12840
cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga 12900
atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt 12960
tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact 13020
ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt 13080
ttattgctga taaatctgga gccggtgagc gtggttcacg cggtatcatt gcagcactgg 13140
ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta 13200
tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac 13260
tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta 13320
aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt 13380
tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt 13440
tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt 13500
gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc 13560
agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg 13620
tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg 13680
ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt 13740
cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac 13800
tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg 13860
acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg 13920
gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat 13980
ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt 14040
tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg 14100
attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa 14160
cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcccaata cgcaaaccgc 14220
ctctccccgc gcgttggccg attcattaat gcagctggca cgacaggttt cccgactgga 14280
aagcgggcag tgagcgcaac gcaattaatg tgagttagct cactcattag gcaccccagg 14340
ctttacactt tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga taacaatttc 14400
acacaggaaa cagctatgac catgattacg ccaagcgcgc ccgccgggta actcacgggg 14460
tatccatgtc catttctgcg gcatccagcc aggatacccg tcctcgctga cgtaatatcc 14520
cagcgccgca ccgctgtcat taatctgcac accggcacgg cagttccggc tgtcgccggt 14580
attgttcggg ttgctgatgc gcttcgggct gaccatccgg aactgtgtcc ggaaaagccg 14640
cgacgaactg gtatcccagg tggcctgaac gaacagttca ccgttaaagg cgtgcatggc 14700
cacaccttcc cgaatcatca tggtaaacgt gcgttttcgc tcaacgtcaa tgcagcagca 14760
gtcatcctcg gcaaactctt tccatgccgc ttcaacctcg cgggaaaagg cacgggcttc 14820
ttcctccccg atgcccagat agcgccagct tgggcgatga ctgagccgga aaaaagaccc 14880
gacgatatga tcctgatgca gctagattaa ccctagaaag atagtctgcg taaaattgac 14940
gcatgcattc ttgaaatatt gctctctctt tctaaatagc gcgaatccgt cgctgtgcat 15000
ttaggacatc tcagtcgccg cttggagctc ccgtgaggcg tgcttgtcaa tgcggtaagt 15060
gtcactgatt ttgaactata acgaccgcgt gagtcaaaat gacgcatgat tatcttttac 15120
gtgactttta agatttaact catacgataa ttatattgtt atttcatgtt ctacttacgt 15180
gataacttat tatatatata ttttcttgtt atagatatc 15219
<210> 15
<211> 805
<212> PRT
<213> Homo sapiens
<400> 15
Met Ser Ser Ser Ser Trp Leu Leu Leu Ser Leu Val Ala Val Thr Ala
1 5 10 15
Ala Gln Ser Thr Ile Glu Glu Gln Ala Lys Thr Phe Leu Asp Lys Phe
20 25 30
Asn His Glu Ala Glu Asp Leu Phe Tyr Gln Ser Ser Leu Ala Ser Trp
35 40 45
Asn Tyr Asn Thr Asn Ile Thr Glu Glu Asn Val Gln Asn Met Asn Asn
50 55 60
Ala Gly Asp Lys Trp Ser Ala Phe Leu Lys Glu Gln Ser Thr Leu Ala
65 70 75 80
Gln Met Tyr Pro Leu Gln Glu Ile Gln Asn Leu Thr Val Lys Leu Gln
85 90 95
Leu Gln Ala Leu Gln Gln Asn Gly Ser Ser Val Leu Ser Glu Asp Lys
100 105 110
Ser Lys Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser
115 120 125
Thr Gly Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu
130 135 140
Glu Pro Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu
145 150 155 160
Arg Leu Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu
165 170 175
Arg Pro Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg
180 185 190
Ala Asn His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu
195 200 205
Val Asn Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu
210 215 220
Asp Val Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu
225 230 235 240
His Ala Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile
245 250 255
Ser Pro Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly
260 265 270
Arg Phe Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys
275 280 285
Pro Asn Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala
290 295 300
Gln Arg Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu
305 310 315 320
Pro Asn Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Asp Pro
325 330 335
Gly Asn Val Gln Lys Ala Val Cys His Pro Thr Ala Trp Asp Leu Gly
340 345 350
Lys Gly Asp Phe Arg Ile Leu Met Cys Thr Lys Val Thr Met Asp Asp
355 360 365
Phe Leu Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala
370 375 380
Tyr Ala Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe
385 390 395 400
His Glu Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys
405 410 415
His Leu Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn
420 425 430
Glu Thr Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly
435 440 445
Thr Leu Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe
450 455 460
Lys Gly Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met
465 470 475 480
Lys Arg Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr
485 490 495
Tyr Cys Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe
500 505 510
Ile Arg Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala
515 520 525
Leu Cys Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile
530 535 540
Ser Asn Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu
545 550 555 560
Gly Lys Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala
565 570 575
Lys Asn Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe
580 585 590
Thr Trp Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr
595 600 605
Asp Trp Ser Pro Tyr Ala Asp Gln Ser Ile Lys Val Arg Ile Ser Leu
610 615 620
Lys Ser Ala Leu Gly Asp Lys Ala Tyr Glu Trp Asn Asp Asn Glu Met
625 630 635 640
Tyr Leu Phe Arg Ser Ser Val Ala Tyr Ala Met Arg Gln Tyr Phe Leu
645 650 655
Lys Val Lys Asn Gln Met Ile Leu Phe Gly Glu Glu Asp Val Arg Val
660 665 670
Ala Asn Leu Lys Pro Arg Ile Ser Phe Asn Phe Phe Val Thr Ala Pro
675 680 685
Lys Asn Val Ser Asp Ile Ile Pro Arg Thr Glu Val Glu Lys Ala Ile
690 695 700
Arg Met Ser Arg Ser Arg Ile Asn Asp Ala Phe Arg Leu Asn Asp Asn
705 710 715 720
Ser Leu Glu Phe Leu Gly Ile Gln Pro Thr Leu Gly Pro Pro Asn Gln
725 730 735
Pro Pro Val Ser Ile Trp Leu Ile Val Phe Gly Val Val Met Gly Val
740 745 750
Ile Val Val Gly Ile Val Ile Leu Ile Phe Thr Gly Ile Arg Asp Arg
755 760 765
Lys Lys Lys Asn Lys Ala Arg Ser Gly Glu Asn Pro Tyr Ala Ser Ile
770 775 780
Asp Ile Ser Lys Gly Glu Asn Asn Pro Gly Phe Gln Asn Thr Asp Asp
785 790 795 800
Val Gln Thr Ser Phe
805
<210> 16
<211> 626
<212> PRT
<213> Homo sapiens
<400> 16
Met Gly Ile Gln Ala Gly Glu Pro Asp Pro Pro Glu Glu Pro Leu Thr
1 5 10 15
Ser Gln Ala Ser Val Pro Pro His Gln Leu Arg Leu Gly Ser Leu His
20 25 30
Pro His Thr Pro Tyr His Ile Arg Val Ala Cys Thr Ser Ser Gln Gly
35 40 45
Pro Ser Ser Trp Thr His Trp Leu Pro Val Glu Thr Pro Glu Gly Val
50 55 60
Pro Leu Gly Pro Pro Glu Asn Ile Ser Ala Thr Arg Asn Gly Ser Gln
65 70 75 80
Ala Phe Val His Trp Gln Glu Pro Arg Ala Pro Leu Gln Gly Thr Leu
85 90 95
Leu Gly Tyr Arg Leu Ala Tyr Gln Gly Gln Asp Thr Pro Glu Val Leu
100 105 110
Met Asp Ile Gly Leu Arg Gln Glu Val Thr Leu Glu Leu Gln Gly Asp
115 120 125
Gly Ser Val Ser Asn Leu Thr Val Cys Val Ala Ala Tyr Thr Ala Ala
130 135 140
Gly Asp Gly Pro Trp Ser Leu Pro Val Pro Leu Glu Ala Trp Arg Pro
145 150 155 160
Gly Gln Ala Gln Pro Val His Gln Leu Val Lys Glu Pro Ser Thr Pro
165 170 175
Ala Phe Ser Trp Pro Trp Trp Tyr Val Leu Leu Gly Ala Val Val Ala
180 185 190
Ala Ala Cys Val Leu Ile Leu Ala Leu Phe Leu Val His Arg Arg Lys
195 200 205
Lys Glu Thr Arg Tyr Gly Glu Val Phe Glu Pro Thr Val Glu Arg Gly
210 215 220
Glu Leu Val Val Arg Tyr Arg Val Arg Lys Ser Tyr Ser Arg Arg Thr
225 230 235 240
Thr Glu Ala Thr Leu Asn Ser Leu Gly Ile Ser Glu Glu Leu Lys Glu
245 250 255
Lys Leu Arg Asp Val Met Val Asp Arg His Lys Val Ala Leu Gly Lys
260 265 270
Thr Leu Gly Glu Gly Glu Phe Gly Ala Val Met Glu Gly Gln Leu Asn
275 280 285
Gln Asp Asp Ser Ile Leu Lys Val Ala Val Lys Thr Met Lys Ile Ala
290 295 300
Ile Cys Thr Arg Ser Glu Leu Glu Asp Phe Leu Ser Glu Ala Val Cys
305 310 315 320
Met Lys Glu Phe Asp His Pro Asn Val Met Arg Leu Ile Gly Val Cys
325 330 335
Phe Gln Gly Ser Glu Arg Glu Ser Phe Pro Ala Pro Val Val Ile Leu
340 345 350
Pro Phe Met Lys His Gly Asp Leu His Ser Phe Leu Leu Tyr Ser Arg
355 360 365
Leu Gly Asp Gln Pro Val Tyr Leu Pro Thr Gln Met Leu Val Lys Phe
370 375 380
Met Ala Asp Ile Ala Ser Gly Met Glu Tyr Leu Ser Thr Lys Arg Phe
385 390 395 400
Ile His Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Asn Glu Asn Met
405 410 415
Ser Val Cys Val Ala Asp Phe Gly Leu Ser Lys Lys Ile Tyr Asn Gly
420 425 430
Asp Tyr Tyr Arg Gln Gly Arg Ile Ala Lys Met Pro Val Lys Trp Ile
435 440 445
Ala Ile Glu Ser Leu Ala Asp Arg Val Tyr Thr Ser Lys Ser Asp Val
450 455 460
Trp Ser Phe Gly Val Thr Met Trp Glu Ile Ala Thr Arg Gly Gln Thr
465 470 475 480
Pro Tyr Pro Gly Val Glu Asn Ser Glu Ile Tyr Asp Tyr Leu Arg Gln
485 490 495
Gly Asn Arg Leu Lys Gln Pro Ala Asp Cys Leu Asp Gly Leu Tyr Ala
500 505 510
Leu Met Ser Arg Cys Trp Glu Leu Asn Pro Gln Asp Arg Pro Ser Phe
515 520 525
Thr Glu Leu Arg Glu Asp Leu Glu Asn Thr Leu Lys Ala Leu Pro Pro
530 535 540
Ala Gln Glu Pro Asp Glu Ile Leu Tyr Val Asn Met Asp Glu Gly Gly
545 550 555 560
Gly Tyr Pro Glu Pro Pro Gly Ala Ala Gly Gly Ala Asp Pro Pro Thr
565 570 575
Gln Pro Asp Pro Lys Asp Ser Cys Ser Cys Leu Thr Ala Ala Glu Val
580 585 590
His Pro Ala Gly Arg Tyr Val Leu Cys Pro Ser Thr Thr Pro Ser Pro
595 600 605
Ala Gln Pro Ala Asp Arg Gly Ser Pro Ala Ala Pro Gly Gln Glu Asp
610 615 620
Gly Ala
625
<210> 17
<211> 529
<212> PRT
<213> Homo sapiens
<400> 17
Met Pro Pro Ala Pro Pro Gly Gly Glu Ser Gly Cys Glu Glu Arg Gly
1 5 10 15
Ala Ala Gly His Ile Glu His Ser Arg Tyr Leu Ser Leu Leu Asp Ala
20 25 30
Val Asp Asn Ser Lys Met Ala Leu Asn Ser Gly Ser Pro Pro Ala Ile
35 40 45
Gly Pro Tyr Tyr Glu Asn His Gly Tyr Gln Pro Glu Asn Pro Tyr Pro
50 55 60
Ala Gln Pro Thr Val Val Pro Thr Val Tyr Glu Val His Pro Ala Gln
65 70 75 80
Tyr Tyr Pro Ser Pro Val Pro Gln Tyr Ala Pro Arg Val Leu Thr Gln
85 90 95
Ala Ser Asn Pro Val Val Cys Thr Gln Pro Lys Ser Pro Ser Gly Thr
100 105 110
Val Cys Thr Ser Lys Thr Lys Lys Ala Leu Cys Ile Thr Leu Thr Leu
115 120 125
Gly Thr Phe Leu Val Gly Ala Ala Leu Ala Ala Gly Leu Leu Trp Lys
130 135 140
Phe Met Gly Ser Lys Cys Ser Asn Ser Gly Ile Glu Cys Asp Ser Ser
145 150 155 160
Gly Thr Cys Ile Asn Pro Ser Asn Trp Cys Asp Gly Val Ser His Cys
165 170 175
Pro Gly Gly Glu Asp Glu Asn Arg Cys Val Arg Leu Tyr Gly Pro Asn
180 185 190
Phe Ile Leu Gln Val Tyr Ser Ser Gln Arg Lys Ser Trp His Pro Val
195 200 205
Cys Gln Asp Asp Trp Asn Glu Asn Tyr Gly Arg Ala Ala Cys Arg Asp
210 215 220
Met Gly Tyr Lys Asn Asn Phe Tyr Ser Ser Gln Gly Ile Val Asp Asp
225 230 235 240
Ser Gly Ser Thr Ser Phe Met Lys Leu Asn Thr Ser Ala Gly Asn Val
245 250 255
Asp Ile Tyr Lys Lys Leu Tyr His Ser Asp Ala Cys Ser Ser Lys Ala
260 265 270
Val Val Ser Leu Arg Cys Ile Ala Cys Gly Val Asn Leu Asn Ser Ser
275 280 285
Arg Gln Ser Arg Ile Val Gly Gly Glu Ser Ala Leu Pro Gly Ala Trp
290 295 300
Pro Trp Gln Val Ser Leu His Val Gln Asn Val His Val Cys Gly Gly
305 310 315 320
Ser Ile Ile Thr Pro Glu Trp Ile Val Thr Ala Ala His Cys Val Glu
325 330 335
Lys Pro Leu Asn Asn Pro Trp His Trp Thr Ala Phe Ala Gly Ile Leu
340 345 350
Arg Gln Ser Phe Met Phe Tyr Gly Ala Gly Tyr Gln Val Glu Lys Val
355 360 365
Ile Ser His Pro Asn Tyr Asp Ser Lys Thr Lys Asn Asn Asp Ile Ala
370 375 380
Leu Met Lys Leu Gln Lys Pro Leu Thr Phe Asn Asp Leu Val Lys Pro
385 390 395 400
Val Cys Leu Pro Asn Pro Gly Met Met Leu Gln Pro Glu Gln Leu Cys
405 410 415
Trp Ile Ser Gly Trp Gly Ala Thr Glu Glu Lys Gly Lys Thr Ser Glu
420 425 430
Val Leu Asn Ala Ala Lys Val Leu Leu Ile Glu Thr Gln Arg Cys Asn
435 440 445
Ser Arg Tyr Val Tyr Asp Asn Leu Ile Thr Pro Ala Met Ile Cys Ala
450 455 460
Gly Phe Leu Gln Gly Asn Val Asp Ser Cys Gln Gly Asp Ser Gly Gly
465 470 475 480
Pro Leu Val Thr Ser Lys Asn Asn Ile Trp Trp Leu Ile Gly Asp Thr
485 490 495
Ser Trp Gly Ser Gly Cys Ala Lys Ala Tyr Arg Pro Gly Val Tyr Gly
500 505 510
Asn Val Met Val Phe Thr Asp Trp Ile Tyr Arg Gln Met Arg Ala Asp
515 520 525
Gly
<210> 18
<211> 1101
<212> DNA
<213> Sus scrofa
<400> 18
aataaatgca ctgttgggcc tatgctcaag atgggtagtg ttaattggtg gtggaactta 60
tctgatttca tgacttgctg gctacctaaa acaggtgagg agaaagccaa tgggactggg 120
actggatgag caagtacaac aaacaaaatg ggcttaaagt atgagtgaga gttatctgac 180
cgtaaggatg caagtgaggg ggcctaaggt ttggagatta atatttaatc tcagatgcta 240
tactttggtg gtgtagcaaa agtctacaaa tgggatgact gtaaaactca gtagatccgt 300
gctttttaac ctatctccct tcatcaggaa attgcgacac aaagatcttt agtaataaca 360
cgcagtctca atgcataaaa tcaggcttag gtgttgcctg gactcatttc ccatctccac 420
cccactataa ttattttgtg acacaaactc aagactgtgg gaatatagag aaattgggct 480
cgtcctcgta cacctgctca atcccctgca ggacaacgcc caagaatcag gttaagccag 540
ggcaaaagaa tcccgcccat aatcgagaag gagcaaactg acatggaggc gatgacgaga 600
tcgcggggga gggagggatt tttctaggcc cagggcggtc cttaggaaaa ggaggcagca 660
gagaactccc ataaaggtat tgcggcactc ccctccccct gcggagaagg gtgcggcctt 720
ctctccgcct cctccactgc agctccctca ggattgcagc tcgcgcgggt ttttggagaa 780
catgcgcctc ccacccacaa gccagcagga ccgacccccc actccttcct ccacccccca 840
cccccacggg tccgagagca ggtagagagc tagtctcgtc cttcaggcgg cggacgccca 900
gggcggagcc gcagtcacca ccacccagaa gcctcggccc ggcagcccgc ccccgcctcc 960
tgcgcgcgct tcctgccacg ttgcgcaggg gcgaggggcc agacactgcg gcgctggcct 1020
cggggagggc cgtaccaaag accgcctccc tgccgactcg cgtagtggtt tcgctcattt 1080
gggacccaag ccaataacaa g 1101
<210> 19
<211> 1056
<212> DNA
<213> Sus scrofa
<400> 19
tgctctctct cctgccccct tcacctgcgt gccctcctca ttctccctct gtgccacctc 60
tggccttgca ctgtaggctc tctcttgggg atgtttctct ttctccacac acttctcttt 120
cactctgtcc tcttgctttg tgtgggcctg cagcgttacc cttttttctg ggcacactca 180
gagcaccctc ctctttctgg ttctgggcca cctgtctgtc ctcgggtcat cttgctctct 240
ctgcctggat gccctcctgt ggctttgggc agcttctccc tccttcagag tgcaccgcca 300
gttctcctag gcccggtcac ttccccttcc caggggacct agagccctgc taggtcctct 360
ctctccacaa cctgggcccc caaacctttc caaaacacct tgctttctgc ctccattggt 420
cttgtgttcc agagccagag tcactatatg tcccagaacc aggattccct ctggttctga 480
gggcttttat cgcatcccct gcctggctgc agtgggtctt tggggacagg ccacagaaga 540
gcctctactc ctccctctgt ccccgaggct gtctccctcc cagtcttccc agctcaggcc 600
agtccccagg cctctcttcc ctgccagagc ccgtcaggtt cggttacttt ggggcccaga 660
gaggaccctg tgaaggaagc gtgggtaggg gcacgggaat ggggaggatg cctgaagagg 720
cccccttagc cagaagagga gcagaagagg agcaggtacc cagaagagga gcagttcagg 780
gaaatagaag agtcccgagc tctttttttt tttttttttt atttcttttc ttttcttttc 840
tttttatggc agcatccgtg gtatatggag gttcccagcc taggggtcag atcatacctg 900
caactgccag cctacaccac agccacagca ctcaggatcc gagctgcatc tgcggcttac 960
gccacaggtc acagcaacgc tggatcctta acccactgaa tgaggccagg gattgaacct 1020
gcaacctcat gcacactatg ctggggtctt aatcgg 1056
<210> 20
<211> 1108
<212> DNA
<213> Sus scrofa
<400> 20
acttcctcct gcccttaccc tttatctggc tcttagctcc taaaaactgc attattagct 60
tcctcttttg cctctactct tactcaacca aaattgtttt aagatctgtg gatctagctt 120
ctgctgtgct attcttagga acacttttat ttcctcttag ctccatctca ccagttattg 180
gctaatggct ttgcttggta cctacatctg tacatttctt tcgtactagc ttctagactg 240
aaaaaggact gttggttcaa catgaaaggg aaggaggtaa aagaggacac acaggaaaga 300
tggattggga ttcaggtctc tgctgttgtt acttgagatt gctttctaga ttctacttgt 360
ggaaacaaaa agcctttgcg agaattctaa actggagtat ttctgtaatt gaggagtctt 420
gctcagcaaa tcccacttag gggactaatg aagtaccagg aagagacaga ccatgctcaa 480
tccacaaagc caggttttac tgaaatgtga cctactttct tatgttcctg gaagtttaga 540
tcagggtggg cagctctggg ttttataggc tacactgtta acactcaggc tgttttctac 600
cgtttagtca aaatatagtc accttgcctg cttcacctgt ccatcagaga atggcctcat 660
taattgactc tctagtatga agtcaaagta gctttggtgg ccctaaatgg acaagtatca 720
agagactggg tgaattgagg agcttgagac tgtcacctca gatcgaaaag actgaaaaat 780
cacctcagat caaaaagact gaaaaatctt cagtctggaa aggggactca aaaccataat 840
tagagtattc tggtagaatc cttttctcca ctgttattca tacagttaag gtgaataact 900
aaaagtaatt gtgagctgag gagtaagata caacacacaa ggaatcagtt aacagagtct 960
cgagtgaaat tataaatgga aagaattatg acttgaatca taactctgag gccccatttt 1020
ccctaacaac ttttgtccca ataaacgtgg gtatttgttt gggagaaact atcatataca 1080
tgattaccca gtaaacagac tgtttact 1108
<210> 21
<211> 1089
<212> DNA
<213> Sus scrofa
<400> 21
actttgtacc tattttgtat gtgtataata atttgagatg tttttaatta ttttgattgc 60
tggaataaag catgtggaaa tgacccaaac caatcttgca ctggcctcct gatttccttc 120
cttggagacg gagggagggg gagacctggg ggagggcgct tggggggggg tgggctctct 180
tctttctgcg ctcccccccc ccacctccaa caccttgacg acccctcctg cttccgcttg 240
cctttctcag gctttaacac tttctcctcg ccctctcagc atgcgcatgc gcgtgcctct 300
acctcccccg cacatcctgg cctgcccacc ctgaatgtcc tggcccagcg atgccaccaa 360
ctctctcgct ccgtccacgg ctggggaggg gggcactctg cagggttggg gggcactggg 420
aggctgggtt gggtgaggga ggggtgcctg ggcccccacc ccccagcaag ttctctccct 480
aggcgaactg gagggtcgtc tggcctcttg agccttgttg ctggctctga gctctaccaa 540
gagagtgacc agcaggaccg caccatcagt ggttgctgag actgcgtggg ggcccaagga 600
gacctggaga aaggaatgct tcctgctcct tcttctgggg ccccaggaga gccttcccag 660
ggccttggag agttgctgtc cagggactaa ccctgtgctc taggaaggct gcaggccctg 720
accagctggg caggtcctgg gtccctcctg gccttctaag ttccccaaac atgagacctc 780
tgggtgtggg gtggcctggg gaggtcattt tgcccaggcc ctacctcctg cccattccta 840
acccttttta aaaatctgtg cgtcctcttc ttccttcttc tccctccctt cccttttcgc 900
tcaccctctg ctgctggcct gagagccgga ggcccccagg gggaaggcga ctggtctcct 960
ccccagtctc agggaaggga gacagagaat ccaggaagcc agaactcagc agacgaagca 1020
cccagggacc tagagatggg ttgaaaagtt gacagctgtc ccacctgcct cccaaggtct 1080
cagggccta 1089
<210> 22
<211> 7922
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 22
ggatcccctg agggggcccc catgggctag aggatccggc ctcggcctct gcataaataa 60
aaaaaattag tcagccatga gcttggccca ttgcatacgt tgtatccata tcataatatg 120
tacatttata ttggctcatg tccaacatta ccgccatgtt gacattgatt attgactagt 180
tattaatagt aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt 240
acataactta cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg 300
tcaataatga cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg 360
gtggagtatt tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt 420
acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg 480
accttatggg actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg 540
gtgatgcggt tttggcagta catcaatggg cgtggatagc ggtttgactc acggggattt 600
ccaagtctcc accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac 660
tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg 720
tgggaggtct atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat 780
ccacgctgtt ttgacctcca tagaagacac cgggaccgat ccagcctccc ctcgaagctt 840
acatgtggta ccgagctcgg atcctgagaa cttcagggtg agtctatggg acccttgatg 900
ttttctttcc ccttcttttc tatggttaag ttcatgtcat aggaagggga gaagtaacag 960
ggtacacata ttgaccaaat cagggtaatt ttgcatttgt aattttaaaa aatgctttct 1020
tcttttaata tacttttttg tttatcttat ttctaatact ttccctaatc tctttctttc 1080
agggcaataa tgatacaatg tatcatgcct ctttgcacca ttctaaagaa taacagtgat 1140
aatttctggg ttaaggcaat agcaatattt ctgcatataa atatttctgc atataaattg 1200
taactgatgt aagaggtttc atattgctaa tagcagctac aatccagcta ccattctgct 1260
tttattttat ggttgggata aggctggatt attctgagtc caagctaggc ccttttgcta 1320
atcatgttca tacctcttat cttcctccca cagctcctgg gcaacgtgct ggtctgtgtg 1380
ctggcccatc actttggcaa agcacgtgag atctatgttt gtttttcttg ttttattgcc 1440
actagtctct agtcagtgtg ttaatcttac aaccagaact caattacccc ctgcatacac 1500
taattctttc acacgtggtg tttattaccc tgacaaagtt ttcagatcct cagttttaca 1560
ttcaactcag gacttgttct tacctttctt ttccaatgtt acttggttcc atgctataca 1620
tgtctctggg accaatggta ctaagaggtt tgataaccct gtcctaccat ttaatgatgg 1680
tgtttatttt gcttccactg agaagtctaa cataataaga ggctggattt ttggtactac 1740
tttagattcg aagacccagt ccctacttat tgttaataac gctactaatg ttgttattaa 1800
agtctgtgaa tttcaatttt gtaatgatcc atttttgggt gtttattacc acaaaaacaa 1860
caaaagttgg atggaaagtg agttcagagt ttattctagt gcgaataatt gcacttttga 1920
atatgtctct cagccttttc ttatggacct tgaaggaaaa cagggtaatt tcaaaaatct 1980
tagggaattt gtgtttaaga atattgatgg ttattttaaa atatattcta agcacacgcc 2040
tattaattta gtgcgtgatc tccctcaggg tttttcggct ttagaaccat tggtagattt 2100
gccaataggt attaacatca ctaggtttca aactttactt gctttacata gaagttattt 2160
gactcctggt gattcttctt caggttggac agctggtgct gcagcttatt atgtgggtta 2220
tcttcaacct aggacttttc tattaaaata taatgaaaat ggaaccatta cagatgctgt 2280
agactgtgca cttgaccctc tctcagaaac aaagtgtacg ttgaaatcct tcactgtaga 2340
aaaaggaatc tatcaaactt ctaactttag agtccaacca acagaatcta ttgttagatt 2400
tcctaatatt acaaacttgt gcccttttgg tgaagttttt aacgccacca gatttgcatc 2460
tgtttatgct tggaacagga agagaatcag caactgtgtt gctgattatt ctgtcctata 2520
taattccgca tcattttcca cttttaagtg ttatggagtg tctcctacta aattaaatga 2580
tctctgcttt actaatgtct atgcagattc atttgtaatt agaggtgatg aagtcagaca 2640
aatcgctcca gggcaaactg gaaagattgc tgattataat tataaattac cagatgattt 2700
tacaggctgc gttatagctt ggaattctaa caatcttgat tctaaggttg gtggtaatta 2760
taattacctg tatagattgt ttaggaagtc taatctcaaa ccttttgaga gagatatttc 2820
aactgaaatc tatcaggccg gtagcacacc ttgtaatggt gttgaaggtt ttaattgtta 2880
ctttccttta caatcatatg gtttccaacc cactaatggt gttggttacc aaccatacag 2940
agtagtagta ctttcttttg aacttctaca tgcaccagca actgtttgtg gacctaaaaa 3000
gtctactaat ttggttaaaa acaaatgtgt caatttcaac ttcaatggtt taacaggcac 3060
aggtgttctt actgagtcta acaaaaagtt tctgcctttc caacaatttg gcagagacat 3120
tgctgacact actgatgctg tccgtgatcc acagacactt gagattcttg acattacacc 3180
atgttctttt ggtggtgtca gtgttataac accaggaaca aatacttcta accaggttgc 3240
tgttctttat caggatgtta actgcacaga agtccctgtt gctattcatg cagatcaact 3300
tactcctact tggcgtgttt attctacagg ttctaatgtt tttcaaacac gtgcaggctg 3360
tttaataggg gctgaacatg tcaacaactc atatgagtgt gacataccca ttggtgcagg 3420
tatatgcgct agttatcaga ctcagactaa ttctcctcgg cgggcacgta gtgtagctag 3480
tcaatccatc attgcctaca ctatgtcact tggtgcagaa aattcagttg cttactctaa 3540
taactctatt gccataccca caaattttac tattagtgtt accacagaaa ttctaccagt 3600
gtctatgacc aagacatcag tagattgtac aatgtacatt tgtggtgatt caactgaatg 3660
cagcaatctt ttgttgcaat atggcagttt ttgtacacaa ttaaaccgtg ctttaactgg 3720
aatagctgtt gaacaagaca aaaacaccca agaagttttt gcacaagtca aacaaattta 3780
caaaacacca ccaattaaag attttggtgg ttttaatttt tcacaaatat taccagatcc 3840
atcaaaacca agcaagaggt catttattga agatctactt ttcaacaaag tgacacttgc 3900
agatgctggc ttcatcaaac aatatggtga ttgccttggt gatattgctg ctagagacct 3960
catttgtgca caaaagttta acggccttac tgttttgcca cctttgctca cagatgaaat 4020
gattgctcaa tacacttctg cactgttagc gggtacaatc acttctggtt ggacctttgg 4080
tgcaggtgct gcattacaaa taccatttgc tatgcaaatg gcttataggt ttaatggtat 4140
tggagttaca cagaatgttc tctatgagaa ccaaaaattg attgccaacc aatttaatag 4200
tgctattggc aaaattcaag actcactttc ttccacagca agtgcacttg gaaaacttca 4260
agatgtggtc aaccaaaatg cacaagcttt aaacacgctt gttaaacaac ttagctccaa 4320
ttttggtgca atttcaagtg ttttaaatga tatcctttca cgtcttgaca aagttgaggc 4380
tgaagtgcaa attgataggt tgatcacagg cagacttcaa agtttgcaga catatgtgac 4440
tcaacaatta attagagctg cagaaatcag agcttctgct aatcttgctg ctactaaaat 4500
gtcagagtgt gtacttggac aatcaaaaag agttgatttt tgtggaaagg gctatcatct 4560
tatgtccttc cctcagtcag cacctcatgg tgtagtcttc ttgcatgtga cttatgtccc 4620
tgcacaagaa aagaacttca caactgctcc tgccatttgt catgatggaa aagcacactt 4680
tcctcgtgaa ggtgtctttg tttcaaatgg cacacactgg tttgtaacac aaaggaattt 4740
ttatgaacca caaatcatta ctacagacaa cacatttgtg tctggtaact gtgatgttgt 4800
aataggaatt gtcaacaaca cagtttatga tcctttgcaa cctgaattag actcattcaa 4860
ggaggagtta gataaatatt ttaagaatca tacatcacca gatgttgatt taggtgacat 4920
ctctggcatt aatgcttcag ttgtaaacat tcaaaaagaa attgaccgcc tcaatgaggt 4980
tgccaagaat ttaaatgaat ctctcatcga tctccaagaa cttggaaagt atgagcagta 5040
tataaaatgg ccatggtaca tttggctagg ttttatagct ggcttgattg ccatagtaat 5100
ggtgacaatt atgctttgct gtatgaccag ttgctgtagt tgtctcaagg gctgttgttc 5160
ttgtggatcc tgctgcaaat ttgattaaac cccaccagtg caggctgcct atcagaaagt 5220
ggtggctggt gtggctaatg ccctggccca caagtatcac taagctcgct ttcttgctgt 5280
ccaatttcta ttaaaggttc ctttgttccc taagtccaac tactaaactg ggggatatta 5340
tgaagggcct tgagcatctg gattctgcct aataaaaaac atttattttc attgcaatga 5400
tgtatttaaa ttatttctga atattttact aaaaagggaa tgtgggaggt cagtgcattt 5460
aaaacataaa gaaatgaaga gctagttcaa accttgggaa aatacactat atcttaaact 5520
ccatgaaaga aggtgaggct gcaaacagct aatgcacatt ggcaacagcc cctgatgcct 5580
atgccttatt catccctcag aaaaggattc aagtagaggc ttgatttgga ggttaaagtt 5640
ttgctatgct gtattttaca ttacttattg ttttagctgt cctcatgaat gtcttttcac 5700
tacccatttg cttatcctgc atctctcagc cttgactcca ctcagttctc ttgcttagag 5760
ataccacctt tcccctgaag tgttccttcc atgttttacg gcgagatggt ttctcctcgc 5820
ctggccactc agccttagtt gtctctgttg tcttatagag gtctacttga agaaggaaaa 5880
acagggggca tggtttgact gtcctgtgag cccttcttcc ctgcctcccc cactcacagt 5940
gacccggaat ccctcgacat ggcagtctag cactagtgcg gccgcagatc tgcttcctcg 6000
ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 6060
gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 6120
ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 6180
cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 6240
ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 6300
accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 6360
catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 6420
gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 6480
tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 6540
agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 6600
actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 6660
gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 6720
aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 6780
gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 6840
aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 6900
atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 6960
gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 7020
atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga cccacgctca 7080
ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 7140
cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 7200
agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 7260
cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 7320
tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga 7380
agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact 7440
gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 7500
gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 7560
ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc 7620
tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc acccaactga 7680
tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat 7740
gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt 7800
caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 7860
atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgac 7920
gt 7922
<210> 23
<211> 9144
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 23
gtcgacggat cgggagatct cccgatcccc tatggtgcac tctcagtaca atctgctctg 60
atgccgcata gttaagccag tatctgctcc ctgcttgtgt gttggaggtc gctgagtagt 120
gcgcgagcaa aatttaagct acaacaaggc aaggcttgac cgacaattgc atgaagaatc 180
tgcttagggt taggcgtttt gcgctgcttc gcgatgtacg ggccagatat acgcgttgac 240
attgattatt gactagttat taatagtaat caattacggg gtcattagtt catagcccat 300
atatggagtt ccgcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg 360
acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt 420
tccattgacg tcaatgggtg gagtatttac ggtaaactgc ccacttggca gtacatcaag 480
tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc 540
attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag 600
tcatcgctat taccatggtg atgcggtttt ggcagtacat caatgggcgt ggatagcggt 660
ttgactcacg gggatttcca agtctccacc ccattgacgt caatgggagt ttgttttggc 720
accaaaatca acgggacttt ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg 780
gcggtaggcg tgtacggtgg gaggtctata taagcagcgc gttttgcctg tactgggtct 840
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 900
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 960
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtggc 1020
gcccgaacag ggacttgaaa gcgaaaggga aaccagagga gctctctcga cgcaggactc 1080
ggcttgctga agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa 1140
ttttgactag cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg 1200
ggagaattag atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata 1260
aattaaaaca tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc 1320
tgttagaaac atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga 1380
caggatcaga agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatc 1440
aaaggataga gataaaagac accaaggaag ctttagacaa gatagaggaa gagcaaaaca 1500
aaagtaagac caccgcacag caagcggccg ctgatcttca gacctggagg aggagatatg 1560
agggacaatt ggagaagtga attatataaa tataaagtag taaaaattga accattagga 1620
gtagcaccca ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc agtgggaata 1680
ggagctttgt tccttgggtt cttgggagca gcaggaagca ctatgggcgc agcgtcaatg 1740
acgctgacgg tacaggccag acaattattg tctggtatag tgcagcagca gaacaatttg 1800
ctgagggcta ttgaggcgca acagcatctg ttgcaactca cagtctgggg catcaagcag 1860
ctccaggcaa gaatcctggc tgtggaaaga tacctaaagg atcaacagct cctggggatt 1920
tggggttgct ctggaaaact catttgcacc actgctgtgc cttggaatgc tagttggagt 1980
aataaatctc tggaacagat ttggaatcac acgacctgga tggagtggga cagagaaatt 2040
aacaattaca caagcttaat acactcctta attgaagaat cgcaaaacca gcaagaaaag 2100
aatgaacaag aattattgga attagataaa tgggcaagtt tgtggaattg gtttaacata 2160
acaaattggc tgtggtatat aaaattattc ataatgatag taggaggctt ggtaggttta 2220
agaatagttt ttgctgtact ttctatagtg aatagagtta ggcagggata ttcaccatta 2280
tcgtttcaga cccacctccc aaccccgagg ggacccgaca ggcccgaagg aatagaagaa 2340
gaaggtggag agagagacag agacagatcc attcgattag tgaacggatc ggcactgcgt 2400
gcgccaattc tgcagacaaa tggcagtatt catccacaat tttaaaagaa aaggggggat 2460
tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca tacaaactaa 2520
agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca gggacagcag 2580
agatccagtt tggttaatta agggcagagc gcacatcgcc cacagtcccc gagaagttgg 2640
ggggaggggt cggcaattga tccggtgcct agagaaggtg gcgcggggta aactgggaaa 2700
gtgatgtcgt gtactggctc cgcctttttc ccgagggtgg gggagaaccg tatataagtg 2760
cagtagtcgc cgtgaacgtt ctttttcgca acgggtttgc cgccagaaca caggaccggt 2820
tctagagcgc tctcgaggcc accatggtga gcaagggcga ggaggataac atggccatca 2880
tcaaggagtt catgcgcttc aaggtgcaca tggagggctc cgtgaacggc cacgagttcg 2940
agatcgaggg cgagggcgag ggccgcccct acgagggcac ccagaccgcc aagctgaagg 3000
tgaccaaggg tggccccctg cccttcgcct gggacatcct gtcccctcag ttcatgtacg 3060
gctccaaggc ctacgtgaag caccccgccg acatccccga ctacttgaag ctgtccttcc 3120
ccgagggctt caagtgggag cgcgtgatga acttcgagga cggcggcgtg gtgaccgtga 3180
cccaggactc ctccctgcag gacggcgagt tcatctacaa ggtgaagctg cgcggcacca 3240
acttcccctc cgacggcccc gtaatgcaga agaagaccat gggctgggag gcctcctccg 3300
agcggatgta ccccgaggac ggcgccctga agggcgagat caagcagagg ctgaagctga 3360
aggacggcgg ccactacgac gctgaggtca agaccaccta caaggccaag aagcccgtgc 3420
agctgcccgg cgcctacaac gtcaacatca agttggacat cacctcccac aacgaggact 3480
acaccatcgt ggaacagtac gaacgcgccg agggccgcca ctccaccggc ggcatggacg 3540
agctgtacaa gggtaccgga tccggcgcaa caaacttctc tctgctgaaa caagccggag 3600
atgtcgaaga gaatcctgga ccgaccgagt acaagcccac ggtgcgcctc gccacccgcg 3660
acgacgtccc cagggccgta cgcaccctcg ccgccgcgtt cgccgactac cccgccacgc 3720
gccacaccgt cgatccggac cgccacatcg agcgggtcac cgagctgcaa gaactcttcc 3780
tcacgcgcgt cgggctcgac atcggcaagg tgtgggtcgc ggacgacggc gccgcggtgg 3840
cggtctggac cacgccggag agcgtcgaag cgggggcggt gttcgccgag atcggcccgc 3900
gcatggccga gttgagcggt tcccggctgg ccgcgcagca acagatggaa ggcctcctgg 3960
cgccgcaccg gcccaaggag cccgcgtggt tcctggccac cgtcggagtc tcgcccgacc 4020
accagggcaa gggtctgggc agcgccgtcg tgctccccgg agtggaggcg gccgagcgcg 4080
ccggggtgcc cgccttcctg gagacctccg cgccccgcaa cctccccttc tacgagcggc 4140
tcggcttcac cgtcaccgcc gacgtcgagg tgcccgaagg accgcgcacc tggtgcatga 4200
cccgcaagcc cggtgcctga acgcgttaag tcgacaatca acctctggat tacaaaattt 4260
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 4320
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 4380
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 4440
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 4500
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 4560
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 4620
tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc 4680
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 4740
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 4800
tctccctttg ggccgcctcc ccgcgtcgac tttaagacca atgacttaca aggcagctgt 4860
agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg 4920
aagacaagat ctgctttttg cttgtactgg gtctctctgg ttagaccaga tctgagcctg 4980
ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt 5040
gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc 5100
cttttagtca gtgtggaaaa tctctagcag ggcccgttta aacccgctga tcagcctcga 5160
ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc 5220
tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc 5280
tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt 5340
gggaagacaa tagcaggcat gctggggatg cggtgggctc tatggcttct gaggcggaaa 5400
gaaccagctg gggctctagg gggtatcccc acgcgccctg tagcggcgca ttaagcgcgg 5460
cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc 5520
ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa 5580
atcgggggct ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac 5640
ttgattaggg tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt 5700
tgacgttgga gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca 5760
accctatctc ggtctattct tttgatttat aagggatttt gccgatttcg gcctattggt 5820
taaaaaatga gctgatttaa caaaaattta acgcgaatta attctgtgga atgtgtgtca 5880
gttagggtgt ggaaagtccc caggctcccc agcaggcaga agtatgcaaa gcatgcatct 5940
caattagtca gcaaccaggt gtggaaagtc cccaggctcc ccagcaggca gaagtatgca 6000
aagcatgcat ctcaattagt cagcaaccat agtcccgccc ctaactccgc ccatcccgcc 6060
cctaactccg cccagttccg cccattctcc gccccatggc tgactaattt tttttattta 6120
tgcagaggcc gaggccgcct ctgcctctga gctattccag aagtagtgag gaggcttttt 6180
tggaggccta ggcttttgca aaaagctccc gggagcttgt atatccattt tcggatctga 6240
tcagcacgtg ttgacaatta atcatcggca tagtatatcg gcatagtata atacgacaag 6300
gtgaggaact aaaccatggc caagttgacc agtgccgttc cggtgctcac cgcgcgcgac 6360
gtcgccggag cggtcgagtt ctggaccgac cggctcgggt tctcccggga cttcgtggag 6420
gacgacttcg ccggtgtggt ccgggacgac gtgaccctgt tcatcagcgc ggtccaggac 6480
caggtggtgc cggacaacac cctggcctgg gtgtgggtgc gcggcctgga cgagctgtac 6540
gccgagtggt cggaggtcgt gtccacgaac ttccgggacg cctccgggcc ggccatgacc 6600
gagatcggcg agcagccgtg ggggcgggag ttcgccctgc gcgacccggc cggcaactgc 6660
gtgcacttcg tggccgagga gcaggactga cacgtgctac gagatttcga ttccaccgcc 6720
gccttctatg aaaggttggg cttcggaatc gttttccggg acgccggctg gatgatcctc 6780
cagcgcgggg atctcatgct ggagttcttc gcccacccca acttgtttat tgcagcttat 6840
aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt tttttcactg 6900
cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatgtctg tataccgtcg 6960
acctctagct agagcttggc gtaatcatgg tcatagctgt ttcctgtgtg aaattgttat 7020
ccgctcacaa ttccacacaa catacgagcc ggaagcataa agtgtaaagc ctggggtgcc 7080
taatgagtga gctaactcac attaattgcg ttgcgctcac tgcccgcttt ccagtcggga 7140
aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt 7200
attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg 7260
cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc aggggataac 7320
gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg 7380
ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa tcgacgctca 7440
agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggaagc 7500
tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc cgcctttctc 7560
ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag ttcggtgtag 7620
gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc 7680
ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc gccactggca 7740
gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac agagttcttg 7800
aagtggtggc ctaactacgg ctacactaga agaacagtat ttggtatctg cgctctgctg 7860
aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca aaccaccgct 7920
ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa 7980
gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa 8040
gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa 8100
tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag ttaccaatgc 8160
ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat agttgcctga 8220
ctccccgtcg tgtagataac tacgatacgg gagggcttac catctggccc cagtgctgca 8280
atgataccgc gagacccacg ctcaccggct ccagatttat cagcaataaa ccagccagcc 8340
ggaagggccg agcgcagaag tggtcctgca actttatccg cctccatcca gtctattaat 8400
tgttgccggg aagctagagt aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc 8460
attgctacag gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt cagctccggt 8520
tcccaacgat caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc 8580
ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag tgttatcact catggttatg 8640
gcagcactgc ataattctct tactgtcatg ccatccgtaa gatgcttttc tgtgactggt 8700
gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg 8760
gcgtcaatac gggataatac cgcgccacat agcagaactt taaaagtgct catcattgga 8820
aaacgttctt cggggcgaaa actctcaagg atcttaccgc tgttgagatc cagttcgatg 8880
taacccactc gtgcacccaa ctgatcttca gcatctttta ctttcaccag cgtttctggg 8940
tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa taagggcgac acggaaatgt 9000
tgaatactca tactcttcct ttttcaatat tattgaagca tttatcaggg ttattgtctc 9060
atgagcggat acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca 9120
tttccccgaa aagtgccacc tgac 9144
Claims (16)
1.一种制备重组细胞的方法,包括如下步骤:将命名为DNA分子甲的DNA分子整合至猪细胞的基因组DNA,得到重组细胞;所述DNA分子甲表达人源ACE2蛋白、人源AXL蛋白和人源TMPRSS2蛋白。
2.如权利要求1所述的方法,其特征在于:所述DNA分子甲中,人ACE2基因、人AXL基因和人TMPRSS2基因存在于同一编码框,它们之间由2A肽的编码基因间隔。
3.如权利要求1或2所述的方法,其特征在于:所述DNA分子甲整合至猪细胞的基因组DNA的COL1A1基因位点中。
4.如权利要求1至3中任一所述的方法,其特征在于:所述“将命名为DNA分子甲的DNA分子整合至猪细胞的基因组DNA”的实现方式为:将命名为DNA分子乙的DNA分子导入猪细胞或者将具有所述DNA分子乙的重组质粒导入猪细胞;所述DNA分子乙中,具有所述DNA分子甲且在所述DNA分子甲的上游具有上游同源臂且在所述DNA分子甲的下游具有下游同源臂,所述上游同源臂和所述下游同源臂用于将所述DNA分子甲整合至猪细胞的基因组DNA。
5.如权利要求4所述的方法,其特征在于:所述方法中,具有所述DNA分子乙的重组质粒与两个辅助质粒共同导入猪细胞;所述两个辅助质粒为sgRNA质粒和Cas9质粒;
sgRNA质粒转录得到特异sgRNA;所述特异sgRNA为sgRNACOL1A1-g3;sgRNACOL1A1-g3的靶序列结合区如SEQ ID NO:13中第1-20位核苷酸所示;
Cas9质粒表达Cas9蛋白。
6.一种试剂盒,包括权利要求4中所述的DNA分子乙;所述试剂盒的用途为如下(a)或(b)或(c):(a)制备重组细胞;(b)制备SARS-CoV-2易感模型猪;(c)制备SARS-CoV-2易感猪细胞模型或SARS-CoV-2易感猪组织模型或SARS-CoV-2易感猪器官模型。
7.一种试剂盒,包括具有权利要求4中所述的DNA分子乙的重组质粒;所述试剂盒的用途为如下(a)或(b)或(c):(a)制备重组细胞;(b)制备SARS-CoV-2易感模型猪;(c)制备SARS-CoV-2易感猪细胞模型或SARS-CoV-2易感猪组织模型或SARS-CoV-2易感猪器官模型。
8.如权利要求6或7所述的试剂盒,其特征在于:所述试剂盒还包括sgRNA质粒和Cas9质粒;sgRNA质粒为权利要求5中所述的sgRNA质粒;Cas9质粒为权利要求5中所述的Cas9质粒。
9.权利要求4中所述的DNA分子乙在制备试剂盒中的应用;所述试剂盒的用途为如下(a)或(b)或(c):(a)制备重组细胞;(b)制备SARS-CoV-2易感模型猪;(c)制备SARS-CoV-2易感猪细胞模型或SARS-CoV-2易感猪组织模型或SARS-CoV-2易感猪器官模型。
10.具有权利要求4中所述的DNA分子乙的重组质粒在制备试剂盒中的应用;所述试剂盒的用途为如下(a)或(b)或(c):(a)制备重组细胞;(b)制备SARS-CoV-2易感模型猪;(c)制备SARS-CoV-2易感猪细胞模型或SARS-CoV-2易感猪组织模型或SARS-CoV-2易感猪器官模型。
11.具有权利要求4中所述的DNA分子乙的重组质粒、sgRNA质粒和Cas9质粒在制备试剂盒中的应用;sgRNA质粒为权利要求5中所述的sgRNA质粒;Cas9质粒为权利要求5中所述的Cas9质粒;所述试剂盒的用途为如下(a)或(b)或(c):(a)制备重组细胞;(b)制备SARS-CoV-2易感模型猪;(c)制备SARS-CoV-2易感猪细胞模型或SARS-CoV-2易感猪组织模型或SARS-CoV-2易感猪器官模型。
12.权利要求4中所述的DNA分子乙、具有权利要求4中所述的DNA分子乙的重组质粒、权利要求6所述的试剂盒、权利要求7所述的试剂盒或权利要求8所述的试剂盒的应用,为如下(a)或(b)或(c):(a)制备重组细胞;(b)制备SARS-CoV-2易感模型猪;(c)制备SARS-CoV-2易感猪细胞模型或SARS-CoV-2易感猪组织模型或SARS-CoV-2易感猪器官模型。
13.权利要求1至5中任一所述方法制备得到的重组细胞。
14.权利要求13所述重组细胞在制备SARS-CoV-2易感模型猪中的应用。
15.用权利要求13所述重组细胞制备的SARS-CoV-2易感模型猪的猪细胞或猪组织或猪器官。
16.权利要求13所述重组细胞、权利要求15所述的猪细胞、权利要求15所述的猪组织、权利要求15所述的猪器官或者用权利要求13所述重组细胞制备的SARS-CoV-2易感模型猪的应用,为如下(d1)或(d2)或(d3):
(d1)筛选治疗新冠肺炎的药物和/或疫苗和/或抗体;
(d2)进行新冠肺炎的药物和/或疫苗和/或抗体的效果评价;
(d3)研究新冠肺炎的发病机制。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210121045.9A CN115247187A (zh) | 2022-02-09 | 2022-02-09 | 表达三种人源基因的SARS-CoV-2易感模型猪的构建方法及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210121045.9A CN115247187A (zh) | 2022-02-09 | 2022-02-09 | 表达三种人源基因的SARS-CoV-2易感模型猪的构建方法及其应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115247187A true CN115247187A (zh) | 2022-10-28 |
Family
ID=83697931
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210121045.9A Pending CN115247187A (zh) | 2022-02-09 | 2022-02-09 | 表达三种人源基因的SARS-CoV-2易感模型猪的构建方法及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115247187A (zh) |
-
2022
- 2022-02-09 CN CN202210121045.9A patent/CN115247187A/zh active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102451510B1 (ko) | Pd-1 호밍 엔도뉴클레아제 변이체, 조성물 및 사용 방법 | |
AU2020289750B2 (en) | Engineered meganucleases with recognition sequences found in the human T cell receptor alpha constant region gene | |
AU2018229561B2 (en) | Recombinant adenoviruses and use thereof | |
KR101471445B1 (ko) | 조류에서 이식유전자 발현 | |
AU2021200863A1 (en) | Genetically-modified cells comprising a modified human t cell receptor alpha constant region gene | |
KR101953237B1 (ko) | 신규 dna 결합 단백질 및 이의 용도 | |
KR20210149060A (ko) | Tn7-유사 트랜스포존을 사용한 rna-유도된 dna 통합 | |
CN108396027A (zh) | CRISPR-Cas9靶向敲除人肠癌细胞DEAF1基因及其特异性的sgRNA | |
CA2473187C (en) | Mismatch endonucleases and methods of use | |
CN110467679B (zh) | 一种融合蛋白、碱基编辑工具和方法及其应用 | |
US20040003420A1 (en) | Modified recombinase | |
CN101939434A (zh) | 用于在大豆中提高种子贮藏油脂的生成和改变脂肪酸谱的来自解脂耶氏酵母的dgat基因 | |
BRPI0806354A2 (pt) | plantas oleaginosas transgências, sementes, óleos, produtos alimentìcios ou análogos a alimento, produtos alimentìcios medicinais ou análogos alimentìcios medicinais, produtos farmacêuticos, bebidas fórmulas para bebês, suplementos nutricionais, rações para animais domésticos, alimentos para aquacultura, rações animais, produtos de sementes inteiras, produtos de óleos misturados, produtos, subprodutos e subprodutos parcialmente processados | |
KR20210151916A (ko) | 뒤시엔느 근육 이영양증의 치료를 위한 aav 벡터-매개된 큰 돌연변이 핫스팟의 결실 | |
CN116083398B (zh) | 分离的Cas13蛋白及其应用 | |
US20040142433A1 (en) | Polynucleotide sequence variants | |
CN101652475A (zh) | 在禽类中进行转基因表达 | |
CN114222763A (zh) | 在嵌合抗原受体设计中实现的超模块化IgG3间隔区结构域和多功能位点 | |
KR20140043890A (ko) | 조절된 유전자 발현 시스템 및 그의 작제물 | |
KR102341583B1 (ko) | 스플릿 인테인을 접목한 가용성 향상 이중 기능성 융합 태그를 이용한 재조합 섬유아세포 성장인자 수용체의 제조방법, 정제방법, 및 이의 용도 | |
CN112852884B (zh) | 一种环形rna敲低载体快速构建试剂盒及其应用 | |
CN115247187A (zh) | 表达三种人源基因的SARS-CoV-2易感模型猪的构建方法及其应用 | |
CN114644581B (zh) | 含芳基硫酚或芳基硒酚经修饰的氨基酸、重组蛋白及其生物合成方法及应用 | |
US11814412B2 (en) | Artificial proteins and compositions and methods thereof | |
EP1395612A2 (en) | Modified recombinase |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |