CN112680444B - 用于oca2基因突变的crispr系统及其在构建白化病克隆猪核供体细胞中的应用 - Google Patents
用于oca2基因突变的crispr系统及其在构建白化病克隆猪核供体细胞中的应用 Download PDFInfo
- Publication number
- CN112680444B CN112680444B CN202011489465.XA CN202011489465A CN112680444B CN 112680444 B CN112680444 B CN 112680444B CN 202011489465 A CN202011489465 A CN 202011489465A CN 112680444 B CN112680444 B CN 112680444B
- Authority
- CN
- China
- Prior art keywords
- oca2
- grna
- pkg
- u6grna
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108091033409 CRISPR Proteins 0.000 title claims abstract description 56
- 238000010354 CRISPR gene editing Methods 0.000 title claims abstract description 15
- 101150068614 OCA2 gene Proteins 0.000 title abstract description 36
- 238000010276 construction Methods 0.000 title abstract description 21
- 206010064571 Gene mutation Diseases 0.000 title abstract description 7
- 108020005004 Guide RNA Proteins 0.000 claims abstract description 58
- 239000013598 vector Substances 0.000 claims abstract description 56
- 239000013604 expression vector Substances 0.000 claims abstract description 25
- 238000010362 genome editing Methods 0.000 claims abstract description 21
- 101100411304 Sus scrofa Oca2 gene Proteins 0.000 claims abstract description 13
- 239000013612 plasmid Substances 0.000 claims description 140
- 108020004414 DNA Proteins 0.000 claims description 86
- 102000053602 DNA Human genes 0.000 claims description 31
- 108020004682 Single-Stranded DNA Proteins 0.000 claims description 5
- 108091008146 restriction endonucleases Proteins 0.000 claims description 5
- 238000000137 annealing Methods 0.000 claims description 3
- 210000004027 cell Anatomy 0.000 description 51
- 210000002950 fibroblast Anatomy 0.000 description 47
- 238000012163 sequencing technique Methods 0.000 description 33
- 108090000623 proteins and genes Proteins 0.000 description 25
- 239000002773 nucleotide Substances 0.000 description 23
- 125000003729 nucleotide group Chemical group 0.000 description 23
- 241000282898 Sus scrofa Species 0.000 description 21
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 19
- 239000000047 product Substances 0.000 description 17
- 238000013461 design Methods 0.000 description 16
- 239000012634 fragment Substances 0.000 description 16
- 238000000034 method Methods 0.000 description 16
- 230000035772 mutation Effects 0.000 description 14
- 101001131829 Homo sapiens P protein Proteins 0.000 description 13
- 102100034574 P protein Human genes 0.000 description 13
- 238000010586 diagram Methods 0.000 description 12
- 206010001557 Albinism Diseases 0.000 description 11
- 239000002609 medium Substances 0.000 description 11
- 238000012216 screening Methods 0.000 description 10
- 238000012408 PCR amplification Methods 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 9
- 238000006243 chemical reaction Methods 0.000 description 9
- 238000003209 gene knockout Methods 0.000 description 9
- 102000004169 proteins and genes Human genes 0.000 description 9
- 241001465754 Metazoa Species 0.000 description 8
- 241000282887 Suidae Species 0.000 description 8
- 238000001962 electrophoresis Methods 0.000 description 8
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 102000004142 Trypsin Human genes 0.000 description 6
- 108090000631 Trypsin Proteins 0.000 description 6
- 238000003776 cleavage reaction Methods 0.000 description 6
- 238000003780 insertion Methods 0.000 description 6
- 230000037431 insertion Effects 0.000 description 6
- 239000006166 lysate Substances 0.000 description 6
- 230000006798 recombination Effects 0.000 description 6
- 238000005215 recombination Methods 0.000 description 6
- 210000003491 skin Anatomy 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 238000001890 transfection Methods 0.000 description 6
- 239000012588 trypsin Substances 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 241000282414 Homo sapiens Species 0.000 description 5
- 238000012761 co-transfection Methods 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 229940088598 enzyme Drugs 0.000 description 5
- 210000002780 melanosome Anatomy 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 108020004635 Complementary DNA Proteins 0.000 description 4
- 238000000246 agarose gel electrophoresis Methods 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 238000010804 cDNA synthesis Methods 0.000 description 4
- 239000006285 cell suspension Substances 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 201000010099 disease Diseases 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 230000008099 melanin synthesis Effects 0.000 description 4
- 238000002156 mixing Methods 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 229960005322 streptomycin Drugs 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 108700028369 Alleles Proteins 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 3
- 101000834253 Gallus gallus Actin, cytoplasmic 1 Proteins 0.000 description 3
- 241000288906 Primates Species 0.000 description 3
- 239000011543 agarose gel Substances 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 229940088710 antibiotic agent Drugs 0.000 description 3
- 239000007795 chemical reaction product Substances 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 239000012154 double-distilled water Substances 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 3
- 238000001976 enzyme digestion Methods 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 239000011259 mixed solution Substances 0.000 description 3
- 229910052754 neon Inorganic materials 0.000 description 3
- GKAOGPIIYCISHV-UHFFFAOYSA-N neon atom Chemical compound [Ne] GKAOGPIIYCISHV-UHFFFAOYSA-N 0.000 description 3
- 201000007909 oculocutaneous albinism Diseases 0.000 description 3
- 239000000049 pigment Substances 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108090000765 processed proteins & peptides Proteins 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- 239000007995 HEPES buffer Substances 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- XUMBMVFBXHLACL-UHFFFAOYSA-N Melanin Chemical compound O=C1C(=O)C(C2=CNC3=C(C(C(=O)C4=C32)=O)C)=C2C4=CNC2=C1C XUMBMVFBXHLACL-UHFFFAOYSA-N 0.000 description 2
- LLKWSEXLNFBKIF-CYDGBPFRSA-N Met-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCSC LLKWSEXLNFBKIF-CYDGBPFRSA-N 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108091093126 WHP Posttrascriptional Response Element Proteins 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000008436 biogenesis Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 101150038500 cas9 gene Proteins 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 208000016361 genetic disease Diseases 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000030648 nucleus localization Effects 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000007170 pathology Effects 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 230000035479 physiological effects, processes and functions Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 101150013400 rag1 gene Proteins 0.000 description 2
- 238000010374 somatic cell nuclear transfer Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000009885 systemic effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- YEJQWBFDKKTPNO-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylbutanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)C)C(O)=O YEJQWBFDKKTPNO-UHFFFAOYSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 102000029816 Collagenase Human genes 0.000 description 1
- 108060005980 Collagenase Proteins 0.000 description 1
- HNNGTYHNYDOSKV-FXQIFTODSA-N Cys-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N HNNGTYHNYDOSKV-FXQIFTODSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- MKVKKORBPTUSNX-LPEHRKFASA-N Cys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N MKVKKORBPTUSNX-LPEHRKFASA-N 0.000 description 1
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 206010058314 Dysplasia Diseases 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 208000014233 Generalized hypopigmentation Diseases 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 1
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 1
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- CHZKBLABUKSXDM-XIRDDKMYSA-N His-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC3=CN=CN3)N CHZKBLABUKSXDM-XIRDDKMYSA-N 0.000 description 1
- FMRKUXFLLPKVPG-JYJNAYRXSA-N His-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O FMRKUXFLLPKVPG-JYJNAYRXSA-N 0.000 description 1
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 1
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 1
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- 208000003367 Hypopigmentation Diseases 0.000 description 1
- 208000026519 Hypopigmentation of hair Diseases 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- NQSFIPWBPXNJII-PMVMPFDFSA-N Lys-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 NQSFIPWBPXNJII-PMVMPFDFSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 1
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 239000012124 Opti-MEM Substances 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- RTUWVJVJSMOGPL-KKUMJFAQSA-N Phe-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RTUWVJVJSMOGPL-KKUMJFAQSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- 206010034960 Photophobia Diseases 0.000 description 1
- 208000012641 Pigmentation disease Diseases 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 102000001183 RAG-1 Human genes 0.000 description 1
- 108060006897 RAG1 Proteins 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 208000029091 Refraction disease Diseases 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- LOKXAXAESFYFAX-CIUDSAMLSA-N Ser-His-Cys Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CN=CN1 LOKXAXAESFYFAX-CIUDSAMLSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- WOJYIMBIKTWKJO-KKUMJFAQSA-N Ser-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N WOJYIMBIKTWKJO-KKUMJFAQSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 1
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 1
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- YMZYSCDRTXEOKD-IHPCNDPISA-N Tyr-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YMZYSCDRTXEOKD-IHPCNDPISA-N 0.000 description 1
- 102000003425 Tyrosinase Human genes 0.000 description 1
- 108060008724 Tyrosinase Proteins 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 230000004430 ametropia Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 239000003637 basic solution Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 239000012295 chemical reaction liquid Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 229960002424 collagenase Drugs 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000003425 hypopigmentation Effects 0.000 description 1
- 208000031040 hypopigmentation of the skin Diseases 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 210000002752 melanocyte Anatomy 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 206010029864 nystagmus Diseases 0.000 description 1
- 210000004681 ovum Anatomy 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000008055 phosphate buffer solution Substances 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 230000019612 pigmentation Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 208000014733 refractive error Diseases 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 238000002791 soaking Methods 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 108091005703 transmembrane proteins Proteins 0.000 description 1
- 102000035160 transmembrane proteins Human genes 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了用于OCA2基因突变的CRISPR系统及其在构建白化病克隆猪核供体细胞中的应用。一种用于OCA2基因编辑的gRNA,序列如SEQ ID NO:39所示。一种针对猪OCA2基因的gRNA表达载体,以全序列如SEQ ID NO.3所示的pKG‑U6gRNA为载体骨架,表达所述的gRNA。一种用于猪OCA2基因编辑的CRISPR/Cas9系统,包含Cas9表达载体和所述的针对猪OCA2基因的gRNA表达载体。采用本发明筛选的gRNA联合改造的Cas9高效表达载体进行基因编辑,编辑效率比原载体有了显著的提高。
Description
技术领域
本发明属于基因编辑技术领域,具体涉及用于OCA2基因突变的CRISPR/Cas9系统及其在构建白化病克隆猪核供体细胞中的应用。
背景技术
白化病(albinism)是由于黑色素合成减少或缺乏而导致的多种遗传性疾病的总称,主要表现为皮肤、毛发以及眼部的色素减退。多种基因的突变均可以导致白化病的症状。根据色素缺失的部位以及有无其他系统异常,可将白化病分为非综合征性白化病和综合征性白化病两大类。非综合征白化病又可再分为只有眼部色素缺乏的眼白化病(OA)和皮肤、毛发、眼睛均有色素缺失(即具有全身症状)的眼及皮肤白化病(OCA)。
眼及皮肤白化病(oculocutaneous albinism,OCA)是一组以眼、皮肤、毛发黑色素沉着减少或缺乏为主要临床表现的常染色体隐性遗传病。超过90%的白化病患者为OCA,一般是由黑色素合成或转运相关基因突变所引起的,主要临床表现为普遍色素沉着不足,眼部改变包括黄斑中心凹发育不良、屈光不正、视力低下、畏光、虹膜半透明、眼球震颤、眼底着色不足及视觉纤维通路异常等。OCAII型约占OCA的30-50%,是OCA2基因突变致使其编码的OCA2蛋白功能丧失引起的。已有研究结果表明,OCA2蛋白对于黑素小体的正常生物发生很重要,可能参与黑色素细胞内酪氨酸(黑色素合成的前体)的运输、调节黑素体的pH值和黑素体的成熟度、调节酪氨酸酶的翻译后加工(黑色素合成中的限速反应)等黑素小体蛋白正常加工和转运的过程。因此,基于OCA2基因突变构建眼及皮肤白化病动物模型将为研究及治疗人类相关疾病提供有力的实验工具。
基因编辑是近年来不断取得重大发展的一种生物技术,其包括从基于同源重组的基因敲入到基于核酸酶的ZFN、TALEN、CRISPR/Cas9等编辑技术,其中CRISPR/Cas9技术是当前最先进的基因编辑技术。目前,基因编辑技术被越来越多地应用到动物模型的制作上。猪作为大动物,是人类长期以来主要的肉食供应动物,其体型大小和生理功能与人类近似,易于大规模繁殖饲养,而且在伦理道德及动物保护等方面要求较低,是理想的人类疾病模型动物。
发明内容
本发明的目的是针对现有技术的上述不足,提供一种用于OCA2基因编辑的靶点gRNA及其表达载体。
本发明的另一目的是提供一种用于猪OCA2基因编辑的CRISPR/Cas9系统。
本发明的又一目的是提供所述的gRNA表达载体、所述的CRISPR/Cas9系统在构建眼及皮肤白化病克隆猪核供体细胞中的应用。
一种用于OCA2基因编辑的靶点gRNA,序列如SEQ ID NO.21所示。
一种用于OCA2基因编辑的gRNA,序列如SEQ ID NO.39所示。
一种针对猪OCA2基因的gRNA表达载体,以全序列如SEQ ID NO.3所示的pKG-U6gRNA为载体骨架,表达权利要求2所述的gRNA。
作为本发明的一种优选,所述的表达载体是将SEQ ID NO.27和SEQ ID NO.28所示的单链DNA退火得到的具有粘性末端的双链DNA分子插入载体骨架pKG-U6gRNA的BbsI限制性内切酶位点所得。
一种用于猪OCA2基因编辑的CRISPR/Cas9系统,包含Cas9表达载体和权利要求3或4所述的针对猪OCA2基因的gRNA表达载体。
作为本发明的一种优选,所述的Cas9表达载体为质粒全序列如SEQ ID NO.2所示的pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO载体。
作为本发明的进一步优选,所述的gRNA表达载体和Cas9表达载体的摩尔比为1~3:1,进一步优选3:1。
本发明所述的gRNA表达载体、所述的CRISPR/Cas9系统在构建眼及皮肤白化病克隆猪核供体细胞中的应用。
一种重组猪成纤维细胞,由所述的CRISPR/Cas9系统共转染猪原代成纤维细胞经验证后所得。
本发明所述的重组细胞在构建OCA2基因敲除的克隆猪中的应用;优选在构建OCA2基因敲除的眼及皮肤白化病克隆猪中的应用。
与现有技术相比,本发明至少具有如下有益效果:
(1)本发明研究对象(猪)比其他动物(大小鼠、灵长类)具有更好的应用性。
大小鼠等啮齿类动物不论从生理、病理及体型上,均与人相差巨大,无法真实地模拟人类正常的生理、病理状态。灵长类动物扩繁速度慢、数量少、成本高,动物保护及伦理等方面的要求也较高。而猪就没有上述缺点,同时猪的克隆技术非常成熟,饲养及克隆成本较灵长类低得多。因此猪是非常适合作为人类疾病模型的动物。
(2)本发明中经过实验验证改造的pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO载体相对改造前的pX330载体,更换了更强的启动子及添加了增强蛋白翻译的元件,提高了Cas9的表达,并且增加了核定位信号个数,提高了Cas9蛋白的核定位能力,具有更高的基因编辑效率。本发明还在载体中加入了荧光标记及抗性标记,使其更方便运用于载体阳性转化细胞的筛选及富集。采用本发明筛选的gRNA联合改造的Cas9高效表达载体进行基因编辑,编辑效率较之原载体有极显著提高。
(3)本发明针对OCA2基因的不同靶点gRNA设计了相应的表达载体,通过筛选获得具有较高编辑效率的gRNA及其表达载体。配合本发明改造的Cas9高效表达载体进行基因编辑,通过靶标基因PCR产物测序结果可分析出所获单细胞克隆的基因型(双等位基因相同变异的纯合突变型、双等位基因不同变异的纯合突变型、杂合突变型或野生型),获得纯合突变的概率为30%~50%,优于使用胚胎注射技术的模型制备方法(即受精卵注射基因编辑材料)中获得纯合突变的概率(低于5%)。
(4)利用本发明所得到的纯合突变单细胞克隆株进行体细胞核移植动物克隆可直接得到含靶标基因纯合突变的克隆猪,并且该纯合突变可稳定遗传。
本发明采用技术难度大、挑战性高的原代细胞体外编辑并筛选阳性编辑单细胞克隆的方法,后期再通过体细胞核移植动物克隆技术直接获得相应疾病模型猪,可大大缩短模型猪制作周期并节省人力、物力、财力。
附图说明
图1为质粒pX330的结构示意图。
图2为质粒pU6gRNACas9的结构示意图。
图3为pU6gRNA-eEF1a Cas9载体的结构图谱。
图4为pU6gRNA-eEF1a Cas9+nNLS载体图谱。
图5为质粒pKG-GE3的结构示意图。
图6为质粒pKG-U6gRNA的结构示意图。
图7为将20bp左右的DNA分子(用于转录形成gRNA的靶序列结合区)插入质粒pKG-U6gRNA的示意图。
图8为实施例2中步骤2.3.3的测序峰图。
图9为实施例2中步骤2.4.3的测序峰图。
图10为实施例3中步骤3.2.3以猪的基因组DNA为模板分别采用OCA2-E3g-F1/OCA2-E3g-R409(组1)、OCA2-E3g-F1/OCA2-E3g-R473(组2)、OCA2-E3g-F2/OCA2-E3g-R409(组3)、OCA2-E3g-F2/OCA2-E3g-R473(组4)组成的引物对进行PCR扩增后的电泳图。
图11为实施例3中步骤3.2.4以18只猪的基因组DNA为模板采用OCA2-E3g-F1/OCA2-E3g-R473组成的引物对进行PCR扩增后的电泳图。
图12为实施例3中步骤3.6.4以基因组DNA为模板采用OCA2-E3g-F1/OCA2-E3g-R473组成的引物对进行PCR扩增后的电泳图。
图13为实施例4中步骤4.4.4以基因组DNA为模板采用OCA2-E3g-F1/OCA2-E3g-R473组成的引物对进行PCR扩增后的电泳图。
图14为实施例5中步骤5.4.4以基因组DNA为模板采用OCA2-E3g-F1/OCA2-E3g-R473组成的引物对进行PCR扩增后的电泳图。
图15为实施例5中步骤5.4.5判定靶基因为野生型的示例性测序峰图。
图16为实施例5中步骤5.4.5判定靶基因为双等位相同变异的纯合突变型的示例性测序峰图。
图17为实施例5中步骤5.4.5判定靶基因为杂合突变型的示例性测序峰图。
图18为实施例5中步骤5.4.5判定靶基因为双等位不同变异的纯合突变型的示例性测序峰图。
具体实施方式
实施例1、质粒的构建
1.1构建质粒pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO(简称质粒pKG-GE3)
原始质粒pX330-U6-Chimeric_BB-CBh-hSpCas9(简称质粒pX330),序列如SEQ IDNO.1所示。质粒pX330的结构示意图见图1。SEQ ID NO.1中,第440-725位核苷酸组成CMV增强子,第727-1208位核苷酸组成chickenβ-actin启动子,第1304-1324位核苷酸编码SV40核定位信号(NLS),第1325-5449位核苷酸编码Cas9蛋白,第5450-5497位核苷酸编码nucleoplasmin核定位信号(NLS)。
质粒pU6gRNAeEF1a-mNLS-hSpCas9-EGFP-PURO,简称质粒pKG-GE3,核苷酸如SEQID NO.2所示。与质粒pX330相比,质粒pKG-GE3主要进行了如下改造:①去除残留的gRNA骨架序列(GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTTT),降低干扰;②将原有chickenβ-actin启动子改造为具更高表达活性的EF1a启动子,增加Cas9基因的蛋白表达能力;③在Cas9基因的上游和下游均增加核定位信号编码基因(NLS),增加Cas9蛋白的核定位能力;④原质粒无任何真核细胞筛选标记,不利于阳性转化细胞的筛选和富集,依次在Cas9基因的下游插入P2A-EGFP-T2A-PURO编码基因,赋予载体荧光和真核细胞抗性筛选能力;⑤插入WPRE元件和3’LTR序列元件,增强Cas9基因的蛋白翻译能力。
pKG-GE3质粒构建方法如下:
(1)去除gRNA骨架中多余无效的序列
质粒pX330用BbsI、XbaI酶切,回收载体片段(约8313bp左右),利用多片段重组法合成插入片段175bp(SEQ ID NO.4),与回收后载体片段重组得到pU6gRNACas9载体(图2)。
(2)改造启动子及增强子
对构建好的pU6gRNACas9载体,用XbaI和AgeI内切酶去除启动子(chickenβ-actin启动子)及增强子序列(CMV增强子),回收线性载体序列约7650bp,并利用多片段重组法合成554bp的包含CMV增强子及EF1a启动子的序列(SEQ ID NO.5),与酶切后载体pU6gRNACas9重组得到pU6gRNA-eEF1a Cas9载体(图3)。
(3)Cas9基因N端增加NLS序列
将构建好的载体pU6gRNA-eEF1a Cas9使用AgeI、BglII酶切,回收7786bp载体序列,并将增加了NLS的序列补充到酶切位点,即利用多片段重组法合成447bp的包括2个核定位信号及部分切除的Cas9编码序列(SEQ ID NO.6),重组得到pU6gRNA-eEF1a Cas9+nNLS载体(图4)。
(4)Cas9基因C端加入NLS、P2A-EGFP-T2A-PURO、WPRE-3’LTR-bGH polyAsignals
以上构建好的载体命名为pU6gRNA-eEF1a Cas9+nNLS,使用FseI、SbfI酶切,回收载体序列7781bp,利用多片段重组法合成2727bp的包括NLS-P2A-EGFP-T2A-PURO-WPRE-3’LTR-bGH polyAsignals的片段(SEQ ID NO.7),与载体片段重组得到载体pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO,简称pKG-GE3,质粒图谱如图5,核苷酸序列(SEQ ID NO.2)。
SEQ ID NO.2中,第395-680位核苷酸组成CMV增强子,第682-890位核苷酸组成EF1a启动子,第986-1006位核苷酸编码核定位信号(NLS),第1016-1036位核苷酸编码核定位信号(NLS),第1037-5161位核苷酸编码Cas9蛋白,第5162-5209位核苷酸编码核定位信号(NLS),第5219-5266位核苷酸编码核定位信号(NLS),第5276-5332位核苷酸编码自剪切多肽P2A(自剪切多肽P2A的氨基酸序列为“ATNFSLLKQAGDVEENPGP”,发生自剪切的断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间),第5333-6046位核苷酸编码EGFP蛋白,第6056-6109位核苷酸编码自裂解多肽T2A(自裂解多肽T2A的氨基酸序列为“EGRGSLLTCGDVEENPGP”,发生自裂解的断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间),第6110-6703位核苷酸编码Puromycin蛋白(简称Puro蛋白),第6722-7310位核苷酸组成WPRE序列元件,第7382-7615位核苷酸组成3’LTR序列元件,第7647-7871位核苷酸组成bGH poly(A)signal序列元件。SEQ ID NO.2中,第911-6706形成融合基因,表达融合蛋白。由于自剪切多肽P2A和自裂解多肽T2A的存在,融合蛋白自发形成如下三个蛋白:具有Cas9蛋白的蛋白、具有EGFP蛋白的蛋白和具有Puro蛋白的蛋白。
1.2构建pKG-U6gRNA载体
来源pUC57载体,通过EcoRV酶切位点,连接pKG-U6gRNA插入序列(含U6启动子、BbsI酶切位点和gRNA骨架序列的DNA片段,序列如SEQ ID NO.8所示),反向插入到pUC57载体,得到pKG-U6gRNA载体全序列(SEQ ID NO.3),SEQ ID NO.3中,第2280-2539位核苷酸组成hU6启动子,第2558-2637位核苷酸用于转录形成gRNA骨架。使用时,将20bp左右的DNA分子(用于转录形成gRNA的靶序列结合区)插入质粒pKG-U6gRNA,形成重组质粒,在细胞中重组质粒转录得到gRNA。所构建的pKG-U6gRNA载体图谱如图6。
实施例2质粒配比优化以及质粒pX330和质粒pKG-GE3的效果比较
2.1靶点gRNA设计及构建
2.1.1使用Benchling对RAG1基因进行靶点gRNA设计
RAG1-g4:AGTTATGGCAGAACTCAGTG(SEQ ID NO.9)
合成针对上述RAG1基因靶点的插入序列互补DNA Oligo如下:
RAG1-gRNA4S:caccgAGTTATGGCAGAACTCAGTG(SEQ ID NO.10)
RAG1-gRNA4A:aaacCACTGAGTTCTGCCATAACTc(SEQ ID NO.11)
RAG1-gRNA4S、RAG1-gRNA4A均为单链DNA分子。
2.1.2设计用于扩增和检测包含RAG1 gRNA靶点片段的引物
RAG1-nF126:CCCCATCCAAAGTTTTTAAAGGA(SEQ ID NO.12)
RAG1-nR525:TGTGGCAGATGTCACAGTTTAGG(SEQ ID NO.13)
2.1.3将gRNA序列克隆到pKG-U6gRNA骨架载体上的方法
1)用限制性内切酶BbsI消化1ug pKG-U6gRNA质粒;
2)酶切的pKG-U6gRNA质粒跑琼脂糖凝胶(琼脂糖凝胶浓度1%,即1g琼脂糖凝胶加入到100mL电泳缓冲液中)分离,用胶回收试剂盒(Vazyme)纯化回收酶切产物;
3)将2.1.1所述靶点合成的2条互补DNA Oligo,通过以下退火程序形成与pKG-U6gRNA载体BbsI酶切后粘性末端互补的DNA双链,示意如图7:
95℃,5min然后以5℃/min的速率降至25℃;
4)按照以下体系启动连接反应:室温反应10min
37℃反应60min;
5)转化
按照感受态细胞(Vazyme)说明书进行操作。
2.1.4 gRNA载体构建
1)将合成的RAG1-gRNA4S和RAG1-gRNA4A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(RAG1-gRNA4)。质粒pKG-U6gRNA(RAG1-gRNA4)将会表达SEQ ID NO.14所示的RAG1-gRNA4。
2.1.5 gRNA载体鉴定
从LB平板上挑取单克隆置入加有相应抗生素的LB培养液内,37℃恒温摇床内培养12-16h后小提质粒后送通用公司测序,经序列比对确认RAG1-gRNA4载体构建成功。
2.2猪原代成纤维细胞制备
2.2.1取初生从江香猪耳组织0.5g,去除外部组织,75﹪酒精浸泡30-40s;
2.2.2用含5%P/S(Gibco Penicillin-Streptomycin)的PBS洗涤5次,不含P/S的PBS洗一次;
其中5%P/S的PBS配方为:5%P/S(Gibco Penicillin-Streptomycin)+95%PBS,5%、95%为体积百分比。
2.2.3用剪刀将组织剪碎,加入5mL 0.1%的胶原酶(Sigma)溶液,37℃摇床消化1h;
2.2.4 500g离心5min,去上清,将沉淀用1mL完全培养基重悬,铺入含10mL完全培养基并已用0.2%明胶(VWR)封盘的10cm细胞培养皿中。
其中,细胞完全培养基的配方为:15%胎牛血清(Gibco)+83%DMEM培养基
(Gibco)+1%P/S(Gibco Penicillin-Streptomycin)+1%HEPES(Solarbio),15%、83%、1%、1%为体积百分比。
2.2.5置于37℃,5%CO2(体积百分比)、5%O2(体积百分比)的恒温培养箱中进行培养;
2.2.6将细胞培养至长满皿底60%左右时使用0.25%(Gibco)的胰蛋白酶将细胞消化下来,然后加入完全培养基终止消化,将细胞悬液转入15mL离心管中,400g离心4min,弃去上清,使用1mL完全培养基重悬细胞以备下一步核转染实验。
2.3质粒配比优化
2.3.1共转染分组情况
第一组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.44μg质粒pKG-U6gRNA(RAG1-gRNA4):1.56μg质粒pKG-GE3。即质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3的摩尔配比为1:1。
第二组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.72μg质粒pKG-U6gRNA(RAG1-gRNA4):1.28μg质粒pKG-GE3。即质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3的摩尔配比为2:1。
第三组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(RAG1-gRNA4):1.08μg质粒pKG-GE3。即质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3的摩尔配比为3:1。
第四组:将质粒pKG-U6gRNA(RAG1-gRNA4)转染致猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:1μg质粒pKG-U6gRNA(RAG1-gRNA4)。
2.3.2共转染操作方法
使用哺乳动物成纤维细胞核转染试剂盒(Neon)与Neon TM transfection system电转仪进行转染实验。
1)按照上述分组配制电转反应液,混匀过程中注意切勿产生气泡;
2)将步骤一制备得到的细胞悬液使用PBS磷酸缓冲液(Solarbio)洗一遍,600g离心6min,弃去上清,使用11μL电转基本溶液Opti-MEM重悬细胞,重悬过程中要避免气泡的产生;
3)吸取10μL细胞悬液,加入步骤1)中的电转反应液中混匀,混匀过程中注意切勿产生气泡;
4)将试剂盒带有的电转杯放置于Neon TM transfection system电转仪杯槽内,加入3mL E Buffer;
5)用电转枪吸取10μL步骤3)得到的混合液,插入点击杯内,选择电转程序(1450V10ms 3pulse),电击转染后立即在超净台内将电转枪中混合液转入到6孔板中,每孔含3mL15%胎牛血清(Gibco)+83%DMEM培养基(Gibco)+1%P/S(Gibco Penicillin-Streptomycin)+1%HEPES(Solarbio)的完全培养液;
6)混匀后放置于37℃,5%CO2、5%O2的恒温培养箱中进行培养;
7)电转后6-12h换液,36-48h用0.25%(Gibco)的胰蛋白酶消化并收集细胞于1.5mL离心管中。
2.3.3基因编辑效率分析
提取2.3.2中收集的细胞基因组DNA,采用RAG1-nF126和RAG1-nR525组成的引物对进行PCR扩增,将产物进行测序。测序结果利用网页版Synthego ICE工具分析测序峰图得出第一组、第二组、第三组的编辑效率依次为9%、53%、66%,测序结果示例性峰图见图8。分析认定第三组的基因编辑效率最高,即确定gRNA质粒与Cas9质粒最适用量为摩尔比3:1,质粒实际用量为0.92μg:1.08μg。
2.4质粒pX330和质粒pKG-GE3的效果比较
2.4.1共转染分组情况
RAG1-330组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pX330共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(RAG1-gRNA4):1.08μg质粒pX330。
RAG1-KG组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(RAG1-gRNA4):1.08μg质粒pKG-GE3。
RAG1-B组:将质粒pKG-U6gRNA(RAG1-gRNA4)转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(RAG1-gRNA4)。
2.4.2共转染操作方法
同本实施例中2.3.2。
2.4.3基因编辑效率分析
提取2.4.2中收集的细胞基因组DNA,采用RAG1-nF126和RAG1-nR525组成的引物对进行PCR扩增,将产物进行测序。测序结果利用网页版Synthego ICE工具分析测序峰图得出RAG1-330组、RAG1-KG组的编辑效率分别为28%、68%,测序结果示例性峰图见图9,结果表明,与采用质粒pX330相比,采用质粒pKG-GE3使得基因编辑效率显著提高。
实施例3针对OCA2基因筛选高效的靶点gRNA(第一次测试)
3.1基因组DNA的提取
使用Vazyme公司FastPure Cell/Tissue DNA Isolation Mini Kit(VazymeCat.DC102-01)分别进行18只猪(雄性A、B、C、D、E、F、G、H雌性1、2、3、4、5、6、7、8、9、10)耳组织的基因组DNA的柱式提取,使用NanoDrop进行定量,-20℃保存备用。
3.2 OCA2基因敲除预设靶点及邻近基因组序列保守性分析
3.2.1猪OCA2基因信息
编码一种黑素体跨膜蛋白;位于15号染色体;GeneID为397171,Sus scrofa。猪OCA2基因编码的氨基酸序列如SEQ ID NO.15所示。已有研究结果表明OCA2在黑素小体的正常生物发生过程中起重要作用,在猪基因组DNA中,OCA2基因具有24个外显子,其中第3外显子在所有转录本中均占据重要位置(猪OCA2基因第3外显子序列,含部分第2和部分第3内含子序列如SEQ ID NO.16所示)。
3.2.2 OCA2基因敲除预设靶点外显子及邻近基因组序列PCR扩增引物设计
根据查到的猪OCA2基因组序列
(https://www.ncbi.nlm.nih.gov/nuccore/NC_010457.5?report=genbank& from=56657648&to=56869920),设计引物扩增前述18只猪基因组样品OCA2基因外显子3的位点。
使用Oligo7软件进行引物设计,设计结果如下:
OCA2-E3g-F1:AGGGGTCTGTGATGTCAAGAAAG(SEQ ID NO.17)
OCA2-E3g-R473:TACACGCTGGTTTGCAGGAT(SEQ ID NO.18)
OCA2-E3g-F2:ACTACGGTGTCTGTGGTCCT(SEQ ID NO.19)
OCA2-E3g-R409:TTGGTAAGGATGTCATGAGGGTA(SEQ ID NO.20)
3.2.3 OCA2基因组PCR扩增引物筛选
使用猪(雌性1)耳朵组织提取的基因组为模板,使用设计的两条上游引物和两条下游引物分别组合,Max酶(Vazyme公司,货号:P505)进行PCR,产物进行1%琼脂糖凝胶电泳以筛选好的扩增引物,结果如图10,组1为:引物OCA2-E3g-F1/OCA2-E3g-R409;组2为:引物OCA2-E3g-F1/OCA2-E3g-R473;组3为:引物OCA2-E3g-F2/OCA2-E3g-R409;组4为:引物OCA2-E3g-F2/OCA2-E3g-R473,优选引物对OCA2-E3g-F1/OCA2-E3g-R473对目的片段进行扩增。
3.2.4 18只猪OCA2基因片段PCR扩增
分别以18个基因组模板(雄性A、B、C、D、E、F、G、H雌性1、2、3、4、5、6、7、8、9、10),引物OCA2-E3g-F1/OCA2-E3g-R473,Max酶进行OCA2基因组片段的扩增,产物(519bp)进行1%琼脂糖凝胶电泳,结果如图11。
3.2.5 OCA2基因序列保守性分析
以上PCR扩增产物,使用扩增引物进行测序(通用生物公司测序),将测序结果与公共数据库中的OCA2基因序列进行比对分析。根据比对结果,扩增片段序列相对保守,所设计的引物本身无可能的突变位点。
3.3靶点gRNA设计及构建
3.3.1使用Benchling进行靶点gRNA设计
设计靶点已避开可能的突变位点,使用Benchling进行靶点gRNA设计:
https://benchling.com/
OCA2基因敲除靶点设计如下:
OCA2-E3-gRNA1:GGGAGAACAGCTCAGCTGAG
OCA2-E3-gRNA2:CAGCAGTGATGAACTCTGGA
OCA2-E3-gRNA3:TGCCAACTCTGTGCTCAGCA
OCA2-E3-gRNA4:GATCTGCCTCCACAGAGAAG
合成的OCA2基因共4个靶点的插入序列互补DNA Oligo如下:
OCA2-E3-gRNA1-S:caccGGGAGAACAGCTCAGCTGAG
OCA2-E3-gRNA1-A:aaacCTCAGCTGAGCTGTTCTCCC
OCA2-E3-gRNA2-S:caccgCAGCAGTGATGAACTCTGGA
OCA2-E3-gRNA2-A:aaacTCCAGAGTTCATCACTGCTGc
OCA2-E3-gRNA3-S:caccgTGCCAACTCTGTGCTCAGCA
OCA2-E3-gRNA3-A:aaacTGCTGAGCACAGAGTTGGCAc
OCA2-E3-gRNA4-S:caccGATCTGCCTCCACAGAGAAG
OCA2-E3-gRNA4-A:aaacCTTCTCTGTGGAGGCAGATC
OCA2-E3-gRNA1-S、OCA2-E3-gRNA1-A、OCA2-E3-gRNA2-S、OCA2-E3-gRNA2-A、OCA2-E3-gRNA3-S、OCA2-E3-gRNA3-A、OCA2-E3-gRNA4-S、OCA2-E3-gRNA4-A均为单链DNA分子。
3.3.2将gRNA序列克隆到pKG-U6gRNA骨架载体上的方法
同实施例2中2.1.3。
3.3.3 gRNA载体构建
1)将合成的OCA2-E3-gRNA1-S和OCA2-E3-gRNA1-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA1)。质粒pKG-U6gRNA(OCA2-E3-gRNA1)将会表达OCA2-E3-gRNA1。OCA2-E3-gRNA1序列为:GGGAGAACAGCUCAGCUGAGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
2)将合成的OCA2-E3-gRNA2-S和OCA2-E3-gRNA2-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA2)。质粒pKG-U6gRNA(OCA2-E3-gRNA2)将会表达OCA2-E3-gRNA2。OCA2-E3-gRNA2序列为:CAGCAGUGAUGAACUCUGGAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
3)将合成的OCA2-E3-gRNA3-S和OCA2-E3-gRNA3-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA3)。质粒pKG-U6gRNA(OCA2-E3-gRNA3)将会表达OCA2-E3-gRNA3。OCA2-E3-gRNA3序列为:UGCCAACUCUGUGCUCAGCAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
4)将合成的OCA2-E3-gRNA4-S和OCA2-E3-gRNA4-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA4)。质粒pKG-U6gRNA(OCA2-E3-gRNA4)将会表达OCA2-E3-gRNA4。OCA2-E3-gRNA4序列为:GAUCUGCCUCCACAGAGAAGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
3.3.3 gRNA载体鉴定
从LB平板上挑取单克隆置入加有相应抗生素的LB培养液内,37℃恒温摇床内培养12-16h后小提质粒后送通用公司测序,经序列比对确认pKG-U6gRNA(OCA2-E3-gRNA1)、pKG-U6gRNA(OCA2-E3-gRNA2)、pKG-U6gRNA(OCA2-E3-gRNA3)和pKG-U6gRNA(OCA2-E3-gRNA4)载体均构建成功。
3.4猪原代成纤维细胞制备
同实施例2中2.2。
3.5使用构建好的gRNA质粒、Cas9质粒(pKG-GE3)共转染猪原代成纤维细胞。
3.5.1共转染分组情况
第一组:将质粒pKG-U6gRNA(OCA2-E3-gRNA1)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA1):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA1)与pKG-GE3摩尔比为3:1。
第二组:将质粒pKG-U6gRNA(OCA2-E3-gRNA2)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA2):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA2)与pKG-GE3摩尔比为3:1。
第三组:将质粒pKG-U6gRNA(OCA2-E3-gRNA3)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA3):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA3)与pKG-GE3摩尔比为3:1。
第四组:将质粒pKG-U6gRNA(OCA2-E3-gRNA4)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA4):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA4)与pKG-GE3摩尔比为3:1。
第五组:猪原代成纤维细胞,同等电转参数不加质粒进行电转染操作。
3.5.2共转染操作方法
同实施例2中2.3.2。
3.6 OCA2基因不同靶点gRNA的编辑效率分析
3.6.1分别向步骤3.5.2中收集在1.5mL离心管中的5组细胞加入10μL KAPA2G裂解液裂解细胞提取细胞基因组DNA。
KAPA2G裂解液配制体系如下:
10X extract Buffer 1μL
Enzyme 0.2μL
ddH2O 8.8μL
75℃15min—95℃5min—4℃,反应结束后基因组DNA于-20℃保存;
3.6.2采用前述针对OCA2基因E3引物OCA2-E3g-F1/OCA2-E3g-R473进行检测突变,PCR目的产物长度为519bp;
3.6.3使用常规PCR反应扩增OCA2靶点基因;
3.6.4将PCR反应产物进行1%琼脂糖凝胶电泳,如图12,将目的产物及其附近产物切胶回收后送测序公司进行测序,然后将测序结果利用网页版Synthego ICE工具分析测序峰图得出OCA2-E3-gRNA1、OCA2-E3-gRNA2、OCA2-E3-gRNA3、OCA2-E3-gRNA4不同靶点的编辑效率依次为1%、1%、0、1%。结果表明,4个gRNA的编辑效率均远低于20%,本次未筛选到较优的高效靶点gRNA。
实施例4针对OCA2基因筛选高效的靶点gRNA(第二次测试)
4.1靶点gRNA设计及构建
4.1.1使用Benchling进行靶点gRNA设计
设计靶点已避开可能的突变位点,使用Benchling进行靶点gRNA设计:
https://benchling.com/
OCA2基因敲除靶点设计如下:
OCA2-E3-g5:CAGAAGCTCCCCCTTCTCTG(SEQ ID NO.21)
OCA2-E3-g6:TCAGCTGAGCTGTTCTCCCA(SEQ ID NO.22)
OCA2-E3-g7:AGCTCAGCTGAGTGGGAGGG(SEQ ID NO.23)
OCA2-E3-g8:GGAGAACAGCTCAGCTGAGT(SEQ ID NO.24)
OCA2-E3-g9:TCATCACTGCTGATGAACCT(SEQ ID NO.25)
OCA2-E3-g10:AGCCCTGCTGAGCACAGAGT(SEQ ID NO.26)
合成的OCA2基因共4个靶点的插入序列互补DNA Oligo如下:
OCA2-E3-gRNA5-S:caccgCAGAAGCTCCCCCTTCTCTG(SEQ ID NO.27)
OCA2-E3-gRNA5-A:aaacCAGAGAAGGGGGAGCTTCTGc(SEQ ID NO.28)
OCA2-E3-gRNA6-S:caccgTCAGCTGAGCTGTTCTCCCA(SEQ ID NO.29)
OCA2-E3-gRNA6-A:aaacTGGGAGAACAGCTCAGCTGAc(SEQ ID NO.30)
OCA2-E3-gRNA7-S:caccgAGCTCAGCTGAGTGGGAGGG(SEQ ID NO.31)
OCA2-E3-gRNA7-A:aaacCCCTCCCACTCAGCTGAGCTc(SEQ ID NO.32)
OCA2-E3-gRNA8-S:caccGGAGAACAGCTCAGCTGAGT(SEQ ID NO.33)
OCA2-E3-gRNA8-A:aaacACTCAGCTGAGCTGTTCTCC(SEQ ID NO.34)
OCA2-E3-gRNA9-S:caccgTCATCACTGCTGATGAACCT(SEQ ID NO.35)
OCA2-E3-gRNA9-A:aaacAGGTTCATCAGCAGTGATGAc(SEQ ID NO.36)
OCA2-E3-gRNA10-S:caccgAGCCCTGCTGAGCACAGAGT(SEQ ID NO.37)
OCA2-E3-gRNA10-A:aaacACTCTGTGCTCAGCAGGGCTc(SEQ ID NO.38)
OCA2-E3-gRNA5-S、OCA2-E3-gRNA5-A、OCA2-E3-gRNA6-S、OCA2-E3-gRNA6-A、OCA2-E3-gRNA7-S、OCA2-E3-gRNA7-A、OCA2-E3-gRNA8-S、OCA2-E3-gRNA8-A、OCA2-E3-gRNA9-S、OCA2-E3-gRNA9-A、OCA2-E3-gRNA10-S、OCA2-E3-gRNA10-A均为单链DNA分子。
4.1.2将gRNA序列克隆到pKG-U6gRNA骨架载体上的方法
同实施例2中2.1.3。
4.1.3 gRNA载体构建
1)将合成的OCA2-E3-gRNA5-S和OCA2-E3-gRNA5-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA5)。质粒pKG-U6gRNA(OCA2-E3-gRNA5)将会表达SEQ ID NO.39所示的OCA2-E3-gRNA5。
SEQ ID NO.39:
CAGAAGCUCCCCCUUCUCUGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
2)将合成的OCA2-E3-gRNA6-S和OCA2-E3-gRNA6-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA6)。质粒pKG-U6gRNA(OCA2-E3-gRNA6)将会表达SEQ ID NO.40所示的OCA2-E3-gRNA6。
SEQ ID NO.40:
UCAGCUGAGCUGUUCUCCCAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
3)将合成的OCA2-E3-gRNA7-S和OCA2-E3-gRNA7-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA7)。质粒pKG-U6gRNA(OCA2-E3-gRNA7)将会表达SEQ ID NO.41所示的OCA2-E3-gRNA7。
SEQ ID NO.41:
AGCUCAGCUGAGUGGGAGGGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
4)将合成的OCA2-E3-gRNA8-S和OCA2-E3-gRNA8-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA8)。质粒pKG-U6gRNA(OCA2-E3-gRNA8)将会表达SEQ ID NO.42所示的OCA2-E3-gRNA8。
SEQ ID NO.42:
GGAGAACAGCUCAGCUGAGUguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
5)将合成的OCA2-E3-gRNA9-S和OCA2-E3-gRNA9-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA9)。质粒pKG-U6gRNA(OCA2-E3-gRNA9)将会表达SEQ ID NO.43所示的OCA2-E3-gRNA9。
SEQ ID NO.43:
UCAUCACUGCUGAUGAACCUguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
6)将合成的OCA2-E3-gRNA10-S和OCA2-E3-gRNA10-A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(OCA2-E3-gRNA10)。质粒pKG-U6gRNA(OCA2-E3-gRNA10)将会表达SEQ ID NO.44所示的OCA2-E3-gRNA10。
SEQ ID NO.44:
AGCCCUGCUGAGCACAGAGUguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
4.1.3gRNA载体鉴定
从LB平板上挑取单克隆置入加有相应抗生素的LB培养液内,37℃恒温摇床内培养12-16h后小提质粒后送通用公司测序,经序列比对确认pKG-U6gRNA(OCA2-E3-gRNA5)、pKG-U6gRNA(OCA2-E3-gRNA6)、pKG-U6gRNA(OCA2-E3-gRNA7)、pKG-U6gRNA(OCA2-E3-gRNA8)、pKG-U6gRNA(OCA2-E3-gRNA9)和pKG-U6gRNA(OCA2-E3-gRNA10)载体均构建成功。
4.2猪原代成纤维细胞制备
同实施例2中2.2。
4.3使用构建好的gRNA质粒、Cas9质粒(pKG-GE3)共转染猪原代成纤维细胞。
4.3.1共转染分组情况
第一组:将质粒pKG-U6gRNA(OCA2-E3-gRNA5)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA5):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA5)与pKG-GE3摩尔比为3:1。
第二组:将质粒pKG-U6gRNA(OCA2-E3-gRNA6)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA6):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA6)与pKG-GE3摩尔比为3:1。
第三组:将质粒pKG-U6gRNA(OCA2-E3-gRNA7)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA7):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA7)与pKG-GE3摩尔比为3:1。
第四组:将质粒pKG-U6gRNA(OCA2-E3-gRNA8)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA8):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA8)与pKG-GE3摩尔比为3:1。
第五组:将质粒pKG-U6gRNA(OCA2-E3-gRNA9)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA9):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA9)与pKG-GE3摩尔比为3:1。
第六组:将质粒pKG-U6gRNA(OCA2-E3-gRNA10)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(OCA2-E3-gRNA10):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(OCA2-E3-gRNA10)与pKG-GE3摩尔比为3:1。
第七组:猪原代成纤维细胞,同等电转参数不加质粒进行电转染操作。
4.3.2共转染操作方法
同实施例2中2.3.2。
4.4 OCA2基因不同靶点gRNA的编辑效率分析
4.4.1分别向步骤4.3.2中收集在1.5mL离心管中的7组细胞加入10μL KAPA2G裂解液裂解细胞提取细胞基因组DNA。
KAPA2G裂解液配制体系如下:
10X extract Buffer 1μL
Enzyme 0.2μL
ddH2O 8.8μL
75℃15min—95℃5min—4℃,反应结束后基因组DNA于-20℃保存;
4.4.2采用前述针对OCA2基因E3引物OCA2-E3g-F1/OCA2-E3g-R473进行检测突变,PCR目的产物长度为519bp;
4.4.3使用常规PCR反应扩增OCA2靶点基因;
4.4.4将PCR反应产物进行1%琼脂糖凝胶电泳,如图13,将目的产物及其附近产物切胶回收后送测序公司进行测序,然后将测序结果利用网页版Synthego ICE工具分析测序峰图得出OCA2-E3-gRNA5、OCA2-E3-gRNA6、OCA2-E3-gRNA7、OCA2-E3-gRNA8、OCA2-E3-gRNA9、OCA2-E3-gRNA10不同靶点的编辑效率依次为32%、14%、2%、0、5%、3%。结果表明,OCA2-E3-gRNA5编辑效率最高,优选为最优靶点。
实施例5构建OCA2基因敲除的从江香猪单细胞克隆
5.1猪原代成纤维细胞制备
同实施例2中2.2。
5.2使用构建好的OCA2-E3-gRNA5质粒、pKG-GE3质粒共转染猪原代成纤维细胞
同实施例2中2.3.2,但不使用0.25%(Gibco)的胰蛋白酶消化并收集细胞于1.5mL离心管中。
5.3 OCA2基因敲除单细胞克隆株的筛选
5.3.1将步骤5.2所得电转48h的群体细胞,使用胰蛋白酶进行消化,完全培养基中和,500g离心5min,去上清,将沉淀用200μL完全培养基重悬,并适当稀释,用口吸管挑取单克隆转移到100μL完全培养基的96孔板中;
5.3.2 37℃,5%CO2、5%O2的恒温培养箱中进行培养,每2~3天换一次细胞培养基,期间用显微镜观察每孔细胞生长情况,排除无细胞及非单细胞克隆的孔;
5.3.3待96孔板的孔中细胞长满孔底,使用胰蛋白酶消化并收集细胞,其中2/3细胞接种到含有完全培养基的6孔板中,剩余的1/3的细胞收集在1.5mL离心管中;
5.3.4待6孔板长至80%汇合度时使用0.25%(Gibco)的胰蛋白酶消化并收集细胞,使用细胞冻存液(90%完全培养基+10%DMSO,体积比)将细胞冻存。
5.4 OCA2基因敲除的细胞鉴定
5.4.1将步骤5.3收集在1.5mL离心管中得到的细胞,然后在细胞中加入10μLKAPA2G裂解液裂解细胞提取细胞基因组DNA。
KAPA2G裂解液配制体系如下:
10X extract Buffer 1μL
Enzyme 0.2μL
ddH2O 8.8μL
75℃15min—95℃5min—4℃,反应结束后基因组DNA于-20℃保存;
5.4.2采用前述针对OCA2基因E3引物OCA2-E3g-F1/OCA2-E3g-R473进行检测突变,PCR目的产物长度为519bp;
5.4.3使用PCR常规反应扩增OCA2靶点基因;
5.4.4将PCR反应产物进行电泳,电泳结果如图14,泳道编号与单细胞克隆编号一致。回收PCR扩增产物并测序。
5.4.5将测序结果与OCA2靶点信息进行比对,从而判断该重组细胞是否为OCA2基因敲除。
编号为5、7、8、14、16、17的单细胞克隆的基因型为双等位相同变异的纯合突变型。编号为6的单细胞克隆的基因型为双等位不同变异的纯合突变型。编号为2、3、13、15、19的单细胞克隆的基因型为杂合突变型。编号为1、4、9、10、11、12、18、20的单细胞克隆的基因型为纯合野生型。得到的OCA2基因编辑单细胞克隆的比率为60%。
示例性的测序比对结果如图15至图18,其中图15是克隆号为OCA2-1的正向测序与已公布序列的比对结果,判定为野生型;图16是克隆号为OCA2-5的正向测序与已公布序列的比对结果,判定为双等位相同变异的纯合突变型;图17是克隆号为OCA2-2的正向测序与已公布序列的比对结果,判定为杂合突变;图18是克隆号为OCA2-6的正向测序和反向测序同时与已公布序列的比对结果,判定为双等位不同变异的纯合突变型。
通过具体序列的分析,OCA2各单细胞克隆基因型如表1:
表1 OCA2基因敲除从江香猪成纤维细胞的单细胞克隆基因型鉴定
以上对本发明进行了详述。对于本领域技术人员来说,在不脱离本发明的宗旨和范围,以及无需进行不必要的实验情况下,可在等同参数、浓度和条件下,在较宽范围内实施本发明。虽然本发明给出了特殊的实施例,应该理解为,可以对本发明作进一步的改进。总之,按本发明的原理,本申请欲包括任何变更、用途或对本发明的改进,包括脱离了本申请中已公开范围,而用本领域已知的常规技术进行的改变。按以下附带的权利要求的范围,可以进行一些基本特征的应用。
序列表
<110> 南京启真基因工程有限公司
<120> 用于OCA2基因突变的CRISPR系统及其在构建白化病克隆猪核供体细胞中的应用
<160> 44
<170> SIPOSequenceListing 1.0
<210> 1
<211> 8484
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 1
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag ttaaaataag 300
gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttg ttttagagct 360
agaaatagca agttaaaata aggctagtcc gtttttagcg cgtgcgccaa ttctgcagac 420
aaatggctct agaggtaccc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 480
ccaacgaccc ccgcccattg acgtcaatag taacgccaat agggactttc cattgacgtc 540
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 600
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tgtgcccagt 660
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 720
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 780
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 840
ggggggggcg gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc 900
agagcggcgc gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata 960
aaaagcgaag cgcgcggcgg gcgggagtcg ctgcgcgctg ccttcgcccc gtgccccgct 1020
ccgccgccgc ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga 1080
gcgggcggga cggcccttct cctccgggct gtaattagct gagcaagagg taagggttta 1140
agggatggtt ggttggtggg gtattaatgt ttaattacct ggagcacctg cctgaaatca 1200
ctttttttca ggttggaccg gtgccaccat ggactataag gaccacgacg gagactacaa 1260
ggatcatgat attgattaca aagacgatga cgataagatg gccccaaaga agaagcggaa 1320
ggtcggtatc cacggagtcc cagcagccga caagaagtac agcatcggcc tggacatcgg 1380
caccaactct gtgggctggg ccgtgatcac cgacgagtac aaggtgccca gcaagaaatt 1440
caaggtgctg ggcaacaccg accggcacag catcaagaag aacctgatcg gagccctgct 1500
gttcgacagc ggcgaaacag ccgaggccac ccggctgaag agaaccgcca gaagaagata 1560
caccagacgg aagaaccgga tctgctatct gcaagagatc ttcagcaacg agatggccaa 1620
ggtggacgac agcttcttcc acagactgga agagtccttc ctggtggaag aggataagaa 1680
gcacgagcgg caccccatct tcggcaacat cgtggacgag gtggcctacc acgagaagta 1740
ccccaccatc taccacctga gaaagaaact ggtggacagc accgacaagg ccgacctgcg 1800
gctgatctat ctggccctgg cccacatgat caagttccgg ggccacttcc tgatcgaggg 1860
cgacctgaac cccgacaaca gcgacgtgga caagctgttc atccagctgg tgcagaccta 1920
caaccagctg ttcgaggaaa accccatcaa cgccagcggc gtggacgcca aggccatcct 1980
gtctgccaga ctgagcaaga gcagacggct ggaaaatctg atcgcccagc tgcccggcga 2040
gaagaagaat ggcctgttcg gaaacctgat tgccctgagc ctgggcctga cccccaactt 2100
caagagcaac ttcgacctgg ccgaggatgc caaactgcag ctgagcaagg acacctacga 2160
cgacgacctg gacaacctgc tggcccagat cggcgaccag tacgccgacc tgtttctggc 2220
cgccaagaac ctgtccgacg ccatcctgct gagcgacatc ctgagagtga acaccgagat 2280
caccaaggcc cccctgagcg cctctatgat caagagatac gacgagcacc accaggacct 2340
gaccctgctg aaagctctcg tgcggcagca gctgcctgag aagtacaaag agattttctt 2400
cgaccagagc aagaacggct acgccggcta cattgacggc ggagccagcc aggaagagtt 2460
ctacaagttc atcaagccca tcctggaaaa gatggacggc accgaggaac tgctcgtgaa 2520
gctgaacaga gaggacctgc tgcggaagca gcggaccttc gacaacggca gcatccccca 2580
ccagatccac ctgggagagc tgcacgccat tctgcggcgg caggaagatt tttacccatt 2640
cctgaaggac aaccgggaaa agatcgagaa gatcctgacc ttccgcatcc cctactacgt 2700
gggccctctg gccaggggaa acagcagatt cgcctggatg accagaaaga gcgaggaaac 2760
catcaccccc tggaacttcg aggaagtggt ggacaagggc gcttccgccc agagcttcat 2820
cgagcggatg accaacttcg ataagaacct gcccaacgag aaggtgctgc ccaagcacag 2880
cctgctgtac gagtacttca ccgtgtataa cgagctgacc aaagtgaaat acgtgaccga 2940
gggaatgaga aagcccgcct tcctgagcgg cgagcagaaa aaggccatcg tggacctgct 3000
gttcaagacc aaccggaaag tgaccgtgaa gcagctgaaa gaggactact tcaagaaaat 3060
cgagtgcttc gactccgtgg aaatctccgg cgtggaagat cggttcaacg cctccctggg 3120
cacataccac gatctgctga aaattatcaa ggacaaggac ttcctggaca atgaggaaaa 3180
cgaggacatt ctggaagata tcgtgctgac cctgacactg tttgaggaca gagagatgat 3240
cgaggaacgg ctgaaaacct atgcccacct gttcgacgac aaagtgatga agcagctgaa 3300
gcggcggaga tacaccggct ggggcaggct gagccggaag ctgatcaacg gcatccggga 3360
caagcagtcc ggcaagacaa tcctggattt cctgaagtcc gacggcttcg ccaacagaaa 3420
cttcatgcag ctgatccacg acgacagcct gacctttaaa gaggacatcc agaaagccca 3480
ggtgtccggc cagggcgata gcctgcacga gcacattgcc aatctggccg gcagccccgc 3540
cattaagaag ggcatcctgc agacagtgaa ggtggtggac gagctcgtga aagtgatggg 3600
ccggcacaag cccgagaaca tcgtgatcga aatggccaga gagaaccaga ccacccagaa 3660
gggacagaag aacagccgcg agagaatgaa gcggatcgaa gagggcatca aagagctggg 3720
cagccagatc ctgaaagaac accccgtgga aaacacccag ctgcagaacg agaagctgta 3780
cctgtactac ctgcagaatg ggcgggatat gtacgtggac caggaactgg acatcaaccg 3840
gctgtccgac tacgatgtgg accatatcgt gcctcagagc tttctgaagg acgactccat 3900
cgacaacaag gtgctgacca gaagcgacaa gaaccggggc aagagcgaca acgtgccctc 3960
cgaagaggtc gtgaagaaga tgaagaacta ctggcggcag ctgctgaacg ccaagctgat 4020
tacccagaga aagttcgaca atctgaccaa ggccgagaga ggcggcctga gcgaactgga 4080
taaggccggc ttcatcaaga gacagctggt ggaaacccgg cagatcacaa agcacgtggc 4140
acagatcctg gactcccgga tgaacactaa gtacgacgag aatgacaagc tgatccggga 4200
agtgaaagtg atcaccctga agtccaagct ggtgtccgat ttccggaagg atttccagtt 4260
ttacaaagtg cgcgagatca acaactacca ccacgcccac gacgcctacc tgaacgccgt 4320
cgtgggaacc gccctgatca aaaagtaccc taagctggaa agcgagttcg tgtacggcga 4380
ctacaaggtg tacgacgtgc ggaagatgat cgccaagagc gagcaggaaa tcggcaaggc 4440
taccgccaag tacttcttct acagcaacat catgaacttt ttcaagaccg agattaccct 4500
ggccaacggc gagatccgga agcggcctct gatcgagaca aacggcgaaa ccggggagat 4560
cgtgtgggat aagggccggg attttgccac cgtgcggaaa gtgctgagca tgccccaagt 4620
gaatatcgtg aaaaagaccg aggtgcagac aggcggcttc agcaaagagt ctatcctgcc 4680
caagaggaac agcgataagc tgatcgccag aaagaaggac tgggacccta agaagtacgg 4740
cggcttcgac agccccaccg tggcctattc tgtgctggtg gtggccaaag tggaaaaggg 4800
caagtccaag aaactgaaga gtgtgaaaga gctgctgggg atcaccatca tggaaagaag 4860
cagcttcgag aagaatccca tcgactttct ggaagccaag ggctacaaag aagtgaaaaa 4920
ggacctgatc atcaagctgc ctaagtactc cctgttcgag ctggaaaacg gccggaagag 4980
aatgctggcc tctgccggcg aactgcagaa gggaaacgaa ctggccctgc cctccaaata 5040
tgtgaacttc ctgtacctgg ccagccacta tgagaagctg aagggctccc ccgaggataa 5100
tgagcagaaa cagctgtttg tggaacagca caagcactac ctggacgaga tcatcgagca 5160
gatcagcgag ttctccaaga gagtgatcct ggccgacgct aatctggaca aagtgctgtc 5220
cgcctacaac aagcaccggg ataagcccat cagagagcag gccgagaata tcatccacct 5280
gtttaccctg accaatctgg gagcccctgc cgccttcaag tactttgaca ccaccatcga 5340
ccggaagagg tacaccagca ccaaagaggt gctggacgcc accctgatcc accagagcat 5400
caccggcctg tacgagacac ggatcgacct gtctcagctg ggaggcgaca aaaggccggc 5460
ggccacgaaa aaggccggcc aggcaaaaaa gaaaaagtaa gaattcctag agctcgctga 5520
tcagcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 5580
tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca 5640
tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag 5700
ggggaggatt gggaagagaa tagcaggcat gctggggagc ggccgcagga acccctagtg 5760
atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag 5820
gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc gcgcagctgc 5880
ctgcaggggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc 5940
atacgtcaaa gcaaccatag tacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg 6000
tggttacgcg cagcgtgacc gctacacttg ccagcgcctt agcgcccgct cctttcgctt 6060
tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 6120
tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg 6180
gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 6240
agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aactctatct 6300
cgggctattc ttttgattta taagggattt tgccgatttc ggtctattgg ttaaaaaatg 6360
agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaattttat 6420
ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc 6480
caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag 6540
ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg 6600
cgagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg 6660
tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat 6720
ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc 6780
aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct 6840
tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag 6900
atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta 6960
agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc 7020
tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca 7080
tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg 7140
atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg 7200
ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca 7260
tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa 7320
acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa 7380
ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg gaggcggata 7440
aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat 7500
ctggagccgg tgagcgtgga agccgcggta tcattgcagc actggggcca gatggtaagc 7560
cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata 7620
gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt 7680
actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga 7740
agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag 7800
cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 7860
tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 7920
agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 7980
ttcttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 8040
acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 8100
ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 8160
gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 8220
gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 8280
gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 8340
tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 8400
caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 8460
tttgctggcc ttttgctcac atgt 8484
<210> 2
<211> 10476
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 2
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag ttaaaataag 300
gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc tagcgcgtgc 360
gccaattctg cagacaaatg gctctagagg tacccgttac ataacttacg gtaaatggcc 420
cgcctggctg accgcccaac gacccccgcc cattgacgtc aatagtaacg ccaataggga 480
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 540
aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 600
ggcattgtgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 660
tagtcatcgc tattaccatg ggggcagagc gcacatcgcc cacagtcccc gagaagttgg 720
ggggaggggt cggcaattga tccggtgcct agagaaggtg gcgcggggta aactgggaaa 780
gtgatgtcgt gtactggctc cgcctttttc ccgagggtgg gggagaaccg tatataagtg 840
cagtagtcgc cgtgaacgtt ctttttcgca acgggtttgc cgccagaaca caggttggac 900
cggtgccacc atggactata aggaccacga cggagactac aaggatcatg atattgatta 960
caaagacgat gacgataaga tggcccccaa aaagaaacga aaggtgggtg ggtccccaaa 1020
gaagaagcgg aaggtcggta tccacggagt cccagcagcc gacaagaagt acagcatcgg 1080
cctggacatc ggcaccaact ctgtgggctg ggccgtgatc accgacgagt acaaggtgcc 1140
cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac agcatcaaga agaacctgat 1200
cggagccctg ctgttcgaca gcggcgaaac agccgaggcc acccggctga agagaaccgc 1260
cagaagaaga tacaccagac ggaagaaccg gatctgctat ctgcaagaga tcttcagcaa 1320
cgagatggcc aaggtggacg acagcttctt ccacagactg gaagagtcct tcctggtgga 1380
agaggataag aagcacgagc ggcaccccat cttcggcaac atcgtggacg aggtggccta 1440
ccacgagaag taccccacca tctaccacct gagaaagaaa ctggtggaca gcaccgacaa 1500
ggccgacctg cggctgatct atctggccct ggcccacatg atcaagttcc ggggccactt 1560
cctgatcgag ggcgacctga accccgacaa cagcgacgtg gacaagctgt tcatccagct 1620
ggtgcagacc tacaaccagc tgttcgagga aaaccccatc aacgccagcg gcgtggacgc 1680
caaggccatc ctgtctgcca gactgagcaa gagcagacgg ctggaaaatc tgatcgccca 1740
gctgcccggc gagaagaaga atggcctgtt cggaaacctg attgccctga gcctgggcct 1800
gacccccaac ttcaagagca acttcgacct ggccgaggat gccaaactgc agctgagcaa 1860
ggacacctac gacgacgacc tggacaacct gctggcccag atcggcgacc agtacgccga 1920
cctgtttctg gccgccaaga acctgtccga cgccatcctg ctgagcgaca tcctgagagt 1980
gaacaccgag atcaccaagg cccccctgag cgcctctatg atcaagagat acgacgagca 2040
ccaccaggac ctgaccctgc tgaaagctct cgtgcggcag cagctgcctg agaagtacaa 2100
agagattttc ttcgaccaga gcaagaacgg ctacgccggc tacattgacg gcggagccag 2160
ccaggaagag ttctacaagt tcatcaagcc catcctggaa aagatggacg gcaccgagga 2220
actgctcgtg aagctgaaca gagaggacct gctgcggaag cagcggacct tcgacaacgg 2280
cagcatcccc caccagatcc acctgggaga gctgcacgcc attctgcggc ggcaggaaga 2340
tttttaccca ttcctgaagg acaaccggga aaagatcgag aagatcctga ccttccgcat 2400
cccctactac gtgggccctc tggccagggg aaacagcaga ttcgcctgga tgaccagaaa 2460
gagcgaggaa accatcaccc cctggaactt cgaggaagtg gtggacaagg gcgcttccgc 2520
ccagagcttc atcgagcgga tgaccaactt cgataagaac ctgcccaacg agaaggtgct 2580
gcccaagcac agcctgctgt acgagtactt caccgtgtat aacgagctga ccaaagtgaa 2640
atacgtgacc gagggaatga gaaagcccgc cttcctgagc ggcgagcaga aaaaggccat 2700
cgtggacctg ctgttcaaga ccaaccggaa agtgaccgtg aagcagctga aagaggacta 2760
cttcaagaaa atcgagtgct tcgactccgt ggaaatctcc ggcgtggaag atcggttcaa 2820
cgcctccctg ggcacatacc acgatctgct gaaaattatc aaggacaagg acttcctgga 2880
caatgaggaa aacgaggaca ttctggaaga tatcgtgctg accctgacac tgtttgagga 2940
cagagagatg atcgaggaac ggctgaaaac ctatgcccac ctgttcgacg acaaagtgat 3000
gaagcagctg aagcggcgga gatacaccgg ctggggcagg ctgagccgga agctgatcaa 3060
cggcatccgg gacaagcagt ccggcaagac aatcctggat ttcctgaagt ccgacggctt 3120
cgccaacaga aacttcatgc agctgatcca cgacgacagc ctgaccttta aagaggacat 3180
ccagaaagcc caggtgtccg gccagggcga tagcctgcac gagcacattg ccaatctggc 3240
cggcagcccc gccattaaga agggcatcct gcagacagtg aaggtggtgg acgagctcgt 3300
gaaagtgatg ggccggcaca agcccgagaa catcgtgatc gaaatggcca gagagaacca 3360
gaccacccag aagggacaga agaacagccg cgagagaatg aagcggatcg aagagggcat 3420
caaagagctg ggcagccaga tcctgaaaga acaccccgtg gaaaacaccc agctgcagaa 3480
cgagaagctg tacctgtact acctgcagaa tgggcgggat atgtacgtgg accaggaact 3540
ggacatcaac cggctgtccg actacgatgt ggaccatatc gtgcctcaga gctttctgaa 3600
ggacgactcc atcgacaaca aggtgctgac cagaagcgac aagaaccggg gcaagagcga 3660
caacgtgccc tccgaagagg tcgtgaagaa gatgaagaac tactggcggc agctgctgaa 3720
cgccaagctg attacccaga gaaagttcga caatctgacc aaggccgaga gaggcggcct 3780
gagcgaactg gataaggccg gcttcatcaa gagacagctg gtggaaaccc ggcagatcac 3840
aaagcacgtg gcacagatcc tggactcccg gatgaacact aagtacgacg agaatgacaa 3900
gctgatccgg gaagtgaaag tgatcaccct gaagtccaag ctggtgtccg atttccggaa 3960
ggatttccag ttttacaaag tgcgcgagat caacaactac caccacgccc acgacgccta 4020
cctgaacgcc gtcgtgggaa ccgccctgat caaaaagtac cctaagctgg aaagcgagtt 4080
cgtgtacggc gactacaagg tgtacgacgt gcggaagatg atcgccaaga gcgagcagga 4140
aatcggcaag gctaccgcca agtacttctt ctacagcaac atcatgaact ttttcaagac 4200
cgagattacc ctggccaacg gcgagatccg gaagcggcct ctgatcgaga caaacggcga 4260
aaccggggag atcgtgtggg ataagggccg ggattttgcc accgtgcgga aagtgctgag 4320
catgccccaa gtgaatatcg tgaaaaagac cgaggtgcag acaggcggct tcagcaaaga 4380
gtctatcctg cccaagagga acagcgataa gctgatcgcc agaaagaagg actgggaccc 4440
taagaagtac ggcggcttcg acagccccac cgtggcctat tctgtgctgg tggtggccaa 4500
agtggaaaag ggcaagtcca agaaactgaa gagtgtgaaa gagctgctgg ggatcaccat 4560
catggaaaga agcagcttcg agaagaatcc catcgacttt ctggaagcca agggctacaa 4620
agaagtgaaa aaggacctga tcatcaagct gcctaagtac tccctgttcg agctggaaaa 4680
cggccggaag agaatgctgg cctctgccgg cgaactgcag aagggaaacg aactggccct 4740
gccctccaaa tatgtgaact tcctgtacct ggccagccac tatgagaagc tgaagggctc 4800
ccccgaggat aatgagcaga aacagctgtt tgtggaacag cacaagcact acctggacga 4860
gatcatcgag cagatcagcg agttctccaa gagagtgatc ctggccgacg ctaatctgga 4920
caaagtgctg tccgcctaca acaagcaccg ggataagccc atcagagagc aggccgagaa 4980
tatcatccac ctgtttaccc tgaccaatct gggagcccct gccgccttca agtactttga 5040
caccaccatc gaccggaaga ggtacaccag caccaaagag gtgctggacg ccaccctgat 5100
ccaccagagc atcaccggcc tgtacgagac acggatcgac ctgtctcagc tgggaggcga 5160
caaaaggccg gcggccacga aaaaggccgg ccaggcaaaa aagaaaaagg gcggctccaa 5220
gcggcctgcc gcgacgaaga aagcgggaca ggccaagaaa aagaaaggat ccggcgcaac 5280
aaacttctct ctgctgaaac aagccggaga tgtcgaagag aatcctggac cggtgagcaa 5340
gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa 5400
cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac 5460
cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac 5520
cctgacctac ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt 5580
cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga 5640
cggcaactac aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat 5700
cgagctgaag ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta 5760
caactacaac agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt 5820
gaacttcaag atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca 5880
gcagaacacc cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac 5940
ccagtccgcc ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt 6000
cgtgaccgcc gccgggatca ctctcggcat ggacgagctg tacaagggct ccggcgaggg 6060
caggggaagt cttctaacat gcggggacgt ggaggaaaat cccggcccaa ccgagtacaa 6120
gcccacggtg cgcctcgcca cccgcgacga cgtccccagg gccgtacgca ccctcgccgc 6180
cgcgttcgcc gactaccccg ccacgcgcca caccgtcgat ccggaccgcc acatcgagcg 6240
ggtcaccgag ctgcaagaac tcttcctcac gcgcgtcggg ctcgacatcg gcaaggtgtg 6300
ggtcgcggac gacggcgccg cggtggcggt ctggaccacg ccggagagcg tcgaagcggg 6360
ggcggtgttc gccgagatcg gcccgcgcat ggccgagttg agcggttccc ggctggccgc 6420
gcagcaacag atggaaggcc tcctggcgcc gcaccggccc aaggagcccg cgtggttcct 6480
ggccaccgtc ggagtctcgc ccgaccacca gggcaagggt ctgggcagcg ccgtcgtgct 6540
ccccggagtg gaggcggccg agcgcgccgg ggtgcccgcc ttcctggaga cctccgcgcc 6600
ccgcaacctc cccttctacg agcggctcgg cttcaccgtc accgccgacg tcgaggtgcc 6660
cgaaggaccg cgcacctggt gcatgacccg caagcccggt gcctgaacgc gttaagtcga 6720
caatcaacct ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc 6780
tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttgcttcccg 6840
tatggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt atgaggagtt 6900
gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg caacccccac 6960
tggttggggc attgccacca cctgtcagct cctttccggg actttcgctt tccccctccc 7020
tattgccacg gcggaactca tcgccgcctg ccttgcccgc tgctggacag gggctcggct 7080
gttgggcact gacaattccg tggtgttgtc ggggaaatca tcgtcctttc cttggctgct 7140
cgcctgtgtt gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct 7200
caatccagcg gaccttcctt cccgcggcct gctgccggct ctgcggcctc ttccgcgtct 7260
tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc gtcgacttta 7320
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 7380
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 7440
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 7500
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 7560
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagggcc 7620
cgtttaaacc cgctgatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg 7680
cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata 7740
aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt 7800
ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgctg gggatgcggt 7860
gggctctatg gcctgcaggg gcgcctgatg cggtattttc tccttacgca tctgtgcggt 7920
atttcacacc gcatacgtca aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg 7980
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ttagcgcccg 8040
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 8100
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 8160
aacttgattt gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 8220
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 8280
tcaactctat ctcgggctat tcttttgatt tataagggat tttgccgatt tcggtctatt 8340
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt 8400
ttacaatttt atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc 8460
cccgacaccc gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg 8520
cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat 8580
caccgaaacg cgcgagacga aagggcctcg tgatacgcct atttttatag gttaatgtca 8640
tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc 8700
ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 8760
gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 8820
cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 8880
tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 8940
tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 9000
cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac 9060
tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 9120
agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 9180
ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 9240
ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 9300
aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc 9360
gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 9420
tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 9480
ttgctgataa atctggagcc ggtgagcgtg gaagccgcgg tatcattgca gcactggggc 9540
cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 9600
atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 9660
cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 9720
ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 9780
cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 9840
ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 9900
tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 9960
taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 10020
caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 10080
agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 10140
gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 10200
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 10260
ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 10320
acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 10380
tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 10440
ggttcctggc cttttgctgg ccttttgctc acatgt 10476
<210> 3
<211> 3120
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 3
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60
cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120
tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180
aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240
ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300
ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360
tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420
tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480
actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540
gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600
acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660
gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720
acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780
gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840
ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900
gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960
cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020
agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080
catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc 1380
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980
cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040
cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160
cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220
accatgatta cgccaagctt gcatgcaggc ctctgcagtc gacgggcccg ggatccgatg 2280
ataaacatgt gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc 2340
tgttagagag ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac 2400
gtgacgtaga aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat 2460
ggactatcat atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt 2520
gtggaaagga cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag 2580
ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc 2640
tagcgcgtgc gccaattctg cagacaaatg gctctagagg tacccataga tctagatgca 2700
ttcgcgaggt accgagctcg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa 2760
accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta 2820
atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat 2880
ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt 2940
gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 3000
cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 3060
tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 3120
<210> 4
<211> 175
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 4
tgtggaaagg acgaaacacc gggtcttcga gaagacctgt tttagagcta gaaatagcaa 60
gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg caccgagtcg gtgctttttt 120
ctagcgcgtg cgccaattct gcagacaaat ggctctagag gtacccgtta cataa 175
<210> 5
<211> 554
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 5
tctgcagaca aatggctcta gaggtacccg ttacataact tacggtaaat ggcccgcctg 60
gctgaccgcc caacgacccc cgcccattga cgtcaatagt aacgccaata gggactttcc 120
attgacgtca atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt 180
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 240
gtgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 300
tcgctattac catgggggca gagcgcacat cgcccacagt ccccgagaag ttggggggag 360
gggtcggcaa ttgatccggt gcctagagaa ggtggcgcgg ggtaaactgg gaaagtgatg 420
tcgtgtactg gctccgcctt tttcccgagg gtgggggaga accgtatata agtgcagtag 480
tcgccgtgaa cgttcttttt cgcaacgggt ttgccgccag aacacaggtt ggaccggtgc 540
caccatggac tata 554
<210> 6
<211> 447
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 6
ccagaacaca ggttggaccg gtgccaccat ggactataag gaccacgacg gagactacaa 60
ggatcatgat attgattaca aagacgatga cgataagatg gcccccaaaa agaaacgaaa 120
ggtgggtggg tccccaaaga agaagcggaa ggtcggtatc cacggagtcc cagcagccga 180
caagaagtac agcatcggcc tggacatcgg caccaactct gtgggctggg ccgtgatcac 240
cgacgagtac aaggtgccca gcaagaaatt caaggtgctg ggcaacaccg accggcacag 300
catcaagaag aacctgatcg gagccctgct gttcgacagc ggcgaaacag ccgaggccac 360
ccggctgaag agaaccgcca gaagaagata caccagacgg aagaaccgga tctgctatct 420
gcaagagatc ttcagcaacg agatggc 447
<210> 7
<211> 2727
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 7
cggcggccac gaaaaaggcc ggccaggcaa aaaagaaaaa gggcggctcc aagcggcctg 60
ccgcgacgaa gaaagcggga caggccaaga aaaagaaagg atccggcgca acaaacttct 120
ctctgctgaa acaagccgga gatgtcgaag agaatcctgg accggtgagc aagggcgagg 180
agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta aacggccaca 240
agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg accctgaagt 300
tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc accctgacct 360
acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac ttcttcaagt 420
ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac gacggcaact 480
acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc atcgagctga 540
agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag tacaactaca 600
acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag gtgaacttca 660
agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac cagcagaaca 720
cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc acccagtccg 780
ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag ttcgtgaccg 840
ccgccgggat cactctcggc atggacgagc tgtacaaggg ctccggcgag ggcaggggaa 900
gtcttctaac atgcggggac gtggaggaaa atcccggccc aaccgagtac aagcccacgg 960
tgcgcctcgc cacccgcgac gacgtcccca gggccgtacg caccctcgcc gccgcgttcg 1020
ccgactaccc cgccacgcgc cacaccgtcg atccggaccg ccacatcgag cgggtcaccg 1080
agctgcaaga actcttcctc acgcgcgtcg ggctcgacat cggcaaggtg tgggtcgcgg 1140
acgacggcgc cgcggtggcg gtctggacca cgccggagag cgtcgaagcg ggggcggtgt 1200
tcgccgagat cggcccgcgc atggccgagt tgagcggttc ccggctggcc gcgcagcaac 1260
agatggaagg cctcctggcg ccgcaccggc ccaaggagcc cgcgtggttc ctggccaccg 1320
tcggagtctc gcccgaccac cagggcaagg gtctgggcag cgccgtcgtg ctccccggag 1380
tggaggcggc cgagcgcgcc ggggtgcccg ccttcctgga gacctccgcg ccccgcaacc 1440
tccccttcta cgagcggctc ggcttcaccg tcaccgccga cgtcgaggtg cccgaaggac 1500
cgcgcacctg gtgcatgacc cgcaagcccg gtgcctgaac gcgttaagtc gacaatcaac 1560
ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta 1620
cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt 1680
tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg 1740
ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg 1800
gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca 1860
cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca 1920
ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg 1980
ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag 2040
cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc 2100
gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat 2160
gacttacaag gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg 2220
gctaattcac tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt 2280
agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca 2340
ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa 2400
ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcaggg cccgtttaaa 2460
cccgctgatc agcctcgact gtgccttcta gttgccagcc atctgttgtt tgcccctccc 2520
ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg 2580
aaattgcatc gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg 2640
acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg gtgggctcta 2700
tggcctgcag gggcgcctga tgcggta 2727
<210> 8
<211> 410
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 8
gataaacatg tgagggccta tttcccatga ttccttcata tttgcatata cgatacaagg 60
ctgttagaga gataattgga attaatttga ctgtaaacac aaagatatta gtacaaaata 120
cgtgacgtag aaagtaataa tttcttgggt agtttgcagt tttaaaatta tgttttaaaa 180
tggactatca tatgcttacc gtaacttgaa agtatttcga tttcttggct ttatatatct 240
tgtggaaagg acgaaacacc gggtcttcga gaagacctgt tttagagcta gaaatagcaa 300
gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg caccgagtcg gtgctttttt 360
ctagcgcgtg cgccaattct gcagacaaat ggctctagag gtacccatag 410
<210> 9
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 9
agttatggca gaactcagtg 20
<210> 10
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 10
caccgagtta tggcagaact cagtg 25
<210> 11
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
aaaccactga gttctgccat aactc 25
<210> 12
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 12
ccccatccaa agtttttaaa gga 23
<210> 13
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 13
tgtggcagat gtcacagttt agg 23
<210> 14
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 14
aguuauggca gaacucagug guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 15
<211> 845
<212> PRT
<213> 猪(Sus scrofa)
<400> 15
Met Arg Leu Glu Asn Arg Glu Gly Arg Pro Thr Ser Gly Val Leu Glu
1 5 10 15
Met Glu Leu Pro Gln Ala Ser Ala Pro Ser Arg Ala Gly Leu Gly Ser
20 25 30
Leu Gly Leu Val Gly Leu Asp Ser Ser Asn His Arg Pro Gln Gln Gly
35 40 45
Gly Ser Lys Ala Gly Ser Arg Gly Pro Tyr Leu Ser Gly Ala Ala Gly
50 55 60
Gln Ser Cys Trp Val Pro Met Asp Gln Asp Phe Gly Pro Phe Leu Thr
65 70 75 80
Glu Arg Arg Ser His Cys Pro Phe Pro Lys His Phe Ser Ser Arg Ser
85 90 95
Lys Asp Pro Cys Phe Thr Glu Asn Thr Pro Leu Leu Gly Ser Phe Ser
100 105 110
Gln Glu Glu Gly Ser Arg Cys Met Pro Val Tyr His Pro Glu Phe Ile
115 120 125
Thr Ala Asp Glu Pro Trp Glu Asn Ser Ser Ala Glu Trp Glu Gly Gly
130 135 140
Ala Leu Leu Ser Thr Glu Leu Ala Val Ser Ser Gly Ser Ala Ser Thr
145 150 155 160
Glu Lys Gly Glu Leu Leu Asp Ser Ala His Ile Arg Cys His Leu Ser
165 170 175
Lys Leu Arg Cys Cys Val Gln Trp Leu Lys Val Ser Gly Leu Phe Val
180 185 190
Phe Val Val Leu Cys Ser Ile Leu Phe Ser Leu Tyr Pro Asp Gln Gly
195 200 205
Lys Phe Trp Gln Leu Leu Ala Val Ser Pro Leu Glu Ser Tyr Ser Val
210 215 220
Asn Leu Ser Ser His Ala Asp Ser Met Leu Leu Gln Val Asp Leu Ala
225 230 235 240
Gly Ala Leu Val Ala Ser Gly Pro Ser His Leu Gly Lys Glu Glu His
245 250 255
Val Ala Val Glu Val Thr Gln Ala Asn Ala Pro Gly Ser Arg Arg Arg
260 265 270
Arg Pro Gln Gln Val Thr His Asn Trp Thr Ile Phe Leu Asn Pro Ser
275 280 285
Gly Gly Glu His Thr Val Met Ser Arg Thr Phe Glu Val Leu Ser Arg
290 295 300
Glu Pro Val Ser Ile Asn Ile Arg Ala Ser Leu Gln Gln Thr Gln Ile
305 310 315 320
Val Pro Leu Leu Met Ala His Gln Tyr Leu Arg Ala Ser Ile Glu Ala
325 330 335
Gln Val Thr Ile Ala Ala Val Ile Leu Ala Gly Val Tyr Val Leu Ile
340 345 350
Ile Phe Glu Ile Val His Arg Thr Leu Ala Ala Met Leu Gly Ser Leu
355 360 365
Ala Ala Leu Ala Ala Leu Ala Val Ile Gly Asp Arg Pro Thr Leu Thr
370 375 380
Gln Val Val Glu Trp Ile Asp Phe Glu Thr Leu Ala Leu Leu Phe Gly
385 390 395 400
Met Met Ile Leu Val Ala Ile Phe Ser Glu Thr Gly Phe Phe Asp Tyr
405 410 415
Cys Ala Val Lys Ala Tyr Gln Leu Ser Arg Gly Arg Val Trp Ala Met
420 425 430
Ile Ile Met Leu Cys Leu Ile Ala Ala Val Leu Ser Ala Phe Leu Asp
435 440 445
Asn Val Thr Thr Ala Leu Leu Phe Thr Pro Val Thr Ile Arg Leu Cys
450 455 460
Glu Val Leu Asn Leu Asp Pro Arg Gln Val Leu Ile Ala Glu Val Ile
465 470 475 480
Phe Thr Asn Ile Gly Gly Ala Ala Thr Ala Ile Gly Asp Pro Pro Asn
485 490 495
Val Ile Ile Val Ser Asn Gln Glu Leu Arg Lys Met Gly Leu Asp Phe
500 505 510
Ala Gly Phe Thr Ala His Met Phe Ala Gly Ile Cys Phe Val Leu Leu
515 520 525
Phe Ser Phe Pro Leu Leu Arg Leu Leu Tyr Trp Asn Arg Lys Leu Tyr
530 535 540
Asn Lys Glu Pro Ser Glu Ile Val Glu Leu Lys His Glu Ile His Val
545 550 555 560
Trp Arg Leu Thr Ala Gln Arg Ile Ser Pro Ala Ser His Glu Glu Thr
565 570 575
Ala Val Arg Gly Leu Leu Leu Glu Lys Val Leu Ser Leu Glu Arg Leu
580 585 590
Leu Ala Arg Arg Leu His Ser Phe His Arg Gln Ile Ser Gln Glu Asp
595 600 605
Lys Asn Trp Glu Thr Asn Ile Gln Glu Leu Gln Lys Lys His Arg Ile
610 615 620
Ser Asp Arg Thr Leu Leu Thr Lys Cys Val Thr Val Leu Gly Leu Val
625 630 635 640
Ile Phe Met Phe Phe Leu Asn Ser Phe Val Pro Gly Val His Leu Asp
645 650 655
Leu Gly Trp Ile Ala Ile Leu Gly Ala Ile Trp Leu Leu Ile Leu Ala
660 665 670
Asp Ile His Asp Phe Glu Ile Ile Leu His Arg Val Glu Trp Ala Thr
675 680 685
Leu Leu Phe Phe Ala Ala Leu Phe Ile Leu Met Glu Ala Leu Ala His
690 695 700
Leu His Leu Ile Glu Tyr Val Gly Glu Gln Thr Ala Leu Leu Ile Lys
705 710 715 720
Met Val Pro Glu Asp Gln Arg Leu Ala Ala Ala Ile Ile Val Val Val
725 730 735
Trp Val Ser Ala Ile Ala Ser Ser Leu Ile Asp Asn Ile Pro Phe Thr
740 745 750
Ala Thr Met Ile Pro Val Leu Leu Asn Leu Ser Arg Asp Pro Glu Ile
755 760 765
Ser Leu Pro Ala Pro Pro Leu Met Tyr Ala Leu Ala Leu Gly Ala Cys
770 775 780
Leu Gly Gly Asn Gly Thr Leu Ile Gly Ala Ser Ala Asn Val Val Cys
785 790 795 800
Ala Gly Ile Ala Glu Gln His Gly Tyr Gly Phe Ser Phe Met Glu Phe
805 810 815
Phe Arg Leu Gly Phe Pro Met Met Ile Val Ser Cys Met Val Gly Met
820 825 830
Cys Tyr Leu Leu Val Ala His Val Val Met Gly Trp Asn
835 840 845
<210> 16
<211> 1500
<212> DNA
<213> 猪(Sus scrofa)
<400> 16
ataagtactg aaggagttgt tcagaagccc agggagtcaa ccaaaagtaa atggatgcat 60
ccacatggaa ttctgataaa agtttatgat tctccttgat tcatccttct aaatataagt 120
cctgtcttgg gcccatctaa tccagaacta atgttcccca ctcctgctca tgtttgtgtg 180
gatttttggt aagaaaccag gtagactacc tttcccacag ctccattaca gaaagatacc 240
tacacagagg aggaaaggct gagactactc actgtacttt ggggtgagta tagcagaacc 300
aggtattgag aactcagaga caatcaaaga gaaagcaaaa aatgaaatga gctggagctg 360
gtctcagctg gaaacagttc cagactcaga acaagctgat gtttcttgaa ggtctacaat 420
ttattttgaa gattgagtga tcccctggaa gatctccctc aggagccagg ggtctgtgat 480
gtcaagaaag actcactccc accagtttga cagtactacg gtgtctgtgg tcctatttca 540
tactcaccgc tatgatgaaa tgttgttttg gagatgctac tatttctgtg aaaataacca 600
caaagcctcg atttccaggt cacggtgcat gcctgtttac cgtccagagt tcatcactgc 660
tgatgaacct tgggagaaca gctcagctga gtgggaggga ggagccctgc tgagcacaga 720
gttggcagtt tcttctggat ctgcctccac agagaagggg gagcttctgg acagcgctca 780
catcaggtgc catctttcca agctaaggta ggaagacttg gagagccacc cttgacacag 840
ggctgagggt cttaccctca tgacatcctt accaaggaca gggtagaaat aaggaagaag 900
gctaggtagt actttactgg aaggtaatct tccttttctc caggtcctgg aagacagatc 960
taaggtatcc tgcaaaccag cgtgtagccc cagaggctac tagaagccca attacagacc 1020
aaaatgattc acatcctgta ttcctctcct tcctattgtg tggctgagat cacgcccatg 1080
aggtttgcta atggtattca ttcgtttcaa cagccttcgt ggtgttttag aagttggtat 1140
tagcagtgat tagaaggggt ctgcaacagg cggtggaaaa gttctttacc ggctgcaggg 1200
ggcgtgggtg gggctgcaag aggttacggc tacgcatgca gaggtgggtg cgcatgcgca 1260
ggggacctga gctctttttg tagagcgtgg tcagaatgtg ttcaccctgc tttgcccgct 1320
gggacttcca tgtgaccgtt tacgtgacgg aagggttctc ttgagatctg tgtttgaaag 1380
tggcatttcc aggaacagaa taaggcttgg ggaccaggac caccaaggtc cccattctgt 1440
atctaccctg aggtatctga ggacccagca gggccaggca tgtcctccgc aactatctcc 1500
<210> 17
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 17
aggggtctgt gatgtcaaga aag 23
<210> 18
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 18
tacacgctgg tttgcaggat 20
<210> 19
<211> 20
<212> DNA
<213> 人工序类(Artificial Sequence)
<400> 19
actacggtgt ctgtggtcct 20
<210> 20
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 20
ttggtaagga tgtcatgagg gta 23
<210> 21
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 21
cagaagctcc cccttctctg 20
<210> 22
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 22
tcagctgagc tgttctccca 20
<210> 23
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 23
agctcagctg agtgggaggg 20
<210> 24
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 24
ggagaacagc tcagctgagt 20
<210> 25
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 25
tcatcactgc tgatgaacct 20
<210> 26
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 26
agccctgctg agcacagagt 20
<210> 27
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 27
caccgcagaa gctccccctt ctctg 25
<210> 28
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 28
aaaccagaga agggggagct tctgc 25
<210> 29
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 29
caccgtcagc tgagctgttc tccca 25
<210> 30
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 30
aaactgggag aacagctcag ctgac 25
<210> 31
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 31
caccgagctc agctgagtgg gaggg 25
<210> 32
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 32
aaacccctcc cactcagctg agctc 25
<210> 33
<211> 24
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 33
caccggagaa cagctcagct gagt 24
<210> 34
<211> 24
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 34
aaacactcag ctgagctgtt ctcc 24
<210> 35
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 35
caccgtcatc actgctgatg aacct 25
<210> 36
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 36
aaacaggttc atcagcagtg atgac 25
<210> 37
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 37
caccgagccc tgctgagcac agagt 25
<210> 38
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 38
aaacactctg tgctcagcag ggctc 25
<210> 39
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 39
cagaagcucc cccuucucug guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 40
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 40
ucagcugagc uguucuccca guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 41
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 41
agcucagcug agugggaggg guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 42
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 42
ggagaacagc ucagcugagu guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 43
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 43
ucaucacugc ugaugaaccu guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 44
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 44
agcccugcug agcacagagu guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
Claims (2)
1.一种用于猪OCA2基因编辑的CRISPR/Cas9系统,其特征在于包含Cas9表达载体和针对猪OCA2基因的gRNA表达载体;所述的针对猪OCA2基因的gRNA表达载体以全序列如SEQ IDNO.3所示的pKG-U6gRNA为载体骨架,将SEQ ID NO.27和SEQ ID NO.28所示的单链DNA退火得到的具有粘性末端的双链DNA分子插入载体骨架pKG-U6gRNA的BbsI限制性内切酶位点所得,所述的gRNA表达载体表达SEQ ID NO.39所示的gRNA,该gRNA的靶点序列如SEQ IDNO.21所示;所述的Cas9表达载体为质粒全序列如SEQ ID NO.2所示的pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO载体;所述的gRNA表达载体和Cas9表达载体的摩尔比为3:1。
2.权利要求1所述的CRISPR/Cas9系统在构建眼、皮肤白化病克隆猪核供体细胞中的应用。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011489465.XA CN112680444B (zh) | 2020-12-16 | 2020-12-16 | 用于oca2基因突变的crispr系统及其在构建白化病克隆猪核供体细胞中的应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011489465.XA CN112680444B (zh) | 2020-12-16 | 2020-12-16 | 用于oca2基因突变的crispr系统及其在构建白化病克隆猪核供体细胞中的应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112680444A CN112680444A (zh) | 2021-04-20 |
CN112680444B true CN112680444B (zh) | 2023-07-21 |
Family
ID=75448274
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011489465.XA Active CN112680444B (zh) | 2020-12-16 | 2020-12-16 | 用于oca2基因突变的crispr系统及其在构建白化病克隆猪核供体细胞中的应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112680444B (zh) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104263754A (zh) * | 2014-08-29 | 2015-01-07 | 中国科学院广州生物医药与健康研究院 | 白化病模型猪的重构卵及其构建方法和模型猪的构建方法 |
CN104651399A (zh) * | 2014-12-31 | 2015-05-27 | 广西大学 | 一种利用CRISPR/Cas系统在猪胚胎细胞中实现基因敲除的方法 |
CN107937345A (zh) * | 2017-11-16 | 2018-04-20 | 山东蓝思种业股份有限公司 | 一种制备同时敲除cd163基因和cd13基因的猪成纤维细胞的方法 |
CN108588123A (zh) * | 2018-05-07 | 2018-09-28 | 南京医科大学 | CRISPR/Cas9载体组合在制备基因敲除猪的血液制品中的应用 |
CN108795902A (zh) * | 2018-07-05 | 2018-11-13 | 深圳三智医学科技有限公司 | 一种安全高效的CRISPR/Cas9基因编辑技术 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11240997B2 (en) * | 2019-04-09 | 2022-02-08 | Shandong Landsee Genetics Co., Ltd. | Method for preparing porcine fibroblasts with both CD163 gene and CD13 gene being knocked-out |
-
2020
- 2020-12-16 CN CN202011489465.XA patent/CN112680444B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104263754A (zh) * | 2014-08-29 | 2015-01-07 | 中国科学院广州生物医药与健康研究院 | 白化病模型猪的重构卵及其构建方法和模型猪的构建方法 |
CN104651399A (zh) * | 2014-12-31 | 2015-05-27 | 广西大学 | 一种利用CRISPR/Cas系统在猪胚胎细胞中实现基因敲除的方法 |
CN107937345A (zh) * | 2017-11-16 | 2018-04-20 | 山东蓝思种业股份有限公司 | 一种制备同时敲除cd163基因和cd13基因的猪成纤维细胞的方法 |
CN108588123A (zh) * | 2018-05-07 | 2018-09-28 | 南京医科大学 | CRISPR/Cas9载体组合在制备基因敲除猪的血液制品中的应用 |
CN108795902A (zh) * | 2018-07-05 | 2018-11-13 | 深圳三智医学科技有限公司 | 一种安全高效的CRISPR/Cas9基因编辑技术 |
Also Published As
Publication number | Publication date |
---|---|
CN112680444A (zh) | 2021-04-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112779292B (zh) | 构建瘦肉率高、生长快且抗蓝耳病和系列腹泻病的优质猪核移植供体细胞的方法及其应用 | |
CN112522261B (zh) | 用于制备lmna基因突变的扩张型心肌病克隆猪核供体细胞的crispr系统及其应用 | |
CN112779291B (zh) | 构建瘦肉率高、生长快、繁殖力高且抗系列流行病的优质猪核移植供体细胞的方法及其应用 | |
CN112522264B (zh) | 一种导致先天性耳聋的CRISPR/Cas9系统及其在制备模型猪核供体细胞中的应用 | |
CN112522260B (zh) | Crispr系统及其在制备ttn基因突变的扩张型心肌病克隆猪核供体细胞中的应用 | |
CN114958762B (zh) | 一种构建神经组织特异过表达人源snca的帕金森病模型猪的方法及应用 | |
CN112877362A (zh) | 构建高繁殖力、抗蓝耳病和系列腹泻病的优质猪核移植供体细胞的基因编辑系统及其应用 | |
CN112522313B (zh) | 用于构建TPH2基因突变的抑郁症克隆猪核供体细胞的CRISPR/Cas9系统 | |
CN113046388B (zh) | 用于构建双基因联合敲除的动脉粥样硬化猪核移植供体细胞的crispr系统及其应用 | |
CN112522309B (zh) | 重症免疫缺陷猪源重组细胞及其制备方法和试剂盒 | |
CN112877363A (zh) | 一种构建瘦肉率高、生长快且繁殖力高的优质猪核移植供体细胞的基因编辑系统及其应用 | |
CN112680444B (zh) | 用于oca2基因突变的crispr系统及其在构建白化病克隆猪核供体细胞中的应用 | |
CN112899306B (zh) | Crispr系统及其在构建gabrg2基因突变的克隆猪核供体细胞中的应用 | |
CN112813101B (zh) | 一种构建瘦肉率高、生长快的优质猪核移植供体细胞的基因编辑系统及其应用 | |
CN112795566B (zh) | 用于构建骨质疏松症克隆猪核供体细胞系的opg基因编辑系统及其应用 | |
CN112575033B (zh) | Crispr系统及其在构建scn1a基因突变的癫痫性脑病克隆猪核供体细胞中的应用 | |
CN112680453B (zh) | Crispr系统及其在构建stxbp1突变的癫痫性脑病克隆猪核供体细胞中的应用 | |
CN112608941B (zh) | 用于构建mc4r基因突变的肥胖症猪核移植供体细胞的crispr系统及其应用 | |
CN112522311B (zh) | 用于adcy3基因编辑的crispr系统及其在构建肥胖症猪核移植供体细胞中应用 | |
CN113584078B (zh) | 用于双靶标基因编辑的crispr系统及其在构建抑郁症猪核移植供体细胞中的应用 | |
CN112522202B (zh) | 制备addi四个基因联合敲除的重症免疫缺陷猪源重组细胞的方法及其专用试剂盒 | |
CN112522258B (zh) | Il2rg基因和ada基因联合敲除的重组细胞及其在制备免疫缺陷猪模型中的应用 | |
CN112522257B (zh) | 用于制备rrip四个基因联合敲除的重症免疫缺陷猪源重组细胞的系统 | |
CN112522256B (zh) | CRISPR/Cas9系统及其在构建抗肌萎缩蛋白基因缺陷的猪源重组细胞中的应用 | |
CN112522255B (zh) | CRISPR/Cas9系统及其在构建胰岛素受体底物基因缺陷的猪源重组细胞中的应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |